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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. TECHNICAL FIELD 

The present invention provides novel polynucleotides and proteins encoded by such 
polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 
polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (ie. 9 partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 
based on the presence of a now well-recognized secretory leader sequence motif, as well as 
Various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that are known to have biological activity, for example, by virtue of their secreted nature in the 
case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 
genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

The compositions of the present invention include novel isolated polypeptides, novel 
isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof; especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 
epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
4 The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 
sequences are designated as SEQ ID NOS: 1-13901. The polypeptides sequences are designated 
SEQDDNOS: 13902-27802. The nucleic acids and polypeptides are provided in the Sequence 
8 Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is 
guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence 
Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
12 hybridizeto the complement of SEQ ID NO: 1-13901 under stringent hybridizationconditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 
domain or truncation ofthe peptides encoded by SEQ ID NOS: 1-13901. A polynucleotide 
16 comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO: 1-13901 or a degenerate variant or fragment thereof. The identifying sequence can be 100 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 

20 from the nucleic acid sequences ofSEQ ID NO: 1-13901. Hie sequence information can be a 
segmentof any one of SEQ ID NO: 1-13901 that uniquely identifies or represents the sequence 
infonnationofSEQIDNOS: 1-13901. 

A collection as used in this application can be a collection of only one polynucleotide. The 

24 collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the segment The collection 

28 can also be provided in a computer-readableformat 

This invention also includes the reverse or direct complement of any ofthe nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 

32 reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readablemedia, use in sequencing 
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full-length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1-13901 or novel 

4 segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art In a particularly preferred embodiment, the nucleic acid sequences of 
SEQ ID NO: 1- 13901 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 

8 et aL, Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901; a 

12 polynucleotide comprising any of the full length protein coding sequences of SEQ ID NOS: 1 - 
1 3901 ; and a polynucleotide comprising any of the nucleotide sequences of the mature protein 
coding sequences of SEQ ID NOS: 1-13901 . The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 

16 conditionsto (a) the complement of any one ofthe nucleotide sequences set forth in SEQ ID NO: 1- 
1 3 90 1 ; (b) a nucleotide sequence encoding any one ofthe amino acid sequences set forth in the 
Sequence Listing (e.g., SEQ ID NOS: 13902-27802); (c) a polynucleotide which is an allelic variant 
of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 

20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited td, a polypeptide 

24 comprising any ofthe amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-13901 ; or (b) polynucleotides that hybridize to the complement of the 

28 polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof (e.g. , with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated The 

32 polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinantmeans using the genetically engineered cells (e.g. host cells) of the 
invention. 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
4 The invention also provides host cells transformed or transfected with a polynucleotide of 

the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
8 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
12 techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
1 6 largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et ah, Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
24 procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
28 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
32 pharmaceutical^ acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 

4 identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufficient to form a 

8 complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
12 and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
16 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 

monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that modulate 

20 (j. e. , increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 

24 bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

28 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

32 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
4 useful for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 

8 

4. DETAILED DESCRIPTION OF THE INVENTION 

12 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 

"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 

16 and/or immunologic activities of any naturally occurring polypeptide. According to the 

invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 

having structural, regulatory or biochemical functions of a naturally occurring molecule. 

Likewise "immunologically active" or "immunological activity" refers to the capability of the 

20 natural, recombinant or synthetic polypeptide to induce a specific immune response in 

appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 

♦ 

extracellular or intracellular membrane trafficking, including the export of secretory or 

24 enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 

28 may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

32 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
4 are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
8 modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 
12 elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 
16 sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is mymine, G is guanine and N is A, C, G or T 
20 (U). It is contemplated that where the polynucleotide is RNA, the T (mymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 

o 

from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 
24 acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 
28 residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 
32 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 1 5 to about 50 nucleotides, more preferably from about 1 7 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PCR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts ofmRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
IDNOs: 1-13901 . 

Probes may, for example, be used to <ietermine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 
art Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

The nucleic acid sequences of the present invention also include the sequence 
information from the nucleic acid sequences of SEQ ID NOS: 1-13901. The sequence 
information can be a segment of any one of SEQ ID NO: 1-13901 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NOS: 1-13901. One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is 
fully matched in the human genome is 1 in 300. In the human genome, there are three billion 
base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 
times more twenty-mers than there are base pairs in a set of human chromosomes. Using the 
same analysis, the probability for a seventeen-mer to be fully matched in the human genome is 
approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 
mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

Similarly, when using sequence information for detecting a single mismatch, a segment can 
be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculatedby multiplying the probability for a full match (I-t-4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 
eighteen mer with a single mismatch can be detected in an array for expression studies is 
approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 
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The term "open reading frame," ORF, means a series of nucleotide triplets coding for 
amino acids, without any termination codons and is a sequence translatable into protein. 

The terms te operably linked" or "operably associated" refer to functionally related nucleic 
4 acid sequences. For example, a promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
8 transcription/translation of the coding sequence. 

The term "pluripotenf ' refers to the capability of a cell to differentiate into a number of 
differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 
12 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
1 6 preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including, but not limited to, 
24 acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 
length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
28 or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
32 may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
4 substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

The term "variant"(or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
8 recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 

12 or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy " in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 

16 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, i.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

24 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

28 glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 
negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 

32 insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

recombinant DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
4 rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 
8 The terms "purified" or "substantially purified" as used herein denotes that the indicated * 

nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 

12 preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 

16 at least one other component (e.g. , nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 

24 bacterial or fungal (e.g. , yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native giycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. coli, will be free of giycosylation modifications; polypeptides or 

28 proteins expressed in yeast will have a giycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 

32 comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
4 methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 
8 transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 

12 gene expression, for example, promoters or enhancers. Recombinant expression systems as 

defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells 
can be prokaryotic or eukaryotic. 

16 The term "secreted" includes a protein that is transported across or through a membrane, 

including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g. , soluble proteins) or partially (e.g. , receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 
P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells 

24 (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Anna Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

28 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent Stringent conditions can include highly stringent conditions (ie., hybridization 

32 to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 
65°C, and washing in 0.1X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are 
described herein in the examples. 
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In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
4 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
8 functional dissimilarity between the reference and subject sequences. Typically, such a 

substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (f.e., the number of individual residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 

12 by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 

1 6 embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 
embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, eg., 

20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 

24 nucleotide sequences of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 

28 identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequences having 
substantially equivalent biological activity and substantially equivalent expression characteristics 

32 are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, using the Jotun Hein method (Hein, J. 
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(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotenf ' refers to the capability of a cell to differentiate into all of the cell 
4 types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
8 or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
12 using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
16 determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

20 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 

24 nucleotide sequences of SEQ ID NO: 1-13901 ; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NOS: 13902-27802; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one 
of SEQ ID NOS: 13902-27802. The polynucleotides of the present invention also include, but 

28 are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-13901; (b) nucleotide 
sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a 
polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 

32 polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NOS: 13902-27802. Domains of interest may depend on the nature of 
the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, 
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extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in 
enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand 
4 polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g. , mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
8 region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 

12 from the disclosed sequence informationfor identification and/or amplification of genes in 

appropriate genomic libraries or other sources of genomic materials. Further 5 f and 3' sequence can 
be obtained using methods known in the art. For example, full length cDN A or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-13901 can be obtained by screening 

1 6 appropriate cDN A or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO: 1-13901 or a portion thereof as a probe. Alternatively, the 
polynucleotidesof SEQ ID NO: 1-13901 may be used as the basis for suitable primer(s)that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

20 The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representativefragment or segment information, or novel segment information for the full-length 

24 gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 

28 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 

32 sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ED NO: 1-13901, or complements thereof which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides* more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 1 5, 17, or 20 nucleotides or more that 
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are selective for (Le: specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
4 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences Calling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
8 variations can be routinely determined by comparing the sequence provided SEQ ID NO: 1 - 1 390 1 , 
a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 
identical, to SEQ ID NO: 1-13901 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

1 2 coding for the same amino acid sequences as do the specific ORPs disclosed herein. In other words, 
in the coding region of an ORF, substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

16 including SEQ ID NO: 1-13901 can be obtained by searching a database using an algorithmora 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 
S.F.etal. J. Mol.BioL 21:403-410 (1990)). Alternatively a FASTA version 3 searchagainst 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

24 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturaUy-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 

28 polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 

32 native or variant polynucleotide. There are two variables in the construction of amino acid 
sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices (e.g., 
hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 
choices {e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 
sequences necessary for secretion or for intracellular targeting in different host cells and 
sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a 
polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 
nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 
DNA 2:1 83 (1983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 
slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
primer. The product DNA fragments replace the corresponding region in the plasmid and this 
gives a polynucleotide encoding the desired amino acid variant 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current 
Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 
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of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
4 to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
8 synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 

12 protein coding sequences corresponding to any one of SEQ ID NO: 1-13901 , or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

16 A polynucleotide according to the invention can be joined to any of a variety of other 

nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 

24 selectable marker for the host cell. Vectors according to the invention include expression 
vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

28 The present invention further provides recombinant constructs comprising a nucleic acid 

having any of the nucleotide sequences of SEQ ED NO: 1-13901 or a fragment thereof or any 
other polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 

32 having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof is 

inserted, in a forward or reverse orientation. In the case of a vector comprising one of the ORFs 
of the present invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and 
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promoters are known to those of skill in the art and are commercially available for generating the 
recombinant constructs of the present invention. The following vectors are provided by way of 
example. Bacterial: pBs^hagescriptPsiXn^pBluescriptSK^pBsKS^NHSa^pNH^a, 
4 pNHl 8a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
8 control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et aL, 
Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 

12 Enzymology 1 85, 537-566 (1 990), As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
(transfected) with the ligated polynucleotide/expression control sequence. 

16 Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 

24 and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 

transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 

28 assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 

32 characteristics, e.g. , stabilization or simplified purification of expressed recombinant product 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a functional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host Suitable prokaryotic hosts for 
transformation include K coli, Bacillus subtilis, Salmonella typhimurium and various species 
4 within the genera Pseudomonas, Streptornyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
8 available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 

12 sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means {e.g. , temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifugation, disrupted by physical or 

16 chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et al., Nat Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

24 

43 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 

28 sequence of SEQ ID NO: 1-13901 , or fragments, analogs or derivatives thereof An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 

32 molecules are provided that comprise a sequence complementary to at least about 1 0, 25, 50, 

100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NOS: 13902-27802 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ 
ID NO: 1-13901 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
4 of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
8 "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5' and 3 f untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g 3 SEQ 
. ID NO: 1-1 390 1 ), antisense nucleic acids of the invention can be designed according to the rules 

12 of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

16 start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art For example, an antisense nucleic acid (e.g, an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

24 Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-raboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

28 inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methyIaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

32 2-methylthio-N6-isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouraciL, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxy acetic acid (v), 5-methyl-2-thiouracil, 

3 - (3 -amino-3 -N-2-carboxypropy 1) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
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antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le. , RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
4 described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
8 protein, e.g. , by inhibiting transcription and/or translation. The hybridization can be by 

conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 

12 nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g, 

16 by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 

24 strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al 
(1987) Nucleic Acids Res 15: 6131-6148) or achimeric RNA -DNA analogue (Inoue et al (1987) 
FEBS Lett 215: 327-330). 

28 

4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
32 single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Geriach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1- 
13901 ). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
4 be cleaved in a SECX-encoding mRNA. See, e.g., Cech et al U.S. Pat No. 4,987,071; and 
Cech et al. U.S. Pat. No. 5,1 16,742. Alternatively, SECX mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., 
Bartelc/ al, (1993) Science 261:1411-1418. 
8 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1 991) 
Anticancer Drug Des. 6: 569-84; Helene. et al (1992) Arm. N. Y. Acad Sci. 660:27-36; and 

12 Maher (1992) Bioassays 14: 807-15. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g. , the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 

16 acids can be modified to generate peptide nucleic acids (see Hyrup et al (1996) Bioorg Med 

Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g. , DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefe etal (1996) PNAS 93: 14670-675. 

24 PNAs of the invention can be used in therapeutic and diagnostic applications. For 

example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 

28 gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 
combination with other enzymes, e.g, SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup et al (1996), above; Perry-O'Keefe (1996), 
above). 

32 In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 

4 using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1 996) above and Finn et cd. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 

8 phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 

5X4-methoxytrityl)amino-5 f -deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5" PNA segment and a 3 f 
12 DNA segment (Finn et al (1 996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et al (1975) BioorgMed Chem 
LettS: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
16 peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 
cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad Sci. U.S.A. 86:6553-6556; 
Lemaitre et al, 1987, Proc. Natl Acad Sci 84:648-652; PCT Publication No. W088/09810) or 
the blood-brain banier (see, e.g. , PCT Publication No. W089/10134). In addition, 
20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 
al, 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Phanru Res. 
5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g , a 
peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
24 cleavage agent, etc. 

4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
28 polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
32 ;with a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
4 is operatively linked to the encoding sequences. See, for example, PCT International Publication 
No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. W09 1/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
8 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

12 The host cell can be a higher eukaiyotic host cell, such as a mammalian cell, a lower 

eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfectioh, DEAE, dextran mediated transfection, or electroporation (Davis, 

16 L. et aL, Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 

24 protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using • 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 

28 expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 

32 protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 
4 replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5' flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 
8 the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
12 Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

16 Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 

or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

24 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 

28 may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

32 negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, mRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules. 

. 4 The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element 
8 Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 

12 the use of one or more selectable marker genes that are contiguous with the targeting DNA, 

allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 

1 6 selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

20 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 

24 Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et at; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

28 

4,6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NOS: 13902-278Q2 or an 
32 amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-13901 or 
the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901 or 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NOS: 
13902-27802 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
either (a) or (b) under stringent hybridization conditions. The invention also provides 
4 biologically active or immunologically active variants of any of the amino acid sequences set 
forth as SEQ ID NOS: 13902-27802 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 
8 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 
Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compared to polypeptides comprising SEQ ID NOS: 13902-27802. 

12 Fragments of the proteins of the present invention which are capable of exhibiting 

biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described in H. 
U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 

16 Chem. Soc. 1 14, 9245-9253 (1992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both full-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 

24 of the protein is also determinable from the amino acid sequence of the full-length form. Where 
proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

28 Protein compositions of the present invention may further comprise an acceptable carrier, 

such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 

32 present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention* At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
4 synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
8 example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 

12 cells which have been altered to express the desired polypeptide or protein. As used herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 

16 procedures for introducing and expressing either recombinant or synthetic sequences into 

eukaiyotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 

20 culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 

24 allow expression of the encoded polypeptide. The polypeptide can be recovered from the 

culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

28 In an alternative method, the polypeptide or protein is purified from bacterial cells which 

naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 

32 immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g. 7 Scopes, Protein Purification: Principles and 
Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

4 The purified polypeptides can be used in in vitro binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but axe not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules.identified in the binding assay are then tested for antagonist or agonist 
8 activity in in vivo tissue culture or animal models that are well known in the art In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 

12 may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ ID NOS: 13902-27802. 

The protein of the invention may also be expressed as a product of transgenic animals, 

16 eg., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 
made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

24 residues may be deleted or replaced with another amino acid to alter the conformation of the 
molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat No. 4,5 18,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

28 protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 

32 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are usefiil for screening or other immunological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 

4 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for bacidovirus/insect cell expression 
systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif., U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
8 Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 

12 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (Le., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 

16 protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression .and purification of such fusion proteins are commercially available 

24 from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

28 Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 

steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 

32 homogeneous isolated recombinant protein. The protein thus purified is substantially free of 

other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 
4 modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 
8 provide for the delivery of polypeptide to pancreatic cells, e.g. , antibodies to pancreatic cells, 
antibodies to immune cells such as T-ceils, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
12 immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

16 4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 

AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et al., J. Molec. Biol 215:403-410 (1990), PSI-BLAST (Altschul SJP. et aL, Nucleic Acids Res. 

24 vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 

28 reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 1 57, pp. 
105-3 1 (1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

32 Biol. 215:403-410 (1990). 

4,7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 

protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
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another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention- In one embodiment, a 
fusion protein comprises at least one biologically active portion of a protein according to the 

4 invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 

8 C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 

12 polypeptide sequences of the invention are fused to the C-terminus of the GST (i. e. , glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprises one or more domains are fused 

16 to sequences derived from a member of the immunoglobulin* protein family. The 

immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 

24 used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 

28 are ligated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 

32 be synthesized by conventional techniques including automated DNA synthesizers. 

Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed, and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors are commercially available that already encode 
a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 

-J 

4 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the protein of the invention. 

4.8 GENE THERAPY 
8 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 

12 appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). Fof additional reviews of 

16 gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 

24 or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 
states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introductionof antisense 

28 molecules to the nucleic acids of the present invention, their complements, or their translated RNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

32 The present invention still further provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g., by 
homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologouspromoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
operatively linked to the desired protein encoding sequences. See, for example, PCT International 
PublicationNo. WO 94/12650, PCT International PublicationNo. WO 92/20808, and PCT 
InternationalPublicationNo. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which 
encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the desired protein coding sequences in the cells. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 
replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engmeeringmethods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachmentregions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 
of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 
protein produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 
which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 
element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell-type specificity than the naturally 
occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 

4 targeting event may also be facilitated by the use of one or more marker genes exhibiting the 

property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 

8 not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-gxianinephosphoribosyl-transferase(gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
1 2 aspect of the invention are more particularly described in U.S. Patent No. 5,272,07 1 to Chappel; 
U.S. Patent No. 5^578,461 to Sherwin et al.; International ApplicationNo. PCT/US92/09627 
(WO93/09222)by Seldenet aL; and International ApplicationNo. PCT/US90/06436 
(WO91/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety. 

16 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244: 1288-1 292 (1989)] . Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 

24 referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 

28 identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 

32 polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
4 through, e.g. , homologous recombination or knock out strategies, of animals that fail to express 
polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 
8 In preferred methods to determine biological functions of the polypeptides of the 

invention in vrvo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 

12 control of exogenous or endogenous promoter elements, are known as transgenic animals. 

Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 

16 animals are useful to determine the roles polypeptides of the invention play in biological 

processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 

24 methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

28 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one or 
more of the uses or biological activities (including those associated with assays cited herein) 
32 identified herein. Uses or activities described for proteins of the present invention may be 

provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 
mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, 'therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 

4 (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including foil length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 

8 activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 
indirectly activate or inhibit the polypeptides of the invention (identified, e.g. , via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
1 2 helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 
or in one of the other physiological pathways described herein. 

16 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analy sis, characterization or therapeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 

24 positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 

28 oligomers for attachment to a "gene chip" or other support, including for examination of 

expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

32 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et ah, Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 
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The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
4 labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 
receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
8 Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 
12 Methods for performing the uses listed above are well known to those skilled in the art. 

References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., R F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
16 Techniques", Academic Press, Berger, S. L. and A. R Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
24 form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

28 4.103 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
32 activity or may induce production of other cytokines in certain cell populations. A 

polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confinnation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 
4 MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e, CMK, 
HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M Kruisbeek, D. H. Margulies, E. 
8 M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 

12 et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., L Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 

16 eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleiakin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVriesetal., J.Exp. Med. 173:1205-1211, 1991; Moreauetal., Nature 336:690-692, 1988; 

24 Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 
and human interleukin 6-Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11-Bennett, F., Giannotti, J., 

28 Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9— Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

32 Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 
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Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et aL, Proc. Natl. Acad Sci. USA 77:6091-6095, 
4 1980; Weinberger et aL, Eur. J. Inunun. 1 1:405-41 1, 1981; Takai et aL, X Immunol. 
137:3494-3500, 1986; Takai et aL, J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

8 A polypeptide of the present invention may exhibit stem cell growth factor activity and 

be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 
cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 

12 ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 
large quantities of human cells has important working applications for the production of human 

1 6 proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 

24 specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Fit- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1-alpha (MEP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 

28 fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 

32 optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 
with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 
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layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 
4 Stem cells themselves can be transfected with a polynucleotide of the invention to induce 

autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
8 of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

12 Expansion and maintenance of totipotent stem cell populations will be useful in the 

treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 

16 genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

24 types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
• to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 

28 et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 

32 retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
4 factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al., Blood, 77: 23 16-2321 (1991). 

8 4.10.5 HEMATOPODESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 

12 involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 

16 cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 

monocytes/macrophages (z.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 

24 those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex~vivo (Le., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 

28 as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

32 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
and Cellular Biology 13:473^186, 1993; McClanahan et al., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 

4 Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 

8 al., Experimental Hematology 22:353-359, 1 994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. L 
12 Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. L Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

16 4.10,6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 

24 prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

28 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

32 inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 
4 application in the healing of tendon or ligament tears, deformities and other tendon or ligament 
defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
8 in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 
induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 

12 invention may provide environment to attract tendon- or ligament-forming cells, stimulate 

growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament-forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 

16 treatment of tendinitis, caipal tunnel syndrome and other tendon or ligament defects. The 

compositions ipay also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, L e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 

24 peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mecjhanical and traumatic disorders, such as spinal 

28 cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
composition of the invention. 

Compositions of the invention may also be usefiil to promote better or faster closure of 

32 non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
4 to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 
8 A composition of the present invention may also be useful for promoting or inhibiting 

differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
12 Assays for tissue generation activity include, without limitation, those described in: 

International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
WO91/07491 (skin, endothelium). 
16 Assays for wound healing activity include, without limitation, those described in: Winter, 

Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84 (1978). 

20 

4.10-7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 

24 herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 
protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCBD)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 

28 and other cell populations. These immune deficiencies may be genetic or be caused by viral {e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 
treatable using a protein of the present invention/including infections by HTV, hepatitis viruses, 

32 herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, Le., in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
4 autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
8 venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens-Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
. 12 (particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

16 models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arck Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., 
J. Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 

24 by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 
generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 

28 persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 

demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

32 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g. , preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
4 and thus acts as an immunosuppressant Moreover, a lack of costimulation may also be sufficient 
to anergize the T cells, thereby inducing tolerance in a subject Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
8 may also be necessary to block the function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 

12 rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
at, Science 257:789-792 (1992) and Turka et at, Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 
(1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven 

16 Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 

24 cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 
long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 

28 human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 
myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 

32 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
responses may be in the form of enhancing an existing immune response or eliciting an initial 
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immune response. For example, enhancing an immune response may be usejful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

4 Alternatively, anti-viral immune responses may be enhanced in an infected patient by 

removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
8 patient Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 
invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient The infected cells would now be 

12 capable of delivering a costimulatoiy signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 

1 6 reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g. , a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and microglobulin protein or an MHC class II alpha chain 
protein and an MHC class E beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 

24 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject 

28 The activity of a protein of the invention may, among other means, be measured by the 

following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
32 H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981; Herrmann et at, J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. 
Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61:1992-1998; Bertagnolli et al., 
Cellular Immunology 133:327-341, 1991; Brown etal., J.Immunol. 153:3079-3092, 1994. 
4 Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
8 Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 

12 in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3. 1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 

16 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells mat activate naive T-cells) include, without limitation, those described in: 
Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine 

20 173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et 
al., Journal of Experimental Medicine 1 82:255-260, 1995; Nair et al., Journal of Virology 
67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 

24 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:63 1-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 

28 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 
53: 1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991; Zacharchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International 
Journal of Oncology 1:639-648, 1992. 

32 Assays for proteins that influence early steps of T-cell commitment and development 

include, without limitation, those described in: Antica et al., Blood 84:1 11-117, 1994; Fine et al., 
Cellular Immunology 155:1 1 1-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991 . 
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4.10.8 ACTTVTN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
4 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Lahibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
8 alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
12 homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
16 sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

20 Assays for activin/inhibin activity include, without limitation, those described in: Vale et 

al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et aL, Nature 
321:776-779, 1986; Mason et al., Nature 3 18:659-663, 1985; Forage et al., Pioc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

24 

410.9 CHEMOTACTIC/CHEMOIONETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 

28 T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 
invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 

32 modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
4 Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
8 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
12 M. Kruisbeek, D. H. Marguiles, E. M Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
16 1994; Johnston et al. J. of Immunol 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 

hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 

in treating wounds resulting from trauma, surgery or other causes. A composition of the 
24 invention may also be useful for dissolving or inhibiting formation of thromboses and for 

treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 

cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
28 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 

45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

35:467-474, 1988. 

32 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation, proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
4 Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
8 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 

12 cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

16 cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers,- liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 

20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 

24 hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

28 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

32 The composition can also be administered in therapeutically effective amounts as a 

portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine, Chlorambucil, Cisplatin (cis- 
4 DDP), Cyclophosphamide, Cytarabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HC1, Estramustine phosphate sodium, Etoposide (V16-213), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon AIpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
8 Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 

Methotrexate (MTX), Mitomycin, Mitoxantrone HCL, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 

12 Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 

16 Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1 987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NYCh 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst, 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 

24 Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 

28 eg. from American Type Tissue Culture Collection catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
32 receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
4 are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

8 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E. Coligan, A. M Kruisbeek, D. H. Margulies, E. M. 
12 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 

Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. 

Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med 168:1145-1156, 1988; 

Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
1 6 1 75:59-68, 1994; Stitt et al., Cell 80:661 -670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 

ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 

through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 
20 overlay assays, or other methods known in the art 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 

partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 

present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
24 colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 

Purification" Murray P. Deutscher (ed) Methods in En2ymology Vol. 182 (1990) Academic 

Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 

carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent 
28 molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 

toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

32 This invention is particularly useful for screening chemical compounds by using the 

novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellular^. One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
4 be used for standard binding assays. One may measure, for example, the formation of 

complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art 
8 Sources for test compounds that may be screened for ability to bind to or modulate (i. e. , 

increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

12 Chemical libraries may be readily synthesized or purchased from a number of 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 

1 6 fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 2*2:63-68 (1998). 

Combinatorial libraries are composed of large numbers of peptidies, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and 

24 oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Cwrr. Opin. 
Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 

28 Ai-Obeidi et al, Mol Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 

1(1):1 14-19 (1997); Dorner et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 

32 polypeptide of the invention. The molecules identified in the binding assay are then tested for 
antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art. In brie£ the. molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 



WO 01/64835 



PCT/US01/04927 



The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
4 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4J0.14 ASSAY FOR RECEPTOR ACTIVITY 

8 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

iigand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 

12 to identify polynucleotides encoding binding partners. As another example, affinity 

chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 

1 6 that modulate (i e., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas the other does not The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

24 in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 

28 polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

32 the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications ie. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 
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4.10.15 ANTI-INFLAMMATORY ACTIVITY 

Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the 
4 inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
8 with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfiision injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

12 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1 . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 

16 limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 
arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 

4.10.16 LEUKEMIAS 

24 Leukemias and related disorders may be treated or prevented by administration of a 

therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblasts, promyelocyte, 

28 myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 

(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d EcL, J.B. Lippincott Co., Philadelphia). 

32 4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 
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therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
4 human and non-human mammalian patients) according to the invention include but are not 
limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
8 surgery , for example, lesions which sever a portion of the nervous system, or compression 

injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

12 infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

16 tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 

24 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 

28 sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
32 injured by a demyelinating disease including but not limited to multiple sclerosis, human 

immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
4 any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g. , 
8 choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 

12 forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1 980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 

16 depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, eg., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 

24 muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 

28 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
32 including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 

4 nutritional factors or components); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

8 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
12 in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

16 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment Such polymorphisms may be associated with, e.g:, differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 

20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 

24 polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 

28 polymorphism in the DNA, For example, PGR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 

32 single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

adjacent to the position of the polymoiphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
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absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 

4 present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 

8 by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 

12 arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at, 1 983, 
Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129. 
Induction of the disease can be caused by a single injection, generally intradennally, of a 

16 suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing the effects of the test compound would consist of intradennally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 

24 described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

28 4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
32 include, but are not limited to, those exemplified herein. 

4.11.1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 
4 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. Hie dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
8 con&tion and response of the individual patient. Typically, the amount of polypeptide 

administered per dose will be in the range of about O.Oljig/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0. 1 Mg/kg to 1 0 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 

12 with a pharmaceutical^ acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient 

16 The preparation of such solutions is within the skill of the art 

4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

24 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutical^ acceptable" means anon-toxic material that does not interfere with the 

28 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, EL-2, IL-3, EL-4, IL-5, IL-6, IL-7, IL-8, DL-9, IL-10, DL-11, IL-12, 

32 EL-13, IL-14, IL-15, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transforming growth factors (TGF-a and TGF-p), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
4 the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
8 present invention may be included in formulations of the particular clotting factor, cytokine, 
lyraphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 

12 IL-IRa, EL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

16 As an alternative to being included in a pharmaceutical composition of the invention 

including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 

20 Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remingtons Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 

24 relevant medical condition, or an increase in rate of treatment, healing, prevention or 

amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 

28 ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
32 a mammal having a condition to be treated. Protein or other active ingredient of the present 

invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 



64 



WO 01/64835 



PCT/US01/04927 



hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factors), thrombolytic or antithrombotic factors, or sequentially. If administered sequentially, 
4 the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic £actor(s), thrombolytic or antithrombotic factors. 

8 4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may, for example, include oral, rectal, transmucosal, or 
intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 

12 intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 

16 or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 

20 occurring as complication of glaucoma surgery, the compounds may be administered topically, 
, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

24 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
28 wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit 

32 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutical^. These pharmaceutical compositions may be 
manufactured in a manner that is itself known, e.g. , by means of conventional mixing, 
4 dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
8 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 

12 90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 

16 saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

24 active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

28 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 

32 solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 
solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
4 liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
8 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 
preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 

12 may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. Fortius 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 

16 solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 

added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 

24 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

28 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 
dichlorodifluoromethane, tricUorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 

32 other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 
vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories or 
retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 
system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 
polyethylene glycol 300, made up to volume in absolute ethanoL The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 
without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 
co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
4 such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
8 skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

12 The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 

or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 

16 provided as salts with phannaceutically compatible counter ions. Such pharmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 

20 monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredients) of present invention along with protein or peptide 

24 antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 

28 those encoded by class I and class H MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 

32 well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutically 
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acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, monogiycerides, diglycerides, 
4 sulfetides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

8 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 

12 ingredient of the present invention with which to treat each individual patient Initially, the 

attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 

16 the patient, and at that point the dosage is not increased further. It is contemplated that the 

various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 ng to about 100 mg (preferably about 0.1 \xg to about 10 mg, more preferably 
about 0.1 ng to about 1 mg) of protein or other active ingredient of the present invention per kg 

20 body weight. For compositions of the present invention which are useful for bone, cartilage, 
tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of coiirse, in a pyrogen-free, physiologically acceptable 

24 form- Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 

28 described above, may alternatively or additionally, be administered simultaneously or 

sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 

32 cartilage damage, providing a structure for the developing bone and cartilage and optimally 

capable of being resorbed into the body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
4 may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
8 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. Hie bioceramics may be altered in composition, such as in 

12 c^cium-alumirate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
glycolic acid in the form of porous particles having diameters ranging from 1 50 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 

16 cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). 

24 The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

28 protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 

compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor (EGF), 

32 platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-0), and 
insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
4 modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
8 and with inclusion of other proteins in the pharmaceutical composition. For example, the 

addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 

1 2 tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy.. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject Polynucleotides of the invention may also be administered by other known 

16 methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.12 3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 

24 intended purpose. More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 

28 the method of the invention, the therapeutically effective dose can be estimated initially from 
appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 

32 concentration range that includes the IC 50 as determined in cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 
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A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
4 cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. 
8 The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage form employed and the route of administration 

12 utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patienf s condition. See, e.g., Fingl et aL, 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 

16 desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

20 Dosage intervals can also be determined using MEC value. Compounds should be 

administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 

24 related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 jig/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0. 1 p.g/kg to 25 mg/kg of patient body weight daily, varying in adults and 

28 children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

Hie amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
32 administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain 
an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab, Fab' and F^b^ 
fragments, and an Fab expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 
by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgG], IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

An isolated related protein of the invention may be intended to serve as an antigen, or a 
portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 
invention provides antigenic peptide fragments of the antigen for use as immunogens. An 
antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 
specific immune complex with the full length protein or with any fragment that contains the 
epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 
located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 
4 may be generated by any method well known in the art, including, for example, the Kyte 

Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., 
Hopp and Woods, 1981, Proc. Nat. Acad Set USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
8 Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
12 immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
16 Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Polyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals (e.g. , rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 

24 protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 

28 bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g 7 iysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

32 dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 

Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 



75 



WO 01/64835 



PCT/US01/04927 



The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
such as affinity chromatography using protein A or protein G, which provide primarily the IgG 

4 fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 
target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoafiBnity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 

8 Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 

12 herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 

16 binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 

24 fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 

28 Antibodies: Principles and Practice, Academic Press, (1986) pp. 59-103). Immortalized cell 

lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 

32 inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 

4 can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 

California and die American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et aL, Monoclonal 

8 Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, (1987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 

12 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELIS A). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

16 Scatchard analysis of Munson and Pollard Anal. Biochem.. 107:220 (1980\ Preferably, 

antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 

20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 

24 culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

28 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 

32 source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 
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example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,8 16,567; Morrison, Nature 368, 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

5.13.2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 
humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature. 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., 
Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 
humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
least a portion of an immunoglobulin constant region (Fc), typically that of a human 
immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. BioL 
2:593-596 (1992)). 

5.133 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et aL, 1985 In: Monoclonal 
4 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Ban Virus in vitro (see Cole, et al., 1985 In: 
8 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); 
Marks et al., J. Mol. Biol, 222:581 (1991)). Similarly, human antibodies can be made by 

12 introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 

16 is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 

5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. 
(Nature 368 856-859 (1994)); Morrison ( Nature 368. 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14, 845-51 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 

20 Lonberg and Huszar (Intern. Rev. Immunol 13 65-93 (1995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 

24 endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 

28 provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 

32 cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 

antibodies directly, or can be fiirther modified to obtain analogs of antibodies such as, for 

example, single chain Fv molecules. 
4 An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
8 locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy' chain locus, 

the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 

contain the gene encoding the selectable marker. 
12 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No. 5,91 6,77 1 . It includes introducing an expression vector that contains a 

nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 

an expression vector containing a nucleotide sequence encoding a light chain into another 
16 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 

antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 

epitope on an immunogen, and a correlative method for selecting an antibody that binds 
20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 

WO 99/53049. 

5.13.4 Fab Fragments and Single Chain Antibodies 

24 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g. , U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries (see e.g., 
Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

28 monoclonal Fab fragments with the desired specificity for a protein or derivatives, fragments, 

analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F^ 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment generated 

32 by reducing the disulfide bridges of an F( a b f )2 fragment; (iii) an F a b fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 

5.13.5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at Least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 

4 other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit 
Methods for making bispecific antibodies are known in the art Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
8 specificities (Miistein and Cuello, Nature, 305:537-539 (1983)). Because of the random 

assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 

12 chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993,andinTrauneckerefa/., 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

16 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 

20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
aL, Methods in Enzvmology, 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 

24 of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains {e.g. 

28 tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

32 Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 

F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et aL, Science 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
4 generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
8 immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et aL, J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' fragment 

12 was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

16 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et aL, J. Immunol. 148(5): 1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 

24 alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (Vh) connected to a light-chain variable domain (V L ) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary V L and Vh 

28 domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

32 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1 99 1). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
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a leukocyte such as a T-cell receptor molecule (e.g. CD2, CDS, CD28, or B7), or Fc receptors for 
IgG(Fc R),suchasFc RI(CD64),Fc RII (CD32) and Fc RIII(CD16)soastofocus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
4 can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 
radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

8 

5.13.6 Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 

12 have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins 

16 can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
meiraptobutyrimidate and those disclosed* for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 

24 formation in this region. The homodimeric antibody thus generated can have improved 
internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et aL, J. Exp Med., 176: 1 191-1 195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 

28 tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

32 

5.13.8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
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bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (Le., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have been 
4 described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
8 PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 

12 Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 

protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 

16 compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-dia2X>niiimbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitroben2ene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et aL, Science, 238: 1098 (1987). 

20 Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyidiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
24 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g. , avidin) that is in turn 
conjugated to a cytotoxic agent. 

28 

4,14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
32 any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
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artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. As used herein, "recorded 11 refers to a process for 

4 storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 
A variety of data storage structures are available to a skilled artisan for creating a 
8 computer readable medium having recorded thereon a nucleotide sequence of the present 

invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 

12 readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 

16 formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present inventioa 

By providing any of the nucleotide sequences SEQ ID NO: 1-13901 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO: 1-1 390 1 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 

24 BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 

28 fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 

32 invention comprises a central processing unit (CPU), input means, output means, and data 
storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means" refers to memory which can store nucleotide sequence information of the present 
4 invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
8 sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 

12 systems of the present invention. Examples of such software includes, but is not limited to, 
Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTTOEIA). A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 

16 computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

24 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 

28 enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 

32 4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on the binding of a polynucleotide sequence to DNA or RNA. 
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Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et at, NucL Acids Res. 6:3073 (1979); Cooney et al, Science 15241:456 (1988); and Dervan 

4 et al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 

56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 

8 molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an antisense or triple helix oligonucleotide. 

12 4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 

1 6 with a suitable label. 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 

20 detected, a polynucleotide of the invention is detected in the sample. Such methods can also 
comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 

24 detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

28 polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

32 Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
4 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1 985). The test samples of the 
8 present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 

12 extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 

16 provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of abound probe or antibody. 

20 In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

24 agents or solutions of each container can be added in a quantitative fashion from one 

compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 

28 contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 

32 probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

4.17 MEDICAL IMAGING 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
4 See, e.&, Kunkel et aL, U.S. Pat NO. 5,41 3,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

8 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
12 encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-13901 , or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, 
said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
16 invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 

the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
24 polypeptide of the invention can comprise contacting a compound with a polypeptide of the 

invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 

polynucleotide of the invention is identified. 
28 Methods for identifying compounds that bind to a polypeptide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a cell for a time 

sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 

receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
32 sequence expression, so that if a polypeptide/compound complex is detected, a compound that 

binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 

activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
4 absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
8 carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 

12 the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 

16 like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 

24 sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 

28 ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 

32 Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241:456 (1988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense RN A hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 

4 antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORFs of the present invention can 
be used as a diagnostic agent Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 

8 composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
12 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOS: 1-13901 . Because the corresponding gene is only expressed in a 
limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences 
16 SEQ ID NO: 1-13901 can be used as an indicator of the presence of RNA of cell type of such a 
tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

24 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 

28 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 

32 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

4 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265:1981f). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

8 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OUGONUCLEOTIDES 

Oligonucleotides,/.^, small nucleic acid segments, may be readily prepared by, for 
12 example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 

using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 

skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
16 precisely spot oligonucleotides synthesized by standard synthesizers. Immobilizationcanbe 

achieved using passive adsorption (Inouye & Hondo, (1 990) J. Clia Microbiol. 28(6) 1469-72); 

usingUVUght(Nagataefa/. f 1985; Dahlen etai, 1987; Morrissey& Collins, (1989) Mol. Cell 

Probes 3(2) 189-207) or by covalent binding of base modifiedDNA (Keller etal, 1988; 1989); all 
20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 

interactionas a linker. For example, Broude et al (1994) Proc. Natl. Acad. ScL USA 91(8) 3072-6, . 

describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
24 streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 

Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidia 

Biotinylated probes may be purchased from various sources, such as, e.g. , Operon Technologies 

(Alameda, C A). 

28 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 

32 purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5'-end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussenef a/., (1 991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinkNH strips for covalent binding of DNA molecules at the S'-end has 
been described (Rasmussen et al., (1991). In this technology, a phosphoramidate bond is employed 
(Chu et al., (1 983) Nucleic Acids Res. 1 1(8) 65 13-29). This is beneficial as immobilizationusing 

4 only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 
CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLinkNH via an phosphoramidate bond, the oligonucleotide terminus must have a 5'-end 
8 phosphate group. It is, perhaps, even possible for biotin to be covalentiy bound to CovaLink and 
then streptavidinused to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0. 1 M 1-methylimidazole, 

12 pH 7.0 (1-Melm 7 ), is then added to a final concentration of 10 mM 1-Melm 7 . A ss DNA solution is 
then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M l-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 
10 mM 1-Melm 7 , is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 

1 6 50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 

20 described in PCT Patent Application WO 90/03382 (Southern & Maskos), incoiporatedherein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support The oligonucleotide is then synthesized on the s\q>ported 

24 nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 

28 arrays may be employed. For example, addressable laser-activated photodeprotection may be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodoret aJ. (1991) Science 25 1(4995) 767-73, incorporatedherein by reference. Probesmayalso 
be immobilized on nylon supports as described by Van Ness et al. (1 99 1) Nucleic Acids Res. 

32 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporated herein 
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To link an oligonucleotide to a nylonsupport, as described by Van Ness et al (1991), 
requires activation of the nylon surface via alkylation and selective activation of the S'-amine of 
oligonucleotides with cyanuric chloride. 

4 One particular way to prepare support bound oligonucleotides is to utilize the 

light-generatedsynthesisdescribedby Pease etal t (1994) PNAS USA 91(11) 5022-6, incorporated 
herein by reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 

8 the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5-protectediV^acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 
combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

12 421 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
including mRNA without any amplification steps. For example, Sambrook et al (1 989) describes 
1 6 three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-923). 

DNA fragments may be prepared as clones inM13,plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplificationmethods. Samples 
20 may be prepared or dispensed in multiwell plates. About 1 00-1 000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et 
24 al (1 989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
28 device allows controlled application of low to intermediate pressures to the cell. Theresultsof 

these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentationmethods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
32 base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 

20(1 4) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DN A into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy 
4 between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this en2yme (CviJI**), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC19 (2688 base pairs). Fitzgerald et al (1992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a CvzJI** digest of pUC19 that was size 
8 fractionatedby a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
M13 cloning vector. Sequence analysis of 76 clones showed that CWJI** restricts pyGCPy and 
PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentatioa 

12 As reported in the literature, advantages of this approach compared to sonication and 

agarose gel fractionationinclude: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 

16 Irrespectiveof the manner in which the nucleic acid fragments are obtained or prepared, it is 

important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 

20 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art 

4 22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 

24 array of weUsm a imcrotiter plate) to repeated a 
nylon membrane. By offset printing, a density of dots higher than the density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 

28 may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 

32 one 96-welI plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. 
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Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

The present invention is illustrated in the following examples. Upon consideration of the 
present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 
present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention are expected to occur to those skilled in the art upon 
consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

5.0 EXAMPLES 

5,1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 
A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
screened with oligonucleotide probes (e.g. , 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 

In some cases, the 5 f sequence of the amplified inserts was then deduced using a typical 
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 
terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNAEnds) was performed to further extend the sequence in the 5' direction. 

4 53 EXAMPLE 2 

Novel Conrigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
8 from one or more public databases. The sequences for the resulting nucleic acid contigs are 

designated as SEQ ID NO: 1-13901 and are provided in the attached Sequence Listing. The contigs 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 

12 (/.e., Hyseq's database containing EST sequences, dbEST version 1 14, gb pri 1 1 4, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 
component sequences into the assemblage was based on a BLASTNhit to the extending 

16 assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predictedpolypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1-13901) of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: : 1-13901. Tabie3 also indicates the method by which 

20 the polypeptide was predicted Method A refers to a polypeptide obtained by using a software 

program called FASTY (available from http://fasta.bioch. virginia.edu) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 1 83 :63-98 (1 990), herein incorporatedby reference). Method B 

24 refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol Biol, 268:78-94 (1997), incorporated herein by reference). MethodC refers 

28 to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel 
polynucleotide and its complementary strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-13901 were obtained by a BLASTP 

32 version 2.0al 19MP-WashU search against Genpept release 120 and Geneseq database release 
200101 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest 
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homologueforSEQIDNOS:l-1390L The nearest neighbor results for SEQ ID NO: 1-13901 
are shown in Table 2 below. 

Tables 1,2 and 3 follow. Table 1 showsthe various tissue sources of SEQ ID NOS: 1- 

4 1 390 1 . Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor 
result shows the closest homolog with an identifiable function for each assemblage. Table 3 
contains the start and stop nucleotides for the translated amino acid sequence for which each 
assemblage encodes. Table 3 also provides a correlation between the amino acid sequences set forth 

8 in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 
NO. in USSN 09/515,126 
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TABLE 1 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQWNOS: 


adult brain 


GIBCO 


AB3001 


83 544 597-598600-607 616 841 1004 1148 1346 1493 1974 
21382141 2143 2161 22662345 2363 2511 256928762880 
300 1 3099-3 101 3 105-3 106 3 1 10-3 1 1 1 3 1 1 5-3 1 1 7 3 199 3272 
3282 3284 3356 3425 3537 3634 3689 3709 3797 3810 3839 
3899 4006 4021-4022 4025 4043 4194 4201 4253 4277 4297 
4388 4399 4410 4667 4671 4722 4747-4748 4750 4755 4767 
4845 4865 4940 5037 5075 5093 5118 5163 5171-5172 5268 
5481 5523 5553 5656 5724 5894 5902 5938 6052 6170-6173 
6176 6214 6307 6336 6369 6374 6793 6894-6897 6979 7058 
71 69 7455 7492-7493 7495-7499 7501 7504 7577 7586 7761 
7792 7864.7870 8035 8065 8085 81 10 8120 8140 8224 8226 
8298 8372 8427 8452 8456 8535 8648 8672 8674-8679 8681- 
8684 8816 8838-8839 8870 8898 9012 9041 9079 9128 9257 
9264 9304 93 17 9460 9503 95 17 9567 9623 9734 978 1 9792- 
9798 9929 9964 9999 10296 10330 10469-10470 10578 10679 
10778 10786 10895 10984-10986 11032 11052 11069 11130 
11145 11239 11289 11402 11818 11862 11870-11876 11878- 
11881 12017 12037 12127 12160 12294 12363 12375 12405 
12424 12438 12467 12539 12570 12590 12615-12616 12618 
12685 12688 12712 12739 12748 12830 12913 12916 12948- 
12950 13002 13064 13073 13083 13141 13150 13153 13164- 
13166 13257 13391 13456 13479 13489 13492 13494 13499 
' 13501 13503 13560 13595-13596 13627 13645 13679 13782 
13795 13861 13866 13869 13882 


adult brain 


GIBCO 


ABD003 


67 83 142 443 587 598 608-609 61 1 613-624633 731 734 737- 
742 760 799-800 809 1148 1152 1167-1184 1193 1346 1433 
1516 1552 1575 1671 1756 1774 1833 1974 21382145 2176- 
2178 2237 2266 2299-2301 2303-2306 2343 2363 2412 2444 
2449 251 1 2516 2555 2569 2576 2614 2716 2809 2876 291 1 
2926 3001 3093 3114 31 19 3121-3124 3126 3128-3130 3234 
3254-3256 3258-3263 3265-3267 3270-3274 3276-3277 3280- 
3281 3284 3286 3348 3356 3378 3435 3459 3484 3537 3548 
3595 3605 3625 3627 3634 36S6-3697 3700 3702 3709 371 1 
3720 3722 3737 3757 3797 3804 3810 3839 3856 4006 4019 
4025 4040 4055 4057-4058 4060 4078 4194 4201 4246 4253 
4277 4282 4390 4405 4412 443 1 4620 4622 4641 4689 4751- 
4764 4791 4808 4837 4845 4847-4849 4852-4858 4860-4862 
4864-4869 4940 4957 4962 4972 4998 5021 5031 5037-5038 
5040 5076 5093 5108 5118 5167 5169 5171-5172 5251-5261 
5263-5265 5270 5364 5401 5481 5492 5521 5523 5535 5656 
5674 5693 5766 5788 58 17 5906-5909 5938 6005 6027 6057 
6064 6147 6178 6180-6182 6189 6214 6229-6233 6254 6272 
6369 6371 6421-6426 6555 6595 6598 6601 6799 6803 6825 
6836 6886 6894 6913 6972 6995 7058 7104 7130 7133 7148 
7164 7169 7339 7347 7386 7426 7455 7494 7502 7507 7509 
751 1-7512 7516 7520 7584-7587 7590-7596 7598-7601 7603- 
7604 7608 7632 7677 7743 7748 7761 7768 7792 7797 7807 
7815 7839 7849-7861 7864 7870 7930 7937 8035 8065 8067 
8080 8087 8095 81 10 8120 8139-8140 8209 8224 8226 8235 
8246 8262 8285 8298 8320 8323 8336 8354 8361 8365 8370 
8375 8387 8452 8456 8535 8556 8576-8577 8603 8630 8648 
8674 8685-8686 8688-8690 8693 8695 8702 8712 8742 8760- 
8761 8763-8764 8766-8769 8813 8815-8816 8830 8834 8838- 
8839 8848 8863 8870 8898 8921 8943-8944 8951 8989 9010 
9041 9050-9056 9058-9064 9076 9079 9092 9097 9128 9144- 
9145 9257 9264 9271 9278-9279 9304 9315 9317 9455 9466 
9472 94759480 9503 951 1 9517 9525 9539 9689 9734 9773 
9781 9791 9799-9802 9847 9852 9873 9928-9929 9964 9999- 
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10001 10003-10004 10053 10175 10267 10276 10330 10349 
10434 10449 10466 10471-10474 10492 10496 10509 10566 
10578 10597-10599 10645 10679 10788 10891 10948 10988- 
10990 11032 11039 11041-11043 11052 11065 11069 11105 
11108 111301114511167 11196 11203-1120711209 11239 
11399 11401-11402 11406 11459 11470 11604 11606 11642 
11761 1181811862 11877 11882-11884 1188611889-11893 
11944-11946 11981 11988 12016 12019 12022 12037 12083 
12127 12143 12164-12165 12168-12171 12178 12195 12236 
12265 12305 12327 12363 12375 12405 12423-12424 12430 
12438 12546 12570 12590 12594 12612 12615-12618 12630 
12670 12674 12685-12688 12693 12704 12706-12707 12748 
12772 12830 12885 12904 12913 12916 12923 12933 12951 
12956 12993 13001 13020-13021 13038 13047 13064 13072- 
13073 13084-13085 13092 13117 13142 13167-13171 13191 

I^OC/I lOICT IQOiCA nine m r\r\ iT>m fi^riA lojr/* i o A>~ir\ 

UZ34 \5Zol 1j260 13295 13390-13391 13394 13456 13479 
13483 13489 13497 13501 13503 13505-13507 13512 13516 
13546 13551 13555 13575 13590 13592 13597 13613-13614 
13645 13649 13659 13711 13782 13795 13838 13861 13869 
13875 13882 13884-13885 13888 13892 13896 


adult brain 


Ciontech 


ABR001 


142 858 1542 2174 2407 2483 2652 3272 3287 3460 3492 3535 
3595 3737 3839 4005 4060 4282 4434 4791 4972 5040 5293 
5523 5530 5535 5788 5906 6082 6601 6799 6980 7373 7577 
7587 7759 7788 7851 8081-8082 8110 9167 9455 9466 9781 
yyZ& IUhZZ iU//4 lU/yi IWoy 11401 11406 11459 11604 
11607 11791 1181811865 11961 1197912022 12122 12160 
12327 12442 12594 12615 12640 12670 12705 12935 12957 
12985 13047 13197 13257 13456 13511-13512 13546 13554 
13646 13793 13885 13889 13893 


aril lit Krfltn 
auuu Oram 


^loniecn 


/VDKUuO 


00/ 1UU4 \y\Jo 51 12. 5 zoo 3548 401 1 4282 4998 5923 5928 
6374 6730 6815 6867 6890 7067 8365 9264 9729 9780 10776 
11587 11618 12596 12601 12605 12704 12749 12754 12951 
13047 13051 13090 13479 13488 13498-13499 13503 13512 
13575 mS2 


adult brain 


Ciontech 


ABR008 


6 11 21 41 51 88 142 364 376 579 598 651 736 800 1050 1 148 
1184 1251-1265 1291 1346 1404 1479 1529 1543 1671-1674 
1697 1699-1710 1820 1830 1832-1838 1840 1848-1849 1908 
1914 1919 1927 1957 1964 1974 1976 1978-19792005-2006 
2050 2081 2090 2110-2111 2129 2150 2174 22002310 2327 
2342 2408-24 10 2420 2444 2449 2461-2467 2484 2490 2499- 
2506 251 1 2553 2574 2576 261 1 2652 2809 2827 2866 2894 
3032 3207 3535 3591 3610 3634 3715 3722 3737 3766-3770 
3819 4006-4007 401 1 4025 4032 4060 4078 4095 4109 4128 
4343-43 55 4182 4194 4247-4257 4277 4282 4294 4296 431 0 
4330 4348 4355 4360 4381 4395 4399 4411 4431 4543 4641 
4662 4694 4698 4767 4781 4791 4808 4833 4837 4985 5001 
5022 5040 5075 5094 5108 5163 5303-5306 5308-5314 5320 
5380 5523 5553 5615 5625-5626 5634 5638-5644 5701 5706 
5711 5727-5742 5766 5772 5775 5783 5801 5814 5817 5820- 
5821 5829 5837 5851 5855 5858 5864 5867 5874 5885 5890 
5897 5901 5906 5923 6057 6125 6214 6223 6288 6302 6456- 
6464 6545 6598 6601-6602 6624 6676-6685 6699 6726 6728 
6746-6749 6765 6799 6805 6854 6860 6893-6894 7004-7007 
7049 7076 7078 7081-7083 7105 71 17 71 19 7133 7153 7166 
7431 7579 7708 7768 7849 7900-7905 8018 8083-8084 8095 
8110 8196-8208 8262 8288 8312 8320 8331 8336 8356 8375 
8452 R487 JMrtt K6R1 871 ft 87^0 R777 R«1 <; «R17 8830 R83Q 

8963 8965 8983 9010 9097 9100 9102-9108 91 11 9128 9142 
9257 9264 9313 9364 9378-9384 9401 9454-9455 9458 9460- 
9464 9503 9509 951 1 9515-9516 9522 9528-9529 9533 9539 
9542 9544 9573 9577 9646 9773 9780 9924 10000 10025- 
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Origin 


RNA 
Source 


Library 
Name 


SEQ IDNOS: 








10029 10097 10148 10209 10218-10223 10225-10229 10267 
10275-10276 10284 10292 10296 10303 10313 10326 10329- 
10331 10333-10334 10343 10346 10393 10430 10494 10496 
10513 10542 10557 10613-10616 10679 10688 10691 10719- 
10721 10729 10743 10758 10760 10776 10782 10786 10795 
10895 1 1 100 1 1 1 14 1 1 132 1 1 195 1 1240 1 1242-1 1254 1 1401 
1 1406 1 1437 1 1454 1 1459 1 1462 ! 1466-1 1473 1 1475-1 1476 
11519 11532-11535 11550 11553 11555-11556 11559 11588- 
1158911606116151161811621 11627-1162811633 11761 
11791 11807 11818 11932 11955 12006 12039 12041 12092 
12212-12224 12231 12236 12305 12363 12368 12396-12399 
12405 12424 12439 12442 12465-12466 12468 12477 12479- 
12482 12490 12521 12546 12552-12553 12576-12577 12579 
12582 12585 12590 12601 12609 12617-12618 12636 12658 
12707 12725 12735-12736 12749 12754 12776-12779 12859 
12868 12894 12905 12909 12940 12955 12959 12977 12980 
12990 13002 13004-13005 13020 13035 13038 13042-13044 
13047 13051-13052 13056 13062 13073 13082-13083 13196 
13249 13280 13311-13312 13336 13387 13417 13421-13426 
13436 13445 13456 13458 13479-13482 13488 13490 13494- 
13495 13497-13500 13503 13507 13512 13516 13533 13546 
13554-13555 13590 13613 13630 13649 13659 13670 13678 
13713 1j724 13769 13793-13794 13808 13827-13828 13838 
13861 13867-13868 13875 13882 13884-13885 13888-13889 
13893 13896 13898 


aauit orain 


CSontech 


A T»T) A 1 1 


1006 1257 3797 4006 4025 5535 6057 7169 7870 8262 8937 
8966 9257 10778 12736 13394 13679 13793 13861 


adult brain 


DiotJnain 


A DDA1 O 

ABR012 


88 598 1007 1 134 2597 3557 3590 3627 3797 4006 4192 4246 
4282 4391 4940 5523 5535 6288 6338 7138 8110 8898 9076 
9401 9455 9476 1077211061 11114 12989 13394 13511 13S66 


adult brain 


Invhrogen 


ABR013 


5982614 3191 4355 4391 5523 5788 8085 8486 11513 12521 
12989 13861 


adult brain 


Invhrogen 


ABT004 


40 51 598 1050-1057 1148 1777-1778 1947 19762270-2272 
2327 2490 2617 3050 3600-3602 3722 3987 4390-4391 4434 
4543 4689 5031 5157-5159 5167 5169 5466 5505 5682-5683 
5701 5766 5778 5794 5902 6147 6367-6371 6459 6545 6709 
6728 6783 6801 6971 7104 7175 7815 7839 7864 8139 8342 
8345 8355 8363 8372 8452 8633 8963 8975-89769012 9133 
9423-9424 9511 9515 9517 9528 95569827 9949 10260 10267 
10275 10570-10571 10733 10767 11132 11159 11406 11459 
11932 12009 12092 12109-12111 12127 12283 12428 12511 
12579 12605 12725 12747 12830 12885-12886 12910 12913 
12954 12987-12989 13051 13054 13062 13073 13090 13249- 
13253 13438 13445 13456 13489 13500 13512 13516 13533 
13546 13590 13622 13649 13683-13684 13713 13803 13838 
13861 13866 13896 


cultured 
preadipocytes 


Strategene 


ADP001 


1 134 1346 2343 2614 3272 3426 3610 3720 3839 3885 401 1 
4277 4282 4297 4346 4388 4391 4405 4434 4641 4833 4940 
4985 5018 5030 5040 5163 5167 5523 5581 5778 5788 5794 
5895 5951 6082 6147 6272 6607 7067 7141 8093 8235 8285 
8312 8363 8629 8648 8830 8839 9290 9401 9466 9503 9781 
10346 10470 10776 10795 1097I 1 1 108 ll 170 1 1513 1 1818 
12034 12037 12046 I2093 12375 12387 12405 12424 12570 
12636 12670 12674 12688 12735 12749 12913 12940 13126 
13163 13295 13489 13494 13497 13499 13511 13516 13575 

IJUJa UUJj 1JOUU I jOOO-J jOOy 


adrenal gland 


Clontech 


ADR002 


8 83 142225 351 443 551 569 731 864 1 134 1266-1271 1273- 
1274 1276-1292 1294-1295 1381 1391 1544-1545 1658 1671 
1908 1959 1983 2010 2023 2145 21752283 23102328-2334 
2343 2444 2449 2510 2522 2576 3032 3069 3153 3166 3272 
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3378 3416 3548 3625 3709 371 1 3771-3788 3790-3791 3797 
3839 3870 3899 3985 4019 4054 4060 4109 4173 4192 4194 
4201 4253 42774282 43894391 4395-4396 4431 4667 4687 
4694 4783 4957 4966 4985 4998 5037 5108 5293 5316-5323 
5325-5336 5481 5505 5527 5531-5533 5560 5628 5656 5701 
5766 5865 5874 5902 5939 5979 6020 6052 6057 6227 6288 
6354 6405 6449 6465-6482 6521 6603 6726 6894 6979 7008- 
701 1 7155-7156 7204 7604 7634 7845 7871 7906-7912 7915- 
7918 7920-7930 8022 8067 8085-8086 8095 81 10 81 16 8224 
8262 8363 8365 8412 8520 8535 8554 8699 8742 8831 8870 
8950-8951 9002 9010 9012 9097 9109-91 19 9121-9127 9190 
9264 9280 9304 9317 9455 9457 9495 9503 9553 9556 9586 
97099780-9781 9895 9927 10007 10030-10032 10034-10035 
10037-10045 10119 10277 10284 10346 10595 10617-10620 
10623-10627 10645 10675 10679 10760-10761 10766 10774 
10782 11108 11194 11255 11258-11270 1 1289 11400 11406 
11457 11519 11588 11600 11621 11626 1 1818 11952 12080 
12159 12225 12227-12230 12232-12237 12239-12240 12242- 
12249 12251 12305 12363 12375 12404-12405 12424 12439 
12577 12599 12601 12630 12636 12657-12658 12663 12688 
12693 12706 12713 12715 12735 12749 12754 12780-12787 
12848 12863 12909 12913 12923 12957 12990 13006-13007 
13020-13021 13064 13095 13104 13295 13313 13315 13388- 
13389 13391 13456 13473 13494 13507 13515 13546 13613 
13631 13679 13725-13733 13858 13866 13872 13883-13884 
13888-13889 


adult heart 


GIBCO 


AHR001 


51 83 88 94 221239360 366-367404 410-411413 415 458- 
459 461 465-468 471 473-478 486 545-546 559 567 616 625- 
630 743-744 799 802-806 808 810-835 837-842 959 1004 1066- 
1085 1134 1178 1184-1193 1346 1512 1516 1546-1547 1556 
1575 1671 1727 1774 1829 1959 1976-1977 2090 2096 2108 
21 10 2128 2138 2 145-2147 2161 2179 2195-2198 2257 2276 
2278-2281 2302 2307-2309 2363 2398-2399 2409 241 1-2412 
2444 2449 2497 25 1 6 2529 2563 2569 2575-2576 2597 2605 
2614 2617 2762 2809 2816 2879-2880 291 1-2924 2926 2961 
2978-2980 2985-29862993 2995-3002 3032 3042 305 1 3058 
3069 3081 3091-3094 3109 3114 3132-3133 3135 3137-3138 
3141 3191 31963199 3215 3263 32723282-32863317 3349- 
3350 3353-3359 3361-3362 3364-3367 3370-3393 3396-3399 
3403 3406 3425 3451 3465-3466 3479 3500 3503 3537-3538 
3544 3548 3550 3555 3557 3590-3591 3595 3604 3606-3612 
3614-3621 3623-3627 3634 3689 3697-3698 3701-3709 371 1- 
3713 3720 3722 3737 3757 3797 3839 3885 3898 3988-3989 
3996 4005-4006 4008 4010-40 1 1 4019 4021-4022 4025 4040 
4043 4054-4055 4058-4060 4078 4183 4192 4194 4201 4246 
4253 4269 4277 4282 4341 4351 4391 4403 4405 4434 4517 
4543 4553 45904615 4622 4631 4633-46344641 4654 4664 
4666-4667 4672-4675 4688-4689 4693-4694 4738 4740 4755 
4783 4828 4870-4871 4905-4906 4909-4915 4917-49194921- 
4924 4926-4927 4940 4957 4972 4985 4991 4998 5001 5030 
5037-5038 5040 5076 5118 5163 5171-5175 5177-5178 5180- 
5182 5266-5268 5380 5422 5481 5490 5492 5506-5507 5523 
5534-5535 5581 5656 5682 571 1 5740 5766 5788 5895 5908 
5923-5924 5938 6047 6057 6083 6085 6104 61 10-61 1 1 61 17 
6147 6184-6185 6189 6195 6215 6255-6257 6259-6266 6268 
6272 6288 6307 6336 6375-6379 6382 6427-6429 6458 6555 
u j°° «J7j vkw / oozy odd j oooy o/oj o / o / o /yy oiSl j o<> / I- 
6873 6879 6881-6882 6898 6926-6928 6975-6978 6987 7058 
7090 7148 73 13-73 14 7341 7343-7344 7346-7347 7350 7354 
7359 7362 7373 7380 7394 7402 7407-7408 7410 7413 7415- 
7416 7418-74197426 7431 7468 7473 7480 7494 7505 7513- 
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7516 7520 7544 7561 7584 7587 7599 7601-7604 7635 7638- 
7643 7645-7649 7651-7655 7657 1659-1662 7733 7735 7743 
7748 7783-7796 7815 7852 7857 7862-7863 7865-7870 7930 
7933 7983 8062-8065 8067 8087-8088 8093 8095 81 10 81 16 
8120 8139-8140 8224 8226 8235 8262 8298 8336 8344-8345 
8354 8356 8363 8368 8372 8378 8387 8410 8427 8452 8456 
8531-8532 8534-8535 8563-8569 8572 8576-85778592-8593 
8597 8603 8606 8610 8613-86148616-8617 8646 8648 8670 
8681 8691 8698-8699 8702 8712 8742 8756 8760 8763 8795 
8807-8809 881 1-8814 8816-8819 8821-8833 8835-8839 8858 
8863 8870 8898 8921 8927 8936 8939 8943 8946 8950-8951 
8966 8988-8992 8994-8995 9010 9049 9059 9065-9067 9070- 
9072 9076 9097-9098 9167 9190 9257-9260 9262 9269 9281 
9287 9301 9304 9315 9317 9401 9454-9455 9466 9476 9480 
9484 9556 9577 9612 9689 9698 9720-9721 9734 9741-9743 
9747-9750 9758 9781 9791 9804 9871-9882 9884-9885 9928 
9939 9942 9954-9960 9999-10000 10005 10175 10179 10275 
10284 10292 10296 10329-10331 10346 10400 10422 10430- 
10431 10437 10442 10444-10447 10475 10511-10512 10514- 
10521 10557 10576-10577 10616 10645 10679 10691 10729 
10742 10744 10772 10774 10777-10778 10782 10788 10839 
10891 10894-10895 10902 10917-10920 10937 10942 10946 
10948 10969-1097010992 11032 11044-1104511061 11066- 
11074 11108 11114 11132 11145 11153 11 165-11170 11173 
11205 1120811210-11215 11283 11289 11386 11388 11401- 
11402 11406 11462 11559 11565 11576 11596 11606 11615 
116201174411788-1178911793 1181811823-11825 11831- 
US32 11869 11894 11947 11961 11982-1198912000 12006 
12009 12011 12019-12020 12028 12037 12044 12078 12081 
12093 12119-12122 12143 12160 12166 12172-12175 12177- 
12179 12197 12305 12335 12363 12375-1237612383 12387 
12400 12402 12405 12424 12428 12438 12479 12521 12523 
12546 12560 12564 12570 12590 12599 12601 12605 12609 
12611 12616-12618 12653 12662-12663 12670 12674 12688 
12717-12720 12734 12752 12754 12772 12905 12907 12914 
12916-12917 12923 12925 12940 12961 12963-12965 12989 
12991 12993 13020-13021 13033 13035 13072-13073 13082- 
13083 13104 13117 13126 13132 13136 13141-13142 13148 
13169 13203-13210 13212 13260-13261 13293 13295 13326 
13377 13394 13413 13442 13456 13477 13480 13488 13490 
13494-13496 13501-13503 13506-13507 13511 13516 13533 
13568 13570-13571 13575 13582 13592 13613 13624-13630 
13632 13644 13646 13659-13660 13678-13679 13689 13701 
13711-13713 13775 13782 13795 13797 13866-13869 13872 
13882 13884-13885 13893 


adult kidney 


GIBCO 


AKD001 


49 67 83-84 142 354405-407415-429431-432445460462 
479-484 486 488 492-493 524 548-549 598 616 63 1-638 744 
787 809 841 1004 1068 1086 1 160 1 163-1 166 1 171 1184 1 193 
1346 1359 1449 1479 1516 1552 1556 1671 1724 1727 1774 
1826 1858 1914 1974 1976 1978-19792081 2097-20992111- 
2113 2117-21182129-21302138 2145 2148-21502161 2186 
22 18 2266 2268 2302 23 10 2327 2343 2363 2409 2412 2444 
2449 2468 2483 2523 2569 2576 2614 2617 2827 2845 2876 
2910 2915 2926-2931 2933-29342938-2943 2945-2947 2955- 
2956 2976 2981 2983-2984 3001-3016 3018 3053 3109 31 14 
3140 3142-3146 3149-3150 3199 3254 3265 3272 3283-3284 * 

1286 HSO "Wsfx X\l% ~KAS\f\ *XA.1* *XAA*\ 1A&(\ o/ioo ocn-» 

J*>OU JJJJ JJ^U JJJU JJ / O JHUU j4jj O'tHj 3*rOU 3^"^ 

3535 3537 3544 3548 3590-3591 3598 3625 3627 3634 3676- 
3677 3679-3685 3689 3697 3709 3711 3720 3722 3737 3757 
3797 3808 3810 3839 3885 3989 4005-4006401 1 4019 4022 
4025 4040 4043 4054-4055 4060 4078 4109 4192 4194 4201 
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4246 4253 4269 4277 4341 4354 4387-4388 4390 4398 4402 
4405 44104434 4473 45404543 4583 4591-4592 4607 4637 
4640-4641 4649 4654 4665 4668 4676-4677 4680-4685 4688- 
4689 471 1-4712 4751 4758 4766-4768 4783 4808 4833 4836- 
4837 4845 4874 4904 4940 4957 4962-4963 4972 4983 4991 
4998 5022 5037 5040 5076 5093 51 18 5143 5163 5171-5172 
5246-5249 5281 5362 5364 5380 5422 5481 5521 5523 5526- 
5527 5535 5656 5693 5726 5740 5766 5769 5778 5788 5794 
5894 5902 5908 591 1 5916-5917 5923 5928 5938 5990 6005 
6049-6051 6057 6078 6082 6086 6088 6105-6107 61 12-6117 
6120-6122 6147 6159 618^-6189 6215 6257 6272 6288 6369 
6418 6423 6430 6555 6595 6598 6738 6743 6765 6767 6793 
6799 6805 6815 6836 6857 6867-6870 6872-6873 6878 6884 
6m 6893 6899-6900 6931 6947 69766979 6988 7024 7045 
7058 7104 71 18 7155 7275 7281 7306 7312 7338 7347-7348 
7351-7352 7355 7357 7359-7363 7366 7369 7371-7373 7381 
7383 7386 7400 7402 7404-7406 7408 7420 7422 7424 7426- 
7428 7430-7431 7433-7435 7439 7455 7465-7468 7494 7502 
7506 7517-7521 7579 7587 7604 76347639 7642 7733 7735 
7743 7748 7792 7797 7839 7842-7846 7848 7852 7857 7862 
7864 7870 7930 7933 8065 8077 8084 8087 8093 8095 8105 
8110 8116 8120 8139-8140 8156 8224 8226 8235 8262 8320 
8336 8345 8351 8354 8359 8364 8368 8370 8372 8375 8377 
8387 8427 8452 8456 8461 8486 8518 8520 8527 8535 8563- 
8564 8566 8572-8573 8575-8577 8580-8583 8585 8588-8589 
8597 8603 8618-8621 8623 8625-8626 8628 8630 8634 8647- 
8650 8664 8674 8677-8678 8691 8694 8701-8704 8706 871 1 
8722 8740 8742 8763 8792 8798 8806 8812-8813 8816 8830 
8838-8839 8848 8853 8863 8876 8898 8921 8935 8943-8944 
8948 8951 8966 8989 9012 9041 9047-9049 9076 9092 9128 
9 1 37 9 1 67 92 1 8 9244 9249 9257 9264 9287 9304 93 1 5 93 17 
9424 9455-9457 9466 9472 9475 9484 9503 95 1 1 95 1 7 9529 
9536 9542 9544 9558 9612 9683 9699 9703 9722-9723 9725 
9734 9744 9752-9755 9758 9772-9773 9780-9781 9805-9807 
984t 9843 9927-9929 9939 9942 9969 9998-10000 10007 
10175 10275 10284 10287 10292 10319 10346 10376 10423- 
10424 10431 10442 10446 10448 10450 10475 10480 10496 
10542 10557 10645 10679 10774 10778 10782 10788 10895 
10921-10928 10947 10949-10953 10986 10994-10996 11032 
11052 11061 11069 11114 11130 11132 11143 11145 11197- 
11202 11205 11208 11212 11239 11289 11401-11402 11406 
11459 11513 11586 11596 11604 11607 11618 11620 11695 
11711 1175911761 11790-11791 11793-1179411818 11820- 
11822 11834-11836 11857 11865 11869 11895-11899 11950 
11970 12006 12022 12041 12078 12143 12159-12160 12178 
12195 12197 12256 12265 12305 12327 12359 12363 12375 
12387 12405 12424 12428 12438 12442 12467 12511 12521 
12523 12535 12539 12546 12564 12570 12590 12599 12601 
12609-12611 12616 12618 12630 12653 12656 12664 12666- 
12667 12670 12674 12688-12689 12691 12739 12754 12830 
12834 12904 12913-12914 12916-12917 12923 12940-12941 
12951-12952 12956 12972 12976-12977 12993 12999 13002 
13052 13062 13064 13066 13072-13073 13082-13083 13095 
13104 13127 13131 13133 13136 13141 13143-13146 13148 
13154-13155 13169 13172 13184 13195-13196 13249 13254 
13260 13263 13280 13295 13307 13319 13394 13442 13456 

l^d.77 1'3A7Q-1'3Aftn l^/lftft 11AQ(\ l^/ioo l?/to/l iiyinc n^iM 

13503 13506-13507 13512 13515-13516 13546 13551 13554- 
13556 13572-13574 13590 13592 13613 13627 13631 13644- 
13645 13656 13659-13660 13665 13670 13710 13713 13776 
13782 13795 13859-13860 13864 13866-13868 13872 13882- 
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Tissue 


RNA 


Library 


SEQIDNOS: 


Origin 


Source 


Name 








13885 13888 13891-13892 


adult kidney 


Inyitrogen 


AKT002 


67 142 550-552 598 639-641 1004 1015 1493 1516 19472215 
2299 2343 2363 2449 2618 3054-3055 3 152-3153 3 155-3 157 
3434 3535 3590 3709 3797 3808 3839 3885 401 1 4022 4025 
4040 4253 4277 4391 4405 4671 4759 4767 4769 4837 4949 
4972 5001 5035 5037 5052 5108 5526 5581 5615 5726 5788 
5895 6062 6139 6190-6191 6803 6900 6947 6975 7468-7469 
7473 7733 8095 81 10 8139 8262 8323 8361 8363 8375 8520 
8539 8648 871 1 8798 8912 8950 8966 8983 9076 9264 9368 
9510 9517 9665 9703 9734 10175 10476 10791 10989 10997 
11132 11618 11745 11900 12006 12039 12160 12363 12375 
12405 12424 12685 12702 12707 12904 13035 13060 13104 
13136 13295 13394 13456 13488 13495 13501 13512 13533 
13554 13583 13644 13670 13679 13696 13713 13795 13866 
13888 13891 


adulUung 


GBCO 


ALG001 


83-84 553 598 642-644 650 747 975 1004 1009 1015-1022 
1449 1516 18162161 21862215 2343 2444 29683056 3158 
3160-3 162 3345 3466 3503 3519 3566 3568-3573 3595 3709 
3797 3810 3839 3885 4025 4039 4059 4194 4405 4622 4755 
4767 4770 4797 4865 4940 4949 4963 4972 4987 4991 4998 
5001 5117-5129 5171-5172 5233 5401 5481 5581 5724 5788 
5938 5960 6123 6125 6140 6215 6322 6336 6343-6348 6371 
6886 6966^967 7024 7143 7275 7441 7444 7502 7522-7523 
7749-7752 8093 8110 8140 8336 8345 8375 8378 8449 8535 
8612 8622 8635 8648 8651 8674 8702 8707-8708 mSSS39 
8898 8941-8942 8944 8948 8951 9076 9128 9457 9466 9475 
9497 9503 9544 9567 9703 9756 9758 9808-98 10 9843 9873 
9926 9999 10161 10175 10275 10329 10344 10400 10458 
10554-10555 10796 10998 11032 11046 11069 11132 11144- 
11145 11483 11627 11818 11837 11901 12006 12028 12039 
12041 12087-12092 12146 12363 12375 12424 12438 12539 
12570 I2601 12617 12661 12670 12674 12688 12738 12749 
12754 12904 12940 1297413062 13173 13195 13232 13234 
13260 13295 13307 13456 13472 13477 13490 13494 13497- 
13498 13569 13584 13592 13660 13663-13666 13670 13743 
13776 13882 13885 13889 13891 


lymph node 


Clontech 


ALN001 


83 142 364 487 495-497 554 629 645-646 648-650 716 938-951 
953-962 1134 1516 1549 1671 1774 19762138 2225-2232 
2234-2235 2340 2879 3019-30203057 3103 3163-3165 3272 
3356 3498-3505 3507-3513 3515-3516 3538 3548 3628 3697 
4194 4201 4253 4405 4641 4687-4688 4771-4772 4783 4808 
4845 4963 4972 4987 4998 5042-5065 5076 5163 5504 5523 
5835 5895 5917 6027 6142 6192 6272 6288 6308-63 1 1 6313- 
6314 6765 6805 6871 6949-6951 7700-7705 7707-7709 8085 
8088 8091 8110 8235 8375 8387 8432 8629 8631 8633 8648 
8677 8709 8713-8715 8830 8863 8887-8889 8891-8896 8943- 
8944 8966 9010 9076 9111 9128 9142 9222 9455 9472 9520 
9544 9734 9774 9780 981 1 9905-9907 9928 9939 9999 10027 
10129 10296 10439 10452 10501 10543-10545 10679 10777- 
10778 10788 10891 10999-1100011111-11113 11115 11130 
11145 11344 11406 11513 11584 11885 12006 12028 12050- 
12056 12143 12256 12363 12405 12442 12570 12674 12690 
12913 12917 12940 12968-12970 13173 13220-13221 13394 
13400 13492 13503 13511 13533 13630 13642 13645 13713 
13868 13885 13889 13891 


young liver 


GIBCO 


ALV001 


211 498 598 651-653 1008-1014 1193 1264 1575 1976-1977 

91^1 91/S1 9954.9955 99£Q I'XfA 9^?« 9Ano/J07 0«o oci/: 

2961 3021 3059 3093 3166 3215 3272 3356 3378 3426 3479 
3559-3565 3590 3597 36273634 3673 3709 37973810 3885 
3993 4006 401 1 401 9 4025 4194 4246 4253 4277 4422 4426 
443 1 4434 4437-4438 4634 4654 4687 4714 4722 4755 4773- 
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Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








4774 4808 4957 5001 5037 5052 5113-51 15 5163 5167 5171 
5508 55 10 5526 5581 5616 5693 5726 5757 5772 5788 5928 
6125 6141-6142 6193 6288 6340-6342 6589 6765 6815 6886 
6901 7024 71 18 7141 7442 7524 7735 7747-7748 7792 7852 
7870 7924 7930 8066 81 10 8120 8262 8375 8378 851 8 8520 
8648 8652 8691 8702 8716-8718 8816 8838-8839 8938-8940 
9097 91 19 9263-9264 9301 9317 9424 9454-9455 9544 9689 
9703 97349758 9775 9925 9947 10393 10477 10769 10774 
1 1001 1 1 132 1 1 140-1 1 142 1 1 173 1 1208 1 1 3 1 8 1 1406 1 1 587- 
11588 11725 11804 11902 1208 M2086 12118 12160 12178 
12375 12377 12405 12424 12479 12511 12570 12599 12601 
12618 12670 12674 12688 12737 12830 12913 12916 12933 
12953 12977 12993 13047 13062 13073 13326 13169 13174 
13230-13231 13295 13488-13489 33494 13498-1349913502 
13506-13507 13575 13592 13646 13660-13662 13670 13866 
13868-13869 13882 13888 13901 


adult liver 


Invitrogen 


ALV002 


6 25 60 142 598 1004 1213 1296-1301 1381 1493 1513-1514 
1977 2139 2161 2269 2310 2335 2342 2400 2413 2449 2458 
2497 2568-2569 2576 2636 2809 2827 2880 2926 347 1 3484 
3503 3597 3722 3792-3796 3885 3994-3995 4025 4060 4201 
4326 4422 4426 4618 4689 4767 4786 4985 4998 5093 5163 
5337-5339 551 1 5526 5581 5693 5701 5724 5726 5757 5895 
5922-5923 5979 6020 6027 6083 6125 6192 6195 6253 6333 
6346 6483-6485 6716 6765 6797 6975 7169 7422 7468 7614 
7642 7807 7932 8089 81 10 8140 8262 8323 8378 8633 8677 
871 1 8740 8759 8786 8834 8839 8944 9002 9128-9129 9264- 
9265 9282 9466 9484-9485 9517 9677 9700 9773 10007 10046- 
10047 10135 10616 10669 10777 11032 11114 11132 11142 
11194 11271 11389 11462 11502 11560 11587 11602 11818 
11980 11989 12086 12160 12195 12253-12254 12316 12327 
12363 12378-12379 12411 12424 12511 12570 12630 12693 
12706 12788-12789 12840 12863 12913-12914 13047 13062 
13072-13073 13090 13242 13256 13378-13379 13389 13479 
13487 13489 13497 13555 13734-13738 13859 13864 13889 


adult liver 


Clontech 


ALV003 


346 2529 3548 3797 3885 4438 4940 5101 5801 5902 9597 
10769 10778 11587 11927 12086 


adult ovary 


Invitrogen 


AOV001 


6 13 40 67 83 88 142 196 444 522 555-560 577 579 598 609 
654-657 659-665 667-668 708 740 745-749 751-754 756-761 
806 841 871 1004 1023 1028-1034 1036-1039 1041-1047 1055 
1087-1115 1119 1151-1162 1298 1346 1359 1456 1493 1507 
1516 1658 1697 1752 1774 1812 1826 1848 1914 1974 1976- 
1977 1979 1983 2099 21 1 1 2138 2153-21552161 2175 2180- 
2181 2186 2258-2267 2284-2286 2288-2290 2298-2299 2342- 
2343 2449 2483 2523 2529 2555 25692576 2591 2597 2618 
2708 2750 2783 28 1 8 2839 2926 2961-2962 2988 3007 3022 
3031 3060-3061 3063 30693166-3172 3174-3175 3177-3189 
3226 3272 3286 3288-3289 3291-3293 3295-3299 3435 3450 
3460 3479 3486 3503 3535 3577-3589 3612 3627 3629 3631- 
3643 3673 3675 3720 3722 3737 3797 3806 3810 3812-3813 
3839 3885 3985 4006 4012-4014 4023 4025 4040 4043 4060 
4078 4133 4192 4194 4201 4246 42694277 4282 4341 4362 
4381 4383 4385 4388 4390-4391 4396 4399 4402 4405 4410 
4434 4465 4473 4515 4543 4582 46104654 4689 4694 4715- 
4717 4722 4759 4767 4775-4782 4836-4837 4873-4878 4880 
4940 4944 4957 4960 4962-4964 4972 4975 4981 4985 4998 
5001-5002 5018 5030 5037 5040 5045 5075-5076 5093 5108 

ciof 0*20 c i yi^c CI AO C\£.i cifti c»nn on ro<lO C\AC 

51j5oIJo 51Jool4o 5148 5163 5191-5209 5213 5238-5245 
5262 5374 5380 5451 5455 5490 5503 5505 5521 5523 5527 
5560 5581 5674 5701 5724 5757 5778 5783 5788 5794 5817 
5874 5894-5895 5902 5906 5908-5909 5916-5917 5923 5932 
5938 5979 5989-5990 6005 6027 6048 6054 6057 6061 6078 
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Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 








6082 6086 6125 6140 6143-6144 6165 6171 6192 6194-6200 
6215 6235-6239 6241 6270 6288 6307 6333 6336 6354-6360 
6362-6364 6374 6385-6392 6410 6412 6415-6417 6458 6461 
6508 6555 6583 6595 6598 6604 6607 6624 6626 6629 6708 
6765 6767 6793 6797 6799 680 L 6805 6825 6860 6870-6871 
6873 6886 6888 6900 6902-6904 6915-6917 6947 6969-6970 
6975 6980-6987 6998 7054 7058 7104 7109 7118 7133 7137 
7141 7155 7166 7169 7171 7191 7204 7281 7386 7408 7431 
7443 7468 7471-7472 7494 7507 7525-7528 7604-7609 7614 
7642 7687 7716 7729 7762-7763 7765-7767 7769-7771 7773- 
7775 7788 7802-7818 7829-7835 7838-7841 7864 7905 7937 
7977 8007 8035 8077 8088 8093 8095 81 10 8120 8134 8139- 
8140 8143 8156 8235 8246 8262 8292 8320 8336 8345 8355 
8358 8363 8365 8368 8370 8372 8375 8387 8444 8452 8456 
8486 8520 8593 8610 8633-8634 8653 8719-8724 8773-8777 
8779 8781-8782 8792 8798 8813 8830 8839 8853 8863 8870 
8876 8898 8928 8944 8950 8956-8961 8963-8971 8989 9004- 
9007 9009-9019 9042-9046 9055 9076 9097 9128 9134 9185 
9264 9283-9284 9291 9313 9358 9424 9445 9454-9455 9457 
9460 9466 9471-9472 9475 9480 951 1 9517 9533 9539 9542 
9544 9553 9565 9577 9586 9612 9626 9677 9694 9703 9722 
9734 9757-9758 9773 9776-9778 9791 9S12-9819 9827 9838 
9843 9848-9853 9922 9927-9940 9942 9962-9970 9989-9992 
9994-9997 10007 10027 10097 10149-10152 10249 10252 
10275 10298 10333-10334 10346 10349 10351 10383 10386 
10417 10438-10439 10452 10459 10470 10478-10480 10493- 
10498 10557-10562 10564-10565 10580-10586 10594-10596 
10645 10658 10676 10679 10767 10772 10774 10778 10782 
10788 10791 10954 10971-10972 11002-1 1004 1 1032 1 1047- 
11052 11061 11069 11083 11108 1 1130 1 1 132 11151-11153 
11155 11157-111581116711170 11172-1117611193-11196 
11274 11343 11369 11406 11411 11431 11588 11596 11600 
11604 11606-11607 11618 11620-11621 11629 11668 11814 
11860 11865 11876 11903-11909 11932 11949-11954 11980 
11985 12000 12006 12017 12022 12033 12039 12080 12083 
12092-12093 12095-12104 12127-12129 12131-12142 12146 
12159-12160 12162-12163 12178 12213 12216 12236 12256 
12260 12305 12327 12363 12368 12371 12375 12379 12387 
12401-12402 12405 12424 12430 12467 12520 12522 12546 
12570 12576 12590 12594 12599 12605 12609 12611 12615 ' 
12617-12618 12630 12636 12643 12657-12658 12663 12670 
12674 12685 12688 12691 12693 12702 12705-12707 12713 
12724 12729 12735 12740-12745 12749 12754-12760 12765- 
12770 12777 12842 12848 12875 12904 12906 12910 12913 
12916-12917 12935 12940 12955 12957 12963 12972 12977- 
12980 12982-12984 12992-12994 12997-13000 13020 13034 
13047 13051 13054 13056 13060 13062 13066 13070 13072- 
13073 13075 13082 13090 13092-13093 13104 13126 13136 
13141 13175-13177 13179-13180 13193-13194 13196-13197 
13202 13236-13241 13243 13245 13249 13254 13263-13275 
13277 13286-13292 13295 13319 13351 13377 13389-13391 
13394 13420 13436 13456 13477 13479 13488 13492 13494- 
13495 13497-13499 13502-13503 13506 13512 13516 13533 
13546 13549 13554-13555 13575 13590 13597 13600-13601 
13613 13616-13620 13627 13631 13644-13645 13649 13659- 
13660 13670 13674-13679 13693-13699 13707-13709 13713 

117X7 HRftl I^Rfvl 1 1R6Q 1^£77 17517^ IIKfio 
ij/OZ, UOUj Ijolr* IjOuO ijOOo'IjOO? iJo/Z 1jo/j JIjOOZ- 

13885 13888-13889 13892 


adult placenta 


Clontecb 


APL001 


669-671 1006 1134 1184 1551 2053 20902156-2158 2342 
2490 2716 3001 3064-3065 3 190 3272 3625 3670 401 9 4055 
4194 4201 4246 4641 47184720-4721 4783-47864957 5523 
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Library 
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SEQIDNOS: 








5536 5581 5788 5895 6145-61466201-6202 6358 6803 7049 
7204 7529 8367 8375 8609 861 1 8630 8725-8727 9097 9480 
9734 9820 10319 10460 10579 10795 10895 11519 11723 
11737 1 1776 11827-1 1828 11959 12403 12546 12570 12663 
12848 12875 12913 13051 13394 13782 13869 13889 


placenta 


Invitrogen 


APL002 


142 561 1023 1551 1976 2449 256926143191 3340 3592 3668 
4060 4346 4767 4787-4789 4892 4985 5536 5674 5693 5757 
5772 5794 5906 6147 6203 6215 6598 6788 6799 6979 7158 
7530-753 1 8139 8361 861 1 8863 9457 9484 9517 9563 9703 
9758 9821 9927 9939 9942 10481 10557 10778 10802 10973 
1 1 006-1 1 01 1 1 1 723 1 173 3 1 1737 1 1776 1 1 807 1 1 866 1 191 0- 
11916 11959 12379 12403 12590 12685 13020 13054 13392 
13489 13533 13554-13555 13867 13882 13888 13891 


adult spleen 


GIBCO 


ASP001 


463 499 562-563 598 672 990-993 995 1004 1020 1346 1515- 
151615561774 1877 19141977 21592161 2215 2248 2257 
2363 2529 2569 2614 2617 2812 2827 3093 3192 3195 3199 
3272 3286 3350 3356 3434 3470 3503 3537 3544-3546 3548 
3604 3634 3689 3709 3720 3722 3796-3798 3812 3839 3885 
4006 40 11 4022 4025 4043 4194 4201 4246 4253 4277 4388 
4391 4396 4405 4434 4641 4689-4690 4781 4957 4987 4998 
5001 5017 5030 5037 5052 5076 5118 5512-5513 5523 5526 
5701 5740 5778 5788 5796 5801 5895 5938 6005 6064 6125 
6140 6147 6192 6204 6272 6329 6458 6551 6590 6607 6873 
6886 6931 6958 7018 7058 7109 7118 7133 7171 7233 7506 1 
7532-7533 7561 7586 7733 7736 7807 7842 7845 7933 8065 
8085 8093 8095 8110 81 16 8139-8140 8226 8235 8262 8323 
8359 8363 8365 8368 8372 8456 8535 8648 8702 8792 8816 
8838-8839 8858 8863 8865 8876 8923-8928 8943 8950-8951 
9128 9257 9264 9317 9455 9457 9466 9474 9544 9560 9773 
9781 9919-9920 9927 9939 9942 10000 10048-10049 10155 
10175 10267 10275 10296 10331 10549 10566 10774 10777 
10828 10944 11032 11061 11108 11113 11132 11173 11272 
11401 11406 11620-11621 11804 11818 11861 11917 12006 
12039 12041 12073-12074 12078 12118 12127 12236 12327 
12375 12387 12402 12405 12424 12426 12511 12546 12560 
12570 12601 12617-12618 12670 12674 12734-12735 12749 
12772 12790 12913 12916-12917 12972 12977 13002 13021 
13066 13082 13084-13085 13169 13228-13229 13256 13260 
13295 13316-13317 13394 13456 13488-13489 13494 13497 
13512 13516 13546 13645 13649 13651 13659 13679 13739 
13776 13864 13866-13867 13888 


testis 


GIBCO 


ATS001 


50 142 500-502564-565 598 673-678963-968 1193 1346 1556 
1671 1826 1968 1977 2160-21612236-2237 2555 2590 2597 
3024 3093 3196-3202 3284 3378 3517-3522 3524 3720 3797 
3839 3885 4006 4025 4055 4109 4253 4354 4377 4388 4405 
4434 4473 4480 4543 4688 4792 4940 4957 4962 4964 4975 
5001 5018 5030 5037 5067-5072 5076 5118 5171 5481 5525 
5535 5740 5757 5788 5907 5924 6125 6147 6257 63 16-6322 
6324 6607 6886 6952 7058 7141 7288 7445 7484 7534-7540 
7677 7710-7711 7713-7717 7735 7792 7815 8087 8110 8120 
8262 8368 8370 8375 8535 8577 8648 8655 8702 8729 8838- 
8839 8848 8898-8906 8936 8939 8966 9076 9087 9244 9264 
9304 9455 9466 9485 9542 9567 9703 9758 9822-9825 9908- 
9910 9928 10462 10475 10478 10482 10551 10583 10679 
10772 10778 10795 10955 10974 11108 11113 11116 11118- 
11120 1113211173 11239 11425 1160611620 11629 11695 
11807 11862 11918-11919 12000 12006 12017 12033 12057- 
12061 12375 12405 12424 12570 12577 12599 12601 12605 
12609 12668 12692 12730-12731 12748 12904 12917 12923 
12929 12935 12940 12956 12971 13047 13051 13073 13090 
13169 13222-13225 13249 13394 13442 13456 13479 13494 
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13497 13502-13503 13506 13512 13515 13533 13549 13555 
13575 13613 13616 13627 13644-13646 13660 13866 13875 
13882 13889 


Genomic 
DNA from 
BAC 63118 


Research 

Genetics 

(CITB 

BAC 

Library) 


BACOOl 


8711 


adult bladder 


Invitrogen 


BLDOOl 


731 1710 1779-1791 2076 2367 2479-2481 4067 4208^214 
4681 4767 4775 5163 5169 5553 55605581 56745684-5688 
5794 6082 6244 6716 6718-6721 6799 6931 7045 7094-7097 
7967 81 10 8226 8249-8258 8364 8648 8737 9012 9097 9425- 
9428 9430 9626 9703 9928 10007 10261 10557 10734-10735 
11459 11504-11506 12000 12212 12512-12513 12515-12517 
12670 12735 12914 13085 13439 13512 13687 13838-13841 
13866 


bone marrow 


Clontech 


BMDOOl 


11 70 83 85 142 150 162-184 186-198200-210230-243 245- 
277 279-281 306 370 373-387 389 433-435 438-440 457 483 
503-510 566 568-569 595-596 598 609 616 679-683 740 770 
843-857 859 861 864 867-876 880-884 886-890 892-893 896 
900 902-920 1000 1004 1116-1320 1122-1126 1128-1134 1184 
1211 1346 1359 1516 1552-1554 1575 1583 1671 1724 1774 
1877 1894 1927 1974 1976 1983 2012-20242031 2033-2038 
2040 2043-2044 2084-2088 2111 2120-2121 2132-2133 2137- 
2138 2161 2163-21642186 2189 2199-22002202-2203 2205- 
2210 2213-2218 2266 2291-2295 2342-2343 2414-2416 2444 
2529 2555 2566 2569 2575-2576 2591 2597 2652 2681-2709 
271 1-2716 2735-2738 2740-2744 2746-2748 2750-2756 2758- 
2759 2761-2764 2766-2768 2770 2772-2781 2783-2787 2806 
2812-2813 2816 2873 2875 2880-2881 2883-2885 2887-2897 
2899 2901 2926 2948 2954 2958 2977 2984 3026-3029 3032 
3068-30693071 3081 3093 3096-30973139 3203-3205 3207 
3238 3257 3272 3282-3283 3286 3317 3345 3356 3404-3418 
3422-3434 3437-3438 3440 3442 3447 3449-3450 3456-3459 
3461-3464 3466-3473 3483 3497 3535 3538 3548 3557 3572 
3588 3593 3600 3604-3605 3610 3612 3625-3627 3634 3644- 
3647 3649-365 1 3653-3657 3709 37 1 1 3722 3725 3727 3737 
3797 3804 3808 3810 3839 3899 4005-4006 401 1 4015-4019 
4023 4025 4040 4043 4058 4060 4129 4132 4192 4194 4201 
4246 4253 4277 4282 4367 4403 4466-4472 4474-4477 4479- 
4484 4486-4490 4492-4497 4509 45 12-4540 4582 4595 4597- 
4606 4608-4615 4622 4642 4648 4650 4654 4667 4691-4695 
4723 4741-4742 4759 4767 4783 4794-4796 4808 4836 4928- 
4929 493 1-4934 4938 4940 4942 49444948 4950-4953 4955- 
4960 4962-4964 4967-4971 4973 4976-4979 4985-4987 4992- 
4998 5000 5004-5016 5030 5037 5052 5075-5076 5093 5143 
5149-5151 5163 5169 5210-5215 5217-52265262 5317 5357 
5503 5523 5535 5537-5540 5560 5604 5695 5740 5748 5766 
5788 5796 5801 5862 5874 5895 5906 5908 5938 5965-5967 
5969-5979 5991-5994 5996-6005 6007-6009 6027 6047 6057 
6065 6067-6068 6071-6072 6082 6086 6089-6091 61 12 6125- 
6127 6148-6152 6195 6214 6233 6257 6269-6276 6278-6280 
6282-6283 6285-6289 6292-6300 6321 6374 6393-6403 6508 
6555 6605 6607 6722 6730 6788 6815-6822 6826-6832 6836 
6859-6861 6867 6870-6871 6873-6874 6905 6915 6929 6933- 
6937 6940-6942 6944-6945 6988-6992 7051 7126 7155 7166 
Vloy /lyy-721U /212-7219 722o 7233-7253 7256-7264 7275 
7317-7318 7320 7322-7326 7347 7349 7373-7374 7379 7382 
7446-7448 7473-7474 7491 7502 7542 7544 7569 7577 7579 
7663-7671 7674 7677-7678 7680-7684 7686 7689-7696 7742 
7768 7791 7815 7819-7824 7847 7864 7930 7995 8065 8085 
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8088 8090-8093 8095 81 10 8120 8134 8139 8224 8226 8235 
8241 8246 8262 8336 8354 8359 8363 8365 8368 8375 8421- 
8427 8429-8440 8444 8452 8456-8460 8463-8465 8467-8476 
8478-8489 8538-8539 8542-8551 8563 8566 8577 8593-8595 
8597 8603 8633 8635-8637 8648 8656-8657 8671 8677 8679 
8701 8792 8796 8817 8830 8840-8845 8847-8849 8852 8855- 
8860 8863-8868 8870 8873 8875 8879-8881 8898 8935 8944 
8963 8965 8995 9010 9012 9020-9031 9049 9051 9066 9076 
9092 9097-9098 9128 9257 9264 9285-9288 9304 9317 9339 
9381 9399 9401 9454-9455 9466 9471-94729474-9476 9516- 
9517 95 19 9529 9544 9622 9630-9643 9646 9650-9665 9689 
9705-9712 9730 9734 9740 9758 9779-9781 9826 9885-9889 
9892-9902 9928-9929 9939 9942 9972-9976 9978-9982 9999- 
10000 10068 10153 10175 10249 10275 10277 10284 10287 
10296 10319 10321 10343-10344 10346 10364-10374 10379- 
10382 10385-10394 10414 10416 10426 10429 10439 10447 
10452 10467-10468 10475 10483 10520 10522-10526 10528- 
10529 10531-10535 10587-10590 10595 10616 10677-10679 
10691 10750 10760 10767 10772 10778 10788 10794-10795 
10828 10831-10837 10846-10847 10849-10851 10853-10858 
10891 10895 10897 10899-10903 10905-10907 10931-10933 
10956-10959 10975-10976 11012 11032 11046 11057 11059 
1 1061 1 1077-1 1082 1 1085-1 1086 1 1088-1 1098 1 1 108 1 1 1 13 
11132 11145 11177-11181 11194 11208 1 1289 11344 11401- 
11402 11513 11618 11620 11631 11673-1167511677-11690 
11695 11697-11704 11706 11708-11715 11736 11742 11762- 
11772 11774 11795-11796 11802-11804 11817-11818 11829 
11839-11842 11863 11920 11946 11992-12002 12004-12008 
12010 12012-12013 12015-12019 12022-12029 12033 12041 
12081 12142-12149 12160 12178 12195 12200 12231 12236 
12283 12305 12308 12316 12327 12363 12368 12404-12405 
12424 12426 12430 12439 12527 12546 12570 12576 12590 
12608 12610 12616 12634-12636 12641-12645 12654-12655 
12657-12659 12669-12670 12674 12721-12724 12754 12761 
12834 12840 12842 12904-12905 12910-12911 12913 12916- 
12917 12922-12923 12925-12928 12933 12938 12956 12966 
12977 12989-12990 12993 12995 13020 13033 13047 13060 
13062 13064 13066 13095 13098 13103-13105 13107 13111- 
13113 13129 13135 13137 13141 13150 13163 13173 13181 
13213-13217 13234 13276-13278 13280 13391 13394 13400 
13456 13472 13477 13490 13492 13494 13498 13501-13502 
13506-13507 13511 13516 13528 13530-13534 13538 13540 
13546 13551-13552 13554-13555 13560 13585-13586 13594 
13613 13630 13633-13639 13644-13646 13649 13659-13660 
13670 13673 13679 13713 13775 13795 13866 13872 13875 
13882-13885 13889 13891 13893 


bone marrow 


Clontech 


BMD002 


51 242 442 654 1004 1134 1841-1904 1908 1927 2023 2107 
2215 2342 2408 2507-2529 2576 2597 2806 2866 3286 3434 
3722 3736-3737 3817 3823 3839 4060 4246 4258-4290 4389 
4396 441 1 4618 4641 4828 4836 4957 4987 5030 5037 5052 
5108 5163 5526 571 1 5743-5772 5774-5804 5813 5895 5939 
6131 6236 6266 6287 6306-6307 6333 6655 6675 6728 6730 
6750-6762 6764-6769 6788 7049 7106-71 17 7137 7156 7738 
8068 8085 8134 8156 8290-8297 8299-8305 8307-8312 8323 
8368 8482 8499 8858 8863 8865 8928 8935 9076 91 1 1 9128 
9465-9502 9626 9700 9703 9923 9928-9929 9942 10277-10280 
10282-10284 10286-10297 10396 10434 10515 10551 10645 
10675 10691 10744-10753 10772 10778 1 1057 11098 11108 
11132 11232 11252 11519 11536-11553 11606 11620 12033 
12039 12146 12260 12305 12387 12402 12405 12500 12554- 
12578 12594 12599 12608 12674 12754 12777 12839 12895- 
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12904 12916 12923 13020-13021 13043 13057-13064 13169 
13249 13446-13454 13479 13492 13494 13513 13533 13555 

iJOjy 13/13 13 1 ij 135U3 l-3o03-13o/j l3o//-13<>o7 13889 

1 1 


bone marrow 


Clontech 


BMD004 


2249 2529 3286 3494 3548 3551 3797 3839 4025 4058 4201 
4277 4282 5052 5108 6545 6961 8262 8898 9474 10000 1 1098 
11818 13021 13893 


bone marrow 


Clontech 


BMD007 


8539 9780 992713021 


adult colon 


Invitrogen 


CLN001 


319 346487 731 799 1792 1848 20502161 24492482-2483 
3431 3901 4215-42174940 495749875163 5239 5560 5689- 
5oy:> 5oo5 591 1 55/23 o/22 6765 7098 7815 7864 7880 8110 
8259-8262 8486 8597 8951 9484 9529 9542 9556 10376 1 1507- 
11508 11617 11869 12127 12236 12424 12518-12523 12601 

lO^lfl 10T7T lOOTiC l"JA#CO ll^O 11AAf\ iicft'r loen 

IZOils IZf H IZyfo 13UOZ 1JU/J 1330/ I344U 13 j(J/ 13512 

13630 13713 13843-13844 13864 13868-13869 13888 


Mixture of 16 
mRNAs* 


Various 
venuors 


CTL016 


6815 107761297713064 13512 


Mixture of 16 

tissues- 

mRNAs* 


Various 
Vendors* 


CTL021 


1671 6738 8432 8648 8863 8944 951 1 10769 13021 13062 
13064 


adult cervix 


BioChain 


CVX001 


50 67 142 158 308 332 346 475 598 654 895 1004 1086 1286 
1449 1516 1671 1698 1701 1711-1756 1758-1776 1828 1848 
1959 2134 2186 2257 2267 2343 2408 2414 2468-2474 2476- 
2478 2608 2716 3002 31363166 3191 3199 3529 3535 3554 
3572 3627 3722 3737 3777 3797 3839 3985 4158-4176 4178- 
4195 4197 4199-4207 4246 4277 4391 4396 4434 4641 4667 
4759 4783 4828 4885 4940 4957 4963 4987 4998 5001 5038 
5075 5108 5163 5293-5294 5455 5481 5523 5552 5581 5646- 
5652 5654-56595661-5671 5673-5681 5687 5701 5711 5723 
5740 5788 5794 5848 5902 5908 5923-5924 5964 6020 6052 
6057 6062 6091 6106 61 12 6125 6129 6181 6350 6371 6374 
6410 6446 6458 6504 6508 6512 6551 6598 6686-6687 6689- 
6705 6707-6715 6788 6873 6893 6917 6998 7008 7045 7078 
7084-7093 7095 7130 7141 7148 7169 7204 7507 7579 7608 
7675 7733 7768 7815 7871 7880 7893 8078 8138 8209-8215 
8217-8236 8238-8242 8244-8248 8298 8345 8370 8444 8456 
8486 8499 8535 8558 8592 8633 8635 8648 8669 8679 8742 
8853 8863 8870 8898 8921 8939 8948 9012 9061 9098 9107 
9128 9137 9153 9304 9308 9317-93 18 9355 9385-9391 9393- 
9403 9405-9406 9408-9418 9420-9422 9457 9466 9475 9510 
9539 9612 9734 9773 9927-9928 9939 9947 9960 101 10 10175 
10230-10256 10258-10259 10267 10274 10319 10329 10344 
10491 10496 10540 10616 10660 10691 10722-10732 10778 
10782 13055 11145 11217 11376 11462 11477-11489 11491- 
11503 115191158411604 11695 11853 1 1869 11891 11980 
12006 12066 12081 12127 12160 12195 12216 12240 12266 
12308 12363 12379 12402 12405 12424 12438 12483-12494 
12496-12510 12579 12605 12610-12611 12617-12618 12643 
12653 12670 12674 12688 12691 12703 12707 12735 12740 
12754 12830 12840 12866 12870-12881 12883-12884 12905 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 
10) human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lympb 
node mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) 
human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 



Ill 



WO 01/64835 



PCT/US01/04927 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 








12913-12914 12917 12923 12951 12955 12957 12965 12989- 
12990 12993 13002 13020 13035 13045-13050 13062 13064 
13072 13092 13136 13141 13174 1319713254 13283 13307 
13391 13428 13430-13437 13442 13473 13479 13492 13494- 
13495 13497-13498 13502 13532 13554-13555 13575 13590 
13597 13613 13616 13627 13644 13679 13713 13775 13829- 
13837 13866 13868-13869 13872 13884 13888 13891 


diaphragm 


BioChain 


D1A002 


731 1346 3548 371 1 3885 4282 4654 5895 6873 8120 8931 
8936 9455 11132 11818 12405 12609 


endothelial 
cells 


Strategene 

* 


EDT001 


21 51 67 83 332 569 598 609 762 796 1004 1024-1026 1086 
1561 1848 1928 1959 1976-19771983 2138 2161 21662257 
2282-2283 2417 2483 24902555 25692614 2926 3042 3189 
3 191 3272 3300-3303 3426 3494 3503 3548 3574-3576 3605 
3627-3628 3673 3709 3720 3722 3737 3797 3839 3885 4005 
401 1 4019 4055 4133 4 192 4246 4269 4282 4340 4354 4365 
4384 4388 4399 4405 4410-4411 4434 4543 4641 4654 4767 
4797-4799 4802 4881-4882 4885-4886 4888 4940 4957 4964 
4972 4985 4998 5002 5017 5030 5076 5103-5104 5132-5133 
5163 5167 5183-5185 5187-5190 5380 5523 5527 5535 5541- 
5542 5544 5674 5684 5693 5724 5766 5778 5788 5794 5796 
5874 5895 5916 5923 5928 5938 6005 6048 6057 6068 6082 
6165 6205 6215 6240-6241 6307 6321-6322 6349-6353 6383 
6458 6595 6598 6606-6607 6765 6799 6805 6815 6860 6871 
6873 6890 6918 6968 6972 6976 6979-6980 6998 7058 7067 
7104 7113 71 16 7137 7139 7169 7275 7468 7613 7716 7755- 
7757 7759-7760 7797 7799-7801 7930 8077 8084 8093-8095 
8120 8139 8235 8262 8320 8323 8335-8336 8345 8354 8358 
8363-8364 8370 8372 8375 8387 8452 8592 8648 8786 8788 
8792 8813 8863 8898 8944-8955 8965 8996-9001 9051 9076 
9097 9128 9264 9289-9291 9304 93 15 9414 9455-9456 9466 
9472-9473 9475 9484 9504 9517 9529 9542 9563 9570 9626 
9703 9780-9781 9843 99279939 9961 10000 10027 10154 
10267 10285 10321 10330-10331 10342 10344 10349 10496 
10500 10550 10556-10557 10579 10679 10772 10776 10778 
10788 10795 10802 11013 11132 1113611146-11149 11406 
11483 11565 11588 11600 11606-11607 11615 11626 11807 
11818 11932 11955 12006 t2034 12037 12041 12044 12078 
12092-12094 12123-12126 12150 12213 12375 12381 12387 
12405 12411 12424 12426 12522 12570 12576 12590 12601 
12610 12612 12615 12617-12618 12663 12670 12674 12707 
12729 12739 12749 12753-12754 12777 12830 12842 12913- 
12914 12916-12918 12929 12940 12972 12975-12977 13002 
13024 13047 13051 13054 13062 13064 13082-13084 13090 
13092 13094 13123 13126 13136 13195 13235 13263 13380 
13389 13392 13394 13400 13456 13479 13488-13489 13492 
13494-13499 13502 13506-13507 13514 13516 13546 13555 
13568 13575 13590 13592 13613 13616 13621 13630 13649 
13659-13660 13667-13668 13670-13673 13678-13679 13690- 
13692 13713 13796-13797 13838 13866-13869 13872 13882 
13884 13888 13893 


Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM001 


150 2023 2327 2490 4109 4783 5503 5560 10267 10760 12017 
12160 12557 12582 12923 13020 13514 


Genomic 
clones from 
the short arm 
of 

chromosome 


Genomic 

DNA 

from 

Genetic 

Research 


EPM003 


5560 12017 12146 
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8 








Genomic 
clones irom 
the short arm 
of 

UIJU UiUUMJUJc 

8 


Genomic 

r\XT A 

UNA 

from 

Genetic 

Kesearcn 


EPM004 


4783 47985560 10817 11926 12017 12160 


esophagus 


BioChain 


ESO002 


999-1000 2449 3272 33 15 3548 3550 3634 3697 3796 4011 
40Z5 4058 420 1 4282 5 106 5 163 5553 6082 6873 7739 9304 
10296 11133 11818 12033 12570 13869 


fetal brain 


Clontech 


FBR001 


51 142 11843664 4060410949405021 5270 5523 5553 6112 
6805 6908 7294 8558 9457 10376 11059 11985 12006 12122 
12160 12754 13438 13507 13888 


fetal brain 


Clontech 


FBR004 


60 2704371 1 4025 4109 4783 5001 6082 7597 9010 9504 
9949 11837 12033 12039 12363 12705 12905 13020 13503 
13512 13891 


fetal brain 


Clontech 


FBR006 


6 60 67 598 800 932 1004 1 170 1793-1794 1796-1797 1799- 
1805 1905-1914 1916-1958 1974 1976 1979 1983 2057 2129 
2 174 222 1 2407 2444 2449 2484-2492 2530-2554 2556-256 1 
2563 2576 2857 3064 3207 3479 3556 3673 3709 3722 4060 
4078 4157 4218-4221 4223-42244277 4291-4334 4338 4355 
4364 4369 4431 4957 5001 5109 5270 5380 5553 5634 5696- 
5706 571 1 5724 5766 5788 5794 5801 5805-5832 5834-5879 
5882-5901 5936 5990 6057 6723-6732 6765 6770-6791 6797 
6805 6894 7049-7050 7100-7102 7105 7118-7123 7125 7127 
7169 7905 8263-8265 8267-8273 8294 8312-8333 8359 8361 
8375 8452 8633 8664 8740 8757 8884 9010 9111 9432-9436 
9503-9516 9518-9545 9547-9551 9556 9570 9577 9780 9895 
9923-9924 9928 9942 10007 10027 10202 10263-10268 10276 
10284 10298-10310 10329 10331 10496 10542 10595 10621 
10736-10737 10755-10761 10772 10774 10795 11108 11132 
11406 11483 11509-11523 11555-11582 11589-11590 11600 
11606 11621 11713 11729 11807 11837 12006 12039 12044 
12092 12113 12218 12231 12236 12327 12363 12398 12405 
12465 12511 12524-1253012576-12577 12579-12601 12729 
12735 12754 12863 12869 12889 12906-12910 12914 12954 
12973 13020-13021 13051-13052 13054 13065 13082-13083 
13427 13445 13455-13470 13488 13490 13496 13498-13501 
13507 13516 13560 I36I3 13630 13649 13708 13713 13769 
13831 13845-13855 13868 13872 13882 13884 13888-13894 
13896-13900 


fetal brain 


Clontech 


FBRs03 


1005 4405 5111 6337 69647742 13084 13864 13891 


fetal brain 


Invitrogen 


FBT002 


51 83 142 321 430 746 932 1054 1058-1065 1493 1833 1947 
2273-2275 2299 2444 2449 2926 3479 3492 3885 4347 4354 
4391 4405 4410 4434 4530 4804 4985 4998 5075 5160-5169 
5380 5428 5466 5750 5788 5801 5895 6132 6215 6371-6374 
6458 659S 6973-6974 7067 7096 7776-7778 7780-7782 7937 
8143 8323 8361 8364 8372 8377 8452 8633 8977-8984 8986 
9010 9142 9264 9332 9457 9474 9503 951 1 9517 9539 9582 
9827 9848 9927 9950-9953 10027 10161 10329 10430 10492 
10573-10575 11014 11160-11164 11406 11628 11742 11814 
11830 11985 12092 12112-12114 12116-12117 12127 12424 
12511 12521 12570 12576 12643 12696 12735 12748-12751 
12754 12830 12835 12913 12957 12977 12990 13002 13020 
13062 13072 13US3 13117 13254-13259 13377 13486 13489 
13496 13499 13507 13590 13649 13685- nR67 
13888 13891 13893 


fetal heart 


Invitrogen 


FHR001 


1001 1004 2250 4025 6334 6765 7740 8933 8935 9457 9544 
10000 11132 12599 12609 13021 13568 13656 13866 


fetal kidney 


Clontech 


FKD001 


142 346 364 511-517 570-572 574 598 685-690 969-970 972- 
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981 983-987 1134 13462123 2167-21692238-22462342 2444 
2483 25 16 2555 2617 2728 2843 2876 3032 3049 3072 3206 
3208 3282-3283 3525-3531 3533-3543 3548 3591 3709 3722 
3797 3839 3878 401 5 401 9 4043 4246 4277 4367 4405 4696- 
4698 4725 4767 4805-4810 4940 4947 4957 4986 4998 5037 
5056 5073-5080 5082-5091 5099-5100 5108 5258 5504 5523 
5560 5923 6005 6207-6208 6225 6272 6288 6325-6332 6478 
6603 6702 6793 6815 6906 6953-6959 7045 7058 7204 7355 
7426 7449-7450 7520 7543-7546 7561 7587 7718-7732 7930 
8077 8097 8262 8375 8387 8452 8520 8638 8658 8736-8737 
8834 8863 8898 8907-8918 8922 8950 9010 9134 9257 9401 
9457 9544 9597 9760 9781 9791 9828-9830 9912 9914-9918 
10296 10440 10484 10546-10548 10772 11108 11121-11129 
11131-11132 11170 11513 11638 11695 11923-11924 12006 
12033 12062-12070 12072 12160 12405 12522 12570 12594 
12599 12605 12626 12663 12670 12732-12733 12749 12848 
12904 129J4 12940-12941 12990 13020 13083 13188 13226- 
13227 13234 13263 13277 13280 13351 13391 13394 13491 
13501 13512 13590 13644 13647-13650 13713 13782 13867- 
13868 13872 13875 


fetal kidney 


Clontech 


FKD002 


3286 5030 5037 5105 11108 12033 12490 12570 13494 13866 


fetal kidney 


Invitrogen 


FKD007 


3272 3806 4025 4253 4277 4654 51 12 5535 5788 5801 8863 
8935 9401 9466 10553 1 1628 11818 13494 13646 13866 


fetal lung 


Clontech 


FLG001 


79 2367 2395 3010 3460 3885 4828 4948 4962 5001 5723 5748 
5902 5908 6186 6738 7051 7067 7677 7759 9264 9553 9700 
10007 10478 11098 12017 12383 12417 12424 12749 12917 
13020 13169 13472 13554 13644 13782 13835 


fetal lung 


Invitrogen 


FLG003 


142 319 364 629 1671 1806-1814 1816-1819 1877 2129 2161 
2169 2367 2449 2493 2529 3191 3503 3610 4109 4225-4234 
4367 4434 4957 5108 5380 5421 5581 5707-5710 5712 5714 
5788 5801 6057 6733-6741 7034 7103 8274-8278 8365 8597 
8948 9264 9327 9437-9442 9444 9466 9510 9525 9530 9539 
9677 9773 9841 10007 10190 10198 10269-10271 10329 11519 
11524-11527 11927 12531-12539 12848 12890 12904 13021 
13072 13249 13445 13472 13489 13551 13575 13649 13670 
13679 13856-13857 


fetal lung 


Clontech 


FLG004 


1003-1004 2597 51106963 9924 10552 11138-11139 12080 
12990 13659 


fetal liver- 
spleen 


Columbia 
University 


FLS001 


-2 4-14 16-22 24 26 28-31 33^6 48-49 51-61 63-68 71-91 93- 
102 104-110 112-124 126-156 158-162282-283 285-290292- 
299 301-304 307-312 314-326 328-338 340-344 346-353 355- 
365 369 390-400 402 436 441 483 557 567 575-585 595 598 
629 673 678 69 1-699 701-702 708 73 1 736 763-767 769-776 
778-786 788-791 793-794 796 925 975 1004 1015 1023 1038 
1068 1104 1134 1144 1184 1192 1216 1264 1298 1346 1482 
1493 1516 1518-1521 1551 1556 1575 1583 1594 1636 1641 
1707 1724 1774 1826-1829 1841 1858 1927 1959 1962-1965 
1967-1972 1974-1979 1981-19982000-20092011 2045-2051 
2053-2055 2057-2058 2060-2063 2065-2083 2089-2094 2 100- 
210121612170 21742184-21942215 2222 22692290 2310 
2342 2409 24 1 1 24 1 4 2444 2449 2458 2483 2490 2497-2498 
2510 2516 2523 2529 2555 2562 2566-2576 2578-2586 2588- 
2591 2593-2601 2604 2607-2608 261 1-2612 2614-2618 2620 
2622-2642 2644 2646-2653 2655-2664 2666 2668 2670-2680 
2696 2750 2788-2793 2795-281 1 2814-2826 2828-2835 2837- 
2842 2844 2846-2848 2851-2858 2860 2862-2871 2876 2878 
2893 2900-2905 2907-2909 2926 2929 2949 2952-2953 2959- 
2960 2984 2992 3032 3058 3069 3073-30763078-3080 3082 
3093 3166 3194 3196 3207 3210-32113213-3214 3217-3225 
3249 3257 3272 3282 3286-3287 3304-3307 33 10-331 1 3314- 
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3317 3319 3321-3332 3334-3346 3356 3419 3426 3434 3446 
3450 3455 3468 3470 3479 3484 3494 3503 3535 3537 3548 
3552 3557 3572 3590 3594 3597 3604-3605 3610 3612 3625 
3627 3634 3668 3670 3709 371 1 3720 3722 3729 3737 3777 
3797 3806 3808 3810 3813 3839 3885 3926 3990 3996-3997 
4006 4009 401 1 4019-4022 4025 4040 4043 4060 4078 4095 
4109 4129 4192 4194 4201 4245-4246 4253 4261 4277 4282 
4297 4335-4338 4340-4357 4359-4361 4364 4366-4367 4370- 
4372 4374-4377 4379^382 4384-4386 4388-4392 4395-4396 
4398-4414 44 1 7-4424 4426-4455 4457-4458 4460-4465 4542- 
4543 4545 4547-4553 4555-4562 4565-4575 4577 4579 4581- 
4582 4585-4588 4593 4596 4607 4616-46294644-4645 4647 
4654 4671 4676 4687 4689 4694 4721 4726-4729 4759 4767 
4775 4783-4784 4788 4790-4791 481 1-4819 4830 4837 4845 
4862 4874 4889-4902 4930 4940 4948-4949 4957 4962-4965 
4972 4985 4998 5022 5029-5030 5037 5040 5075-5076 5092 
5108 51325152 5163 51675171-5172 5335 5380 53985473 
5503 55145516 5523 5526-55275535-55365553 5581 5598 
5604 5616 5674 5684 5691 5693 571 1 5715 5724 5726 5748 
5750 5757 5778 5788 5794 5801 5817 5865 5874 5894-5895 
5902-5914 5916-5919 5921-5928 5930-5949 5951-5954 5956- 
5964 5966 5979 6005 6010 6013-6015 6017-6018 6020-6022 
6024-6025 6028-6032 6034-6048 6052-6062 6064 6068 6073- 
6081 6086 6092-6094 6104 61 12 6125 6135 6140 6147 6149 
6153-6157 6165 6186 6195 6209-621 1 6240-6241 6243-6248 
6250-6253 6272 6276 6287-6288 6307 6313 6338 6371 6374 
6419 6430 6446 6451 6458 6478 6496 6508 6545 6579 6595 
6598 6607 661 1 6624 6629 6642 6658 6695 6726 6728 6730 
6738 6745 6754 6765 6767 6788 6793-6797 6799-6808 6810- 
6815 6833-6843 6845-6847 6850-6856 6858 6860 6862-6865 
6870-6871 6875 6877 6891 6893-6894 6900 6907-6908 6917 
6919-6925 6950 6979 7049 7058 7067 7077 7096 7109 7116 
7118 7128-7131 7133-7135 7137-7141 7144-71477149-7150 
7152-7153 7155-7179 7181-7192 7194-7198 7265-7271 7273- 
7280 7282-7305 7307-73 12 73 16 7327-7336 7375-7376 7383 
7386 7451 7455 7468 7473 7475-7477 7479 7484 7548-7553 
7555-7558 7561 7608 7617-7618 7620-7631 7642 7675 7687 
7695 77167768 7809 781 1 7839 7842 7864 7883 7933 7977 
8069 8093 8095 8105 81 10 8116 8139 8224 8226 8235 8241 
8262 8323 8334-8340 8342-8350 8352-8355 8357-8359 8361- 
8408 8410-841 1 8414-8419 8452 8456 8490-8498 8500-8502 
8504-8526 8529-8530 8537 8540-8541 8553-8560 8563 8566 
8568 8577 8592 8611 8639 8648 8659-8661 8664 8668-8669 
8710 8738-8739 8741 8777 8792-8804 8830 8834 8839 8853 
8858 8863 8865 8876 8898 8926 8935 8939 8950-8951 8957 
8963 9010 9012 9076 9092 9097 91 12 91 19 9128 9257 9264 
9266 9284 9291 9301 9304 9313 9318 9352 9375 9399 9414 
9424 9445 9455 9457 9466 9473-9474 9476 9480 9484-9485 
951 1 9517 9528-9529 9536 9539 9542 9544 9552-9557 9559 
9561-9564 9566-9572 9574-95779579-9630 9646 9655 9666- 
9667 9669-9671 9673 9675-96979700-9701 9703-97049713- 
9715 9718-9719 9722 9726-9729 9734 9752 9758 9761 9777 
9780-9784 9813 9831 9854-9867 9869 9906 9927 9929 9939 
9942-9943 9969 9983 10000 10007 10205 10267 10284 10296 
10305 10311 10313-10316 10318-10325 10327 10329-10331 
10333-10354 10356-10363 10395-10397 10400-10403 10405- 
10410 10413 10417-10422 10427 10446-10447 10463 10470 
10485 10496 10502-10507 10515 10542 10557 10565-10566 
10645 10679 10688 10691 10729 10750 10760 10762 10764- 
10769 10772-10774 10776-10782 10784 10786-10795 10797- 
10815 10817-10830 10859 10861-10869 10871-10879 10881- 
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10893 10908-10916 10929 10934 10943 10948 10968 10971 
10975 10977-10983 31005 11009 11011 11015-11018 1 1028 
1 1031 1 1054-1 1063 3 1094 1 1 108 1 1 1 14 1 1132 1 1136 1 1 142 
1 1 194 1 1208 1 1213 1 1246 1 1390-1 1391 1 1401 1 1406 11411 
11414 U459 11462 11483 11508 11513 11521 11534 11559- 
11560 11565 11573 11585-11603 11606-1161011612-11616 
11618-11638 11640 11642-11657 11659-11667 11669-11670 
11672 11699 11716-11718 11720-11722 11724-11741 11743- 
11758 11760-11761 11775-11787 11797 11799-11801 11818 
11830 11862 11864-11866 11869 11876 11889 11911 11925- 
11928 11958-11961 1 1963-1 1976 11979-11980 12000 12006 
12017 12019 12028 12033-12034 12041 12078 12086 12092 
12122 12127 12159-12160 12178 12216 12218 12236 12245 
12256 12260 12289 12303 12305 12308 12327 12330 12363 
12375 12382 12387 12402-12405 12424 12426 12439 12442 
12465 12470 12476 1251 1 12522-12523 12546 12551 12560 
12570 12576 12590 12603-12602 12604-12619 12621-12633 
12636 12639 12646-12653 12670-12673 12674 12682-12683 
12688 12693 12696 12706 12709-12715 12735 12749-12750 
12772 12790 12823 12830-12832 12835 12842 12849 12855 
12863 12904-12905 12910-12911 12913-12916 12918-12924 
12929 12931-12932 12934-12937 12939-12940 12954-12955 
12965 12973 12976-12977 12989-12990 12993 12999 13020- 
13021 13035 13051 13054-13055 13060 13062 13066-13067 
13069-13087 13089-13095 13097-13102 13114-13117 13119- 
13120 13122-13126 13134-13136 13141 13147 13156 13174 
13193 13195-13196 13198-13201 13254 13260 13264 13277 
13280 13295 13351 13368 13377 13391 13394 13400 13409 
13412 13420 13456 13472-13477 13479-13484 13486-13492 
13494-13496 13498-13499 13501-13510 13512-13516 13518- 
13527 13533 13541-13544 13546-13551 13553-13555 13560 
13575 13587-1358913597 13603-13604 13613 13616 13622- 
13623 13630-13631 13644-13646 13649 13659 13670 13679 
13713 13743 13748 13769 13775 13782 13793 13803 13808 
13818-13819 13858-13860 13864 13866-13869 13872 13882 
13884 13888-1388913891 13893 13901 


fetal liver- 
spleen 


Columbia 
University 


FLS002 


6 16 24 30-31 63 67 81 83 89 95 103 1 15 1 17 126 140 142 147 
150-151 158 162 211225 287 308332 356358390-391438- 
439 483 551 556 641 654 694 701 708 731 788 997 1006 1012 
1047 1082 1151 1154 1178 1184 1208 1212 1356 1480 1507 
1551 1556 1623-1647 1649-1667 1669-1671 1675 1677-1698 
1828-1829 1858 1877 1885 1889 1914 1927 1947 1961 1997 
1999 2028 2057 2070 2092 2098 2138-2139 2154 2161 2174- 
2175 2189 2191 2257 2282 2327 2342 2363 23992403 2409 
2440-2441 2443-2450 2452-2459 2490 2498 2522-2523 2544 
2555 2568 2575-2576 2592 2597 2605 2619 2623 2659 2806 
2809 2812 2860 2869 2879 2903 2926 2932 2988 303 1 3066 
3075 3089 3188-3189 3286-3287 3319 3343-3344 3356 3426 
3535 3548 3552 3554-3555 3583 3591 3610 3625 3634 3673 
3709 371 1 3720 3722 3839 3899 3926 3985 4005-4006 401 1 
4020 4025 4055 4058 4060 4078 4091-41 16 41 18-4126 4128- 
4142 4172 4194 4201 4253 4277 4282434743504353-4355 
4362 4367 4374 4379 4386 439 1 4394 4396 4402 4426 443 1 
4435 4437-4439 4512 4553 4578 4586 4607 4622 4644 4654 
4671 4758 4767 4783 4798 4836 4845 4899 4940 4948 4962- 
4963 4985 4991 4998 5001 5037 5108 5167 5171 5177 5198 
5237 5293-5294 5380 5400 5523 5535-5536 5581-5591 5593- 
5599 5601-5613 5615-5624 5627-5637 5653 5674 5691 5693 
571 1 5724 5726 5733 5748 5757 5772 5778 5794 5817 5874 
5894 5902 5904 5906-5907 5909 591 1 5916-5919 5923-5924 
5927 5929 5932 5938 5941 5948 5957 5959-5960 5962 5964 
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5979 6005 6020 6027 6037-6038 6052 6054 6057 6060-6062 
6064 6068 61 12 6140 6195 6225 6246 6287-6288 6306-6307 
6336 6354 6371 6374 6423 6430 6451 6458 6461 6469 6478 
6496 6508 6538 6545 6563 6580 6583 6595 6598 6607 6629 
6638 6641-6643 6645-6675 6709 6726 6730 6765-6766 6788 
6792-6793 6795 6797-6798 6800-6801 6808 6812 6836 6843 
6850-6851 6855 6860 68646870 6875 6888 69086924 6950 
6958 6968 6998 7003 7018 7045 7049 7056 7071-7079 7095- 
7096 7109 71 16 71 18 7126 7133 7135 7137-7139 7155-7156 
7166-7167 7169 7175 7178 7275 7291 7294 7329 7422 7426 
7478 7608 7627 7687 7695 7716 7815 7839 7871 7893 7914 
7937 7977 8022 8095 8120 8133-8134 8148-8149 8151-8167 
8169-8182 8184 8186-8192 8194-8196 8241 8246 8276 8289 
8298 8336 8339 8343 8345 8349 8355-8356 8361 8363 8365 
8367-8368 8370 8373 8375-8378 8385 8388 8417 8496 8518 
8520 8543 8558 8561 8563 8646 8667 8710 8738-8740 8786 
8803 8813 8865 8926 8946 8948 8963 8970 9010 9049 91 19 
9128 9142 9164 9222 9264 9289 9296 9301 93 17-93 1 8 9320 
9322 9335-9341 9343-9362 9365-9368 9370 9372-9373 9375- 
9377 9399 9455 9457 9466 9472 9475 9480 9483 9495 9526 
9533 9536 9553 9556 9558 9560 9563 9567 9570 9582 9597 
9601 9630 9646 9655 9671 9683 9695 9700 9703 9715 9722 
9729 9733 9752 9758 9783 9843 9848 9855 9880 9936 9942 
9983 10007 10027 10103 10142 10186-10217 10274 10284 
10287 10296 10313-10314 10329 10331 10349 10352 10354 
10400 10405 10430 10439 10496 10500 10507 10542 10621 
10709-10710 10712-10717 10750 10760 10776-10779 10782 
10794 1080910816 10837 10885 10891 10895 10913 10960 
10971 10975 11057 11071 11098 11142-11143 11148 11194 
11246 11401 11406 11430-11453 11455-1146411483 11504 
11508 11556 11560 11589 11596 11615 11618 11626 11631 
11714 11729 11830 11833 11865 11988 12000 12006 12017 
12019-12020 12041 12044 12081 12092 12178 12195 12245 
12256 12277 12297 12327 12363 12383 12402-12403 12405 
12426-12427 12444-12455 12457-12465 12467-12473 12475- 
12476 12520 12546 12576 12599 12607 12611 12614-12615 
12617-12618 12622 12627 12630-12632 12636 12647-12648 
12650 12685 12688 12691 12693 12696 12703 12739-12740 
12743 12763 12808 12830 12834 12842 12849-12858 12860- 
12866 12885 12895 12906 12910 12913 12915 12917 12920 
12922 12929 12933 12940-12941 12954-12955 12957 12965 
12977 12990 12993 12999 13021 13039-13040 13047 13051 
13054-13055 13060 13062 13072-13073 13077 13082-13083 
13085 13092-13095 13098 13117 13123 13136 13173 13195- 
13197 13202 13249 13254 13362 13377 13391 13394 13406- 
13409 13411-13415 13417-13420 13456 13473-13474 13477 
13480 13484 13486 13488-13489 13496-13498 13500 13502 
13504 13506-13507 13511-1351213515 13521 13546 13551 
13554-13555 13560 13575 13590 13597 13613 13616-13617 
13622 13630 13659-13660 13670 13678-13679 13695-13696 
13701 13708 13713 13726 13748 13775 13795-13796 13815- 
13826 13835 13838 13859-13860 13864 13867 13869 13872 
13882-13885 13888-13889 13891 13893 


fetal liYer- 
spleen 


Columbia 
University 


FLS003 


525 2269 2529 26272636 3552 3554-3555 3597 4201 4246 
4253 4422 4426 4434 4438 5536 5801 8935 9536 1 1 136 1 1 142 
11519 11626 11927 12028 12077-12079 12086 12305 12403 
\lt\Z^ 13021 13472 


fetal liver 


Invitrogen 


FLV001 


40 51 6059% 731 1023 1048-1049 1820 19762111 2268-2269 
2449 2483 2529 2568 2576 2614 2809 3207 3220 3468 3595- 
3599 3885 4020 4346 4390 4422 4434 4543 4615 4767 4833 
5092 5152-5155 5270 5505 5526 5553 5581 5715 5724 5726 
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5750 5766 5788 5794 5801 5902 6125 6147 6458 6598 6624 
6645 6765 6799 6805 6862 7104 7856 8070 8156 8359 8361 
8432 8740 8928 8935 8972-89749012 9041 9128 9264 9445 
9474 9485 951 1 9536 9593 9597 9703 9780 9927 9943 9945- 
9946 9948 10007 10137 10275 10334 10529 10557 10566- 
10569 10776 10828 11142 11560 11587 11618 11927 12070 
12086 12105-12108 12127 12218 12236 12327 12424 12511 
1254 J- 12542 12605 12746 12916 12977 12985-12986 13020- 
13021 13060 13064 13135 13246-13247 13472 1347913488- 
13489 13497 13533 13554 13590 13659 13680-13682 13859- 
13860 13882 13888-13889 


fetal liver 


Clontech 


FLV002 


360 996 1 134 3226 5102 9401 


fetal liver 


Clontech 


FLV004 


998 1927 2449 2627 4025 4043 4426 4438 4834 5030 5726 
6730 9474 9942 10769 1 1 142 1 1587 12590 12608 13889 


fetal muscle 


Invitrogen 


FMS001 


40 150 731 894 1264 1555 16J3 1821-1822 1883 1959 2161 
2343 2494-2496 2555 3203 4235-4237 4434 4671 4694 4783 
4885 4940 4985 5037 5045 5481 5716-5718 5724 5788 5902 
5932 6624 6742-6743 6825 7054 7104 7141 7632 7800 8279- 
8281 8863 8935 9204 9264 9446-9449 9451 9474 951 1 9536 
9556 9722 9780-9781 10097 10273-10274 10330 10738-10739 
10750 11098 11406 11528-11530 11560 12305 12544-12546 
12590 12609 12618 12663 12670 12702 12735 12891-12892 
12916 13051 13053 13195 13441-13442 13479 13613 13782 


fetal muscle 


Invitrogen 


FMS002 


337843484434 8932 11132 12033 12570 13782 


fetal skin 


Invitrogen 


FSK001 


60 142 235 319 641 683 800 1015 1050 1346 1774 1823-1825 
2044 2099 2 1 1 1 2161 2215 2223 2280 2367 2401 2408 2513 
2516 2614 2812 2871 2926 3207 3356 3468 3490 3503 3548 
3599 3720 3722 3885 4020 4032 40604109 4238 4240-4244 
4246 4253 4277 4340 4353 4355 4388-4389 4391 4405 4434 
4543 4562 4568 4654 4667 4671 4767 4940 4944 4947 5030 
5037 5075 5163 5198 5296 5380 5481 5514 5581 5656 5674 
5691 5719-5724 5748 5757 5801 5894 5902 5923 6057 6076 
6078 6125 61476215 63366374 64096437-64386551 6563 
6744 6783 6803 6805 6815 6873 6900 6908 6917 6985 6987 
7037 7054 7058 7067 7075 7275 7355 743 1 7484 7507 7587 
7627 7642 7653 7687 7871 7977 8110 8134 8186 8226 8235 
8251 8262 8282-8287 8298 8356 8365 8368 8452 8455-8456 
8539 8597 8648 8679 8813 8863 8898 8921 8935 8939 8948 
8951 8957 8983 9002 9041 9061 9122 9128 9257 9285 9414 
9452-9453 9466 9474 9484 9503 9517 9529 9553 9563 9671 
9677 9703 9714 9781 9841 9927 10007 10175 10296 10343 
10491 10741-10742 10778 10795 10907 10948 11239 11513 
11531 11573 11606 11626 11807 11869 11952 12000 12006 
12160 12190 12218 12256 12327 12363 12371 12375 12378 
12426 12438 12465 12470 12488 12549-12550 12570 12604 
12615 12617 12663 12670 12735 12749 12754 12830 12893 
12904 12910 12916-12917 12976-12977 13038 13084 13090 
13116 13249 13254 13367 13389 13391 13443-13445 13456 
13472 13479-13480 13494 13496 13499 135,05 13512 13516 
13551 13554-13555 13575 13590 13613 13630-13631 13644 
13670 13713 13782 13784 13793 13803 13858 13866 13869 
13882 13891 


fetal skin 


Invitrogen 


FSK002 


1004 3544 4834 5523 9922 9942 1 1 134-1 1 135 12570 13495 
13499 13793 13884 13889 


fetal spleen 


BioChain 


FSP001 


997 5030 9466 11108 12033 12749 13590 


umbilical 
cord 


BioChain 


FUC001 


60 83 89 142 166 567 609 760 997 1302-1304 1306-1307 1309 
1671 1697 1724 1848 1917 1978 2111 2154 2161 22072215 
2315 2343 2444 2569 2576 2591 25972652 2866 2926 3468 
3526 3599 3602 3625 3697 3722 3799-3805 3813 3839 3885 
4025 4060 4173 4197 4246 4277 4340 4364 4387 4391 4395- 
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4396 4405 44 10 4434 4543 4589 4667 4694 4767 4834 4865 
4940 4957 4963 4985 5030 5093 5096 5108 5163 5171 5213 
5313 5340 5342-5344 5380 5481 5521 5526 5627 5656 5674 
5714 5743 5750 5757 5801 5895 5902 5912 5938 6027 6061 
6082 6125 6186 6195 6218 6241 6313 6321 64106458 6486- 
6495 6595 6607-6608 6668 6745 6825 6860 6870 6979 7054 
7141 7260-7261 7275 7329 7355 7373 7383 7444 7579 7586 
7677 7716 7807 7930 7935-7936 7995 8077 8093 8226 8246 
8262 8298 8343 8345 8358 8387 8452 8535 8563 8635 8669 
871 1 8819 8858 8863 8921 8935 8943-8944 8948 8963 9001- 
9002 9012 9097 9130 9132 9134-9138 9264 9303-9304 9313 
9332 9401 9424 9466 9472 9474 9544 9597 9665 9677 9700 
9722 97349758 9834 9841 9848 10050 10175 10183 10267 
10277 10311 10337 10446 10470 10508 10529 10566 10691 
10772 10828 10961 11108 11132 11145 11239 11241 11273- 
11276 11293 11408 11484 11513 11596 11604 11607 11620 
11695 11823 11909 11927 11942 12000 12077 12160 12245 
12255-12256 12292 12305 12308 12363 12383 12404-12405 
12426 12438 125 11 12570 12601 12605 12630 12700 12706 
12754 12875 12913 12916 12935 12957 12993 13002 13009 

11(Yy(\ 11fiO% IQAiO lltldZ I'ilylt niCii i^o? inio 

iJUzu-lJUZl lJUoz 131i5-liJ3o 13141 1325413283 13318- 
13319 13336 13394 13400 13472 13479 13481 13488-13489 
13495-13496 13499 13507 13511-13512 13516 13551 13555 
13568 13575 13590 13613 13630 13644 13713 13740-13742 
13866-13867 13882 13884 13888 


fetal brain 


GIBCO 


HFB001 


51 70 21 1-213 215-222 224-229 445 586-588 598 608 703-710 
712-716 i068 1148 1178 1184 1193 1308 1516 1556 1774 1778 
1957 1974 1976-197720102025-20292134-2135 2145 2161 
2171 2174 2220 2223 2257 2266 2283 2444 2449 2510 2555 
2563 2569 2590 2597 2614 2617 2717 2719-2723 2726 2729- 
2734 2809 2997 3002 3081 3083 3086 3109 3226-3237 3239 
3257 3272 3278 3282-3283 3286 3356 3419 3460 3479 3492 
3495 3544 3548 3557 3590 3604-3605 3625 3627 3634 3673 
3689 3697 3709 3722 3797 3810 3839 3885 4006 401 1 4019 
4022 4025 4040 4054 4059 4095 4194 4201 4253 4277 4282 
4355 4364 4383 4399 4405 4410 4412 4434 4473 4480 4498- 
4504 4507-4508 4517 4543 4654 4689 4694 4730-4733 4735 
4755 4783 4820-4823 4825-4830 4845 4885 4940 4949 4957 
4962 4972 4985 5017 5019 5022 503 1 5037 5108 5132 5167 
5171-5172 5262 5380 5481 5521 5523 5527 5553 5616 5656 
571 1 5740 5788 5801 5894 5902 5906 5908 5923 5938 5979- 
5989 6057 6082 6096 6125 6147 6159 6161-6163 6165 6207 
6212-6213 6215-6222 6241 6257 6288 6336 6369 6374 6419 
6422 6512 6551 6555 6595 659S 6607 671 1 6767 6788 6823- 
6825 6860 6871 6879 6892-6894 6900 6909-6910 7058 71 18- 
71 19 7169 7220 7222 7224-7225 7228-7229 7231-7233 7275 
7426 7431 7444 7481-7483 7485-7488 7560-7567 7569 7608 
7743 7768 7792 7814 7845 7864 7930 7977 7995 8093 8095 
8110 8140 8226 8235 8262 8345 8358 8361 8370 8372 8387 
8441-8443 8445 8447 8450-8455 8520 8535 8558 8597 8603 
8648 8663-8666 8677 8702 8742-8749 8811 8838-8839 8943 
8951 9010 9092 9134 9137 9257 9287 9304 93 17 9455 9457 
9466 9472 9475 9484 9510 9515 9533 9553 95679644-9645 
9647-9649 9734 9758 9781 9785-9786 9791 9832-9837 9927- 
9928 9939 9942 9970 10053 10175 10275 10277 10296 10329 
10375-10378 10434 10464 10486 10496 10645 10679 10691 
10778 10782 10791 10838-10844 10928 11019-11025 11027 
11032 11055 11061 11108 11132 11145 11153 11208 11239 
11343 11483 11513 11588 11596 11604 11606 11620-11621 
11668 11691-1169511818 11867 1186911929-11930 12006 
12033 12039 12041 12044 12047 12066 12078 12197 12218 
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12327 12375 12405 12411 12424 12438 12521-12522 12564 
12570 12576 12582 12590 12599 12601 12604 12611 12626 
12630 12637-12640 12670 12674 12688 12694-12695 12703 
12734 12748 12754 12785 12835 12840 12913 12916-12917 
12929-12930 12935 12940 12946 12959 12989 12993 13020 
13051-13052 13054 13073 13083 13090 13108-13110 13117 
13131 13136 13148 13158 13160 13169 13184-13186 13254 
13263 13277 13295 13389 13394 13438 13456 13477 13479- 
13481 13489-13490 13494-13500 13502-13503 13506 13512 
13516 13535-13536 13555 13575 13590 13592 13605-13606 
13613 13616 13627 13630 13644-13645 13660 13670 13679 
13687 13713 13793 13803 13838 13861 13866 13868-13869 
13875 13888 13891 13896 


macrophage 


Invitrogen 


HMP001 


1002 2253 3548 401 1 4058 4201 4246 4282 5526 8093 8262 
11137 12039 12426 12511 12521 13888 


infant brain 


Columbia 
University 


IB2002 


6 89 142 21 1 276 307 518 589-590 598 644 717-718 720-721 
773 841 921-924 926-932 937 1023 1091 1147-1150 1289 1493 
1522-1525 1557-1564 1724 1778 1928 1947 1976 1978 2097 
2111 2138 21722218-2223 2257 2283 2401 2418-24192444 
2449 2516 2555 2563 25692575 25972627 28092818 2880 
2932 3087 3166 3226 3241-3245 3272 3435 3474 3476-3477 
3481 3483-3486 3488-3489 3492 3494-3495 3497 3538 3544 
3548 3572 3595 3597 3669 3671-3672 3709 3720 3722 3737 
3797 3817 3885 3985 3997-39994006 4011 4022-4031 4040 
4095 4201 4246 4253 4277 4297 4355 4405 4410 4434 4676 
4689 4767 4775 4831-48344837 4885 4962 4983 4998 5017- 
5019 5021-503 1 5035-5037 5040 5095 5143 5164 5167 5169 
5233-5236 5380 5505 5517-5519 5523 5536 5546-5548 5691 
5695 5724 5750 5766 5788 5801 5895 6027 6048 6082 6095 
6111 61196132 6135 61476223 6301-6303 6305-63076374 
641 1 6459 6555 6563 6591-6595 6601 6606-6607 6609-661 1 
6613-6614 6624 671 1 6767 6799 6805 6909 6946 701 8 7052 
7067 71 18 7133 7166 7168-7169 7386 7464 7469 7473 7571- 
7573 7697-7698 7716 7814 7827-7828 7905 7934 7977 8071- 
8073 8084 8098-8104 8235 8320 8331 8351 8354-8356 8364- 
8365 8370 8372 8376 8452 8520 8572 8667-8668 8750-8752 
8813 8830 8863 8883-8885 8950 8958 8963 9040-904 1 9251 
9257 9267 9293-9298 9313 9424 9454 9456-9457 9466 9510 
9514-9515 9533 9542 9556 9576-9577 9597 9626 9646 9722 
9758 9787 9903-99049988 10027 10138-10139 10155-10160 
10284 10296 10319 10346 10496 10536-10542 10557 10670- 
10672 10680 10682 10778 10791 10897 10971 11028-11030 
11052 11100-11109 11122 11132 11192 11392 11403-11404 
11447 11513 11568 11595 11606 11626 11638 11818 11927 
11931-11933 12028 12030-12031 12033-12034 12036-12039 
12041 12047 12049 12146 12155-12157 12160 12327 12344 
12363 12383-12385 12402 12405-12408 12424 12439 12521- 
12522 12570 12590 12601 12611 12615-12616 12626 12639 
12684-12685 12688 12696-12697 12707 12725-12727 12729 
12748-12749 12754 12763-12764 12830 12904 12913 12916- 
12917 12923 12929 12959 12977 12990 12996 13000 13047 
13051 13054 13062 13073 13082-13084 1311713141 13187- 
13188 13196 13218-13219 13249 13257 13277 13284-13285 
13351 13389 13394-13395 13456 13458 13479-13481 13488 
13494-13495 13499 13503 13512 13516 13530 13535 13575 
13607-13608 13613 13616 13641 13649 13659 13679 13705- 
13706 13708 13713 13793 13798-13801 13803 13861 13869 
13872 13875 13883-13884 13888 13891-13893 13896 


infant brain 


Columbia 
University 


IB2003 


6 46 746 1914 1947 1959 1994211122202257 25162962 
3166 3226 3272 3435 3572 3885 4277 4377 4410 4833 4837 
5029-5030 5040 5147 5259 5851 6147 6423 6595 6598 661 1 
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7145 7169 7716 8262 8354-8355 8364 8452 8863 9264 9457 
9515 9556 9577 9626 96469715 9781 9928 9970 10027 10329 
10376 10557 10791 1 1083 1 1289 1 1406 1 1459 11513 1 1663 
11933 12039 12405 12546 12570 12601 12616 12685 12754 
12913 12916 12954 12977 12990 1300O 13056 13062 13195 
13257 13392 13456 13458 13480-13481 13488 13497 13499 
13506-13507 13511-13512 13514 13516 13549 13616-13617 
13649 13793 13861 13884 13888 13891 13896 


infant brain 


Columbia 
University 


IBM002 


1564-1565 19762218 242031664391 5788 6147 6302 10329 
11108 11513 12039 12729 12830 13062 13257 13512 13617 
13803 13859 13861 


infant brain 


Columbia 
University 


IBS001 


927 931 1564 2915 3166 3737 3885 5029 5044 5095 5143 5701 
5894-5895 6135 6307 7018 8452 8963 9424 9556 9626 10296 
10542 11083 11100 11108 11167 12383 12611 12785 13257 
13280 13479 13495 13506 13617 13679 13713 13896 


lung, 
fibroblast 


Strategene 


LFB001 


67 235 552 641 746 1346 2090 2186 2343 2770 2812 2871 
3203 3537 3548 3634 3812 3839 4073 4253 4434 4654 4783 
4786 4821 4940 4957 4963 4985 5163 5172 5492 5523 5788 
591 1 5938 6027 6048 6057 6223 6272 6322 6599 6873 6SS6 
6997 7204 7759 7815 7933 7977 8226 8235 8262 8354 8452 
8648 8742 8791 8830 8863 8898 8944 8948 8951 9000 9076 
9097 9128 9290 9304 9414 9455 9466 9472 9476 9503 9544 
9563 9671 10000 10050 10342 10376 10434 10447 10679 
10760 10857 11055 11094 11289 11402 11607 11818 11837 
12000 12006 12178 12363 12405 12462 12582 12617 12636 
12670 12754 12913 12916-12917 12925 12940 13126 13377 
13394 13446 13456 13494 13497 13499 13502 13575 13613 
13670 13679 13691 13795 13797 13867-13868 13882 


lung tumor 


Invitrogen 


LGT002 


51 69 88 142 158 21 1 444 598 608 722-723 733 795-797 799 
1004 1023 1135-1145 1308 1338 1346 1493 1526-1533 1535 
1566-1569 1626 1654 1816 1841 1848 1860 1976 1983 2090 
2150 2161 2173 2223 2257 2266 2296-2297 2342 2402 2449 
2483 2555 2569 2576 2591 261 1 2623 2724 2809 2868 2880 
2926 3088 3166 3272 3347 3571 3658-3666 3673 3722 3737 
3797 3839 3885 4000-4002 4006 401 1 4025 4032-40344060 
4133 4201 42464253 42774282 4340-4341 43844388 4391 
4399 4402 4405 4434 4543 4622 4671 4686 4767 4783 4791 
4833-4834 4836 4885 4903 4940 4957 4987 4998 5001 5017- 
5018 5030 5037 5052 5108 5163 5210 5227-5232 5422 5451 
5520-5528 5581 5627 5637 5674 5693 5713 5724 5748 5766 
5772 5788 5894-5895 5902 5907 5917 5938 5990 6005 6020 
6094 6129 6147 6171 6224-6225 6239 6241 6253 6322 6336 
6404-6410 6461 6508 6595-6598 6624 6765 6793 6798 6851 
6870 6873 6994 7003 7008 7045-7047 7053-7055 7067 7085 
7109 7116 7139 7141 7158 7169 7187 7468 7517 7524 7579 
7608 7676 7687 7716 7761 7825-7826 8074-8078 8105-8106 . 
8139 8235 8298 8323 8335 8345 8354 8359 8363 8365 8370 
8372 8375 8456 8563 8633 8648 8678 8741 8792 8798 8805 
883 1 8863 8870 8989 9002 9010 9012 9032-9037 9039 9051 
9076 9080 9092 9128 9257 9269-9271 9273-9274 9276 9299- 
9301 9399 9414 9424 9457 9466 9472 9480 9484-9485 9517 
9533 9536 9539 9544 9567 9612 9626 9665 9677 9689 9700 
9703 9758 9773 9784 9788 9838-9841 9870 9928-9929 9942 
9970 9983-9986 10140-10143 10149 10161 10175 10275 10296 
10319 10330 10346 10349 10386 10496 10508 10561 10591- 
10593 10673-10674 10742 10774 10778 10782 10788 10802 
10836 10971 11031-11033 11064 1 1108 11132 11182-11191 
11393-11395 11406 11459 11462 11560 11565 11588 11596 
11604 11606-11607 11615 11618 11629 11821 11823 11845 
11869 11932 11934 11961 31978-11980 12000 12006 12017 
12022 12033 12078 12080 12127 12150-12154 12178 12236 



121 



WO 01/64835 



PCT/USO 1/04927 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 








12327 12363 12371 12379 12386-12393 12409 12424 12521 
12570 12590 12616-12618 12630 12637 12674 12691 12693 
12698-12699 12716 12749 12754 12760 12762 12830 12832- 
12833 12842 12904 12907 12913 12917 12923 12993 13020 
13031-13032 13035 13060 13062 13064 13072 13083 13090 
13136 13150 13193 13197 13202 13234 13277 13279 13281- 
13282 13295-13296 13381-13383 13385-13386 13394 13400 
13456 13477 13479-13480 13484 13488-13489 13492 13494 
13496-13497 13499 13503 13506-13507 13511 13554 13575 
13597 13616 13630 13646 13649 13659-13660 13670 13701- 
13704 13713 13743 13782 13788-13789 13791-13792 13802- 
13803 13864 13867-13868 13872 13882-1388413888 13892 


lymphocytes 


ATCC 


LPC001 


142 316 1034 19772161 23422367 2483 2513 25292932 2962 
3272 371 1 3722 3839 4006 4277 4282 4405 4434 4836 4970 
5037 5239 5451 5706 5796 5924 5941 6048 6147 6301 6765 
6793 6805 6815 7133 7171 7275 7320 7687 7815 8033 8093 
8095 8105 81 10 8246 8262 8345 8365 8368 8370 8535 8664 
8674 8722 8834 8858 8898 8919 8963 9012 9257 9456-9457 
9484 9539 9544 9560 9758 9928 9939 10175 10201 10284 
10333 10496 10520 10529 10679 10744 10766 1 1032 1 1 108 
11508 11513 11519 11853 11862 12033 12160 12363 12387 
12402 12424 12438 12520 12560 12570 12663 12830 12913 
12923 12968 13021 13066 13072 13085 13140 13147-13148 
13394 13479 13488 13495 13499 13507 13575 13867 13869 
13888 


leukocyte 


GIBCO 


LUC001 


21 49 51 67 83 88 94 142 211 316 326 340 368 371 403 438 
443-455 464 485 518-523 525-530 532-535 591-592 616 724- 
726 744 800 841 889 943 1004 1346 1556 1570-1572 1826 
1914 1916 1927 1959 1976-19772095 2103-21072124-2127 
2134 2138 2145 2161 2163 2175 2215 2223 2237 2266 2343 
2363 2412 2421 2449 2483 2529 2555 2569 2575 2597 2617- 
261 8 2695-2696 2827 2876-2877 2899 2903 2961 2965-2975 
3002 3035-3038 3040-3046 3109 3132 3166 3199 3203 3207 
3246-3251 3265 3272 3286-3287 33 12 3344 3356 3361 3378 
3400 3434 3451 3468 3479 3484 3494 3503 3552 3590 3610 
3627 3634 3704 3709 3720 3722 3725 3757 3797 3804 3810 
3839 3885 3985 3996 4006 4009-401 1 4019 4022-4023 4025 
4035-4037 4058 4060 4194 4253 4269 4277 4297 4341 4362 
4388 4390-4391 4396 4399-440044024405 441044124431 
4434 4534 4543 4594 4615 4630 4641 4651-4663 4670 4688- 
4689 4694 4699-4706 4736 4755 4758-4759 4767 4775 4783 
4791 4798 4828 4835-4840 4845 4865 4930 4940 4947-4948 
4957 4962 4964 4972 4976 4985 4991 4998 5001 5003 5017 
5022 5030 5037 5040 5075-5076 5 108 51 18 5143 5163 5171- 
5172 5313 5481 5503 5505 5521 5523 5526 5535 5549 5552 
5656 5691 5724 5726 5740 5750 5766 5772 5788 5794 5796 
5801 5865 5874 5894-5895 5906 5908 5923-5924 5928 5938 
5989 6027 6042 6057 6063-6064 6082 6094-6103 6125 6130- 
6137 6142 6147 6166-6167 6171 6181 6214 6226 6239 6253 
6301 6307 6371 6374 6418 6512 6662 6716 6730 6788 6792 
6799 6815 6836 6860 6873 6876-6877 6886-6887 6945 6975 
6979 7018 7037 7056 7058 7067 71 16 71 18 7137 7155 7158 
7171 7275 7315 7384-7385 7387-7389 7391-7393 7395-7398 
7452-7463 7468 7494 7608 7671 7676 7687 7714 7733 7792 
7815 7845 7864 7870 7905 7930 8093 8107 8110 8120 8139- 
8140 8224 8226 8262 8276 8320 8363-8365 8368 8375 8387 
8432 8452 8456 8520 8535-8536 8539 8562-8563 8577 8579 
8597 8599-8601 8603-8605 8608 8640-8646 8648 8664 8669 
8674 8677 8691 8702 8722 8753 8755 8798 8815-8816 8830 
8838-8839 8858 8863 8870 8876 8898 8943-8944 8948 8951 
9001 9010 9012 9061 9076 9092 9097 9128 9257 9304 9454- 



122 



WO 01/64835 



PCT/USO 1/04927 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ1DNOS: 








9456 9466 9472-9475 9484 9497 951 1 9515 9517 9529 9536 
9542 9556 9567 9612 9626 9702-9703 9723 9731-9738 9745 
9752 9758 9762-9770 9780-9781 9789 9791 9808 9832 9842 
9927-9928 9939 9942 9970 10000 10175 10275 10319 10329- 
10331 10333 10345-10346 10412 10428 10430 10433 10435- 
10436 10453-10454 10456 10465 10470 10475 10488 10496 
10513 10566 10645 10679 10683 10691 10774 10776 10778 
10788 10791 10795 10802 10891 10928 10930 10935-10941 
10948 10961-10962 10964-10967 11034 11055 11108 11132 
11136 11145 11239 11289 11405-11406 11459 11483 11519 
11588 11604 11606 11618 11620 11693 11695 11805 11809- 
1 1 8 1 6 11 8 1 8 1 1 83 0 1 1 833 1 1 848 1 1 850- 1 1 856 1 1 868 1 1 927 
11932 11934-11935 11937-11939 11954 11988 12000 12006 
12009 12028 12033 12037 12039 12047 12053 12092 12097 
12118 12127 12143 12159-12160 12178 12186 12327 12336 
12363 12375 12387 12402 12404-12405 12410 12424 12438 
12442 12490 12511 12521-12522 12556 12560 12564 12570 
12590 12599 12601 12605 12615-12617 12636 12643 12653 
12660 12670 12672-12680 12693 12700-12701 12706-12707 
12715 12735 12754 12823 12895 12904 12910 12913 12916- 
12918 12923 12925 12935 12945 12947 12955-12956 12972 
12977 12993 13002 13020-13021 13051 13062 13066 13072- 
13073 13082 13085 13117 13126 13130-13131 13135 13138- 
13140 13147-13151 13161 13169 13189 13254 13295 13391 
13394 13425 13436 13442 13456 13477 13479 13488-13490 
13494-13496 13498-13503 13505-13507 13512 13530 13546 
13550-13551 13554 13557-13564 13575-13581 13590 13592- 
13593 13609 13613 13616 13627 13630 13645-13646 13660 
13679 13713 13782 13803 13864 13866-13869 13872 13883 
13885 13888 13893 


leukocyte 


Clontech 


LUC003 


536 539 541-543 593 728 1552 1927 1974 3089 3252-3253 
3434 3548 3709 371 1 3722 3797 401 1 4019 4040 4060 4194 
4201 4277 4282 4622 4707 4791 484M842 4949 4998 5001 
5030 5345 5895 6052 6138 6227 6419 6595 6754 6765 6767 
6788 691 1-6912 7049 7139 7171 7464 7575-7577 8093 81 10 
81 16 8365 8370 8375 8592 8648 8830 8863 8944 9466 9544 
9612 9758 9771 9773 9790 9929 10326 10346 10679 10779 
10961 11035-11036 1 1132 11401 11513 11853 11940 12160 
12256 12405 12670 12674 12681 12840 12904 12906 13020 
13051 13162 13280 13400 13488 13511 13516 13554 13670 
13679 13860 13869 13883 13889 


melanoma 
from cell line 
ATCC#CRL 
1424 


Clontech 


MEL004 


83 142 360447 841 1061 1346 1516 1573 1724 1959 1963 
2159 2175 2343 2367 25 13 2652 2812 2876 2961 3001 3272 
3345 3604 3627 3722 3754 3839-3840 3985 3988 4040 4194 
4246 4341 4515 4568 4641 4667 4862 4940 5075 5132 5163 
5481 5492 5523 5740 5788 6008 61 12 61 19 6147 6189 6195 
6272 6287 6418 6423 6508 6563 6726 6860 6870 7134 7166 
71 69 7275 7294 7579 7677 7748 783 1 7852 7980 81 10 81 16 
8226 8235 8262 8320 8345 8363 8370 8456 8563-8564 8633 
8838 9001-9002 9128 9296 9304 93 13 9484 9542 9544 9646 
9703 9758 9780-9781 9808 9999 10007 10027 10296 10346 
10470 11145 11239 11289 11401 11406 11508 11596 11606 
11620 11823 11899 11950 12019 12107 12160 12292 12329 
12363 12405 12436 12523 12599 12658 12670 12749 12754 
12774 12842 12930 13020 13296 13394 13458 13489 13498 
13501 13507 13551 13554 13575 13613 13616 13649 13660 
13743 13775 13868 


mammary 
gland 


Invitrogen 


MMG001 


51 67 142 449 594 598 616 708 729-733 1004 1060 1194-1196 
1310-1315 1329 1536-1541 1826 1848 1858 1914 1947 1974 
1976-1978 2174-2175 2283 2299 23 10 2336 2343 2403-2406 
2449 2483 2555 2563 2576 2597 261 1 2614 2617 2812 2827 
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2926 3191 3207 3490 3503 3673 3714-3715 3722 3737 3777 
3796 3807-3819 3885 4003-40044060 4330 4346 4355 4391 
4399 4434 4543 4671 4767 4791 4836-4837 4843 4940 4964 
4975 4985 4998 5001 5017 5019 5022 5052 5108 5155 5163 
5168-5169 5269 5345-5347 5466 5526-5527 5529 5581 5615 
5724 5772 5778 5788 5794 5894-5895 5902 5907 591 1 5917 
5923-5924 6048 6057 6064 6094 6109 6125 6147 6156-6157 
6215 6228 6321 6430 6458 6496-6499 6545 6579 6598-6599 
6607 671 1 6765 6780 6793 6797 6799 6805 6880 693 1 6979 
6987 6996 7012-7013 7048 7054 7057 7104 7141 7158 7169 
7517 7578-7581 7583 7872 7937-7940 8110 8139 8156 8224 
8262 8292 8298 8336 8359 8363 8365 8372 8452 8619 8628 
8646 8648 8722 8758-8759 8782 8813 8815 8863 8882 8939 
8963 8983 9012 9097 91 1 1 9139-9141 9143 9157 9164 9264 
93 13 9327 9347 9466 9484 95 1 1 95 17 9544 9553 9563 9577 
9646 9700 9703 9734 9746 9780 9791 9844-9846 9927-9928 
9942 10006-10007 10051-10055 10144 10146-1014710162 
10285 10330 10346 10439 10490-10491 10496 10542 10557 
10600 10628 10645 10691 10729 10772 10796 10983 11038 
11108 11132 11216-11217 11277-11279 11396-11397 11406 
11459 11513 11596 11606-11607 11626 11628 11823 11830 
11833 11902 11941-11943 11979 12000 12009 12037 12047 
12078 12093 12146 12160 12181 12258-12263 12281 12327 
12371 12383 12387 12395 12424 12426 12438 12442 12484 
12511 12564 12570 12590 12594 12601 12605 12609 12615- 
12616 12618 12621 12653 12670 12691 12699 12702-12703 
12749 12771 12791-12794 12834-12835 12885 12898 12905 
12913 12916-12917 12920 12923 12955 12976-12977 13002 
13020 13035 13047 13062 13064 13066 13072-13073 13084 
13090 13092 13127 13135 13169 13190 13193 13195 13249 
13254 13296 13320-13322 13392 13456 13477 13479-13480 
13488-13489 13494 13497-13499 13505-13507 13512 13514 
13516 13546 13551 13554-13555 13590 13611-13612 13631 
13649 13670 13713 13743-13745 13838 13864 13866-13867 
13869 13872 13875 13882-13885 13888 13891 


induced 
neuron cells 


Strategene 


NTD001 


88 1493 1552 1561 2034 2090 2510 2522 2570 2576 2623 3226 
3272 3839 4006 4058 4282 4384 4694 4783 4888 5103-5104 
5172 5523 5541-5542 5684 5766 5788 5794 5938 6082 6195 
6418 6449 6607 6960 6972 7058 7494 7815 7937 8370 8929- 
8930 8996 9000 9003 9128 9157 9289-9291 9472 9928 10275 
10550 10729 10760 11607 11736 1 1818 12039 12075 12094 
12256 12363 12381 12423 12522 12707 12735 12863 12906 
12958-12959 12963 13020 13052 13054 13196 13202 13310 
13368 13378 13438 13479 13486 13506-13507 13546 13616 
13654 13659 13691 13795 13891 


retinoid acid 
induced 
neuronal cells 


Strategene 


NTR001 


1552 3839 3885 4282 4434 5030 9308 9466 1 1 108 1 1 132 
12034 12039 12405 12424 12570 12905 13047 


neuronal cells 


Strategene 


NTU001 


88 708 1552 1561 1947 2223 2490 2614 3272 3610 3625 3627 
4032 4201 4253 4297 43374434 4543 4622 4641 5104 5163 
5523 5541 5684 5788 6094 6147 6307 6601 6862 7755 8648 
8740 8996 9097 9128 9289 9308 9466 951 1 9544 10333 10779 
lOyoi 11032 11108 1 1 Ij2 12034 12039 12075 12405 12424 
12570 12590 12706 12749 12830 12905 12917 12963 13035 

\*Xf\H1 11A(\f\ 11CA1 ti/ii H/i/ 

\6\)I5 \6Zo\j 134yu 13507 13613 13616 


pituitary 
gland 


Clontech 


PIT004 


1 148 5674 6068 6371 7759 8093 10430 12605 12636 126*58 
12916 13021 13082 13875 


placenta 


Clontech 


PLA003 


3885 4025 5030 5536 571 1 5923 8349 8367 9921 12033 12076 
12403 12511 13655 13885 


prostate 


Clontech 


PRT001 


67 83 142 225 235 379 486 572 616 1286 1316-1329 1538 
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2023 2139 2147 2161 22662290 2337-2344 2346 2367 2422 
24442511 2631 28093001-300230843419 3563 3625 3627 
3797 3810 3821-3827 3829-3835 40194022 4173 4282 4332 
4434 4667 4755 4791 4862 4865 4998 5001 5066 5171 5349 
5352-5356 5358-5361 5363-5364 5481 5581 5656 5674 5723 
5740 5902 6027 6047 6125 6321 6371 6374 6427 6458 6463 
6500-6507 6509 6563 6598 6643 6793 6803 6871 7014-7015 
7058 7104 7275 7320 7608 7635 7733 7842 7852 7864 7941- 
7943 7946-7951 7953-7954 8077 8087 8093 81 10 8224 8226 
8452 8487 8520 8558 8635 8677 8863 8950 8963 8983 8997 
90129145-91489150-91549269 93029317 94669503 9528 
9646 9703 9780 10000 10027 10056-10060 10491 10629 10691 
10777 10891 1114511200 1123911281 11283-1129511344 
11406 11761 11837 11862 12006 12166 12264-12278 12305 
12363 12368 12411-12412 12438 12467 12685 12691 12729 
12734 12795-12798 12800 12830 12863 12904 13010-13011 
13104 13146 13295 13323-13326 13377 13394 13456 13473 
13477 13489 13516 13533 13550 13611 13678-13679 13746 
13866-13867 13884 13889 


rectum 


Invitrogen 


REC001 


6 67 142 683 731 997 1 178 1909 1959 2005 2023 2596 261 1 
2614 2809 2926 3314 3333 3455 3722 3780 3870 4173 4355 
4837 4949 4987 5526 5580 5615 5674 5691 5724 5788 5895 
5909 5924 6057 6112 6195 6765 6805 7815 7833 8095 8664 
8863 9517 9539 9544 9884 9927 10400 10666 1 1401 1 1513 
11606 11985 12160 12327 12428 12693 12848 12910 12977 
13051 13064 13072 13420 13494 13497 13507 13512 13515- 
13516 13866 13869 13884 13888 


salivary gland 


Clontech 


SAJL001 


67 731 800 997 1054 1914 2267 2395 2529 3136 3595 3627 
401 1 4192 4246 4330 4434 4641 4957 4987 5040 5052 5163 j 
545 1 5481 5706 5723 5788 5895 6219 662 1 6801 6900 6975 
7045 7733 8110 8372 8535 8563 8635 8830 8951 9000 9010 
905 1 93 13 9472 9475 9671 9724 9758 9927 10027 1 1 145 
11695 11725 12017 12284 12363 12424 12427 12570 12609 
12670 12674 12693 12977 13035 13307 13554 13617 13867 
13872 13889 13891 


salivary gland 


Clontech 


SALs03 


1516 1724 1858 5030 6186 13657 13864 


skin fibroblast 


ATCC 


SFB001 


2251-2252 5788 6068 12511 


skin fibroblast 


ATCC 


SFB002 


6068 8951 12511 


skin fibroblast 


ATCC 


SFB003 


4025 5895 7741 


small 
intestine 


Clontech 


SIN001 


142 319 627 654 1034 1063 1197-1198 1330-1338 1340-1359 
1575 1646 1774 1814 19782161 2347-235424092876 3046 
3419 3460 3605 3716-3718 3737 3797 3837-3839 3841-3843 
3845-3857 3885 3986 40604201 4301 4351 4385 4568 4689 
4694 5076 5163 5270-5273 5304 5326 5365 5367-5372 5374 
5503 5550 5701 5772 6064 6094 6171 6288 6427 6430-6432 
6438 6510-6522 6598 6615 6793 6815 6997-69987016-7018 
7054 7058 7072 7309 7450 7604 7769 781 1 7873-7876 7955 
7957 7959-7962 7964 8120 8298 8350 8452 8830 8863 8950- 
8951 8966 9010 9073-9075 91 19 9126 9128 9155-9166 9303 
9544 9560 9780 9884 9928-9929 10008-10010 10061-10068 
10097 10262 10330 10351 10601 10630-1063410760 10983 
11061 1121911296-1130811310-11313 11513 1162011693 
12182-12183 12280-12287 12327 12363 12488 12707 12799- 
12801 12922 12991 13012-13014 13035 13051 13064 13297 
13307 13328-13332 13335 13382 13499 13506 13554 13560 
13575 13631 13695 13714 13747-13749 13751 13882 13884 


skeletal 
muscle 


Clontech 


SKM001 


1 104 1346 2363 2367 2495 2555 2876 2880 3555 3634 3722 
4011 4022 4194 4201 4253 4277 4282 4434 4641 4940 4972 
4998 5343 5481 5523 5801 6005 63366873 7408 7995 81 10 
8120 8235 8262 8292 8345 8372 8576 8740 8830 8936 8951 
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Source 


Library 
Name 


SEQ ID NOS: 








9303 9689 10616 10679 11132 11145 11825 11869 11927 
12387 12442 12467 12570 12663 12735 12916 13280 13503 
13872 13885 


skeletal 
muscle 


Clontech 


SKM002 


8535 


skeletal 
muscle 


Clontech 


SKMs03 


6336 6962 8936 


skeletal 
muscle 


Clontech 


SKMs04 


770 1724 3797 4277 6336 12405 13658 


spinal cord 


Clontech 


SPC001 


83 142 390 415 598 668 708 731 11841199-1207 1360-1375 
1377-1396 1516 1574-1576 1595-1596 1849 1927 20702129 
2161 23 1 1-23 14 2345 2355-2368 2423-2424 2430 2484 2529 
2569 2576 2876 3215 3249 3272 3283 3532 3584 3627 3634 
371 1 3719-3722 3737 3839 3860-3884 401 1 4025 4038-4039 
4043 4055-4056 4173 4246 4282 4354 4375 4391 4434 4681 
4767 4781 4808 4964 4985 4998 5037 5163 5233 5274-5277 
5375-5392 5394-5402 5523 5569-5570 5581 5615 5723 5788 
5835 5902 5928 5936 6047 6078 6082 621 1 6288 6374 6433- 
6435 6512 6523-6531 6534 6595 6616 6625-6626 6788 6894 
6979 6999 7018-7026 7126 7166 7359 7473 7642 7653 7807 
7814 7877-7879 7965-7968 7970 7972-7980 8105 8108-8110 
8139 8246 8298 8345 8363 8368 8482 8603 8646 8884 8898 
8981 9010 9012 9076-9078 9098 9167-9179 9184-9189 9264 
9302 9304 9319-9320 9455 9466 9520 9530 9544 9556 9567 
9781 9895 9901 9928 9942 9947 9969 9999 10007 10069- 
10077 10079-10085 10177 10296 10326 10346 10376 10422 
10566 10602 10635-10638 10679 10685-10686 10729 10776 
1 1 132 1 1220 11246 1 1314-11323 1 1325-1 1330 1 1417-1 1418 
11459 11513 11818 12000 12011 12017 12033 12039 12160 
12184-12185 12288-12292 12295-12299 1230M2305 12363 
12375 12383 12387 12402 12413 12442 12468 12527 12605 
12617 12636 12657-12658 12739-12740 12754 12772 12802- 
12809 12830 12835 12841-12842 12905 12923 12940 12976 
13003 13015 13017-13021 13051-13052 13117 13126 13136 
13260 13277 13283 13295 13336-13343 13367 13442 13456 
13473 13477 13481 13495 13497 13499-1350013507 13516 
13659 13670 13713 13715-13716 13748 13752-13759 13803 
13869 13872 13884-13885 13888 13893 13896 


adult spleen 


Clontech 


SPLcOl 


800 1927 4032 4834 6064 6135 6195 6446 6788 6873 7166 
7455 8966 9929 10744 12402 12564 12590 12691 12904 12933 
13082 13500 13506-13507 13516 13575 13864 13869 13883 
13889 


stomach 


Clontech 


STO001 


21 83 142 1004 1208-1215 1217-1219 1397 1399-1405 1671 
23 1 5-23 16 2345 2369-2373 2375 2575-2576 2809 2846 2984 
3136 3166 3537 3610 3698 3723-3725 3839 3885-3897 4057- 
4059 4173 4277 4410 4480 4667 4791 4808 4940 4987 5262 
5278-5281 5283-5284 5403-5405 5407-5424 5481 56565674 
5796 5904 6418 6436-6440 6535-6540 6563 6627-6629 6765 
6940 7000-7001 7027-7030 7064 7135 7509 7604 7880-7885 
7981-7990 8087 81 10 8120 8143 8226 8452 8535 9010 9079- 
9081 9191 9193-9196 9304-9306 9313 9317 9321 9715 10007 
10011-10013 10086-10093 10178-10179 10603-10605 10640- 
10642 1 1069 1 1 167 1 1221-1 1222 1 133 1-1 1337 1 1339-1 1343 
11419 11513 11818 12186-12190 12307-12314 12327 12363 
12425-12427 12438 12617 12773-12774 12810-12811 12834 
13082 13103 13298-13299 13344-13349 13592 13630 13670 
13717 13760-13764 13782 13888 


thalamus 


Clontech 


THA002 


579 598 616 1065 1148 1220-1221 1223-1226 1407-1432 1597 
2266 2317-2319 2340 2342 2376-2378 2380 2431 2444 2555 
3093 3230 3286 3537 3722 3726-3732 3737 3898-3902 3904- 
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Origin 


RNA 
Source 


Library 
Name 


SEQIDNOS: 








3918 3920-3922 4060-4062 4173 4201 4282 4360 4391 5270 
5285-5288 5425-5438 5440-5449 5701 5902 5938 6137 6437 
6458 6461 6541-6544 6546-6550 6630 6755 7031 7095 7119 
7166 7484 7579 7815 78497905 7977 7991-8000 8002-8005 
8126-8128 8134 8363 8558 8664 8786 8890 8930 8963 9082- 
9085 9128 9197-9199 9201-9216 9251 9264 9308 9503 9515 
9556 9646 9703 9928 10014-10017 10027 10094-10100 10102 
10267 10496 10606-10608 10643-10650 10696 10891 11145 
11223-11225 11344-11355 11406-1140711420 11513 11604 
11695 11791 11950 12022 12107 12191-1219912315-12331 
12363 12375 12405 12428-12430 12688 12706 12735 12748 
12754 12812-12813 12815 12835 12914 12959 13020 13023- 
13025 13060 13073 13300 13351-13358 13402-13403 13489 
13496 13506 13512 13719-13720 13765-1376813770 13872 
13883 


thymus 


Clontech 


THM001 


51 142 150 332 346 360 438 546 731 760 895 1004 1104 1227- 
1234 1264 1391 1516 1577-1585 1812 1860 187721292174 
2215 2263 2321-2322 2408 2414 2425 2449 2490 2555 2569 
2575 261 1 2847 2880 3435 3530 3722 3727 3733 3735-3742 
3839 3870 4006 4041 4043-4046 4060 4109 4375 4396 4399 
4434 4667 4671 4759 4791 4885 4976 4987 5108 5289-5290 
5466 5481 5526 5553-5554 5796 5956 5979 6020 6186 6253 
6336 6371 641 1 6438 6441-6447 6617-6621 6765 6788 6797 
6870 6886 6908 6972 6994 7059-7060 7126 7141 7166 7168 
73 10 7383 7450 7494 7632 7716 7779 7887 7889-7891 8087 
81 1 1-81 14 81 17-8120 8139 8226 8343 8368 8370 8452 8456 
8633 8830 8898 8921 8963 9010 9076 9086-9087 9128 9134 
9264 9304 9307-9313 9401 9466 9497 9544 9563 9582 9612 
9626 9646 9758 9781 9927 9937 9969 10018 10065 10163- 
10166 10175 10284 10329 10414 10557 10609-10610 10617 
10679 10687-10691 10742 10760 10772 10776 10778 10881 
10891 11187 11194 1 1226-11229 11274 11406 11408-11410 
11412 11459 11513 11676 11695 11830 11865 11942 12000 
12006 12016 12022 12092 12160 12186 12200-12204 12327 
12363 12414-12417 12427 12462 12470 12490 12564 12693 
12706 12735 12740 12835 12840 12898 12905 12910 12913- 
12914 12916 12977 13020 13036 13051 13062 13111 13126 
13141 13295 13301-13305 13326 13351 13391 13396-13397 
13456 13484 13498 13505 13507 13512 13516 13546 13713 
13803 13805 13808 13866 13869 13885 13888 


thymus 


Clontech 


THMc02 


16 27 51 67 142 390 598 1233 1493 1508 1586-1588 1598-1603 
1724 1841 1918 1927 1959 1976 1979 198920572107 2161 
2223 2290 2407 2426-2428 2432-2433 2444 2487 2569 2617 
3537 3664 371 1 3720 3722 3737 3780 3817 4021 4025 4047- 
4049 4060 4063-4068 4246 4277 4377 4405 4688 4694 4759 
4791 4837 4949 4957 4992 4998 5037 5052 5108 5505 5553 
5555-5564 5571-5572 5683 5766 5772 5788 5796 5894 591 1 
6048 6186 6225 6288 6306 6333 6621 6728 6730 6765 6767 
6780 6788 6815 6867 6873 6979 7049 7061-7062 7166 7169 
7676 7687 7809 8084 8121 8129-8130 8179 8262 8354 8363 
8365 8375 8482 8597 8654 8740 8786 8791 8963 9076 9157 
9257 9264 9314 9323-9324 9401 9454 9466 9519 9529 9536 
9637 9700 9703 9927-99299940 10004 10007 10070 10167 
10169-10172 10180 10182 10267 10284 10326 10331 10383 
10439 10452 10542 10605 10691 10693-10694 10697 10744 
10775 10778 10795 11046 11098 1127411413-11415 11462 
11519 11576 11592 11606 11618 11621 11627-11628 11693 
11807 11814 12022 12034 12044 12080-12081 12086 12160 
12236 12256 12327 12335 12363 12368 12387 12418-12419 
12424 12433 12556 12560 12564 12570 12577 12594 12599 
12612 12663 12735 12754 12836-12839 12844 12905 12913 



127 



WO 01/64835 



PCT7US01/04927 



Tissue 
Origin 
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Library 
Name 


SEQIDNOS: 








12955 12977 13020 13035 13037 13051 13062 13072 13087 
13193 13280 13295 13398 13404 13456 13488 13499-13500 
13503 13507 13516 13533 13535 13546 13611 13613 13630 
13646 13649 13659 13670 13678-13679 13713 13769 13775 
13803 1380S-13808 13810-13811 13869 13872 13882-13883 
13885 13888-13889 


thyroid gland 


Clontech 


THR001 
• 


49 1 1 5 142 360 641 698 800 1004 1 1 34 1 193 1233 1235 1237- 
1241 1329 1433-1471 1473-1476 1478-1479 1481 1589-1591 
1593-1594 1604 1606-1612 1614-1615 1639 1671 1906 1927 
1959 1976 19942090 2099 213421502161 2174 21862215 
2343 2367 2381-2387 2429 2434-2437 2449 2490 2510 2529 
2555 2569 2662 2827 2901 2961 2997 3001 3109 3265 3286 
3425 3490 3503 3560 3643 3698 3722 3737 3743-3747 3797 
3839 3885 3923-3941 3943-3954 3956-3961 40094050-4053 
4060 4069-40824109 4173 4194 4253 4277 4282 4348 4354- 
4355 4376 4391 4396 4412 4434 4641 4689 4694 4755 4759 
4783 4791 4834 4957 4972 4985 4987 5018 5037 5052 5075- 
5076 5108 5147 5163 5171 5270 5292-5294 5317 5380 5450- 
5461 5464-5481 5503 5523 5565-5568 5573-5574 5693 5711 
5724 5757 5772 5788 5894-5895 5917 5923 5938 5959 5966 
6005 6013 6027 6047-6048 6061 6064 6125 6135 6139 6189 
6215 6240 6336 6371 6374 6448-6449 6458 6508 6538 6551- 
6554 6556-6561 6563-6572 6595 6598 6607 6622-6623 6631- 
6636 6793 6803 6815 6873 6899 6955 6958 7032-7038 7040 
7043 7063 7066-7067 7079 71 16 7155 7233 7275 7455 7669 
7743 7792 7839 7845 7857 7892-7893 7910 7930 7989 8006- 
8008 8010-8033 8053 8067 8087 8092-8093 81 10 8116 8120 
8122-8124 8131-8143 8196 8226 8235 8262 8345 8365 8368 
8370 8372 8375 8452 8520 8535 8543 8597 8646 871 1 8760 
8816 8839 8863 8898 8930 8948 8951 8963 8966 9010 9055 
9088-9091 9142 9157 9217-9220 9222-9227 9229-92429244 . 
9257 9264 9304 9315-9316 9325-9330 9368 9401 9455 9466 
9484 9525 9542 9544 9563 9646 9695 9703 9780-9781 9855 
9928 9942 9947 10000 10019 10027 10103-10117 10119-10120 
10173-10175 10183-10185 10346 10376 10440 10470 10496 
10611 10651-10657 10666 10679 10695 10698-10704 10760 
10772 10778 10782 10788 10891 11033 11051-11052 11061 
11063 1111211231 1125611356-1135711359-1136011362- 
11367 11369-1137211376 11385 1140611416 11421-11425 
11462 11513 11588 11605-11607 11620 11833 11869 11932 
11980 11985 12006 12017 12033 12143 12160 12193 12195 
12205 12207-12208 12292 12332-12336 12338-12345 12347 
12349-12356 12358-12360 12363 12368 12405 12420 12422 
12424 12428 12434-12439 12465 12523 12601 12605 12609- 
12611 12617 12636 12645 12670 12691 12707 12740 12749 
12754 12768 12775 12777 12817-12824 12830 12840 12845- 
12848 12863 12911 12940 12955 12977 13020 13026-13028 
13038 13047 13051 13062 13072-13073 13090 13104 13117 
13126 13195 13202 13254 13295-13296 13360-13371 13389 
13391 13399 13412 13479 13488-13489 13492 13494 13498 
13500 13503-13504 13506-13507 13511-13512 13516 13533 
13551 13554 13590 13613 13627 13644 13646 13656 13659- 
13660 13670 13713 13721 13743 13771-13774 13782 13803 
. 13809 13812-13814 13860 13866-13867 13869 13872 13882 
13884-13885 13888 13891 13893 


trachea 


Clontech 


TRC001 


83 438 483 858 1006 1034 1242-1245 1359 1483-1494 1616- 
1619 1621 1671226623242388-2391 24092438-24392444 
2575 3001 3136 3272 3425 3460 3535 3548 3748-3756 3810 
3962-3967 4083-4086 4530 4755 4758 4949 4957 4987 5075 i 
5213 5295-5299 5481-5482 5484 5486-5488 5490-5491 5535 i 
5576-5580 5656 5941 6091 6450-6453 6574-6578 6637 6797 | 
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Library 
Name 


SEQ ID NOS: 








6955 7041-7042 7049 7068 7204 7347 7733 7864 7894 8034- 
8043 8045 8120 8139 8144 8563 8635 8646 9012 9092-9099 
9245-9251 9317 9331 9333 9484 9503 9517 9734 10020-10022 
10027 10097 10121-10123 10125-10127 10284 10344 10478 
10612 10658-10659 10705 10812 10948 11145 11233-11239 

1 1 itx 1 1 1 1 A09 1 1/iOG i lO/io i omo i nno i oo 1 1 iio/v> 
llj/J-ilj/o Il*Kio-l iiy^jL lzuZZ l^ZUy-lzzi 1 12292 

12361-12363 12365-1236612438 12440-12441 12611 12617 

12734-12735 12825-12826 12900 12904-12905 12916 12923 

12965 12972 12976 12989 13029-13030 13306-13308 13372- 

13373 13492 13670 13713 13722 13775-13781 13803 13884 


uterus 


Clontech 


UTR001 


1036 1134 1246-1250 1324 1493 1495-151 1 1622 1671 1928 
2145 2266 2310 2326 2343 2367 2392-23972555 3281 3479 
3720 3757-3760 3762 3969-3986 4088-4090 4192 4201 4246 
4277 4390-4391 4434 45154998 5002 5184 5300-5301 5481 
5492-5504 5656 5695 5712 5794 5936 6116 6475 6579-6587 
6595 6870 7002 7043-7044 73 1 1 7484 7895-7899 8046-8055 
8057-8059 8061 8110 8146-8147 8152 8226 8262 8622 8677 
9008 9092 9137 9252-9257 9334 9370 9414 9466 9539 9703 
10023-10024 10128-10134 10346 10595 10660-10668 10706- 

10708 1 1 14S 1 193Q 1 17RQ 1 1377 1 1370-1 1 1 AQS 1 Uttn 

12367-12369 12371-12374 12442-12443 12570 12670 12693 
12827-12829 12914 13002 13047 13073 13083-13084 13131 
13260 13277 13309-13310 13374-13376 13394 13489 13512 
13713 13782 13784-13787 13866 13869 
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TABLE 2 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idea tit 










Score 


y 


1 


M36501 


Homo s aniens 


a I Dha-2-macmelobii 1 in 


118 


69 


2 


AF1 18090 


Homo saniens 


PRO2044 


247 


59 


3 


X01683 


Homo saoiens 


aloha 1-antitrvDsin 


544 


78 


4 


L27428 




ri^vprcf* tran<srT*ir»tftCf» 


79 


97 


5 


M22332 


Homn <u*nifMi<3 

* LKJlllXJ oauiciio 


UliAJiL/W U J_/i L/lCUi 


89 


40 


6 


AFO 15539 


Vflj LULU) GUUlla 


rwp/*n 11 a (ipn P 


1 XJ 


11 


7 


X03325 


T-Iomn cani pnc 
ijajiiilt oapiciw 




54ft 


R1 

Oj 


Q 
o 


A R0 10980 


1V1U3 mUSCUlUS 


sprouiy-*f 


bi 

VI 




o 


D88 1 59 


nuiiio sdpieru> 


aceiyi-coenzyme a uansponer 




O / 


10 


O0019R 


oomo Sapiens 


Unman oonrafAH rkrA^Atn QT7^\ TT\ VT^* 

Human secreted protein, oity w invj. 


194 


JO 














1 1 


AT 040^^0 


— 

Homo Sapiens 


/lri7^1A ^ /VTA A(\AA<\ 


1C9 
loZ 


C7 
o2 


19 


A T9 A9 ^/lrt 


Vol vox carteri £ 


hydroxyproJine-rich glycoprotein DZ- 


OJ 


in 
it 






n agar icns is 


rlKOr 






1 J 




Homo sapiens 


reverse transcriptase 


IOC 

liD 


Hi 
01 


14 


T Tyl OQ71 


Homo sapiens 


UKr 1 ; MbK37; putative transposase 


266 


72 








similar to pogo element 






1 J 


uyjjoy 


— ; ; 

Homo sapiens 


putative pi 50 


135 


•in 

37 


10 


VJUZojZ 


Homo sapiens 


rluman secreteu protein, obvj ID NU: 




07 






• 








1 / 


AD J jo 1 


Rattus 


UKr4 


124 


34 






norvegicus 








1 o 


J\r looVOl 


Homo sapiens 


_p ~ — : 

carbon catabolite repression 4 protein 




7< 

75 








homolog 






17 


A TAA7 1 OA 
AJUUZ 1 y\) 


— — ; 

Homo sapiens 


dihydroxy acetone phosphate 


551 


oo 
0 0 








acyltransferase 






90 


VI 97 11 
I 1Z/ lJ 


— 

Mus musculus 


rro-roi-au l rase poiyprotem 


12.1 


45 


91 

Jul 




Homo sapiens 


unnamed protein product 


i /Oil 


oo 
yy 


99 


T TA0071 


Homo sapiens 


UKr l, Mc-ro /, putanve transposase 


77< 


5o 








similar to pogo element 






91 
xJ 


fjni70i 


— ; 

Homo sapiens 


Human secretea protein, oJiy ii-» inu. 


177 


7< 

/5 
















r\r l JOJJU 


~tt i 

Mus musculus 


putative e* i -HZ a i rase 


1 £S 

loo 


JO 




AFT 1QR56 


JuLUillvJ Sapiens 


irivvyioji 




ftl 
OJ 


96 


T TA007d. 


Homo sapiens 


mariner transposase 


1 C7 
15/ 




97 


frfiftOOl 

\j\jKjy\Ji 


Homo sapiens 


riuman secretea protein, oe>v^ iu jnu. 


50 


in 
jU 








AOft9 






98 


AF995771 


numo sapiens 


iai guanine nucleotide dissociation 


19A 
1ZO 


74 








dlUllUiOlUi 






29 


AF1 13685 


1 lXJLllKJ jauiuuj 


PRO0974 

i IVW7 / « 


99 


71 


30 


U83303 


xx\jixi\j oapjL&iio 


linp-1 rpvpr^p trsinQfT'in'tJlie 




50 


31 


Y91577 


XxVlllsJ oayi&llD 


T-Tumsin c^rTPtf»f! nrntp in ipnnpnrp 
XlLUllOXI dwUWlCU pivLviZl Owl^UCIlUC 


979 


75 

/J 














32 


AF003535 


iivjJLiivj oapiwio 


V71.VJL Z. li&C piVJtCUl 


114 


47 


33 


Ml 5386 


Hrtrnri Q^niF»nC 
XiUillLy oaUlClla 


q jinn tti ji — ol ohin 

^aiiiiiia**gi\7iyxu ( 


170 


84 


34 


M19419 

irX X 7*T 1 7 


RyTiic tnncnt liic 
iVllio lutloCUiUo 


r»rn1inp»_r4/*1i calivarv rvrrvtT^in 
pi UlUlC-JJull oittJVaijr piUlCiil 


110 
ill/ 


15 
JJ 


35 


AF911941 


l-I/vm r\ CQni An C 

XlVUiliJ bapiCJlb 


Hr\tYi!JiTi-fnTi1*ainirio T^rAtAin 
W VY UUIllaUl"UiJllulULLUg UiULCUi 


5Rfi 


81 

OJ 














36 


X11885 


^Ti/*/Yf Jori o 
XN iv/CF llailct 


PYt/>ncin (A A 1-690^ 


101 


15 
j j 






tabacum 








37 


U93563 


Homo sapiens 


putative pl50 


127 


58 


38 


U93564 


Homo sapiens 


putative pi 50 


103 


77 


39 


AF069732 


Homo sapiens 


ADA2-like protein 


524 


88 


40 


X61046 


Hydra sp. 


mini-collagen 


101 


34 


41 


AK000322 


Homo sapiens 


unnamed protein product 


566 


80 


42 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 
v* merman 
Score 


% 

Ideatit 
y 








7727. 






43 


AF071081 


Mycobacterium 
tuberculosis 


proline-rich mucin homolog 


104 


41 


44 


AF092135 


Homo sapiens 


PTD014 


228 


41 


45 


Y73353 


Homo sapiens 


HTRM clone 1870914 protein 
sequence. 


295 


56 


46 


AF1 18082 


Homo sapiens 


PRO1902 


119 


44 


47 


X78926 


Homo sapiens 


zinc finger protein 


442 


52 


48 


X54326 


Homo sapiens 


glutaminyl-tKNA synthetase 


542 


81 


49 


D50645 


Homo sapiens 


SDF2 


321 


95 


50 


M92439 


Homo sapiens 


leucine-rich protein 


344 


80 


51 


U28963 


Homo sapiens 


Gps2 


593 


82 


52 


U41806 


Homo sapiens 


p60 


660 


81 


53 


AF181490 j 


Homo sapiens 


prenylcysteine lyase 


461 


78 


54 


U93570 


Homo sapiens 


putative pi 50 


147 


36 


55 


W73499 


Homo sapiens 


Von WiJlebrand factor. 


529 


76 


56 


AF1 19851 


Homo sapiens 


PR01722 


126 


57 


57 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


115 


61 


58 


AL021939 


Homo sapiens 


dJ352A20.2 (aldehyde dehydrogenase 
family protein) 


422 


90 


59 


L24158 


Homo sapiens 


integrin alpha 9 protein 


117 


71 


60 


Y32157 


Homo sapiens 


Human SH3D1 A protein. 


530 


91 


61 


X61296 


Rattus 
norvegicus 


open reading frame 2 


117 


31 


62 


AK002064 


Homo sapiens 


unnamed protein product 


330 


80 


63 


AB012223 


Canis familiaris 


ORF2 


80 


56 


64 


U93570 


Homo sapiens 


putative pl50 


113 


37 


65 


U15647 


Mus musculus 


reverse transcriptase 


152 


55 


66 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 


54 


67 


Y48359 


Homo sapiens 


Human prostate cancer-associated 
protein 56. 


590 


99 


68 


W74879 


Homo sapiens 


Human secreted protein encoded by 
gene 151 clone HTLEF62. 


368 


98 


69 


AF 104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


604 


93 


70 


AF175265 


Homo sapiens 


vacuolar sorting protein 35 


632 


88 


71 


U93571 


Homo sapiens 


p40 


106 


33 


72 


X15324 


Homo sapiens 


angiotensinogen 


330 


84 


73 


Z98204 


Hordeum 
vulgare 


extensin 


111 


38 


74 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


61 


75 


AF1 18092 


Homo sapiens 


PRO2061 


453 


79 


76 


M63175 


Homo sapiens 


autocrine motility factor receptor 


190 


85 


77 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


153 


38 


78 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


58 


79 


X78926 


Homo sapiens 


zinc finger protein 


199 


37 


80 


M77381 


Homo sapiens 


acrosin 


98 


54 


81 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


44 


82 


U93569 


Homo sapiens 


putative pi 50 


94 


38 


83 


AP000616 


Oryza sativa 


similar to RTNG-H2 finger protein 
RHAla(AF078683) 


132 


100 


84 


AF255446 


Crypmecodinium 
cohnii 


Dip 1 -associated protein C 


129 


34 


85 


R59837 


Homo sapiens 


Sequence of human microtubule- 


82 


48 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 










Identic 










Score 


y 








associated protein tau. 






86 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


157 


76 








gene 100 clone HNFIU96. 






87 


AF116712 


Homo sapiens 


PR02738 


91 


58 


88 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


44 








6566. 






89 


Y82742 


Homo sapiens 


DNA replication and repair associated 


315 


79 








protein (DRASP). 






90 


Ml 6961 


Homo sapiens 


alpha-2-HS-giycoprotein 


138 


74 


91 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


250 


56 








similar to pogo element 






92 


AF220656 


Homo sapiens 


apoptosis-associated nuclear protein 


62 


69 








PHLDA1 






93 


U65928 


Homo sapiens 


Jun activation domain binding protein 


188 


75 


94 


U93568 


Homo sapiens 


putative pi 50 


102 


48 


95 


S80119 


Rattussp. 


reverse transcriptase homolog 


130 


53 


96 


U93563 


Homo sapiens 


putative pi 50 


242 


50 


97 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


105 


46 








fragment, SEQ ID NO:387. 






98 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


62 








7795. 






99 


X74045 


Equus caballus 


preproalbumin 


289 


65 


100 


AF1 18090 


Homo sapiens 


PRO2044 


269 


90 


101 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


198 


51 








similar to pogo element 






102 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


53 








7214. 






103 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


225 


82 








fragment, SEQ ID NO:490. 






104 


AF003535 


Homo sapiens 


ORF2-like protein 


114 


47 


105 


AF130079 


Homo sapiens 


PR02852 


133 


56 


106 


AF 130089 


Homo sapiens 


PRO2550 


107 


71 


107 


M63473 


Homo sapiens 


alpha-5 type IV collagen 


131 


45 


108 


AF1 16661 


Homo sapiens 


PR01438 


112 


54 


109 


X92485 


Plasmodium 


pval 


101 


41 






vivax 








110 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


80 


111 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


199 


69 








similar to pogo element 






112 


AF194537 


Homo sapiens 


NAG13 


104 


44 


113 


L27428 


Homo sapiens 


reverse transcriptase 


160 


34 


114 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


137 


56 


115 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


191 


67 








similar to pogo element 






116 


AF130052 


Homo sapiens 


PRO0956 


163 


47 


117 


L27428 


Homo sapiens 


reverse transcriptase 


117 


36 


118 


U93569 


Homo sapiens 


putative pi 50 


104 


66 


119 


W48351 


Homo sapiens 


Human breast cancer related protein 


96 


66 








BCRB2. 






120 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


57 








4438. 






121 


G03438 


Homo sapiens 


Human secreted protein* SEQ ID NO: 


78 


51 








7519. 






122 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


80 








6613. 






123 


X61296 


Rattus 


open reading frame 2 


94 


36 






norvegicus 








124 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


60 








7882. 







132 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


125 


AF 119900 


Homo sapiens 


PR02822 


168 


68 


126 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


209 


58 


127 


L27428 


Homo sapiens 


reverse transcriptase 


102 


35 


128 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


99 


63 


129 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


73 


130 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


170 


36 


131 


U93572 


Homo sapiens 


putative pl50 


168 


38 


132 


U37263 


Homo sapiens 


KRAB zinc finger protein; Method: 
conceptual translation supplied by 
author 


155 


57 


133 


G00403 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4484. 


137 


92 


134 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


58 


61 


135 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


128 


66 


136 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


102 


38 


137 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


230 


55 


138 


AK000496 


Homo sapiens 


unnamed protein product 


127 


46 


139 


X53581 


Rattus 
norvegicus 


ORF4 


136 


38 


140 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


158 


48 


141 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


162 


65 


142 


AF090930 


Homo sapiens 


PRO0478 


127 


65 


143 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


141 


58 


144 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


98 


65 


145 


AJ238588 


Sciurus vulgaris 


cytochrome c oxidase subunit III 


417 


72 


146 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


139 


76 


147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


40 


148 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


121 


70 


149 


Y76184 


Homo sapiens 


Human secreted protein encoded by 
gene 61. 


214 


85 


150 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


95 


57 


151 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


126 


66 


152 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


105 


51 


153 


AF1 19900 


Homo sapiens 


PR02822 


116 


62 ; 


154 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


149 


66 


155 


AB009993 


Mus musculus 


collagen al(V) 


105 


36 


156 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


155 


69 


157 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


348 


71 


158 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


54 


159 


AF247705 


Oryctolagus 
cuniculus 


alpha 1 type X collagen 


102 


42 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Wi\ t#»r ma n 

Score 


% 

laeniic 

y 


160 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 


161 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


52 


162 


X71442 


Rattus 
xiorvegicus 


ORF1; putative 


96 


47 


163 


U93570 


Homo sapiens 


putative pi 50 


11S 


38 


164 


U23515 


Caenorhabditis 
elegans 


weak similarity to adenylyl cyclase- 
associated protein (CAP) and to P. 
chabaudi adami major merozoite surfae 
antigen protein (PIR:A32555). Final 
exon overlaps gene predicted on other 
strand. 


93 


37 


165 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


143 


66 


166 


AF130079 


Homo sapiens 


PR02852 


143 


90 


167 


L27428 


Homo sapiens 


reverse transcriptase 


200 


53 


168 


G04072 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8153. 


92 


51 


169 


R95913 j 


Homo sapiens 


Neural thread protein. 


116 


54 


170 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


324 


85 


171 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


111 


50 


172 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


69 


173 


AF130089 


Homo sapiens 


PRO2550 


126 


59 


174 


S80119 


Rattus sp. 


reverse transcriptase homolog 


151 


46 


175 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


113 


53 


176 


U15647 


Mus musculus 


reverse transcriptase 


104 


46 


177 


M24732 


Homo sapiens 


lam in-like protein 


112 


42 


178 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


137 


38 


179 


AF0I0144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


58 


180 


AF194537 


Homo sapiens 


NAG13 


92 


90 


181 


U93564 


Homo sapiens 


putative pl50 


131 


53 


182 


U93574 


Homo sapiens 


putative pi 50 


86 


46 


183 


Y14166 


Gallus gallus 


attachment region binding protein 


91 


40 


184 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


104 


64 


185 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


183 


73 


186 


U93572 


Homo sapiens 


putative pi 50 


139 


64 


187 


M22332 


Homo sapiens 


unknown protein 


79 


41 


188 


Y87202 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:241. 


75 


71 


189 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


132 


37 


190 


S80119 


Rattus sp. 


reverse transcriptase homolog 


172 


43 


191 


AF194537 


Homo sapiens 


NAG13 


81 


75 


192 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


112 


64 


193 


X92485 


Plasmodium 
vivax 


pval 


96 


40 


194 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVl 
oncoprotein. 


112 


45 


195 


L27428 


Homo sapiens 


reverse transcriptase 


141 


37 


196 


U93570 


Homo sapiens 


putative pi 50 


201 


41 


197 


X92485 


Plasmodium 
vivax 


pval 


120 


48 


198 


AF130089 


Homo sapiens 


PRO2550 


137 


60 
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SEQ U> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


199 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase submit) 


377 


84 


200 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


52 


201 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


151 


68 


202 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


247 


78 


203 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


56 


204 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 


362 


80 


205 


R59842 


Homo sapiens 


ApoE4Ll protease. 


100 


82 


206 


AF161356 


Homo sapiens 


HSPC093 


78 


62 


207 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


111 


48 


208 


U83280 


Leishmania 
donovani 


39 kDa antigen 


121 


53 


209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


80 


210 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


146 


72 


211 


M69197 


Homo sapiens 


haptoglobin-related protein 


344 


92 


212 


AF034611 


Homo sapiens 


mtrmsic factor-B12 receptor precursor, 
cubilin 


123 


37 


213 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


129 


65 


214 


V00662 


Homo sapiens 


cytochrome oxidase I 


485 


87 


215 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


124 


80 


216 


U35312 


Mus musculus 


nuclear receptor co-repressor 


115 


47 


217 


L26953 


Homo sapiens 


chromosomal protein 


143 


77 


218 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


224 


70 


219 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


125 


75 


220 


AB018114 


Arabidopsis 
thaliana 


RING finger protero-like 


lil 


38 


221 


D38112 


Homo sapiens 


ATPase subunit 6 


475 


84 


AHA 

222 


V00662 


Homo sapiens 


cytochrome B 


466 


77 


223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


175 


85 


224 


AF010144 . 


Homo sapiens 


neuronal thread protein AD7c-NTP 


125 


38 


225 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


86 


65 


226 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


130 


54 


iZI 


T TAAf AA 

U09500 


Homo sapiens 


cytochrome b 


274 


62 


228 


AF081104 


Mus musculus 
domesticus 


ORF2 


111 


36 


229 


AF090942 


Homo sapiens 


PRO0657 


88 


57 




G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


76 


57 


231 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7553. 


101 


66 


232 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


116 


54 


233 


AF010400 


Homo sapiens 


transaldolase-related protein 


253 


77 


Zi4 


At \yl9lo 


Helicoverpa 
arrrrigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 


137 


50 



135 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: . 








Water man 


Identit 










Score 


y 


235 


A I?nQfYCY2 1 


Homo sapiens 


PRO0483 


123 


75 


236 


U499/3 


Homo sapiens 


ORFl; MER37; putative transposase 


185 


86 








similar to pogo element 






ZJ 1 




Homo sapiens 


ORFl; putative 


99 


40 


238 


UU0368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


58 








4449. 






239 


A PA 1 A 00"i 

Ar0l4oo3 


Homo sapiens 


NADH dehydrogenase subunit 2 


305 


65 


240 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


60 


44 








7183. 






Z41 


X o64 /Z 


Homo sapiens 


Human gene 52-encoded protein 


79 


50 








fragment, SEQ ID NO:387. 






*"i ao 

242 


T 7 1 CCAn 

U 15647 


Mus musculus 


reverse transcriptase 


117 


47 


oa-5 i 
243 


a mi m a a i 

Ar 010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


56 


*"IA A 

244 


G03133 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


94 


45 








7214. 






245 


AK023542 


Homo sapiens 


unnamed protein product 


82 


38 


O A ^ 

246 


X55702 


Drosophila 


polycomb protein 


84 


31 






melanogaster 








247 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


65 








7870. 






248 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


70 








6613. 






249 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


142 


63 








IDNO:1051. 






250 


Y17832 


Human 


env protein 


297 


71 






endogenous 










retrovirus K 








ZD I 


U9356o 


Homo sapiens 


p40 


103 


46 


252 


A T7AA 1 11A 

AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


201 


47 


253 


AF090895 


Homo sapiens 


PRO01I7 


139 


60 


KA 

254 


V7°AO OO/ 

Y02886 


Homo sapiens 


Fragment of human secreted protein 


185 


72 








encoded by gene 90. 






255 


L27428 


Homo sapiens 


reverse transcriptase 


156 


40 


if/ 
256 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 


157 


59 








similar to pogo element 






257 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


70 








4414. 






258 


A 171 A/1 0*T 

AF 194537 


Homo sapiens 


NAG13 


141 


38 


259 


OA 1 T*» 

B0 1372 


Homo sapiens 


Neuron-associated protein. 


115 


71 


260 


G00637 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


113 


53 








4718. 






26) 


M22332 


Homo sapiens 


unknown protein 


78 


45 


262 


A 17 A 1 A 1 A A 

AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


70 


78 


263 


X61296 


Rattus 


open reading frame 2 


108 


36 






norvegicus 








Z04 


At 01 6099 


Mus musculus 


endonuclease/reverse transcriptase 


178 


45 


265 


G03303 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


63 








7384. 






266 


Y01154 


Homo sapiens 


Protem sequence Seq Id 54 from 


116 


84 








tifrtnAA i <v>A 

WO9901020. 






ZO/ 


UU3/86 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


52 








7867. 






Zoo 


Ar 1 19855 


Homo sapiens 


PRO 1847 


74 


70 


zoy 


ATI A A AAT 

Ar 10990 / 


Homo sapiens 


S164 


85 


61 


270 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


63 








AAA A 
44 11-. 






271 


X92485 


Plasmodium 


pval 


107 


72 






vivax 








272 


AF194537 


Homo sapiens 


NAG13 


167 


51 



136 
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S£QU> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


07^ 

Z 1 0 




— rr : 

Homo sapiens 


p40 


1 Ail 

104 


40 


z it 




Homo sapiens 


reverse transcriptase 


142 


56 


77 '< 

ZlJ 


vjUUjj4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


60 








a Ate 

4435. 






77A 
z /o 


Y617Q6 
-AX>lZyo 


Kamis 


open reading frame 2 


96 


A O 

48 






norvegicus 








777 
Z/ / 




Homo sapiens 


rKt>0483 


t A A 

140 


65 


77ft 
Z /O 


U4yy / j 


fiomo sapiens 


UKr 1 ; MER37; putative transposase 


231 


66 








similar to pogo element 






77Q 

z/y 


/vrxjUUoy 


— : 

Homo sapiens 


Tin /~vo c 

PRO2550 


164 


60 


OQA 

ZOV , 




y _ - 

Homo sapiens 


PTH-responsive osteosarcoma Dl 


98 


58 








protein 






7C1 
Zol 


JLzzj4o 


Homo sapiens 


collagen type XA^IH alpha I 


92 


38 


7Q7 
ZOZ 


A DAA AZ 1 C 


Oryza sativa 


similar to RING-H2 finger protein 


130 


68 








RHAla (AF078683) 






zoJ 


A T? 1 1 /;t i c 

Arllo715 


Homo sapiens 


T\T> /"VOC40 A 

PR02829 


160 


75 


Z©4 


A DAI Al A A 

Ar 010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


53 


IOC 

zoj 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


53 








7871. 






Zoo 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


53 








BCRB2. 






7C7 

zo/ 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


147 


66 








similar to pogo element 






Zoo 


U 93 572 


Homo sapiens 


putative pi 50 


125 


32 


AAA 

289 


A T Alf\A 

AL050399 


Arabidopsis 


putative proline-rich protein 


142 


44 






thaliana 








zyu 


VA14QC 


Plasmodium 


pval 


147 


43 






vivax 








291 


A DAilT^AA 

AB 047600 


Macaca 


hypothetical protein 


172 


66 






fascicularis 








707 
ZyZ 


vjuj/oy 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 






— 


TOTA 






zyj 




Pongo pygmaeus 


INAUH aenydrogenase suburut 5 


342 


71 


zy4 


A CAAAn/l'* 

Aruy0942 


Homo sapiens 


PRO0657 


99 


66 


zyo 


Mol loD 


Bos taunis 


glutamic acid-rich protein 


114 


44 


zyo 


K/fl 1. 1 AA 

Ml J1UU 


Rattus 


unknown protein 


107 


43 






norvegicus 








ZyJ 


A IT A 1 A 1 /f sf 

Ar010144 


Homo sapiens 


neuronal thread protem AD7c-NTP 


114 


50 


zyo 


X92485 


Plasmodium 


pval 


93 


78 






vivax 








7QQ 

zyy 


UU3 1 /z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


75 








7253. 








ATJA1 Al A A 
ATU1UI44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


42 




T 7/i^7l 
L,Z**jZl 


Homo sapiens 


transformation-related protein 


117 


60 


OA/} 


V8^A17 


Human 


1 TOO 


219 


49 






herpesvirus 6 










uy^ JO / 


Homo sapiens 


putative pi 50 


130 


48 




ID AOQ 1 O 


Homo sapiens 


Human secreted protein sequence 


72 


61 








encoded by gene 28 SEQ ID NO:75. 










Volvox carteri £ 


hydroxyproune-rich glycoprotein DZ- 


153 


68 






nagariensis 


T TT> J^T> 






3Afi 


JLOol iz 


Homo sapiens 


cytochrome c oxidase subunit 3 


532 


79 


JU / 


VJU3/14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


53 














IAS 
J I/O 


T 0*7/100 

L.Z ftlZX 


Homo sapiens 


reverse transcriptase 


151 


72 


309 


M69297 


XaUIHU bdpi CRS 


ADC 1 




4J 


310 


X9248S 


Plasmodium 


pval 


81 


60 






vivax 








311 


L27428 


Homo sapiens 


reverse transcriptase 


103 


41 



137 
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SEQ ID 

NO: 


Accession Na 


Species 


Description 


Sroith- 

Watermao 

Score 


% 

Identit 

y 


3 IZ 


A 171 OAft'TQ 

Ar 13UU/9 


Homo sapiens 


PR02852 


135 


49 


313 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


110 


58 


314 


Aruyuy2o 


Homo sapiens 


PRO0470 


88 


48 


3 I j 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


175 


64 


3 10 


uy33oo 


Homo sapiens 


putative pi 50 


148 


46 


317 


AF1 19855 


Homo sapiens 


PR01847 


IQ8 


84 


^ id 


P60839 


Homo sapiens 


Sequence of human serum albumin 
(HSA) on plasmid pXL53. 


175 


50 


319 


W46424 


Homo sapiens 


Human macrophage stimulating protein 
(MSP). 


257 


69 


JZU 


ALU49547 


Homo sapiens 


dJ34F7.2 (CREB-RP (G13)) 


247 


64 


321 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


110 


66 


3zz 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


48 


323 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


324 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


126 


44 


325 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


146 


59 


326 


R59842 


Homo sapiens 


ApoE4Ll protease. 


95 


60 


327 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


84 


61 


328 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


123 


66 


5Zy 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


125 


65 


33U 


AF 1 19855 


Homo sapiens 


PR01847 


121 


80 


331 


L78671 


Homo sapiens 


CoxIIZD-Ioop DNA fusion protein 


364 


71 


332 


AK000496 


Homo sapiens 


unnamed protein product 


145 


41 


333 


D00570 


Mus musculus 


open reading frame (1 96 AA) 


153 


53 


334 


AF1 19855 


Homo sapiens 


PR01847 


116 


74 


IOC 

335 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


50 


336 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


129 


56 


337 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


107 


59 


*3 *5 O 

338 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


115 


72 


339 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 1 02, variant a dna, 
complete virion genome. 


117 


50 


1A A 

J4U 


AK022217 


Homo sapiens 


unnamed protein product 


127 


70 


341 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


115 


75 


342 


AF118086 


Homo sapiens 


PR01992 


141 


73 




X92485 


Plasmodium 
vivax 


pval 


96 


59 


344 


AF 106677 


Drosophiia 
melanogaster 


dissatisfaction 


90 


48 


IAS 


U 12693 


Homo sapiens 


cytochrome oxidase subunit II 


239 


91 


346 


L27428 


Homo sapiens 


reverse transcriptase 


95 


56 


347 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


69 


348 




nomo sapiens 


xiuman secrete a protein, o.bQ ID NO: 
6613. 


81 


51 


349 


X51616 


Volvox carteri 


SULFATED SURFACE 
GLYCOPROTEIN 185 


110 


41 



138 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smitb- 

Waterroan 

Score 


% 

Identit 

y 


350 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


55 


351 


U49973 


Horao sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


183 


60 


352 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


151 


51 


353 


R95913 


Homo sapiens 


Neural thread protein. 


95 


56 


354 


AF061340 


Artibeus 
jamaicensis 


cytochrome c oxidase subunit III 


346 


70 


355 


AF090895 


Homo sapiens 


PRO0117 


126 


60 


356 


AFO 16099 


Musmusculus 


endonuclease/reverse transcriptase 


121 


48 


357 


AF1 18086 


Homo sapiens 


PR01992 


159 


73 


358 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


85 


89 


359 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


50 


360 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


57 


361 


M13100 


Rattus 
norvegicus 


unknown protein 


122 


34 


362 


Y36203 


Homo sapiens 


Human secreted protein #75. 


108 


63 


363 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 1 
7133. 


83 


75 


364 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


212 


65 


365 


AF 130051 


Homo sapiens 


PRO0898 


136 


71 


366 


AF068294 


Homo sapiens 


HDCMB45P 


188 


65 


367 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


225 


70 


368 


S80119 


Rattus sp. 


reverse transcriptase homolog 


188 


45 


369 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


75 


48 


370 


AF1 18082 


Homo sapiens 


PRO1902 


98 


79 


371 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


40 


372 


AF014903 


Pan troglodytes 


NADH dehydrogenase subunit 2 


169 


41 


373 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


93 


48 


374 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


66 


375 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7188. 


90 


80 


376 


U93568 


Homo sapiens 


putative pi 50 


140 


56 


377 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


140 


50 


378 


AF090942 


Homo sapiens 


PRO0657 


154 


66 


379 


U93568 


Homo sapiens 


putative pi 50 


149 


36 


380 


U93570 


Homo sapiens 


p40 


184 


57 


381 


L27428 


Homo sapiens 


reverse transcriptase 


128 


60 


382 


AF194537 


Homo sapiens 


NAG13 


114 


35 


383 


AF1 16712 


Homo sapiens 


PR02738 


109 


56 


384 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


405 


81 


385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


63 


386 


M22334 


Homo sapiens 


unknown protein 


124 


39 


387 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


97 


52 


388 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


327 


94 


389 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


157 


70 


390 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


62 


391 


U49973 


Homo sapiens 


OBF1; MER37; putative transposase 
similar to pogo element 


259 


66 



139 
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SEQQ) 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


0/ 
/o 

Identit 

y 


392 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


137 


63 


393 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


129 


54 


394 


AK023582 


Homo sapiens 


unnamed protein product 


148 


46 


395 


M22332 


Homo sapiens 


unknown protein 


128 


41 


396 


AF118086 


Homo sapiens 


PR01992 


160 


71 


397 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


101 


52 


398 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


199 


66 


399 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


234 


78 


400 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


40 


401 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoforra 


116 


61 


402 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


145 


37 


403 


AF1 18082 


Homo sapiens 


PRO1902 


97 


55 


404 


AF202635 


Homo sapiens 


PP1200 


126 


55 


405 


V00662 


Homo sapiens 


cytochrome oxidase I 


352 


68 


406 


AF229067 


Homo sapiens 


PADI-H protein 


129 


71 


407 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


197 


38 


408 


L26251 


Trypanosoma 
brucei 


CR5 


95 


46 


409 


LT49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


310 


62 


410 


X92485 


Plasmodium 
vivax 


pval 


96 


68 


411 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


128 


40 


412 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


431 


83 


413 


M10546 


Homo sapiens 


cytochrome oxidase I 


299 


86 


414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


81 


415 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


296 


89 


416 


X58438 


Mus musculus 


proline rich protein 


112 


50 


417 


U70932 


Peromyscus 
leucopus 


reverse transcriptase 


89 


51 


418 


V00662 


Homo sapiens 


cytochrome oxidase III 


200 


84 


419 


AFO 17789 


Homo sapiens 


putative transcription factor CAI50 


120 


41 


420 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


183 


69 


421 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


166 


44 


422 


AF 130051 


Homo sapiens 


PRO0898 


158 


59 


423 


R96S0Q 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


86 


52 


424 


D38116 _j 


Pan paniscus 


cytochrome c oxidase subunit 3 


342 


82 


425 


U93570 


Homo sapiens 


putative pi 50 


133 


41 


426 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 1 


165 


67 


427 


D13951 


Nicotiana 
tabacum 


extensin precursor 


140 


42 


428 


L27428 


Homo sapiens 


reverse transcriptase 


104 


34 


429 


R95913 


Homo sapiens 


Neural thread protein. 


118 


49 


430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


206 


97 


431 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


55 


432 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


98 


54 


433 " 


J05042 


Oryctolagus 


alpha- 1 (VJH) collagen precursor 


91 


48 



140 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 




Smith- 


/© 


NO: 








Waterman 


Identit 










Score 


y 






cuniculus 








434 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


56 








7348. 






435 


U93572 


Homo sapiens 


putative pi 50 


118 


38 


436 


U93569 


Homo sapiens 


putative pl50 


100 


30 


437 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


81 








7437. 






438 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


71 








6613. 






439 


U52077 


Homo sapiens 


mariner transposase 


187 


52 


440 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


45 








7214. 






441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


71 


442 


AE003727 


Drosophila 


CGI 67 18 gene product 


301 


48 






melanogaster 








443 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


221 


74 








clone HPMBQ32. 






444 


U35730 


Mus musculus 


jerky 


159 


26 


445 


X53581 


Rattus 


ORF3 


192 


46 






norvegicus 








446 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


52 








4497. 






447 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


156 


38 








similar to pogo element 






448 


AF 1 94537 


Homo sapiens 


NAG13 


315 


70 


449 


W48351 


Homo sapiens 


Human breast cancer related protein 


93 


66 








BCRB2. 






450 


X92099 


Brugia pahangi 


collagen 


126 


44 


451 


AF090930 


Homo sapiens 


PRO0478 


88 


60 


452 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 


93 


40 








SEQIDNO:163. 






453 


AF081114 


Mus musculus 


ORF2 


108 


32 






domesticus 








454 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


65 








7871. 






455 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


197 


54 


456 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


68 








6616. 






457 


R95913 


Homo sapiens 


Neural thread protein. 


114 


48 


458 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


160 


39 


459 


X92485 


Plasmodium 


pval 


99 


52 






vivax 








460 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


52 








8172. 






461 


S80119 


Rattus sp. 


reverse transcriptase homolog 


138 


48 


462 


Y91577 


Homo sapiens 


Human secreted protein sequence 


211 


67 








encoded by gene 2 SEQ ID NO:250. 






463 


X97707 


Pongo pygmaeus 


stopcodon created by 


229 


76 






abelii 


posttranscriptional polyadenylation 






464 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 i 


67 


465 


L27428 


Homo sapiens 


reverse transcriptase 


154 


40 


466 


AK000496 i 


Homo sapiens 


unnamed protein product 


140 


69 


467 


G00637 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


114 


61 








4718. 






468 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


69 








4570. 






469 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


286 


79 


470 


D38112 


Homo sapiens 


N ADH dehydrogenase subunit 4 


448 


86 


471 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


296 


79 



141 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO- 








Waterman 


Identit 










Score 


y 


472 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


102 


48 








SEQ ID NO: 163. 






473 


AL080253 


Arabidopsis 


putative snRNP protein 


103 


42 






thaliana 








474 


X99452 


Lycopersicon 


extensin-like protein Dif54 


108 


25 






esculentum 








475 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


68 


34 








7214. 






476 


AB012223 


Canis farailiaris 


ORF2 


78 


66 


477 


AF130089 


Homo sapiens 


PRO2550 


113 


71 


478 


G03652 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


390 


97 








7733. 






479 


AF210651 


Homo sapiens 


NAG18 


146 


80 


480 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


103 


40 


481 


AF1 94537 


Homo sapiens 


NAG13 


118 


31 


482 


R96800 j 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


45 


483 


L27428 


Homo sapiens 


reverse transcriptase 


184 


47 


484 


U93570 


Homo sapiens 


putative pi 50 


101 


50 


485 


AF194537 


Homo sapiens 


NAG13 


213 


52 


486 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


162 


82 








SEQ ID NO: 163. 






487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


204 


86 








SEQ ID NO: 163. 






488 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


140 


53 








SEQIDNO:163. 






489 


U93574 


Homo sapiens 


putative pi 50 


86 


54 


490 


W48351 


Homo sapiens 


Human breast cancer related protein 


83 


56 








BCRB2. 






491 


AJ27I872 


Nicotiana 


extensin 


220 


47 






sylvestris 








492 


U11288 


Drosophila 


diaphanous protein 


113 


33 






melanogaster 








493 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


184 


70 








8172. 






494 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


68 








7795. 






495 


AF1 19900 


Homo sapiens 


PR02822 


148 


65 


496 


AB026542 


Homo sapiens 


WASP-family protein 


96 


38 


497 


D86853 


Catharanthus 


extensin 


104 


34 






roseus 








498 


AF025467 


Caenorfiabditis 


contains similarity to drosophila DNA- 


109 


47 






elegans 


binding protein K10 (NID:g8148) 






499 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


67 








4573. 






500 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


62 








4497. 






501 


AF1 19901 


Homo sapiens 


PR02831 


116 


82 


502 


AF238235 


Entamoeba 


diaphanous protein 


120 


41 






histolytica 








503 


M22332 


Homo sapiens 


unknown protein 


123 


49 


504 


AF1 19851 


Homo sapiens 


PR01722 


204 


52 


505 


X61296 


Rattus 


open reading frame 2 


107 


45 






norvegicus 








506 


AF1 18082 


Homo sapiens 


PRO1902 


145 


52 


507 


W48351 


Homo sapiens 


Human breast cancer related protein 


103 


68 






BCRB2. 






508 


X55685 


Lycopersicon 


extensin (class I) 


175 


39 






esculentum 








509 


X92485 


Plasmodium 


pval 


117 


52 



142 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% | 

ldennt 

y 






vivax 








510 


AF090942 


Homo sapiens 


[PRO0657 


95 


77 


511 


U93569 


Homo sapiens 


putative pi 50 


120 


54 


512 


U93574 


Homo sapiens 


putative pi 50 


140 


49 


513 


AJ242540 


Volvox carteri t 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


196 


63 


514 


L27428 


Homo sapiens 


reverse transcriptase 


132 


37 


515 


U93565 


Homo sapiens 


putative pi 50 


101 


45 


516 


U93574 


Homo sapiens 


putative pi 50 


178 


35 


517 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


81 


27 


518 


AF053538 


Alvinella 
pompejana 


fibrillar collagen chain FApl alpha 


112 


36 


519 


X52235 


Homo sapiens 


ORFII 


148 


35 


520 


AF130051 


Homo sapiens 


PRO0898 


98 


61 


521 


L02106 


Drosophila 
melanogaster 


ribonucleoprotein 


143 


40 


522 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


202 


70 


523 


U93570 


Homo sapiens 


putative p!50 


159 


43 


524 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO.163. 


210 


100 


525 


L27428 


Homo sapiens 


reverse transcriptase 


128 


38 


526 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


96 


65 


527 


X53581 


Rattus 
norvegicus 


ORF4 


130 


42 


528 


U93570 


Homo sapiens 


putative pi 50 


195 


35 


529 


AF130089 


Homo sapiens 


PRQ2550 


132 


43 


530 


AK024455 


Homo sapiens 


FLJ00047 protein 


126 


54 


531 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


61 


532 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


210 


44 


533 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


109 


42 


534 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


120 


62 


535 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


124 


37 


536 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


122 


50 


537 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


43 


538 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 


539 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


39 


540 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


102 


67 


541 


S80119 


Rattus sp. 


reversie transcriptase homolog 


191 


50 


542 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


543 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


125 


91 


544 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


146 


62 


545 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


45 


546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


94 


75 


547 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


510 


86 


548 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


146 


68 



143 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 








RHAla(AF078683) 






549 


Y08061 


Homo sapiens 


Human c-ruyb protein fragment 


128 


82 


550 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


153 


77 


551 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


121 


56 


552 


X92485 


Plasmodium 
vivax 


pval 


103 


50 


553 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


129 


46 


554 


L27428 


Homo sapiens 


reverse transcriptase 


149 


44 


555 


AF194537 


Homo sapiens 


NAG13 


157 


45 


556 


Y13247 


Homo sapiens 


FBI 9 protein 


106 


42 


557 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


100 


54 


558 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


176 


68 


559 


S80119 


Rattus sp. 


reverse transcriptase homolog 


113 


43 


560 


AY008270 


Homo sapiens 


cholestery I ester transfer protein 


107 


95 


561 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


63 


562 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


347 


68 


563 


AF025467 


Caenothabditis 
elegans 


contains similarity to drosophila DN A- 
binding protein K10 (NID:g8148) 


93 


40 


564 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 3 (COIH) 


329 


74 


565 


Y36156 


Homo sapiens 


Human secreted protein #28. 


153 


56 


566 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


131 


48 


567 


D38I12 


Homo sapiens 


cytochrome c oxidase subunit 3 


406 


94 


568 


AF130079 


Homo sapiens 


PR02852 


101 


55 


569 


AF081114 


Mus musculus 
domesticus 


ORF2 


123 


40 


570 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


65 


45 


571 


D86853 


Catharanthus 
roseus 


extensin 


168 


39 


572 


AF104415 


Mus musculus 


gene trap locus-13 


179 


66 


573 


AF130089 


Homo sapiens 


PRO2550 


114 


56 


574 


X67863 


Mus musculus 


T2 


115 


42 


575 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


28 


576 


S80119 


Rattus sp. 


reverse transcriptase homolog 


150 


57 


577 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


142 


74 


578 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


106 


57 


579 


G0379O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


46 


580 


L24521 


Homo sapiens 


transformation-related protein 


110 


38 


CO 1 

581 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


537 


84 


582 


AF090895 


Homo sapiens 


PRO0117 


127 


80 


583 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


145 


70 


584 


X55681 


Lycopersicon 
esculentum 


extensin (class 1) 


112 


38 


585 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


473 


60 j 


586 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


68 


587 




Araneus 
diadematus 


fibroin-3 


124 


30 


588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


150 


75 


589 


U93567 


Homo sapiens 


putative pl50 


225 


47 



144 
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SEQ ID 
NO: 




opcCIO 


Lrcscnpnon 


Smith- 
Waterman 
Score 


% 

Jdcntit 

y 


590 


X71602 


Nicotiana 
tabacum 


extensin 


147 


33 


591 


X57527 


Homo sapiens 


alpha 1(VIII) collagen 


103 


42 


592 


G03 112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


75 


47 


593 


R28916 


Homo sapiens 


Type III procollagen (prior art). 


116 


48 


594 


R95913 


Homo sapiens 


Neural thread protein. 


116 


37 


595 


U11880 


Petromyzon 
marinus 


cytochrome oxidase subunit I 


127 


52 


596 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4438. 


84 


62 


597 


L27428 


Homo sapiens 


reverse transcriptase 


158 


40 


598 


M55251 


Canis familiaris 


glycoprotein 80 


559 


86 


599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


600 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


143 


33 


601 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


154 


54 


602 


X7348I 


Drosophila hydei 


mstl0I(2) 


107 


42 


603 


M81321 


Macaca 
fascicularis 


proline-rich protein 


114 


39 


604 


X05561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 
917) (2953 is 2nd base in codon) 


109 


42 


605 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


82 


62 


606 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


64 


607 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


107 


44 


608 


L27428 


Homo sapiens 


reverse transcriptase 


147 


43 


609 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


113 


61 


610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


151 


82 


611 


U93568 


Homo sapiens 


putative pi 50 


144 


32 


612 


AB022223 


Arabidopsis 
thaliana 


extensin protein-like 


186 


58 


613 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 


49 


614 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


133 


36 


615 


X92485 


Plasmodium 
vivax 


pval 


120 


42 


616 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


175 


89 


617 


U83280 


Leishmania 
donovani 


39 kDa antigen 


111 


51 


618 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


137 


67 


619 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


620 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


92 


45 


621 


AF130089 


Homo sapiens 


PRO2550 


123 


34 


622 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


133 


59 


623 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


45 


624 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


205 


66 



145 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Wo fprm 4 n 

Score 


% 

1 A an fit- 

y 


625 


X07882 


Homo sapiens 


Po protein 


119 


39 


626 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


70 


100 


627 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


51 


628 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


54 


629 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


115 


46 


630 


X63368 


Homo sapiens 


HSJlb 


151 


52 


631 


AF130089 


Homo sapiens 


PRO2550 


155 


47 


632 


X92485 


Plasmodium 
vivax 


pval 


102 


61 


633 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


102 


39 


634 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


74 


635 


X92485 


Plasmodium 
vivax 


pval 


95 


73 


636 


S80119 


Rattussp. 


reverse transcriptase homolog 


114 


58 


637 


U15647 


Mus musculus 


reverse transcriptase 


170 


42 


638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


76 


639 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718, 


83 


36 


640 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


96 


34 


641 


X61296 


Rattus 
norvegicus 


open reading frame 2 


166 


35 


642 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


127 


35 


643 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


115 


35 


644 


AF081111 


Mus musculus 
domes ticus 


ORF2 


168 


33 


645 


AK027208 


Homo sapiens 


unnamed protein product 


90 


51 


646 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


101 


59 


647 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


149 


76 


648 


AF273441 


Pongo pygmaeus 


NADH dehydrogenase subunit 3 


121 


58 


649 


L27428 


Homo sapiens 


reverse transcriptase 


173 


58 


650 


AF119851 


Homo sapiens 


PR01722 


176 


53 


651 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


168 


68 


652 


AF130089 


Homo sapiens 


PRO2550 


130 


36 


653 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-1. 


126 


41 


654 


AK000385 


Homo sapiens 


unnamed protein product 


195 


63 


655 


AB041881 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


158 


100 


656 


X61047 


Hydra sp. 


mini-collagen 


60 


38 


657 


M22332 


Homo sapiens 


unknown protein 


100 


50 


658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


62 


659 


AF1 94537 


Homo sapiens 


NAG13 


95 


48 


660 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


50 


661 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


86 


32 


662 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


169 


53 


"663 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


72 



146 
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SEQ n> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


v 
7 








BCRB2. 






664 


U15647 


Mus xnusculus 


reverse transcriptase 


148 


50 


665 


R95913 


Homo sapiens 


Neural thread protein. 


161 


59 


666 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


80 








7001. 






667 


G03267 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


117 


44 








7348. 






668 


AB018705 


Mus musculus 


ORF2 


115 


32 


669 


D381I2 


Homo sapiens 


NADH dehydrogenase subunit 4 


280 


75 


670 


X53581 


Rattus 


ORF4 


71 


39 






norvegicus 








671 


U49973 


Homo saDiens 


ORF1* MER37* outative transnosase 


201 


66 








similar to pogo element 






672 


R95913 


Homo saDiens 


Neural thread Drotein. 


144 


78 


673 


U49973 


Homo saDiens 


ORF1* MER37* outative transnosase 


164 


46 








similar to pogo element 






674 


AF1 18082 


Homo saDiens 


PRO1902 


137 


49 


675 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


43 








7214. 






676 


D00570 


Mus musculus 


ODen readinc frame ^25 1 AA} 


112 


72 


677 


AF194537 


Homo sapiens 


NAG13 


238 


56 


678 


M13100 


Rattus 


unknown protein 


146 


51 






norvegicus 








679 


U15647 


Mus musculus 


reverse transcriptase 


123 


54 


680 


R95913 


Homo sapiens 


Neural thread protein. 


145 


55 


681 


R59842 


Homo sapiens 


ApoE4Ll protease. 


107 


63 


682 


AF130089 


Homo sapiens 


PRO2550 


94 


51 


683 


AF010144 


Homo saDiens 


neuronal thread Drotein AD7c~NTP 


129 


69 


684 


AF093748 


Homo saoiens 


KH tvoe SDlicine reeulatorv Drotein: 


93 


50 








KSRP 






685 


U93569 


Homo saniens 


DUtative d 1 50 


133 


58 


686 


G03095 


Homo sanien^ 


Human secreted Drotein SEO ID NO* 


117 


64 








7176. 






687 


Y36243 


Homo sanien*? 


Human secreted Drotein encoded hv 

AJLUUIUU JwUlvVvu UiVlvlU VUwUUwU J 


69 


73 








gene 20. 






688 


AFl 16712 


Homo sapiens 


PRQ2738 


133 


56 


689 


G02872 


Homo sanien^ 

X IvHJV *J*1 tllVilQ 


Human secrpted nmtein SEO ID NO* 


134 


53 








6953. 






690 


U93563 


Homo sapiens 


putative pi 50 


132 


49 


691 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


349 


70 








similar to pogo element 






692 


AF090895 


Homo sapiens 


PRO0117 


115 


63 


693 


AF130089 


Homo sapiens 


PRO2550 


132 


80 


694 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


43 


695 


U15647 


Mus musculus 


reverse transcriptase 


120 


64 


696 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


59 








7870. 






697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


72 


698 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


182 


47 


699 


D38112 


Homo saniens 


cytochrome c oxidase subunit 1 


459 


83 


700 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


73 








4484. 






701 


AF003535 


Homo sapiens 


ORF2-like protein 


125 


49 


702 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


89 


41 






elegans 


binding protein K10 (NID:g8148) 






703 


L27428 


Homo sapiens 


reverse transcriptase 


255 


50 


704 


AF130089 


Homo sapiens 


PRO2550 


87 


55 


705 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


52 



147 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7133. 






706 


Y79140 


Homo sapiens 


Human haemopoietic stem cell 
regulatory protein SCM3. 


211 


88 


707 


U15647 


Mus musculus 


reverse transcriptase 


94 


47 


708 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


66 


709 


G03807 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7888. 


107 


50 


710 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


163 


64 


711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


146 


65 


712 


U93565 


Homo sapiens 


putative pi 50 


108 


33 


713 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


108 


37 


714 


M24732 


Homo sapiens 


lamin-like protein 


92 


35 


715 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


306 


79 


716 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


462 


85 


717 


AF004715 


Homo sapiens 


jerky gene product homolog 


100 


42 j 


718 


X92485 


Plasmodium 
vivax 


pval 


84 


48 


719 


AF130089 


Homo sapiens 


PRO2550 


132 


74 


720 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


125 


43 


721 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


68 


722 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


46 


723 


X83413 


Human 
herpesvirus 6 


U88 


269 


41 


724 


X92485 


Plasmodium 
vivax 


pval 


117 


43 


725 


X92485 


Plasmodium 
vivax 


pval 


97 


41 


726 


G02538 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6619. 


102 


73 


727 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


43 


728 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 


729 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


730 


G03114 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7195. 


131 


59 


731 


W49717 


Homo sapiens 


Protein polymer adhesive substrate 
PPAS1-C. 


148 


29 


732 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


95 


45 


733 


X96731 


Ostertagia 
circumcincta 


articular collagen 


104 


37 


734 


AF130089 


Homo sapiens 


PRO2550 


118 


40 


735 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


736 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


494 


86 


737 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


188 


69 


738 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


593 


91 


739 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


102 


62 



148 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


740 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


111 


54 


741 


A23786 


Beta vulgaris 


chitinase 1 


106 


36 


742 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


72 


743 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


184 


61 


744 


AE003629 


Drosophila 
melanogaster 


CG 17108 gene product 


76 


36 


745 


U93563 


Homo sapiens 


putative pi 50 


145 


50 


746 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


68 


66 


747 


AF217973 


Homo sapiens 


unknown 


113. 


79 


74S 


AC006161 


Arabidopsis 
thaliana 


putative CENP-B/ARS-binding 
protein-like protein 


112 


31 


749 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


106 


47 


750 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


115 


54 


751 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


99 


72 


752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


85 


48 


753 


X92485 


Plasmodium 
vivax 


Dval 


89 


73 


754 


U93563 


Homo sapiens 


putative pi 50 


186 


68 


755 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683) 


116 


50 


756 


AF 194537 


Homo sapiens 


NAG13 


138 


40 


757 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


79 j 


758 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


119 


61 


759 


AF130079 


Homo sapiens 


PR02852 


138 


40 


760 


X92485 


Plasmodium 
vivax 


pval 


88 


77 


761 


U93050 


Mus musculus 


pohy(A) binding protein II 


95 


35 


762 


V01555 


Human 
herpesvirus 4 


BYRFI, encodes EBNA-2 (Dambaugh 
et al, 1984; Dillner et al, 1984) 


80 


41 


763 


V00662 


Homo sapiens 


ATPase 6 


337 


83 


764 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


117 


41 


765 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


423 


81 


766 


R95913 


Homo sapiens 


NeuraJ thread protein. 


114 


66 


767 j 


V00662 


Homo sapiens 


cytochrome oxidase I 


223 


83 


768 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


268 


83 


769 


V00662 


Homo sapiens 


cytochrome oxidase I 


357 


81 


770 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


296 


71 


771 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


95 


39 


772 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


53 


773 


X92485 


Plasmodium 
vivax 


pval 


95 


39 


774 


AF130051 


Homo sapiens 


PRO0898 


123 


38 


775 


ABO 12223 


Canis familiaris 


ORF2 


174 


51 


776 


AB028664 


Paralichthys 
olivaceus 


cytochrome oxidase subunit-3 


268 


57 


777 


V00662 


Homo sapiens 


cytochrome oxidase I 


436 


85 


778 


X52235 


Homo sapiens 


ORFII 


125 


47 


779 


U93569 


Homo sapiens 


putative pi 50 


235 


49 



149 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description" 


Smith- 
Waterman 
Score 


< 

Identit 

y 


780 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


262 


76 


781 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


239 


70 


782 


AF090942 


Homo sapiens 


PRO0657 


146 


69 


783 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


85 


66 


784 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


209 


62 


785 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


184 


60 


786 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


158 


60 


787 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


293 


86 


788 


AF130089 


Homo sapiens 


PRO2550 


153 


64 


789 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


131 


57 


790 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


100 


51 


791 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


80 

ox/ 


792 


Y36203 


Homo sapiens 


Human secreted protein #75. 


104 


77 


793 


AF265575 


Homo sapiens 


ubiquitous TPK-motif protein Y 
isoform 


117 


58 


794 


AF090930 


Homo sapiens 


PRO0478 


130 


68 


795 


AKQ24455 


Homo sapiens 


FLJ00047 protein 


114 


61 


796 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


141 


70 


797 


AF130089 


Homo sapiens 


PRO2550 


273 


85 


798 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


134 


66 


799 


AF130051 


Homo sapiens 


PRO0898 


162 


80 


800 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


179 


70 


801 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


75 


51 


802 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


103 


34 


803 


D38484 


Hylobates 
syndactylus 


Cytochrome C oxidase subunit 1 

(coxn 


263 


70 


804 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


41 


805 


X92485 


Plasmodium 
vivax 


pval 


103 ! 


38 


806 


AF194537 


Homo sapiens 


NAG 13 


285 


51 


807 


AF121360 


Drosophila 
melanogaster 


DNZDHHC/NEW1 zinc finger protein 
11 


179 


47 


808 


X92485 


Plasmodium 
vivax 


pval 


131 


46 


809 


D13951 


Nicotiana 
tabacum 


extensin precursor 


88 


41 


810 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


174 


73 


811 


U93565 


Homo sapiens 


putative pi 50 


118 


38 


812 


AF 11 8082 


Homo sapiens 


PRO1902 


88 


54 


813 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


42 


814 


AL359782 


Trypanosoma 
brucei 


probable similar to ring h2 ringer 
protein rhal a. 


114 


50 


815 


Y12713 


Mus musculus 


Pro-Pol-dU TPase polyprotein 


113 


43 


816 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


148 


40 



150 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

Identit 

y 


817 


AF217449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


102 


37 


818 


AB012223 


Canis familiaris 


ORF2 


101 


54 


819 


X71602' 


Nicotiana 
tabacum 


extensin 


162 


45 


820 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


154 


36 


821 


R95913 


Homo sapiens 


Neural thread protein. 


153 


61 


822 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


125 


61 


823 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


364 


57 


824 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


48 


825 


AF194537 


Homo sapiens 


NAG13 


224 


58 


826 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


437 


72 


827 


AF051782 


Homo sapiens 


diaphanous 1 


108 


38 


828 


AF194537 


Homo sapiens 


NAG13 


92 


45 


829 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


492 


75 


830 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


46 


831 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


108 


31 


832 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


117 


34 


833 


U93564 


Homo sapiens 


putative pi 50 


84 


40 


834 


U93563 


Homo sapiens 


putative pi 50 


262 


54 


835 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


214 


80 


836 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


48 


837 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


76 


838 


AF194537 


Homo sapiens 


NAG 13 


153 


60 


839 


U52077 


Homo sapiens 


mariner transposase 


344 


67 


840 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
b hiding protein K10 (NID:g8148) 


104 


46 


841 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


130 


92 


842 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


114 


36 


843 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


196 


43 


844 


X92485 


Plasmodium 
vivax 


pval 


102 


73 


845 


X61048 


Hydra sp. 


mini-collagen 


106 


41 


846 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


48 


847 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


54 


848 


G03438 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7519. 


128 


68 


849 


AF194537 


Homo sapiens 


NAG 13 


141 


46 


850 


U43360 


Peromyscus 
maniculatus j 


reverse transcriptase 


121 


45 


851 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


58 


852 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


60 



151 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideatit 

y 


853 


X92485 


Plasmodium 

IVfcJilJVUl Villi 

vivax 


nvn 1 
pvai 




HD 


854 


AF I 34305 


Homo sapiens 


Scar3 






855 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


343 


68 


856 


S80119 


Rattus sp. 


IwVClOV li «U [avl Ipicioc ilV/lilUIl/g 


1 so 


00 


857 


AF130089 


Homo saDiens 




1 19 


4A i 


858 


AK024372 


Homo sapiens 


unnamed protein product 


129 


50 


859 


G03790 


Homo Qanipn*: 

XI. Willi/ duUlWilO , 


riuindu secreiea proitJin, z>X-\£ uj inu. 
7871. 


1 (\1 
IVD 


/o 


860 


D38112 


Homn <sanif*rn: 

AJ.VJ11HJ OtlUlt/llo 


lyiuuiiroine c oxiuase buuuniT j 


*7TO 


Q{\ 


861 


G03133 




fiuman secreiea proiem, onv^ W) jnu. 

7914 


O/C 


44 


862 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
ounudi iu pogo element 


83 


40 


863 


AF210651 


Homo sapiens 


NAG18 


122 


63 


864 


AF0 16099 


IVlUo ILlUoCU 1US 


endonuc lease/reverse transcriptase 


109 


51 


865 


X55685 


Lycopersicon 

Coc. UlCIl lUill 


extensin (class I) 


115 


34 


866 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44/o. 


107 


80 


867 


X92485 


Plasmodium 
vivax 


pval 


105 


41 


868 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA 19. 


127 


62 


869 


X92485 


Plasmodium 

VlVtlA. 


pval 


105 


38 


870 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA7SL 


90 


56 


871 


Y27571 

X 4ml J t 1 


tTrtron cflntAnc 
XlUIUU oapiCILb 


Human secreted protein encoded by 

(tana "KJ r\ *\ 

gene xno. j. 


too 
125 


72 


872 


G03133 


T-Tnmn Qftr»if*nc 
xxuiiiu oajjidio 


nuuQan secreiea proiem, ocy uj vi\j. 
1*714. 

14. It. 


lz3 


36 


873 


AF130089 


Homo sanien^ 


x ivv^JJu 


1£A 


oZ 


874 


AF1 18082 


Homo sapiens 


PRO1902 


143 


65 


875 


U93564 


A LKJillKJ aa^Ji&iio 


pUuuIVC piJU 


low 


A A 

44 


876 


M10546 


Homo sapiens 


cytochrome oxidase I 


248 


75 


877 


U83303 


xiumw aapLCLio 


line l reverse u anscnpiase 


1Z/ 


CO 
JZ 


878 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7ft71 
ro /l. 


106 


35 


879 


U11288 


Drosophila- 

mAlanAOQctoi* 

Uic laQOgaSicT 


diaphanous protein 


93 


46 ! 


880 


AL359782 


1 X J UOXlvJjiJLllCt 

bmcei 


pruoauie oUQiiar io ring'-iiz linger i 

jL>l vlwiil J -Ltd l do 


/l 


/IT 

47 


881 


B08942 


Homo sapiens 


Human secreted protein sequence 
encoded hv i»pne 1 R SPO TO NO*00 

wlIAAJVlwU Uj gGlXw XO OXLV^ XX-/ rNW.77\ 


95 


40 


882 


AF130089 


Homo sapiens 


PRO2550 


137 


44 


883 


AF090942 


Homo sanieir? 


PRO0657 


149 
itz 




884 


G03787 


Homo <;anipr)c 


"Human Cf»rrf»tAr? nrntpin CPH iTi "NTfV 

IX 111 lieu l oCCICLCLl L71ULCUX, OJ_.V^f 1 1 J 1^(U. 

7868 






885 


X61296 


Rattus 
norvefficus 


open reading frame 2 


106 


43 


886 


Y19767 




SFO TD "WO frnm WOQ9999zn 

OXJV^ lis iSKJ tOJ li villi W v77mZZHj . 


Q1 

71 


J / 


887 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


130 


51 


888 


G0245S 


numu oapicns 


numan secreiea proiein, otix^ uj inu. 
6536. 


1 1*7 

1 17 


55 


889 


L25616 


Homo sapiens 


CGI protein 


150 


62 


890 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


171 


66 


891 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


107 


40 



152 



SEQ ID 


Accession No. 


Species 


Description 


SmHh- 
Waterman 

(•nr. 

ocore 


% 

Identit 

y 








fragment, SEQ ID NO:387. 






892 


X52318 


Bos taurus 


histone H2A Z < AA 1-1271 


356 


*70 


893 


V00662 


Homo sapiens 


URF 1 fNADH Hehvdro^enase subunifi 


X \JC 


A£L 
40 


894 


D38112 


Homo sapiens 


cvtocliromp r nYirlase ^iibunit 1 


173 

X r J 


/ J 


895 


X92485 


Plasmodium 
vivax 


pval 


109 


4 / 1 


896 


L76159 


Homo sapiens 


FRG1 ffene nroduct 


I \J\J 


JJ 


897 


D50926 


Homo sapiens 


The KJAA0136 eene Droduct is novel 


280 


8Q 


898 


X04011 


Homo sapiens 


orecursor nnJvrv*ntide 


1 14 

i x ^ 




899 


M90656 


Homo sapiens 


gamma-ghitamylcysteine synthetase 


101 


90 


900 


R95913 


Homo sapiens 


Neural tliTPaH r>rntf*5n 




7^ 


901 


L27428 


Homo sapiens 


rPVPftlP fTJVnC/ N l"irvtac^ 




H / 


902 


Y12713 


Mus musculus 


Pro-Pol-HI TTPa^f* nn Ivnmtfin 

A lv 1 vl U\J IX OOw yJKJXj IJX UltsXXl 






903 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493 


105 


76 


904 


AF010144 


Homo sapiens 


neuron si fhrpaH nrntpin A P)7r'-"KIT r P 


1ZO 


J / 


905 


AF130089 


Homo sapiens 


PRO2550 


88 


82 


906 


U49973 




v-nvr i , iviiiivj / , puiauve umisposase 
similar fn nnoA plempnt 


Tin 


en 

5U 


907 


L27428 


Homo sapiens 


rpvpr^p tran<:n-fnf5icp 

l^YddG (XOXlJUipuoC 


1 Id. 


Oh 


908 


G00377 


Homo saniens 


Humfln cprrrptprf nmtetn QpO TT> "MO* 

4458. 




/l 


909 


U93570 


Homo <ar>ien i ? 


mttsi+iw r»l ^(1 
jpuiciuvv 


i 


4i 


910 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


91 


63 


911 


L27428 


Homo ^anif*n<s 


UPVPrQP f-T-^n crn"ntQcr» 
ICVCIiC UiiiU>tnpifU>g 


1 05 


4o 


912 


X92485 


Plasmodium 
vivax 


pval 


91 


60 


913 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7R70 


103 


64 


914 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870 

/ O /V. 


96 


64 


915 


Y64890 


Homo sanien^ 


Human ^* P^T rplntprl nn1vrtf»ntir1f» QPO 
AlLUxidil J GO 1 ICxalCU pOiy pep tlQc oCy 


171 


ei 
Dl 


916 


G02501 


Homo sapiens 


Human SftreteH nrotein <5FO TT> MO- 

6582. 


87 


I / 


917 


M12099 


Mus musculus 


rirnlinp-rir*fi rwr»tf*in 

J/l 1 1.1/11 JJl V/bClU 


170 


44 


918 


Ml 5530 


Homo sapiens 


B-cell growth factor 


88 


51 


919 


AF1 30079 


Homo sanien<t 


PT*07R^7 


i 

1 Do 


DO 

oo 


920 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874 


103 


77 


921 


X53581 


Rattus 
norve*ricti<5 


ORF4 


124 


32 


922 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148 


no 


50 


923 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


99 


36 


924 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


47 


925 


AC006127 


Homo sapiens 


BRG-1 -HUMAN ; nuclear protein 
GRB 1 ; homeotic gene regulator, SNF2- 
X3I11 A, lVll IKJ i l\„ UKU W Irl AINU 
TRANSCRIPTION ACTIVATOR; 

JTV/oOJLDLiXi VjLUD/iL 

TRANSCRIPTION ACTIVATOR 
SNF2L4 


442 


92 


926 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


126 


39 


927 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


321 


76 



153 



WO 01/64835 PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 






... 




Score 


y 


yzo 


AUUUoJol 


Arabidopsis 


putative CENP-B/ARS-binding 


97 


35 






thaliana 


protein-like protein 






92y 


AU2o /5 


Daucus carota 


put precursor 


1 12 


38 


95U 


U9556.? 


Homo sapiens 


putative pi 50 


125 


70 


931 


AJ3U 12225 


Can is familiaris 


ORF2 


202 


50 


932 


AF053538 


Alvinella 


fibrillar collagen chain FApl alpha 


114 


37 






pompejana 








933 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


57 








7871. 






934 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


283 


54 








similar to pogo element 






935 


U41017 


Caenorhabditis 


repetitive region; weakly similar to E. 


107 


33 






elegans 


gracilis major membrane skeletal 












protein (PIRA434 17) 






936 


U47855 


Araneus 


fibroin-3 


109 


33 






diadematus 








937 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


43 








7214. 






938 


W48351 


Homo sapiens 


Human breast cancer related protein 


108 


50 








BCRB2. 






939 


M13101 


Rattus 


unknown protein 


121 


40 






norvegicus 








94U 


G00689 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


117 


46 








4770. 






941 


A T/\ 1 A1 A A 

Ar 010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


62 


942 


L27428 


Homo sapiens 


reverse transcriptase 


86 


45 


943 


U93564 


Homo sapiens 


putative pi 50 


279 


40 


944 


m 9+4+ A AAA 

Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


110 


48 








ID NO: 1051. 






945 


L27428 


Homo sapiens 


reverse transcriptase 


238 


66 


946 


AF 194537 


Homo sapiens 


NAG13 


146 


47 


947 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


55 








6619. 






948 


AC006161 


Arabidopsis 


putative CENP-B/ARS-binding 


113 


37 






thaliana 


protein-like protein 






949 


AF194537 


Homo sapiens 


NAG 13 


106 


66 


950 


Y 14437 


Homo sapiens 


Human secreted protein encoded by 


74 


75 








gene 27 clone HSA WA27. 






951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


66 








8148. 






952 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


85 


72 


953 


T Til 401 O 

U44838 


Glycine max 


extensm 


145 


39 


954 


R13556 


Homo sapiens 


Protem encoded downstream of hhc_M 


131 


39 








oncoprotein. 






955 


/""•A*} OA 1 

003801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


53 








7882. 






950 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


74 








4435. 








A T71 1AAQA 

Ar 150089 


Homo sapiens 


PRO2550 


1 OA 

120 


37 


95o 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


71 








7124. 






509 


A rAAAAilO 

Ar 090942 


Homo sapiens 


PRO0657 


83 


63 


960 


L27428 


Homo sapiens 


reverse transcriptase 


121 


30 


961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


68 








/ /yj. 






962 


AF090942 


Homo sapiens 


PRO0657 


138 


61 


963 


U83280 


Leishmania 


39 kDa antigen 


101 


53 






donovani 






964 


U93566 


Homo sapiens 


p40 


102 


31 



154 



WO 01/64835 



PCT/USO 1/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


965 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


63 








7867. 






966 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


40 








4770. 






967 


AF090930 


Homo sapiens 


PRO0478 


158 


80 


968 


AB012223 


Canis familiaris 


ORF2 


94 


36 


969 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


54 


970 


X53581 


Rattus 


ORF4 


163 








norvegicus 








971 


G03453 


Homo sapiens 


Human secreted orotein SEO ID NO* 


109 


56 








7534. 






972 


G03114 


Homo sapiens 


Human secreted orotein SEO ID NO* 


82 


47 








7195. 






973 


Y69166 


Homo sapiens 


A mature human N-acetvlslvcosaminvl 


95 


80 








transferase protein. 






974 


G03095 


Homo sapiens 


Human secreted orotein- SEO ID NO* 


74 


51 








7176. 






975 


U93574 


Homo sapiens 


putative ol50 


140 


43 


976 


W48351 


Homo ^aniens 


Human Wrryi^t cancer related rwntein 


79 










BCRB2 






977 


Y08062 


Homo saoiens 


Human PR0245 orotein fragment 


116 


55 








derived from DNA35638. 






978 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


63 








7492. 






979 


AF130114 


Homo sapiens 


PR02459 


121 


61 


980 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


99 


40 


981 


AF 116909 


Homo sapiens 


unknown 


115 


42 


982 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 " 


75 








7795. 






983 


Y14674 


Plasmodium 


glutam ate— cysteine ligase 


106 


66 






falciparum 








984 


AF229067 


Homo sapiens 


PADI-H protein 


152 


60 


985 


AF 119900 


Homo sapiens 


PR02822 


142 


48 


986 


G03714 


Homo saniens 

AAV/HIV l/UUiVUO 


Human secreted nrotein SEO TD NO* 


1 19 


70 








7795. 






987 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


106 


60 








oncoprotein. 






988 


G03119 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


51 








7200. 






990 


U35730 


Mus musculus 


jerky 


133 


29 


991 


AF113685 


Homo sapiens 


PRO0974 


136 


63 


992 


U52077 


Homo sapiens 


mariner transposase 


497 


77 


993 


Z97211 


Schizosaccharom 


kinesin-like protein 


197 


47 






yces pombe 








994 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


68 


995 


U93563 


Homo sapiens 


putative pi 50 


157 


50 


996 


AB015802 


Acetobacter 


similar to cellulose complementing 


140 


54 






xylinus 


protein of A. xylinum ATCC23869 






997 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


60 








6444. 






998 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


352 


75 


999 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


524 


78 








1 (NDF-1). 






1000 


V00662 


Homo saoiens 


IJRF 1 fNADH dehvdroerenase subunifi 


377 


70 


1001 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


50 








8153. 






1002 


AL390U4 


Leishmania 


extremely cysteine/valine rich protein 


249 


61 






major 








1003 


M14702 


Murine leukemia 


pol polyprotein 


206 


48 



155 



WO 01/64835 



PCT/TJS01/04927 



SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 






virus 








1004 


Z21507 


Homo ^aniens 


human plAnoatiAn fa rtAr-1 -Helta 
iiuiijoii wuugauv/n lauiui i viwta 


511 


O J 


1005 


L27428 


Homo sapiens 


reverse transrrintpj^ie 


176 




1006 


D38112 


Homo sapiens 


NADH dphvdmppna^e suhunit S 


332 


77 


1007 


AF090895 


in/iiiv/ aapieio 


PRflfll 17 
rivuu 11/ 




oo 


1008 


G03102 


Homo ^anien^ 


Human tzprrptpfi nrAtpin SPO TCm NO* 
nuuwu ocuicicu i/iv/LtAii, oi_iV^ it./ iiv,/. 


140 


6S 








7183. 






1009 


U44838 


Glvcine max 


pvtencin 


166 




1010 


AF251290 

i»*l <bJiii7V/ 


Pla^mnrlinm 


<rlnr«mip nrirtLrion nrAtpfn 
giuuuiiii* avviU'lluu j/iv/icm 


1 14 








falcinamm 








1011 


L27428 


Homo ^aniens 


rpvpr*sp trflTi<ir*riAtJ4QP 

IvYvldv U CUlOvi 41/IOjO 


114 

l It 




1012 


AF130089 


Homo <;anipn<i 




1 14 


77 


1013 

1 \J kJ 


G04097 

VJ V^TV/7 


Hatyi a cnntpnc 

nuinu sapiens 


Human e*wrt*ff*/4 nrAtAtn Q C (~\ ill ~\T(~\ m 

riuinan secreieQ proiem, ocy iu in w. 


oi 










Oil J. 






1014 




Hatha caniAnc 
numu oajptciLo 


P"RO01 17 


07 


Oj 


1015 




tfrini n caniAnc 
riuiiiu bapicllo 


PP07S*\rt 

nvv/zj ji/ 




R1 


1016 


AF07Q/567 


A^f*-o apt! r*^tni o 


oiifAAnf^m a /** i **4 ocft citnitt*iif III 

cyiocnrome c oxjoase suDunii 111 


77A 








aUi aiUb 








1017 


AFO^^QRS 

/■\..T uJJ/OJ 


fVn r*ti A/*<»Pf*a 


pyrroiicione-ncn antigen 




/La 






vol villus 








1018 


Y36166 


1-1 Am a conipnc 
ITLv/ino SapicUb 


T7rQmnot%(' nf human CAAfAta/l rM*r\i~Atn 

riagnicDi oi liUiiidn becreiea protein 


1 14 


*f0 








cn^ocicu. t/jr gene j. 






1019 

11/1.7 




nunio Sapiens 


PRO0663 


1 17 




1020 


Y2 1 1 66 

L Ctl k 1/1/ 


Hatvia csnipnc 
ClUII 11/ bapicllo 


Hllmltl V**->17 nrAtn-An/*nAoriB tniiHnt 

jntuiuaii vcll proio-oncogene rnuiuni 




7A 
JO 








proiein iragiDcflt ih. 






1071 




nomo sapiens 


neuronal uireaa protein au /c-in i r 


1 A(\ 
141/ 


77 




AT AAQAOQ 


Arabidopsis 


extensin-like protein 


1 AC 


34 






+ r*k nit OHO 

III all all a 








Vfi77aQ 


jiomo Sapiens 


UiimATn naAT > AT > A/*i «%«*^\.^A*fn an aa/"1a/*2 l^** # 

nuinan secreiea proiem encoaea oy 












gene mu cione riiNriijyo. 






1074 


VJUJ / l*f 


nomo Sapiens 


Mumaji secreiea proiem, ocy iu inu. 




DO 








770 •> 






107S 


M1714n 


iiouio Sapiens 


envelope protein 


1 


^7 
OZ 


1076 


T I4QQ71 


nuino sapiens 


L/Kr l , JMiixO /, p uiative uHnsposase 




<c 

JO 








sinniai to pugu eieineni 






1027 


T T4QQ7'"; 


n oiiiu sapiens 


wrvr i , j\irix\_) / , puiduve uansposase 


zoo 


A7 
0/ 








sunuax to pogo element 






1028 


r\.r v*+ 1 jju 


DUUU ekillallo 


in/a_l/it ucnyurugenase suuunit j 


07 


11 


1079 




nomo sapiens 


riuinan secreiea protein, ocy iu iNw. 




JO 














lUJv 


VjVZ*toZ 


— : 

Homo sapiens 


xiuman secretea protein, ocaj iu INU. 


/o 


Oi 














10^1 


VJUvOj / 


rtomo sapiens 


Unman o r>r*t-&tor\ nrntpin CHT^ TT^l KTf^i- 

numan secreiea proiem, oJza^ iu inw. 


7A 


JO 








471 R 






1032 


X90*S68 


ITUJllU dapiCIlo 


I I vj LC Jit SCt|UvIlue allU alUlUUlUi/Il 


JO/ 


inn 








aVallal/lC iUUll Via o W Ioi*~rI Ul, avallaUie 












at nrp<:pnf *Wa p-mail frnm 
at uiGowui vta v> man 1.1 v/ in 












LABEIT/2)£MBL-Heidelberf? DE 






1033 


G02654 


Homo sanien^ 


Human secreted nrotein SEO ID NO* 


96 


40 








67TS 

V/ / JJ. 






1034 


AF010144 


Homo Q3nif*n*5 


neuronal thread nrotein AD7e-i\ITP 


^16 


60 

uv/ 


1035 


AF 194537 


Homn <sanipn<! 
iiv/iiii/ □ai/iv^uo 


NAG 13 


708 


s? 

Jx. 


1036 


L27428 


Hattia canipnc 
i iv/ tin,/ saj/iciio 


rpvprcp tranQmnta^e 


166 


CI 
J 1 


1037 




Hattia eaniptiQ 
llv/lilU tHUJieilo 


Human hrpaQt ranrpr rplatpH ArAtpin 
in ui ii an i/ii/Ooi "woiivw iv/iaLcu jjiuicin 


107 

lvi / 


/I/I 








BCRB2. 






1038 


AF116638 


Homo sapiens 


PR01546 


56 


61 


1039 


U93570 


Homo sapiens 


putative pi 50 


138 


40 


1040 


AF130089 


Homo sapiens 


PRO2550 


150 


91 


1041 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 



156 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 








Waterman 


Identit 










Score 


y 


1042 




Hnmn oflmpn<; 

IiUli.lv/ oOpiGUS 


Uliman oevr»r*a4-^/^ nmtam ttJ 0 

riurnan secreieu proiem ttzo* 


07 
y l 


A 1 
41 


I V/*+J 


U7JJOO 


Urvm f\ com otic 

xiorno sapiens 


puiauve pi jU 


IZ4 


34 


1044 




T-Tr\rM s\ conipnc 
XiUJLllv sopidlo 


wivr i, iviiixvD /, puiative iransposase 




/I 








Qimilyir tn nnoA plpmpnf 

OLLUllOI LU IJU^U ClClliCIIL 






1045 


U93563 


X WjlXlXJ oCipi.V'llO 


r»n+?»ii vp r\1 ^A 
pUtaUVC pi JU 


746 


J4 


1046 


G00416 


Homo <anipn^ 


XJ.UlU.Oll oCv-i CICLl piuiciii, OJUv^ 1L/ lx\J. 




6A 

OH 








4497 






1047 


U93563 


Homo oanipnc 


puiauvv piju 


177 


ou 




AF11A1 14 
AT I Jul 14 


riomo sapiens 




1 1 H 


Of 


1049 


IT19Q1Q 


lVllib liiUbLUlUS 


ducnyiy i cyclase type v 11 


1 7A 
1 /U 


/TO 

Oo 




A r'AAQACy! 


Leishmania 


T 1 

Lo4ji. 1 


1 AA 

129 


OA 

30 






major 








luJ 1 


YQQA67 

j\yynof 


ivieaicago 




1 1A 
1 10 


io 






tn m r*it fi 1 1 si 








1052 


G017A6 


noiuu sapiens 


numan secreiea proiem, oev^ il* wkj. 


1 JU 


7A 

/o 














1053 


AF1 1671? 

AT HO/ 1<6 


XaOIIIU SdpicIlS 




1 AO 


AA 
44 


in*A 




Homo sapiens 


rapamycin binding protein 


168 


56 


IVJJ 


TT1 *\fJLl 
UJ J04/ 


jvjus mus cuius 


reverse transcriptase 


QC 
00 


OT 

37 


in<6 

1UJO 


AT AO/iAQC 


Homo sapiens 


cU4 ] 7M 14.z (novel serine/tnreonine- 


1 AA 

190 


72 








protein kinase (ortholog of mouse and 












rat MAK (male germ cell-associated 












Inn qc n\i 
AJLllaoC J J 






AUJ / 


A FAOAQAO 


— : 

Homo sapiens 


JrKUUO j / 


103 


^o 

63 


1UJO 


A FA 1 A 1 


xiomo sapiens 


neuronal uireaa proiem aij/c-in if 


15o 


Oo 




/vrUol 1 14 


Mus mus cuius 


UKrZ 


1j4 


47 






domesticus 








in 

1UOU 




Homo sapiens 


Human secreted protem encoded by 


1 OA 

180 


83 








gene 44 cione xi 1 uj\uzz. 






1UU 1 




* eruinyscus 


reverse uranscripiase 


IZO 


43 






XllOUll* LLldlUo 








1062 




ixuiiio sapiens 


Human cPw*PAtAH nrntotn QPfl TF\ XTO* 

xi uiii oil secreieu pro Lcin 7 ocy lu iS\j. 


54 










/Of 1. 






1063 


III 5647 


iviUS UIUSCUIUS 


reverse ixanscnpcase 


JO 


OC 
JO 


1064 


T TCn *67 

U7JJO f 


riomo sapiens 


puiauve pi ju 


100 

Izo 


CO 


1065 


Y1AA7Q 

Z 1HH /y 


xiuziio sapiens 


rragmenr or numan secretec proiem 


1 *2A 


OO 








encuueu oy gene 14. 






1066 


X92485 


cm Art 1 ii im 

A ld^lllULllUill 


• pval 


1 1 0 
1 12* 


oz 






vivax 








1067 




Wattia canipnc 

XxlrlUU SOpiGllo 


p*tu 


161 
lO 1 


45 


1068 


D38112 


CTvuiU sapiens 


cyujwiruuic c o/uuase suounit d 


J4U 


ftA 
54 


1069 


\J7JJ / V 


riuiiiu sapiens 


puEanve p i du 


1 A*7 


CO 
D9 


1070 




f* K a 1 i n aI nKn c 


cyiociiromc c oxidase suDunii 111 




^71 
fi 






IUUC1 t>UialuS 








1071 


U93567 


T-TAmA cnni anc 
julviuuvj oapiciio 




oo 


Oft 


1072 


G04091 


Unmn canipno 


rxuiilail SCCiclCU piuiciii, 3X3.v^ ILy INV_/. 


11)5 

lUJ 


OO 








817? 






1073 


Y86248 




TrTumsin cprrptpr) nrAtpin HPT-TppAR 
nuaju.au 5CU1CLCU pi utciii nLnrruft, 


1^6 

IjO 










CCD Tn>Jf"V163 






1074 


U93572 


Rnmn c;iniVnc 
nuuiu bapjClio 


puiauve pi JU 


1 AA 
14U 


JJ 


XV / J 


VJUZOJZ 


xiomo sapiens 


riuman secreieu proiem, on\i uj nu. 


OO 


43 








6Q11 






1076 


AT AA06HR 


/VTEOluOpSlS 


extensin-like protein 


10 j 


yj 






maiiana 








1077 


Y86248 


Homo saoien^ 


Human secreted orotein HCHPF68 


94 


66 

OQ 








SEQIDNO:163. 






1078 


G02493 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


81 


57 








6574. 






1079 


G04091 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


83 


35 



157 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








8172. 






1080 


AF1 62149 


Mycoplasma 
bovis 


variable surface lipoprotein 


103 


41 


1081 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


148 


75 


1082 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


42 


1083 


U93564 


Homo sapiens 


p40 


97 


42 


1084 


AF229067 


Homo sapiens 


PADI-H protein 


145 


61 


1085 


U49973 


Homo sapiens 


ORF1; MER37; putative trahsposase 
similar to pogo element 


221 


60 


1086 


U88573 


Homo sapiens 


NBR2 


165 


67 


1087 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


72 


1088 


AF194537 


Homo sapiens 


NAG13 


142 


62 


1089 


B08976 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO: 133. 


93 


62 


1090 


AF194537 


Homo sapiens 


NAG13 


155 


40 


1091 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


35 


1092 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


195 


40 


1093 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


91 


46 


1094 


X53581 


Rattus 
norvegicus 


ORF4 


106 


62 


1095 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


129 


51 


1096 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:388. 


72 . 


33 


1097 


U40342 


Mus musculus 


mnein 


152 


44 


1098 


M24732 


Homo sapiens 


lamin-like protein 


92 


37 


1099 


X92485 


Plasmodium 
vivax 


pval 


111 


67 


1100 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


156 


86 


1101 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


116 


90 


1102 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


158 


71 


1103 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 


36 


1104 


U93572 


Homo sapiens 


putative pi 50 


168 


56 


1105 


U93570 


Homo sapiens 


putative pi 50 


96 


40 


1106 


L27428 


Homo sapiens 


reverse transcriptase 


188 


43 


1107 


X53581 


Rattus 
norvegicus 


ORF4 


141 


43 


1108 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


344 


77 


1109 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


126 


68 


1110 


U93569 


Homo sapiens 


putative pl50 


156 


38 


mi 


AF 118086 


Homo sapiens 


PR01992 


135 


54 


1112 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI 8. 


71 


63 


1113 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


124 


62 


1114 


L27428 


Homo sapiens 


reverse transcriptase 


200 


40 


1115 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


101 


50 


1116 


L27428 


Homo sapiens 


reverse transcriptase 


122 


70 


1117 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


65 



158 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 














1118 


W48351 


Homo san tens 


Human Krc^Qct ponrAr rplafpH rvrotpln 

JLAULIJCUl UlCdol UllltCl IGlaiVMJL JJiULCLLl 

BCRB2. 


1 lU 


03 


1119 


D38484 


Hvlobates 
syndactylus 


r^vtfiphmmR C* iyyiHsiqp cnVinnir 1 

(COXI) 




07 


1120 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


76 


1121 


G03786 


Homo sanien^ 


Human cprrpfoH nrntpin **RO 1T> MO* 

7867. 




OO j 


1122 


AFO 13990 


Homo saoien^ 


nniniiitin tormina] hvHrnlacp 

VIUIVJUILUI ICllJJUlal JU.J'UlVJltloC 


1 UJ 


SO 


1123 


G03786 


Homo saniens 


Human cprrplwl nrntpin SPO TO WO- 

7867. 


94 


«/-! 


1124 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571 


154 


65 


1125 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


53 


1126 


AF 130089 


Homo <s*mipn<; 


XT IVUXJ *J V 


if* 

JO 


79 


1127 


G02832 


Homo ^smipn^ 


Human cprrpfpH nrr»tptti QPO TR MO* 

6913. 


09 




1128 


■ Y14479 


Homo saniens 


Frapmenf of human ^ecrptpd orofpin 
encoded bv £ene 14 


80 

WW 


SQ 


1129 


AF1 19855 


Homo sapiens 


PR01847 


146 


70 


1130 


AF1 94537 


Homo saniens 


NAG 13 


1 ox 




1131 


L27428 


Homo ^anien^ 


revprse trtw\ srri nta qp 




3R 
Jo 


1132 


U93570 


Homo sapiens 


nutativp n150 




3d 


1133 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


157 


68 


1134 


AJ004810 


Zea mav» 


cvtochrome P450 mnnnnyvaprifl^ 


79 


o / 


1135 


AFO 10 144 


Homo ^anien^ 


neuronal thread nrotpin ADTr-MTP 


i JO 


/u 


1136 


GO1502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 


81 


73 


1137 


ABO 18705 


Mus musculus 


ORF2 


138 

X JO 


36 


1138 


L20321 


Homo saniens 

A A. UU1 V/ JUL/ AVI lfcj 


nrotein ^erine/thn*oninp Jcina^p 

VIA* 1A1 Swl UIW UJU VUUUlv Mllfljfc 




63 


1139 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


92 


60 


1140 


AF1 94537 


Homo <vanien«i 


NAG 13 


1 1 


33 


1141 


U93564 


Homo sapiens 


putative pi 50 


135 


51 


1142 


D86853 


fa His* ran th n c 

roseus 


pvtpncin 






1143 


D00570 


IV/f HQ mnscuhis 


oopn readinc fhmp f951 A 


913 


sn 


1144 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284 


62 


38 


1145 


Z70684 


Caenorhahriiri<5 
elegans 


F28D1 8 

JL -CO IS 1 . O 


IDS 


39 


1146 


G03133 


Homo saniens 


Human secreted nrotein. SEO ID NO* 
7214. 


79 


54 


1147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


151 


62 


1148 


A23786 


Beta vulgaris 


chitinase 1 


93 


37 


1149 


AF129756 


Homo sapiens 


BAT2 


177 


52 


1150 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


147 


73 


1151 


L27428 


Homo sapiens 


reverse transcriptase 


77 


31 


1152 


U34044 


Homo sapiens 


selenium donor protein 


238 


48 


1153 


AK024455 


Homo saniens 


FLJ00047 nrotein 


78 
/ o 


78 


1154 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

Similar to nocro plpmpnt 


197 


67 


1155 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


120 


64 


1156 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


98 


34 


1157 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


54 



159 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


&roith- 
Watermao 


% 

Ideotit 

y 


1158 


AF194537 


Homo sapiens 


NAG 13 


106 


48 


1159 


U49973 


Homo sapiens 


ORF1* niltaHvp trancno'ia'if- 
similar to pogo element 


357 


AO 


1160 


U93572 


Homo sapiens 


p40 


89 




1161 


G03203 


Homo saoiens 


Human <if-rri*tf-H nrofpin ^PO TT\ MO* 

7284. 


184 


Oo 


1162 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


123 


57 


1163 


AF116712 


Homo sapiens 


PR02738 




AQ 

oy 


1164 


V00662 


Homo sapiens 


cytochrome oxidase I 


465 


69 


1165 


G00416 


Homo saniens 


Human cprrpteH nrntpin ^T70 TTl WO* 

4497. 


HO 


0 1 


1166 


L26163 


Mus musculus 


histone Hie 


104 


37 


1167 


X70343 


Nicotian a 

sylvestris 


vAlviiulli 


yj 




1168 


AF130051 


Homo sapiens 


PRO0898 


117 


43 


1169 


G03714 


Homo *;anien*i 


Human CP/*r/»t<»rl nrr\fpin TT\ ~K1(~\- 
14U111CU1 dCUClCU pi t>LCUl, ODy LL/ IN W. 

7795. 






1170 


L27428 


Homo <*ariien<j 


TTPVPrcp fr*i>ncr , t*f nt"-?ic<* 
jcvcioc u an bdipuibe 


1/10 

my 


2*2 


1171 


G00497 


Homo **anif*n*j 


Hum An CPr>rp>t^^ nrAfpin CCA Tr> XTfV 

n union sccreiea protein, oiji^ il/ lNL/. 
4578. 




OZ 


1172 


AF030277 


rrap^e1anhii<i 

spekii 


cyiUi/uTUIUC UXIUaoC b LI Dull 11 111 


zoo 


34 


1173 


L22030 

■ lit i . V_f \J 


Ivn'np mov 

VJljrVUJG ItlOA 


h l*(W\fr\irf\ linn r"i/»h (yltmnnrAfam 

nyaroxyproiine-ncn glycoprotein 


0/ 




1174 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
diiuiiar io pugo element 


168 


45 


1175 


U43627 


ArahiHnriQic 

thaliana 


CAlClloUl 


111 

111 


42 


1176 


U43627 


thaliana 


^vtpncin 

cAiciioin 


Oft 


oo 
Zy 


1177 


U93565 


Hrtmft QAmVnc 


putative pi JU 


07 


JO 


1178 


J01047 


C*aen orh a hri i ti c 
elepan** 


PAllacfpn 
V/Uiiciiicii 


ion 
lUo 




1179 


G02532 


Homo sanienQ 


Human ^pprptprl nrntpin ^FO TO "MO* 

■1 1 1111 A ell L owwlblCU LUvflGUAj OCy JUL* IN Vw'- 

6613. 


1 /u 


Oo 


1180 


AFO 16099 


Mus musculus 


finrfnniip1pfl<!/*/rpvprcp tranc^rirrfac** 
wum/miMicojw tcvviaO u dii^ct i uidoc 


111 

I 13 


JO 


1181 


AF010144 


Homo saniens 


neuronal thread nrntein A T")7r-"MT r P 

11VU1 VJllui Ull VsClll ^Ji ULullJ fix/ / ^ IN 1 1 


lid 


03 


1182 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


62 


1183 


Y86248 


Homo <?anipn<5 


Hittnfln cp***r<*-t*»H nrntAin UfHDI</SQ 
ixuiiiau oclicicli piuicui riv^xxi ruo, 

SEQ ID NO: 163. 


19/1 
lZ«f 


4!) 


1184 


D38112 


Homo *sanien<: 


IN/VL'n UClljrvlIVig*wilc2*3C OUUUillL 




OO 


1185 


U87607 


Rattus 
norvegicus 


nntative R^JA nin/lrnor nrntpin 1 

pUUlUTV 1 VI 'fix UlliLlUlg piULCUl 1 


1UO 


41 


1186 


Y86573 


Homo sapiens 


Human gene 91-encoded protein 
fragment SEO ID NO490 


381 


75 


1187 


D38112 


JJomo sapiens 


cytochrome c oxidase subunit 1 


434 


79 


1188 


U83303 


Homo sapiens 


line-1 reverse tran*j*r*rint , a ip 

AlXlw X IwTUJv U UUOvl 11/uUw 


t j 




1189 


G03789 


Homo saniens 


Human <tf*r , rptp_H nrntpin SlPO TTl "MO* 

7870. 


1 SO 


Oo 


1190 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


146 


68 


1191 


AF1 18086 


Homo sapiens 


PR01992 


146 


81 


1192 


W12842 


Homo Q3rii^n? 


Tnin/*atpH nrn^alnha 1 / II f*h?iin 
1 1 uutaicu pi tr*ctiuiia 1 f ULl. J HJ.au I. 


in/; 

I I/O 


JJ 


1193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine-* 
and proline-rich 


67 


34 


1194 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


116 


74 


1195 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1 . 


132 


82 



160 



WO 01/64835 



PCT/US01/04927 



Seq id 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

v 


1196 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


75 


48 


1197 


U49973 


Homo sapiens 


ORPi; MER37; putative transposase 
similar to pogo element 


254 


73 


1198 


U93570 


Homo sapiens 


p40 


103 


30 


1199 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


145 


39 


1200 


AK024455 


Homo sapiens 


FU00047 protein 


115 


56 


1201 


AF090942 


Homo sapiens 


PRO0657 


88 


64 


1202 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


124 


70 


1203 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


113 


46 


1204 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


81 


41 


1205 


U35730 


Mus musculus 


jerky 


107 


27 


1206 


U15647 


Mus musculus 


reverse transcriptase 


191 


45 


1207 


U15647 


Mus musculus 


reverse transcriptase 


124 


50 


1208 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


140 


58 


1209 


AF010144 


Homo- sapiens 


neuronal thread protein AD7c-NTP 


91 


54 


1210 


AF 11 9900 


Homo sapiens 


PR02822 


160 


81 


1211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


68 


1212 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


45 


1213 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


61 


1214 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


102 


61 


1215 


G03560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


101 


46 


1216 


AJ242540 . 


Volvox carteri £ 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


58 


1217 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ EDN0.387. 


105 


46 


1218 


AC002483 


Homo sapiens 


putative product from mRNA sequence 
CG003 from BRCA2 region; match to 
U50534 (NID:gl685103) 


378 


97 


1219 


AF090895 


Homo sapiens 


PRO0117 


130 


58 


1220 


API 13685 


Homo sapiens 


PRO0974 


117 


60 


1221 


X61295 \ 


Rattus 
norvegicus 


LI retroposon, a portion of its ORF2 
sequence 


126 


50 


1222 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


148 • 


70 


1223 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


56 


1224 


U93574 


Homo sapiens 


putative pl50 


93 


44 


1225 


AF130051 


Homo sapiens 


PRO0898 


133 


69 


1226 


U93563 


Homo sapiens 


putative pi 50 


125 


47 


1227 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


47 


1228 


U93564 


Homo sapiens 


putative pl50 


116 


47 


1229 


W21733 


Homo sapiens 


NIP-1 encoded by clone 59. 


138 


63 


1230 


U15647 


Mus musculus 


reverse transcriptase 


105 


42 


1231 


U93563 


Homo sapiens 


putative pi 50 


299 


54 


1232 


R95913 


Homo sapiens 


Neural thread protein. 


138 


51 


1233 


AF130079 


Homo sapiens 


PR02852 


203 


70 


1234 


X53581 


Rattus 
norvegicus 


ORF3 


106 


60 



161 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


INU:' 






Water man 


Identit 










Score 


y 


1235 


AFl 18086 


Homo sapiens 


PRO 1992 

* A ** A ✓ ^ 


144 


81 


1236 


X92485 


Plasmodium 


pval 


125 


71 






vivax 








1237 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


75 


1238 


U93572 


Homo sapiens 


Dutattve nl50 


133 


40 


1239 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


69 


56 








5330. 






1240 


AF130089 


Homo sapiens 


PRO2550 


136 


41 


1241 


G00397 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


109 


j j 








4478. 






1242 


G03714 


Homo saoiens 


Human secreted oratein SFO ID NO- 


161 


7*? 








7795. 






1243 


G02752 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


87 


45 








6833 






1244 


G03787 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


128 


58 








7868. 






1245 


U93570 


Homo sapiens 


nutalive nl 50 


161 


SO 


1246 


Z70684 


Caenorhabditis 


F28D1.8 


121 


45 






eleeans 








1247 


AF257305 


Homo sapiens 


ASH1 


576 


89 


1248 


G03263 


Homo ^aniens 


HiinmiTfi Qpprf*tf»f? nrntpin TO MO* 




Oo 








7344. 






1249 


AF025467 


Caenorhabditis 


contains ^imilaritv to dro^onhila TTNA- 


107 








elegans 


bindine: nrotein K10 fN!D*e8148^ 






1250 


AP000616 


Oryza sativa 


similar to RING-H2 fin per nrotein 

tJAAAAA W4 %AV AV^ll w 4 1 it iUJCwl UlvtvJJl 


130 


91 








RHAla(AF078683) 






1251 


U63542 


Homo sapiens 


FAP protein 


116 


61 


1252 


G04091 


Homo sapiens 


Human secreted nrotein SEO ID NO* 

11ULUUU JWVI vlvu UI v>>vUl) *JLJ\S kLS iiv/> 


109 


5^ 
j j 








8172. 






1253 


AF068294 


Homo sapiens 


HDCMB45P 


251 




1254 


AF090895 


Homo sapiens 


PRO01 17 


111 


60 


1255 


G02827 


Homo saoiens 


Human secreted nrotein XPO ID TvIO* 


201 


7** 








6908. 






1256 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c~MTP 


1 16 


51 


1257 


AB033032 


Homo sapiens 


KIAA1206 nrotein 


115 


80 


1258 


• G00365 




Human cerrpteH nrotein TT*j MO* 




OU 








4446. 






1259 


G03801 


Homo ^anienc 


Human Q^rreteH nrntpin TO MO* 


1j / 


Oo 








7882. 






1260 


AB032906 


Hvlohate*; 


flonamine rerpntor 04. 

UUUal i 1 Ul V 1 C ^ v LF lAJi JL^T 










nileatiis 








1261 


AF022985 


Ca en o rhflhH iri c 


Ouiiiiai Lw ouiiagcii, tuucu ICli Uy 


J vO 


JO 








ClCgtulS L>LJrir\. jrKJJU.J, CC/Ucu KJi Oj 












P elppani rDNA vlrftfin'S *<• rnHpH for 












hv n eleannQ cDNA vk"71ftd rtnH*v1 












for bv C eleeans cDNA vk55f* 5* 












coded for by C. elegans cDNA 












yk66d5.5; coded for by C. elegans 












cDNAyk71e4.5 






1262 


U93566 


Homo sapiens 


p40 


182 


39 


1263 


L20096 


Manduca sexta 


ribosomal protein s7 


227 


59 


1264 


AFl 19901 


Homo sapiens 


PR02831 


103 


71 


1265 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 


219 


97 








in cosmid C27F2 flJ40419) 

AAA VVUiUJlU V ' A 4# I TVTT A S J 






1266 


D90279 


Homo sapiens 


collagen alpha 1(V) chain precursor 


120 


42 


1267 


L27428 


Homo sapiens 


reverse transcriptase 


111 


41 


1268 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: [ 


126 


68 








4493. 






1269 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


34 



162 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


mcc&bion it ©• 


Species 


Description 


Smith- 
Waterman 

T T mail 

Score 


% 

Ideotit 
y 


1270 


AFI30089 


Homo sapiens 


PRO2550 


96 


69 


1271 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


121 


45 


1272 


AF165310 


Homo sapiens 


ATP cassette binding transporter 1 


243 


100 


1273 


R95913 


Homo sapiens 


Neural thread protein. 


110 


70 


1274 


X92485 


Plasmodium 
vrvax 


pval 


106 


65 


1275 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7253. 


104 


56 


1276 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


53 


36 


1277 


X03725 


Mus musculus 


ORF2(466aa) 


103 


41 


1278 


U93570 


Homo sapiens 


putative pi 50 


98 


43 


1279 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


159 


83 


1280 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


167 


71 


1281 


AJ271871 


Nicotiana 
sylvestris 


putative extensin 


105 


36 


1282 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


119 


32 


1283 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


66 


1284 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


135 


50 


1285 


A3 1039 


Nicotiana alata 


PRP3 


112 


36 


1286 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


116 


72 


1287 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 


67 


1288 


S80119 


Rattus sp. 


reverse transcriptase homolog 


112 


33 


1289 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


203 


76 


1290 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


53 


1291 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


82 


1292 


AF130089 


Homo sapiens 


PRO2550 


127 


62 


1293 


AF003535 


Homo sapiens 


ORF2-like protein 


101 


48 


1294 


Y19610 


Homo sapiens 


SEQ ID NO 328 from W09922243. 


100 


42 


1295 


L27428 


Homo sapiens 


reverse transcriptase 


126 


36 


1296 


G00437 


Homo sapiens t 


Human secreted protein, SEQ ID NO: 
4518. 


140 


71 


1297 


L24433 


Oncorhynchus 
mykiss 


complement component C3 


359 


31 


1298 


AC004381 


Homo sapiens 


SA gene 


443 


55 


1299 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


180 


64 


1300 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


135 


66 


1301 


D84391 


Mus musculus 


reverse transcriptase 


106 


48 


1302 


D13951 


Nicotiana 
tabacum 


extensin precursor 


134 


42 


1303 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


73 


1304 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


93 


38 


1305 


Y13620 


Homo sapiens 


BCL9 


102 


39 


1306 


U93567 


Homo sapiens 


putative pi 50 


245 


63 


1307 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


130 


70 


1308 


AL355774 


Streptomyces 
coelicolor A3(2) 


putative integral membrane protein 


136 


40 


1309 


W54966 


Homo sapiens 


Synthetic human type III collagen 


124 


41 



163 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








SYN-C3. 






1310 


M20670 


Plasmodium 
vivax 


circumsporozoite protein 


107 


34 


1311 


AF151366 


Arabidopsis 
thaliana 


arginine/seririe-rich protein 


114 


36 


1312 


G03099 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7180. 


76 


43 


1313 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


109 


85 


1314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


111 


37 


1315 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


123 


62 


1316 


R95913 


Homo sapiens 


Neural thread protein. 


98 


58 


1317 


AF 113685 


Homo sapiens 


PRO0974 


170 


47 


1318 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


122 


61 


1319 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


99 


68 


1320 


AJ242540 


Voivox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


270 


56 


1321 


U93569 


Homo sapiens 


putative pi 50 


124 


37 


1322 


AF090931 


Homo sapiens 


PRO0483 


111 


85 


1323 


G04091 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8172. 


85 


44 


1324 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


131 


43 


1325 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


144 


54 


1326 


AF 194537 


Homo sapiens 


NAG13 


125 


49 


1327 


L27428 


Homo sapiens 


reverse transcriptase 


111 


45 


1328 


U93568 


Homo sapiens 


putative pi 50 


112 


30 


1329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


110 


51 


1330 


L27428 


Homo sapiens 


reverse transcriptase 


142 


53 


1331 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


98 


88 


1332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


55 i 


1333 


X71602 


Nicotiana 
tabacum 


extensin 


113 


35 


1334 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


129 


41 


1335 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


112 


50 


1336 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


102 


68 | 


1337 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


95 


51 


1338 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


60 


1339 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


142 


55 


1340 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


123 


31 


1341 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


118 


70 


1342 


X71629 


Mus musculus 


msgl 


106 


57 


1343 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. \ 


105 


79 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1344 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


116 




1345 


AF161356 


Homo sapiens 


HSPC093 


88 


88 


1346 


J01435 


Rattus 
norvegicus 


ATPase 


348 


66 


1347 


U93563 


Homo sapiens 


putative pi 50 


142 


36 


1348 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


87 


66 


1349 


AF090942 


Homo sapiens 


PRO0657 


152 


54 


1350 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


143 




1351 


U93572 


Homo sapiens 


putative pi 50 


113 


84 


1352 


X92485 


Plasmodium 
vivax 


pval 


130 


70 


1353 


X61047 


Hydra sp. 


mini-collagen 


105 


36 


1354 


AF115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


128 


46 


1355 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


91 


39 


1356 


AF194537 


Homo sapiens 


NAG13 


148 


63 


1357 


AF130079 


Homo sapiens 


PR02852 


139 


73 


1358 


X53581 


Ratals 
norvegicus 


ORF4 


208 


43 


1359 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


50 


1360 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


328 


69 


1361 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


126 


65 


1362 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


97 


47 


1363 


X53581 


Ratals 
norvegicus 


ORF4 


no 


35 


1364 


U93569 


Homo sapiens 


putative pl50 


123 


.41 


1365 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


63 


1366 


X61294 


Ratals 
norvegicus 


LI retroposon, a portion of its ORF2 
sequence 


153 


43 


1367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


69 


80 | 


1368 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


80 


46 


1369 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


113 


26 


1370 


X92485 


Plasmodium 
vivax 


pval 


106 


46 


1371 


U90946 


Dictyostelium 
discoideum 


myosin heavy chain kinase B 


114 


62 


1372 


L27428 


Homo sapiens 


reverse transcriptase 


98 


61 


1373 


GO3800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


124 


80 


1374 


U49974 


Homo sapiens 


mariner transposase 


137 


57 


1375 


AC004891 


Homo sapiens 


contactin-like; similar to U87224 
(PID:gl857710) 


234 


58 


1376 


AP010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


132 


60 


1377 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


317 


51 


1378 


AF1 18082 


Homo sapiens 


PRO 1902 


102 


42 


1379 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


186 


45 


1380 


U93567 


Homo sapiens 


putative p i 50 


116 


38 


1381 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


218 


47 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

v 








similar to pogo element 






1382 


B08918 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 28 SEQ ID NO:75. 


87 


66 


1383 


AF090895 


Homo sapiens 


PRO0117 


68 


82 


1384 


U93570 


Homo sapiens 


putative pi 50 


178 


39 


1385 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


132 


43 


1386 


AF130089 


Homo sapiens 


PRO2550 


142 


35 


1387 


L27428 


Homo sapiens 


reverse transcriptase 


163 


49 


1388 


X61296 


Rattus 
norvegicus 


open reading frame 2 


123 


44 


1389 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


67 


1390 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


147 


57 


1391 


U93570 


Homo sapiens 


putative pi 50 


110 


28 


1392 


R95913 


Homo sapiens 


Neural thread protein. 


104 


35 


1393 


L27428 


Homo sapiens 


reverse transcriptase 


101 


39 


1394 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


147 


47 


1395 


AF216972 


Homo sapiens . 


p8 protein 


118 


49 


1396 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


160 


58 


1397 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


312 


62 


1398 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 




1399 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


54 


1400 


X92485 


Plasmodium 
vivax 


pval 


124 


39 


1401 


U93563 


Homo sapiens 


putative pi 50 


131 


36 


1402 


X67863 


Mus musculus 


T2 


160 


48 


1403 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


94 


39 


1404 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


73 


1405 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


71 


1406 


Ml 8094 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


138 


31 


1407 


AF 134304 


Homo sapiens 


Scar2 


118 


40 


1408 


Y08061 


Homo sapiens 


Human omyb protein fragment 


121 


82 


1409 


U93574 


Homo sapiens 


putative pi 50 


179 


43 


1410 


U93563 


Homo sapiens 


putative pi 50 


98 


43 


1411 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


46 


1412 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


56 


1413 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


48 


1414 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


155 


82 


1415 


G03267 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7348. 


92 


52 


1416 


AF1 19855 


Homo sapiens 


PR01847 


82 


70 


1417 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


34 


1418 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


99 


39 


1419 


AF130079 


Homo sapiens 


PR02852 


114 


69 


1420 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


52 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% | 

Identft 
y 


1421 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


115 


44 


1422 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867/ 


111 


77 


1423 


R95913 


Homo sapiens 


Neural thread protein. 


128 


80 


1424 


L26953 


Homo sapiens 


chromosomal protein 


104 


34 


1425 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


51 


1426 


Y 19684 


Homo sapiens 


SEQ CD NO 402 from W09922243. 


98 


75 


1427 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


149 


40 


1428 


AF090895 


Homo sapiens 


PRO0117 


111 


75 


1429 


AF1 19855 


Homo sapiens 


PR01847 


88 


56 


1430 


AF229067 


Homo sapiens 


PADI-H protein 


157 


51 


1431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


1432 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


333 


62 


1433 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


149 


57 


1434 


AF161356 


Homo sapiens 


HSPC093 


180 


46 


1435 


U93570 


Homo sapiens 


putative pl50 


116 


48 


1436 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


171 


73 


1437 


U83280 


Leishmania 
donovani 


39 kDa antigen 


106 


80 


1438 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


103 


73 


1439 


U15647 


Mus musculus 


reverse transcriptase 


233 


44 


1440 


L27428 


Homo sapiens 


reverse transcriptase 


78 


40 


1441 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 


68 


1442 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


77 


63 


1443 


M22332 


Homo sapiens 


unknown protein 


153 


62 


1444 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


40 


1445 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


101 


66 


1446 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


125 


74 


1447 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


90 


66 


1448 


X76208 


Drosophila 
melanogaster 


protein 33-specific exons 


123 


48 


1449 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


178 


79 


1450 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


109 


45 


1451 


S80119 


Rattus sp. 


reverse transcriptase homolog 


115 


54 


1452 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


70 


63 


1453 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


105 


40 • 


1454 


AK024455 


Homo sapiens 


FU00047 protein 


109 


53 


1455 


AC007258 


Arabidopsis 
thaliana 


Hypothetical protein 


105 


37 


1456 


AF194537 


Homo sapiens 


NAG13 


208 


52 


1457 


U63542 


Homo sapiens 


FAP protein 


111 


84 


1458 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


111 


65 


1459 


AF090931 


Homo sapiens 


PRO0483 


84 


43 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1460 


R95913 


Homo sapiens 


Neural thrrarl nrntem 
nwuiai iuicau pi UlCill. 


lrt6 
loo 


/CO 


1461 


R95913 


Homo sapiens 


Neural thread protein. 


109 


40 


1462 


U93564 


Homo sapiens 


putative pi 50 


237 


42 


1463 


AB029309 


Homo ^anient 


iNpwjo-Dinamg protein lNpwor 


Q7 


j>7 


1464 


U44838 




pvtpnem 
CAICIlSill 


07 


33 


1465 


AL050341 


IaUuIO oapiClLo 


dJ39G22. 1 (rearranged L-myc fusion 
sequence (ZN-15 related zinc finger 
pr utcin j j 


121 


45 
* 


1466 


L27428 


Homo sapiens 


reverse transcriptase 


94 


34 


1467 


AF0 10144 


JTJLUt 11 IS ocxpitiio 


neuronal tnreaa protein ad/c-.n 1 r 


ill 
117 


58 


1468 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 

3J. 


98 


48 


1469 




nomo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


55 


1470 




Urvm r\ canipnc 
nuinu odpiCllS 


xiuman secreted protein, ofc.^) ID NU: 
7871. 


147 


67 


1471 


AF1 ftQ0fl7 


nomo Sapiens 


Q1 HA ~~ 
O 104 


133 


35 


1472 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
cione nrlVliiy^Z. 


142 


36 


1473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

QT?r\ rr\ vifV 1 /CO 
dEV^ ID NU.163. 


152 


90 


1474 




nomo sapiens 


Hum an secreted protein, SnQ ID NO: 

7714 
/Zl*h 


157 


45 


1475 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


398 


58 


1476 


U49974 


Homo sapiens 


mariner transposase 


201 


59 


1477 


T 140074 


nomo sapiens 


mariner transposase 


206 


60 


1478 


VY*tOJJ 1 


nomo sapiens 


Human breast cancer related protein 


117 


72 


1479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


82 


1480 


U93567 


Homo sapiens 


putative pi 50 


202 


42 


1481 


kiY7*7/% 

fvUZ J /o 


Homo sapiens 


salivaiy proline-rich protein 1 


101 


46 


1482 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


100 


37 


1481 


no^7on 


nomo sapiens 


Human secreted protein, SEQ ID NO: 

fo/ 1. 


124 


75 


1484 


\JUuJZ.O 


nomo sapiens 


numan secretea protein, ocxi tD NO: 
4409. 


1 AO 

108 


62 


1485 


U15647 


Mn<: miicoiiluc 


reverse u ansenptase 


1 1 c 

115 


73 


1486 


AF194537 


Homo sapiens 


NAG13 


132 


42 


1487 


Ml 1902 


Mus muscuius 


proline-rich salivary protein 


118 


40 


1488 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/A 14. 


102 


64 


1489 




nimupie 
sclerosis 

retrovirus 


poiyprotein 


i in 

110 


48 


1490 




nomo odpiCUb 


unnamea protein proauci 


114 


37 


1491 


G03786 


Uaitia canipnc 
rxviuvs aapiviio 


Human OA/^rotA/l nrAfdiM Q TT"\ "VT/~\. 

nuuiaii secretea protein, ocxi ID INO: 

7R/S7 
/oo /. 


133 


50 


1492 




nuiliu aapivjuo 


vivr j , ivuciivD / , putative transposase 
ouiiiioi io pogo cicrncni 


1 QA 

lo4 


47 


1493 


Y12713 


Mus muscuius 


Pro-Pol-dUTPase poiyprotein 


222 


53 j 


1494 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


50 


1495 


AF109907 


Homo sapiens 


S164 


259 


45 


1496 


R95913 


Homo sapiens 


Neural thread protein. 


110 


51 


1497 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


299 


80 
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SEQW> 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1498 


G03258 


Homo <5aoiens 


Human <ze*rrrtr>A nrntpm &FO TT> KO- 

1J.U1110U oCMGlCU JJlULClil, OJL-»\^ IX-/ JNV/. 

7339. 


l Jo 


OJ 


1499 


L26953 


Homo "saniens 


c n norm OQrt tin £i 1 nT/^f"<*iri 

vau umuwuial pi UtC 111 


104. 


Of 


1500 


AF090895 


Homo sapiens 


PRO0117 


145 


68 


1501 


U93572 


Homo ^lanien*; 




1 13 




1502 


G04091 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8179 


133 


50 


1503 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6SXR 


143 


56 


1504 


G00328 




T-Tlimiln cor- rotor? nrnfpm QTTf\ lit "KT/"^- 

riLiiiiau secreieu proieiu, otA^ lxJ i\vj. 
4409. 


i in 

i iy 


69 


1505 


W00838 


Homo ^fmifriQ 


i uiiiuui necrosis iacior*reiaicci gene 
product 


1 n^ 


jZ 


1506 


AF 109907 

*XX Iv//vf 




OAO*T 




43 


1507 


U49973 


X LKJILIKJ Odp IClli 


v*vr i , ivle.iv) f 3 puuiu vc inmsposase 
similar to pogo element 




/IT 


1508 


R13556 


Homo QfiniPnc 


x lutein eucouca uownbu cdiii oi nnc ivi 

An c t\ nr n tf> i n 
UXivUJ/l UICU1. 




/CO 

oy 


1509 


AK000241 


Homo ^anienR 


nnn^mfvl nrntPtn rirrwHtiot" 


10/ 




1510 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
66 n 


153 


60 


1511 


S62928 


Homo <; aniens 




1 17 
13 / 


ID 
J7 


1512 


AB012223 


Canis familiaris 


ORF2 


116 


40 


1513 


G03807 




numaii socreiea protein, ocy jlu invj. 

/ ooo. 




59 


1514 


AF220264 


Homo sapiens 


MOST-1 


108 


80 


1515 


X53581 


Rattiic 
lvaima 

norvegicus 




yo 


A A 

44 


1516 


V0066 / > 


TJVvmo cjhiimic 


i*yu?uijroinc oxidase in 


All 


74 


1517 


U49973 


Homo canipnc 

AT.UlliU ba.pi Clio 


v/KT i , lvituvj / j puianve uansposase 
similar to pogo element 




70 


1518 


U49973 


Homo Raoif^tm 


OWP 1 * IV/iThIJ17* tyi froncnncQoo 
\JL\Sr l , IVUCJVj / , pULalivc uaJlSpOSaSe 

similar to pogo element 




/u 


1519 


M24732 


Hattia cflnipnc 

X LKJIUU da.pJ.C113 


idiiiifi'iiKc protein 


1 AT 
10/ 


A A 

44 


1520 


G03133 




numofl secreieu protein, osz\i iu jnvJ. 
7214. 


111 


51 


1521 


W48351 


TJr\m f\ ca nipnc 


numan ore as i cancer reiateG protein 
BCRB2. 


oi 


68 


1522 


AF 194537 


X1AJJ11V/ jdjJiCllo 


1N/WJ 1 J 


1 lo 


A'x 

42 


1523 


X92485 


P1ai5modiiim 
c man mmuiii 

vivax 


nvfl 1 
pvai 




A*} 


1524 


G04091 


Homo saDiens 


Human <tf»rrptpH nrntpin <IPn TO MO* 

8172. 


JZ3 


JU 


1525 


AB041228 


Homo sapiens 


G protein-coupled receptor TGR-1 


220 


100 


1526 


AF010144 


Homo sapiens 


neuronal thread nrotein Aiy7r-NTP 


1*19 
lOO 




1527 


U52077 


Homo sapiens 


mariner tnanwiosase 


2^7 


JO 


1528 


L27428 


Homo sapiens 


reverse transcriptase 


189 


40 


1529 


Y86573 


Homo saDiens 


Human trpnp Q1-pnfmrfpH nrntpin 
, x in in cm g&uc J/ 1 **iwiiiA/ucu jji UtCiii 

fragment, SEQ ID NO:490. 




to 


1530 


L13610 


Mus musciilu^ 

LliUJ UiUJUUtUiJ 


XJTlT'i vo|>v/IioC vivlliwlJI. (/iliUlUg lovLUi £, 


on 


17 


1531 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


64 


1532 


U11288 


Dro^onhila 

XSX. vO\JL/li lid 

m elan n<»a5tfpr 


uidpiiaiiuub pruiein 


IJO 


1 0 

38 


1533 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


429 


63 


1534 


L27428 


Homo sapiens 


reverse transcriptase 


249 


55 


1535 


U93570 


Homo sapiens 


putative pi 50 


114 


31 


1536 


AF130089 


Homo sapiens 


PRO2550 


111 


47 



169 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith. 

Waterman 

Score 


% 

Idcntit 

y 


1537 


X55687 


Lycopersicon 


extensin (class II) 


63 


28 


1538 


U15647 


Mus musculus 


reverse transcriptase 


110 


42 


1539 


AK024455 


Hnmn conipn c 
nuiiiti 5apicii5 


r LtJuuuH / protein 


i jy 


80 


1540 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

714^ 
/ 1*0. 


128 


48 


1541 


AC024788 


Caenorhabditis 

CICgallb 


Hypothetical protein Y46E12A.d 


80 


46 


1542 


W48351 


Homo sapiens 


Human breast cancer related protein 


132 


73 


1543 


tt JV17Z 


numo Sapiens 


Amino acid sequence of salivary 
pruiein v/Wii- i . 


90 


32 


1544 




nonio sapiens 


Human secreieu protein, bev^ ID JNU: 
7052. 


71 


63 


1545 


IV7J7 1 D 


nunio sapiens 


iNeurai mreaa protein. 


1 18 


55 


1546 


D381 16 


ran palUSCUS 


cytochrome c oxidase sub unit 1 


A« ft 

218 


78 


1547 


D381 12 


nunio sapiens 


cytochrome c oxidase sub unit 1 


370 


71 


1548 


W48351 


Homo sapiens 


Human breast cancer related protein 


119 


65 


1549 


AJ004810 


Zeamays 


cytochrome P450 monooxygenase 


140 


70 


1550 


AF1 1 


Homo sapiens 




115 


47 


1551 


YR694R 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 


75 


1552 


Y01 577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 




WtUl 1 J 


Homo sapiens 


Human alpha-2(IV) collagen protein. 


117 


59 


1554 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAQ1 


138 


76 


1555 


U42471 


Mus musculus 


Wiscott-AIdrich Syndrome protein 
homolog 


113 


40 


1556 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


502 


89 






Homo sapiens 


JrKUzjSO 


98 


55 


1558 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
aenveairom UN Ai do Jo. 


109 


43 


155Q 




— = : 

Homo sapiens 


UKr l , MbKj /, putative transposase 
similar to pogo element 


183 


52 


1560 




Homo sapiens 


UKr I ; McKj /; putative transposase 
similar to pogo element 


180 


52 


1561 




ivius muscuius 


KlLdlD 


254 


85 


1562 


M63421 


Drosophila 
m eianogas ier 


csp32 


104 


39 


1563 


G01652 


rnjiiiu sapiens 


Unman c or* r atari nrAtain CCA TT\ "KT/^v. 

numan secreteu protein, ocv£ m no: 
7733. 


129 


69 


1564 


U49973 




vjtvr i , jvldivj / , puianve uansposase 

o liU ll al IU ClvlUCUL 




59 


1565 


AJ132106 


Bos taurus 




1 l*f 


Af\ 
W 


1566 


AL390114 


T .fti^h mania 

major 


cAucLLieiy \*y btcuic/ valine ncn proiein 


1 io 
i iy 


oo 


1567 


AF161356 


A A will V/ csaJJlCUo 








1568 


AF1 19851 


Homo sapiens 


PR01722 


94 


72 


1569 


L27428 


nuiiiu sapiens 


reverse transenptase 


1 AT 


48 


1570 




Xyycupersicon 
esculentum 


excensin-iiKe proiein jl/iiiu 


1U4 


32 


1571 


vni4nn 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


126 


59 


1572 


X73481 


Drosophila hydei 


mstl01(2) 


105 


41 


1573 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein Kl 0 (NJD:g8148) 


133 


50 


1574 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


51 



170 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

v 








8144. 






1575 


D38112 


Homo sapiens 


NADH dehydrogenase subunrt 4 


323 


83 


1576 


AF062008 


Caenorhabditis 
elegans 


unknown 


111 


54 


1577 


X92485 


Plasmodium 
vivax 


pval 


81 


57 


1578 


U93570 


Homo sapiens 


p40 


102 


33 


1579 


AF090944 


Homo sapiens 


PRO0663 


132 


59 


1580 


AL137798 


Homo sapiens 


dJl 182A14.5.1 (novel gene (isoform 
1)) 


182 


53 


1581 


X92485 


Plasmodium 
vivax 


pval 


120 


44 


1582 


G0409I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


160 


47 


1583 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


54 


1584 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


119 


52 


1585 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


82 


55 


1586 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


152 


60 


1587 


D28482 


Homo sapiens 


SCR2 


390 


83 


1588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 


59 


1589 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


50 


1590 


AF 118078 


Homo sapiens 


PR01848 


118 


59 


1591 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


56 


78 


1592 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


118 


80 


1593 


U93571 


Homo sapiens 


p40 


170 


77 


1594 


Y12713 


Mus musculus _j 


Pro-Pol-dUTPase polyprotein 


230 


51 


1595 


L27428 


Homo sapiens 


reverse transcriptase 


138 


45 


1596 


X97707 


Pongo pygmaeus 
abelii 


stopcodon created by 
posttranscripuonal polyadenylation 


139 


61 


1597 


X98710 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


107 


31 


1598 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


97 


58 


1599 


AF210651 


Homo sapiens 


NAG18 


86 


89 


1600 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


148 


48 


1601 


J03770 


Mus musculus 


homeobox protein 


99 


35 


1602 


AF1 19901 


Homo sapiens 


PR02831 


119 


56 


1603 


AL031673 


Homo sapiens 


dJ694B14.1 (PUTATIVE novel KRAB 
box protein with 18 C2H2 type Zinc 
finger domains) 


233 


44 


1604 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


66 


1605 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7348. 


89 


44 


1606 


D88461 


Rattus rattus 


N-WASP 


123 


43 


1607 


AF090942 


Homo sapiens 


PRO0657 


107 


61 


1608 


U35730 


Mus musculus 


jerky 


105 


34 


1609 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


108 


45 


1610 


B06334 


Homo sapiens 


Human subtilisin-kexin isoenzyme I. 


474 


84 


1611 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


101 


90 


1612 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


55 


1613 


D86853 


Catharanthus 


extensin 


123 


39 



171 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


fcmith- 

Watermao 

Score 


% ~~ 

Ideatit 
v 






roseus 








1614 


AFI19851 


Homo sapiens 


PR01722 


135 


59 


1615 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


142 


57 


1616 


AF1 94537 


Homo sapiens 


NAG13 


154 


63 


1617 


AF1 19851 


Homo sapiens 


PR01722 


91 


62 


1618 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


48 


1619 


AKG24455 


Homo sapiens 


FLJ00047 protein 


147 


60 


1620 


AF217973 


Homo sapiens 


unknown 


116 


67 


1621 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


102 


41 


1622 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


108 


40 


1623 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


135 


44 


1624 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


322 


62 


1625 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


356 


72 


1626 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


113 


65 


1627 


X92485 


Plasmodium 
vivax 


pval 


90 


45 


1628 


AF090895 


Homo sapiens 


PRO0117 


156 


61 


1629 


AF1 16661 


Homo sapiens 


PR01438 


87 


54 


1630 


M13100 


Rattus 
norvegicus 


unknown protein 


109 


76 


1631 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


147 


60 


1632 


AF1 19851 


Homo sapiens 


PRO 1722 


107 


70 


1633 


M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


109 


46 


1634 


L27428 


Homo sapiens 


reverse transcriptase 


109 


38 


1635 


AF1 18082 


Homo sapiens 


PRO 1902 


80 


40 


1636 


R95913 


Homo sapiens 


Neural thread protein. 


118 


88 


1637 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


79 


60 


1638 


U93570 


Homo sapiens 


putative pi 50 


128 


54 


1639 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


127 


69 


1640 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


171 


68 

I/O 


1641 


AF194537 


Homo sapiens 


NAG 13 


140 


66 


1642 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


101 


50 


1643 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


33 


1644 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


50 


1645 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


70 


1646 


L27428 


Homo sapiens 


reverse transcriptase 


86 


84 


1647 


X92485 


Plasmodium 
vivax 


pval 


137 


40 


1648 


U15647 


Mus museums 


reverse transcriptase 


93 


68 


1649 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


1650 


AF1 16712 


Homo sapiens 


PR02738 


107 


57 


1651 


G02639 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6720. 


139 


72 . 


1652 


X05472 


Rattus 


ORF3 


84 


51 



172 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

TT 41CT UtHD 

Score 


% 

1 A art tit 

y 






norvegicus 








1653 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


122 


71 


1654 


U93566 


Homo sapiens 


p40 


117 


52 


1655 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


141 


70 


1656 


AF090895 


Homo sapiens 


PRO0117 


125 


60 


1657 


X92485 


Plasmodium 
vivax 


pval 


114 


45 


1658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


192 


61 


1659 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:388. 


73 


30 


1660 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250\ 


331 


74 


1661 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


127 


69 


1662 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


312 


67 


1663 


S80119 


Rattus sp. 


reverse transcriptase homolog 


99 


59 


1664 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


106 


45 


1665 


M76729 


Homo sapiens 


pro-alpha- 1 type V collagen 


172 


47 


1666 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


97 


54 


1667 


AF169388 


Mus musculus 


alpha 4 collagen IV 


84 


38 


1668 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


84 


66 


1669 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


428 


80 


1670 


M29622 


Mus musculus 


open reading frame 2 


74 


46 


1671 


W90838 


Homo sapiens 


Human lymphocyte targeted peptide 
#6. 


98 


100 


1672 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


55 


58 


1673 


AF090931 


Homo sapiens 


PRO0483 


72 


39 


1674 


AF051782 


Homo sapiens 


diaphanous 1 


116 


49 


1675 


U57361 


Rattus 
norvegicus 


collagen XII alpha 1 


108 


48 


1676 


AF 182844 


Homo sapiens 


VPS28 protein 


395 


95 


1677 


L27428 


Homo sapiens 


reverse transcriptase 


189 


47 


1678 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


1679 


U93565 


Homo sapiens 


putative pi 50 


214 


40 


1680 


AK002129 


Homo sapiens 


unnamed protein product 


128 


57 


1681 


X03145 


Homo sapiens 


pot ORF V 


93 


48 


1682 


X63005 


Mus musculus 


proline-rich protein 


108 


38 


1683 


AFi 18082 


Homo sapiens 


PRO1902 


117 


42 


1684 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


157 


58 


1685 


R95913 


Homo sapiens 


Neural thread protein. 


92 


66 


1686 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


111 


61 


1687 


X61296 


Rattus 
norvegicus 


open reading frame 2 


104 


38 


1688 


AB012223 


Canis familiaris 


ORF2 


98 


39 


1689 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


278 


72 


1690 


U52077 


Homo sapiens 


mariner transposase 


175 


56 


1691 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


46 



173 



SEQID 
NO: 


Accession No. 


Specks 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


1692 


AF061128 


P lasm od inm 

a mju i wui uiii 

falciDamni 


iiici t/zuiie suriaCc proiciii i 




44 


1693 


X77722 


Homo sapiens 


interferon nlnha/h^tn rpeenfror 






1694 


M13100 


Rattus 
norvegicus 


unknown protein 


94 


40 


1695 


AF202635 


Homo s aniens 


PP1200 


1 14 

1 l*T 


ou 


1696 


AK001116 


Homo sapiens 


unnamed protein product 


127 


53 


1697 


G00403 


Homo sanien^ 


Human cf»r*rr»tf»rl nrntpin QPO TTi NfV 
nuiiiaii bcureicu pi uicui, ocy ll/ pivj. 

4484. 


lOZ 




1698 


AF 118078 


Homo <?anien^ 

IIUUIU OCipi^<110 






4Z 


1699 


X92485 


Plasmodium 

XT &CU111V/VUU111 

vivax 






4y 


1700 


M63819 


Plasmodium 

falcinamm 


malaria antigen 


101 


64 


1701 


AF090930 


Homo saniprK 






/O 


1702 


AB009993 


Mus musculus 


collagen al(V) 


94 


40 


1703 




J-rC loJUUXlallla 

major 


extremely cysteine/valine rich protein 




66 


1704 


AF130089 


Homo sapiens 


PRO2550 


145 


38 


1705 


X83413 


Tinman 
n. mucin 

hf*rnp<ivirii<i /5 


UOO 


1 1 J 




1706 


S60088 


Homo ^anipiT? 


nntfltivp nHViAcinn mn1iviilA= A T^TV^T Y 
JpUuxllVC aUilcolUU inuicuuic — /aj^/ivlua. 


i 

i ji 


OO 


1707 


GO 1246 


Homo <saniPTK 

I lull 11/ oCLJJldlCi 


Unman cfv^rpterl nrntpin CT70 TH "KTfi* 
xiullldil bCuicLCU prOLciLI, oEy LU WKJ. 

5327 


1 /O 


DZ 


1708 


AF130079 


Homo sapiens 


PR02852 


174 


41 


1709 


Y28682 


Homo cnntpnc 


xt uiii aii ppo^z, j bcv/reu?u protein. 


jj / 


yy 


1710 


M14423 


N/fllR milQPIllllC 
1V1UO XllUoV'UlUo 


piu-aipiid- 1 type *• conagen 


t LZ 


"2/1 

34 


1711 


D13623 


Rattus sp. 


p34 protein 


128 


45 


1712 


VJutvwJ 


nuuju oapiciio 


Unman eor»r*A*»rI nrntain CT?/"^ TT"\ "KT/^\. 

nunian secreted proiein, oiii^ UJ inu. 
8144. 


54 


41 


1713 


X9767S 


JlUillU bap I Clio 


piaKOpnillQ SO 




60 


1714 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


51 


1715 


U4997'? 


JxUIIlU bapiCJlb 


uivT i , jvldivj / , pucauve transposase 

OUilllul us pv/^U d CI 11 Clit 


1 /l 


48 


1716 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438 

If JO. 


99 


63 


1717 


S80119 


Raftiic en 

IValLUj JU, 


npvprcf* trsmc/TTntricA Vir\m r\1r\rr 
icvci ac iiouac* lpuaac liuiiiuiug 


1 ACi 


J J 


1718 


L27428 


Homo sapiens 


reverse transcriptase 


103 


46 | 


1719 


G03790 


RnnriA cjinipnc 
XAUHIU xipicio 


Human c^frAt^H r*rr\tT»iTi CTJO XT\ KTr*4- 
ax uiii cm acuicicu pjuicui, oX3>v/ 1L/ LNv/. 

7871. 


134 


/o 


1720 


AF 130089 

rvi UUU07 


ilv/lllU CWipiGllb 


PRH95^ft 

riv\//>jju 


1 0£ 


JJ 


1721 


G03652 


T-fntYio QAmVnc 

XXUllll/ ZXOlilCllo 


T-fumnn cArrAtAd nmtp»in QTTfi ITi \J/^- 
rxULilltlll OCwI CUGU UllllCill, OJC»V^ II-/ liVy. 

7733. 


14/ 


4y 


1722 


G03703 


Homo ranter! q 


Wnman <;pfr<»tpd nrotpin TFi "MT*^- 
nuiiicui ocwcivu pi ULCiit, OC/V/ 1 1 J liV^. 

7784. 


ill 


/z 


1723 


AFO 16099 


JVfne itiiicriiIiic 

lTlUo UlUdwUlUd 


pndonti^?A$icp/riP\/prcA fr^ncr^rintao** 
wiiuuiiubicoo wi oy ci sc u oiiov/i ipiaoC 




AA 


1724 


X92485 


Plasmodium 


pval 


96 


62 


1725 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 

oiuiiioi iaj pugu vJCIUCUl 


249 


52 


1726 


M81321 


fascietilarii 


i^rrtlin**— rioH nrntpin 

piuiiiic-i it/ii pruicui 




AK 
4J 


1727 


G02507 


Homo <ianipn^ 


Hiimfln ^f*rr**tftd nrnt#*m ^T?0 1 1 "i Xin* 

XXU.L1KU1 dCUlClvU. piUlGJxi, OJ3*V^ LL/ llV/. 

6588. 


1 10 


oy 


1728 


U93564 


Homo sapiens 


p40 


129 


58 


1729 


U93574 


Homo sapiens 


putative pi 50 


113 i 


76 


1730 


AF 130089 


Homo sapiens 


PRO2550 


136 


61 


1731 


L05608 


Cercopithecine 


glycoprotein gl 


100 


46 



174 
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S£QH> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






hpmfivinnj 9 








1732 


U15647 


Mus musculus 


reverse transcriptase 


138 


36 


17T? 


1107574 


t-I pvm c\ ccrniDTio 
rivjlIIO SapidlS 


puiauVe pi3U 


187 
»o/ 


4*5 


1734 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7705 


155 


64 


1735 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoueu oy gene \l i . 


102 


64 


17^6 

1 / JO 


Yft9QQQ 
i \)L,yyy 




rragmeni 01 n union secreieu piuieni 
encoded by gene 12 1. 


1 1R 


64 


1717 


frftft4Qft 


nujnu sapiens 


Unman co^rvwit&H rwwtetTl QPD TTl "NJfY 

riuman secreieu protein, ony 11/ inw. 

4571 


lift 


JO 


1738 


AF090942 


Homo sapiens 


PRO0657 


163 


55 


1770 


KJ 1 iZoo 


t-TObOplllld. 

m f* 1 a n f\ on cf 
lllCIullUgoolC/l 


cuapnanous proiein 


140 


/16 


174ft 


1 97498 


nuiiiti sapieiis 


reverse uauscTipuxSc 


1D& 






Oft948S 


rioniu sapiens 


numao secreieu protein, oDy ils inv^. 
6566. 


/ v> 


^4 


1749 


VI 97 17 
I Uf 1j 


iviub inuscuius 


rro-roi-uu 1 r cisc poryproiein 






1743 


ACG03682 


Homo sapiens 


R28830J 


179 


63 


1744 


Y6^t 6<* 

AO 3 1 03 


v oivox canen 


extensin 


17^ 
1 /J 


4Q 
Hy 


1745 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44AO 


114 


62 


1 /ho 




xiomo sapiens 


Unman oa^rAfo^ nmtam FP\ "XT/^\« 

riuman secreieu proiein, oci^ ilj rvu. 
6720. 


SO 


A 1 
*fl 


1747 


UUj /O/ 


riomo sapiens 


Unmon oawaia/l nfAtain CCA fT"\ "HTA* 

riuman secreieu proiein, onvj iu inu. 
7868. 


1 1 s 

1 13 




1748 


I TQ7 5/15 


Homo sapiens 


putative pi 50 




17 


1740 


flft4ft67 


riomo sapiens 


riuman secreieu proiein, oxiv^ xxj inw. 
ft i/i a 


191 
1ZI 


4^ 


1750 


Y09000 


nuiiiu sapiens 


IT'po rrm anf nf linman ca/^rAtaH nrnfain 

riaguieiii ux nuuidu secreieu prmcui 
encoded by gene 12 1 . 






17S1 




rivjiiiu sapiens 


TiAiirrtTial t'liT'ooH TM'rt'frfrln A l~Y7r v _"XJ 1 't* 
iicuj uiiai iur cou pi uiciii r\u f in a r 


R7 
o / 




1752 


L27428 


Homo sapiens 


reverse transcriptase 


133 


42 




U7JJ /U 


riomo sapiens 


puiauve pi ju 


196 


^7 


1754 


U22376 


Homo sapiens 


alternatively spliced product using exon 

1 7 A 


134 


50 






riomo sapiens 


uivti, miikj /, puiauve transposase 
similar to pogo element 


17^ 


76 
/O 


i /JO 


ArwuOIO 


v/ryza saxiva 


simiiar co KiiNvj-riz ringer pro rein 
RHAla(AF078683) 


1 ?3 


6ft 


1757 


T I5478R 


1V1US UlUbCUlLLS 


wisKuu-/viuncu oyiiuruiiie r ruieui 


1 <I8 

1 JO 


44 


1758 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 

frofttnani QCH TT> "MA* AO A 

iragmeni, ocy \u iNC/.*»yo. 


109 


88 


1759 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

rvui« 


123 


73 


1760 




nuiilll oapiciio 


riUlIiau gCIlC 7Z-"C11WJUCU piuicui 

fragment, SEQ ID NO:496. 


i n 


09 
y& 


1761 


i oqj iy 


nuiiiu sapiens 


it mil aii gene yz-cncuucu pruicin 
fragment WO TD NO-4Q6 

AiaQiMCiii, Oily iaj" iik/.t^u. 


1 1ft 


81 


1762 


U08020 


Mus musculus 


collagen pn>alpha-l type I chain 


105 


34 


1767 




numo sapiens 


XTUiLiaii bevreieu pi uiciii, ocy jjl^ inv. 
7R71 

/o / 1. 


Oft 


4^ 


1764 


U93569 


Homo sapiens 


putative pi 50 


148 


36 


176^ 
1 too 


V86948 


Homo sapiens 


riuman secreieo protem ri^rirroo, 
SEOIDN0163 


1 1 6 
1 10 


43 


1766 


X02873 


Daucus carota 


put precursor 


112 


47 


1767 


X92485 


Plasmodium 
vivax 


pval 


100 


45 


1768 


R95913 


Homo sapiens 


Neural thread protein. 


96 


65 



175 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 










Waterman 


Idcntit 










Score 


y 


1769 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


51 








4718. 






1770 


X97675 


Homo sapiens 


plakophiiin 2b 


115 


70 ! 


1771 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


58 








4718. 






1772 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


85 








6613. 






1773 


AF130089 


Homo sapiens 


PRO2550 


158 


69 


1774 


U23552 


Ailuropoda 


cytochrome b 


206 


78 






melanoleuca 








1775 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-MTP 


98 


53 


1776 


AF1 16715 


Homo sapiens 


PR02829 


134 


67 


1777 


AC008054 


Leishmania 


L8453.1 


1 14 








major 








1778 


AF037364 


Homo sapiens 


paraneoplastic neuronal antigen MAI 


397 


73 


1779 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSR 


132 


60 


1780 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


64 








656L 






1781 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


80 


36 








similar to pogo element 






1782 


AF174482 


Homo sapiens 


poly comb 3 


133 


46 


1783 


U93563 


Homo sapiens 


putative pi 50 


196 


70 | 


1784 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c-NTP 


142 




1785 


AF194537 


Homo sapiens 


NAGI3 


1 14 

1 1*T 


JO 


1786 


U21123 


is 1 uj ksyii l la 


pnfl nnlvnpntiHf* 


ion 


AA 






melanogaster 








1787 


AF200187 


cercopithicine 


EBNA2-like protein 


108 


30 






herpesvirus 15 








1788 


G02872 


Homo s aniens 


Human secreted nrotein SEO ID NO" 


97 










6953. 






1789 


M64792 


Rattus 


salivarv oro line-rich nmtein 


198 


4ft 






norvegicus 








1790 


X92485 


Plasmodium 


pval 










vivax 








1791 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 14 
i it 


34 

i 


1792 


U49973 


Homo sapiens 


ORF 1 1 MER3 7 j putative transposase 


197 


81 








similar to doso element 






1793 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


71 








4409. 






1794 


AF104923 


Homo sapiens 


putative transcription factor 


142 


59 


1795 


W48351 


Homo sapiens 


Human breast cancer related protein 


133 


60 








BCRB2. 






1796 


AC003113 


Arabidopsis 


F240L6 


57 


62 






thaliana 








1797 


M22332 


Homo sapiens 


unknown protein 


118 


29 


1798 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


100 


71 








gene 44 clone HTDAD22. 






1799 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


154 


37 






major 








1800 


U93570 


Homo sapiens 


p40 


103 


56 


1801 


X99452 


Lycopersicon 


extens in-like protein Dif54 


101 


28 






esculentum 








1802 


L27428 


Homo sapiens 


reverse transcriptase 


102 


34 


1803 


L27428 


Homo saoiens 


reverse transcriotase 


141 




1804 


Mi 8933 


Mus musculus 


alpha- 1 type-Ill collagen precursor 


118 


30 


1805 


X92485 


Plasmodium 


pval 


106 


67 






vivax 








1806 


G03411 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


118 


65 








7492. 







176 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1807 


G04091 


Homo sapiens 


Human secreted nrotein SEO TD NO* 
8172. 


105 

JvJ 


j j 


1808 


X97675 


Homo sapiens 


plakophilin 2b 


154 




1809 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


53 


1810 


X92485 


Plasmodium 
vivax 


pval 


133 


54 


1811 


G03807 


Homo ^aniens 


Human ^prret^H nrnfpin ^PO ID XfO* 

7888. 


191 


oo 


1812 


Y17833 


Human 
endogenous 
retrovirus K 


env protein 


119 


81 


1813 


AF1 19851 


Homo saniens 


PRO 1722 


IjU 


SR 

Do 


1814 


X53581 


Rattus 
norvegicus 


ORF4 

VIM *T 


1Sfc 
1 JO 


sn 


1815 


G03473 


Homo ^aniens 


Human cprrptpfi -nrntpin QRO ITJ "MO* 

7554. 


111 




1816 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


153 


68 


1817 


M19155 


Plasmodium 
i way lyviimii 

falciparum 






JU 


1818 


AF 118082 


Homo sapiens 


PRO1902 


90 


75 


1819 


W40353 


Homo ^aniens 


Hitman iincoppifiAH nrrktf»in ■fi-nm 

XXUXLLCU1 LUOpCViXlCU JJ1 vJlClil U ULI1 

US5702907 


i in 


SO 


1820 


U93563 


Homo sapiens 


putative pi 50 


114 


35 


1821 


U41538 


Caen orhabd 

v/UWI IV/i >1C< V U ALIO 

elegans 


nrnline rirh 




so 

DjL 


1822 


AF010144 


Homo saoiens 


neuronal thread rwntpin A r>7r-1^'TP 

UVtu V/iuU till VsCUX l/lUlvllI tWJ t\» \r\ lx 


1S4 


*tj 


1823 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


53 


1824 


AF130089 


Homo sapiens 


PRO2550 


178 


*tu 


1825 


AF090944 


Homo sapiens 


PRO0663 


103 


45 


1826 


AC003U3 


Arabidopsis 
thaliana 


F2401.18 


107 


40 


1827 


AF1 94537 

ill 1 jTJyJ i 




IN /\VJ I J 


us 

OJ 


no 
Z6 


1828 


AF009668 


multinlf* 
sclerosis 
associated 
retrovirus 


inn 1 vnrnt *» i n 


1 as 
loJ 


A 1 

41 


1829 


AF016099 


Mus mus cuius 


endonuclease/reverce tran<;crinta<:e 


155 


A? 


1830 


X69465 


Sus scrofa 


ryanodine receptor 1 


516 


86 


1831 


G00454 


Homo sapiens 


Human ^errpfpH nrnfpin *\PO in "KIO* 
4535. 


1UO 


An 


1832 


U88966 


Homo sapiens 


raDamvcin associated Drotein FRAP2 


434 


8Q 


1833 


M19155 


Plasmodium 
falciparum 


S-antisen Drecursor 


105 




1834 


AF085809 


Mus musculus 


synapsin lb 


98 


33 


1835 


AK023003 


Homo sapiens 


unnamed protein product 


393 


81 


1836 


Y41740 


Homo sapiens 


Human PRO701 protein sequence. 


429 




1837 


M36913 


Zeamays 


cell wall protein (put); putative 


72 


3S 


1838 


X63005 


Mus musculus 


proline-rich protein \ 


98 


40 


1839 


X83413 


Human 
heroesviriis 6 


U88 


149 


45 


1840 


AF134304 


Homo sapiens 


Scar2 


sn 

o / 


J J 


1841 


AC024772 


Caenorhabditis 
elegans 


contains similarity to Mus musculus j 
alpha-NAC, muscle-specific form 
(GB:U48363) 


131 


25 


1842 


AB002366 


Homo sapiens 


KIAA0368 


153 


75 


1843 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


47 



177 



WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1844 


AL035526 


Arabidopsis 
thaliana 


extensin-Jilve nrotein 




33 


1845 


D26156 


Homo sapiens 


hSNF2b 


91 


34 


1846 


M14228 


Gallus galius 


c-beta-3 beta-tubuiin 


598 


83 


1847 


AK022217 


Homo sapiens 


unnameH nrntetn nmrfiifif 


97 


JO 


1848 


AJ250042 


Homo sapiens 


Rab5 GDP/GTP exchange factor 
homolopiift 


174 


83 


1849 


W23949 


Homo sapiens 


Human nhnsnhoinnsitiHe 3fYH-lrinase 
dIOI subunit 


143 
i*tj 




1850 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


101 


63 


1851 


AB017114 


Homo saniens 


AD 3 


1 13 


inn 


1852 


D00570 


Mus mus cuius 


onert rearfina frame f751 AA1 

v/jj^ii i vauiiig xi cull v y*--s l Art j 


174 




1853 


A67510 


Mils musculus 


MUS Ml f^n IT T TS (iFNOMTP DMA 
CONTAINING N ALLELE OF FV1 
GENE. 


1 33 

ID J 




1854 


U49974 


Homo sapiens 


mariner transposase 


214 


82 


1855 


U93569 


Homo saniens 


p40 


95 


31 

31 


1856 


D89729 


Homo sapiens 


CRM1 nrotein 


475 


on 


1857 


AF090895 


Homo sapiens 


PRO0117 

A JLV-VwxV/ A A J 


89 

07 


36 


1858 


AFO 15926 


Homo sapiens 


ezrin-radixin-moesin binding 
nhosnhonrotein-Sfl 


117 


73 


1859 


D13721 


Gallus gallus 


NF-kB p65 subunit 


223 


56 


I860 


K03204 


Homo ^aniens 




1 1 1 
ill 


37 


1861 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMRO30 


195 


52 


1862 


U49973 


Homo ^aniens 


ORF1 ' MFR37* rmfrative trancnncace 

similar to pogo element 


3**1 




1863 


G04091 


Homo saniens 


Human serreterl nrotein ^Pf*> IT} T\JO* 

sin. 


so 

ov 




1864 


L27428 


Homo saniens 




R4 


31 


1865 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


106 


46 


1866 


U53585 


Mycobacterium 
avium 


fibronectin attachment protein 


86 


36 


1867 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


134 


45 


1868 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


81 


63 


1869 


AP000001 


Pvrococctis 
horikoshii 


235aa lnnt* hvnothetinal nrotein 


IvO 




1870 


M13100 


Rattus 
norvegicus 


unknown protein 


121 


53 


1871 


X67863 


Mus musculus 


T2 


101 


35 


1872 


S80119 


Rattus so. 


reverse transcriptase homolog 


151 


43 


1873 


W73633 


Homo sapiens 


Human secreted protein clone. 


140 


44 


1874 


U57053 


Homo sapiens 


myosin-ID 


203 


82 


1875 


W48351 


Homo sapiens 


Human breast cancer related nrotein 
BCRB2. 


92 


54 


1876 


M21097 


Homo saniens 


CD19 differentiation antigen 


432 


79 


1877 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


150 


52 


1878 


U25281 


Rattus 
norvetncus 


^T-T3 rinmain hinrlin** nrotein 

iJLXJ UVJlllcuil UllILUllg £JlvJlClll 


IVO 


30 


1879 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


48 


1880 


X73113 


Homo sapiens 


fastMyBP-C 


599 


77 


1881 


AX028128 


Homo sapiens 


unnamed protein product 


162 


43 


1882 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


62 



178 
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SEQ n> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideatit 

y 








7870. 






1883 


Y00664 


Homo ^aniens 

AAV/llAV/ OuA-Sl V* UUJ 


onen reading frame 1 (AA 1 - ftn~i 

WUwli 1 vttUIIIK AA CAIAA*— A 1 Uvff 


74 

/*T 


14 


1884 


U86587 


IrlUJ 111 UOvU IUO 


nliosijhatiHvl inositol 1-lcina^ft cfltalvHr* 

UUVOI/IIOUUV Aa1aV/911wA J JVAllOOO MMfllV 

subunitpl 10 delta 




77 


1885 


X67337 


Homo saniens 


Human nre-mRNA cleavage factor I 6R 

AAUAA1CAAA Ul w 1AAAXA 1 XX VlV/flYUtl/ AOAvLvA A VrO 

kDa subunit 




17 


1886 


Y13829 


Homo sapiens 


MBNL nrotein 


117 


63 


1887 


G01931 


Homo sapiens 


Human secreted d rote in SEO ID NO* 
6012. 


119 


80 


1888 


RS5913 


Homo sapiens 


Neural thread protein. 


91 


64 


1889 


L39059 


Homo saoiens 


transcrioHon factor SL 1 


92 




1890 


M69297 


Homosapiens 


ORF3 


169 


48 


1891 


R95913 


Hnmn Q^nipnc 


^Jpiir^l thrpc»rl nrn^Ain 
IN ecu aJ lill Call pi uiCJJJ. 


01 




1892 


Z2S2GI 


Saccharomyces 

vCl O V Id IQXZ 


ORF YKL202w 


95 


51 


1893 


G0Q427 


AlvlIlU <M^/lCUo 


Ritmnn Gf*rTP*tf*A nrntpin CpA TTl lSJO* 

4508. 


on 


J 1 


1894 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

similar to nnoo pIpmAnt 

OAAAlllCU K.KJ ^JKJQKJ GIGaAAGAaI 


183 


73 


1895 


Y10055 


Homo ^aniens 


nho^nhoino^iHHp l-Hna^fi 




OA 
OH 


1896 


AF093775 


Miic miic/Niliic 


alnha-artinin 1 


j i j 


O J 


1897 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


82 


66 


1898 


M21904 


Homo ^anien^ 


4T«9 h<*y*w phnin nnfiopn 
*t* *m liwcivjf uua.ui antigen 


jOu 


ko 

Ow 


1899 


AB028997 


Homo saniens 

lXvllIV «9UL/lV>AAO 


KIAA 1 074 wotein 


14S 


3R 

JO 


1900 


U97553 


murid 

herpesvirus 4 


tinlrnown 




41 


1901 


AB011142 


Homo sapiens 


KIAA0570 protein 


209 


95 


1902 


AF194537 


Homo ^yiTiien*; 


NAG 13 


161 

1U1 




1903 


U93564 


Homo sapiens 


putative pl50 


142 


33 


1904 


M29399 


r iu oaa^jiguo 


ciyuuutjic iiicnipraiic piULCiii uouu. h~£ 




on 


1905 


Y27400 


llnmA CAntATIC 
jrkKJLliiJ OCLylGlLo 


rilUUail a H J v XCUUwlaoe iUuCUOUaJ 


904 


0/ 


1906 


Y28503 


Homo ^anieriQ 


HfiFHl Human (irowtfi Fnrtnr 

A AVJ A lij flUUtnU VJtUnlU val/lVJl 

Homologue 3. 


io / 


inn 


1907 


AF128625 


Homo ^aoienQ 


fTV^AO-KtnWitia nrntAin Winacp> H/*to 
V^i->V->*TZ. i/iiiviiiig jui uiciu Mil do c ucia 




o7 


1908 


AB029147 


Cucumis sativus 


expressed in cucumber hypocotyls 


98 


41 


1909 


XI 3783 


Hnmn canipnc 


cupiid-i type l, uoiiagen ^/ if 


87 
o / 


41 

HJ 


1910 


AF24063G 


A/Tn« miicr*iiliic 
lriuo iXittoUliluo 


II1UUUL V/UniaHting VJ 1 rabC ai/UV<Uing 

protein 1 




4R 
HO 


1911 


AF 129075 


Homo Qflni^nQ 


SUBUNIT (TCP-1-THETA) 


^1 1 


00 
y\j 


1912 


AF011450 


lV4ii55 milieu lus 


rvrn* W nnllAC^n 
tjrpo v iA/ii<tgvii 


87 


9R 


1913 


Y14482 


Homo ^anierK 


Fraomprif of human cpfrefcH nrntpin 

encoded bv eene 17 


194 




1914 


AF171230 


Vigna 
unguiculata 


phosphatidic acid phosphatase beta 


112 


53 


1915 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


47 


1916 


M94131 


Homo sapiens 


mucin 


97 


37 


1917 


X13885 


Nicotiana 
tabacum 


extensin (AA 1-620) 


120 


34 


1918 


AF 186605 


Homo <ianien<; 


Mf L2 nrotein 


1 1 


70 


1919 


M12130 

ATA lAla/V 


IMllQ miiof*iilii<! 

1TAUO ILlUOvlilUj 


T?"WA nnlvrnprii^** TT 
AVi^iiT. puijiucicuv AA 


4QK 




1920 


AL049794 


Homo sapiens 


dJ777L9.1 (novel protein similar to 
mouse kines in-like proteins KIF I A and 
KIFIB) 


514 


90 


1921 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


233 


64 


1922 


Y11922 


Homo sapiens 


Human 5 1 EST secreted protein SEQ ID 
No: 522. 


162 


77 



179 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Ideotit 

y 


1923 


AC002481 


Homo sapiens 


similar to nitrogen permease regulator, 
similar to P39923 (PID:g730170), 
match to AA233630 (NlJJ.gl<$5oS33) 
ana A/V07740Z {isii^.gzu33 14 /) 


223 


79 


1924 


D82060 


Homo sapiens 


membrane protein with histidine rich 
cnarge clusters 


115 


40 


1925 


U49974 


Homo sapiens 


mariner transposase 


195 


61 


1926 


S80119 


Rattus sp. 


reverse transcriptase homolog 


104 


40 


19// 


000354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


*TA 

79 


68 


1 aoo 

192o 


S796j9 


Homo sapiens 


EXTl=putarjve tumour 
suppressor/hereditary multiple 
exostoses candidate gene 


430 


88 


1010 


A3o003 


Brugia pahangi . 


major protein component of the 
mico filarial sheath 


1 AA 
104 


A*} 

43 


1930 


AFH9855 


Homo sapiens 


PR01847 


132 


64 




AJ005577 


Homo sapiens 


6-phosphofructo-2-kinase 


326 


AA 

90 


ion 

iy3z 


A* 02026 1 


Santalum album 


proline rich protein 


A^7 

93 


33 


1V33 


AJ2 /2ZU4 


Homo sapiens 


hypothetical protein 


321 




1934 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
KlBUNULLfcUrKUlbLN u 


116 


39 




Aoly /l 


unidentified 


MCSP 


32o 


79 


1 no/; 


f TAO C£A 

U9J5o4 


Homo sapiens 


putative pi 50 


217 


54 


1937 


U97553 


murid 

herpesvirus 4 


unknown 


113 


38 


1938 


AF194537 


Homo sapiens 


NAG13 


90 


37 


1 A** A 

1939 


AF226044 


Homo sapiens 


HSNFRK 


403 


85, 


1940 


X83413 


Human 
herpesvirus 6 


U88 


303 


50 


i rk a i 

1941 


AJ007628 


Rattus 
norvegicus 


ELK channel 1 


112 


38 


1 A^O 

1942 


AF 1 04328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


135 


35 


1 QA1 


Ar 130089 


Homo sapiens 


PRO2550 


93 


63 


1 QAA 
1U44 


U031 33 


Homo sapiens 


T T ^ j\. — 1 ^ a * pn/\ ill VT/'N. 

Human secreted protein, bbQ ID NU: 
7214. 


1 AO 

102 


34 


194!) 


iv /f i aa i o 
M. 194 19 


Mus musculus 


proline-rich salivary protein 


121 


40 


l;/4o 


Uj73 lo 


Homo sapiens 


histone acetyltransferase 


132 


73 


1QA7 


ATI /ciom 


Homo sapiens 


numan type aviu collagen 


fy 


^A 

34 


l74o 


ArSOZU /4o 


Arabidopsis 
thaliana 


protein kinase-like protein 


11/ 


30 


1QAQ 


AJUl I /jo 


Homo sapiens 


frill k 

mi id 


zvy 


53 




Aril 1 Ul*4- £ * 


xioroo sapiens 


neuronal uireau protein /ylj / c-in ir 


i 

133 


AR 
HO 




API 1 CK/^Q 
nr 1 17307 


xiomo sapiens 


patcnea l 


100 


oy 


1952 


AB002107 


Homo sapiens 


hPer 


118 


39 


1QC7 


A70534 


Homo sapiens 


zinc finger protein Hsal2 


430 


7A 

/O ! 


1QCA 

iy3*f 


I iyo4i 


Homo sapiens 


oaKj iu jjy uom WvJ77Z/Z43. 


O/i 


04 




uyjDoy 


Homo sapiens 


putative pi 50 


ID/ 


44 




nflA AOl 


Homo sapiens 


Muman secreted protein, otiv£ iu NO. 
01 /z. 


03 


33 


1957 


ABO 13729 


Mus musculus 


semaphorin Y 


106 


38 . 




A /"'AACa/IA 


Homo sapiens 


17 A A 

rAA 


33J$ 


OO 


1959 


AJ223075 


Homo sapiens 


TRIP protein 


598 


95 


1900 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


147 


62 


1961 

1 7U I 


a xi(\icni& 

ATU/O/ /0 


urosopnua 
melanogaster 


neiicase L'v^miinvj a. 


13 / 


43 


1962 


AF265555 


Homo sapiens 


ubiquitin-conjugating BIR-domain 
enzyme APOLLON 


587 


77 


1963 


AF229642 


Mus musculus 


DXImx46e protein 


127 


91 



180 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1964 


U72520 


Mus muscuJus 


mena protein 


89 


31 


1965 


W40309 


Homo sapiens 


Human ITAK protein. 


179 


32 


1966 


AJ388S57 


Ponio fiimilmriQ 
v^dlllo xoiliiucuio 


zauu linger protein 


£9£ 




1967 


1 1 -J 


XXi/iiiV OaX/IC»"0 


Human OVPT7-19 


994 




1968 


Y 17852 


X-IllTTl QT| 
XX Ui 11 all 

endogenous 
retrovirus PC 


pui pruieiu 


1 Sil 


AO 


1969 


Y41245 


Homo sapiens 


Human Y218 protem. 


220 


78 


1970 

17 l\f 


ftJOVJ* /JO 


olio ouuia 


OXU«U3 




5D 


1971 

17 1 1 




UluliCUlCl 


vuxiiaxxio suniicu ny io puyiutyauixi/caiiy 

UV/VlUlill-llKC UlOLCiU^^CllC iU.IVl7ri I ~J 


lUO 


19 


1972 

17 /Z. 






Human c#»rr*H-#»H nrnfAm QpO TF\ XJO* 

xiiuiiaxi bet/rcieu proiem, ojcv^ ul/ ii w. 
4414. 


199 

1 Z.Z, 


^7 


1973 


AL3 57472 


iwiiiv oai/ivuo 


VPS33B 


1 19 


11 


1974 


AF151902 


Homo ^aniens 


CGI- 144 nrotpin 


1 17 

l IX 


OS 


1975 


AL137260 


Homo ^aniens 

XXl/XIlxr OCl^JiVIlO 


hvnotherica! nrntpin 


148 


QO 


1976 


LT89505 


Wotyio oarvipnc 
xivriixu oaLsidio 


XXlCUIV 


4flK 

*fUO 


RQ 

07 


1977 


U67328 


\4^iic mu^ciilus 


NTPT-like nrotein 




71 


1978 


AK026435 


Homo sapiens 


unnamed protein product 


601 


94 


1979 

17 f 7 


Y141 18 


xXtflxxl/ OCtplCrl(5 


pcIUXxaUxIlai auipUI LCl 


^ft-7 


yo 


1980 


UvJOjv 


nUIllU 2><ipiCLU> 


MPK/T4 

lVx^lYlH- 


^ /U 


on 


1981 


AF1 18000 
rVi 1 l OU7U 


XiUillU odpicuo 


PRO9044 




54 


1989 




numu bdpicixb 


kj^/kjsj small nuciear rjoonucicoproiein 
hPrp3 




A1 


1981 


AR011 154 


xxuiuv jap ic lib 


1CTAA0SR9 nmtpin 


49 n 


oU 


1984 


A B0 11429 


iiuuiu aajjiwiio 




901 


o/ 


1985 


AL390156 


Homo sapiens 


hypothetical protein 


230 


97 


1986 


A 194954ft 


Y OIYOa. CailCn I- 

n ii cm n An c i c 
XlClgUl lciiaxo 


nyaroxyproiine-ncn glycoprotein xj/l," 

XiXxVJx 




4/1 
44 


1987 


Ml 2523 

IV X X£, J£*J 


XTXlS 1X1 17 OCLL/lGllO 


o] lr*fMrYiimin Vpn^7ifi 
aXIVcUUUIlXiil Y CllCZila 


41 1 




1988 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


63 


1989 

I7U7 


X JU1JU 


UAmn eanipnc 

XXVXlllU StaLIlGlId 


Hlimnn et*f*ri*tc*r\ r\»r»tp>i*n OD 51 
XX Hill all ^CvieLCU piUiCUl trx.O. 


1 <V7 


5U 


1990 


AF1 61356 


Homo sapiens 


HSPC093 


156 


57 


iyy x. 




nomo sapiens 


xiuman secreteu protein, ocxi lu inu. 
7S94 




(1 


1992 


AF1 19851 


Homo sapiens 


PR01722 


126 


52 


109^ 
lyyo 




xiomo sapiens 


numan secreieu proiein, oeKc *^ inl/. 
7214. 


TOO 

Zzo 


ou 


1QQ4 




xxuiiiu sapiens 


neuronal tnreau protein au / c-in i jr 


1 1 1 
1 1 J 


4U 


1995 


fi07469 


XXvPXllvr aapiCUo 


Human c<*nv»tf»/1 rvrr\t<=»in QPO TY\ "MO- 

xxiuiicui bcui eieu proiein, ocy jlu inv-'. 
6550. 


i 16 


CO 
JO 


1996 


G00328 


X lvJlllx/ CKlpi^llj 


Hitman ^^rrpfpfl nrntpin WO Tfi WH- 
xxuiiicixi oevjeveu pi utcm, ojcv^ xi_/ i>u. 

4409. 


1^7 
I J / 


J / 


1997 


X75068 


XJ\JO LCLLll Uo 


nlflcmal^mimnl nnrm 
piaouiaiexixiiicu ixi 


OJ 


OJ 


1998 


Y64869 


fTomo cftnierw 


Human S* F^T related nolvnentiHe ^PO 
xxuixiau j coi icioicu pviypcL/uuc ox_»v^ 

ID NO* 1030 


09 




1999 


Y91577 


Homo saoiens 


Human secreted nrotein ^eonenre 

AXUlliCUX SvVJvlVAl UlUUolll dWUUvll\<C 

encoded bv eene 2 SEO ID NO-250 


275 


7Q 
iy 


2000 


API 18078 


Homo saniens 


PRO 1 848 


1 1 1 
in 


57 


2001 


S80119 


Rattussp. 


reverse transcriptase homolog 


120 


61 


2002 


AB011110 


Homo Qanien<! 


KIAA0538 nrotein 


146 


71 


2003 


Ml 5530 

IrX X J JJ\f 


X XKJlll\J OO-LJl^llO 


orrkW+Vi fiir+Ar 
X>^v>wXX ^1 w Vr ul XCUrli/i 


04 

y*r 


04 


2004 


AF225918 


Mus musculus 


intestinal cell kinase 


216 


77 


2005 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


85 


2006 


D84391 


Mus musculus 


reverse transcriptase 


135 


38 1 


2007 


AF090930 


Homo sapiens 


PRO0478 


101 


86 



181 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2008 


G03807 


Homo saoiens 


Human secreted nrotein SRO TD NO* 
7888. 


156 


lie i 


2009 


Y02671 


Homo sapiens 


Human secreted orotein encoded bv 
gene 22 clone HMSJW18. 


165 




2010 


AF1 13685 


Homo sapiens 


PRO0974 


124 


63 


2011 


AF130079 


Homo sapiens 


PR02852 


141 


78 


2012 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


51 


2013 


AF090944 


Homo ^aniens 


PRO066'? 


Of 




2014 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 

derived frnm nMA^^R 


151 


58 


2015 


G03797 


Hottio <?anien<i 


Human c/vrAtpH r&mtfMTi QCO TY\ XTfV 

7878. 


77^ 




2016 


G02485 


Homo Qanieni 


Uiinian c*»or»tfl>/4 nrrttAtn QCO TT^ "MTV 

nuiuaii cjcui c leu pru icin, otiv^ wj iNvy. 

6566. 


117 


/o 


2017 


L26953 


Homo QanipnQ 


/-> f-» rrtm pio r\rr» d 1 r\twf'<*i*i 

ciuuiiiubumai pruicin 




/4 


2018 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7777 

/ ML 1 . 


68 


57 


2019 


G03646 




Human as4 nrMoin CCA TT*k XTfY- 

xiurnan secreiea proxein, odv "-i JNL/: 
7727. 


04 


42 


2020 


U49973 


WfMYiri caniVn q 


v^tvT i , ivir/Kj /, puiauve iransposase 

qittiiIht tri rv~\or\ plAmAnt 

OUUllcU WJ |JUgU ClCJIlClll 


334 


o3 


2021 


L26953 


Homo sapiens 


chromosomal protein 


112 


64 


2022 


W48351 


TTrvmn canmno 


xiurnan Dreast cancer related protem 
BCRB2. 


ion 

139 


69 


2023 


U93569 


IlvfillU oapiCUo 


pUiailVC pi j\) 




OA 

89 


2024 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7X74 


80 


72 


2025 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


142 


71 


2026 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


77 


66 


2027 


AJ242540 


VoIvoy f*jirtpri € 

nagariensis 


uy uruxyproiine-ncn glycoprotein JL/Z«- 
HRGP 




15 


2028 


U49973 


xiuiuu japiciio 


III? 17 1 * Rill? U^^f* mifofttfA fmn^v\Anrv/i/i 

um i , MDio /, puuiuve iransposase 
similar to pogo element 




on 


2029 


AF1 94537 


Homo sapiens 


NAG13 


158 


57 


2030 


L26251 


Trypanosoma 
bmcei 


CR5 


no 


41 


2031 


AF197913 


Helicoverpa 
anuigeni nuciear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 


149 


55 


2032 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


80 


2033 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


157 


82 


2034 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


133 


75 


2035 


G03646 


Homo sapiens 


Human secreted nrotein SEO TD NO* 
7727. 


104. 


o / 


2036 


G00328 


Homo ^aniens 


Human <;prrpted nrotpin ^T?0 W\ "KTfY 

4409. 


117 


71 
/ 1 


2037 


AF1 30089 


Homo saniens 


PRO2550 




JJ 


2038 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


81 


73 


2039 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


66 


2040 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


68 



182 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 




tin 1*379 


jTHJiIIU Sap I Clio 




95 


79 


ZCrtZ 




Human 
riuiiiaii 

herpesvirus 4 


RYRF1 mryvipc PRNA-2 ^Dambau^h 
et al, 1984; Dillner et al, 1984) 


108 


11 


ZU*K? 




nun Iv sapiens 


Human <^rrptpri nrotein SEO ID NO' 
6596. 


125 


65 


ZU*F* 


API 1RAR6 


\-Jf\TT\ n canipnQ 
AXUillU SapiCild 


PRO 1999 


103 


64 


2045 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

vOlJ. 


146 


64 


2046 


W50192 


Homo sapiens 


Amino acid sequence of salivary 

F>rnfr»in rON-l 

pi UICJJJ V>V-»1 1 1 . 


108 


51 


2047 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638 


133 


68 


2048 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 

JLJi>/\ J UOUipiClC VUlvll gCUUiUC 


155 


64 


2049 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
buniiax to pugo eienicui- 


125 


53 


ZXJDkJ 


AT ^Qnl Id 


LiC ISilmcUlIa 

major 


*>Ytr*>mf»1t/ /*vctiPifi<»/v5?I?np rir*h tirnfpin 
cAucuicjy i/jroLwuic/ vai iuc uui yi kjujux 


1 12 




ZUJ J 


VJUJZUJ 


jrioiuo sapiens 


T-Titmsin cpTTpfpri nrntpin QpH TFi T\IO* 
nUUlaU SCvIClCU. piUiviU, ODy J.JL/ 1NU. 

7284. 


70 


69 


IX) J J. 


Y/ClOa^ 
AOllHO 


nyara sp. 


trim i— liio^n 
IL11I1 JagCll 


99 


46 


2053 


X83413 


Human 
nerpesvirus o 


U88 


148 


73 


2054 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

(COM 


75 


63 


ZUJJ 


nn9^9 

VJUZjjZ 


rioino sapiens 


Wnrrnan c#»r*r«*tfvi r%rr*tAin ^PH TFI iM{^)- 
JTllillldli Sd/ldCu piviiciu, jcy IX/ Hv. 

6613 


lOd 


7S 


?fl56 


XI 6594 


O iftvfwfp Hum 

JL/ 1\# L jr 1 IU1 1 

discoideuxn 


rnriin^ reirioti TAA I - 437) 


104 


51 


90S7 






Nptiral thrf*af1 Timtpin 
A^iwUiai uii^au jji\f\\ii 1 1 ■ 


83 


77 


2058 


AF 118080 


Homo sapiens 


PRO1880 


126 


61 






xtujuo sapiens 




136 


59 


2060 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OJOO. 


120 


70 


ZUOl 


Yft 1 0 1 R 


jL/rosopniia 
melanogaster 


boJivdj y gioQU giue pruicni 


179 
l ty 


40 


ZUOZ 


UUUUZ7 


OavCuaTOmyceb 

cerevisiae 


Vhr9 1 7rn 


\ 10 


46 


ZUOJ 




flOluU bapicllb 


PTJnOROX 

r R.V/V070 


14"? 


RO 


9fl£a 
ZUCK4 


AT UUO vO I 


jtiomo sapiens 


putcenLai giowui nunnouc isuiuiin 
hGH-V3 


191 


OJ 




VJVZ*tU7 


nomo sapiens 


Human cprrpfpri nrnfp in SFO TD NO* 

6490. 


129 


66 


2066 


U82303 


Homo saniens 


unknown 


92 


68 


2067 


AJ242540 


Volvox carteri f. 
nflffflrifinsi*; 


hy droxyproline-rich glycoprotein DZ- 
HTIGP 


368 


79 


2068 


L27428 


Homo saniens 


reverse transcriptase 


186 


58 


2069 


AF 130051 


Homo saniens 


PRO0898 


164 


73 


2070 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to do*?o element 


271 


73 


2071 


AF130051 


Homo sapiens 


PRO0898 


110 


56 


9fl79 


/VTU 1 UlTT 


riAJUiU sapiens 


npinnnal thrpad nrotptn AD7r»NTP 

UwLU KJiSCil UU WIU pi wL^Ul r\±J IV 111! 


117 


67 I 


2073 


AF157706 


Human 

neipos virus od 


B4 


104 


49 


2074 


AT 049608 


ArahidoiiQis 

/•J UU lUWUOlO 

thaliana 


extens in-like nrotein 

VAI VI Will AJUVV* M* VtiV^iAA 


120 


41 


2075 


AP002460 


Arabidopsis 
thaliana 


gene_i±F 1 D9.26~unknown protein 


150 


27 


2076 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


85 



183 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith. 

Waterman 

Score 


% 

Identit 

y 




Ml 1 KQ7 


ivius museums 


proime-ncn salivary protein 


IOC 


42 






nomo Sapiens 


Human secreted protein encoded oy 
gene zz cione ruvioJ wio. 


OO 


75 


2079 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


185 


47 




/VJUUOh / v 


Homo sapiens 


cartilage-associated protein (CASP) 


13V 


OA 

84 


9fiR1 
ZUO 1 




Homo sapiens 


unnamed protein product 


132 


8:> 


2082 


U93564 


Homo sapiens 


putative pi 50 


137 


67 


one** 
2U53 


U02332 


Homo sapiens 


Human secreted protein, obvj ID NU: 
6613. 


99 


71 


/A) OH 


D013/2 


Homo sapiens 


Neuron-associated protein. 


t in 

148 


81 


one** 


AT 1 £Am 

AJ-. 1 003 / 1 


Leishmania 
major 


probable (nnv-o) ul 102, variant a 
DNA, complete virion genome 


1 1 A 

114 


76 


20oo 


T fA C\(\T) 

U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


255 


63 




Ar 1 94j3 / 


Homo sapiens 


XT A 4** 1 1 

NAG13 


122 


78 


2088 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


74 




A 171 1AA*7Q 

Ar 1300/9 


Homo sapiens 


T1DAOOM 

PRU2852 


1 in 

149 


77 


2090 


U93574 


Homo sapiens 


p40 


260 


96 


2091 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


99 


76 


2U92 


A T? 1 A A f OT 

Ar 194537 


Homo sapiens 


NAG13 


125 


51 


2uy3 


T T/IAATI 

U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


319 


72 


2094 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


56 


2U9j 


T TA-5 CT> 

U93572 


Homo sapiens 


putative pi 50 


195 


57 


2096 


X83413 


Human 
herpesvirus 6 


U88 


132 


57 


2U9 / 


Ar 010400 


Homo sapiens 


transaldolase-related protein 


463 


89 


2098 


U93563 


Homo sapiens 


putative pi 50 


128 


35 


2099 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


160 


43 


2100 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


133 


43 


2101 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


71 


2102 


K02401 


Homo sapiens 


chorionic somatomammotropin 


628 


90 


2103 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


141 


80 


1 1 A/1 

2104 


U49973 


Homo sapiens' 


ORFl; MER37; putative transposase 
similar to pogo element 


309 


63 


2105 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


192 


37 


n a*c 
2100 


Y59772 


Homo sapiens 


Human normal ovarian tissue derived 
protein 49. 


261 


89 


ziu / 


Ar 20203 1 


— — - 

Homo sapiens 


lNM23-xio 


OoO 


1 AA 
100 


2108 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


127 


54 




F 77/V7Q 


-—■ : 

Glycine max 


hydroxyproline-rich glycoprotein 


121 


30 


2110 


D26135 


Homo sapiens 


diacylglycerol kinase gamma 


172 


100 


9111 
2111 


U499/3 


Homo sapiens 


UKr 1 , MJiK3 7j putative transposase 
similar to pogo element 


159 


56 


2112 


X65488 


Homo sapiens 


hnRNPU protein 


117 


70 


2113 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


116 


35 




A pi 'JAACl 
/\T 1 jUUJ 1 


Homo sapiens 


JrivwOoyo 


yj 


02 


2115 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


260 


53 


2116 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


148 


70 



184 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Water ma ii 
Score 


% 

Identit 

y 


2117 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


304 


71 


2118 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


70 


66 


2119 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2120 


S79410 


Mus museums 


nuclear localization signal binding 
nrotein 


94 


94 


2121 


U49973 


Homo saniens 


ORF1 * MFR17* mitative tran<avrca<?e 
similar to pogo element 


304 


73 


2122 


Ml 1901 


Rattus 
norvegicus 


nrnlinp— rir»li ^nlivarv nmtpin 


81 

OJ 




2123 


AK025047 


Homo «w*nienQ 


iinnamftH r\rr\tf»in r»mrlnr+ 
umuuiivu yi uicui yi uuuti 






2124 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452 


95 


72 


2125 


AF138883 


Bos taurus 


type II collogen cyanogen bromide 


103 


40 


2126 


Y86248 


Homo <uir>i£*n<; 


Human cprrpted nrntpin H<"" , H~PFfi& 

SEQIDNO:163. 


186 


inn 


2127 


G03114 


i±\JUlKJ Oa|/lUlo 


tinman cpprrtral nrntpin ^PO TD MO* 
7195 


1 17 




2128 


AF194537 


Homo sapiens 


NAG13 


126 


66 


2129 


ABO 15802 


xylinus 


cimiljir tr> opIIiiIivza fnmnlpmf^nHncr 
auuiicu iv ucnuiuic vuiiipiciiiciiiiijg 

protein of A. xylinum ATCC23869 


100 




2130 


AF187147 




drehrin A 

Ui. wl HI A 


1 10 

1 1 V 


1R 


2131 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C1 

will 


148 


42 


2132 


L36341 


AsnerPi litis 
nidulans 


rppiilatnrv nrntein 


no 




2133 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


119 


58 


2134 


AF1 16689 


Homo sanierK 


PR02168 


1 13 

1 1 J 




2135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


191 


38 


2136 


M81321 


fascicularis 


tji uiiaic-i il>ju jjivusiii 




di 


2137 


A18812 


Rrassiffl ruirvii<; 




106 




2138 


AL359782 


Trypanosoma 

%JX Uvwi 


probable similar to ring-h2 finger 

LVl wlGlll I Lid I U. 


104 


41 


2139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFO ID NO- 163 


191 


92 


2140 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO- 163 


J96 


90 


2141 


Y57285 


Homo saniens 


Human GPCR nrotein fHGPRP^ 
sequence (clone ID 2214673). 


507 


78 


2142 


AJ242540 


Volvox carteri fl 
nagariensis 


hvdroxvDroline-rich fflvcoorotein DZ- 
HRGP 


201 


47 


2143 


AF090944 


Homo sapiens 


PRO0663 


164 


51 


2144 


Y86573 


Homo sapiens 


Human eene 9 1-encoded nrotein 
fragment, SEQ ID NO:490. 


390 


76 


2145 


Y86248 


Homo saoiens 


Human secreted nrotein HCHPK68 
SEQIDNO:163. 


198 

I/O 


70 


2146 


R96800 


Homo ^aniens 


Human hi^tiocvti?-^ecreited fiactor HSF 


1 19 




2147 


AJ242540 


Volvox nartpri f 
nagariensis 


VivHrnYvrrmliTif 1 — ripVi &1vrfiT>mt"pifi T}7- 

HRGP 




4^ 


2148 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


91 


60 


2149 


AF090942 


Homo sapiens 


PRO0657 


129 


66 


2150 


AF130089 


Homo sapiens 


PRO2550 


372 


82 


2151 


AC009991 


Arabidopsis 


unknown protein 


81 


59 



185 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






Iflallalla 








2152 


AF090942 


Homo sapiens 


PRO0657 


93 


53 


oi *i 


K.UJ17 


rioino sapiens 


Partial Human Natural Kilier receptor. 


215 


89 


2154 


AC008075 


Arabidopsis 
thaliana 


F24J5.4 


139 


36 


01 5<\ 
Zl 33 


AT *\Q(\\ \A 


Leishmania 
major 


extremely cysteine/valine rich protein 


1 AO 

148 


50 


01 

Zl DO 


C7Q4 in 


Mus musculus 


nuclear localization signal binding 
protein 


112 


58 


0 1 CO 
Zl5/ 


a vn.O/M<< 
AJvUz44jD 


Homo sapiens 


FLJ00047 protein 


152 


66 


2158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A CI 

4457. 


85 


80 


01 ^0 


(jUjZjo 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


75 


oi^n 




— : 

Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


1 a e 

145 


45 


oi£i 

Z101 


A CI TOO An 


Homo sapiens 


rKUzozz 


138 


71 


2162 


AJ223953 


Homo sapiens 


hPTTG 


106 


62 


01 ai 


AKA)zJ54z 


Homo sapiens 


unnamed protein product 


76 


52 


2164 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA^C 

4425, 


118 


84 


2165 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


137 


65 


2166 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


348 


88 


2167 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


108 


84 


ZlOd 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327. 


114 


48 


01 AO 

zioy 


A DAAA£ 1 X 

ArUUUOlO 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


125 


67 


01 on 
Zl /u 


Ar UUUoy 


Homo sapiens 


PRO2550 


142 


65 


0171 
Zl /i 


V 1 O/C AO 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


103 


65 


Ol OO 

Zl /z 


a un 1 n 1 >i yi 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


39 


2173 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


91 


50 


2174 


D38435 


Homo sapiens 


homologue of yeast PMS 1 


314 


96 


O 1 7C 


Ar 119851 


Homo sapiens 


PRO 1722 


230 


69 


2176 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


100 


63 


2177 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


137 


44 


2178 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:(^24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 

/DTD • TTJTA 1 AO\ 
\r IK-JriUlOZ/ 


139 


42 


717Q 

Zl /If 


AT a/Q£ar 


AlaDiaopSlS 
in ail alia 


exiensm-iiKe protein 


ZOO 


57 


01 ftn 


T 10118 


Kattus 
norvegicus 


p =~u s 

proline-nch proteoglycan 


1 AO 

148 


A A 

40 


01 Rl 


Aivno/iA';^ 

ArwUZ^Jj 


Homo sapiens 


171 TAAn^*7 nrr-ifnin 

rLJUUU4/ protein 


97 


63 


2182 


R95913 


Homo sapiens 


Neural thread protein. 


100 


69 


0 1 

zlS3 


AF266479 


Homo sapiens 


rectachrome 1 


148 


81 


2184 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

HO JO. 


64 


70 


2185 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


107 


40 


2186 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


110 


84 



186 



WO 01/64835 



PCT/US01/04927 



SEQ n> 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


9187 




("valine oaJlii<; 


OlylKX I ^ y j uUllagdl 


103 


4"? 


2188 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


115 


36 


91 ftO 

2159 




UnmA cQntPtlC 
xrOlIlU dctpivUd 


"MP A-U/979 


971 


inn 

1UU 


2190 


AF130089 


Homo sapiens 


PRO2550 


137 


68 


2191 




v oivox caxxen 


extensin 


1 1j 


ft) 


2192 


U499 /3 


rtomo sapiens 


uivr 1 , ivic,rvj / , puis live transposasc 
similar to pogo element 


1 <ZA 




2193 


nAl r 7Q9 

uUi/o/ 


riomo sapiens 


Unman roni-a+cJ nrrt+oin QPA 1T\ XTO" 

riuman secreted protein, oc\</ uj rixj. 

/ oOO. 


14o 




2194 


U52077 


Homo sapiens 


mariner transposase 


257 


53 j 


2195 


D 1 1 A 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


OS 


70 


2190 


n?oi io 
U3ol 12 


Homo sapiens 


iNAUri aenyarogenase suDunit 3 


i en 

loU 


77 


2197 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 cione riiUAxJZJ.. 


121 


65 


2198 


Y 3 63 60 


— =-= : 

Homo sapiens 


Fragment of human secreted protein 
encoueu oy gene j. 


120 


04 


2199 


UOioO/ 


— : 

Homo sapiens 


Human secreiea protem, otiv^ Jul* jnu. 

/OOO. 


1A1 


o / 


2200 


AJ007042 


Homo sapiens 


TRX5 protein 


264 


75 


2201 


Ar lJUODl 


Homo sapiens 


rKvUo95 


71 


£1 


2202 


A V(V)AAZZ 

AJsa)244jj 


Homo sapiens 


T?T TAAA/17 nrntain 


1 j3 


OA 


2203 


U83303 


Homo sapiens 


iine-1 reverse transcriptase 


95 


52 


2204 


G00328 


Homo sapiens 


Human secreted protem, obv^ iu JNU. 

44U9. 


19ft 

125 


5/ 


2205 


AF090895 


Homo sapiens 


PRO0117 


163 


69 


2206 


G03806 


Homo sapiens 


T T . nanra4ar1 nmtain PEA T"T\ 'VT/~\« 

Human secreiea protem, oby Uj NU. 
7887. 


1 ZA 
I D4 


09 


2207 


K95913 


Homo sapiens 


Neural thread protein. 


1 A1 
103 


50 


2208 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/MAO 


144 


78 


2209 


\v4a3Dl 


_ 

Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 AA 
lOO 


03 


2210 


003790 


Homo sapiens 


Human secreiea protem, oc\i id inu. 
7871. 


111 
111 


/IS 


2211 


Ar 090931 


Homo sapiens 


JrKU04o3 


03 


on 


2212 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone rir Mt>vj32. 


100 


74 


2213 


Y02o93 


_ ; 

Homo sapiens 


Human secreted protein encoded by 
gene 44 cione ri 1 uiwjaz.. 


OS 


^1 
41 




\47£A71 
JV1 / OO / 1 


T 1/^ AnAfCI A ATI 

L,ycopciMCon 

f^cfiili^ntiim 

Wu UlCiivtUtl 


c A LCI IS 111 ^CldSa 11 ) 


117 




2215 


X03717 


Homo sapiens 


pot unidentified reading frame 


98 


54 


991/^ 

X^IO 


RQSQ1 3 


nulllU *>a|JlvLu> 


ncui ai uiicau piuiciii. 


1U7 


48 


991*7 


AT 1 1 OUOO 






1JO 


79 


99151 


AFARIAftA 
/VTUO 1*+o*t 


JtlV/IIlvJ SapiCEla 


aipiut'tUUUUll IbUlUlIil 1 


j*tj 


95 


991Q 




XiOIiiU attpiCllo 


Human Qprrpfr^H nmfpin 91 RO TD MO* 

4488 


157 


84 

Or 4 


9990 






fllnha-tiihiilin 

Allsll a. luu lllui 


571 


94 


2221 


AP071172 


Homo sapiens 


HERC2 


187 


84 


9999 


K/f 177R/* 


nUlUU oapiCllo 


crli5i-^PT*iv«»r1 nPYin rvr**f»iirCAt* 




OJ 


2223 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


588 


85 


2224 


AK02o072 


Homo sapiens 


unnamea protem proaucc 


lOO 
199 


S7 


2225 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7871 

/Oil* 


161 


61 


2226 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


126 


41 


2227 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


63 



187 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


Alio 


A 171 1 Off ^ 


: 

Homo- sapiens 


PRO 1847 


108 


74 




vJl/J IjJ 


Homo sapiens 


Human secreted protein, obvj ID NU. 


122 


55 








TO 1 A 

7214. 










— r : 

Homo sapiens 


rK(JUoo3 


1 A £ 

146 


61 


Z/j 1 


Y 144oZ 


Homo sapiens 


Fragment of human secreted protein 


152 


74 








encoded by gene 17. 






1151 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


59 








/871. 






1155 


f^AT70A 


— — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


58 








/8/L 






H5h 


vjuj /yu 


— — : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


60 








9271 
lot 1. 






oo^ 
H5j 




Homo sapiens 


FLJ00047 protein 


133 


71 


11 Jo 


Yo7103 


Homo sapiens 


Human secreted protein sequence SEQ 


99 


59 








ID NO: 142. 






1151 


U87607 


Rattus 


putative RNA binding protem 1 


111 


38 






norvegicus 








115 o 


OUUiy / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


63 








4478. 






11 5 y 


000357 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


75 


65 








A AIO 

4438. 








L27428 


Homo sapiens 


reverse transcriptase 


136 


40 




da^qaa 
KyooOO 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


122 


82 


2242 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


95 








4409. 








U00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


48 








A AtC 

4435. 








Cr0032d 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


75 








4409. 








T T A0 1 1 X 


Homo sapiens 


ORF1, encodes a 40 kDa product 


346 


87 




T TOT3A1 

Uo2303 


Homo sapiens 


unknown 


155 


79 


2247 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


45 








6536. 






OO/IQ 


Y02671 


XT - 

Homo sapiens 


Human secreted protem encoded by 


105 


47 








gene 22 clone HMSJW18. 






OO/IQ 


Y ooz45 


— — ; 

Homo sapiens 


Human secreted protein HCHPF68, 


161 


63 
















A Ffi 101 AA 


— : 

Homo sapiens 


neuronal tnreaa protein AD /c-N iJr 


124 


62 


09 SI 


YJM0A/£ 
A5 IzUO 


Drosophila hydei 


nistone 


101 


71 




AT71 1 

AT 1 jjjoI 


Danio rerio 


proteasome subunit beta 7 


92 


52 


2253 


A T7 A 1 /W A A 

AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


172 


60 




A TTAIMOOC 


Homo sapiens 


cytochrome P450 2E1 


114 


46 


oo<< 


U4yy73 


Homo sapiens 


ORF1 ; MER37; putative transposase 


136 


64 








similar to pogo element 








AT 1T?ftA1 


Caenorhab cutis 


VI C"G1 A 1 

I I Dili A. J 


147 


90 






eiegans 








29S7 


UH77 / J 


riomo sapiens 


uivt i , Miiivj /, putative transposase 


OAT 

191 


59 








similar to pogo element 






^JO 




nOuio sapiens 


putative p i ju 


218 


66 




T OO/^CA 

• L.Z20MJ 


Homo sapiens 


early lymphoid activation protein 


82 


55 


99 £n 


At iy4jj / 


Homo sapiens 


XT a rt 1 -i 
NA013 


117 


56 


99/: i 


U43360 


Peromyscus 


reverse transcriptase 


130 


82 






maniculatus 










OUOjOO 


Homo sapiens 


Human secretea protein, bbQ ID NO: 


86 


66 








4447 






2263 


AL021918 


Homo sapiens 


b34I8. 1 (Kruppel related Zinc Finger 


372 


61 








protein 184) 






2264 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


122 


71 






major 









188 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2265 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


123 


79 


2266 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


204 


95 


2267 


L29029 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain aa 26/5 468* amino acid 
feature* 2 lobular nrotein domain aa 32 
..265 


141 


58 


2268 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


499 


84 


2269 


A11693 


Homo sapiens 


start codon not included 


594 


87 


2270 


AB014554 


Homo sapiens 


KIAA0654 protein 


141 


63 


2271 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


112 


60 


2272 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


118 


54 


2273 


Y20852 


Homo sapiens 


Human neurofilament-H mutant protein 
fragment 1 1 

AA UglllvllV A A . 


125 


35 


2274 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


92 


66 


2275 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


198 


64 


2276 


U15647 


Mus musculus 


reverse transcriptase 


100 


72 


2277 


AJ004810 


Zea mays 


cvtochmme P450 monooxv^enase 


190 




2278 


G00328 


Homo saniens 


Human secreted nrotein SEO ID NO* 
4409. 


111 




2279 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


112 


68 


2280 


L27428 


Homo sapiens 


reverse transcriptase 


150 


60 


2281 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


142 


40 


2282 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


321 


54 


2283 


D00570 


Mus musculus 


on en reading frame f25 1 AA^ 


169 




2284 


Y02887 


Homo saoiens 


Fragment of human secreted nrotein 

A. 1 tlgUlwUli vi 11UI 1 Hill iJVVlWlvU Lit VVwill 

encoded by gene 90. 


86 


S4 


2285 


AL359782 


TrvDanosoma 
brucei 


nossible f hhv-6i ul 1 02 variant a dna. 

t*\J»J*9*.\fl\* IMUT J ***- A wA#j VCAAACAUfc (A %AJ 111, 

complete virion genome. 


144 


50 


2286 


AF1 19901 


Homo sapiens 


PR02831 


116 


82 


2287 


G03797 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 
7878. 


126 


73 


2288 


AF130089 


Homo sapiens 


PRO2550 


102 


75 


2289 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


70 


2290 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


64 


2291 


L26953 


Homo sapiens 


chromosomal protein 


137 


53 


2292 


AF161356 


Homo sapiens 


HSPC093 


100 


57 


2293 


G00328 


Homo sapiens * 


Human secreted protein, SEQ ED NO: 
4409. 


138 


68 


2294 


X03145 


Homo sapiens 


pot OfcFI 


120 


43 


2295 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


83 


52 


2296 


AK024455 


Homo sapiens 


FU00047 protein 


98 


66 


2297 


G00262 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4343. 


123 


73 


2298 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ED NO:250. 


349 


73 


2299 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


109 


61 



189 



WO 01/64835 PCT/US01/04927 



SEQ1D 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 


2300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


81 








5327. 






2301 


Ar2M290 


Plasmodium 


glutamic acid-rich protein 


112 


40 






falciparum 






n AO 

2302 


At I > jziz 


Pisum sativum 


extensin 


on 

89 ; 


36 


2303 


AF 130089- 


Homo sapiens 


PRO2550 


83 


50 


2304 


1 TA*"i All 

U42471 


Mus musculus 


■it j ■ mw Alt" \ r"» 1 » ■ 

Wiscott-Aldrich Syndrome protein 


84 


51 








homolog 






2305 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


86 


30 


2306 


AP002031 


Arabidopsis 


gene_id:K3D20.3~ 


168 


44 






thaliana 








2307 


AF157321 


Homo sapiens 


30 kDa protein 


309 


64 


2308 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


57 








6613, 






2309 


M76546 


Helianthus 


hydroxyproline-rich protein 


198 


42 






annuus 








2310 


Y27607 


Homo sapiens 


Human secreted protein encoded by 


207 


100 








gene No. 41. 






2311 


AF 130089 


Homo sapiens 


PRO2550 


106 


66 


2312 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


60 








4518. 






2313 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 


114 


80 








antigenic protein encoded by 












phageclone lambda HC2533. 






2314 


A »^kftftftft-« ft 

AB032910 


Hylobates 


dopamine receptor D4 


108 


40 






muelleri 








2315 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


159 


59 








gene No. 107. 






2316 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


58 


2317 


AF238235 


Entamoeba 


diaphanous protein 


103 


51 






histolytica 








2318 


U15647 


Mus musculus 


reverse transcriptase 


101 


37 


2319 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


117 


51 


2320 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


163 


78 








SEQ ID NO: 163. 






2321 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


48 


• 






7871. 






2322 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


39 


2323 


A MA ft ft f 1 

AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


122 


75 








RHAla (AF078683) 






2324 


ftft >*ft ft 

G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 








6720. 






2325 


IT j i Oft fl 

U44838 


Glycine max 


extensin 


126 


36 


2326 


X Aft T 4 ftft 

Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 


99 


59 








ID NO: 142. 






2327 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


408 


79 








fragment, SEQ ID NO .490. 






2328 


AF 194537 


Homo sapiens 


NAG 13 


181 


66 


2329 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


405 


75 








similar to pogo element 






2330 


VC1 CO 1 

X53581 


Rattus 


ORF4 


11/: 

116 


A C 

45 






norvegicus 








2331 


L26953 


Homo sapiens 


chromosomal protein 


108 


67 


2332 


L27428 


Homo sapiens 


reverse transcriptase 


117 


76 




OUUoJ/ 


Homo sapiens 


Human secreted protein, bbQ ID NU. 


OA 

84 


CA 

50 








4718. 






2334 


AF191687 


Homo sapiens 


alanine-glyoxylate aminotransferase 


100 


37 








homolog 






2335 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


84 


68 



190 



SEQID 


Access on No. 


Species 


Description 


Smitb- 


% 


NO: 




Wa terra an 


Ideotit 








Score 


y 


2336 


AL390114 


Leishmania , 


extremely cysteine/valine rich protein 


1 06 


53 






major 








2337 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


168 


65 








SEQID NO: 163. 






2338 


R95913 


Homo sapiens 


Neural thread protein. 


164 


67 


2339 


AF090942 


Homo sapiens 


PRO0657 


122 


81 


2340 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


t\/\ 
90 








8159. 






2341 


AF090942 


Homo sapiens 


PRO0657 


124 


60 


2342 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


83 






7124. 






2343 


ABO 13454 


Rattus 


NaPi-2 beta 


143 


77 






norvegicus 








2344 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


127 


73 








gene 44 clone HTDAD22. 






2345 


AJ242540 


Volvox carteri f. 


* * i * • * \ i * T\nr 

hydroxyproline-nch glycoprotein DZ- 


146 


45 






nagariensis 


HRGP 






2346 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


40 


2347 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


55 






7124. 






2348 


Y36203 


Homo sapiens 


Human secreted protein #75. 


138 


78 


2349 


L26953 


Homo sapiens 


chromosomal protein 


131 


57 


2350 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


65 








6613. 






2351 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


144 


58 






terminal, clone XEH8c} 






2352 


AF130051 


Homo sapiens 


PRO0898 


155 


70 


2353 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


76 






4438. 






2354 


L27428 


Homo sapiens 


reverse transcriptase 


141 


71 


2355 


L26953 


Homo sapiens 


chromosomal protein 


128 


66 


2356 


W48351 


Homo sapiens 


Human breast cancer related protein 


148 


75 








BCRB2. 






2357 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 15 


62 








7867. 






2358 


B12310 


Homo sapiens 


Human secreted protein encoded by 


115 


50 






gene 10 clone HDPGP94. 






2359 


M76546" 


Helianthus 


hydroxyproline-rich protein 


103 


43 






annuus 








2360 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


ox 


OA 






encoded by gene 17. 






2361 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


95 


TO 

72 








isoform 






2362 


M64793 


Rattus 


salivary proline-rich protein 


117 


41 






norvegicus 








2363 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


98 


80 








sequence. 






2364 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 


152 


45 






crassa 


RIBONUCLEOPROTEIN C 






2365 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


85 


55 








gene 22 clone HMSJWlo. 






2366 


G03790 


Homo sapiens 


TT j. J nDA Tr\ XT/A. 

Human secreted protein, SEQ ID NO: 


1 A1 
101 


TT 








787 1. 






2367 


AF093097 


Homo sapiens 


putative RNA-bmdmg protein Q99 


aid 

248 


AT 

97 


Z3oa 


T KTATT 


Homo sapiens 


manner transposase 


777 


7d 


2369 


X07882 


Homo sapiens 


Po protein 


102 


38 


2370 


U44838 


Glycine max 


extensin 


102 


32 


2371 


ABO 12223 


Canis familiaris 


ORF2 


158 


60 


2372 


AF025467 


Caenorbabditis 


contains similarity to drosophila DNA- 


104 


42 



191 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






elegans 


binding protein (INLUigo 14oJ 






Ad id 




Homo sapiens 


Human secreted protein, obQ iu NU. 
8172. 


10/ 


4U 


JLD /** 


UIWU / o 


riomo Sapiens 


Human secreted protein, oeq iu inu. 
8159. 


1UO 


an 
67 


jLD 1 J 


f\T 1 1 OUoO 


Homo Sapiens 


rKU 1 




65 


2376 


W48351 


Homo sapiens 


Human breast cancer related protein 


95 


55 


JLd / / 




— ; ; 

Homo sapiens 


Human secreted protein, obQ ID NU: 

7870 


1 O 1 
121 


59 


2378 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


141 


70 






— : 

Homo sapiens 


Neural thread protein. 






2380 


U93572 


Homo sapiens 


putative pi 50 


124 


53 


7*381 i 




Homo sapiens 


Human secreted protem, SEQ ID NO: 
6619. 


195 


66 


ZdoZ 




Homo sapiens 


mariner transposase 


282 


67 


2383 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O 1 A O 

8148. 


113 


43 


Zjo4 


AXJ4z;>4l) 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


112 


37 




vjUj/ 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

T7A< 


88 


75 


2386 


AF130089 


Homo sapiens 


PRO2550 


160 


62 


ZJo/ 


Woooz/ 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HrMoQJz. 


166 


61 


73R51 


API qaarq 


— ~ ; 

Homo sapiens 


rKOz35U 


W3 


71 


2389 


AB027890 


Schizosaccharom 
yces pom be 


Hypothetical protein 


116 


100 


2390 


U93570 


Homo sapiens 


putative pi 50 


151 


54 


zjy i 


Yiy /o/ 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


110 


71 


2392 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


109 


67 


2393 


AE001381 


Plasmodium 
ialciparum 


hypothetical protein 


94 


34 


2394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
obQ ID NO; 163. 


156 


62 


7^o^ 




— j 

Homo sapiens 


Human secreted protem, SbQ ID NO: 
7133. 


57 


56 




U4z4 / I 


Mus museums 


Wiscott-Aldrich Syndrome protein 
homolog 


i f\—i 
107 


43 


2397 


AF210651 


Homo sapiens 


NAG18 


162 


54 


7^QSi 


\yfl 1QA1 


Kauus 
norvegicus 


proline-rich salivary protein 


1 lo 


34 


7^QQ 
**jyy 




rtoroo sapiens 


cyiocnrome c-iute poiypcpiiue 


1 i j 


OS 


2400 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A AAA 
4440. 


141 


54 


2401 


M13100 


Rattus 
norvegicus 


unknown protein 


148 


46 






Homo sapiens 


Human secreted protem, oxiQ iv xmu, 
7874. 


All 
1JJ 


70 




VQ 1 C'7'7 

yyj j// 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


JOO 


94 






Homo sapiens 


PRO0974 


i n 

112 


56 


2405 


Y36495 


xlliillU oapiciis 


FlCl^lUCUL Ul IJUlXlali aCUCICU piUlCUI 

encoded by gene 27. 




/%7 
o / 


2406 


R41001 


Homo sapiens 


Human myotonic dystrophy gene 
protein. 


207 


68 


2407 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


127 


63 



192 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






major 


DNA, complete virion genome 






2408 


GOjIOI 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7182. 


"3 TO 

378 


95 


2409 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


250 


86 


2410 


U72520 


Mus musculus 


mena protein 


115 


40 


2411 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
KHAla (AF078683) 


132 


A A 

44 


Oil n 
2412 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


10y 


85 


2413 


AF1 19901 


Homo sapiens 


rKU2oj 1 


i p 

1 13 


66 


2414 


R95913 


Homo sapiens 


Neural thread protein. 


93 


57 


2415 


B01372 


Homo sapiens 


Neuron-associated protein. 


104 


83 


2416 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


50 


2417 


J02459 


bacteriophage 
lambda 


K (tail component; 199) 


720 


92 


2418 


J04694 


Mus musculus 


alpha- 1 type IV collagen 


103 


43 


2419 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


145 


61 


2420 


W26496 


Homo sapiens 


CD2 associated intracellular protein 
CAIPLS02-21. 


115 


80 


2421 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; giycine- 
and proline-rich 


145 


38 


2422 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


158 


73 


2423 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


70 


2424 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


122 


61 


2425 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


95 


59 


2426 


U93565 


Homo sapiens 


putative pi 50 


1 A1 

147 


75 


2427 


A T Ifk/W 1 A 

AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


""I A/Z 

246 


53 


2428 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


138 


61 


2429 


\Tt ClTI /ll 

Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


112 


75 


2430 


D13892 


Homo sapiens 


carboxyl methyltransferase 


1 O 1 

181 


68 


2431 


X97675 


Homo sapiens 


plakophilin 2b 


131 


65 


2432 


G02896 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6977. 


108 


70 


2433 


X03145 


Homo sapiens 


pot orf in 


180 


82 


2434 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


77 


86 


2435 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


421 


67 


2436 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


38 


2437 


AF210651 


Homo sapiens 


NAG18 


128 


72 


2438 


A TT\ 1 /AftA 

ArO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


232 


A £ 

46 


2439 


AL160493 


Leishmania 
major 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


122 


53 


2440 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


136 


80 


0/1/1 1 




Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


54 


2442 


AF090895 


Homo sapiens 


PRO0117 


115 


61 


2443 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


125 


70 



193 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


nAAA 
Z'Hrt 




— — : : 

Homo sapiens 


Human secreted protein, SEQ ID NO. 
4675. 


/I 


80 






Homo sapiens 


Human secreted protein, &dU_ ID NU. 
7871. 


51 


37 






Homo sapiens 


"DO /~\A 1 1 "7 

I'KUU 117 


60 


64 


2447 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4DZ3. 


103 


86 


z'tho 


W*fo3J i 


— : 

Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


61 


04210 


L.Z 


Homo sapiens 


succinate dehydrogenase flavoprotein 
subunrt 


loo 


A 1 

91 


Z**JU 


A FAQA04O 


— ; 

Homo sapiens 


rKUUOD / 




71 


2451 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


87 


94S9 


rv.U.5 1 /V 


: 

Homo sapiens 


pro-alpha- 1 type-I collagen 


1 OA 

120 


44 


2453 


AF 11 8082 


Homo sapiens 


PRO1902 


143 


46 


OA^A 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


114 


68 


Z*Oj 




Homo sapiens 


T»T> CCA 

PRO2550 


125 


89 


2456 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


134 


71 


Z*0 / 


Y J0Z43 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


93 


75 


Z*Oo 


VftOO/1 ^ 


Rattus 
norvegicus 


putative integral membrane transport 
protein ^ 


166 


46 


2459 


AF 130052 


Homo sapiens 


PRO0956 , 


75 


48 




OUUizo 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


76 


Z401 




Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


69 


2462 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


157 


48 


2463 


X83413 


Human 
herpesvirus 6 


U88 


236 


50 


2464 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


181 


53 


2465 


X83413 


Human 
herpesvirus 6 


U88 


218 


50 


2466 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


147 


57 


2467 


X83413 


Human 
herpesvirus 6 


U88 


196 


53 


2468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

ok\£ UJ NU: 163. 


165 


61 


2469 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
ol /Z. 


101 


46 


2470 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


221 


56 


9471 


n(\f\Aoi 

VJUU*fZ / 


— : 

Homo sapiens 


Human secreted protein, i>by ID NO: 
4508. 


136 


63 


9470 


AljJjy /5Z 


Trypanosoma 
brucei 


possible (nnv-oj ul iuz, variant a ana, 
complete virion genome. 


122 


94 


7471 


API &\ 


Homo sapiens 


rtorlAJyo 


1 A A 

120 


CA 

60 


2474 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


142 


40 


2475 


Y86248 




SEQIDNO:163. 


1 JU 


35 


2476 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


78 


48 


2477 


AL160371 


Leishmania 


probable (hhv-6) u 1 102, variant a 


86 


46 
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SEQI0 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 






Waterman 


Identit 










Score 


y 






major 


i ~ 

DNA, complete virion genome 






24 /o 


001478 


Homo sapiens 


riuman secreteu proiem, o£.v< iu imu. 


yo 


DO 














2479 


A t All £ AC 

AL033545 


Arabidopsis 


extensin-like protein 


114 


4U 






thai i an a 








a OA 

2480 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, vanant a dna, 


142 


59 






brucei 


complete virion genome. 






2481 


AL390114 


Leishmania 


probable (nnv-o) ul luz, vanant a 


I AC 

l4o 


oz 






major 


DNA, complete virion genome 






2482 


ATI i^nAO 

AL 132902 


Caenornaoditis 


Y71 A 1215.4 


1 iCA 

lou 


0 / 






elegans 








o>i oo 
248J 


AT?* Cil CAO 

Ar 154502 


Homo sapiens 


: tz p -p r-5 

quiescent cell proline dipeptidase 


Aid 


CQ 
OO 


24X4 


A T7A 1 A^OiC 

AJrOlO^zo 


Drosophila 


snort torm 01 CriLr 


JO 


4Z 






melanogaster 








2485 


\1 TO t\Al\£ 

W80406 


Homo sapiens 


A secreted protein encoded by clone 


t in 

1 iy 


7A 








dn4U 






2486 


W31186 


. , . - ,. , 
Homo sapiens 


Human pi 60 polypeptide lo(J.z. 


1 0 


AA 


2487 


W48351 


Homo sapiens 


Human breast cancer related protein 


1 AO 

103 


IA 

/4 








BCRB2. 






2488 


L26953 


Homo sapiens 


chromosomal protein 


1 OA 


AO 
OO 


2489 


Y21418 


Homo sapiens 


Human high mobility group protein 


96 


AC 

46 








HMGI-C mutant tragment l. 






2490 


Y91577 


Homo sapiens 


Human secreted protein sequence 


3oo 


94 








..«>J*il |__ . O CT?A TT*\ \TA.OCA 

encoded by gene 2 SEQ ID NO:250. 






2491 


G00427 


Homo sapiens 


Human secreted protein, bbvj LD NU: 


1 <o 
158 


OA 
/O 








/I <AO 






2492 


U 12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


1 AA 
144 


>lO 
4/ 


2493 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


1 |<7 

117 


0 1 
81 








encoded by gene 17. 






2494 


X92485 


Plasmodium 


pval 


1 10 


A/f 
04 






vivax 








24V3 


003 / 14 


Homo sapiens 


riuman secreieu proiein, aiivj uj inu. 


yz 


AA 
04 








OOO^. 

/ /yj. 






24V7 


A 1?1 ai AQ1 

Ar lol4oi 


Homo sapiens 




Zl 1 


Oft 


2498 


VACAA/T 


Homo sapiens 


S-protein 


zy© 


no 

yz 


2499 


AB032911 


Hylobates agiiis 


dopamine receptor D4 


OA 

89 


>IO 

42 


2500 


M98502 


Mus musculus 


pMLZ-4 


325 


OA 

yu 


2501 


Y94920 


Homo sapiens 


Human secreted protein clone 


</CA 

5o9 


o5 








pm412_12 protein sequence SEQ ID 












In U:4o. 






/JuZ 


ATIAAOA^A 


riomo sapiciib 


Oram luiK prutciii*"! 






o<ao 
Z5U3 


Arl)646U4 


Homo sapiens 


rLbUJ protein 


ZZ4 


J / 


Z3U4 


■7AAOOO 


Schizosaccharom 


putative dtia-directed ma polymerase in 


Jo4 


Al 

01 ! 






yces porabe 


130 kd polypeptide (ec 2.7. / .o) 






ZOID 


mi iyui 


Rattus 


proiine-ncn salivary protein 


I*!-/ 


A^ 






norvegicus 








O^AA 


a in 1 aaaa 


jvius muscuius 


j_»-oOX3 proiem 


jOO 




ZOU/ 


AoJ413 


Human 


Uoo 


ZUJ 


AC i 
40 






nci pes virus 0 








0<A© 

ZDUo 


A IOOO/IOC 
AJZ / /4Z5 


vjloooaera 


. _ — . 

putative cuticular collagen 




AA ! 
4U ! 






f% oil inn 

pajiiaa 








o^aq 
zouy 


A QAIOOI A 

AJtsui /yiy 


Homo sapiens 


pepuayiargnime aeiminase type v 


1 AR 
l*lo 


51 






Homo sapiens 


U4/ut>-associaiea kina spue in g iactor 


JOJ 


OO 

y / 


oci 1 
2511 


a nioMn 

AJ238520 


Homo sapiens 


putative transcription factor-like 


*700 

722 


1 AA 
100 








mir*I~5ii" rpoii i?it"Ar 

nuclear rcgiiid-iui 






2512 


AL390736 


Homo sapiens 


bA209J19.1.1 (GW1 12 protein) 


557 


84 


2513 


D84223 


Homo sapiens 


leucyi tRNA synthetase 


1113 


100 


2514 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


174 


56 








clone HPMBQ32. 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2515 


UoOoui 


Homo sapiens 


clathrin heavy chain 2 


111 


92 


2516 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


2517 


AB02725 1 


Homo sapiens 


zinc finger protein (ZFD25) 


631 


85 


2518 


AK023160 


Homo sapiens 


unnamed protein product 


168 


54 


2519 


D57326 


Mus musculus 


GSG2 


575 


73 


2520 ' 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


42 


2521 - 


X67688 


Homo sapiens 


transketolase 


120 


66 


2522 


AF074086 


Homo sapiens 


protease 


390 


86 


2523 


AF220509 


Homo sapiens 


transcription associated factor 
TAFII31L 


801 


99 


2524 


AL078463 


Homo sapiens 


dJ365I19.1 (K1AA0456) 


374 


92 


2525 


AF038995 


Mus musculus 


putative RNA helicase RCK 


160 


93 


2526 


M60618 


Homo sapiens 


nuclear autoantigen 


116 


75 


2527 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


101 


57 


2528 


X68790 


Homo sapiens 


Bactericidal /Permeability Increasing 
Protein 


136 


90 


2529 


A01592 


Homo sapiens 


haemoglobin A beta chain 


508 


94 


2530 


U89277 ! 


Homo sapiens 


polyhomeotic 1 homolog 


404 


79 


2531 


X90845 


Rattus 
norvegicus 


alphall spectrin 


507 


87 


2532 


AL1 37081 


Arabidopsis 
thaliana 


phenylalanine-tRNA synthetase-like 
protein 


178 


33 


2533 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


363 


100 


2534 


W54966 


Homo sapiens 


Synthetic human type HI collagen 
SYN-C3. 


113 


42 


2535 


X61047 


Hydra sp. 


mini-collagen 


102 


38 


2536 


AB006330 


Mus musculus 


SOX5 


559 


94 


2537 


AC006283 


Arabidopsis 
thaliana 


En/Spm-like transposon protein 


149 


33 


2538 


AF196779 


Homo sapiens 


JM11 protein 


139 


57 


2539 


AK000741 


Homo sapiens 


unnamed protein product 


233 


47 


2540 


Y18046 


. Homo sapiens 


FGFR1 oncogene partner (FOP) 


315 


84 


2541 


U54996 


Homo sapiens 


HZW10 


371 


81 


2542 


G02631 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6712. 


99 


45 


2543 


M95610 


Homo sapiens 


alpha-2 IX collagen 


93 


34 


2544 


AF071173 


Mus musculus 


Herc2 


349 


82 


2545 


AB000516 


Homo sapiens 


DSIFpl60 


600 


93 


2546 


AL355178 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 


395 


90 






(SCR repeat) domain protein) 






2547 


U93574 


Homo sapiens 


putative pi 50 


235 


76 


2548 


X80035 


Oryctolagus 
cuniculus 


cysteine rich hair keratin associated 
protein 


99 


31 


2549 


U58088 


Homo sapiens 


TT /IT TT O 

Hs-CUL-2 


592 


86 


2550 


L26953 


Homo sapiens 


chromosomal protein 


131 


63 


2551 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


99 


61 


2552 


U39412 


Homo sapiens 


alpha SNAP 


159 


54 


2553 


AF248651 


Homo sapiens 


RNA-binding protein BRUNOL4 


569 


93 


2554 


AJ006519 


Rattus 
norvegicus 


ASIC-beta 


164 


73 




L)y35/0 


Homo sapiens 


putative pi 50 


loy 


52 


2556 


AF1 04328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


105 


29 


2557 


AB028975 


Homo sapiens 


KIAA 1052 protein 


.610 


83 


2558 


AJ245621 


Homo sapiens 


CTL2 protein 


286 


47 
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SCQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 


2559 


AF090944 


Homo sapiens 


PRO0663 


174 


78 

to 


2560 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


411 


63 


2561 


AF109907 * 


Homo sapiens 


S164 


279 


44 


2562 


A09561 


synthetic 
construct 


human serum albumin 


680 


91 


2563 


U48359 


Gallus gallus 


kinesin light chain 


575 


81 


2564 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


435 


88 


2565 


D38112 


Homo sapiens 


NADH dehvdro^ enase subimit 4 


623 


89 


2566 


AF154916 


Chlamydomonas 
reinhardtii 


variable flagellar number protein 


117 


36 


2567 


AF181657 


Drosophiia 
melanogaster 


BcDNA.LD34475 


261 


42 


2568 


M12530 


Homo sapiens 


transferrin precursor 


693 


87 


2569 


U96915 


Homo sapiens 


sin3 associated polypeptide pl8 


651 


93 


2570 


R74205 


Homo sapiens 


Human death associated nrotein DAP- 
2. 


285 


27 


2571 


AL078593 


Homo sapiens 


dJ210Bl 1 OQAA0680) 


107 


37 


2572 


AF293405 


Phaseolus 
coccineus 


seed-micropylar-endothelium-specific 
nrotein 

i/l \3 Lv 111 


88 


39 


2573 


G03114 


Homo saniens 


Human secreted nrotein SKO ID NO* 
7195. 


110 


49 


2574 


AB024601 


Pseudomonas 
aeruginosa 


uridvlvl transferase 


120 


37 


2575 


AB050893 


Anadara nodifera 


cytochrome c oxidase subunit 1 


111 


79 


2576 


G03172 


Homo saniens 


Human secreted nrotein SEO ID NO* 
7253. 


130 


72 


2577 


MI7697 


Homo sapiens 


glutamate dehydrogenase 


369 


61 


2578 


AF045640 


Caenorhabditis 
elegans 


CI 1D2.4 gene product 


264 


43 


2579 


U71382 


Homo sapiens 


OB binding protein- 1 


273 


86 


2580 


AFU6661 


Homo sapiens 


PR01438 


114 


61 


2581 


AK000496 


Homo sapiens 


unnamed nrotein nroduct 


153 


55 i 


2582 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF F V 1 
GENE. 


190 

17V 


JO 


2583 


Y44851 


Homo sapiens 


Human CD39-L66 protein. 


205 


97 


2584 


AB029151 


Homo sapiens 


D29 


253 


73 


2585 


AF039023 


Homo sapiens 


Ran-GTP binding protein; RanBP6 


765 


94 


2586 


AF054180 


Homo sapiens 


hematopoietic cell derived zinc finger 
protein 


116 


34 


2587 


AF283645 


Homo sapiens 


folate transporter/carrier 


580 


82 


2588 


AF090930 


Homo sapiens 


PRO0478 


132 


67 


2589 


AC002339 


Arabidopsis 
thaliana 


putative ABC transporter 


238 


34 


2590 


M34668 


Homo sapiens 


protein tyrosine phosphatase (EC 
3.1.3.48) 


627 


86 


2591 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


141 


63 


2592 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


142 


63 


2593 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 
oncoprotein. 


146 


54 


2594 


AF047437 


Homo sapiens 


sperm acrosomal protein 


526 


74 


2595 


U28789 


Mus musculus 


PACT 


528 


71 


2596 


AB002366 


Homo sapiens 


KIAA0368 


615 


83 


2597 


AL049610 


Homo sapiens 


(U1055C14.2 (KIAA0026 
(transcription factor-like protein 


704 


93 
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SEQ n> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








MRGX)) 






zo yo 


Ul/lZ /y 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

D30U. 


385 


79 


zjyy 


AFAQARQ5 

ATU7U07J 


— : 

Homo sapiens 


rKUU 11/ 


142 


61 






ivius niuscuius 


seizure-related gene product 6 
precursor 


1 1 A 

119 


76 


2601 


U20158 


Homo sapiens 


SLP-76 


108 


53 


zouz 


rv*Ri 19 

UjOl 1Z 


xiomo sapiens 


nauh dehydrogenase subunrt z 


242 


78 


9*m 

xOvj 


rv*Ri 19 


riomo sapiens 


NADH dehydrogenase subunit 2 


233 


73 


ZOU4 


\ 9ARAA 
JL£4oU4 


Homo sapiens 


pZi 


259 


62 


9£A*\ 
ZOUj 


V191 A9 


Homo sapiens 


Human :> EM secreted protein SEQ ID 

xi/v 4 1 < 
JNU. 41 j. 


132 


92 






nomo sapiens 


OCpl A J 

oxiU14JL 


614 


81 


2607 


AF202635 


Homo sapiens 


PP1200 


105 


47 


ZOlro 


ArZ/ /i/4 


Homo sapiens 


enhancer of polycomb 


254 


85 


ZOU? 


XUj4o4 


Homo sapiens 


raf protein (aa 1-648) 


600 


82 


9^ in 


T T1 fT^T 
U 1 JOi f 


Homo sapiens 


CD40 binding protien 


465 


82 


9A1 1 
ZOl 1 


U4UZOJ 


Trypanosoma 
cruzi 


ATPase subunit 6 


AO 

98 


31 


9619 


APAQA0A9 

rvrvyvy^z 


xiomo sapiens 


t>T>/^AiCC*J 

r KUUoD / 


125 


A O 

48 


9fil^ 


vaa£69 
V uuooz 


Homo sapiens 


cytochrome oxidase I 


605 


82 


9614 


u*foyzu 


Homo sapiens 


metaxin 


Tit 0 
748 


94 


9fil^ 


VI 07^7 


Homo sapiens 


ID NO 475 trom W09y2zz43. 


113 


87 


9616 


AK"A9^A47 


Homo sapiens 


unnamed protein product 


173 


58 


9a*17 
xO 1 / 


L/i joOO 


Homo sapiens 


alpha- catenin 


569 


96 


2618 


U20536 


Homo sapiens 


cysteine protease Mch2 isoform alpha 


588 


87 


zoiy 


V UUOOZ 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


608 


86 


9A9A 

ZOZU 


AT71 Q(\t£Q 

At lyUloo 


Homo sapiens 


serum albumin precursor 


522 


78 


2621 


A06977 


Homo sapiens 


albumin 


607 


93 


2022 


Of A CO A 

R14584 


Homo sapiens . 


TGF beta 1 binding protein encoded by 
clone BPA 13. 


339 


81 


2623 


X5669Z 


Xenopus laevis 


42Sp48 


117 


47 


Z0Z4 


VITA C 1 

W4S351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


57 


Z0Z3 


VAT QQ1 


Homo sapiens 


proiine-rich protein Gl 


116 


32 


zozo 


OKI 


Homo sapiens 
_ — . 


Human breast cancer related protein 
BCKBz. 


89 


58 


9 AO 9 
ZOZ / 


AAA970. 

Auuz/y 


synthetic 
construct 


Human serum albumin 


564 


95 


9/19 B 
ZOZo 


AJvUZlolJ 


Homo sapiens 


unnamed protein product 


214 


70 


9^9 Q 
ZOZy 


MZjOU 


Homo sapiens 


nucleophosmin 


A 

486 


82 


9/11 A 


A 17941 47/1 
ArZ4J4Z4 


Homo sapiens 


SOZNA beta isoiorm 


256 


98 


2631 


G00506 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/i ^57 


81 


60 




AAfiQ77 


— ; 

Homo sapiens 


albumin 


A CI 

45/ 


<tA 

74 




aa/*Q77 
Auoy / / 


riomo sapiens 


albumin 


563 


A*? 

93 


2634 


M81088 


Rattus 
norvegicus 


EF-l-alpha 


165 


68 


96^ 


AT 150^5*7 


Homo sapiens 


— - — : — : 

hypothetical protein 


496 


82 


2636 


A03758 


Homo sapiens 


serum albumin 


576 


91 


ZOJ/ 


uUZo /Z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oyjj. 


127 


39 






Homo sapiens 


putative pi 50 


186 


38 


2639 


U68729 


lS/f f*1 c\\ H n ovn p 
incognita 


LrULlV/lC JJi Cjjl UWV/ll£lgCXl 


1 1 ^ 




2640 


AB033056 


Homo sapiens 


KIAA 1230 protein 


269 


94 


2641 


AF1 16712 


Homo sapiens 


PR02738 


115 


61 


2642 


W48353 


Homo sapiens 


Human breast cancer related protein 


124 


72 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 








BCFLT2. 






2643 


G02455 


Homo saniens 


Human secreted nrotein SEO ID NO* 


98 


54 








6536. 






2644 


G03438 


Homo saniens 


Human secreted nrotein SEO ID NO: 


127 


64 








7519. 






2645 


AK025116 


Homo saniens 


unnamed Drotein oroduct 


104 


64 


7646 


X92485 


Plasmodium 


pval 


100 


47 






vivax 












Hnm/i canipnc 

i J.LU vl\J oupiCilo 


PRO2550 


122 


51 


9648 


AF1 94557 


Homo saniens 


NAG13 


279 


83 1 


7640 


v 14482 


Homo saniens 

liVliiv 0 1 1 L> 1 V^XJUj 


Fraoment of human secreted nrotein 

X 1 uullwUV UUUKUl OWWXW»*»NX fc/X X»* 


90 


66 








encoded bv f?ene 17 








G03438 


Homo saniens 


Human secreted nrotein SEO ID NO* 

XXIXIXIIXIJI VIVU fcX* VfcWXXXj ii/ x ^ V**. 


148 


63 








7519. 






96S1 


AF1 16661 


Homo opmipTiQ 


PRO 143 8 


112 


46 


96S"9 


T T6"* 549 


Haiti cjanipni 
fluiilU oapiGiio 


FAP nrotein 

X / xx piVrwLU. 


128 


79 




AT "2<07CO 


i ry panuoUQid 


pUSMUlG ^JU1V**0 ^ UX Iv^, vol lain a LHUI, 




46 






hnicpi 


enmolete virion Pennine 








I 0 JxJOZ 


llnmn cam'pnc 
rxUlUU odpiClld 


Tir+prlf** iVfn 1 rvMiv^rHnor pn wmi* 

XlxlCl iCUlVlJl 1 Willi Vd IXUg Cllx^jr 


139 


72 








homoloonip fTOFT nrotern seouence 










nomo sapiens 


xiuuidn bcureucu pruicui 9 ocy iu i~ w. 


164 


72 








6613 








IVJ J\)D I 


nuinu sapiens 


OmdllCI UCpalULiCllULal UilvAjpi ULCilJ 


145 


SI 








Hihcm i ©ene nreoduct 






96^7 


VJvvUJ / 


Hnmo ^ar»iftn<*. 
xxviiivi oapi&iio 


Human secreted nrotein SEO TD NO* 

x x Lillian jtv/iviwu pivivui) uuy ixv * » vy . 


166 


64 








4718. 






ZOjo 


R9 5*799 


numu sapiviis 


Human cpr-nptRii nmtfMn cfMTiipnf^' 
XXIIIllflll oCul VvLvU yi VK&Ul OV>v^lxWllVvt/ 


91 

S 1 


88 








encoded hv ffene 9 SEO ID NO # l 1 1 








VQ9AR5 


x iooUlUUlUill 


pval 


141 


60 






vivax 


£3 






9660 


Y76198 

X /UI70 


Homo saniens 

xxvyl llv/ O dpi til o 


Human secreted nrotein enended bv 

l L\M 111(111 JwvlwUAJ LV1 \J LwJLlL WllwVUWU J 


68 


68 








cene 7~? 






9661 

ZOO X 


Y3flX77 

X JuOZ-i 


Homo ssniens 


Human secreted nrotein encoded from 

xxLuxicui avwvwu yriuiwixx viiwuwu uuiu 


96 


59 








pene 12 






7669 


AF09093 1 


Homo saniens 


PRO0483 


114 


74 


7663 


G03807 


Homo saniens 

11 Will W OUfJlwAU 


Human secreted nrotein. SEO ID NO - 


129 


57 








7888. 






2664 


AF090930 


Homo sapiens 


PRO0478 


170 


78 


2665 


V00662 


Homo saoiens 


URF 1 fNADH dehvdro eenase subunit } 


460 


77 


2666 


D38112 


Homo saniens 


cytochrome c oxidase subunit 1 


368 


67. 


2667 


AF09093 1 


Homo saoiens 


PRO0483 


90 


88 


2668 


AF010144 


Homo saDiens 


neuronal thread protein AD7c-NTP 


121 


66 j 


2669 


G00397 


Homo saniens 


Human secreted Drotein. SEO ID NO* 


169 


73 








4478. 






2670 


AF 194537 


Homo sapiens 


NAG 13 


170 


42 


2671 


X86003 


Raltus 


n eiiron-deri ved ornhan receotor 


104 


50 






norvegicus 








2672 


U63542 


Homo sapiens 


FAP protein 


139 


75 


2673 


AF090930 


Homo saoiens 


PRO0478 

A i\wVrr / \J 


130 


69 


2674 


G03564 


Homo saoiens 


Human secreted Drotein. SEO ID NOr 


161 


75 








7645. 






2675 


AK024455 


Homo sapiens 


FU00047 protein 


105 


53 


7676 


G017Q7 


XXUlxlU ^dlJ it-lio 


Human secreted nrotein SEO ID NO* 


142 


67 








7878. 






2677 


AK000496 


Homo sapiens 


unnamed protein product 


110 


67 


2678 


AFI 16715 


Homo sapiens 


PR02829 


151 


78 


2679 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


72 








4718. 






2680 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


81 



199 



SEQ1D 
NO: 


Accession No, 


Species 


Description 


Smiths 

Waterman 

Score 


% 

Identit 

y 








ool3. 






ZOO 1 , 


Y7A1QR 


— : 

Homo sapiens 


Human secreted protein encoded by 
gene 75. 


131 


75 




vl /OOlKfr 


Homo sapiens 


180 kDa bullous pemphigoid antigen 
zaype avu collagen 


96 


37 


2683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


85 


55 




VjuU*HJ / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


127 


80 


OARS 


a cnofiRQS 
Ar\jy\)oyj 


Homo sapiens 


PKUUl 17 


81 


51 




fy\) 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o/l. 


119 


52 


2687 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


122 


41 


2688 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


145 


71 






Homo sapiens 


PRO2550 


149 


78 


2690 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


102 


47 


9AQ1 


UUJU4J 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


U6 


74 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


130 


60 




Ar04133U 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


26 


2694 


AE001158 


Borrelia 
burgdorferi 


conserved hypothetical integral 
membrane protein 


81 


37 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


162 


76 


zoyo 


A DAAA/C 1 £ 

ArUUUolo 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


114 


100 


9AQ7 


AT 1 yHjj 1 


Homo sapiens 


NAG 13 


126 


90 


2698 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


69 


2699 


U93563 


Homo sapiens 


putative pi 50 


138 


37 




Ar 090928 


Homo sapiens 


PRO0470 


177 


57 




n a 1*71 /i 
OUj / 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 






Plasmodium 
vivax 


pval 


116 


50 


2703 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


57 






vjronila gorilla 


NADH dehydrogenase subunit 2 (ND2) 


137 


81 


2705 


AF090895 


Homo sapiens 


PRO0117 


105 


51 






Homo sapiens 


PRO0663 


121 


45 


97A7 
Z/U/ 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ 344. 


104 


57 


2708 


nrn 1 79 


jiojiio sapiens 


xiuman secretea protein, ocvj id nu: 
7253. 


157 


85 


2709 


D^81 19 

«7 OHZ> 


HAmn cantonc< 

nuiiiu oapicUb 


is/\L/xi aenyczrogenase suounii i 


zoj 


OA 

84 


2710 


AFl^flfWO 


Homo sapiens 




125 


48 


271 1 


iVIZZjjf 


jtiomo sapiens 


unknown protein 


131 


50 


2712 


G00416 


Homo sapiens 


Human secreted protein, SEQ CD NO: 

AAQH 

*¥*yl. 


123 


58 


27 n 


QR01 10 
OOU117 


rvanus sp. 


reverse transcriptase homolog 




57 


2714 


AF090931 


Homo sapiens 


PRO0483 


98 


85 


771 S 


VJU1Z*K> 


Homo sapiens 


Human secretea protein, SEQ ID NO: 
5327. 


104 


79 


2716 


L00016 


TTomo canipnc 


UiXJ 


£yj 


yj i 


2717 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


135 


82 


2718 


M81321 


Macaca 
fascicular is 


proline-rich protein 


117 


38 



200 



WO 01/64835 



PCT/USO 1/04927 



SEQ H> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Wa term an 
ocore 


% 

fdentit 

y 


2719 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

V>7 lj. 


124 


57 


2720 


AL359782 


Trypanosoma 
brucci 


probable similar to ring-h2 ringer 

r\rotpin rhola 
U-IULCIU rild. 1 a.. 


147 


56 


2721 


Y02749 


Homo ^anien*; 


Xiuiiidn secreieu proiem euvuucu uy 
gene 100 clone HNFIU96. 




fin 
o/ 


2722 


Y02749 


Homo <iaoipn<; 


Hltmon OA^rato/l nrv"if/»in Pn 

xi uiii au Sccreiea proiciu entuueu uy 
gene 100 clone HNFIU96. 


190 




2723 


D38112 


Homo cnnipTiQ 


cyiucnrome c o/uoasc suuunii 1 


*?07 


Do 


2724 


Y86248 


Homo ^ani#*n<; 


nuiiiaii scGreieu proicin nv-ixr roo, 


1 »9 

1 Oj£ 




2725 


Y86248 


Homo ^aniens 


SEQ ID NO: 163. 






2726 


W97293 


Homo ^arii^n*; 


An snnPYin Kinrltncr nmtMn A RP_ 1 ^ 
I\JU aliUCAill OJIlUing prOlCUl ^nnDr" 1 )% 


10^ 

I7J 


fU 


2727 


LT12690 


Homo sapiens 


cytochrome oxidase subunit II 


547 


85 


2728 


V00662 


XiUUUU oapiCllo 


cyxoenrome oxidase i 




oo 


2729 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


110 


56 


2730 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ooZ. 


124 


75 


2731 


AL359782 


Trypanosoma 

Kn ir*Ai 


probable similar to ring-h2 finger 
protein rnaia. 


113 


57 


2732 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4.71 R 


149 


51 


2733 


AL359782 


Trypanosoma 
bmcei 


probable similar to ring-h2 finger 
protein rhala. 


101 


44 


2734 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encooeu oy gene z oc\i al' invazdu. 


288 


81 


2735 


Z74036 


Caenorhabditis 

f*\t* crane 


contains similarity to Pfam domain: 
jrru 1^71 ^vrOuagen triple neux repeat 
\zu copies//, ocore jo.j, Ju- 
vatue=3.5e-14,N=3 


117 


37 


2736 


G04063 




Unman e£»r*rA+»rl *\i»/vf'al« C ri"\ 

fiuman secrereu protein, oxiv^ iu fSL/. 
8144. 


I J** 


/CO 


2737 


AK024455 


1 LUUIU aajJICllD 


FT TftftfW17 nrntAin 

rj-*juuu t f/ proiem 


1 JO 


ou 


2738 


G00577 


Hnmn canipn c 
nxjlilKj oapjClib 


Human secreceu protein, o.dv£ iu inw. 
4658. 






2739 


AP002460 


rVl aUlUUpolo 

thaliana 


gene ltLr 1 jjy.zo~unKiiown proiein 




on i 


2740 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


358 


69 


2741 


G02832 


Hnmn canipnc 


nuuid.il sccreieu protein, jny iu inu. 
6913. 


14 J 




2742 


L27428 


Homo QanipnQ 




1 10 

1 17 




2743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


76 


63 


2744 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW1 8 


155 


79 


2745 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

UUIJ. 


114 


51 


2746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFOTDNO-16^ 


159 


75 


2747 


G00437 


Hnmn <2nr%ipn<: 

*■ avfiaj v oaJJldiij 


I-Tnmnn Qf»rn=»fpH nrntpin QPO TFi XTO- 

4518. 


10Z 


70 


2748 


W48351 


Hnmn oani^no 
XAV/Uiu 5apidJ±> 


nuiiidii orcdbi cancer reiaieu proicin 
BCRB2. 


Iu/ 


ACS 


2749 


AK024455 


Homo sapiens 


FLJ00047 protein 


165 


69 


2750 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


167 


72 


2751 


M22332 


Homo sapiens 


unknown protein 


208 


46 



201 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 


2752 


R13556 


Homo ^anipn^ 


XiwlGLll CIliAJUCU UVJVTIldU vcUll Ul lUlv *• 


125 


JO 








on co nrotein 






2753 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


150 


65 








oncoprotein. 






2754 


AF220264 


Homo saDiens 


MOST-1 


130 


52 


2755 


AK024455 


Homo sapiens 


FLJ00047 Drotein 


128 


58 


2756 


AF130051 


Homo saniens 


PRO0898 


162 


71 


2757 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 

iirAJU'i.i Uwiiyui viiiviiujw tjuvuiiiw «^ 


259 


86 


2758 


G03714 


Homo <*finiprw 

XXl/lXIU OdJ^lV'liO 


Human oftrrptftd nrotein ^PO ID NO* 




Jo 








7795. 






2759 


AJ243666 


Homo saniens 


NTCE-5 Drotein 

IIIVJJ J L*iU twill 


118 


84 


2760 


D38112 


Homo saniens 


>JA rjH dehvdrooenase <mhunit 4 


j j j 




2761 


G02482 


Homo ^aniens 


Human ferreted nrotein SFO TDNO- 


1 10 


10 








6563. 






2762 


U 12690 


Homo QflnipnQ 


/» vtTV*TirAm P AVirlstQP Q1lV\llT1lt II 
l/j mUUUiUC UAlUOoC jUUUIUI 11 


7^7 


oo 


2763 


Y73345 


Unmn cflnipnc 


HTPlUf pJaiip 41K75J1 nmfpin cpntipnrp 
n i ivivi i>iunc hjo^oj piuicm sequence. 




^?7 
O/ 


2764 


R13556 




l IUIC1U CUvAKiCU UUWIlbli Cam Ul 1U1U XVA 


1 JO 










An^nn rATP in 






2765 


G02812 




Unman cPf*rATPrl nrAtAin OCA fT*\ XTO* 

riUiiiaii secreieu protein, ocy il/ in\j. 


1^7 
I 


JO 








6911 






2766 


X92485 


X UldlliUUluiil 


uval 


140 


JJ 






V1VOA 








2767 


AF1 13685 


UnniA canipnQ 


PR O007/1 


1^1 

U 1 


Al 
Oj 


2768 


G00178 


numu bapieiib 


Human sccreiea protein, ocxi xxj n\j. 


too 


DO 








AAf\a 






2769 


G03741 


nuiuo sapiens 


fiuman sccrereo protein, oc^ iiy inia 


i *;^f 
I j4 


*71 
/ 1 








7^94 






2770 


X97707 


jrongo pygmaciis 


siopcouon creaiea oy 




o7 






sthpl 1 1 


puMuanscripuoiiaj poiyaucnjriauon 






2771 


Y02671 


Haiti/\ cantAnc 
nuLUv/ aapicno 


riunian secreiea proiem encuuea oy 




^o 














2772 


G00637 


Homo QanipnQ 


Human opprptpd nmtpiri QPO TO MO* 


14S 


ou 








4718. 






2773 


AF090944 


Homo saniens 


PRO0663 


118 

HO 




2774 


L26953 


Hnmr* Qanipn«i 


rhrnm Acnmsil ^rftfpin 
viuviiiii/auiiiai jjiuicui 


1 17 
1 tz 


JO 


2775 


AF090930 


Homo ^arvipn^ 




177 


^0 

J7 


2776 


AK024455 


Hnm r» QanipnQ 
liv/nnj oajjiwiio 


FT J00047 nrotpin 


1 ^J. 
l j*f 


JO 


2777 


R59843 


t-Inm r\ ennipne 
iiuiuu sajjivixa 


Ar»AP4T v7 nrAt*»i»cf> 
xVpujCrrl^AZ piVJlCaoC, 




47 


2778 


W481S1 


xiuinu bapicno 


Unman nroact pan^Ar Tt*\^t&i\ nrntoin 

riuiiidn oreaoi cancer reiaiea protein 


1 A1 


JJ 














2779 


Y02671 


J.1VU1V/ aapicuo 


Human cp/»rptpH nrAtpin ptipaHpH K\/ 
xiujuau ocuiCLcu. piuicui cubuueu uy 


111 


(LA ' 
O** 








aene 22 clone HMSJW18 






2780 


G03172 


Homo ^a"nipns 


Human Rprrpfpd nrotein ^FO TH NO- 


oy 


S6 








7253. 






2782 


D38113 


Pan troplodvte^ 


"MAT^H dehvdro^ena^e suhunit 4 

iiru/ii uwttjrvu vgviisow ouuuiin » 


1 


97 
y t 


2783 


Y02749 


Homo <ianien<; 


Human <»pcreted nrotein encoded hv 

JLlUlilall JbwlvlvU |/lwfcCLLl vUwum J 


124 


oo 








eene 100 clone I1NFIU96 






2784 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c-NTP 


123 


49 


2785 


U43360 


Perorayscus 


reverse transcriptase 


175 


50 






maniculatus 








2786 


W58700 


Homo saniens 


Human ST-1 nartial ^eouence 


219 


100 


2787 


R96800 


Homo saniens 


Human hi^tioevte-^ecreted factor H^F 


T37 


90 


2788 


D38112 


Homo saniprw 


fvtorhromp r nvida^p ^nhnnit ^ 


519 
ji 


76 
t o 


2789 


G04072 


Hatha cnniAnQ 


Human QpprpfpH nrnfpin QPH TTi MO* 
xxUJUcUl odlClcU piUlCUl, Oily xxV INV-f . 


T04 










8153. 






2790 


AF130089 


Homo sapiens 


PRO2550 


121 


39 


2791 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


63 








7519. 






2792 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 



202 



WO 01/64835 



PCT/USO 1/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








7133 






2793 


Ml 0546 

XYXX KJ*J**\J 


Hom c\ caniAnc 


pvtApHrnm p nYiHa^e I 


153 


R0 


2794 


D381 12 


UnmA <*anif-nQ 
xxvjixivs aapibxia 


rvtoohrnmp c oviHa^e subunit 1 


461 


84 


7795 


AFAQAQ31 


xxuiiiix oopiciio 




114 


76 


7796 


OA37R9 


xxumu ddpiciio 


Wnman cpr-rpfpH nrntein SFO ID NO' 


107 

X \J t 


53 








7R7ft 

rO/U. 






2707 
z /*/ / 


tori 17 

UO 0 1 1 z 


xxuiiio sapiens 


iN/viJxi ucnyurugenase duuuihi x> 


1 14 


7R 
/o 


77QR 


oai7qa i 

vJuj / y\j 


nouiu Sapiens 


xiuman secreieu proicm, »jx>v^ xl/ in^». 


IK 
1 1-* 


71 








7R71 






77Q9 

jz/yy 


OA7515 


XxUfliO Sapiens 


Hitman c*v*n»tpH nr Afpin ^PH TT) NO* 
xiuiiioiJ ocurcicii pi UIC1U, OCV^ JUL/ 1 « w. 


1 13 


79 

ty 














7RAA 


VA1AAA 


riuiiiu Sapiens 


oeC/fcLcu proiein ciiutiueu tijr gcuc io 


1 17 
11/ 


4R 








clnnp HNHFO20 
viLMic xxxn nr \j£*y . 






7RA1 
ZOU J 


YQ7ARS 


r laSIuOuluin 


pval 


07 
y i 


RA 
oo 






\fi\rav 

VlVaA. 








7RA7 

ZoUZ 


Ti3R1 17 
Ujoi IZ 


xiuiiiu bdpiciio 


cyiocm uine c oxiuose suounu j 


471 


RA 
oo 


7R03 


afi iaa51 


riumo sapiens 


ppnnROR 
rxsAJUoyo 


1 (R 

1 1 O 


7R 
/o 


zou-+ 


VJVJJDO 


Homo sapiens 


Human secreted protein, ocki id in*j. 


7ft 

/o 


7Q 








7/117 






7RA5 
zouj 


VA77AQ 
X Uz f ** y 


xiomo sapiens 


TJ« *rv» co/H*ata/i nrn^otn en^A/iAn V*\ t 

tiuman secreiea proiem encoueu oy 


1 A3 


69 








gene 100 clone HNFIU96. 






9RA6 
ZoUO 


nn3Ai 5 


riomo sapiens 


iruman secrcicu proiem, uj inw. 


1A7 
l vz 


67 
oz 














7 RAT 
ZoU/ 


JVUZ*f v 1 


riomo sapiens 


cnoxionic somaiomammoiropm 


*\A3 


RR 
oo 


zoUo 


I lOJZZ 


Corvus 


cytochrome oxidase subunit I 


Jo/ 


£A 

ou 






nnguegus 






jLO\)y 


oai iqa 


nomo Sapiens 


Unman ra^roto/l nfAtotti CDA T 11 ^Tfl* 

xiuman sccrcico protein, ocy xli in v/. 


-;55 


Q3 








JZ/J. 






7R1A 


RA314R 


Ufnmn Q^nipne 
xxuixiv" dapiviis 


Hi imgn npumnal fiiffiprpnrisiHnn 'fortor- 
ixuiiiaii livtUviiai uixAciciiLiauuii iauiv/i*" 


399 


77 














7R1 1 

ZO 1 1 


AF007R76 

r\X v v / OZU 


|-Tr\rn ct <;3nipnc 

X 1 will IS oapiCllS 


VtAY p*r»ci Ion 
UOA CpjIIVJll 


94 

y" 


39 


7R17 

ZO 1Z 


AF0101 Ad 


lxuxxiu sapiens 


IlCUXUILai UXl CdU UIUlCllX r\JL//W**lN li 


770 

ZZV" 


67 


7R13 
ZOU 


III 7600 


nomo sapiens 


cyroenrome oxidase suuuiui ix 


A9S 


RI 

Ol 


7R1A 

ZO I*f 


nn3Ai i 


numu sap ic us 


xiuiriaii secreiea protein, acy xl/ in w. 


1 17 
1 IZ 


4R 
to 








7/4Q7 
/47Z- 






7R1S 

ZO 1 J 


nAV?5R 
vtUJZjo 


riomo Sapiens 


TXnrvton corrotAH nrntoin CCA 1T\ WO 1 

riuman secreiea proiem, ocvj uj nvj. 


107 

1UZ 


75 








7339 






7R16 - 

ZO ID 


tyiri 19 

JLOO 1 IZ 


xiomo Sapiens 


cyiocnrome c oxidase suDunn j 


A3 7 


R3 


7R17 
Zol / 


HA^7Q7 


liOino sapiens 


xiuman secreieo proiem, ocy iix in v. 


143 


OO 








7R7R 






2818 


X55733 


Hftmn <;anipn<: 
X lUiilVJ oapiciio 


initskriAn fi»<*tAT* ATA 
iiiiiatiuii lautui *Ti-> 


458 


80 


7R1Q 
zo i y 


VJUJ OV/ 1 


riUIIlU SapiCILS 


Human c^rrptpH nrnt^m WO TFl NO* 
it iii ii ail sevreicu pioieixx, ocy iu i~ v_/. 


137 


65 








7882 






2820 


D38112 


Homo sanien^ 


rvtorhrome c oxidafie subunit 3 


347 


68 


2R21 


AF010144 


Xiuiiivj sapivito 


npurAnal tbrf^H nrofpfn Ar>7r-NTP 
lie ui ui lai uii cau pi i/iciu r\jL/ 1 v n x x 


94 

yt 


62 


2822 


R 13556 


Wnmn CanipTiQ 

XXLIlUvJ o<Xpiv>ll3 


Protpiri PtirwHeH H a wn otrftfi m of hhn M 

I 111 LO ill v^XX*a>l/UCHJ UVJVVXIOU V>OXXX v»X XxXlV> xvx 


118 


54 








onconrotein 






2823 


AF 130079 

rvx 1 JUU / 7 


riviiuv/ oapi^uo 


PR02852 


143 


54 


2R24 


O0032R 


Xxuiiiv/ oapiciis 


Human cprrptorl nrotAin SFO TT) NO* 
xxuiiiaxx ocuvicu piv/tciix, ocy ix/ nv. 


140 


52 








4409. 






2825 


Y76198 


Homo Qanien<; 


Human <»Rct*eteH nrotein encoded bv 


161 


67 








€i"<*nf» 7S 

£vllC / ^. 






2826 


AF1 19R51 




PRO] 722 


104 

X X/T 


70 


2827 


D49489 


xiv/iiiu oapiciis 


human PS 
iiuxxiau x j 


523 

JX.J 


91 

^ X 


2828 


Ml 0546 

ATX X VrfTA/ 


Homo ^anifin*: 


cvtochrome oxidase I 


206 


81 


2829 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


138 


42 








oncoprotein. 






2830 


AF090931 


Homo sapiens 


PRO0483 


97 


80 


2831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idcntit 

y 








TOT 1 

7871. 






2832 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


112 


35 






Plasmodium 
vivax 


pval 


103 


43 




KJzUlv 


Homo sapiens 


RpI5-TIA-l. 


152 


54 




uUUjj r 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


153 


59 


ZojO 


W**5Jj1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


59 


Zo J / 




Homo sapiens 


PKU0483 


149 


81 


ZOjo 




Homo sapiens 


PRO2550 


123 


49 




Ar 090942 


Homo sapiens 


t»t> /"v/\/- ri 

PRO0657 


135 


54 


2o4U 


O03 787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


54 




Ar 090895 


Homo sapiens 


PRO0117 


146 


55 


2842 


AF132200 


Homo sapiens 


PR01751 


98 


74 


2843 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


194 


75 


2844 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


61 


59 


2845 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


171 


37 


2846 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


155 


69 


2847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


58 


Z54o 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


65 


2845/ 


T T1 £ t\t\ 

U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


441 


77 


2850 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


490 


88 


2851 


K02401 


Homo sapiens 


chorionic somatomammotropin 


488 


90 


2852 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


110 


49 


2853 


U83280 


Leishmania 
donovani 


39 kDa antigen 


131 


47 


2854 


AF090944 


Homo sapiens 


PRO0663 


173 


72 


2855 


■t r r\r\ f f-* 

V00662 


Homo sapiens 


ATPase 6 


392 


71 


2856 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


147 


50 


2857 


Y85062 


Homo sapiens 


Interleukin I converting enzyme 
homologue (ICEL) protein sequence. 


113 


76 


2858 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


178 


84 


2859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


92 


2860 


Z81068 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00102 (Protein-tyrosine 
phosphatase), Score=232.1, E- 
value=2.6e-66,N=l~cDNA EST 
yk299h6.3 comes from mis 
gene-cDNA EST yk420b4.3 comes 
from this gene-cDNA EST yk439g6.3 
comes from this gene-cDNA EST 
yk299h6.5 comes from this 
gene-cDNA EST yk420b4.5 comes 
from this gene-cDNA EST yk439g6.5 
comes uum uiis gene 


104 


36 


2861 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


444 


85 


2862 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


168 


61 


2863 


U93564 


Homo sapiens 


putative pi 50 


132 


34 ! 
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SEQID 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 








Waterraan 


Identit 










Score 


y 


Zo04 


WoU4UO 


Homo sapiens 


; , y . 

A secreted protein encoded by clone 


152 


















vjvj /y / 


riomo sapiens 


riuman secreteo proiein, ocv( li^ 


1Z4 


76 








to /o. 






zooo 


r;fn 179 

KjKJ J 1 / z 


-—_ ; 

Homo sapiens 


riurnan secrete Q protem, oc\i jjj inu. 




46 








79^1 






9R67 


A FOOftO^O 
r U7W/JV 


noniu Sapiens 


rtsxjyjH I o 


101 


ol 


iOUO 


VMK99 
I J vozz 


riomo sapiens 


Xjll'mo-r* Parana/] t~\-vc°\\ c\ 1 1-» otl/>A/iA/l £rr\m 

riuman secreieu proieui encoaeu rrom 


1 17 

11/ 










gene xz. 






2869 




ft/inc miicr>iiliic 


reverse uaiiscr lp utse 


107 


A A 


2870 


U09500 


XlUlllU aoUiCliS 


ujciii uixie u 


oy 1 * 


oo 


9871 




riomo sapiens 


cytochrome b 


£L\ A 

014 


yz 


zo /Z 


Ujo 1 1Z 


Homo sapiens 


NADH dehydrogenase subunit 4 


C 1 c 

516 


87 


ZO 13 


L/JO 1 1Z 


Homo sapiens 


NADH dehydrogenase subunit 4 


4 jo 


o4 




LiJo J LZ 


Homo sapiens 


NADH dehydrogenase subunit 4 


516 


92 


Zo/J 


r juoZZ 


Homo sapiens 


Human secreted protein encoded from 


87 


55 








gene lz. 






Zo /O 


nioi 17 
JJiol 1Z 


Homo sapiens 


NADH dehydrogenase subunit 5 


624 


96 


9R77 


vjUZjJo 


Homo sapiens 


riuman secreted protein, ^fc^f ID NU: 


160 


58 








6619. 






Zfi /o 


i no 1 7/i 
UJZl /4 


Dictyostelrum 


non-receptor tyrosine kinase 


120 


33 






discoideum 








7870 
Zo /V 


r*vooi 17 
L/Jo 1 1Z 


Homo sapiens 


A irase subunit o 


141 


54 


ZooU 


UU.JU43 


Homo sapiens 


* - ^1 ^_ _ _ a ^ ?^ op/\ l|\ T/"^\^ 

Human secreted protein, SEQ ID NO: 


140 


84 








719>t 
/ 1Z4. 






9flft1 

ZOO 1 


Y 144oZ 


Homo sapiens 


Fragment of human secreted protein 


93 


79 








encoded by gene 17. 






ZooZ 


v uuooz 


— = 

Homo sapiens 


Airase o 


1 *>A 

130 


87 


Zoo J 




Leishmania 


probable proteophosphoglycan 


117 


50 






major 








9 c ft/1 


API 1 0ft^l 


Homo sapiens 


"DU/^1 777 

rKUi /Zz 


*7A 

ly 


55 


Zoo j 


Y09AR^ 
AJ7Z*foj 


Plasmodium 


pval 


106 


48 






vivax 


, 






ZOOO 


V uuooz 


Homo sapiens 


ATPase 6 


127 


87 


Zoo / 


PkQ Q 1 1 O 

UjoI 1Z 


Homo sapiens 


NADH dehydrogenase subunit 4 


191 


83 


ZOOo 


VJUj ;o7 


riomo sapiens 


riuman secreted protem, bbC^ ID NU: . 


too 

138 


CO 

58 






— — 


7ft 7 A 
/o/U. 






zoo? 


AlfZ4o3 


Plasmodium 


pval 


107 


63 






vivax 








9 Ron 
zoyu 




Homo sapiens 


— — 

cytochrome oxidase I 


163 


OA 

89 




Aruyuy4z 


Homo sapiens 


PKO0657 


72 


37 


9 ft 09 
zo^z 


\juuh 10 


riomo sapiens 


riuman secreted proteui, obi^ 1L) NU: 


136 


56 








AAQ1 






9R0T 

ZO>J 


UUJ /?u 


riomo sapiens 


riuman secreted protein, oby ID NU: 


1 ^A 

\5y 


66 








/o71. 






9804 




riomo sapiens 


ITT rAOA/f 7 nrnfoin 

rJLJUuu*t'/ protein 


13V 


37 


ZoJO 


AT 7^Q7fi9 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


111 


A A 

44 






brocei 


complete virion genome. 






9RQ£ 

Z0270 




riomo sapiens 


riuman secreted protein, oc\i lu rsu: 


1 oo 

loZ 


62 








001 J. 






2807 




nuiiiu sapiens 


numan secreieu proiein, oily il/ inu. 


14 J 


77 








770^ 






2898 


rv?Ki 19 

J O 1 I Z 


Hatha compnc 


cyiocnrome c oxiuase suounn ^ 


1AQ 

Z4o 


c* 


2899 


V VUUUZ 


iTUiilU SaLJlCIlo 


uivr h ^iN/vuxi uenyarogenase suounitj 


Joo 


oo 


2900 


X07882 


Homo sapiens 


Po nrntefn 


146 


41 


2901 


V00662 


Homo sapiens 


cytochrome B 


517 


91 


2902 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


378 


67 


2903 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


2904 


AF003540 


Homo sapiens 


KrueppeJ family zinc finger protein 


99 


54 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2905 


AF130079 


Homo sapiens 


PR02852 


111 




2906 


D38U2 


Homo sapiens 


cytochrome c oxidase subunit 1 


427 


79 


2907 


AF202635 


Homo sapiens 


PP1200 


140 


66 


2908 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


1 16 

I 1 17 


At 
*# I 


2909 


U63542 


Homo sapiens 


FAP protein 


155 


75 


2910 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 1 fCCYTt 


446 


79 
/Z 


2911 


AP002031 


Arabidopsis 
thaliana 


aene id*K3D20 3~ 


1 to 


jj 


2912 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


435 


Ov 


2913 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


469 




2914 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


480 


Ov 


2915 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


488 

too 


K6 
ou 


2916 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


80 


2917 


D38112 


Homo sapiens 


cytochrome c oxidase stihnntt *\ 


177 


ou . 


2918 


D38112 


Homo saoiens 


cytochrome c iwirffl<:p Qiihunit 1 


^19 


rye 

/J 


2919 


D38112 


Homo saoiens 


cvtochmmp c ayiHucp QiiHnriit 1 






2920 


G03797 


Homo saoiens 


Human <;pf^rPtpH nrn+pJn TVS XTfV 

7878. 




OJ 


2921 


V00662 


Homo saniens 




DCS* 


Q9 

o / 


2922 


D38112 


Homo saoiens 


cvtnpJirrMTiP r* oyiHsiqp cuHiinit ^ 

vjriuvmuillv v vAlUdjC ollUuilll D 


DJU 


514 
o*t 


2923 


AF157321 


Homo saoiens 


"XCi IrOa r»rntpin 

JV RJU'd pi IJ LCI 1 1 




Oo 


2924 


G00397 


Homo saoiens 


Human <»pfTPtpH nrotpin ^FO ITS TJTV 

4478. 


1 74 


/ 1 


2925 


D38U2 


Homo sapiens 


NADH dehydrogenase ^nhnnit4 


50^ 


OO 


2926 


U47624 


Xenopus laevis 


alpha(E)-catenin 


243 


94 


2927 


AF010144 


Homo saoiens 


npnronal thrparl nrntpin A TY7 rv/MTP 


i k 




2928 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


476 


78 


2929 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


465 


81 


2930 


G02994 


XJrvTYi/x caniAnc 
IlUIflU cktJJlCIlo 


riuinan secreiea protein, otiv^ ID wut 
7075 


1 CI 

151 


58 


2931 


D16480 


Homo sapiens 


enoyl-CoA hydratase/3-hydroxyacyl- 
ucDyuiT/gcnase aipna~suDunii oi 
trifunctional protein 


345 


67 


2932 


D38112 


XXXJlllW JO^/ICJlJ 


ri/xL/rL Qcnyarogenase suounn 


Zld 


AC 

95 


2933 


AP000616 


OrVTJi <5»tiva 


similar tn T? rXTr^-T-TO "fin o-or nrntoin 
oiiiiiiai lu iviin \j~nz j.mgcr uroicin 

KHAIa f AF07868^ 




oy 


2934 


X83427 


Omithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


460 


82 


2935 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


527 


83 


2936 


D38112 


Homrt ^anien<; 

X X\JLLk\J OttplCUd 


MADH HpfivHr/vjprmcp cnKunif A 
i^*r\xsxx UGUjrUlUgGLlaov oUUUIUl *t 




oo 


2937 


Y86248 


Hnmn ^aniens 


Human <sf»prptpH nrntpin yPMPPAS 
iiiujiaii ocviviv/VA piuiciii xTv^rir roo, 

SEQIDNO:163. 


1 53 


/ / 


2938 


X15081 


Crithidia 
fasciculata 


MTJRF2 nrotein f AA 1-14JN 


10<\ 


49 
*fZ 


2939 


AF194537 


Homo saoiens 


NAG13 


1 51 


^9 
JZ 


2940 


AJ271872 


Nicotiana 
sylvestris 


extensin 


1fi5 ! 


^7 


2941 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAlafAF078683^ 


134 


91 


2942 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


^X7 


60 


2943 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


2944 


Y86248 


Homo saoiens 


Human <ieerpteo l nrotpin WCWPFfiR 
SEOIDNO-163 


1 08 
lUo 


yj 


2945 


J01415 


Homo sapiens 


MTND4 


429 


76 


2946 


AC010793 


Arabidopsis 
thaliana 


F20B17.16 


99- 


41 


2947 


AF1 16712 


Homo sapiens 


PR02738 


132 


60 


2948 


X89658 


Homo sapiens 


CAP- 18 protein 


136 


49 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


2949 


V00662 


Homo saDiens 


cytochrome oxidase I 


449 


86 


2950 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


438 


82 


2951 


D38112 


Homo ^aniens 


NADH riehvdropenase subunit 4 


363 


76 


2952 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla f AF078683"* 


129 


92 


2953 


X56015 


Crithidia 

OTlCATifiltl 

vIluVUvlU 


NADH dehydrogenase subunit 5 


102 


32 




G00577 


Unmft <%ftmen<i 

X XVUllKJ JQ^/IUIO 


Hitman ferreted nrotein SRO TO NO" 
4658. 


110 


68 


2955 


Y76287 


Homo sapiens 


Fragment of human secreted protein 
encoded bv &ene 20 


345 


78 


2956 


D38112 


Homo sapiens 


cytochrome <c oxidase subunit 3 ! 


340 


71 




man? 

xJ J Ol li 


Homo cantPfiQ 


evtorhrome r oYirfa^e ^lihimit 1 

V» Jf IXJKjlXX UJUw V«« JUUUUll X 


456 


83 






Hnmft QflnipriQ 
nuuiv D<xpiciio 


Hum an secreted nrotein SEO ID NO 

6566. 


102 


76 


2959 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


96 


84 


2960 


D38112 


Homo ^aniens 


cvtochrome c oxidase subunit 1 


400 


70 


2961 


X69978 


Homo sapiens 


XP-G factor 


539 


88 


7Q£7 


AF9ftt6R7 


T-Inm/% canipTic 

noniu sKtpiciio 


nml flptin reoiilatorv element— hind in i> 

protein 


453 


77 




nun 17 
xJj 0 1 1 z. 


TIrtmA car»if*nc 
ITUIIIU oapiCIlo 


rvtorhromp r OYidaQe ^nhnnit 1 


561 






X 00^*rO 


xxvJiiiyj jdjJiciiD 


Human secreted nrotein HCHPF6R 
SEQ ID NO: 163. 


187 

XV I 


92 






xilflllU aajJlvilo 


A c^prpfpd nrrvtein pti coded riv r*1rme 
A ocisit/icu piuibiu wii^uudu vjy \*i\jxi\j 

dh40 3. 


1 19 


63 






T-fnrrin QArvipnc 
xiwiiiv/ aa^iiviu 


mariner rran<inosa<»e 


469 


83 


2967 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ8 1 


130 


53 


2968 


AL359782 


Trypanosoma 

nriiPM 

1/1 Ubvi 


probable similar to ring-h2 finger 

nrotein rha 1 a 

Lfl xJLGlXl i net Id. 


103 


42 


2969 


X92485 


Plasmodium 

VIVOA 


pval 


144 


42 


2970 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6516 


127 


63 


2971 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


129 


63 


2972 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 
3, 


91 


56 


707-1 




T-Tr\rn r\ Q?ir\iPn<J 


Human secreted nrotein SRO ID NO* 
6613. 


101 


72 


2974 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


130 


68 


2975 


AF130089 


Homo sapiens 


PRO2550 


167 


46 


2976 


V00662 


Homo ^aniens 

J> XXIXXXXf Ou^ilVUd 


cvtochrome oxidase I 


539 


79 


2977 


AP000616 

i*X WW 1 V 


Orvza sativa 


similar to RING-H2 fineer nrotein 
RHAla (AF078683) 


121 


44 


2978 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


126 


72 


2979 


U93568 


Homo sapiens 


putative pi 50 


122 


30 


2980 


M12099 


Mus musculus 


proline-rich protein 


119 


39 


29R1 


73R19R 


miiopiiliic 

ATXlXo lUlUwUllu 


hictnne HI 


117 


36 






f apu ot^i 51 hH iri q 

elegans 


contains cim ilaritv to drosonhila DNA- 

\ M i flUflMiiT dlXXXUUl Itjf *XJ UIUJUUIUIO X^x^rX 

bmding protein K10 (NID:g8148) 


172 


51 


2983 


AC002291 


Arabidopsis 
thaliana 


extensin 


110 


35 


2984 


M10546 


Homo sapiens 


cytochrome oxidase I 


295 


92 


2985 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


171 


65 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


2986 


U 18985 


Womr» QnniPnQ 
1 1, win v sapiciia 


— _ 

tnadin 


410 


o3 


2987 


AF 155232 


Pfcum sativum 


GALCTIXoIIl 


1 50 

1 J7 


A 1 

41 


2988 


X03 145 


Uftnin «inr>ipn<i 
1 1 ui n\j odjjiciia 


pui. \ss\r v 


1 11 


44 


2989 


U 12690 


Worno sar)if*n<! 


\yj UXOllI (JI1IC UaIIuuC oUUUilll XX 




9A 
54 


2990 


G03789 


Horrio sanieric 


Human <zr>rrf*tp*A rvmtp'in QPO TO MO* 
Xa 1X111 Oil oCUiCiCU piUtClIX, OCy 11/ rivA 


177 
IZ/ 










7X70 






2991 


D38112 


Homo saoien"? 


f*vrrtCnrrtnrip /%viHnc<* Qiil^iiTiit^ 1 
t# jr ujv/ixi uiiic c UAiuaoc ouuuxm x 






2992 


V00662 


Uaitia cnnipnc 
rxuiiiis oaxyiciio 


uxxjt h ^iNAUti aenyoxogenase suouniy 


4/1 


75 


2993 


D38113 


P*»n trAtylrvH/frAc 
rail uugiVAjyico 


cyiocnrome c oxiuase suounii 1 


< 1 1 
jl 1 




7QQ4 


tori 17 


jtioino Sapiens 


cytochrome c oxidase subunit 1 


301 


Oft 

79 


7QQS 


tori 17 


riomo Sapiens 


cytochrome c oxidase subunit 1 


5zo 


78 


i*y y%j 




jvicsocriceuis 


cytochrome c oxidase chain I 


634 


82 






aUIatUS 








7QQ7 
L>yy / 


vr/;74.r 

I OOZ*tO 


rioino sapiens 


riuman secreted protein HCrlProoy 


1 CO 

158 


76 








SEQ ID NO: 163. 






OQQQ 

£»yyo 


Y77R 1 /£ 
A / /o lO 


IvaUUS 


T>T> \/Vk«+/» 1 

rK-VDetal 


103 


50 






norvegicus 


: 






7QQQ 


V VJUOOZ. 


nonio Sapiens 


cytochrome oxidase I 


535 


00 
78 




T 7 BOHR 


Nicotiana 


extensin 


1 AC 

146 


0 0 
38 






tabacum 








3001 


TO R 11 7 


fiOEuo Sapiens 


, ~ r— 

NADH dehydrogenase subunit 2 


4V4 


OO 

57 




X O0Z4O 


Homo sapiens 


Human secreted proteui rlCrlrroo, 


11 1 


66 








SEQ ID NO: 163. 






J VAX J 


V67470 


riorno sapiens 


Np70 protein carboxy terminal region. 


in 

117 


AC 

46 




AT 75Q7R7 


i rypanosoma 


probable similar to ring-h2 finger 


93 


iio 

47 






brucei 


protein rhala. 






3oos 




/vraoiaopsis 


extensin 


110 
118 


37 






tnaiiana 








juUD 




Homo sapiens 


Wiskott-Aldrich Syndrome protein 


150 


44 






Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


73 








/oJ7. 






:>UUo 


\l/5/1Q<£ 


Homo sapiens 


Synthetic human type III collagen 


111 


39 














iooq 




riomo sapiens 


riuman secreted pro tern, bbQ ID NO: 


1 AA 

109 


84 








711/1 






3om 


VR£7AR 


~— : 

Homo sapiens 


riuman secreted protem HCHrroo, 


212 


00 
77 














301 1 


AUJ / 1 0 


Pninnc /Initio 


extensin 


izy 


/|o 

4z 


^ni7 

JU1Z 




Homo sapiens 


riuman secreted protein, SKQ ID NO: 


71 


76 








**H J J. 






301 ^ 


77R17R 

LOO IZO 


ivius muscuius 


nistone rii 


1 A>l 

104 


3z 




AjOuIU 


oorgnum oicoior 


hydroxypro line-rich glycoprotein 


1 AO 


34 


Jul J 


I OOZ45 


Homo sapiens 


riuman secreted protein HCrlrroo, 


128 


80 








oJCV^ xX» NvJ.lOJ. 








I j*tUOO 


riomo sapiens 


riisxone rii isoronn rii.j pANv^A- 


00 


34 








icaciive iragmeni ^resiuues oy-zzo «. 






3017 


X556R5 

AJJUOJ 


T vrAnAraiAnn 

JuycopeibivAJu 


exiensin ^cinss i) 


i4Z 


Jz 






fKiPiilpntiim 








3018 


AJ1 33050 


Paniilimc aroiic 
x dUUlllUo OlgUo 


ty luoiiroinc c oaiuosc buounii 1 


175 
1ZD 


71 
/I 


3019 




nomo sdpicnb 


riunidn scctclcu proicLn, onv^ ixj inc 


1 /J 


Oo 








767R 
/o /o. 






3020 


U7JJO" 


riomo sapiens 


putative p 1 JU 


0*7 

yJ 


J / 


3071 


TOR1 1£ 
UjOI ID 


Pan paniscus 


cytochrome c oxidase subunit 1 


55j 


OO 


3077 


VJUvJDJ 


riomo sapiens 


riuman secreiea protein, ori,Q jlu nu. 


1 AH 

147 


7o 








4446. 






3023 


U97674 


Mesocricetus 


cytochrome c oxidase chain I 


632 


82 






auratus 








3024 


AF155232 


Pisinn sativum 


extensin 


181 


42 


3025 


V00662 


Homo sapiens 


cytochrome oxidase I 


567 


85 



208 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


3026 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 










7868. 






3027 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 


141 


67 








product 






3028 


AF130089 


Homo sapiens 


PRO2550 


149 


55 


3029 


AF090944 


Homo sapiens 


PRO0663 


168 


76 


3030 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


122 


92 


3031 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


507 


91 


3032 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


351 


87 


3033 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


99 


75 


3034 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3035 


U93567 


Homo sapiens 


Dutative ol 50 


'173 


41 


3036 


X92485 


Plasmodium 


pyal 


122 


41 






vivax 








3037 


AF202635 


Homo sapiens 


PP1200 


128 


67 


3038 


Y02671 


Homo sapiens 


Human secreted nrotsin encoded hv 

A 1UU1UU Jvwl VIVU Ul Vr IwULl VUVVfUwU \J J 


151 










gene 22 clone HMSJW18. 






3039 


D38112 


Homo sapiens 


cvtochrome c oxidase subunit 1 

» J U/VUl W VAlUUOw OUUUU1L I 


415 


74 

/*T 


3040 


D86853 


Catharanthus 


extensin 


198 


40 






rose us 








3041 


M76546 


Helianthus 


hvdroxvDroline-rich nrotein 




47 






arm u us 








3042 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


377 


95 


3043 


S80119 


Rattus sp. 


reverse transcrintase homolo*? 


106 


50 


3044 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


55 








7284. 






3045 


Y36156 


Homo sapiens 


Human secreted nrotein #28 


91 


58 

JO 


3046 


U93574 


Homo sapiens 


Dutative d!50 


177 


49 


3047 


AF063866 


Melanoplus 


ORF MSV233 hvnothetieal nrotein 










sanguinipes 












entomopoxviius 








3048 


D38112 


Homo sapiens 


cvtochrome c oxidase subunit 3 


568 


87 


3049 


G02994 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


144 


63 








7075. 






3050 


G03714 


Homo sapiens 


Human secreted nrotein SEO ID NO- 


144 


80 








7795. 






3051 


AK027208 


Homo saoiens 


unnamed nrotein nroduet 


145 


72 


3052 


G03790 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


166 


75 








7871. 






3053 


AF 118082 


Homo sapiens 


PRO1902 


84 


50 


3054 


G04072 


Homo saoiens 


Human secreted nrotein SFO ID NO- 


lift 

no 


41 








8153. 






3055 


X92485 


Plasmodium 


pval 


132 


52 






vivax 








3056 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


74 








4484. 






3057 


AL390U4 


Leishmania 


extremely cysteine/valine rich protein 


99 


72 






major 








3058 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


627 


89 


3059 


G00332 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


98 


57 








4413. 






3060 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


38 








4770. 






3061 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


159 


80 








gene No. 107. 






3062 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


529 


80 


3063 


AF130051 


Homo sapiens 


PRO0898 


159 


71 


3064 


AF195418 


Musmuscuhis 


ODZ3 


386 


94 


3065 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


109 


51 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Ideotit 










Score 


y 








cione HCACJ81 






3066 


D38112 


Homo saniens 


T4AOH dehvdropena*^ ^nhunit 2 


338 


80 


3067 


AL3 59782 


Trvnanosoma 


orobable similar to rint*-h2 fint?er 


120 


50 






brucei 


protein rhal a. 






3068 


U 15647 


Mus musculus 


reverse transcrintase 


134 


41 


3069 


D38112 


Homo saniens 


cytochrome c oxidase suhun it 3 


397 


92 


3070 


AF 1 30089 


Homo saniens 


PRO2550 


149 


55 


3071 


U93565 


Homo sanien<; 


niitati ve n 1 50 


143 


37 


3072 


G03133 


Homo <»anien<; 


Human secret f»H nrotein ^FO lO "NO* 


1 1? 

1 XA' 


45 








7214. 










UArMn comAno 

nuxnu ^apiciis 


PRO0478 


1 7S 
I jD 


/O j 


3074 


AF 130089 

ill IJUUQ7 






1 S6 


SR 


3075 


Y86472 


xx\sixhj □apiciid 


HiiTYiiiTi ctatip 59«.*»ti/*oo i pH T^rotp»in 


OKI 


03 








fraompnt *\FO IT) "WO ^87 






3076 


1183780 


JUClblllllailia 


J7 KX-'a allUgcn 


OR 


3 1 






Hon ovarii 






3077 


D38112 


Homo oani^nc 

1 1UI11U dcl|IICJl) 


**vtochromp /* oviHncf* cnhiinit "X 


fi?6 


OO 


3078 


AF1 16712 


Homo <isinif"r»<i 

XX\JklX\J oaiJl\tHo 


PR 027^8 


1 14. 


5Q 


3079 


U42471 


IvTiiq nrinc/Miluc 


\A/icr*ott- AlHriph SvnHmni<* nrof^rn 
vv liUAi-rtiui i^u oy&iuiuiiiG Luincuj 


1 19 










liomoloar 






3080 


D38112 


Homo ^aniens 




AOO 


ou 


JwO 1 


TORI 19 


noEQo sapiens 


rs/\uri aenyorogenase suounii h 


DZj 


oo 


3082 


r>3xi i? 

L/ J 011x> 


rnjixiu sapiens 


CyiUUIll OlDC C OAIOaoC SUDlulll J 




70 

Iy 


JvOJ 


TIA9471 


IVJLUb U1U5CU1U5 


w iscoLW\iancn oynarorne protein 


1 sn 


*fl 








homnloty 






3084 


Y86248 


Homo onntArtQ 
nvuiv/ dapiwia 


Hnrnnn QArr^t^d nrotpin HP HPF f\ R 
nuiJiaii dccicicu pruicui nv^nrroo, 


101 


07 














3085 


U 12690 


Homo saoiens 


cytochrome oxidase summit Tl 


537 


78 


30K6 


H3R1 19 


I-J/xfY^ f\ conionc 

IavJOIU SopivQS 


/\irase suDunii o 




30 


^0R7 


YRAO/11 
A.50U3 1 


oUS SCiOIH 


homologue of proline/arginine-rich 


1 A/I 
lU4 


57 








anuoacieriai protein 






3088 


K"ft99A7 


ivallUi* 


proiinc-ncn protein 


IAS 


/li 














3089 


WtOJJ X 


flOIflU Sapiens 


riurna/i urease cancer reiaicci proiem 




HO 








XJ \^X\J-J4, » 






3090 


V00662 


Homo c^niPTiQ 

XXKJlllU OajJlWJO 


ovtoi*hromp ovidacp T 




RA 
o*f 


3091 


Ml 8094 

XVX K \j\J S T 


X M LCU>Vy VI 119 


hvHmyvnrnltn^-ri^h oJwvmrotpin 




*f3 






vulparK 

V uigcu io 








3092 


D38113 


Pan troplrxivte*; 


cvtochrome c oxidase summit 1 


392 


59 


3093 


D3JU 16 


Pan nanicr*iic 


rvt/vhrAmp c oviHiicp cuKiinit 1 




RA 
OH 


3094 


D38112 


Homo «?flnif*n^ 


pvtor* M mm p- c oxida<*f» nihtinit 1 


471 


76 


3095 


R63935 


Hrtmo ennipne 
XHJlilU bapicuo 


nyJ<I neural thrpaH nrotptn HRd 
\_»i>to iiciuai uiicau piuicui not. 


joy 


QR 
76 


3096 


L26953 


Homo QHrtif n q 
a avjiiiv/ Dapiv/iio 


fKiromocomnl nrot^in 
V/iiivjiiiUoUiiiai pi viciii 


1 1S 
1 1 j 


57 


3097 


G00591 


Hr\mo ci>nif*nc 
riuiiiu bapiwio 


Human c^nrptAd nrotAin QPO 1V\ NH- 
nuiuaii acucicu pi \j ocy xxv inv/. 




3 / 








4672 






3098 


Y86248 


Homo cjmif*riQ 


Human secreted nrotein HPHPFfiR 

mil iiwii jcuvim uiuicui xx\*jlxx x uo. 


159 


01 
^ l 








^FO ID "NO* 161 

oi_»v^ XXJ liV/.lUJ. 






3099 


009755 


Hr«Tirt canlpnc 

nuinu bapicnb 


Hitman CA/*rAt*v1 nrotAin QFO TPi XTPi* 

xi Hindu bccrcixxi protein, oxz,\£ iu 


1 J J 


70 














3100 


AO006233 


A rahiHonciQ 
r\l au luupolo 


hvnothptical nrotein 


65 








fh a liana 








3101 


AF289098 


flaHrsictiQ 
v/iauiaoiio 




107 

lv/ 


^5 






fcpntnlrpa 

ftVAllUIVGd 








3102 


AF155232 


Pisum sativum 

X UUUl «J 114,1 ▼ Ulll 


extensin 


142 


43 


3103 


AF130089 


Homo saoiens 


PRO2550 


122 


81 


3104 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


143 


80 








SEQIDNO:163. 






3105 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


104 


59 








sequence. 







210 



seq n> 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3106 


Y86248 


Homo saDiens 


Human secreted nrotein HCHPF68 
SEQIDNO:163. 


112 


f\j 


3107 


Y86248 


Homo saoiens 


Human secreted nrotein HCHPF68 
SEQ ID NO: 163. 


117 


79 


3108 


D38112 


Homo saoiens 


NADH dehvdroeenase subunit 4 


334 




3109 


U43627 


A rabidonsis 
thaliana 


extent in 

VAvUUJ 111 


112 




3110 


AP000616 


Orvza. sativa 


similar tA RrWfi-H9 finper nrotein 
RHAla CAF078683^ 


1 18 

1 io 


inn 

luv 


3111 


AF1 30089 


Homo sapiens 


PRO2550 


137 


37 


3112 


Y86248 


Homo sanien^ 


Human secreted nrotein HCHPF6R 
SEQ CD NO: 163. 


220 




3113 


A18812 


Rra^sioa naroi<i 




1Z.O 


19 


3114 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OXvV( 1*-/ 1NV/. lUJ- 


176 


63 


3115 


G03172 




T-Tiiman c *v t*£»rl rr»tr>i n QPf^ 1 1 XTfV 
jnniiicui ocuictcu jjiv/icm, ouy jll/ nu. 

7253. 






3116 


X92485 


Plasmodium 
vivax 


pval 


148 


45 


3117 


D38112 


T-T rxrri r\ canipnc 
HUI IL\J oauiciio 


r*\rf A/Hirnm A aviHopo cniniinit 1 1 

c_y lociiruinc u oxiuobts suDunit 1 


j /** 




3118 


U23172 


Caenorhabditis 


similar to myoblast cell surface antigen 

fcp.peoyi UTT1V/rA"M P719/I/r\ anH n 
illCldllUgailCl 1NO-UU-U alioICIlt r\. (JrULClll 

(PIR:JH0162) 


125 


41 


3119 


U43627 


AraHidonsis 
thaliana 

UUU1UUU 




iz.y 


19 


3120 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


171 


82 


3121 


Y12950 


Homo QaniPTiQ 


rVllllllU ai/lU aCl|U.CliCC SJl a 11 Lull all 

secreted nentide 


Oft 


so 


3122 


AF130051 


Homo saoiens 


PRO0898 


134 


89 1 


3123 


M81321 


Macaca 
fascicularis 


nroline-rich nrotein 


158 


48 


3124 


U93563 


Homo sapiens 


nutative n 1 SO 


126 


32 


3125 


D38114 


Gorilla gorilla 


NADH dehvdrnpenav suhnnit 1 fNT)]^ 


337 


92 


3126 


AF130089 


Homo sapiens 


PRO2550 


155 


77 


3127 


Y86248 




Human <f*rrt*tf*A nrnfpin T-l/^T-IPFfiR 
XI. ul 11 all scurcicu. pruicm riv^rix r oo, 

SEQIDNO:163. 




/o 


3128 


AF090901 


Homo *\anieti<% 




1V7 


^i 


3129 


AF003736 


Murine leukemia 
virus 


reverse transcriptase 


164 


40 


3130 


U25281 


Rattus 
norveinciis 


SH3 domain binding protein 


88 


39 


3131 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


116 


71 


3132 


D86853 


Catharanthus 
roseus 


extensin 


110 


33 


3133 


B03148 


Homo saoiens 


Human neuronal differentiation fartor- 
1 (NDF-1). 


506 


76 


3134 


V00662 


Homo saoiens 


cytochrome oxidase T 

\djf IUWU VUiw UAlUiUV< X 


583 


88 

oo 


3135 


Y36366 


Homo sapiens 


Fragment of human secreted protein S 
encoded hv Pene 3 


103 


68 


3136 


AJ242540 


Volvox carteri £ 
nagariensis 


hvdroxvnroline-rich fHvconrntein TYZ^ 

HRGP 


282 


6(1 

\}\J 


3137 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


119 


54 


3138 


AF1 18082 


Homo sapiens 


PRO1902 


118 


51 


3139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


270 


87 


3140 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


48 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


3141 


D38112 


Homo sapiens 


NADH dehydrogenase subunit5 


353 


74 


3142 


G02485 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 
6566. 


126 


77 
1 1 


3143 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


115 


72 


3144 


AF090930 


Homo sapiens 


PRO0478 


138 


73 


3145 


AF155232 


Pisum sativum 


extensin 


110 


34 


3146 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


107 


37 


3147 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


103 


61 


3148 


D38112 


Homo sapiens 


cvtochrome c oxidase subunit 1 


505 


84 


3149 


M77194 


Rat leukemia 
virus 


do lvm erase 


167 


15 


3150 


AP002543 


Arabidopsis 
thaliana 


eblAAD23015 l~£ene id*F15M7 16— si 
milar to unknown protein 


105 


31 


3151 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 




3152 


K02576 


Homo sapiens 


salivarv nroline-rich nrotein 1 

1 1 ▼ *** J T** V/ AAl K\J I IVAI fJl WWII 1 A 


89 


39 

J7 


3153 


J04794 


Homo sapiens 


aldehyde reductase (EC 1.1.12) 


137 


58 


3154 


G03714 


Homo saniens 


Human secreted nrotein SFO ID NO* 
7795. 


115 


7ft 


3155 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c-NTP 


160 




3156 


X92485 


Plasmodium 
vivax 


pval 


104 


50 


3157 


AF194537 


Homo sapiens 


NAG13 


207 


W 


3158 


AF063693 


Mus musculus 


tvoe XIII collagen 


104 


17 i 


3159 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA1 a (AF078683} 


130 


91 


3160 


AF020191 


Mus musculus 


proline-rich protein 13 


107 


51 


3161 


AF 194537 


Homo ^aniens 


NAG 13 


17A 


76 


3162 


G02535 


Homo saniens 


Human secreted nmtein SFO TF> MO* 

6616. 


mi 


41 
HI 


3163 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


161 

Iwl 


58 


3164 


K03205 


Homo sapiens 


salivarv nroline-rich nrotein nrecursor 


139 


46 


3165 


U93570 


Homo sapiens 


Dutative nl50 


151 


68 


3166 


Y 19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


144 


71 


3167 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


147 


66 


3168 


L27428 


Homo sapiens 


reverse transcriptase 


94 




3169 


X97675 


Homo sapiens 


plakophilin 2b 


103 


76 


3170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0:I63. 


115 


54 1 


3171 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


61 


3172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


146 


38 


3173 


D38U6 


Pan paniscus 


NADH dehydrogenase subunit 4 


467 


81 


3174 


X92485 


Plasmodium 
vivax 


pval 


88 


69 


3175 


L26953 


Homo sapiens 


chromosomal protein 


124 


78 


3176 


AF042169 


Homo sapiens 


putative ATP-dependent mitochondrial 
RNA helicase 


223 


95 


3177 


AF130079 


Homo sapiens 


PR02852 


96 


44 


3178 


AF2732I7 


Mus musculus 


cell proliferation related protein CAP 


97 


36 


3179 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


90 


69 


3180 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


146 


78 


3181 


AF1 18082 


Homo sapiens 


PRO1902 


150 


58 


3182 


U60315 


Molluscum 
contagiosum 


MC107L 


101 


44 



212 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identic 
v 






virus subtype 1 








3183 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


148 


61 


3184 


U93565 


Homo sapiens 


putative pl50 


265 


58 


3185 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


79 


3186 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


167 


39 


3187 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


160 


63 


3188 


S80119 


Rattus sp. 


reverse transcriptase homolog 


152 


49 


3189 


X05300 


Rattus 
norvegicus 


ribophorin I 


122 


46 


3190 


B01372 


Homo sapiens 


Neuron-associated protein. 


206 


80 


3191 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


83 


3192 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


135 


80 


3193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and prolme-rich 


98 


34 


3194 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


87 


3195 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen 


97 


40 


3196 


D38112 


Homo sapiens 


cytochrome c oxidase summit 1 


452 


81 


3197 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


110 


58 


3198 


X99452 


Lycopersicon 
esculentum 


extensin-like protein Dif54 


100 


36 


3199 


D86853 


Catharanthus 
roseus 


extensin 


119 


36 


3200 


AF025424 


Rattus 
norvegicus 


RNA polymerase 1 127 kDa subunit 


215 


74 


3201 


U93563 


Homo sapiens 


putative p!50 


359 


77 


3202 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


112 


37 


3203 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


153 


84 


3204 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


128 


39 


3205 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


154 


76 


3206 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


127 


64 


3207 


X97675 


Homo sapiens 


plakophiiin 2b 


142 


51 


3208 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


58 


3209 


AB026512 


Ecnomiosa sp. 
Ecnl 


cytochrome c oxidase subunit I 


254 


62 


3210 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


59 


3211 


D38112 


Homo sapiens 


ATPase subunit 6 


447 


78 


3212 


V00662 


Homo sapiens 


ATPase6 


482 


83 


3213 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


151 


66 


3214 


D38112 


Homo sapiens 


ATPase subunit 6 


400 


72 


3215 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


88 


3216 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


472 


82 


3217 


AF090930 


Homo sapiens 


PRO0478 


96 


64 


3218 


G03172 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


46 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


Identit 

y 








7253. 






3219 


AF006061 


Homo sapiens 


placental growth hormone isoform 
hGH-V3 


290 


68 


3220 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


115 


63 


3221 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


441 


71 


3222 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


111 


59 


3223 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


81 


3224 


L26953 


Homo sapiens 


chromosomal protein 


95 


59 


3225 


U83280 


Leishmania 
donovani 


39 kDa antigen 


117 


90 


3226 


AF239615 


Manihot 
esculenta 


CRANTZ hydroxyproline-rich 
glycoprotein 


124 


43 


3227 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4 


442 


74 


3228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


172 


67 


3229 


X92591 


Mus musculus 


transcription factor 


109 


43 


3230 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


81 


54 


3231 


D38U5 


Pongo pygmaeus 


cytochrome c oxidase subunit 1 


476 


72 


3232 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


54 


3233 


U96416 


Dennyus 
distinctus 
timjonesi 


cytochrome b 


95 


40 


3234 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


86 


3235 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


101 


40 


3236 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


149 


75 


3237 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


532 


76 


3238 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


576 


89 


3239 


U93567 


Homo sapiens 


p40 


155 


32 


3240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


79 


3241 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


65 


3242 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


76 


62 


3243 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


69 


3244 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


115 


39 


3245 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


54 


3246 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


153 


69 


3247 


U58736 


Caenorhabditis 
elegans 


Similar to curicular collagen 


107 


37 


3248 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


148 


45 


3249 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3250 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


166 


61 


3251 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


109 


40 


L3252 


AF 130089 


Homo sapiens 


PRO2550 


151 


.73 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3253 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


58 


3254 


Y86248 


Wnmn <j?inififi? 


Human opra-Pter! nrntpin WPHPFr»8 
xxiiiuau dCvivtcu L/i etc ill nbi xi x iso, 

SEQIDNO:163. 


162 


Ov 


3255 


AL3 59782 


1 1 yuaiivo villa 

brucei 


nrnKaHlp simitar tn rimx-h*!? frn&pr 
n rote in rhal a 


108 


57 


3256 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


156 


88 


3257 


Y86248 


Hftmn <;anipn<l 

IIU1I1U oa^/ltiUUJ 


Human <&rre\pA nrntpin HPHPFfiR 
SEOIDNO*163 


206 


7J 


3258 


Y36203 


Homo sapiens 


Human secreted protein #75. 


111 


77 


3259 


W03642 


TTnmn <ianiprK 


Hum an f^annnhinniH f»PR "M-tprmrnal 

XXXXXxXOXm WiiAlJdUAilV/m VJX IV. IN LCI III ill C* I 

sequence. 


98 


57 


3260 


D381 12 


Wnmn canipnc 

XL UIX l\J OQ^J Iv^lio 


pvtnfhrnmp n /yv5Hc»cp ciihnnit 1 

V. J UA4JU UillC K, UAiUaoC jUULUllt 1 


345 


67 


3261 


Y86248 

< 


Homo sapiens 


Human secreted protein HCHPF68, 

OJDy 1U INU. JO J. 


97 


56 


3262 


L38908 


Nicotiana 


extensin 


155 


40 


3263 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
cncoucu oy gene jj. 


95 


70 


3264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


198 


87 


3265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


115 


69 


3266 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ ooo. 


158 


59 


3267 


I jujuu 


XIOIUU bapiCIlo 


riaguiciH xJx uuiiiaii bcuicicu pruiciu 

encoded by gene 3. 


ins 








T-Tomn cimipnc 
flULUU oa.JJlCllo 


"Human cpprpfpH nrntpin T-JPt-TPRfift 
xuiuMU 2>ouicicu LviuLcui nv^nrruo, 

SEQ ro NO: 163. 


1 so 




3969 


roan? 


T-Tnmn csinipnc 


TvFAT"*IW HpttvnVnopn oca ciihiintt 1 
INr\L/0 UCUjf Ul UgGUaaw MIL) LIU 11 i 


JLMxM 


QS 


3270 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OHy xxj INV/.lvJ. 


138 


73 


3271 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


90 


3272 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


193 


95 


3273 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encuueu oy gene 3. 


111 


52 


3274 


Y8624& 


Wnmn caniprio 

JLLU11I4J CWJjJlCllo 


Unman cprrptpn 1 nrntpin T-JPHPP^R 
xxiim<m dcoicicu yisji&ixi xxk^xxx ryo, 

SEQIDNO:l63. 


14ft 

1HO 


o / 


3275 


Y86248 


liuiuu oa^Jiciio 


TTiiman cpprpfpH nrntpin T-TPHPFAR 
ij.il hi <ui ocu ctcu luuiclii nunr ruo, 

SEQCDNO:163. 


1r»l 

1U1 


ou 


3276 


S67928 


X 1\JIU\J aajJlCllo 


PRRIft/f nrntpin nnprtiininr 

.T IVU 1 lvl JJi \J ICUl LJX d/Ul jUI 


104 


^4 


3277 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913 

V7 1J . 


154 


55 


3278 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


3279 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO- 163 


212 


91 


3280 


Y86248 


Homn *umien<! 


Human secreted nrotein HCHPF68 
SEO ID NO * 163 


118 


50 


3281 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFO TDNO-163 

OUy XXJ liV/.lUJ. 


130 


46 


3282 


Ml 0546 


Homn ^anipn<5 


rvtnphmmp nviHs^p T 


303 


95 


3283 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


455 


81 


3284 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


502 


83 


3285 


AF1 18086 


Homo sapiens 


PR01992 


99 


88 


3286 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


489 


83 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


3287 


V00662 


Homo saDiens 


URF 1 OJADH dehvdroeenase summit^ 


485 


Oo 


3288 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


3289 


AL359782 


Trypanosoma 
brucei 


possible (hh v-6) u 1 1 02, variant a dna, 
complete virion genome. 


158 


60 


3290 


X92485 


Plasmodium 
vivax 


Dval 


99 


58 


3291 


AF1 16712 


Homo sapiens 


PR02738 


102 


51 


3292 


AF1 19851 


Homo sapiens 


PR01722 


99 


86 


3293 


AF090930 


Homo saoiens 


PRO0478 


141 


79 

/Z. 


3294 


G03798 


Homo saoiens 


Hum an QprreteH nrotein SPO TT") MO* 

AXLLLliCLLl OwvlvlvU l/lvlvlU) vj1_jV^ mJLJ IiV/, 

7879. 


1 SI 

LJ 1 


OO 


3295 


M64793 


Rattus 
norvegicus 


salivarv nro line- rich nrotein 


107 


HI 


3296 


AK024455 


Homo saoiens 


FLJ00047 nrotein 


07 




3297 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874 


156 


60 


3298 


AC007654 


Arabidopsis 
thaliana 


T19E23.7 


97 


49 


3299 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3 17- 


96 


44 


3300 


AJ243905 


Caenorhabditis 
eleoans 


SF1 protein 


104 


42 


3301 


AF000298 


C!»enorhahditi<; 

VuvUVI UvUs VAUIO 

elegans 


u/pali* ciroi laritv tn rvil In optic* olvr*inA_ 

and oroline-rich 


yo 


HI 


3302 


X89453 


Rattus 
norvegicus 


DRPLA 


OJ 


OJ 


3303 


U10099 


Homo sapiens 


POM-ZP3 


227 


52 


3304 


D38112 


Homo saoiens 


cvtochrome c oxidase <*iihiinit ^ 


55^ 
j j j 


RO 

07 


3305 


L27428 


Homo saoiens 


reverse rra n vxi nla q p 

l^VWLOb U.CUlOX/1 l^SlClOv-' 


144 




3306 


D38112 


Homo saoiens 


cvtochrome c oxida^p ^nhiinit *\ 


JOO 


71 


3307 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


439 


78 


3308 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


'439 


82 


3309 


D38112 


Homo ^anien^ 


nvtfifihromft c oviHacp ciiKiinif" 1 

V/JrlUUlll UUJ6 V OUUUllJLt- 1 


409 


oo 


3310 


G00352 


Homo saoiens 


Human cecretpri nrotein ^FO TD MO* 

4433. 


OJ 




3311 


X53581 


Rattus 
norvegicus 


ORF4 


140 




3312 


V00662 


Homo sapiens 


cytochrome B 


487 


88 


3313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


451 


80 

Ow 


3314 


V00662 


Homo sapiens 


cytochrome oxidase I 


524 


88 


3315 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunift 


518 


87 

o / 


3316 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


115 


50 


3317 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


356 


69 


3318 


U 12693 


Womo ^aniftn<5 




477 


to 


3319 


Z67990 


Caenorhabditis 

elegans 


contains similarity to Pfam domain: 

PF0 14514 rMpmatnHp nirirlp mllncpn 
N-tprrninal domain^ Score= i 77 5 K- 
value=0.0001,N=l 


75 


40 


3320 


GO 1931 


Homo <;anien<% 


Human <%pcrPted nrotein SFO TO "NO* 

xi mucin ovvi^ivu pivivuX| »jj-»v^ ii*/ 

6012. 




75 


3321 


AF090930 


Homo sapiens 


PRO0478 


166 


65 


3322 


X92485 


Plasmodium 
vivax 


pval 


104 


44 


"3323 ' 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


513 


80 


3324 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


483 


79 



216 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 

Waterman 

ocorc 


% 

Identit 

y 


3325 


AF130089 


Homo sapiens 


PRO2550 


121 


Rft 


3326 


AC026234 


Unknown 


Contains weak similarity to an 
unknown protein T23E18.5 


191 


fits 


3327 


V00662 


Homo sapiens 


cytochrome B 


439 


72 


3328 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


316 


75 


3329 


AF091711 


Homo sapiens 


splice variant AKAP350 


106 


52 


3330 


U93565 


Homo sapiens 


putative pl50 


125 


45 


3331 


AF1 19851 


Homo sapiens 


PR01722 


88 


72 


3332 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


TO/ 


o 1 


3333 


V00662 


Homo sapiens 


cytochrome oxidase I 


544 


90 


3334 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


47 S 


Rft 


3335 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded bv £ene 17 


168 


66 


3336 


AF146688 


Takifugu 
rubripes 


serine-threonine kinase 9 


101 


43 


3337 


G03801 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
7882. 


101 
Ivl 


j\) 


3338 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


110 


50 


3339 


V00662 


Homo sapiens 


cytochrome oxidase I 


458 


Rft 
ou 


3340 


K02401 


Homo sapiens 


chorionic somatomammotropin 


209 


97 


3341 


G00412 


Homo sapiens 


Human secreted nrotein SEO JD NO* 
4493. 




S9 


3342 


X65165 


Volvox carteri 


extensin 


146 


41 


3343 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


1 19 


SI 


3344 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


559 


84 


3345 


U12690 


. Homo sapiens 


cytochrome oxidase subunit II 


508 


RQ 


3346 


Ml 1897 


Mus musculus 


proline-rich salivary protein 


96 


41 


3347 


U92455 • 


Mus musculus 


WW domain binding protein 7; WBP7 


113 


JO 


3348 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


40 


3349 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


113 


38 


3350 


D38112 


Homo sapiens 


cytochrome c rotiHasfc ^nhiinif 1 

VJ iUVUl UllLv \* vAiUllOw out/ Mil it 1 


549 


70 i 


3351 


B03148 


Homo sapiens 


Human neuronal differentiatinn ftrtnr- 
1 (NDF-1). 




RR ! 
oo 


3352 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


117 


85 


3353 


A18812 


Brassica napus 


extensin 


103 


31 


3354 


M10546 


Homo sapiens 


cytochrome oxidase I 


125 


78 
/ o 


33S5 


M81321 


Macaca 
fascicularis 


proline-rich protein 


107 


44 


3356 


M10546 


Homo sapiens 


cytochrome oxidase I 


284 


90 


3357 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


513 


80 


3358 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


541 


79 


3359 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


543 


87 [ 


3360 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


86 


3361 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787L 


179 

LIS 


57 


3362 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


125 


62 


3363 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


151 


46 


3364 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


82 


3365 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


102 


40 


3366 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


76 


3367 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


145 


84 



217 



WO 01/64835 PCTYUS01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 

iM;ore 


% 

Idcntit 

y 


3368 


D38112 


Homo sapiens 
-X 


cytochrome c oxidase subunit 1 


587 




3369 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


549 


82 


3370 


AL035526 


Arabidopsis 
thaliana 


ex ten sin -like protein 


130 


7Q 


3371 


J04543 


Homo sapiens 

_E 


synexin 


101 


40 


3372 


Z29573 


Didelphis 
virginiana 


cytochrome c oxidase subunit 3 


154 


71 


3373 


U93568 


Homo sapiens 


putative pi 50 


176 


59 


3374 


L28748 


Bos taurus 


putative 


146 


fn 
\j / 


3375 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


3376 


D38112 


Homo sapiens 


cytochrome c oxidase ^uhimft 1 




/o 


3377 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


51 


3378 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain 1 


557 


82 


3379 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SE0IDNO163 


120 


35 


3380 


X92485 


Plasmodium 
vivax 


pval 


194 


Jo* 


3381 


G02538 


Homo sapiens 


Human ^ecretfiH nrntpin RPO TFl NO* 

6619. 




7Q i 

/o 


3382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


108 


67 j 


3383 


AP000616 


Oryza sativa 


similar to RTNG-H7 finder nrntpin 

RHAla(AF078683) 


1 1 s 


OO 


3384- 


X98296 


Homo sapiens 


uhinuitin hvdrolaop 


91£ 

A JO 




3385 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


494 


75 


3386 


Y86472 


Homo sapiens 


Human pene S9-enrnH<»H nrntpin 

fragment, SEQ ID NO:387. 


I DD 


OO 


3387 


AL359782 


Trypanosoma 
brucei 


nrobable similar tn rtno-W) finow 
protein rhal a. 


1 1£ 

1 io 




3388 


W73624 


Homo sapiens 


Human secreted nrntein elnnpflwQ7' 1 


777 


OA 


3389 


G03807 


Homo sapiens 


Human secreted nrntein ^FO TH WH* 

7888. 


141 


/O 


3390 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


73 


61 


3391 


U35730 


Mus museums 


ierkv 


1 S7 




3392 


U54788 


Mus musculus 


Wiskott-Aldrich Svndrome Protein 


100 

lwu 


41 


3393 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


483 


77 


3394 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


539 


XI 


3395 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


608 


87 


3396 


X71602 


Nicotiana 
tabacum 


extensin 


104 


jj 


3397 


L26953 


Homo sapiens 


chromosomal protein 


100 


Ovr 


3398 


AF197832 


Cyanocitta 
cristata 


cytochrome oxidase I 


488 


76 


3399 


X53581 


Rattus 
norvegicus 


ORF4 


140 


42 


3400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


597 


85 


3401 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


656 


89 


3402 


L26953 


Homo sapiens 


chromosomal protein 


117 


VfJ 


3403 


AF052298 


Drosophila 
silvestris 


Y box rjrotein 


114 


70 


3404 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


133 


53 


3405 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


65 


3406 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 


277 


82 


3407 


U01849 


Trypanosoma 


ORF2 


94 


30 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






brucei 








3408 


G00328 


Homo sapiens 


Human qecretejrl nrotein SEO ID NO* 
4409. 


115 


65 


3409 


Y86248 


Homo sapiens 


Human secreted nrotein HCHPF68 
SEQID NO: 163. 


189 


86 


3410 


U93570 


Homo sapiens 


putative pi 50 


233 


52 


3411 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


104 


44 


3412 


AF090944 


Homo sapiens 


PRO0663 


97 


56 


3413 


X61048 


Hydra sp. 


mini-collagen 


128 


44 


3414 


AF 118086 


Homo sapiens 


PR01992 


128 


81 


3415 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


61 


3416 


AF1 16695 


Homo sapiens 


PR02221 


169 


69 


3417 


U31778 


Human 

nani 1 lorn a virus 
type 20 


nntaKvp 






3418 


X92485 


Plasm od him 
vivax 


pval 


97 


54 


3419 


U12690 


Homo •^aniens 


cvtochromp oxida^f* «%iiftniiit 11 


195 

1 y -J 


94 


3420 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
pene No 1 07 


144 


69 


3421 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


139 


71 


3422 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


111 


63 i 


3423 


X15917 


Paramecium 
aurelia 

tun vim 


ORF4 protein (AA 1-156) 


93 


41 


3424 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


99 


69 


3425 


G00333 


Homo saDtens 


Human secreted nrotein SEO TD NO* 

M- 1.11 1 J Hill OvVlvlvU Lst vlwlll] iJLfV/ A X-/ llv/. 

4414. 


1 15 — 


60 


3426 


Y14486 


Homo sapiens 


cvtosolic serine 
hvdroxvmethvltransfetase 


196 


61 


3427 


U93569 


Homo saoiens 


putative p 1 50 


110 


44 


3428 


G03415 


Homo saDiens 


Human secreted orotein SEO ID NO* 
7496. 


120 


67 


3429 


ABO 12223 


Canis familiaris 


ORF2 


169 


44 


3430 


X97675 


Homo sapiens 


plakophilin 2b 


118 


60 


3431 


AF220264 


Homo sapiens 


MOST-1 


153 


72 


3432 


X64173 


Zea 

diploperennis 


hydroxyproline-rich glycoprotein 


104 


41 


3433 


AF1 13685 


Homo sapiens 


PRO0974 


104 


72 


3434 


AB002306 


Homo sapiens 


KIAA0308 


282 


83 


3435 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


83 


3436 


ABO 12223 


Canis familiaris 


ORF2 


108 


43 


3437 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


65 


3438 


U52077 


Homo sapiens 


mariner transposase 


381 


64 


3439 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


425 


84 


3440 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


63 


3441 


G02624 


Homo saniens 


Human secreted Drotein SEO ID NO* 
6705. 


90 


78 


3442 


Y36156 


Homo saniprK 


Human <5f*CTPted nrotein 


126 


71 

f X 


3443 


AF191032 


Myxine 
glutinosa 


RING3 


84 


63 


3444 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment 


141 


57 


3445 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


88 


65 
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SEQ IB 

nut 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3446 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


157 


67 


3447 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


281 


86 


3448 


AF1 12481 


Homo sapiens 


RAD54B nrotein 


392 


o / 


3449 


G00497 


Homo saoiens 


Human secreted nrotein SEO ID NO* 
4578. 


98 


j j 


3450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


133 


90 


3451 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


418 


82 


3452 


AF130079 


Homo ^anien 1 ? 


PR02852 


1 10 


ou 


3453 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
ene No 1 07 

gvllV r»V/. 1 u f . 


143 


73 


3454 


AB032254 


Homo sapiens 


bromodomain adjacent to zinc linger 
domain 7 A 

\X\J ill Cull. jwX 


447 


85 


3455 


G02480 


Homo saoiens 


Human <5eereted nrotein SPO FT) NO* 
6561. 


134 




3456 


G03787 


Homo saoiens 


Human <secreteri nrotein SFO ID NO* 
7868. 


79 


6R 


3457 


D38112 


Homo saoiens 


cvtochrome c oxidase subunit 3 


545 


84 


3458 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


83 


3459 


D38112 


Homo saoiens 


NADH dehydrogenase subunit 2 


277 


SO 


3460 


AK000867 


Homo saoiens 

A4Vll.lV *^V*A^lwJ.X4j 


unnamed nrotein nroHuot 

UlllIIUllMl 1/1 Viwlll ul UVllivl 


474 




3461 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
£ ene No 101 


123 


52 


3462 


AF041330 


Bodo saltans 


NADH dehvdroeenase subunit 5 


89 


30 


3463 


U93568 


Homo sapiens 


putative pi 50 


100 


46 


3464 


AL390114 


Leish mania 
major 


eYtremplv cvctptne/valini* rich nrotein 


1 Sfi 




3465 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


340 


88 


3466 


Y86248 


l±\JHl\J Jd^/lC/UJ 


SEQ ID NO: 163. 


1 10 
i iy 


inn 


3467 


BO 1372 


pinmn oanien<i 






7*^ 


3468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


103 


40 


3469 


G00357 


Homo <«inipn<» 


Human "Secreted nrotein SFO TD NO* 
4438. 


139 


Ou 


3470 


W54282 


Homo ^aniens 


Protein ^pniipnrj* nf tbp Hi-alnha 

l J VJlt/il J dvl|Uvlilsv Ul Ulw Ul aipjla 

haemoglobin eerie contained in nSS 1 


?07 
/ 




3471 


AF1 16661 


Homo sapiens 


PR01438 


119 


47 


3472 


G00416 


Homo saoiens 


Human secreted nrotein SEO ID NO* 
4497. 


129 


J7 


3473 


AF130089 


Homo sapiens 


PRO2550 


108 


86 


3474. 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


113 


80 


3475 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


590 


79 


3476 


AF 118086 


Homo sapiens 


PR01992 


126 


69 


3477 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


147 


47 


3478 


J01415 


Homo sapiens 


MTND4 


482 


78 


3479 


X03404 


Bos tauras 


alpha subunit (aa 1-394) 


583 


89 


3480 


D38112 


Homo sapiens 


cvtochrome c oxidase subunit 1 


514 


84 


3481 


G03628 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7709 


242 


80 


3482 


G00407 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4488. 


121 


61 


3483 


X83427 


Ornithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


508 


78 


3484 


Y07754 


Homo sapiens 


Human secreted protein fragment 


549 


93 
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SEQID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








encoded from gene 1 1 . 






3485 


X97567 


Homo sapiens 


pori 


388 


69 


3486 


X97567 


Homo sapiens 


porl 


608 


84 


3487 


K02401 


Homo sapiens 


chorionic somatomammotropin 


641 


93 


3488 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c-NTP 


1 35 


*71 


3489 


S62941 


Homo sapiens 


Ps 2=basic nrnline-rich nroteinfPRRII 

uUJll/ yjl WILLI t> 111/11 piUlCUl^i XVJJllj 

nrecursor nrofein=ha^ie nmlinfi-rirfi 

proteins (Ps, PmF, PmS, and Pe) 
precursor) {C-terminal} 


1 16 


JO 


3490 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


549 


84 


3491 


L13329 


Homo sapiens 


iduronate-2-sulfatase 


564 


85 


3492 


X79535 


Homo sapiens 


beta tubulin 


620 


oo 


3493 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


578 


87 
o / 


3494 


U09823 


Oryctolagus 
cunicuhis 


elongation factor 1 alpha 


631 


89 


3495 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


616 


90 


3496 


M12140 


Homo sapiens 


f*nvelonp nrafpin 






3497 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


123 


70 


3498 


W48352 


Homo sapiens 


Human hrea<rt" ranppr rplafprl nmti»m 

BCFLTL 


I V*T 




3499 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


134 


54 


3500 


Y36156 


Homo sapiens 


Human secreted Drotein #28 


143 


75 


3501 


AF1 13685 


Homo sapiens 


PRO0974 


77 


ol 


3502 


G00365 


Homo sapiens 


Human secreted Drotein SKO ID NO- 
4446. 


157 




3503 


AF090944 


Homo sapiens 


PRO0663 


164 


£1 

O 1 


3504 


AF090931 


Homo sapiens 


PRO0483 


103 


70 


3505 


U93570 


Homo sapiens 


p40 


95R 


HI 


3506 


AB016601 


Drosophila 
alpina 


cytochrome c oxidase suhuiiit T 


lUO 


oo 
oz 


3507 


AB016601 


Drosophila 
alpina 


cytochrome c flYida<»f* ciihnnit I 


1051 
IVJo 


ft7 


3508 


AF194537 


Homo sapiens 


NAG13 




/O 


3509 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMB032 


153 


60 


3510 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


102 


37 


3511 


AC008113 


Arabidopsis 
thaliana 


F12A21 10 


on 


/1ft 


3512 


M64791 


Rattus 
norvegicus 


salivarv nroline-rich nrntein 


104 


Jj 


3513 


AF090944 


Homo sapiens 


PRO0663 


115 


57 


3514 


AF240630 


Mus musculus 


IO motif containing GTPase activating 
protein 1 


152 


fn 
\jf 


3515 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


123 


48 


3516 


AF090942 


Homo sapiens 


PRO0657 


128 


47 


3517 


M10546 


Homo sapiens 


cytochrome oxidase I 




JO 


3518 


AF255661 


Crypthecodinium 
cohnii 


Dinap 1 -interacting protein 5; Dip5 


108 


36 


3519 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SE0IDNO163 


209 


93 


3520 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


38 


3521 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


131 


64 


3522 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


105 


42 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 






brucei 


comnlete virion penome 






3523 


R95913 


Homo sapiens 


Neural thread protein. 


118 


72 


3524 


Y86248 


Homo sapiens 


Human secreted nrotein HCHPF68 
SEQ ID NO: 163. 


115 




3525 


L27428 


Homo sapiens 


reverse transcriptase 


162 


50 


3526 


Y02749 


Homo sapiens 


Human seerefprl nrntpin enroHed hv 

eene 100 clone HNFIU96. 


1 18 

x I O 




3527 


AF090930 


Homo sapiens 


PRO0478 


146 


uu 


3528 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


169 


50 


3529 


AF194537 


Homo sapiens 


NAG13 


119 


oo 


3530 


U93564 


Homo sapiens 


Dutative ol50 


914 




3531 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


208 


63 


3532 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


275 


92 


3533 


AF090931 


Homo saniens 


PRO0483 


19£ 

1ZO 


OO 


3534 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 


117 


85 


3535 


G03076 


Homo sapiens 


Human c/>rrpfp^H r»rnt*»in [H MO* 

7157. 


i /J) 


77 


3536 


U21123 


Drosophila 
melanogaster 


ena oolvneotide 


I 17 

I I / 


4S 


3537 


AF209061 


Eubranchipus sp. 


cvtochrome c rvxida<tf» T 




Rft 


3538 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


174 


85 


3539 


AF010144 


Homo sapiens 


neuronal thread nrntfMn Ari7r-TsITP 

11V*- LU Ullcll Ull vail tjl. UlClXl .rt-U / L It X X 


1 to 




3540 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 

on rnnrntpi n 

vllV* U \JX \J 1C 111. 


88 


50 


3541 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


128 


71 


3542 


U74612 


Homo sapiens 


henatfWivte mtrlAar fnrtnr-^/frkrlr h<*aH 

homolog 11A 


1 1 1 
111 


77 


3543 


G03714 


Homo sapiens 


Human <seeretf»H nrnt<*in ^PO TO "KID* 

iiuiiiau dwi^icu |Ji u id ll, Ouy XL/ 

7795. 




AO 


3544 


D38112 


Homo sapiens 


cvtochrome c rtYidacp Qiihiinit 1 


47ft 




3545 


G03172 


Homo saoierK 


TTuman <seeretf*rl nrntein SPO in MO* 

J.J.LU110U dWiWIVlCU yl VJldlX, OCy XLf i^tv/. 

7253. 




<\4 


3546 


AL359782 


Trvoanosoma 

A. A, J I/U41VJVI I HI 

brucei 


nrfihahle similar tn rina-h9 fincrpt* 

j/tuuauib oiuuiiai IV/ uiig, 1JL& llllgd 

nrotein rhala. 






3547 


D38113 


Pan troglodytes 


cvtochrome c oxidase siiniinif 1 


SK4. 


OJ 


3548 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


635 




3549 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


661 




3550 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


607 


86 


3551 


AK024455 


Homo sapiens 


FLJ00047 protein 


88 


51 


3552 


W88957 


Homo sapiens 


Polypeptide fragment encoded by gene 
128. 


500 


86 


3553 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


228 


95 


3554 


AF130092 


Homo sapiens 


PRO2620 


165 




3555 


AL121845 


Homo sapiens 


dJ583P15 S 1 /novel nrotein fi«?offn-m 




01 


3556 


G03062 


Homo sapiens 


Human secreted Drotein SEO ID NO* 
7143. 


170 


79 


3557 


D38112 


Homo sapiens 


cvtochrome c oxidase subunit ^ 


500 


518 
oo 


3558 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


578 


84. 

0"T 


3559 


AK000385 


Homo sapiens 


unnamed protein product 


138 


47 


3560 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


112 


61 


3561 


X97675 


Homo sapiens 


plakophilin 2b 


123 


65 


3562 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


62 
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SEQ ID 


Accession No- 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Idcntit 








Score 


y 














3563 


Y86248 


Homo sapiens 


Human secreted protein nurtrroo, 


1 Oft 


OA 






ID NU.loi. 






3564 


X86029 


Vigna 


extensin-like protein 


141 


JO 






unguiculata 








3565 


G03714 


Homo sapiens 


Human secreted protein, otKl Nu. 


J 1 
















3566 


AP000616 


Oryza sativa 


snuilar to KING-nz tmger protein 


1 1A 

1 1U 


Q/f 
74 






RHAia QArU7oOo3j 






3567 


Y86248 


Homo sapiens 


Human secreted protein JtiUrtrroo, 




Ol 






cr?n in wn« i A'l 

oJZ>v^ 1U INC- ID J- 






3568 


Y86248 


Homo sapiens 


Human secreted protein HUHrroo, 




/Z 








CCA TT* "M/V 1 /LI 






3569 


S62928 


Homo sapiens 


PRB1M protein precursor 


lvZ 


AA 


3570 


D86853 


Catharanthus 


extensin 




AA 






roseus 


_ 






3571 


U93574 


Homo sapiens 


putative pi 50 


0*70 

Lly 


JO 


3572 


Y02785 


Homo sapiens 


Human secreted protein encoded by 




oZ 






gene 51 clone HUisJiXo.). 






3573 


U45964 


Herpesvirus 


LMP1 


1^/ 


dD 






papio 








3574 


Y08319 


Homo sapiens 


. : . , 

kinesin-2 


ZD'* 




3575 


X97675 


Homo sapiens 


plakophilin 2b 


1J4 


34 


3576 


AF003151 


Caenorhabditis 


contains similarity to an KJNA 


137 


43 






elegans 


recognition motif 






3577 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


oo 








4658. 






3578 


U80447 


Caenorhabditis 


coded for by C. elegans cDNA 


117 


50 






elegans 


ykl87f6.5; coded for by C. elegans 










cDNA ykl 87f63; coded for by C. 












1 _TW T A .,1,1 /IZTfl 1 c 

elegans cDNA ykl46tl 1 J5 






3579 


U23172 


Caenorhabditis 


similar to myoblast cell surface antigen 


120 


31 






elegans 


(SP:CS24_HUMAN, P23246) ana D. 










melanogaster No-on-transient A protein 












/nn> • run i Afy\ 






3580 


AJ249395 


Globodera 


NADH-ubiquinone oxidoreductase 




3D 






pallida 


subunit 4 






"t CO 1 

3581 


U091 lo 


Homo sapiens 


l/kt J , encoaes a *w kusl proaucc 


110 




DJOZ 


AV /O / J 


iiomo sapiens 


pidKUpillllIl £A3 


125 


73 


3583 


L26953 


Homo sapiens 


chromosomal protein 


1A1 
lui 




3584 


S80343 


Homo sapiens 


arginyl-tKlNA syntnetase, Argio 


1 1 A 
1 1U 


Of 


3585 


X97675 


Homo sapiens 


plakopnilin zd 


1 10 


OA 


3586 


X92485 


Plasmodium 


pval 


1Z4 


en 
jU 






vivax 








3587 


G03714 


Homo sapiens 


Human secreted proiein, ocxi lu issj. 


1ZO 


S6 

JU 








/ Jyj. 






TfQO 

ODOO 


UU3 1 IZ 


: 

Homo sapiens 


Human secreted proiem, oiivc *^ 


116 


56 








/ZD j. 






neon 

jjoy 


T TiCa</IO 

UojMZ 


Homo sapiens 


f Ar protein 


19R 
iz>o 


75 






Homo sapiens 


cytocnrome c oxitiase !>ucjuuil j 




oo 




DJol lz 


Homo sapiens 


iM/VL/ri aenyurogeiiabe auuuiui i 




87 
o / 


*5 COO 

3592 


K02401 


Homo sapiens 


chorionic somatomammotropin 




oO 


3593 


G00689 


Homo sapiens 


Human secreted protein, uj nu. 


IAS 
1U5 


Af\ 
4U 






4770. 






ICQ/I 


VUUOOZ 


. . 

Homo sapiens 


cyiocnrome oxiuaoc 1 




76 


3595 


AC008262 


Arabidopsis 


F4NZ10 


136 


31 






thaliana 








3596 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


52 








7871. 







223 



WO 01/64835 PCT/US01/04927 



jvn- 


Accession No. 


Species 


Description 


Smitb- 

Watermaa 

Score 


Identit 

y 


3597 


AF 11 6645 


Homo sapiens 


PRO 1708 


623 


91 


3598 


M2302& 


Human 
herpesvirus 4 


nuclear antigen precursor 


121 


35 


3599 


W88816 


Homo sapiens 


Polypeptide fragment encoded by gene 
58. 


100 


33 


3600 


X92485 


Plasmodium 
vivax 


pval 


156 


Jy 


3601 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


78 


3602 


AK000496 


Homo sapiens 


unnamed protein product 


150 


68 


3603 


AF083929 


Mus musculus 


ES18 


98 


40 


3604 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


537 


83 


3605 


U12690 


Homo sapiens 


cytochrome oxidase subunit n 


548 


8<5 


3606 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


158 


57 


3607 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded bv ffene 2 SEO ID NO250 


429 


75 


3608 


G03356 


Homo sapiens 


Human secreted orotein SEO ID NO* 
7437. 




OJ 


3609 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


oo 


3610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


224 


80 


3611 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


481 


70 


3612 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


166 


Q9 


3613 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


83 


3614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


167 


91 


3615 


U 12690 


Homo sapiens 


fvtnphmmp nviHnc^ cuKitnit FI 

Vjr lASVslLl UAlUdoC otlUUllll 11 


JoZ 


/ / 


3616 


U23172 


Caenorhabditis 
elegans 


similar tr» mvnKlitct rt>\ 1 cnrfa^o ontmon 
OlUlLlcu l\J LUyUUldol veil bUTIaCC aUllgen 

(SP:CS24 JIUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(?TR:JH0\62) 


1 A1 


/1A 

4U 


3617 


D381I2 


Homo sapiens 


cvtochrome c <widfi<tt* •uiHiimf* 1 


jyi 


OJ 


3618 


AB037275 


Cynomolgus 
Epstein-Ban* 
Virus TsB-B6 


EBNA-1 


119 


42 


3619 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


581 

-JO 1 


01 


3620 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226^ 


100 


33 


3621 


AL049608 


Arabidopsis 
thatiana 


extensin-like protein 


105 


27 


3622 


AP000616 


Oryza sativa 


similar to RTNO-H2 finder nmtein 

n*m*i wi w ivuiv< life iiiigwi uj v/ivlll 

RHAla(AF078683) 






3623 


S71569 


Neocallimastix 
patriciarum, 
Peptide, 860 aa 


Xylanase B, XYLB {EC 3.2.1.8} 


108 


43 


3624 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24 JIUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


101 


T7 


3625 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


390 


83 


3626 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


84 


3627 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 

w J WJ will UiUW V/ vAlUUOw JUuUlill >J 


637 

V J f 


90 


3628 


AJ242540 


Volvox carteri f. 
nagariensis 


hvdroxvnroline-rich plvconrotein XY7 ~ 
HRGP 


I W7 


30 

J7 


3629 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


3630 


U93564 


Homo sapiens 


putative pi 50 


330 


90 


3631 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


42 



224 



WO 01/64835 PCTYUS01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Ideatit 










Score 


y 


JOJZ 


vjUzjjZ 


— — — . 

Homo sapiens 


Unman ctwrofarl nmfoin QCA 1T\ WH* 

riuman secreted protein, ocy ij-j inu. 


1 AA 


fin 








661 1 






l£ll 


VJvZJOU 


XIOIHO SdpJcIlS 


Unman c*»/»r*»i"*»rl r\t*rvtAin QPO TTl ^tff"V 

xi union secreted pi uicui, ox>v< xnvj. 


1J7 


OJ 








6441 






1£1A 
JOJ*f 


TT7AA71 


iTxaSCOlOSOm o 


lyioinronie c oxidase suuuuji i 


94.R 


79 
/Z 






cn 'Palifni-mfl 1 
Sp. v^aJltumia 








1A1^ 
jojj 


nAAi/ci 


xiomo sapiens 


Human secretea prorein, onv^ iu vtu. 


191 
XZi 


fi7 
5 / 








AAAO 






JO JO 


xv.1 jjjO 


xiomo sapiens 


Jrroiem encoded aownsir earn 01 miu_ivi 


19R 


OJ 








uueoproiein. 






1a"17 




TiUlIlU bd.pi Clio 


Unman cf>rr*»tf»H nrrvtAin PnfftfiPff Kv 

xiunian scureicti pi uicm ciiuuucu uy 


194 


6^ 

VI _J 








gvllC X>Z. 1/1UUC X1J.VXOJ VV 1 O. 








YAR069 


liUIllU Sapiens 


ri mil an lswjahj piuicin uaguicui 


1 J7 


51 
^ i 








uCllVCU JUL Will L/linJJUJO. 






1610 


AOjJoZ 


ivius m us cuius 


aipna-z coiiagen 


1 A9 
IK) c. 


JJ 




A FAQA049 

AT vy uynz 


xiomo sapiens 


r IvVJUOD / 


07 


/o 


JO*t I 


W ooOZ / 


xiomo sapiens 


oecreiea proiem encoueu oy gene y** 


1HO | 


07 








cione xiirivLDV<*)z. 






1/C49 


UUZ J Jo 


— : 

Homo sapiens 


Human secreted protein, SEQ ID NO; 


1 1R 


JJ 








00 17. 






lA/i'l 
J\rtJ 


Y1 4^7/C 
/O 


Murine leuKemia 


gagpl5 protein 


114 
1 J^f 


44 






virile 
V JU lib 










AFiion5i 


nomu SapieUS 


PRO0R0R 
* s\\j\Joyo 


177 


4R 


l/vd/\ 

JUtU 


VJUJUOf 


numu sapiens 


Unman cfv*n>tAH nrntpin WO TD'MO* 


190. 


50 








7145 
/ i*tj« 






1647 


a vtiaair< 
/vxvuuu j oj 


FLOmO Sapiens 


u nri allien proiem prouuuL 


1 fK 
1UJ 


40 

*fU 


164R 


VA0669 


nomo sapiens 


cytownxnue OAiuase i 




Rl 

OJ 


164Q 


AF090944 


jnujuiti oaiJiciio 




129 


73 


165fi 




numo sapiens 


c1\.\-/VOOj 


190 


/J 


16^1 

jOj 1 


W4R1S1 


nomo sapiens 


xiumau urease cancer reiaxea protein 


RR 


60 
ou 








BCRB2. 






1659 


AFOQOQIA 


rTuinu Sapiens 




Q9 


57 
j / 


jojj 


UUil /Z 


Homo sapiens 


riuman secreted protein, bJbv^ iu inu. 


145 


/J 








79^^ 










ricninococcus 


rs/vuri aenyarogenase suounic z 


1 AA 


19 
JZ 






muiiuocu ions 






JO J J 


UU1Z40 


Homo sapiens 


Jriuman secreiea proiem, ocv^ tL' jno. 


10 J 


05 








^^97 

JJZ/. 






1656 


AF161156 


XlAJiUU oOLIlCJLIo 






4R 


1657 


A FftQAQin 


MAtnA com AtlO 

nomo sapiens 


ilvUu't / 0 


1 SR 

1 JO 


R5 i 
OJ 


165 X 

JUJ o 


f!01ft41 


nuiuu sapiens 


xiumau secreieu proiem, ocy xxy iiu. 


19R 


R9 








7194 






3659 


TTQ1574 

U7JJ / *♦ 


nuuiu aOLJlCllo 


rmtativp nl SA 
UUlaUVC piJu 


117 


44 


166n 
JOOU 


I UZD / I 


xiomo sapiens 


riuman secreted proiem encooea oy 


140 

i*fy 


CI 

jl 








ffene 27 clone HMSJW18 






1661 

JOO 1 


VA9671 
I UZO / 1 


xiomo sapiens 


xiuman secreiea proiem encoaea oy 


/;7 

O/ 


7A 








a<»np 99 rlnnp HM^TWI R 

gCJIC /,. XXIVlvXP VV 1 O. 






1669 


VR694R 


X-4 /*\rvi s\ Romano 

xiomo sapiens 


riuman secreted proiem xioxixroo, 


yy 


A9 

oz 








OXJrVJJ XL/ INvJ 1 . 1 OJ. 






16£1 
jOOj 


AT AC1 A17 


_ 

Neurospora 


-olrtioJ +/v f n CK>f ATT "MT in PAP 

related to U 1 oJVx/VLfLr r^uuJL»xiAxv 


1 <4 
1 J4 


4j 






1/IaSad 


XvLDv/lN Uv^LDUr Xvw 1 XillN 






joo4 


YZ/j/1 


Homo sapiens 


Human secreted protein encoded by 


17/; 
lio 


/ / 








gene no« j. 






166^ 


W4R1*\1 
WHOjJ 1 


xiomo sapiens 


xiuman oreasi cancer reiatea proiem 


19R 
IZo 


<s0 

jy 








BCRB2 






3666 


U47855 


Araneus 


fibroin-3 


109 


39 






diadematus 








3667 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


604 


88 


3668 


K02401 


Homo sapiens 


chorionic somatomanimotropin 


636 


93 



225 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3669 


AF083929 


Miis muscultiii 


ESI 8 




4t> 


3670 


J01415 


Homo sapiens 


MTND4 




S7 


367J 


AK025047 


Homo sapiens 


unnamed protein product 


160 


60 


3672 


G03793 


Homo saniprK 


Human CAfrptoH nmtpin CRH TT\ XI 
i lLuiiaJi bcuicicu protein, ocy lu IN w. 

7874. 


1UO 




3673 


AB019038 


Homo sapiens 


beta- 1, 4 mannosyltransferase 


766 


94 


3674 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/forkhead 

Vir\mr\1 r\cr 1 1 A 

nomoiog nj\ 


113 


69 


3675 


U80447 


f a pn txrYi si KH it ic 


coaea ior oy c eiegans cdjna 
JK.10/IO.J, coaea ror oy eiegans 
cDNA ykl 87f6.3; coded for by C. 

wGgaila yK.l*fOIi 1 -J 


ill 
1 IJ 


39 


3676 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


197 


68 


3677 


U52077 




manner uansposase 


f AA 


74 


3678 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
oca in Mri'i^i 


174 


97 


3679 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


191 


86 


3680 


AC003113 


Arabidopsis 

thai icma 


F2401.6 


102 


83 - 


3681 


AF005370 


Alcelaphine 

lipmpcvimc 1 
ilCipoSVUUb X 


putative immediate early protein 


153 


42 


3682 


G01502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

JJOJ. 


111 


62 


3683 


AP002543 


/U aUIUUpoib 

thaliana 


*>KI A A r>9^0l *T 1 rtana i#4- C 1 C\if*7 1^ 

go|/\AjLizji/ 1 j. i~gene_ia.r I 5M7. 16~si 
milar to unknown protein 


131 


A A 

40 


3684 


G03790 


IJAmrt CflniAnc 
nuulU 2>apicnb 


riuman secreted protein, ocv£ it/ NU. 
7871. 


1 AC 

146 


64 


3685 


U93572 


jiuino sapiens 


puiauve pi5U 


t CO 

158 


43 


3686 


Y3406K 


numu sapiens 


xiistone Hi isotorm HI. 5 pANCA- 
reactive iragmeni \resiaues oy-xZO^. 


98 


43 


3687 


W80406 


Hnmn ^aniPTiQ 


/\. bcucLcu pro te hi encoaca oy cione 
dh40 3. 


1 A/C 

1U0 


54 


3688 


L26953 


llvlUV/ DClL/ivllo 


V/iiiuiiivJjUHla.1 uiuiem 




/O 


3689 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


87 


3690 


R96R00 


iromo bap lens 


riuman nisuocyie-secretea iactor JriM\ 


1 Aft 

109 


67 


3691 


Y86248 


T-T<VY\/\ conionc 

numu ixtpieiib 


Unman CA^rata/l n«>/\fni« Y T/"^T TTIT?/* O 

riuman secreiea protein riUrirroo, 
SEQIDNO:163. 


1 AA 

122 


75 


3692 


G03 172 


Wrimr\ conipnc 
liuuiu oopicub 


ri union secreiea proiein, oily IJL/ ML). 


1 cc 


or 


3693 


M20789 


Horn o Qanipn^ 


aipixa** i type i euiiagen 


1 A 1 

141 


43 


3694 


Y86248 


Homo ^anipn^ 


Unman cpcrpfprl nrnfpin 

nuiiicui ocuieicu pruicui nv^.rxr roo, 
SEO ID NO- 163 


1 AO 
lOZ 


50 


3695 


AF2 10651 


Homo saDiens 


NAG18 


194. 


Of 


3696 


AF130079 


Homo sapiens 


PR02852 


94 


53 


3697 


Y19192 


Talna euronaea 


cvtochrome ovi Ha^p <5nhiiTiit I 


/lift 


55 


3698 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


414 


86 


3699 


D38112 


Homo saniens 


cvtochrome p oYiHa<;p ^iiriiinit 1 


ash 


/0 j 


3700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

vJJO. 


109 


42 


3701 


AF2 17374 


piCUllrfl 


cyLuciuouie oxidase suuunii 1 




or 

o5 


3702 


AF025467 


Caenorhabditis 

p1p<rnn« 


contains similarity to drosophila DNA- 
uinomg proiem iviu ^iNJJLr.goi*ioj 


106 


48 


3703 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


522 


79 


3704 


D67066 


B03 taurus 


N-WASP 


219 


42 


3705 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


51 


3706 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


44 



226 



WO 01/64835 PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 








7R70 
/ 0 ty. 






3707 


Z70684 


C* a pn nrharMi iti<5 




ins 


17 






p] f»fr*jnC 
ClCgdllO 








3708 


G03790 


1 IvvlllV/ oapivlu) 


nun i dii secreicQ piutcui, ocy llj n\j. 


lxj 


7A 








7871. 






3709 


D381 12 

l/JOl IX 


F-Fnrnn C5*r»ipn<* 
xxviixiu ixi^/it-iio 


\*y luuiii 0111 c c OAiuase miuuiiii x 




oo 


3710 


AF2 17374 

^VX XI t «/ # *T 


A cantha«ri"pr 


L-yiuuuuiuc ti/viuaoc; duuuuii 1 


S19 

J IX 








plane i 








371 1 
j / 1 1 


TjIR 1 1 9 


Haitia cantpnc 
XI kJimj ^apiCllo 


lvrvLJii Qcnyurogenabe suduiiii z 


1 


oU 


3711 


AF1 97854 


N^pl ith rpr\n ic 
1 VAC! lull cpiUo 


vyiociuome oxidase x 


1 ftO 


RI 






lnnntiic 
luiiotuo 








3713 


AP000616 


Orvza saliva 


oinjiiar 10 i\Ji xiz linger pruicm 


190 
lx> 










RHAla ^AF078683^ 






3714 


AB029027 

^\xjvx.71/x t 


HAmA caniAnc 
nuinu oapidio 


KJAA 1 104 nmfpin 
xvl/lr\l IVt pi U LCI 11 


141 
iti 


ri 


3715 


T 269S1 


1 i ypaiiusuiiid. 


V^lVJ 


71 


11 
Ji 






brucei 








1716 

J / 1\J 


T?Q-\011 

ssyjy to 


rruiiiu sapiens 


lNeuToi uircaa piuiein. 


11/ 




1717 
3 / 1 / 


1100107 


riomo Sapiens 


numan secreiea proiein, otiv^ lu inw. 


0*s 
yj 


0/ 








*fH /<>. 






3718 


V161 19 

X JDl IX 


xiomo Sapiens 


CiXienuea numan secreteu protein 


ins 

lUo 


si 








cpnupnrp QCH TPl "MO 407 






371Q 
d i vy 


R070S7 


noino sapiens 


omajier nepauieeiiuiar oucoproiem 


119 

1 JX 


3D 








^iintni ) gene picuuucu 






3720 


A/T64991 


Rnc tannic 


i v pr oieuj 


1Q4 




3721 

3 /X. 1 


Y09671 

I V/XU / 1 


liOiiiu sapiens 


LXiimcm tA^'tffa/i nrAfom e*nr*r\rl^n www 

xiuin au secreicti proiein encoucu oy 


1 1 ^ 
1 ID 


7^ 








gvllC XX tlUliC XUVJLOJ w 1 0. 






3727 


G03041 


nviuu oapicns 


ri uiii aii secrcica protein, oui^ iu i> w. 


1 1 R 
l lo 


0^ 








7194 






3723 


AF302773 


xxvuivs dauiviid 


n in Pin— T m iQA'fArm 
U ill V 111 JUlll louiuiuu 


1S7 


S7 

3/ 


3724 


AK024455 


Hnmrt <janipn«i 


FT J00047 Timtefn 


134 


59 


3725 


Y02671 


xxuuiu oapidio 


T-Ti 1 m n n cp/*rpt"pH rrnfpin pnr*Arlprl hv 
11. Ill 11 all dCCICLCU piVJlvlll wllwUllwU Ujr 


161 

1U1 


Rn 








eene 22 clone HMSJW18 






3726 


Y12713 

A lib f J J 


K4tiQ miiQr>iiliiQ 

JLYlUo lilUoLrLIlUd 


r 1 yj^L ui uu x raoc puijpiuiciii 


191 

1X1 


S9 

3X 


3727 


AF220264 


XXUllll/ oOpiGIl^ 


MO^T.1 

iTlvy O l • t 


86 
ou 


RR 
oo 


-5 / XO 


T 99a9R 
UX/*+x5 


xiomo sapiens 


reverse transcriptase 


JlO 


Ox 


3729 


T 183303 


Haiti a canipnc 


llHC-l 1CV CISC 11 allot,! ipidckC 


1 19 
1 lx 




T7in 


WAJtt C 1 

WtOJJ I 


Homo Sapiens 


Human breast cancer related protein 


J4j 


oo 














3731 


W4R1S1 


HnmA canipnc 
X1UIUU bapicllb 


oiuiiau ureasi cancer reidieu proieiu 


1 1R 
l lo 


30 














3732 


G03043 


Haiti a cartipnc 
X1.U111U aapiGiio 


U nm9n cpprptprl «mtpJn WO rn MO* 

xxuiiiaii secreicu pioieiu, ODy ix> in w. 


111 


03 








7124 






3733 


AF090930 


Homo saniens 


PRO0478 


148 


71 


3734 


Y02671 


ffnmft **ariipnQ 
xiviiiu oapituo 


Wiimnn cprrptprl nrAtpin pnnorlpH liv 
xxuiuaii jwuwicu piwiciii cixuv/u&u ujr 


140 

ltu 


79 

/ X 








eene 22 clone HMSJW1 8 






3735 


U93569 


Homo sariiens 


nntative tj! SO 


270 


59 


3736 


U93568 


Homo ^ariiprm 


TfMitnrivp nl SO 

UUlaUVC I/1JU 


1S1 


1R 

JO 


3737 


fr09491 


rTfilYlA CitniPnc 
xxVlxllS oapicilo 


lxuiiiaii seuicicu pi uicm, oc\i ii> inw. 


194 


R9 
ox 








6574 






3738 


X92485 


Pla^mnHiiim 


nva 1 
p yet i 


101 


44 






V1VOA 








3739 

•j 1 jy 


(,02538 


Hatha 'Janfpn'v 
x 11/11 ivj oapi&iio 


Human cprmfpfl nrnfpin *\FO IFl WD- 
XXUluaD ibCClClCLl pi UlClil, OX-»V^ 1U IN W« 


146 


3 1 








6619 

W17. 






3740 


AF090944 


l LVJlllKJ oajjiviio 




194 

lx*t 


30 


3741 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


94 


47 








6613. 






3742 


AF1 18086 


Homo sapiens 


PR01992 


124 


61 


3743 


AF083929 


Mus musculus 


ES18 


108 


35 


3744 


L34685 


Arabidopsis 


cell wall protein 


120 


34 



227 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






th a liana 








3745 


AF090931 


Homo sapiens 


PRO0483 


117 


54 


3746 


AF090930 


tf/Arn o cnntAtK! 

rXLIlllLF OCUJlCllD 




1 1£L 
1 JO 


15 


3747 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


59 


3748 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAAQ 


92 


44 


3749 


AP000616 


Oiyza sativa 


similar to RING-H2 finger protein 

JKJrlAia ^ArU/oOoJ ) 


98 


42 


3750 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene n clone hmoj wio. 


165 


73 




/Vr l/VvOIvJ 


v/ryza saliva 


sun iiar to KJNCj-rLc linger protein ■ 
RHAla(AF078683) 


111 

1 13 


87 




Uv/J / Or 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
/oOo. 


96 


CO 

58 


3753 


U93574 


Homo sapiens 


putative pi 50 


157 


60 




AJOvl J 


l_*J lLDlUla 

uncopeici 


NADH dehydrogenase subunit 5 


103 


30 


3755 


M80613 


Homo sapiens 


putative 


114 


26 


J /JO 




riomo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


119 


54 


J / j / 


UjoI lZ 


Homo sapiens 


N AJDH dehydrogenase subunit 2 


210 


89 


3758 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o / 1. 


91 


58 


3759 


AF025467 


Caenorhabditis 
eiegans 


contains similarity to drosophila DNA- 
binding protein Riu (jNiL>:gol48) 


110 


46 


3760 


M13100 


Rattus 
norvegicus 


unknown protein 


119 


46 


J /Oi 




Caenorhabditis 
ciegdui> 


contains similarity to drosophila DNA- 
Dinaing protein jviu ^iNiiJ:go 145) 


135 


52 


3762 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

sSX\J\ 1 a \ J\r U / oOoo ) 


132 


84 


3763 


Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


125 


40 






Homo sapiens 


fAr protein 


131 


72 


3765 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAQ1 

449 J. 


143 


76 . 


3766 


X92485 


Plasmodium 
vivax 


pval 


119 


54 


3767 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein i^uin-z. 


81 


56 


3768 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/4VZ. 


98 


65 


3769 


AB005216 


Homo sapiens 


Nek, Ash and phospholipase C gamma- 
Dinaing protein ri Ar4 


487 


90 


3770 


AF026211 


Caenorhabditis 
eiegans 


Similar to cuticular collagen 


102 


37 


3771 


WHO J J 1 


Wnmn conipnc 
XIUIIIU bdpiCIlb 


nuroan oreasi cancer reiatea protem 
RPR TV? 


91 


53 


3772 






niuiid.li scli cicu proiein, ocy iu in w. 

6566. 


IZo 


/O 


3773 






"FT Iflfl ft/17 rwntein 

ri^juui/ 4 */ protein 


lift 


DO 


3774 


AF090895 


Homo sapiens 


PRO0117 


152 


60 


3775 


rvT iJlM/J 1 


nonio oapiCuo 


rl\.V_/U070 


ill 


78 


3776 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


100 


63 


3777 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


187 


75 


3778 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


130 


56 



228 
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SEQID 


Accession No. 


Species 


Description 


SmittV 


% 


NO: 








Watermao 


Identit 










Score 


y 


3779 




T \z/*f\rw»rcir*An 
l_»jr UOpCJ 3 1 vUil 


CALCilMIl ^LldSS 1) 




DO 






£SC1]1 PTltl JTtl 

MjwUlvii tUJ u 








3780 


L2742R 


nomo sapiens 


iOVCl oc uaiisuripcase 


1 1 A 
1 io 




3781 


Y71066 


nuiiio Sapiens 


nuiiiau lucinDranc uauspoii piuicm, 


155 


oO 








lYi 1 IvJT- 11. 








AT 1QA1 la 


: : 

Leishniania 


extremely cysteine/valine ncn proiein 




Of 






major 










T I4QQ71 


fiumo sapiens 


v/ivr i , xvlckj / , puiauve uausposase 


AA9 


/J 








cirri ilsir tt\ nnon plpmpnt 






3784 


G00333 


Jl will v Sapiens 


Human cpr-rpfpH nrntpin QPO TO "NJPk- 


197 


A7 
Of 








4414. 






3785 


Y67470 


numo Sapiens 


iNp/u proiein caTOOAy iciiuuiai region. 


1 90 

izy 


30 


3786 


AF010I44 


T-J/vmrv outvptic 
nuiiiu oapicu:> 


npiiroiml tfirpaH Tirrvfpm Ari7r>«MTTP 
11CU1 KJlLa.1 UU CaU Ul UlC-lil J-\±J / C _ iN 1 JT 


I ^9 
1 JZ 


7R 
/o 


3787 


OOv/l 17 


JxallUS Sp. 


reverse uanscnpiase nomoiog 


lill 


A(\ 


3788 


AF090949 


liUUlU oapiCllo 


PRO0fiS7 
riw^uo j / 


I j*f 


^7 
0/ 


3789 


H017 1 4 


nomo sapiens 


Unman ci»/>r»4A^ nrnt^in Q"Cf~\ f r\ XI/^\. 

riuman secreiea proiein, on\i id 


1 oc 


OD 








770^ 






3790 


API 1£££1 


U/>tv»/"\ com one 

nomo sapiens 


ppniAIR 

JTlvWl*t^O 


19Q 
lZo 




T791 




riomo Sapiens 


salivary proiine-ncn proiein precursor 


1 1A 


35 


-3709 
J /yz 




Homo sapiens 


Human secreted protein, bbv MU: 


IZ/ 


74 














3793 


WJVl7A 


_ - 

Homo sapiens 


Amino acid sequence of salivary 


1 AI 










protein ouin - 1 . 






17Q4 
o /y*t 


/\r vuuzy o 


L/aenOrtlaDOJuS 


weak similarity to collagens; glycine- 


175 


45 






p? poanc 

eicgaiis 


anu proiine-ncn 






37Q5 


K01905 


nomo Sapiens 


salivary proiine-ncn protein precursor 


1 IT 
I I / 


A A 


**70fi 
o iy%j 


I OOZ*rO 


riorno Sapiens 


Unman rofrolo^ v\m4-ain T Tf~*X TT»T?/Z O 

xiuman secreteo pro tern nOxirrOo, 


Ioj 


00 
OO 














3797 
■j fy i 


tori 19 

L7J O 1 1 Z 


xiorno sapiens 


cyiocnrome c oxiuase suDunrt j 


Al A 


OA 






nomo sapiens 


xiuman secreteo proiein, oni^ iu v*\ji 


i </c 
IDO 










rr in 
00 iy. 






3799 

^ / y^ 


AT 1SQ7R9 


1 rypanosoilla • 


prooaoie sunuar to nng-nz ringer 


71 
/ 1 


DZ 






nmr>oi 

urucei 


protein rnaia. 






3800 


O040Q1 


nomo sapiens 


Unman <•» «■> <4 nrntain Q C Tffc "M/™\» 

xiuman secreiea protein, ocv^ iu jnvj. 


1Z/ 


/ / 














^ROI 


A pi *JAA7Q 


riomo sapiens 


DDA08<O 


104 


<A 

34 




AT 1^0789 

i\Juojy/5z 


Trypanosoma 


probable similar to ring-h2 finger 


1 1 o 

118 


45 






orucei 


protein rhala. 






1R01 


U4ZD5U 


Paramecium 


rro-ncn, rAFK {^ZUAJ; similar to 


1 A 1 
101 


io 






bursaria 


Arabidopsis anter-specific Pro-rich 










^nioreiia virus 1 


protem, corresponus to owiss-rrot 












A /«/*AO0«nM 1kTi»mV\a'i" DA'A/CAO 

/vccessiou iNumoer r*tuouz 






3804 


Y86248 


numty sapiens 


tinman cpTTPt-pH nrntoin Uf^TJf PP/a 52 

nuuidn secreieu piuieui n^rLrroo, 


1 fj 


ft7 
of 








SKO ID NO- 163 






3805 


G03172 


Homo saniprK 


Hiimfln Qftpn*lif*HI ArAfpin 5sT*!0 IT1 INTO* 


1X1/ 


79 








7953 






3806 


D38112 


Horn a <U)nipn<i 




995 


R5 

OJ 


3807 


U88587 


TvJir^rvtinnn fllntn 
iiiwuauci ciiaio 


190 l^*T*la cK/Ip orK//»nrkmt , piri 


1 1R 
1 lo 


JO 


3808 


UJO] IX 


fiuuiii sapiens 


cyiov^uronie c oxiuase suoumc i 


190 

izy 


519 
5Z 


3809 

jo\ty 




xiomo sapiens 


xiuman secreted protein, oni^ inu. 


1ZZ 


O/ 








7194 






3810 


Y8694R 


nuiiiu sapiens 


Unman cpprpfpH nrntpin MPUPPAft 

xiuuian ocercu?u proiein nv^rxrroo, 


911 
Z1J 


07 

y / 








OIjV< 11-/ 1> w. 1UJ. 






3811 


G03041 


T-TAmrk isnipnc 


PTnm^n oprrpfr»H nmtpm QPO rn "KJO- 

Xl Ul 11 all oCUl GLk/Ll piULClllj \JLj\£ 1 L7 IN W. 


1UO 


OJ 








7124. 






3812 


AF1 69974 


Homo sapiens 


seme racemase 


153 


56 


3813 


L27428 


Homo sapiens 


reverse transcriptase 


175 


43 


3814 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


109 


43 








protein CON-2. 
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PCT/USO 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


3815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


176 


78 


3816 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


112 


65 


3817 


AF1 19901 


Homo sapiens 


PR02831 


113 


55 


3818 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


89 


3819 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


91 


66 


3820 


X92485 


Plasmodium 
vivax 


pval 


108 


46 


3821 


AK024455 


Homo sapiens 


FLJ00047 protein 


79 


59 


3822 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


178 


87 


3823 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


99 


50 


3824 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


86 


94 


3825 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


53 


3826 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


88 


3827 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


77 


61 


3828 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


51 


3829 


K03208 


Homo sapiens 


salivary proline-rich protein precursor 


161 


41 


3830 


U44838 


Glycine max 


extensin 


108 


ATI 


3831 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


136 


71 


3832 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


84 


3833 


AL163912 


Arabidopsis 
thaliana 


glycine-rich protein atGRP-7 


117 


■J 9 


3834 


AF130089 


Homo sapiens 


PRO2550 


138 


45 


3835 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


92 


76 


3836 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


70 


3837 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


98 


37 


3838 


X68249 


Xenopus laevis 


Proline rich protein 


92 


66 


3839 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


262 


92 


3840 


R63235 . 


Homo sapiens 


CNS neural thread protein HB4. 


186 


100 


3841 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


64 


3842 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


116 


62 


3843 


AF164614 


Homo sapiens 


envelope protein 


508 


77 


3844 


AF130051 


Homo sapiens 


PRO0898 


130 


73 


3845 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


144 


74 


3846 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


119 


68 


3847 


M13100 


Rattus 
norvegicus 


unknown protein 


129 


46 


3848 


X97675 


Homo sapiens 


plakophilin 2b 


145 


67 


3849 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


65 


3850 


K03206 


Homo sapiens 


salivary proline-rich protein precursor | 114 


42 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


JOJ i 


U7JJ / *t 


T-I r>m n c^nipnQ 


mitoht/0 r\t ^(t 

puutuve pi ju 


I OJ 


oz 


3852 


AF090931 


Homo sapiens 


PRO0483 


97 


78 


JOJJ 




nomo Sapiens 


\Jtvr i , MJcrJKJ /, putative uansposase 
similar to pogo element 


770 

z/y 


CA 
oil 


JOJ** 




U/vm e\ cant An c 

nomo Sapiens 


putative pi jv 


11^ 
i 


35 


3855 


M13100 


Rattus 
nor vcgicus 


unknown protein 


100 


57 


3856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/Z03. 


166 


63 


3857 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 3 /, 


117 


49 


3858 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ooo. 


147 


45 


3859 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

77^3 

/Z33. 


150 


71 


3860 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7A7^ 
ill I J. 


165 


69 


3501 




~— : 

Homo sapiens 


Unman na/tM^ ncr>fni« QPA TT\ XT /A. 

riuman secreted protein, ocvj u-* rs\j. 
7124. 


111 

131 


o^ 




VA7A71 
I UZO / I 


Homo sapiens 


Human secreted protein encoded by 
gene zz cione xiivioJ w 10. 


1 11 


ol 


3863 


AK000241 


Homo sapiens 


unnamed protein product 


100 


48 


3504 


I jOI 1Z 


Homo sapiens 


Extended human secreted protein 

contionAD CPA TA "MA AQ7 

sequence, ocv^ invj. 4y /. 


1 O 1 

1Z1 


OA 

oU 


3865 


AK024455 


Homo sapiens 


FLJ00047 protein 


124 


69 


JOOD 


VJUZ53Z 


xiomo sapiens 


riuman secreieu protein, ocy h-j in la 
6913. 


87 
of 


CA 


3JIA7 
jOO / 


AP1 1 A717 
AT 1 10/ 1Z 


nomo sapiens 


DT> A773C 


yz 


4o 


3868 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprote in . 


165 


62 


3©oy 


/^AAyM/: 
VJUU410 


rtomo sapiens 


riuman secreieu protein, ocvj H-J inva 
4497. 


IOC 

lZo 


OZ 


Jo /v 


YZ/50& 


riomo sapiens 


Human secreted protein encoded by 
gene No. 107. 


QA 




3o 1 1 


/\JVUZ443j 


Homo sapiens 


T7T TAAA/fT nrnfain 

r JL»Juuu4/ protem 




•70 

IZ 


jo /Z 


/\JZ4Z34U 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


14/ 


AA 


35T73 
JO / 3 


r; ao 70 A 
VJU3 /JrU 


nomo sapiens 


tTnmon ca/«ra^a^ nrA^atn CCA IA MA- 

xiuman secreieu protem, ocv^ iu inv/. 

7R71 


1 1 1 


CI 


JO /*+ 




nomo sapiens 


ppAAA7« 


170 
1ZU 


03 


3875 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

77<K 


145 


70 


3876 


D88674 


Homo sapiens 


antizyme inhibitor 


147 


75 


3877 


AF7/>3744 
rlTZOJ / *r4 


xzomo sapiens 


cid oz - in leracnng proiein cxvoiin 


7 17 


37 


3878 


Y36112 


Homo sapiens 


Extended human secreted protein 

cpnitf»rifv» CCA TT> MA 407 

seuuence, onv^ jll/ inva *r7 f « 


140 


82 


JO / 7 


UUujJ't 


nouiu sapiens 


PTuman cprrptorl nrnfpin CCA TA XT A' 

numdu bcoicieu pruiein, oca^ hj invj. 
4435. 


oZ 


3J 


388n 


I Ul lOl 


nomo sapiens 


roiypepuue rxagmeni encoueu oy gene 
12. 


144 
144 


ou 


JOO I 


ACA1 AI/t/1 
rVTl/1 vl*r4 


numo Sapiens 


riAiimna) f^Viroo/1 nrntAtn A 1^17/* Xi'i'u 

neurunai uucaa protein /iiv / c-in i r 


1 77 
1ZZ 


ou 


3882 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4035- 


134 


75 


JOOJ 


A/fOI-301 
XViOl JZI 


lviacaca 
fascicularis 


pro line-rich protein 


1 A1 


A\ 


3884 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


109 


56 


3885 


AF013214 


Bos taurus 


acidic ribosomal phosphoprotein PO 


177 


94 • 


3886 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


77 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 








4484. 






3887 


Y59860 


Homo sapiens 


Human nonnal uterus tissue derived 
protein 23. 


137 


72 


3888 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded bv ffenp 1 7 


147 


72 


3889 


M64792 


Rattus t, 
norvegicus 


salivary proline-rich protein 


104 


40 


3890 


AF156228 


Drosophila 
melanogaster 


salivary gland secretion protein 


104 


35 


3891 


Y01158 


Homo saoiens 


Secreted nrofpin pnmripH Hv optip 1 ft 

clone HCACJ81. 


on 


A1 


3892 


Y01400 


Homo saDiens 


Secreted nrotpin enrorlprf hv apnp 1 % 
clone HNHF029 


too 


7R 

JO 


3893 


D38112 


Homo sapiens 


"NADH dehydrogenase Qiihnnit S 




£S 

OJ 


3894 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


34 


3895 


X02873 


Daucus carota 


put. precursor 


104 


48 


3896 


AF090942 


Homo <ifinien<; 




1 7*5 




3897 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


77 


3898 


X92485 


.Plasmodium 
vivax 


pval 


111 


62 


3899 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571 


107 


70 


3900 


AL 160493 


T ei^h mania 

UvMUUl GUI J CI 

major 


•nmViflhlA ^Kh\/.-/£\ till AO von ant o 

pruuauic ^lliiy-o j ui luz, varicuii a 
f)NA cnmnletp virion opnnmp 

i-^i I^Tl, YUJlyJl gullibility 


OA 


Ol 


3901 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


83 


3902 


AF090942 


Homo sapiens 


PRO0657 


134 


65 


3903 


AF220264 


Homo <i?inipTi<5 


K/ro<*T-i 


*1A 




3904 


G04072 


Homo QanienQ 


Human cpprptpH nrntpin QPO m Wn* 

8153. 






3905 


G03043 


Homo ^ar)ien<? 


Human oprrpfpH nrnfpin QPH TD WO' 

7124. 


19Q 
1 £y 


5D 


3906 


AK025047 


Homo <ianiftn<; 


UUIIOUICU pi tJLCLLI pUJUUCl 


197 
1Z/ 




3907 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124 


135 


86 


3908 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718 


134 


72 


3909 


X77816 


Rattus 
norvegicus 


PR-Vbetal 






3910 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


114 


70 


3911 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


61 


3912 


U93567 


Homo saoiens 


rjutative ol 50 






3913 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


96 


40 


3914 


X94976 


Brassica napus 


cell wall-plasma membrane linker 
nrotein 


104 


36 


3915 


P92219 


Homo sapiens 
fliiimarA 

\i mi nun j 


CR1 protein. 


125 


73 


3916 


X92485 


Plasmodium 
vivax 


pval 


QA 


Of 


3917 


Y36495 


Homo saoiens 


Frapment of* human ^prretpd nrnfpm 

encoded by gene 27. 


1 0S 




3918 


AF1 18086 


Homo sapiens 


PR01992 


130 


87 


3919 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


80 


3920 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 



232 
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SEQIO 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4493. 






3921 


AF090930 


Homo sapiens 


PRO0478 


128 


64 


3922 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


75 


3923 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


65 


3924 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:103. 


134 


71 


3925 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


102 


58 


3926 


D63487 


Homo sapiens 


The K1AA0 1 53 pene nrnduct k related 
to a putative C.eleaans eene encoded in 
cosmid F42A8. 


141 


1 \J\J 


3927 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


37 


3928 


X58438 


Mus musculus 


proline rich protein 


136 


36 


3929 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


71 


3930 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


73 


3931 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


48 


3932 


W48351 


Homo sapiens 


Human breast cancer related Drotem 
BCRB2. 


110 


65 


3933 


X92485 


Plasmodium 
vivax 


pval 


116 


36 


3934 


X97675 


Homo sapiens 

— ZZ. — 


plakophilin 2b 


129 


82 


3935 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
?339 


84 


58 


3937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


107 


56 


3938 


AF090895 


Homo sapiens 


PRO0117 


166 


68 


3939 


K02550 


Oncorhynchus 
raykiss 


70-kilodalton heat shock protein 


104 


38 


3940 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


74 


3941 


L26953 


Homo sapiens 


chromosomal protein 


92 


67 i 


3942 


U63542 


Homo sapiens 


FAP protein 


135 


75 


3943 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


80 


3944 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


141 


67 


3945 


AK024455 


Homo sapiens 


FU00047 protein 


123 


76 


3946 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


92 


52 


3947 


W48927 


Homo sapiens 


Schwannom in-binding protein C- 
terminal fragment 


103 


56 


3948 


AF252293 


Homo sapiens 


PAR3 


161 


44 


3949 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 1 00 clone HNFIU96. 


125 


47 


3950 


Y86248 


Homo s^iens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


128 


59 


3951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


174 


87 


3952 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


152 


75 


3953 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


168 


64 


3954 


R95913 


Homo sapiens 


Neural thread protein. 


110 


54 


3955 


AF 130089 


Homo sapiens 


PRO2550 


122 


63 


3956 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


48 
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SEQ ID 


Accession No. 


Species 


Description 


Smith. 


% 


NO: 








Waterman 


Identit 










Score 


y 


3957 


Y36203 


Homo saoiens 


Human cArrftpd nrntpifi &7S 

XXU111CU1 j^WlblvU UlUlvilJ IT 1 J • 


121 


ID 


3958 


Y86248 


Homo saoiens 


Human secreted" nrotein HCHPF68 

XXUXJ1CUJ jbUvlMl 1/J vlvlil 1 vJOj 


177 
if/ 


61 
Ol 








SEO ID NO- 163 






3959 


G00328 


Homo saoiens 


Human secreted nrotein SEO ID NO* 


U J 


oy 








4409. 






3960 


U93563 


Homo saoiens 


outative nl SO 


227 


o^ 


3961 


D86853 


C^ath a ran th 1 1 

V_^Cll_llCXl tUllllUO 


PYt¥*ncin 


1O0 


/in 






roseus 








3962 


U12390 


Olonino vertor 

>w lull ii igj y v»i^ tut 




1 Al 


Hi 






oSoortl 








3963 


GO 1828 


Homo aniens 


Humeri w*prptf*d r\rr\tf»iii ^FO TTi MfV 

X JLLULICUl aCL/lC/LCLI JJlUlV/111, OLiy 1JL/ 1 1 Vv. 


107 










5909 






3964 


G00397 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 


121 
i«ii 


x!9 








4478. 






3965 


G03714 


xiv/mu oapiviia 


Hum Jin c/>r*rf»tf>/-* nrntptn QI?fi XV\ "NffV 

xi ui 11 ail bcuieieu piuicm, ocy iu 


1 ^0 


00 








77QS 






3966 


Ml 1901 


Ratine 


pruimc-i ivn salivary pruicin 










norvefficus 








3967 


AF229126 


Homo saniens 


acetylcholinesterase collaoen-lilfp tail 

(i-\*\jLj I VI iv/ 111 1 V#o Iwi Qow \AJ t 1 OfcVM 1 IxxO Will 


1 OR 


36 








subunit isofbrm VII 






3968 


AJ006770 


f>ri^finiim 
\/ivci aiiciuiuiu 


pvfpncin 
CALCIlolil 


or 

OO 


OO 


3969 


AF130089 


Homo sanierK 


X A XA_/X. J _l V/ 


19R 


R9 


3970 


U93563 


Homo <?anierK 


mitativp nl SO 

^/UUxLlVVv pi JV/ 


00 


Ho 


3971 


AK000496 


Homo saoiens 


unnamed nrotein nrodnet 

UUlluUim UlUlGxll UlUUUvi 


1 ^xl 


71 
/ 1 


3972 


Z70684 


Caenorhahd itis 


F28D1 ft 


1 OR 


A9 






elegans 








3973 


AF010144 


Homo saoiens 

XXV/UXW OUULwlliP 


neuronal thread nrotein A TY7e-"MTP 

11CU1 KJUCLl Ull CttU JJ1 vslvlxl ni^ / V IN 1 x 


l j f 


79 I 
/Z 


3974 


AK024455 


Homo ^aniens 


l JL/Jv/v/v/*-r / pi ULolll 


1 ^7 


Oj 


3975 


AK024455 


Homo c:anipn<t 




IZo 


o3 


3976 


X01918 


TYro<:nnh tin 

1_/1 \SoU£/lllI<x 


oaij vdiy giauu giuc protein 


OO 


HU 






melanogaster 








3977 


G02872 




Human c w rf» t(>f\ T\rt~\tf* i n QPA TT~^ XT/^\« 
llUlUdll dCLrlCLCU pruLcLU, ocy 1U IN w. 


1 jO 


HO 








6953 






3978 


AF229067 

* ***** y w/ / 


Homo ^aniens 

XXl/lxlv/ jUpiUil J 


PADT-H nrotein" 

X /TLX-/1 11 JJlvJlGlxl 


107 


R7 
Of 


3979 


M64793 


lVOllUo 


oallvcUy J/l UlULC-'rivU pi U161U 


1 5(0 


Ho 






norvepieus 

xlvl TvglvUd 








3980 


S80119 


Ra.ltus so 


reverse h*Jin<scririt!iQe homolnc 

lv¥ CI jv U CUuU lUUuC llUlxlV/lv/^ 






3981 


AF090901 


numu dapicua 




1U0 


Oil 

yn 


3982 


AF025467 


C\sk en orh ahd iti c 

V-KlvllVl llox/ HI 11 a 


COlllalXla bUUllaTiiy lO UlUSOpilUa JLTNA- 


111 


HZ 








UxlxUlilg JpiUlCiU XVlU ^INXL/.gO lHo) 






3983 


X92485 


Plasmodium 

X HIillllVWUI II 


pval 


109 


30 






vivax 








3984 


G02640 


Homo sapiens 


Human secreted orotein SFO ID NO* 


127 


6A 








6721. 






3985 


AB012162 


Homo sapiens 


APCL protein 


190 


42 


3986 


AF016099 


Mus musculus 


endonuc lease/reverse transcriptase 


139 


65 


3987 


U93570 


Homo sapiens 


putative pi 50 


170 


71 


3988 


D38112 


Homo saoiens 


MADH dehvdropena^e <iiihnnit *^ 


902 


1 j 


3989 


D38112 


Homo sapiens 


NADH dehvdroeenase subunit 5 


244 


85 


3990 


V00662 


Homo sapiens 


URF 4 rNADH dehvdroffenase Dublin \i\ 


420 


92 i 


3991 


D38112 


Homo sapiens 


NADH dehydrogenase subunit /l 


272 




3992 


L38908 


Nicotiana 

L. ~ 1VV mmAfcl 


evtensin 

VWVXylU 111 


1 14 
i it 


40 






tabacum 








3993 


AF010144 


Homo sapiens 


neuronal thread nrotein AF>7r-iSrTP 

xx will V/Ucxx li 11 ocivj ^Jl v itill rVi/ / v IN JLx 


1 16 


64 

OH 


3994 


AF090944 


Homo sapiens 


PRO0663 


99 


,39 


3995 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


75 








7888. 






3996 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


574 


87 


3997 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


488 


76 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideniit 

y 


3998 


U52077 


Homo saniens 


manner transposase 


475 


70 


3999 J 


U93570 


Homo sapiens 


p40 


ill 


34 


4000 


U 15647 


Mnc mn^culus 

&'1U«J IUIUvUIUu 


reverse transcriptase 


137 


43 


4001 


AF1 16712 


Homo ^aniens 


PR02738 


105 


52 


4002 


AF113685 


Homo ^aniens 

All/lUv \J\XXJX\jxx& 


PRO0974 


125 


53 


4003 


W48351 


Homo <5aoiens 


Human breast cancer related protein 
BCRB2. 


92 


78 


4004 


AF1 16715 


Homo <;anifin<? 


PR02829 


115 


73 


4005 


D38112 


Homo QAniftn^ 

XX villi \J «XlLflt/llD 


NADH dehydrogenase subunft 2 


2S8 


RO 

Ou 


4006 


D38112 


Homo <5anieii<i 


cvtochrome c oxidase suhnnif 1 


614 


S6 


4007 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


106 


45 


4008 


U90268 


Homo sapiens 


Kritl 


152 


52 


4009 


V00667 


xxUllli/ oajJlCilo 


TTpp j, fMATiW rlehvrtroaenac^ onWiinitr 
uivr *+ ^lNrturx ucuyuiugciiaoc ouuiiiii l ) 


AT) 




4010 


B03148 


Homo sapiens 


Human neuronal differentiation fector- 


304 


100 


4011 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 />JTYF-1 i 


472 


98 


4012 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


102 


50 


4013 




XTOIIIU i>a.plC115 


T-Tnman c/*rrrtf*H nrntein ^FO IT) MO- 
IT lull oil dCUCiicu jjiulciu, ocy kXJ nv. 

4343. 




7^ 


4014 


G(ttR01 

VJvJOv I 


l luniu dapiciio 


Witman cf>rr#»teH nmtpin 9PO TTi MO* 
it m 11 oil ^cvicicu wiuigui, ocy xly llU. 

7882 


1 1 X 


4.7 


4015 


U 12690 


Homo ^anipnc 


cvtochrome 03rida?ie QiiHiintt TT 


522 


81 

O 1 


4016 


G02480 


Homo ^flnipn^ 

XlVlXlvF OtljJlVTlJ 


Human arreted nrotein SEO ID NO* 

x nit mm jt»wi VrfixAi L/iv*iviii, UL/y y\j \r\ vy« 

6561. 


147 
i * i 




4017 


AF010144 


Homo ^aniens 


neuronal thread nrotein AD7c-NTP 


106 


57 


4018 


AB003476 


Homo sapiens 


gravin 


549 


91 


4019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


391 


93 


4020 


U44949 


Xenonn<* laevis 


Ton a nellueida A plvconrotein hnmolop 


143 


32 


4021 


D38113 


Pan troglodytes 


cvtochrome c oxidase subunit 1 


571 


88 


4022 


D3R1 H 

L/JOl 1J 


Don irf\crlr\riirt(*c 

Fall UtJglLJU.jrl.Ca 


UyUJCllI UlllG L» UAIUaoC MlUUlllL 1 




o / 




vJv*tv / o 


riv/iiio sapiens 


Human c(*rrf*tf>A nrnf-pin QWl Til XI fY 

xiuniau sccrcicu pruLciii, ojza^ il/ iNv/- 
8159 


IOj 


K& 
oo 


4024 


AL390114 


Xvwidiixxi auia 

major 


wAUvllldjr vjfolGUlW VaiillC 111/11 LVlVJtGlxl 


131 

1 J X 


JO 


4025 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


583 


86 


4026 


AF090895 


Homo sanipnq 


PRO0117 

X i\\>/V 11/ 


104 


49 


4027 


AF130056 


Homo sapiens 


PRO 1367 


80 


60 


4028 


X97675 


XTULXIV OdLMCllo 


nlnVnnhilin 7H 
UlCuvUplllllll 


i j / 


Oj 


4029 


X67337 


Unntn ciiniptiQ 

XXvMllvf OOlJlOllO 


xxuiiicux jji w~xiix\a^i^v wivardgv xa^iui 1 uo 

kDa subunit 


1 14 

1 1*T 




4030 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


110 


74 


4031 


033U2 


Homo QflnipnQ 


ATPfiop oiihiinit fi 




RO 


4032 


^70718 


kk C U 1 1 uc 11 U \J LU a 

nnlch errimus—se 

uuiwiiu x miuj 

a urchins, tests, 
Peptide, 632 aa 


xxUllLUU i/Uxlogvil aipilcl liv ailu Jtv 

chains 






4033 


Y12713 


Mus musculus 

XTAU** illUUVUlUil 


Pro-Pol-dUTPase Doivorotein 

X X\J X \JX \XKS XX ****** L/l\/fc-wU.l 


355 


66 


4034 


AC004497 


Homo sapiens 


MX2 


464 


58 


4035 


G00333 


Haiti n caniptiQ 

XX vr J 111/ dCL|7lv>XXO 


Human QPorptpH nrofpin SPO TTi XTO* 

11 111 11 Oil oOLl CLA^Vl JpiVLClll, OJL>y IT/ 1> \-/ . 

4414. 


1ST 
i j i 


ox 


4036 


AF 117888 


Homo sapiens 


myosin-IXa 


113 


34 


4038 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


48 


4039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


153 


73 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






~U : 

Homo sapiens 


NADH dehydrogenase subunit 4 


j>38 


86 


4041 


AF1 16661 


Homo sapiens 


PR01438 


136 


53 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


84 




LfOO 11/. 


Homo sapiens 


cytochrome c oxidase subunit 3 


493 


85 


AfiAA 


MOO SI A 


Homo sapiens 


glutamine:fructose-6-phosphate 
amidotransferase 


261 


72 


*t\J*tJ 




nomo sapiens 


Jriurnan secreted protein, abQ ID NO: 

AAHQ 
44/0. 


102 


64 


4046 


AFl 30089 


Homo sapiens 


PRO2550 


143 


78 






Homo sapiens 


rKU0o57 


| in 

119 


41 


4048 


R95913 


Homo sapiens 


Neural thread protein. 


138 


47 * 


ACiAQ 


ArU10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


59 


4050 


X77962 


Tetrahymena 
thermophila 


fibrillarin 


105 


49 


A(\S 1 


ru\fvxi 9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


144 


71 




Yft77Q/l 


— — — : 

Friend murine 

leukemia virus 


Pr65 


107 


37 




i uuyy4 


Homo sapiens 


Human CSR3 protein sequence. 


109 


38 


4054 


AL12I585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


299 


74 




VUUooZ 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


314 


80 




CjU343o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


65 


54 






Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


67 


4058 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


202 


89 


40oy 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


100 


hUOU 


Uo3j42 


Homo sapiens 


FAP protein 


142 


73 


4061 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


158 


50 


4062 


Y95028 


Homo sapiens 


Human clone vp7_l ORF2, SEQ ID 
NO: 128. 


123 


44 


4063 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


76 


65 


4064 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


165 


46 


Af\£L C 

4Uo5 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


147 


93 




UU3U43 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

' 719/1 


109 


76 


4067 


AB035523 


Gallus gallus 


avenalll 


101 


48 






Arabidopsis 
thaliana 


Pto kinase interactor, putative 


105 


43 




ArZlUODI 


Homo sapiens 


XT A/^1P 

NA018 


97 


77 


dfrcn 

*RI/U 




Kattus 
norvcgicus 


salivary proline-rich protein 


111 


43 


4071 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

00 iy. 


158 


65 


407? 




nomo sapiens 


salivary proline-rich protein 1 


131 


43 


4073 


AF038007 


Homo sapiens 


FIC1 


153 


96 




M3J225 


T» 

lrypanosoma 
brucei 


ATPase 6 


103 


37 


4075 


U54788 






1 10 


Af* 


4076 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


36 


4077 


X52235 


Homo sapiens 


ORFn 


192 


39 


4078 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


83 



236 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 1 
Identit 

y 








AAfiQ * 
*MtUy. 






*wj iy 




nomo Sapiens 


w Aoi -tamiiy protein 


OS 

yo 


A 1 

41 


4080 


L17318 


JRattus 
norvegicus 


proline-rich proteoglycan 


105 


42 


4081 


AF130051 


Homo sapiens 


PRO0898 


117 


42 


40oZ 


At 1300oy 


Homo Sapiens 


JrJvOzOjO 


ft* 
OJ 


TO 


4083 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/OOO. 


117 


48 


4084 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CUn- 1 . 


89 


40 


4085 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

//.JO, 


163 


80 


4086 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAQA 

44o4. 


144 


74 


4Uo 1 


Ar0o5o91 


Homo sapiens 


multidrug resistance-associated protein 
3A 


i on 
lyf 


/IO 

49 


40oo 


00x490 


Homo sapiens 


Human secreted protein, 6bQ 1U NO: 
6571. 


i no 
Ivy 


oz 


4059 


ArO 10144 


Homo sapiens 


_ - ^ 1 mi . jt j.^ * - A T\*T««. "HI" l'l> 

neuronal thread protein AD/c-N lr 


139 


51 


4090 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHrOz9- 


103 


65 j 


4091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


159 


68 


4092 


X924o5 


Plasmodium 
vivax 


pval 




3o 


4093 


U93569 


Homo sapiens 


p40 


205 


40 


4094 


I 1/ZOJZA 


African swine 
fever virus 


cd2 homologue 


OO 

yo 


45 


4095 


A/1413 


Homo sapiens 


nr pi 

ELE1 


675 


98 




O04U /Z 


Homo sapiens 


Human secreted protem, oci^ id no. 

o 103. 


i4y 


£n 
00 


4097 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


70 


4oyo 


Aruyoy30 




Homo sapiens 


JFK004/0 


1 AQ 

i4y 


/o 


4099 


AF090942 


Homo sapiens 


PRO0657 


124 


56 | 


A 1 Aft 

4100 


r>M, C£A 

U03564 


Homo sapiens 


Human secreted protein, bbQ ID NO: 
7645. 


139 


o2 


>1 1 A1 

4101 


JUZ0953 


Homo sapiens 


chromosomal protein 


104 


54 


4102 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

TOO 

7253. 


128 


70 


4103 


U03 / VU 


Homo sapiens 


Human secreted protem, obQ ID NO: 
7871. 


Oil 

94 


CO 

58 


4104 


T TT3 1 OO 


Mus rousculus 


Rho~guanine nucleotide exchange 
factor 


T7n 
3/U 


5o 


4105 


AF090931 


Homo sapiens 


PRO0483 


104 


90 


H-100 


run 70 a 

vJV/j /yo 


Homo sapiens 


Human secreted protem, on^ id nu. 
/ o #y. 


Ijj 


Id. 


4107 


AF1 18082 


Homo sapiens 


PRO1902 


145 


49 


din© 




jtiomo sapiens 


uoiquitous iJrK-moni proiein x 
isoform 


1 10 


^n 


*noy 


U10J04 


Homo sapiens 


p40 


539 


oi 
91 


/ii in 
4110 


U03 /o9 


Homo sapiens 


Human secreted protem, oJtQ ID NO: 
7870. 


144 


54 


Al 1 1 
4111 


rA?CA7 


Homo sapiens 


Human secreted protein, bJaQ ID NO: 

/ OOO. 


138 


65 


4112 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


91 


73 


4113 


G03203 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7284. 


120 


36 


4114 


M17522 


Paracoccus 


cytochrome cl precursor (EC 1.10.2.2) 


101 


41 
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Identit 
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denitrificans 








4115 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


59 


A t 1 £1 

4116 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


148 


73 


4117 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


80 


85 


4118 


X04758 


Homo sapiens 


1 I y« W\ ft f A A l A AAvN 

pro- alpha (V)collagen (AA 1099) 


106 


39 


4119 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


122 


47 


4120 


AF130089 


Homo sapiens 


PRO2550 


132 


68 


4121 


A HA AAA Al A\ 

AF090942 


Homo sapiens 


PRO0657 


136 


48 


4122 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 


66 


4123 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


72 


48 


4124 


X92485 


Plasmodium 
vivax 


pval 


104 


54 


4125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


59 


4126 


AK024455 


Homo sapiens 


FLJ00047 protein 


86 


61 


4127 


AF 119855 


Homo sapiens 


PRO 1847 


99 


68 


4128 


S80119 


Rattus sp. 


reverse transcriptase homolog 


129 


39 


4129 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


156 


62 


4130 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


72 


4131 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPm la- 


108 


83 


4132 


L16461 


Chlamydomonas 
reinhardtii 


structural wall protein 


87 


37 


4133 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


95 


4134 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


121 


69 


4135 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


135 


70 


4136 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


134 


42 


4137 


Y15173 


Human 

papillomavirus 
type 75 


E4 protein 


101 


38 


4138 


A Y*t f\f\OZ\ 

AF130089 


Homo sapiens 


PRO2550 


130 


35 


4139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, - 
SEQIDNO:163. 


111 


38 


4140 


Y36203 


Homo sapiens 


Human secreted protein #75. 


120 


•48 


4141 


AF130089 


Homo sapiens 


PRO2550 


139 


36 


4142 


S80119 


Rattus sp. 


reverse transcriptase homolog 


170 


47 


4143 


U54636 


Staphylococcus 
aureus 


protein A 


135 


35 


4144 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


157 


55 


4145 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


112 


54 


4146 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

TOO/I 

7284. 


117 


37 


4147 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


98 


61 


4148 


AF130051 


Homo sapiens 


PRO0898 


124 


88 


4149 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 
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Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 








4754. 






4150 


Z29481 


Homo sapiens 


3-hydroxyantiiranilic acid dioxygenase 


244 


62 


4151 


M81321 


Macaca 


proline-rich protein 


172 


50 






fascicularis 








4152 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


66 








4497. 






4153 


AF1 19851 


Homo sapiens 


PR01722 


128 


62 


4154 


AB037826 


Homo sapiens 


KIAA1405 protein 


194 


60 


4155 


M 12099 


Mus musculus 


proline-rich protein 


124 


37 




CJ03714 

\J\JJ / 1*T 


J-Irim c\ Qat>ipn<i 
ni/iiiu odpit/iio 


Human secreted nrotein SEO ID NO* 


122 


46 








7795. 






4157 


Ml 1901 

1Y1 1 1 71/ X 


i vail no 


oroline-rich salivarv nrotein 


124 


41 






norvefncns 








4158 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


83 








4446. 






4159 


G03172 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


141 


74 








7253. 






4160 




11 V XlOllUlUO 


hvdroxvnroline-rich nrotein 


135 


39 






OU11UUJ 








4161 


U35730 


JY1US UJUoVUllU 


ierkv 


116 


31 


4162 


781525 


vOtUU 1 lid u u 1 1 10 


contains similaritv to Pfam domain* 

Wx/Xl m I IIJ J1JU1HU IV J L Vy x 1UU1 Vlwl 1 twill 


130 


50 








PF01391 f Collagen triole helix reoeat 












(20 copies)), Score=35.5, E-value=4e- 












07,N=2 






4163 


AF1 13685 


Homo saoiens 


PRO0974 


129 


45 


4164 


G04063 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


168 


71 








8144. 






4165 


U49973 


Homo ^aniens 


ORF1; MER37; putative transposase 


137 


70 








similar to pogo element 






4166 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


175 


72 


4167 


G03790 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


86 


45 








7871. 






4168 


AB000505 


Daucus carota 

vuuvvii wen vw 


unnamed protein product 


101 


43 


4169 


U87607 


Rattns 


putative RNA binding protein 1 


113 


30 






norvegicus 








4170 


AF1 19900 

iU 1 x.s s w 


Homo saoiens 


PR02822 


154 


59 


4171 


AE001381 


Plasmodium 


hvnothetical nrotein 


106 


33 






falciparum 








4172 


AE000034 


Mveon lasma 


bifunctionai threonine dehydrogenase* 


100 


28 






pneumoniae 


galactosyltransferase 






4173 


U12390 


Cloning vector 


beta-galactosidase alpha peptide 


100 


47 






pSportl 








4174 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


68 








6561. 






4175 


AK024455 


Homo sapiens 


FLJ00047 protein 


116 


60 


4176 


Ml 1759 


Lycopersicon 


cell wall hydroxyproline-rich 


94 


44 






esculentum 


elvcoDTotein 






4177 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


59 


4178 


AY007557 


Mycobacterium 


fibronecun-attachment protein FAP-P 


98 


42 






avium subsp. 












paratubercuiosis 








4179 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


115 


69 








clone HNHF029. 






4180 


M12100 


Mus musculus 

1TAUJ AiAUiJVUAUu 


oroline-rich nrotein MP-3 


132 


44 


4181 


AF090930 


Homo sapiens 


PRO0478 


138 


63 


4182 


G03172 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


152 


75 








7253. 






4183 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


184 


84 








SEQIDNO:163. 
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4184 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


150 


43 


4185 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


128 


51 


4186 


AF090944 


Homo sapiens 


PRO0663 


124 


49 


4187 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


93 


62 


4188 


D38112 


Homo sapiens 


ATPase subunit 6 


439 


78 


4189 


AF130051 


Homo sapiens 


PRO0898 


89 


57 


4190 


AF090930 


Homo sapiens 


PRO0478 


137 


83 


4191 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


100 


35 


4192 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


212 


91 


4193 


Y13247 


Homo sapiens 


FBI 9 protein 


142 


46 


4194 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


268 


93 


4195 


AF 124729 


Mus musculus 


acinusS 1 

viv MM 1 UJVJ 


140 


49 


4196 


AJ277425 


Globodera 
Dallida 


putative cuticular collagen 


156 


43 


4197 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SE0IDNO163 


161 


77 


4198 


Z70684 


Caenorhabditis 
elegans 


F28D1 8 


97 


4^ 
*t j 


4199 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


119 


36 


4200 


X62379 


Mus musculus 


form in, isoform IV 


115 




4201 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 4 


224 


93 


4202 


M14820 


Trypanosoma 
brucei 


ORF2 bases 1807-2850; first start 
codon at 2032; putative 


103 


29 


4203 


U93572 


Homo sapiens 


p40 


182 


34 


4204 


D82026 


Silene latifolia 


glycine-rich protein 


98 


49 


4205 


AF003151 


Caenorhabditis 


contains similarity to an RNA 

ropn o n i 'Ki nn m/vt"iT 
IClHJgUlUUU lilUUl 


107 


40 


4206 


L07545 


L'V/ 1 Oil ill 1 CUllcL 

tarentolae 


£\ wdb iiiocrLcu. diier ni J oy ^ ni 
10459 in genomic sequence (M10126)) 
to correct -1 frame<»hift nrfthahlv rhif* tn 
pel compression 

fc*** VVUlUlwulVU 




Jj 


4207 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


104 


38 


4208 


AF229126 


Homo sapiens 


acetvlchoHne^tera.se collagen-like tail 

subunit isoform VII 


7U 


7S 


4209 


AF090895 


Homo sapiens 


PRO0117 

M A\V/V AMI 




vO 


4210 


X92485 


Plasmodium 
vivax 


pval 


104 


35 


4211 


G02828 


Homo saniens 


Human secreted orotein. SEO ID NO* 
6909. 


173 


# j 


4212 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


151 


69 


.4213 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


122 • 


43 


4214 


AF090944 


Homo sapiens 


PRO0663 


97 


39 


4215 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


111 


60 


4216 


M69065 


human 
herpesvirus 2 


ORF1 


89 


38 


4217 


AF090895 


Homo sapiens 


PRO0117 


133 


78 


4218 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


135 


46 


4219 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


71 


47 


4220 


U93567 


Homo sapiens 


p40 


170 


35 


4221 


D00570 


Mus musculus 


open reading frame (251 AA) 


108 


28 


4222 


X97675 


Homo sapiens 


plakophilin 2b 


122 


47 


4223 


AF270937 


Plutella 
xylostella 


PxORF73 peptide 


99 


54 



240 



WO 01/64835 



PCTVUS01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
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granulovirus 








4224 


A T1 ^AAOO 


Homo sapiens 


rK.Uz55U 


1 A A 

144 


69 


4225 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
loot. 


125 


82 


4226 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homo log 


99 


62 


4227 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


172 


59 


4228 


W4S35 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


lol 


*7/1 ! 

74 


4229 


AT flAQtZII 

AJLU49537 


Homo sapiens 


aJ 1 104110.1 (breleiain A-innioitea 
guanine nucleotide-exchange protein 2) 


Z1Z 


9U 


4230 


A T1 1 OAO< 

Arl loUSo 


Homo sapiens 


DI>/^l1 OAO 

rKU199z 


1 JU 


09 


4231 


M81321 


Macaca 
fascicularis 


proline-rich protein 


130 


41 


4232 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ca< 
6595. 


138 


68 


4233 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


132 


56 


4234 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


110 


52 


4235 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


124 


32 


4236 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
o5oo. 


119 


42 


4237 


A9349S 


- - ■ ■ ■ . —J- 
Homo sapiens 


O 1 /^lufnmin A n !yl O toll D*-*-iJ-fkTn 

z i-vjiutamic Aciu-iucn rrotein 


30o 


jo 


423 o 


K95913 


Homo sapiens 


Neural thread protein. 


I3y 


55 


423 y 


uUU3Zo 


Homo sapiens 


riurnan secreted protein, onvj iu wu 
4409. 


lit A 
140 


03 


4Z4U 


A9/o/5 


Homo sapiens 


piaKopnnin zo 


1 Aft 


*7C « 
/5 


4241 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


65 


4242 


AK00U385 


Homo sapiens 


unnamed protein product 


131 


33 


4243 


AJ252253 


human 
herpesvirus 2 


glycoprotein G-2 


107 


36 


4244 


Y99447 


Homo sapiens 


Human PRO 1556 (UNQ764; amino 
acici sequence ocxi iu nuj tz. 




100 


4Z4D 


Y0339B 


; 

Homo sapiens 


Human im. ligano injlo protein 
sequence. 


AO A 

4Z4 


0/1 
o4 


4246 


L00016 


Homo sapiens 


urf4 


222 


90 


4Z4/ 


ATI 1A*Z7Ci 

Ar 1345/9 


Zea mays 


arabinogalactan protein 


1^4 


39 


*»Z4o 


ZJ440J 


Zea mays 


extens in-like protein 


IZj 


J D 


*tZ4y 


/\rUlU144 


Homo sapiens 


neuronal tnreau proiein /\l//c-jn ir 


108 
iZo 


OZ 




JUZOiO 


r IHSITlUUJUlTl 

yoelii 


circumsporuzoiic pruLcin 


1 ID 
1 IU 






AVAO/1>1<< 
/\JvUZH*tJJ 


nOIDO SapiCub 


CT \(\(\C\An nmtpin 

r Ljj\j\jvh* i p ruic iii 


J JU 


OJ 


4252 


U02570 


Homo sapiens 


CDC42 GTPase-activating protein 


566 


93 




UJol 1Z 


fiomo sapiens 


cytoenrome c oxiaase suounii i 


44^7 




4ZD4 


K13D30 


Homo sapiens 


jrToiem encoucu aownsueam oi nnc_jvi 
oncoprotein. 


111 
111 


OJ 


4Z05 


Mol 143 


Bovine 
herpesvirus 1 


latency-related open reading frame 1\ 
putative 


1 A1 
1U1 


4Z 




A Cfk \t\ \ AA 

ArUlU144 


Homo sapiens 


neuronal tnreaa protein J\3J /c^ri ir 


11j 


43 


4257 


Y06294 


Homo sapiens 


Human transcription regulator MOP6 
partial sequence. 


410 


79 


4Z5o 


I 3 0495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


i n 
1 12 


OI 


4259 


D90252 


Human 

papillomavirus 
type 5b 


E4 protein 


110 


32 
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4Z0U 




Homo sapiens 


elastic titin 


660 


95 


4261 


AF090930 


Homo sapiens 


PRO0478 


114 


71 


4262 


AF 132209 


Homo sapiens 


prepro-major basic protein homolog 


422 


72 


4Zco 


Ar04J 10Z 


Pneumocystis 
carinii 


surface glycoprotein A 


121 


29 


4264 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


132 


55 


4265 


AF090895 


Homo sapiens 


PRO0117 


159 


62 


42oo 


Ar 090930 


Homo sapiens 


PRO0478 


143 


50 


4267 


U03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


119 


34 


42oo 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


106 


61 


4zoy 


DZJooO 


Homo sapiens 


ribosomal protein 


493 


73 


4Z /U 


API ^AAOA 

Ar 130089 


Homo sapiens 


PRO2550 


119 


39 


4271 


U93564 


Homo sapiens 


putative pi 50 


121 


37 


4Z/Z 


A DAI "5^1 C 

AJB033615 


Mus musculus 


phospholipase C-L2 


485 


79 


4273 


AF006082 


Homo sapiens 


Arp2 


488 


85 


4Z/4 


Y 12293 


Mus musculus 


lun 


104 


33 


4275 


X54289 


Bos taurus 


cGMP-dependent protein kinase 
(isoform I beta) 


561 


81 


4276 


AF1 19855 


Homo sapiens 


PR01847 


155 


71 


4277 


J01415 


Homo sapiens 


MTND4 


372 


63 


4278 


AB021078 


Plasmid CoIIb- 
P9 


ybbA 


101 


30 


4279 


G00328 


Homo sapiens 

- 


Human secreted protein, SEQ ID NO: 
4409. 


110 


73 


4280 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


110 


73 


4281 


AF1 19851 


Homo sapiens 


PR01722 


no 


60 


4282 


D38116 


Pan panisctLs 


NADH dehydrogenase subunit 1 


346 


92 


4283 


Y15908 


Homo sapiens 


DIA-12C protein 


109 


52 


4284 


Y30681 


Homo sapiens 


Splice variant ZAP-IB protein of the 
human tumor suppressor gene ZAP- 1 . 


122 


76 


4285 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


121 


42 


4286 


AK024455 


Homo sapiens 


FU00047 protein 


135 


57 


4287 


Y11525 


Homo sapiens 


CCAAT/enhancer binding protein 
alpha 


100 


32 


4288 


AF033122 


Homo sapiens 


non-p53 regulated PA26-T1 nuclear 
protein 


128 


81 


4289 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


111 


39 


4290 


W48351 


, Homo sapiens 


Human breast cancer related protein 
BCRB2. 


140 


58 


4zy i 


AJ/4Z540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


132 


34 


4zyz 


X0U4JZ 


Zea mays 


prolin rich protein 


118 


42 


4zyj 


U09367 


Homo sapiens 


zinc finger protein ZNF136 


461 


60 


4294 


AF038960 


Homo sapiens 


SKD1 homolog 


146 


82 


4295 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


128 


54 


4296 


U66561 


Homo sapiens 


kruppel-related zinc finger protein 


512 


89 


4297 


AF043706 


Caenorhabditis 
elegans 


contains similarity to granulans 


104 


55 


4298 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


4299 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
protein 


520 


87 



242 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Store 
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Ideotit 
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4300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

S177 
332/. 


142 


72 




OU3 / oy 


riomo Sapiens 


numan sccreitxi pruieui, ocy jul* inw. 
7K70. 




sn 


Aim 
43U2 


/1lLU.>j44U 


ATaDiaOpSlS 
UlaJlctlld. 


puLdiive piUlGill 




J 1 


Aim 




xiomo sapiens 




655 


96 


43 


API 1 £717 


t-Tr»rri a canipnc 

noiiiu ^apicno 


JT JVV/6 / JO 


140 


54 


43UO 




Uatyia cq nipnc 

riuiiiu bapiCUb 


am i H AnVi ncnh Ai*fhnQvltran<lfpi*5i^p 
alillUU UllUo UllVl iVvi>j au. at io iw dot- 


182 


92 


4306 


L20450 


Mus musculus 


DNA-binding protein 


470 


64 


43U/ 


AT?71 178£ 


jvius museums 




i / j 


Q7 


4308 


X76850 


Mus musculus 


MAP kinase-activated protein kinase 2 


154 


69 


4309 


com m 

SoOl 19 


Rattus sp. 


reverse transcriptase homolog 


1ZO 


JJ 


4310 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
KJL/d suounu 


1 1 A 
1 lu 


32 


ai 1 1 
431 1 


uU3 /o / 


riomo sapiens 


Unman c*>r>-n»tf»H nrnlpin QPfl in NO* 

riuman sucreicu pruicin, ociv^ iu 
7868. 




JO 


4312 


7 a mm 


Ovis anes 


Keraiinocyre growin lacior 




7^ 


4313 


AB020700 


Homo sapiens 


KIAA0893 protein 


569 


87 


4314 


AF 124727 


Homo sapiens 


acinusS 


vlOQ 


00 


4315 


W 55027 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


55 


17 ! 


4316 


AL 160493 


Leishmania 
major 


prooaoie tjinv-oj ui wz 3 vanani a 
DNA, complete virion genome 


199 
1ZZ 


j3 


4317 


* pi i AAA A 

AF 119900 


Homo sapiens 


T>T> f\0 Q'VJ 


13 J 


Do 


4318 


U74612 


Homo sapiens 


hepatocyte nuclear fector-3/fork head 
nomoiog i i a 


1 A/1 


7A 

/II 


4319 


O03043 


Homo sapiens 


Uiimon oanr»t»H nrnfpin CCA TT| WO* 

riuman secreiea pro tern, oc\£ Ji-' inva 
7124. 


1 17 
11/ 




432U 


A T>AO*CA^/l 

AdU20Uj4 


Homo sapiens 


Drain linger pruiem 


AM 


QA 


4321 


AF090944 


Homo sapiens 


PRO0663 


133 


71 


4322 


A CI £CC1 1 

Ar 165513 


Homo sapiens 


vacuolar prorem sorung j isororm 


717 


07 


4323 


Y99418 


Homo sapiens 


Human PR013 17 (UNQ783) amino 
acid sequence obi^ id in <j.2 / /. 


532 


97 


4324 


AF163772 


Leishmania 
major 


7138.7 


142 


37 


4325 


OA*) O AT 

G03807 


Homo sapiens 


T T— maIai«i CCA JT\ XI/"\« 

riuman secreted protein, oc\i lu NU. 
/ooo. 


125 




4326 


W40054 


Homo sapiens 


P300/CBP-associated transcriptional 
coiacior rvv^/vr v>-iermiiiui>. 


175 


94 


432/ 


Ar2I /4l 1 


riomo sapiens 


neuroiigm ^ lsoioim xiiNjLij 






4328 


U94836 


Homo sapiens 


ERPROT 213-21 


175 


87 


4329 


Y 20763 


Homo sapiens 


Human neurofilament-M mutant 
protein fragment 45. 


jUl 


»7 
0/ 


/loin 
433 U 


AUA777/I S 


T_T/-v s\ nAt\l An c 

riomo sapiens 


If T A A 1 17 A. ArntPin 
jsjj\j\ i jzh- proiein 






4331 


M91563 


Rattus 

v\r\t^* ram /»i 1 c* 

norvegicus 


NMDA receptor subtype 2C 


116 


39 


4332 


G03704 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

77»^ 
/ /W. 


286 


76 


4333 


AC006841 


Arabidopsis 
thaliana 


Mutator-like transposase 


130 


70 


4334 


AP000373 


Arabidopsis 
thaliana 


jasmonate inducible protein; 
myrosinase binding protein-like 


130 


53 


4335 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 

A cf'opcinn "KTi ifY»T%^i* OA77A51* TV>f^rtiA/T* 

/accession iNumoer ruz /oo, lvicuiuu. 
conceptual translation supplied by 
author 


377 


90 


4336 


AJ277426 


Globodera 
pallida 


putative cuticular collagen 


112 


37 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4337 


U 16802 


Rattus 
norvegicus 


Ca2+-dependent activator protein; 
calcium-dependent actin-binding 
protein 


560 


79 


4338 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


118 


56 


4339 


AB024520 


Homo sapiens 


notcb4 


93 


33 


4340 


AF 128406 


Homo sapiens 


nuclear prelamin A recognition factor 


241 


100 


4341 


W19771 


Homo sapiens 


Beta-1 integrm modulator B 171. 


169 


100 


4342 


AL096828 


Homo sapiens 


dJ963E22. 1 (Novel protein similar to 
NY-REN-2 Antigen) 


666 


88 


4343 


D50312 


Homo sapiens 


uKATP-1 


542 


80 


4344 


U84487 


Homo sapiens 


CX3C chemokine precursor 


371 


78 


4345 


U93569 


Homo sapiens 


putative pl50 


111 


47 


4346 


AL161755 


Streptomyces 
coelico!orA3(2) 


putative serine/threonine protein kinase 


102 


30 


4347 


AK026162 


Homo sapiens 


unnamed protein product \ 


377 


98 


4348 


AF221759 


Homo sapiens 


Maml 


314 


47 


4349 


AF165926 


Homo sapiens 


NUP155 


147 


84 


4351 


U50185 


Rattus 
norvegicus 


PP-1M 


144 


52 


4352 


D50455 


Rattus 
norvegicus 


phodpholipase C delta4 


196 


70 


4353 


X54131 


Homo sapiens 


protein-tyrosine phosphatase 


261 


87 


4354 


AF151850 


Homo sapiens 


CGI-92 protein 


255 


92 


4355 


G03996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8077. 


105 


95 


4356 


U93564 


Homo sapiens 


putative pi 50 


171 


72 


4357 


AP001507 


Bacillus 
halodurans 


unknown conserved protein 


159 


34 


4358 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


153 


60 


4359 


AF072697 


Mus musculus 


SHYC 


177 


97 


4360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


69 


4361 


M98776 


Homo sapiens 


keratin 1 


449 


77 


4362 


AK024436 


Homo sapiens 


FUT00026 protein 


671 


91 


4363 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


155 


49 


4364 


D13896 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


327 


92 


4365 


X79389 


Homo sapiens 


glutathione transferase Tl 


164 


96 


4366 


AB006458 


Mus musculus 


alpha-D-mannosidase 


177 


56 


4367 


X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 


160 


96 


4368 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


54 


4369 


AL353715 


Homo sapiens 


bK3184A7.3.1 (helicase-like protein 
NHL) 


485 


100 


4370 


AF001631 


Oryctolagus 
cuniculus 


glucose-regulated protein GRP94 


118 


92 


4371 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


113 


80 


4372 


AL121741 


Schizosaccharom 
yces pombe 


putative negative regulator of vesicle 
formation 


200 


41 


4373 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 
subunit 


279 


70 


4374 


U47741 


Homo sapiens 


CREB-binding protein 


123 


96 


4375 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


127 


75 


4376 


Y12781 


Homo sapiens 


transducin (beta) like 1 protein 


1056 


90 



244 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 

MA. 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


4377 


W69424 


Homo sapiens 


Human secreted protein bg 1 40 1 . 


183 


97 


4378 


AL096700 


Homo sapiens 


dJ499B10.2 (phosphorylase kinase, 
ah>ha2(liver)(PYK)) 


639 


86 


4379 


AF177390 


Manduca sexta 


antennal specific membrane protein 
AMP 


378 


51 


4380 


X97675 


Homo sapiens 


plakophilin 2b 


156 


75 


4381 


R33713 


Homo sapiens 


PgllOl. 


104 


100 


4382 


AB015473 


Arabidopsis 
thaliana 


gene_id:MCM23. 1 -unknown protein 


113 


61 


4383 


AF1 16715 


Homo sapiens 


PR02829 


133 


50 


4384 


AL357472 


Homo sapiens 


VPS33B 


676 


99 


4385 


AF090931 


Homo sapiens 


PRO0483 


155 


58 


4386 


U92645 


Gecarcinus 
lateralis 


alpha- 1 -tubulin 


511 


75 


4387 


Y36095 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 480. 


202 


90 


4388 


M64923 


Bostaurus 


CIO protein 


218 


95 


4389 


X98264 


Homo sapiens 


M-phase phosphoprotein 4 


197 


100 


4390 


AK000264 


Homo sapiens 


unnamed protein product 


239 


73 


4391 


M12937 


Homo sapiens 


ferritin heavy subunit 


662 


84 


4392 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


116 


80 


4393 


X16135 


Homo sapiens 


L protein (AA 1-558) 


759 


99 


4394 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


58 


4395 


AC005591 


Homo sapiens 


PkB-like 


170 


97 


4396 


AF161426 


Homo sapiens 


HSPC308 


313 


77 


4397 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


146 


61 


4398 


AF104921 


Homo sapiens 


succinyt-CoA synthetase alpha subunit 


679 


88 


4399 


AF257330 


Homo sapiens 


COBW-like protein 


586 


90 


4400 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


116 


65 


4401 


AF1 19851 


Homo sapiens 


PR01722 


101 


79 


4402 


M55542 


Homo sapiens 


gu any late binding protein isofonn I 


230 


76 


4403 


Y07752 


Volvox carteri 


pherophorin-S 


459 


88 


4404 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


113 


44 


4405 


U07786 


Sus scrofa 


betaactin 


519 


90 


4406 


A06977 


Homo sapiens 


albumin 


586 


88 


4407 


L20755 


Cuscuta reflexa 


hybrid proline-rich protein;cytokinin- 
induced;haustoria 


112 


41 


4408 


AB002299 


Homo sapiens 


KIAA0301 


612 


98 


4409 


AB001424 


Mus musculus 


KIF17 


104 


47 


4410 


M88108 


Homo sapiens 


p62 


574 


92 


4411 


AL121673 


Homo sapiens 


bA305P22J2 (novel protein) 


415 


71 ! 


4412 


AF064553 


Mus musculus 


NSD1 protein 


224 


64 


4413 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


107 


36 


4414 


A06977 


Homo sapiens 


albumin 


505 


82 


4415 


A06977 


Homo sapiens 


albumin 


596 


90 


4416 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 155. 


123 


50 


4417 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


103 


85 


4418 


GO3053 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7134. 


100 


54 


4419 


AF049606 


Mus musculus 


transcription factor NF-ATc isofonn b 


114 


90 
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SEQU> 


Accession No. 


Species 






0/ 


NO: 








Waterman 


Identit 










Score 


y 


4420 


Z11190_cdl 


Homo sapiens 


1 l-DEC-1998 Interleukin-3 coding 


354 


97 ! 








sequence from b2HFLS20 W cDNA 












library. 






4421 


W64469 


Homo sapiens 


Human secreted protein from clone 


203 


100 








CW795 2. 






4422 


M12523 


Homo sapiens 


alloalbumin Venezia 


350 


94 


4423 


AF 130077 


Homo sapiens 


PR02619 


561 


89 


4424 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


161 


64 








gene 44 clone HTDAD22. 






4425 


Y91577 


Homo sapiens 


Human secreted protein sequence 


393 


91 








encoded by gene 2 SEQ ID NO:250. 






4426 


A09561 


synthetic 


human serum albumin 


651 


96 






construct 








4427 


AL133215 


Homo sapiens 


bA108L7.5 (novel protein similar to 


392 


100 








Plasmodium POM1 and C. elegans 












F46Gll.l(Tr:Q20485)) 






4428 


W33663 


Homo sapiens 


Human puromycin-sensitive 


172 


96 








aminopeptidase (PSA)-68. 






4429 


AB021654 


Homo sapiens 


DD2/bile acid-binding 


184 


81 








protem/AKRl C2/3alpha- 












hydroxysteroid dehydrogenase type 3 






4430 


W63683 


Homo sapiens 


Human secreted protein 3. 


114 


42 


4431 


AY008763 


Homo sapiens 


sentrin/SUMO-specific protease 


447 


96 


4432 


U52965 


Homo sapiens 


ENX-1 


176 


94 


4433 


AF 180470 


Mus musculus 


Kiaa0575 


423 


77 


4434 


X17206 


Homo sapiens 


put LLRep3 protein (AA 1-221) 


581 


99 


4435 


Y70929 


Homo sapiens 


Human zilla4 splice variant protein. 


621 


100 


4436 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


71 








4497. 






4437 


A06977 


Homo sapiens 


albumin 


610 


97 


4438 


A00279 


synthetic 


Human serum albumin 


621 


94 






construct 








4439 


L29028 


Chlamydomonas 


amino acid feature: N-glycosylation 


105 


36 






eugametos 


sites, aa41 43, 46 48, 51 53, 72 












74, 107 .. 109, 128 .. 130, 132 .. 134, 












158 160, 163 .. 165; amino acid 












feature: Rod protein domain, aa 169 












340; amino acid feature: globular 












protein domain, aa 32 168 






4440 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


65 


4441 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


133 


69 








similar to pogo element 






4442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


60 








6953. 






4443 


AF1 13685 


Homo sapiens 


PRO0974 


101 


54 


4444 


W03627 


Homo sapiens 


Human follicle stimulating hormone 


181 


42 








GPRN-terminal sequence. 






4445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


76 








7871. 






4446 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


54 








4493. 






4447 


L02867 


Homo sapiens 


paraneoplastic antigen 


136 


78 


4448 


D38435 


Homo sapiens 


homologue of yeast PMS 1 


108 


66 


4449 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


65 








A A AO 

4409. 






4450 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


68 


4451 


AF220264 


Homo sapiens 


MOST-1 


101 


40 


4452 


AF130079 


Homo sapiens 


PR02852 . 


97 


59 


4453 


AF116715 


Homo sapiens 


PR02829 


139 


69 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


4454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


144 


53 


4455 


AF130087 


Homo sapiens 


PR02411 


108 


73 


4456 


AF279891 


Homo sapiens 


dead box protein 15 


108 


58 


4457 


L08258 


Strongylocentrot 
us purpuratus 


kinesin light chain isoform 4 


176 


94 


4458 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


116 


78 


4460 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


146 


74 


4461 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


156 


52 


4462 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


118 


72 


4463 


AF1 19900 


Homo sapiens 


PR02822 


144 


80 


4464 


M15530 


Homo sapiens 


B-ccll growth factor 


92 


76 


4465 


Y17833 


Human 
endogenous 
retrovirus K 


env protein 


107 


62 


4466 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


78 


4467 


AF090895 


Homo sapiens 


PRO0117 


1 12 


79 


4468 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


64 


4469 


X97675 


Homo sapiens 


plakophilin 2b 


119 


78 


4470 


AF1 18081 


Homo sapiens 


PRO 1900 


119 


74 


4471 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


121 


60 


4472 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


63 


4473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


205 


73 


4474 


AF130114 


Homo sapiens 


PR02459 


121 


67 


4475 


AF178534 


Homo sapiens 


talin 


213 


67 


4476 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


122 


92 


4477 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ8L 


99 


66 


4478 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


62 


4479 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


153 


55 


4480 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


130 


100 


4481 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7795. 


110 


57 


4482 


Y20983 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP wild type fragment 9. 


1 1 A 

no 


63 


4483 


AF221552 


Oryza sativa 


proline-rich protein RiP-15 


119 


33 


4484 


L25941 


Homo sapiens 


integral nuclear envelope inner 
membrane protein 


110 


84 


4485 


AF130089 


Homo sapiens 


PRO2550 


161 


81 


4486 


U39742 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
yk25e53; coded for by C. elegans 
cDNA yk25e53; similar to repeat 
guanylate kinase domain of D. 
melanogaster lethal(l) discs large- 1 
tumor suppressor protein 
(SP:DLG1 JDROME, P31007) and R. 
norvegicus postsynaptic density protein 


98 


72 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








95 (PSD-95) (SP:PSD9_RAT, P31016) 






A A OT 

4487 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


89 


81 


A AOO 

4488 


X97675 


Homo sapiens 


plakophilin 2b 


127 


88 


4489 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 


71 


4490 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


64 


4491 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


77 


4492 


U83280 


Leishmania 
donovani 


39 kDa antigen 


107 


76 


4493 


AF023142 


Homo sapiens 


pre-mRNA splicing SR protein rA4 


141 


40 


4494 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


40 


A AftC 

4495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


98 


45 


AAdC 

4496 


ATI inAon 

AF130089 


Homo sapiens 


PRO2550 


132 


75 


4497 


AF090930 


Homo sapiens 


PRO0478 


160 


76 


4498 


AF 11 9900 


Homo sapiens 


PR02822 


158 


55 


4499 


AFl 16661 


Homo sapiens 


PR01438 


126 


44 


4500 


AF 116661 


Homo sapiens 


PR01438 


118 


42 


4501 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


48 


4502 


M81321 


Macaca 
fascicularis 


proline-rich protein 


154 


44 


4503 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


96 


4504 


D44596 


Saccharomyces 
cerevisiae 


Mdjlp heat shock protein 


93 


42 


4505 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


76 


4506 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


207 


93 


4507 


U93564 


Homo sapiens 


p40 


520 


86 


4508 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


135 


55 


4509 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


77 


Ac t n 

4510 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


146 


66 


4511 


L27428 


Homo sapiens 


reverse transcriptase 


143 


87 


4512 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


80 


4513 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


97 


75 


A CI A 

4514 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


60 


4515 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


73 


A C 1 C 

4516 


AF130051 


Homo sapiens 


T^TX A A A A 

PRO0898 


141 


86 


4517 


U12690 


Homo sapiens 


cytochrome oxidase subunit n 


185 


94 


4518 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


130 


100 


4519 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 




Ar/10651 


Homo sapiens 


XT APIO 

NAG 18 


161 


no 

88 


4521 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


140 


68 


4522 


AF217374 


Acanth aster 
planci 


cytochrome oxidase subunit I 


131 


76 



248 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4523 


U72520 


Mus musculus 


mena protein 


102 


42 


4524 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


129 


75 


4525 


AL1 60493 


Leishraania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


108 


84 


4526 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


76 


4527 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


89 


4528 


AF09093 1 


Homo sapiens 


PRO0483 


91 


88 


4529 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


51 


4530 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


130 


92 


4531 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


123 


64 


4532 


L26953 


Homo sapiens 


chromosomal protein 


102 


79 


4533 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


88 


54 


4534 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


122 


40 


4535 


AF090944 


Homo sapiens 


PRO0663 


113 


53 


4536 


U15647 


Mus musculus 


expressed in select embryonal 
carcinoma cells and testicular cells; 
similar to Swiss-Prot Accession 
Number PI 1260 


119 


35 


4537 


L13858 


Homo sapiens 


guanine nucleotide exchange factor 


150 


81 


4538 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


138 


68 


4539 


AF090930 


Homo sapiens 


PRO0478 


104 


53 


4540 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


112 


64 


4541 


K02401 


Homo sapiens 


chorionic somatomammotropin 


672 


96 


4542 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


171 


67 


4543 


J04208 


Homo sapiens 


inosine-5 f -monophosphate 
dehydrogenase (EC 1.1.1 205) 


455 


92 


4544 


K02401 


Homo sapiens 


chorionic somatomammotropin 


397 


88 


4545 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


155 


66 


4546 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


161 


68 


4547 


R24056 


Homo sapiens 


hGH variant #8 - 172Arg 174Thr 
176Tyr 178His. 


117 


62 


4548 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


111 


68 


4549 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


65 


4550 


AF1 19855 


Homo sapiens 


PR01847 


152 


68 


4551 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


153 


70 


4552 


M59217 


Homo sapiens 


alpha- 1 type XIII collagen 


98 


57 


4553 


R83119 


Homo sapiens 


Human cisplatin resistance protein. 


136 


96 


4554 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


72 


4555 


O02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


91 


60 


4556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


70 


4557 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


101 


44 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idectit * 










Score 


y 








isoform 






4558 




Homo sapiens 


T>T> /\1 CCA 

PRO2550 


150 


OJ 


4559 


Wooo27 


Homo sapiens 


Secreted protein encoded by gene 94 


161 


56 








clone HPMBQ32. 






4560 


W48351 


Homo sapiens 


Human breast cancer related protein 


81 


64 








t>CRB2. 






4561 




Homo sapiens 


Human secreted protein, bbQ ID NU: 


1 Id 


DO 








ooiy. 






45oz 


003 1 14 


Homo sapiens 


Human secreted protein, bbQ ID NO. 


1 A/C 
1UO 










T1 AC • 

fly j. 






4563 


AF090931 


Homo sapiens 


PRO0483 


145 


73 


ACCA 

4564 


AJ242540 


Volvox carten £ 


hydroxyproline-rich glycoprotein DZ- 


O CO 

358 


11 






nagariensis 


HRGP 






4565 


AF130089 


Homo sapiens 


PRO2550 


175 


85 ! 


4566 


Z93244 


Homo sapiens 


bICl I6F5.2 (PUTATIVE RhoGAP 


125 


61 








(CDC42 GTPAse Activating Protein) 












LIKE protein) 






A c a 

4567 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


56 








A A'lO 

4478. 






4568 


AFI30089 


Homo sapiens 


PRO2550 


125 


75 


4569 


AF130089 


Homo sapiens 


PRO2550 


172 


82 


4570 


AF22596I 


Rattus 


RhoGEF glutamate transport modulator 


149 


78 






norvegicus 


GTRAP48 






4571 


W48351 


Homo sapiens 


Human breast cancer related protein 


143 


64 








BCRB2. 






4572 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


70 








6721. 






4573 


G03786 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


91 


73 








7867. 






4574 


X92485 


Plasmodium 


pval 


111 


52 






vivax 








4575 


U93567 


Homo sapiens 


p40 


151 


71 


4576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


90 


57 








clone HNHF029. 






4577 


AC007508 


Arabidopsis 


F1K23.4 


143 


42 






thaliana 








4578 


G03807 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


145 


53 








7888. 






4579 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


73 








7874. 






A<.Qf\ 




Homo sapiens 


Human secreted protein, ocQ id inu. 


1 An 
142 


/£> 








44iy. 








A 171 1AATA 


Homo sapiens 


JrKUZojZ 


1 IT 
1 1 / 


TA 
/If 




ATTAQAQIA 


Homo sapiens 




1 AO 

iuy 


CT 


A<Q1 

4 J 83 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


134 


TA 

/y 


A<QA 

43 84 


uUi 172 


Homo sapiens 


Human secreted protem, oJbQ ID NU. 


1 AQ 

i4y 


oz 








7253. 






4585 


AF1 18090 


Homo sapiens 


PRO2044 


175 


97 


A CO * 

4586 


AL080243 


Homo sapiens 


El A binding protein p300; match: 


144 


93 








proteins: Sw:Q09472 Sw:Q92793 












Sw:P45481 Wp:CE00571 












Wp.CJbzlll / JLnUt/13oo Wp.CliUoojo 












Wp.UfcU05/U Wp:CllUo4jJ lr.U44U/0 






*lj<S/ 


Y55U62 


Homo sapiens 


InterJeukin 1 converting enzyme 


1 A(\ 

14U 










hnmnlAoii^ C\CV*\ \ r>rA t&\n ^eaiience 






4588 


AJ131245 


Homo sapiens 


Sec24B protein 


131 


81 


4589 


W02105 


Homo sapiens 


Human L-asparaginase. 


559 


100 


4590 


L26953 


Homo sapiens 


chromosomal protein 


146 


71 


4591 


M16976 


Glycine max 


N-75 


125 


36 
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Accession No. 


Species 


Description 


Smith- 


% 


NO: 








tt *t icrm*llJ 


men ut 










Score 


y 


4592 


AFl 19851 


Homo sapiens 


PR01722 


118 


61 


4593 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


4594 


M20789 


Homo sapiens 


alpha-1 type I collagen 


103 


41 


4595 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


72 








4462. 






4596 


AF130089 


Homo sapiens 


PRO2550 


124 


71 


4597 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


65 








4497. 






4598 


AFl 18082 


Homo sapiens 


PRO1902 


122 


64 


4599 


AK023034 


Homo sapiens 


unnamed protein product 


156 


100 


4600 


AFl 18082 


Homo sapiens 


PRO1902 


148 


62 


4601 


AFl 19855 


Homo sapiens 


PR01847 


130 


50 


4602 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


77 








4484. 






4603 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


65 








7075. 






4604 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


135 


54 








gene 100 clone HNFIU96. 






4605 


AL160371 


Leishmania 


probable (hhv-6) ul 1 02, variant a 


123 


75 






major 


DNA, complete virion genome 






4606 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


74 








4658. 






4607 


V00672 


Pan troglodytes 


reading frame protein 4 


194 


72 


4608 


X86791 


Sus scrofa 


beta-globin 


156 


87 


4609 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


130 


88 








clone HPMBQ32. 






4610 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


146 


90 








SEQ ID NO: 163. 






4611 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


68 








7870. 






4612 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


120 


38 


4613 


W48351 


Homo sapiens 


Human breast cancer related protein 


148 


53 








BCRB2. 






4614 


AK024372 


Homo sapiens 


unnamed protein product 


124 


63 


4615 


AC005360 


Homo sapiens 


FAA 


175 


91 


4616 


D38113 


Pan troglodytes 


ATPase subunit 6 


204 


69 


4617 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


122 


45 








oncoprotein. 






4618 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


81 








4478. 






4619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


90 








4478. 






4620 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


50 








7124. 






4621 


S79410 


Mus musculus 


nuclear localization signal binding 


107 


45 








protein 






4622 


AF126163 


Homo sapiens 


HHLA3 protein 


124 


75 


4623 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


69 








5327. 






4624 


U38184 


Trypanosoma 


ATPase subunit 6 


102 


45 






cruzi 








4625 


U01849 


Trypanosoma 


ORF1 


97 


36 






brucei 








4626 


G03058 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


57 








7139. 






4627 


M33228 


Trypanosoma 


ATPase 6 


126 


37 






brucei 








4628 


AFl 18086 


Homo sapiens 


PR01992 


113 


60 


4629 


AF041330" 


Bodo saltans 


NADH dehydrogenase subunit 5 


149 


41 
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Seqid 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


4630 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


108 


48 


4631 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


46 


4632 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:J63. 


124 


60 


4633 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


151 


60 


4634 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


138 


100 


4635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


135 


56 


4636 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


100 


4637 


X67320 


Rattus 
norvegicus 


Hld-Histone 


122 


40 


4638 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


181 


76 


4639 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


96 


73 


4640 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


136 


54 


4641 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


123 


70 


4642 


AJ242540 


Volvox carteri £ 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


190 


56 


4643 


M29580 


Homo sapiens 


zinc finger protein 7 (ZFP7) 


103 


38 


4645 


U47741 


Homo sapiens 


CREB-binding protein 


207 


94 


4646 


AL035461 


Homo sapiens 


dJ967N21.5 (novel MCM2/3/5 family 
member) 


213 


80 


4647 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7516. 


76 


60 


4648 


AF227948 


Homo sapiens 


HBV pX associated protein-8; XAP-8 


402 


86 


4649 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


143 


86 


4650 


U93574 


Homo sapiens 


putative pi 50 


121 


33 


4651 


A02739 


synthetic 
construct 


chloramphenicol acetyitransferase 


622 


99 


4652 


AF190449 


Mus musculus 


putative transcription factor ALF-4 


274 


83 


4653 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


76 


4654 


X15187 


Homo sapiens 


precursor polypeptide (AA -2 1 to 782) 


763 


98 


4655 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


91 


4656 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


169 


77 


4657 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


135 


83 


4658 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


60 


4659 


AC004850 


Homo sapiens 


vacuolar assembly protein VPS41 
homolog (S53) 


187 


92 


4660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


181 


92 


4661 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


138 


58 


4662 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


154 


78 


4663 


M69180 


Homo sapiens 


nonmuscle myosin heavy chain-A 


106 


84 


4664 


AF130089 


Homo sapiens 


PRO2550 


117 • 


65 


4665 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


133 


91 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


4667 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein ; 
RHAla(AF078683) 


111 


78 


4668 


AF1 13695 


Homo sapiens 


PR01365 


82 


71 


4669 


AF003540 


Homo sapiens 


Krueppe] family zinc finger protein 


111 


64 


4670 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


49 


4671 


AL109658 


Homo sapiens 


dJ776F14.1 (ortholog of mouse P47) 


110 


100 


4672 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


62 


4673 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


138 


60 


4674 


X51591 


Homo sapiens 


beta-myosin heavy chain (1151 AA) 


442 


97 


4675 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


148 


76 


4676 


D86971 


Homo sapiens 


no similarities to reported gene 
products 


133 


100 


4677 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


142 


43 


4678 


AF044205 


Gossypium 
birsutum 


proline-rich protein precursor 


117 


39 


4679 


D38116 


Pan paniscus 


NADH dehydrogenase subunit 4 


243 


98 


4680 


AF 113685 


Homo sapiens 


PRO0974 


117 


74 


4681 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


193 


90 


4682 


L26163 


Mus musculus 


histone Hie 


111 


37 


4683 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


145 


78 


4684 


Y86248 


Homo sapiens 


.Human secreted protein HCHPF68, 
SEQIDN0.163. 


112 


53 


4685 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


110 


60 


4686 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


153 


72 


4687 


D38112 


Homo sapiens 


NADH 'dehydrogenase subunit 6 


165 


100 


4688 


W50922 


Homo sapiens 


Amino acid sequence of a heterogenous 
ribonucleotide protein. 


139 


100 


4689 


Y00281 


Homo sapiens 


precursor 


789 


100 


4690 


AF210651 


Homo sapiens 


NAG18 


142 


65 


4691 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4692 


L26953 


Homo sapiens 


chromosomal protein 


124 


72 


4693 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


166 


80 


4694 


L27428 


Homo sapiens 


reverse transcriptase 


144 


87 


4695 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


165 


74 


4696 


W29474 


Homo sapiens 


Human histone HI isoform H1S-1. 


93 


41 


4697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


56 


4698 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


138 


62 


4699 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


141 


54 


4700 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


162 


70 


4701 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


112 


88 


4702 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


159 


59 


4703 


AF241242 


Mus musculus 


T-cell-specific T-box transcription 
factor T-bet 


294 


75 


4704 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


152 


80 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








SEQ ID NO: 163. 






4705 


Ar 03 8963 


Homo sapiens 


RMA helicase 


102 


70 


4706 


AF000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA la (AF078683) 


148 


63 


4707 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


145 


70 


4708 


AF289099 


Maackia 
amurensis 


ENOD2f 


113 


42 


4709 


AF 130087 


Homo sapiens 


PR024] 1 


124 


57 


4710 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


188 


86 


4711 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


86 


4712 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163, 


143 


78 


4713 


AF273133 


Ochotona 
macrotis 


NADH dehydrogenase subunit 4 


154 


76 


A-l 1 A 

4714 


G03453 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7534. 


104 


48 


4/15 


Z18361 


Ovis aries 


trichohyalin 


120 


33 


4716 


AF130089 


Homo sapiens 


PRO2550 


152 


75 


4717 


AF119851 


Homo sapiens 


PRO 1722 


143 


60 


All O 

4718 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


145 


45 


4719 


K02401 


Homo sapiens 


chorionic somatomammotropin 


619 


96 


4720 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


114 


52 


4721 


J03756 


Homo sapiens 


hGH-V2 


515 


74 


4722 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


194 


93 


4723 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


128 


56 


4724 


D49490 


Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 


213 


55 


4725 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


66 


4726 


K02401 


Homo sapiens 


chorionic somatomammotropin 


657 


98 


4727 


D87459 


Homo sapiens 


Similar to Volbox carteri extensin 
(S22697) 


162 


80 


4728 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


99 


34 


4729 


AF090944 


Homo sapiens 


PRO0663 


151 


63 


4730 


M32305 


Human 

papillomavirus 
type 47 


E1/E4 fusion protein 


106 


40 


4731 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


133 


57 


4732 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


76 


4733 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
bmdmg protein K10 (NID:g8148) 


114 


43 


4734 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


133 


72 


4735 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


113 


43 






Homo sapiens 


Human secreted protein, it) NO. 
6913. 


AO 

yy 


jy 


"4737 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


92 


47 


4738 


AP000616 


Oryza sativa 


similar to RJNG-H2 ringer protein 


133 


73 



254 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 








DUA 1« i kT2f\HQ£Q1\ 

KriAia {ArU /oOoi ) 






4739 


AF044205 


Gossypiura 
hirsuturn 


proiine-rich protein precursor 


105 


42 




O/0OJ4 


i^rycrojagus 
cuniculus 


— . 

alpna-tropomyosin 


ISO 


517 

Of 






rionio Sapiens 


xiuman secreteu proiem, oc\i ll> vwj. 
7870. 


1S1 


79 




HU1 j/Z 


Homo sapiens 


Neuron-associated protein. 


1 SO 

1 J7 




4743 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OCA TT\ XIA«1JC3 


122 


100 


4744 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
otAj IXJ NU.IOJ. 


176 


82 


4745 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
obvj ID JNU.loJ. 


154 


78 


4746 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OCA VT\ XT A- 1 £1 


164 


80 


4747 


AP000616 


Oryza saliva 


similar to R1NG-H2 finger protein 

nil A In / A TJAOQ/^CJ^ 

KHAla (ArU/ooo3J 


120 


100 


4748 


AT 1<fl700 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


117 
11/ 


SI 


4749 


i 86248 


Homo sapiens 


Human secreted protein liurlrroo, 
cca rri ma- i /ci 


OAO 


01 

7J 


4750 


AF155232 


Pisum sativum 


extensin 


172 


40 


4751 


ArUUUolO 


Oryza sativa 


similar to miNO-MZ ringer proiem 
RHAla(AF078683) 


1 1T 


7A 
/O 


4752 


A t?i iaaoo 


Homo sapiens 


"DO AO CCO 


i r\A 


70 
/U 


4753 


W48351 


Homo sapiens 


Human breast cancer related protein 

npnno 


154 


50 


4754 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

9 1 SO 
810 J. 


167 


50 


4755 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
KHAla (ArU /ooo j) 


138 


100 


AKH 
4/00 




Homo sapiens 


Human secreted proiem, oc\i iu in*j. 


105 


DO 


4757 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


45 


4758 


D38112 


Homo sapiens 


NADH dehydrogenase summit 5 


242 


94 


4/jV 


AUAyl/CAOC 

Ar04oyJ5 


Homo sapiens 


rLr lip nomoiog 


OAA 


0/1 


4760 


M76546 


Helianthus 
annuus 


hydroxypro line-rich protein 


165 


41 


4761 


Y30681 


Homo sapiens 


Splice variant ZAP-1B protein of the 
numan tumor suppressor gene zAr-i . 


115 


60 


/i7/jo 

4/0^ 


IZ/oOo 




Homo sapiens 


Human secreieci proiem encoueu oy 
gene No. 107. 


1 /o 




4/0j 




xiomo sapiens 


Unman CArrAtpH rn-ntofn QPA TT\ "KTfV 

Human secreiea proiem, ocy ul? invj. 
7124. 


l^d. 




*t /04 


KJOir 13 


Homo sapiens 


rseurai inreau proiem. 


1 ->o 


7fl 




OAYM/IO 
VJUU44Z 


Homo sapiens 


Unman cf*r*ra+e*rl r\rrtit>\n QPfl TPk MO* 

Human secreiea proiem, ocv£ *jj r*\J. 
4523. 


uy 


My 


H/OO 


X B0Z4i> 


xiomo sapiens 


iruman secreteu pruieixi n\^nirroo 9 
SEQ ID NO: 163. 






A7£7 




Mus mus cuius 


uansaiQoiase 




inn 


4768 


AF230279 


Caenorhabditis 
elegans 


SW13-like protein; PSA-1 


106 


38 


4760 
i t\jy 




f-Trvm (\ caniAnc 
riAJluv aapicDo 


Wiiman c^rrefpH nrnfpin SFO TT> NO* 

riUIIlaJl bCCfClCU piUlClll, OLy jJLf 11V-/. 

7882. 


1 Id 




4770 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


136 


67 


4771 


X92485 


Plasmodium 


pval 


85 


64 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






vivax 












nomo Sapiens 


jV*iuuvcu rcunopauiy pi oiclii yv^* 

terminal f \r\r\f* VP t-T RrX 


OS 




4773 


AC002398 


Homo sapiens 


F25965 3 


96 


39 






rxomo Sapiens 


u nmfln cprrAfpH nrnfpin HPHPFfiR 
nuuiau bcurcieu proiem nunrroo, 

SEQ ID NO: 163. 




7S 






jtiomo sapiens 


cy to Kin c recepior-iiKe niuieuuic y 


1 \0 


1 vv 


4776 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


74 


4777 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


75 


Anno. 
*f / to 


/Z 


jiomo sapiens 


xiuman secreieu protein, ojdv inv/. • 
7253. 


117 
lit 




AH1Q 
4/ 17 




Homo sapiens 


UDiquiious i r iv-moui protein i 
isoform 


mi 




Anon 
4/oU 


AxJUu/ooU 


Homo sapiens 


IV1A/\U4UU 


1 1 Q 

1 iy 


OO 


4/51 




Homo sapiens 


A-iinKea reonopainy protein \\^- 
terminal, cione AJiri.oc/ 




OO 




nni R7R 


nomo Sapiens 


Unman ca/*r»tA/1 nrntAin CCA TTk NO* 

xiuiiidii secreiea proiciii, ocy \xj v\\j. 
5909. 




84. 


H/OJ 


iVJLfcZ>J J*f 


xiumo sapiens 


unKiiown pro ic m 


^1 1 

Oil 




4784 


Y36203 


Homo sapiens 


Human secreted protein #75. 


350 


76 


4/0 J 


VAOyl A1 


Homo sapiens 


chorionic somatomammotropin 


1 

loo 


07 


4/00 


vjUUJoo 


Homo sapiens 


Human secreted protein, obv^ lu iNw" 
4449. 


1 ^7 


£LA 


4/0/ 


vjUUVo> 


Homo sapiens 


Human secreted proxem, oev^ inu. 

JUOO. 


1 ft7 
lUZ 


0/ 


4788 


M38451 


Homo sapiens 


growth hormone 


264 


94 


4/oy 


I O0Z4o 


Homo sapiens 


Human secretea protem HL/Hr r oo, 
SEQIDNO:163. 


l^O 


77 


4/yo 




Homo sapiens 


chorionic somatomammotropin 


7^A 




4791 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OJOO. 


127 


88 


4792 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, E>tA£ iu xs*j. 47/. 


125 


78 


4793 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

KnAla ^Arll /BOa?; 


131 


51 


AHQA 
4/V4 


cm\ia 1 1 


: 

Homo sapiens 


xiuman secretea protem, ocy wj isix 
7492. 




S7 


4 /Vj 


X41 &AQ< 

Miouyj 


Phaseolus 
vulgaris 


Vit»4rAVtfiMV\1inA i*is*li rrli/AAnrAtain 

nyaroxyproune-ricn giycoproiem 






4 /yo 




U/"*Tti/\ Pan lane 

nomo Sapiens 


lT LUI1 all pilUapnUiipaoC Z> tirrLZ. 


1j 1 


^0 
07 


4797 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


77 


4798 


AL132841 


Caenorhabditis 

vIGgdllb 


Y15E3A.3 


120 


51 


4799 


AL009I46 


Drosophila 
melanogaster 


alternatively spliced form 


79 


94 


4oUU 


5 /4ZZ 1 


xiomo sapiens 


Lis. iacior^cyiOivme uown-reguiaiing 
HLA class II 


10X 


R0 

Ov 


480] 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
JK-U protein ^nuciear ma-Dinaing 
protein) 


121 


100 


4802 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 
l^D nrAtf*in rnnclear rna-bindinff 

XVJU' UIUWIU ^UUvlvfll Ilia uuiuuig 

protein) 


94 


90 


4803 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


101 


90 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 


tOV/t 




tiorno sapiens 


nlAiirol fnrAa/i m*/\t'Atvi 

iNcuiai inreau proiem. 


1 J7 


71 


480S 


AU1UHJ 


I-T-ifHro en 

riyuTii sp. 


m t n 1 0 (tan 

nun i-cu iiagen 


19A 
izo 


/CO 

00 


4806 


Y19S44 


UAmn rein tpn C 

riomo sapiens 


j riJL»/\*L/i\. 15 proiem prcvuisur ^/v/\ - 


1 1 A 


30 








90 to 267i 






4807 


W88697 


rtomu sapiens 


oeui clcu proiem cqcuucu ojf gene y*-r 




R4 i 








elnne HPMR039 

ItlUJlw .til iVADv^J^- 






4808 


D38112 


nUiilU oaplvJld 


iNrtX/Ji Licuyui UgGliOoe oUUUll.il V/ 




04 


4809 




nojiiu SapiCIlS 


wiruiiiusonjai proiem 


00 
y\j 


Ou 


4810 


AP00061 6 


\JTySa SallVa 


similar 10 KiiNO-riz linger proiem 




7< 








"DTI A 1 5, / A TTA7R/'iScTl 






HO 1 1 




nomo sap 1 ens 


cnromosomai proiem 


191 
1Z1 


/;9 
oz 


4019 


V97Qfl7 


Homo sapiens 


Human secreted protein encoded by 


Iz4 


71 








gene ino. i iy„ 






4813 


f*037R7- 


— : 

Homo saypiens 


Human secreted protein, SEQ ID NO: 


/o 


4o 








/500. 






4ft 14 
HOI** 


11/yiOQCI 

WHojjI 


— — ; 

Homo sapiens 


Human breast cancer related protein 


1 AA 


78 








DtKDZ. 






tOlJ 


API ^70A1 
AJr 1 ,>ZyO 1 


— ; 

Homo sapiens 


chromatin-specific transcription 


1 

1 jZ 


yo 








elongation iacior rAui i^u KUa 












suouniL 






481/) 


TIQ3S6Q 

U7JJO7 


noiiiu sapiens 


puraiive p i du 




oZ | 


4817 




Homo sapiens 


jsjLAAoo / j proiem 


1ZD 


O/ 


HOlO 


vjUZ-jjZ 


riomo sapiens 


riuman secretea protem, &e\1 ID Nu. 


11 A 

1 14 


90 








001 j. 






Holy 


nAAO^/l 

UUUJ54 


Homo sapiens 


Human secretea protein, SEQ ID NO: 


129 


45 














4890 


AT l<Q7ft7 


— 

Trypanosoma 


probable similar to ring-h2 finger 


1 1/< 
I J4 


>f 0 






bmcei 


proiem rnaiE. 






4891 


W483^1 


riomo sapiens 


Human breast cancer related protein 


IZy 


ol 








DWkIUDZ. 






4822 


R96800 


numi? sapiens 


Unman n 1 ofi A/*\/tA_c A<^AtA/i "Pa/^f/M^ UOp 

nuiucui nisCTocyie-secreicu iacior nor. 


137 


ft3 


HOZj 


ijuznyu 


Homo sapiens 


Human secretea protein, dc-Q ID NO. 


lzo 


55 








AS71 






4894 


a vi Ti^s^ 
/\r j dooj 


: : 

Homo sapiens 


JrKUuy /4 


1 1 A 

1 lU 


54 


AST)* 


CJ0JDHZ 


Homo sapiens 


FAP protein 


1 JO 


TA 

70 


4896 


Yft3zl13 


riuman 


Uoo 


i in 
Jiu 


54 






Herpesvirus o 








4897 
toz / 


r?0 1707 

OUJ /7 / 


riomo sapiens 


numan secretea protein, oc,ki id nkj. 


Jjz 


7o 








787R 
/o/o. 






4898 


T I7rf£0Q 

LI / 00\/7 


nomo sapiens 


iiDosoniai uj proiein 




so 


4.R9Q 




Homo sapiens 


Human breast cancer related protein 


1 1A 

1 JO 


5 / 








DV^IV-DZ. 






4830 


APOOAIAI 


/vlaOluOpSlS 


putauve v^tiiNi^-J3//\jNjk>-Dinaing 


1/11 
141 


J 1 






tViiilismA 
u 1 ail an a 


px ULCiii liAC proiem 






4831 




xiomo sapiens 


riuman secretea protein, oiiv^ id ihkj. 


1/1*2 


f 1 








7£37 






4839 


T 104 180 


D off-lie 

IxalLUS 


uuo 


1 <*& 
105 


70 

ly 






n nn/poi on c 

nurvcgiiyus 








4833 


104011 


riomo sapiens 


IvlU lVlL^or ^riO L..J .1.3, EX* J.J.4.y, CA^ 




yl 








3 A 3^ 






483d 


V11307 
1 1 Joy 1 


nomo sapiens 


Annno acia sequence or proiem 


Oj4 


A"7 
9/ 








PPH33A 
x^rv\Jj^*r. 






483 S 


f303zl3fc 


llUlIiU sapiens 


numan sccreieu proiem, on^ uj inl/. 


1 A7 
1U/ 


5j 








7^10 






4836 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


464 


93 








5327. 






4837 


AF272833 


Homo sapiens 


misato 


413 


87 


4838 


AF130089 


Homo sapiens 


PRO2550 j 


132 


71 


_4839_j 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


54 
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WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 








4414. 






4840 


G00437 


Homo saDtens 


Human secreted nrotein ^FO ID NO* 
4518. 


89 


80 


4841 


W77354 


Homo salens 


Human telomere repeat binding factor 
2 truncated nrotein 


106 


100 


4842 


AF090942 


Homo sapiens 


PRO0657 


160 


75 


4843 


AF090944 


Homo sanienc. 


PRO0663 


202 


80 


4844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO- 163 


167 


81 


4845 


Y86248 


Homo sanien^ 


Human ^pprptftH nrofpin HCT-TPFriS 
SEO ID NO* 163 


166 


74 


4846 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID "NO* 163 


228 


97 


4847 


UI2690 


Homo sapiens 


cytochrome oxidase subunit II 


198 


95 


4848 


AB023485 


IVTiiQ nmicciiliic 

1.VAUO HlUovUiUo 


rran^rrirition "fiartor C* A 1 SOh 


118 


42 


4849 


Z70684 


Caenorhabditis 

plptxanc 


F28D1.8 


98 


42 


4850 


AF025467 


Caenorhabditis 

plp<*anc 


contains similarity to drosophila DNA- 

hinHino* nrntpin K 10 fivinVoX 1 450 


106 


45 


4851 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
cfo m no- 1 m 


179 


92 


4852 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


145 


82 


4853 


Y8624R 


nvjiuu aapiciio 


Unman qprrptftH nrotein HPHPFfiR 

SEQIDNO:163. 


123 


77 






Wrmnn cani^nc 
flUillU baJJlCllb 


Human CArrptftH nrotpin QPO \Y\ NO* 
nuiiiau scurcLou piuicui, ocy ixj nv. 

7143. 




j j 


4855 




U(\mf\ canipnc 


hpmatrvn/MpHf* nrr\(Tf>n itrvr Vina CP 
liviiiaivjJi/icuv piu^ciiiuji jvuuuv 


i in 


R1 


4856 


AC011708 


Arabidopsis 

thaliana 
maligna 


putative RING zinc finger protein 


121 


43 


4857 


M81321 


Macaca 

■fa cp toi il arlc 
loot* 'i/iuaj lo 


proline-rich protein 


126 


35 


"OJO 


IT49471 


1Y1U5 lUUbOUlUb 


vvii>vou.»/\iuriwi oynuruuic proiem 

homolog 


iftd 


Aft 


48*59 


I OOZiHO 


J1UII1U Sapid IS 


Human cprrpfpH nrntpin HPHPF6R 
XTuinan bcuivicu pi u it/ ill nviirroo, 

SEQID NO: 163. 


iyj 


/J 


4860 


X65165 


VnlvAV partpri 


pvtpncin 


74R 




4861 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


79 


4862 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


185 


92 


4863 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


142 


75 


4864 


AP000616 


Orv*«i ^ativa 

V-/1 jf tjoi jail v cl 


similar to R1NG-H2 fineer nrotein 
RHAla(AF078683) 


132 


100 


4865 


Y86248 


Homo saDiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


63 


4866 


AB023229 


Homo sapiens 


KIAA1012 protein 


208 


92 


4867 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


144 


80 


4868 


L26953 


Homo sapiens 


chromosomal protein 


126 


70 


4869 


U63542 


HoThO QJinipnc 


FAP nrotein 


134 


40 


4870 


D86853 


Catharanthus 


extensin 


102 


48 


4871 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


69 


4872 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


136 


55 


4873 


G0O403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


156 


84 



258 
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SEQ ID 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 








Watennao 


Identit 










Score 


y 


4R7/1 

HO /H 


VI 7R^7 


n urn an 


env protein 


ziy 


O/ 






cnuugcnuus 












retrovirus FC 








487S 

*to f J | 


AF717517 


nUIIlU SapidiS 


uuwiiaiauicrizcci uunc inaiiuw piutein 


Rl 
01 


IR 

JO 








BM041 






HO t\) 


T 7747R 


UAmA on r>i on r* 

XT t> ITlt) SapicuS 


reverse transcriptase 


in*; 


Ol 


4R77 


1V1UH / 7Z. 


Battue 
JtvallUb 


soiivai y pruLLnc-ncn pruicm 


IhO 


HO 






nurvcgicus 








HO/0 


A PA 1 A 1 4/1 

ATvIUIhh 


Homo sapiens 


neuronal inreaQ proiein /\jl//c-in ir 


oo 


JO 


4R7Q 


XZ/oOo 


tiomo sapiens 


numan secrexeu proxem encoded oy 


1UD 


77 








m*r*c± XTn 1 A*7 

gene xno. iu/. 






HooU 


KjoolO 


jtionio sapiens 


Human c-myc far upstream element 


ff7 
Of 


3y 








(FUSE) binding protein (FBP)variant 












fW\TTI PPT /tlnnp "2 1 1 A 

irum i dl ciune j i- iv. 






4RR1 

HOOl 


HA175R 


tiomo Sapiens 


iruman secrexeu proxem, oe\j \aj 


i 

l jU 


A7 








7110 






4RR7 

HOOZ 


AP17<;777 


urosopnua 


oAin i domain protein oivjuvi c,t\ 


o4 


on 

oy j 






mcianogasicr 






4RR1 


A P 1 7AAR7 


Homo Sapiens 


poiycomo j 


130 


*I7 


/1RSM 
hooh 




Homo sapiens 


Amino acid sequence of a human 


147 


OO 








gastric cancer antigen protein. 






HOOJ 


/"1A7QOA 


Homo sapiens 


riuman secreted protein, ocki wj in w. 


1 At 


/I 








7fini 

/wl. 






HOOO 




— . 

Caenorhabditis 


I? | /CLJ 1 1 O nana tmw)i«<4 

r 1 on 1 1 «z gene proauci 


yj 


1 AA 






eiegans 






>l DOT 

4887 


AF261918 


Homo sapiens 


dismtegrin metalioprotemase with 


262 


72 








uiiuLu uubpunuiu repeals 






ARRR 

HOOO 




oactenopnage 


-TV \ vaii component, i yyj 


01U 


1 AA 






InrnKfln 
iaiiiL/ua 








4889 


V9541 5 


T-J rvm r» cani'pnc 
nuiiiu dapiCUb 


nuiiiaii caiuium vnannci ovv** 


710 

ziy 


RQ 

oy 






2/CRAC-l 






489A 


RA1177 


numu sapiens 


INCUTUQ-abbUClaieU prUlcUl. 


1 J*r 


RA 
ov 


4891 

T07 1 


yvvj*-r fix 


Valine cnlltic 

VJCIIIUO gall US 


pollnarpn^QlnliQ.l t\/rw» fY 
C U 1 1 age 1 l a ipi la J lypc -LA. 


1 17 
1 1 z 


41 
H 1 




074ft 1 
rvu zh u 1 


HAmA cam ATI c 


f*Yt/^FirMii/* crttrmtrtTYi arn in rttrrtt\iri 

vuuj luuii/ suuiaiuniaiiiuiuu upiu 


J JO 


R7 
of 


4891 


AF010144 


numu sapiens 


uvUiUiiai uucaupiuiciu t\xj / u~in ir^ 


(OR 

1UO 




4894 


(101714 


nuiiiu sapiens 


Hitman CArrptArl nrntpin QPO IT) MO' 

nun id 1 1 scureieu pruicin } ony ll/ inw. 


110 

1 J7 


oz 








779S 






489 S 




numu sapiens 


viuumusumdi pruLcin 


01 


Rl 
ol 


HOi/O 


A T77774.A 


Homo sapiens 


i\jrr> 1 1 o l Deta protein 


1 /V? 

IUZ 


00 


4R97 


R01A7R 


numu sapiens 


numan pnuspnoupase z nrrLZ. 


14*^ 

iHJ 


R7 
o / 


4RQR 

Hoy© 


AI 1507R7 

/\L»jjy /oz 


Trypanosoma 


possiuie ^nnv-oj ui iuz, variant a ona, 


131 


Of 






brucei 


wuiupicic viriun gciiuuic. 






Hoyy 


nA75*57 


Homo sapiens 


Human secreted protein, SEQ ID NOr 


1 


/ZA 

OH 








^ii 






4Qaa 


f TOA1 AA 


— _ — 

Homo sapiens 


iroquois-class homeodomain protein 


1 AO 


Hi 
















UZyy^O 


numu sapiens 


" I 'Hi C IT An A 1C C1TY11IUP t/\ tl"A 

l nib gene is similar iu uc 


17A 
1ZU 


71 i 
Id 








ATM*A0"f»Tlf»r5ffil S47^ 






4902 


fiooii7 


numu sapiens 


Utrnifln <ex*Tf*ip'A nrnt^in QPO TFi MO* 
numan ouciclcu pruicuij ocy iu 


1 1^ 

1 ID 


Id 

fVJ 








4413. 






4901 


I oy 100 


numu sapiens 


a mature numan iN~ai/etyigiycosaroinyi 


1 A7 


R7 
5Z 














4904 


AP1 19900 

rvr 1 1 77UU 


Homo saniens 


PR07R77 


1 57 


57 


4905 


iro?<£5 


numu sapiens 


puianvc pi du 


1/1 £ 
l*+0 


<^ 
jj 


4906 


GO 0401 


Homo saniens 


Hirman <»ei^rptftH nrntpin SFO IF) NO* 


170 


67 








4484. 






4907 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


160 


76 








encoded by gene 17. 






4908 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


136 


55 



259 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








2>CKl iXJ NO. 103. 






4909 


AF 155232 


a i5urn sativum 


extensin 


QQ 

9o 


/*«> 
43 


4910 


U 15647 


Mus musculus 


reverse transcriptase 


196 


50 


491 1 


G 02994 


Homo sapiens 


Human secreted protein, oEQ ID NO: 

7fl75 
/v / 0. 


1 A A 

144 


56 


4912 


AF225419 


numo sapiens 


TJQr* Apr: 


Z45 


ol 


4913 




raeiianuius 
annuus 


hydroxyproline-rich protein 


1 1 A 

1 10 


47 


4914 


AI 1 173X2 


numo sapiens 


ojooiJLZZ.z (novel protein ) 


111 

312 


100 


4915 


AB037814 


Homo sapiens 


KIAA 1 393 protein 


192 


68 




i-ZJo 1 1Z 


Homo sapiens 


NADH dehydrogenase subunit 4 


169 


89 


4917 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SbQ ID NO: 163. 


105 


90 


4918 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
KiiAla (Ar 078683) 


116 


86 


4910 


I oOZ*fO 


-— ; 

Homo sapiens 


Human secreted protein HCHPF68, 
obvj tlJ NO: 163. 


132 


100 


4920 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
xlKOr 


183 


65 


4991 


AF000Q44 


Homo sapiens 


FKOU663 


1/4 1 

141 


57 


497? 


ArU7U7J 1 


Homo sapiens 


jrXO04d3 


i c r\ 

159 


55 






Homo sapiens 


Human secreted protein HCHJProo, 

ollv^ 1L> INU. 103. 


128 


81 


4924 


I TOO ^00 


xiorao sapiens 


cytochrome b 


276 


94 


4925 


AB002377 


Homo sapiens 


KIAA0379 protein 


304 • 


98 






Homo sapiens 


unknown protein 


201 


62 


4927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


65 


4Q7SI 


ATM "2AAQA 


Homo sapiens 


PRO2550 


116 


82 


4929 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


94 


41 


4930 


W75159 


Homo sapiens 


Human secreted protein encoded by 
gene 45 clone HTTDS54. 


294 


82 


4931 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


96 


74 


4Q^O 


VJUZ40U 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6541. 


101 


57 




X UZO / 1 


Homo sapiens 


Human secreted protein encoded by 
gene zz clone hmoJ w i 5. 


142 


70 


4034 


AF1 300K7 
r\T iJUUtW 


— : 

Homo sapiens 


r KOZ4 1 1 


137 


80 


4935 


G00637 


Homo sapiens 
-— 


Human secreted protein, SEQ ID NO: 
4/io. 


132 


56 


~7JU 


AR03?fiO4 
-rYXj Uj^Ovtt 


Mus musculus 


iviiwi (piwij 


313 


CA 

50 


4917 


"X~74Qx7 
-/V /'ryot 


ri Ohio sapiens 


2 , -5 f oligoadenyiate binding protein 


1 >f A 
144 


A^ 

96 


4938 


AF0546SX 


xi Ohio sapiens 


Kappa l immunogJODUim ugni cnain 
variable region 


iZO 


on 


4939 


xvyuouv/ 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


11/1 

134 


67 


4940 


U22230 


Felis catus 


ribosomal protein S17 


118 


100 


4Q41 


/\x3uzoyjo 


Homo sapiens 


KiAAlOJi protein 


COT 

527 ; 


88 


4942 


G00429 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4510. 


281 


86 


4943 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


152 


71 






— 

Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1 037. 


108 


95 


4945 


G02490 


Homo saoien^ 


Human secreted nrotpin ^FO ID NO* 
6571. 


170 




4946 


AF090895 


Homo sapiens 


PRO0117 


125 


65 


4947 


D13866 


Homo sapiens 


alpha-catenin 


575 


100 


4948 


L01664 


Homo sapiens 


lysophospholipase 


166 


96 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


4949 


AF1 18085 


Homo sapiens 


PR01975 


572 


96 


4950 


AK024455 


Homo sapiens 


FLJ00047 protein 


95 


81 


4951 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


113 


58 


4952 


AB0U142 


Homo sapiens 


KIAA0570 protein 


120 


88 


4953 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


108 


39 


4954 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


69 


4955 


AK024455 


Homo sapiens 


FU00047 protein 


119 


64 


4956 


AF1 16715 


Homo sapiens 


PR02829 


142 


75 


4957 


M31520 


Homo sapiens 


■ribosomal protein S24 


158 


100 


4958 


D38113 


Pan troglodytes 


ATPase subunit 8 


98 


64 


4959 


AF130089 


Homo sapiens 


PRO2550 


156 


75 


4960 


AP000616 


Oxyza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


95 


4961 


S61950 


Oryctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4962 


AF070540 


Homo sapiens 


putative nuclear protein 


513 


93 


4963 


S61950 


Oryctolagus 
cuniculus 


alpha 2(1) procollagen, alpha 2(1) {C- 
terminal} 


159 


93 


4964 


AF146191 


Homo sapiens 


FRG1 


343 


92 


4965 


Y16624 


Homo sapiens 


Human phosphoprotein 32 (pp32) 
protein sequence. 


114 


100 


4966 


X67813 


Canis familiaris 


signal recognition particle,72 kDa 
subunit 


127 


100 


4967 


S79410 


Mus m us cuius 


nuclear localization signal binding 
protein 


127 


63 


4968 


U83303 


Homo sapiens 


Iine-1 reverse transcriptase 


117 


69 


4969 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


101 


80 


4970 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


173 


69 


4971 


D38112 


Homo sapiens 


ATPase subunit 6 


112 


91 


4972 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


209 


97 


4973 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


80 


4974 


AL050399 


Arabidopsis 
thaliana 


putative proiine-rich protein 


75 


91 


4975 


LI 1668 


Bos taurus 


cyclophilin-40 


134 


100 


4976 


M55169 


Homo sapiens 


tripeptidyl peptidase II 


385 


90 


4977 


AF090944 


Homo sapiens 


PRO0663 


154 


88 


4978 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


102 


62 


4979 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


128 


70 


4980 


G00721 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4802. 


104 


95 


4981 


R54656 


Homo sapiens 


Prostaglandin 12 production stimulating 
protein B. 


101 


100 


4982 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


124 


38 


4983 


AF273053 


Homo sapiens 


CTCL tumor antigen se89-l 


281 


96 


4984 


M15530 


Homo sapiens ■ 


B-cell growth factor 


117 


[58 


4985 


D13641 


Homo sapiens 


mitochondrial outer membrane protein 
19 


534 


94 


4986 


L11316 


Mus musculus 


ect2 


528 


92 


4987 


X51755 


Homo sapiens 


open reading frame (458 AA) 


108 


90 


4988 


M61877 


Homo sapiens 


erythroid alpha spectrin 


129 


87 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


A AOQ 


A57316 


Saccharomyces 


carboxypeptidase s 


134 


50 






cerevisiae 












Homo sapiens 


Human secreted protein, obQ ID NU: 


151 


78 








A A 1 o 








YXa?4R 


no mo sapiens 


riuman secretea protein xiurirroo, 


1 T2 
1 15 


61 








oca jn XI A- 1 £/X 








A/TC/lftlQ 


i> os laiiius 


beta-adrenergic receptor kinase 


172 


t AA 

100 






Homo sapiens 


chromosomal protein 


IOC 

135 


86 


/1QO/1 


YO^T^ 

Ay / O /5 


Homo sapiens 


plaKopniun 2b 


124 


63 


45#iO 


I Jt) J 1Z 


Homo sapiens 


Extended human secreted protein 


153 


77 








sequence, ocy lU lN\J. 4y/. 






4^?0 


A 171 1AAOQ 

/Vxuuuoy 


Homo sapiens 


PRO2550 


175 1 


48 


/JQD7 


O0z50 i 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


95 


68 








6582. 






AQQQ 

4yyo 


VO/C0 1 A 
Yoozl4 


Homo sapiens 


Nuclear transport protein clone hfb341 


312 


98 








protein sequence. 








G03681 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


95 








7762. 






->UUU 


At 1 5296 1 


Homo sapiens 


chromatin-specific transcription 


127 


96 








elongation factor FACT 140 kDa 












subunit 






5UU1 


U /0063 


Homo sapiens 


acid ceramidase 


315 


96 




A 1?AA 1 1ll 

ArU41373 


Rattus 


clathrin assembly protein short form 


163 


63 






norvegicus 








5003 


AK0OI84I 


Homo sapiens 


unnamed protein product 


468 


83 


5U04 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


61 








A At A 

4414. 






5005 


S79410 


Mus musculus 


nuclear localization signal binding 


116 


47 








protein 






jUOD 


003133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


55 








7214. 






jUO / 


vjUU33o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


76 








A A 1 A 

44 19. 






->UUo 


£15005 


Homo sapiens 


CENP-E 


281 


84 


f AAA 

5009 


a F%« aaaa 

AF130089 


Homo sapiens 


PRO2550 


122 


75 


5010 


AL049698 


Homo sapiens 


dJ470B24. 1 . 1 (myeloid/lymphoid or 


77 


33 








mixed-lineage leukemia (trithorax 












(Drosophila) homolog); translocated to, 












A SAT? AT\ fZm^A?^ 1 W 

4 (AF-6) (isoform l)) 






jUI 1 


U02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


62 








6463. 






5Utz 


AL023780 


Schizosaccharom 


DNA bmding protem 


145 


36 






yces pombe 










T> 1 *KClT 

KI3550 


Homo sapiens 


Protein encoded downstream of hhc_M 


93 


46 








oncoprotein. 








UUJoUO 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


43 








TOO 1 

7881. 






Jul J 


A tja 1 A 1 A A 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 A A 

104 


55 




TT1 <£LAH 


Mus musculus 


expressed in select embryonal 


1 1 A 

119 


35 








carcinoma cells and testicular cells; 












similar to Swiss-Prot Accession 












iNumuer ri izou 








D1U545 


— — : — 

Homo sapiens 


Human aspartate protease psn-like4 


/ i A 

649 


AO 

98 








protein. 






5018 


AF089745 


Homo saniens 


FTC506-hindiTio; nrotein 






5019 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


634 


97 


5020 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


603 


92 


5021 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


623 


94 


5022 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


671 


100 
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SEQ ID 


Accession No. 


S pedes 


Description 


Smith- 


% 


NO: 






Waterman 


Idcntit 








Score 


y 


5023 I 


Alt 130089 


Homo sapiens 




ivy 


71 


5024 


A DS\1 *7 A CO 

ABO 13452 


Homo sapiens 


A lraseii 


£.\JO 


77 


CAO.C 

5025 


R27654 


Homo sapiens 


U.nMHM JinlltHtm />tlAHMa1 770Q0/1 H 

Human calcium cnannei z /you/ io. 


1 911 

lOJ 


100 


5026 


Y36156 


Homo sapiens 


Human secreted protein ffZo. 


11/1 
1 14 


OH 


5027 


G03714 


Homo sapiens 


Human secreted protein, bbv^ lu isu. 




Id 








/ fyj. 






5028 


G00577 


.. . . 
Homo sapiens 


Human secreted protein, bbv^ iv phj. 


1 AO 


5Q 








40JO. 






5029 


OO wtOOO 

S74728 


Homo sapiens 


anucjuiun— zog turgor protein nomoiog 






5030 


L41498 


Homo sapiens 


kmgation factor 1 -alpha 1 




o/ 


5031 


AB040672 


Homo sapiens 


UDP-OaLNAc: polypeptide in- 




70 








acetyigaiactosaininyitraiisierase 






5032 


A TOO *%t\A O 

AJ223948 


Homo sapiens 


KJNA neiicase 


J /I 


OA 


5033 


AF021935 


Rattus 


mytonlc dystrophy kinase-reiated 


o/u 


07 






norvegicus 


v^uC4Z~ DuiQing Kinase 






5034 


AF 150755 


Mus musculus 


microtubule-actin crosslink ing factor 


co/r 


RO 


5035 


AF 1 50755 


Mus musculus 


microtubule-actin crosslinking factor 


£ft9 


oy 


5036 


AF081484 


Homo sapiens 


alpha-tubulin isofonn 1 


CQQ 


OQ 
07 


5037 


U09823 


Oryctolagus 


elongation factor 1 alpha 


/Oo 


0£ 






cuniculus 









5038 


AE003621 


Drosophila ; 


CGI 33 84 gene product 


T7Q 








melanogaster 








5039 


G00379 


Homo sapiens 


Human secreted protem, abvj ID WU. 


1U/ 


Do 








4460. 






5040 


AF047469 


Homo sapiens 


arsenite translocating a. i rase 






5041 


I'll 

G03133 


Homo sapiens 


Human secreted protein, bbv^ id riL>. 


1U7 


41 








TOI il 

7214. 






5042 


AJ007798 


Homo sapiens 


stromal antigen 3, \p i auj j 


Zoo 


100 


5043 


L26953 


Homo sapiens 


chromosomal protein 


lZ*r 


fn 
o/ 


5044 


AB005047 


Homo sapiens 


otto j i__ 

SH3 binding protem 


1UJ 


oo 


5045 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1/11 


/o 








7124. 






5046 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


164 


o3 








7519. 






5047 


W48351 


Homo sapiens 


Human breast cancer related protein 


1 O 1 

121 


03 








T > »/T> YiO 






5048 


G03793 


Homo sapiens 


Human secreted protem, bbvj ID nu. 












7R74 

/O/H. 






5049 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 


ion 
1ZU 


50 








clone Hwyljoi. 






5050 


Z25535 


Homo sapiens 


nuclear pore complex protein hnupl53 


1 O 


8ft 
oo 


5051 


J03176 


Bradyrhizobium 


cytochrome b/cl precursor 


07 


15 






japonicum 








5052 


X85995 


Homo sapiens 


iminunogiODUiin ugni cndin 






5053 


G02994 


Homo sapiens 


Human secreted protein, on\i iv vi\j. 


1 


7r» 








/0/j. 






5054 


B0 1372 


Homo sapiens 


Neuron-associated protem. 


10Q 
ivy 


SO 

Ovi 


5055 


AJ242540 


Volvox carten r. 


hydroxyproline-rich glycoprotein DZ- 


9AO 

ZUy 


54 






nagariensis 


HKUr 






5056 


G03371 


Homo sapiens 


Human secreted protem, onv^ jlu inlj. 


110 
1 1U 


51 

J 1 






* 








5057 


G03790 


Homo sapiens 


Human secreted protem, ony iu ino. 




J/ 








75/1. 






5058 


X92474 


Homo sapiens 


cn-iuu 




100 


JUJ7 


i 9749B 


nviiiu iy<ipiClU> 


rr»v<»rc/> trancr^rintTXK* 


264 


75 


5060 


S79410 


Mus musculus 


nuclear localization signal binding 


119 


44 








protein 






5061 


L25616 


Homo sapiens 


CGI protein'' 


119 


77 


5062 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 



263 



seq n> 

NO: 


Accession No. 


Species 


Description 


Smith- 

Watciman 

Score 


Ideotit 

y 








7871. 






5063 


Y02671 


Homo sanien<i 


Hnm an cp»r"r*»t<»rl r»rnt*»in &nt*nt^f*t\ Yi\f 
1 ltuiioii ov^ivicu jpjuivJJu cuwv/uuvi \jy 

gene 22 clone HMSJW18. 


164. 

1 Krr 


so 


5064 


AF1 18082 


Homo sapiens 


PRO 1902 


175 


51 


5065 


Y14482 


Homo sapiens 


Frapmpnr of human Qecrpted nrotf»in 
encoded bv eene 17 


84 


51 


5066 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla (AF078683^ 


108 


100 


5067 


AF155232 


Pisum sativum 


extensin 


137 


45 


5068 


X63005 


Mlic mUSdl1ll<5 

i'lUJ uiugvUIUJ 


nml in P-ri^h nrrttein 

piVJIIIlV-I lull JJlULCUl 




is 


5069 


AJ242540 


Volvox carteri f. 
napariensis 


hydroxyproiine-rich glycoprotein DZ- 
HRGP 


307 


56 


5070 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO- 161 


173 


89 


5071 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7K82 

/ OOJJ. 


153 


68 


5072 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFO ID NO- 161 


164 


73 


5073 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 

nunc niN ixr \jj.y. 


109 


73 


5074 


G03172 


Homo Qant**n«: 


7253. 


191 




5075 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


113 


72 


5076 


U 12690 


Homo saniens 


cvtnchrnmft ovioJiti* cnniinif" TI 


l si 
i j t 


ftft 
oo 


5077 


AF090944 


Homo sapiens 


PRO0663 


138 


78 


5078 


Y02671 




TjTi irn can c^PtY*tp^l nrntpin f*r\r>f\f\f*f\ K\/ 

ouiiicui scurcicu protein cncuucu uy 
gene 22 clone HMSJW18. 


1 so 


7/L 


5079 


X97675 


Homo «5an?pnQ 


nlatfnnViili'n 7K 
piaiwpnilJll Z,\j 


170 
i fy 


yj 


5080 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4475? 
■ « / 0. 


123 


76 


5081 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 

pruicui 1 


176 


59 


5082 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OJ-»V^ WL/ 1UJ . 


100 


54 


5083 


S79410 


Mus musculus 


nuclear localization signal binding 


99 


59 


5084 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


101 


57 


5085 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


118 


77 


5086 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


93 


69 


5087 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


88 


80 


5088 


AF000298 


Caenorhabditis 
departs 


weak similarity to coilagens; glycine- 
arid oroKne-rich 

VU.1VJ |/1 UUllv X Ivll 


112 


57 


5089 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


87 


61 


5090 


G04078 


Homo ^anien*; 


Human secreted orotein SRO ID NO- 
8159. 


74 


61 


5091 


Y02999 




T^rn om pn t rtf hiirrisiri CAf*tf*f pH TMV%t*»in 

f IflglUCUL ilULllCUl oCl/lClCli piUlClil 

encoded by gene 121. 


ri 


7R 


5092 


AF065484 


Homo canipnc 


cnrrino npvin 1 A 

5U1 1X1 JlGAJAl 1 A 


17? 


o 1 


5093 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


139 


90 


5094 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


164 


78 


5095 


U02313 


Mus musculus 


protein kinase 


147 


84 



264 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


S0Q6 


R17SS6 

IVl>> J JO 


mo mo sapiens 


r roHsiii cficoucu uuwiii>u caixi oi jonc ivi 

v/li l/VJ JJ1 V IC Jul • 


1 90 

izy 


A9 
0/ 


5097 


U64849 


r Caenorhabditis 

f*lf* era n<z 


Contains similarity to Pfam domain: 
PF00646 iF-box"* Score=98 7 F- 
vahie=4.3e-05, N=l 


197 


39 


S098 

JU70 


AF006010 


numo sapieiis 


pi VgCoUll UlUUUvu JJiUlGlll 


174 
l /*t 


oo 


5099 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

74S9 


119 


55 


5100 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7194 


132 


82 


5101 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44R4 


159 


77 


5102 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 

terminal r Innp "VT7T-1 Rr*\ 


99 


70 


5103 


J02459 


bacteriophage 


J (tailmost specificity; 1 132) 


485 


96 






Dactenopnage 
lambda 


ri ^laii component, ojj j 


OZJ 


Ha 


Sift 1 ? 


AFfi9S4A7 


v^acnomaouius 
elegans 


contains sumiaxiiy to urosopniia uri/\- 
binding protein K10 (NID:g8 148) 


1UO 


AO 


SI 06 


AF17ftfiRQ 

rVT 1JUU07 


xiomo sapiens 




191 
1Z1 


7S 


5107 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
SI so 

O 1 J7. 


135 


$1 


J 1VJO 


PQ991Q 

r^i^o 17 


riomo sapiens 
(human) 


v^Ki proiem. 


197 


&£ 
oo 


S10Q 




no mo sapiens 


xiunian secreiea proiem, ocy uj vhsj. 


Oj 


78 
to 


5110 


U41751 


Mus musculus 


EI24 


560 


83 


Sill 


vJVj / l*f 


no mo sapiens 


niuQaii secrcieu proiem, ojdv^ llj in w. 
7795. 


1 Sft 


oy 


S1 19 


AF1 IftftftQ 


nomo sapiens 


pt>n9ssn 




oo 


5113 


G03084 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

71/*S 


160 


72 


5114 


AF052831 


Trypanosoma 
cruzi 


unknown 


107 


70 


5115 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna,* 
complete virion genome. 


128 


68 


5116 


X55684 


Lycopersicon 
csvuieuiuin 


extensin (class I) 


60 


29 


5117 


W80406 


Homo sapiens 


A secreted protein encoded by clone 

Ah Aft *3 


124 


64 


5118 


nruvuuiu 


KJlyjuei saliva 


cim ilor +r\ RTMf~l_W7 finer at nrrttAtn 

similar 10 x\JiN\j-fx<& nuger proiem 
RHAla(AF078683) 


1 1 s 
1 1 j 


oo 


5119 


G03172 


l XKJlklKJ Od|Jl.&lL3 


Human QprrpfpH nrntpin SPO IF) NO* 

7253. 


112 


7Q 


5120 


AUJ / lO 


Pmmic HiiIpic 
jriuiiuo uuiuxa 


Avtpncin 
CAUCUS ill 


Q4 

.7*+ 


X\ 
jj 


5121 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RFT A 1 a f AF07fc68^ 


132 


100 


5122 


AJ007714 


Homo sapiens 


lysine-ketoglutarate reductase 

/ iiatLUdl upillC UCliy ill UgCUdaC 


154 


96 


5123 


AJ007714 




I\#cii*ip_ 1/'p+rkty1iiteiT**it - *> taHiipI^qp 
lyoixic-jvCLugjiuiai aic iCviu*wiaoc 

/saccharopine dehydrogenase 


1S8 

1 JO 


oo 


5124 


AT 7SQ7R9 


J lypallVbVIIla 

brucei 


prooooie sinuiar to ring-iiz linger 
protein rhala. 


IV7 


44 


5125 


AF 13 0089 


Homo 4 Mr>ien<i 


PRO2550 


112 


46 


5126 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


60 


5127 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


99 


68 



265 



WO 01/64835 PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 




• 




Waterman 


Identit 










Score 


y 






riomo sapiens 


Human secreted protein, obQ ID NU: 


143 


57 








AAQA 

4454. 






S19Q 


WRRfi97 


— : 

Homo sapiens 


Secreted protein encoded by gene 94 


t A A 

14U 


55 








cione xiriviDv^JZ. 






JlJv 


T094.SQ 


udctenopnage 


xi ^ian component, ojjj 




1UU 






lambda 








sill 


nnni9 r 


T-T/YtV* /V rani a«t o 

riomo sapiens 


xiuman secreteo protein, otsiyj uj wu. 


1 11 

Izi 


"in 

/o 








A a tad 
44Uy. 










xi 01110 Sapiens 


uiNA lopoisom erase 11 jtiv^ j.yy. i^) 


lt>3 


96 


sm 


AT 119R41 


Caenorhabditis 


VI CC1 A 1 

Y15E3A.3 


147 


85 






cicgdns 










n0ftR97 


j&scnericnia coil 


Serine/Threonme protein phosphatase 


329 


46 








(JbC i.l.i.lo;. 






S11S 




; 

Homo sapiens 


ximnan secreted protein, ob^ lu p*u. 


15y 


62 






_ 










vjUU4U / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


91 








44oo. 






st n 


VJU4U/0 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


96 














S11R 
Ji JO 


VJVOU4J 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


67 








T1 *)A 

/124. 






S13Q 

juy 


uUj /y / 


■— . — : 

Homo sapiens 


Human secreted protein, SbQ ID NO: 


93 


31 








7o7o. 






SI 4ft 




— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


46 














S141 


VA9671 


xiomo sapiens 


Human secreted protein encoded by 


loo 


67 








(tana Mrmo TJV/IC ni/1 O 

gene zz cione rixvioj wio. 






SI 49 


A FOODS'* S 


no mo sapiens 


flD TTO 1 lira tw/\tam 

vJtvrz-iiKc protein 


lOo 


50 




AF9S07Q7 

Ar/jy /yz 


Homo sapiens 


bWHLDeta protein 


CO A 

684 


no 

98 






Homo sapiens 


Human secreted protein #75. 


130 


63 


S14S 


A 171 7AACO 


Homo sapiens 


r*T> C CA 

rKUzoSO 


151 


76 




ArUUUOlO 


Oryza sativa 


similar to RING-Hz ringer protein 


128 


52 








RHAla (AF078683) 






SI 4*7 


\jSjo 1 IL 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


85 








TO CI 

7zoJ. 






SldR 
Jl4o 


AT 1*C<y7ttO 


— . — 

Trypanosoma 


probable similar to ring-h2 finger 


1A1 

101 


✓•1-7 
67 






bmcei 


protein rhal a. 








W4o\2j1 


Homo sapiens 


Human breast cancer related protein 


1 to 

118 


56 














51S0 


AFI^OOSI 


xiomo sapiens 


PDHOROR 
Irrv^JOoyo 


RA 


/z 


S1S1 


R070S7 


xiomo sapiens 


omaiier nepaiocenuiar oncoprotein 


lOD 


03 








^nncfri^ gent/ prcOUUCl. 






5152 


X56S32 


T-T/itti f\ canipnc 


miicrlp cn^nnr ATirkiocA 
lUUaLflC 3UCCU.ll* CUUlOoC 


S71 


on 
yu 






xiomo sapiens 


nepaiocyie nucieor lacior *f 


j44 


OT 

yi 


51 S4 


TT19Q10 


i vi lib muscuius 


auenyiyi cyclase type v u 


1 C& 
1 jO 


IJ 


si ss 


flOl 1 79 


xiomo sapiens 


Unrnon carrAto^ nrAfpin OCA Tr\ XT A. 

xiuman secreted protein, ocy iu invj. 


01 

yi 


/O 








79 si 

/ZJJ. 








VJVnU 7 X 


xiomo sapiens 


Unman oa/*rofan nrAtein OCA TTl XT fi« 

xiuman secreica protein, jny xli nu. 


i ii 


SA 








R1 SI 
0 1 j j . 






5157 


Y08387 


±X\JXH\J oajjlvllo 


mil- nHnntin— rAlfitpH rvrAtpin 9 
lull auaLFiiiJ j ciatwU |ji uiciu x. 


9S9 


R£ 


5158 


tt13SS/i 


XIOUIU bdpiCiLo 


rruLcm encuueu uownaircdiii oi iiiiu ;vi 


19/\ 

1ZO 


00 














51SQ 


AFO0O049 


LJAnt a c* am an a 

xiomo sapiens 


pp AA/CC7 
Jrlv\JU0 J / 


uy 


1 1 




AF000049 


Homo sapiens 


DT>AA#C<T 
r JKJJUO J / 


lOA 

1ZU 


/U 


5161 


D67066 


Bos taums 


N-WASP 


135 


42 


5162 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


73 








7124. 






5163 


AF044496 


Canis familiaris 


acidic ribosomal phosphoprotein PO 


112 


78 


5164 


W48351 


Homo sapiens 


Human breast cancer related protein 


106 


58 



266 



WO 01/64835 



PCT/US01/04927 



SEQ LD 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
v 








BCRB2. 






5165 


AJ250095 


Bordetella 
bronchiseptica 


pertactin (P.68) 


119 


64 


5166 


U96411 


Mus musculus 


otogelin; MLEMP 


609 


90 


5167 


AF I 16238 


Homo sapiens 


pseudouridine synthase 1 


237 


87 


5168 


G0O376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


151 


78 


5169 


AL050342 


Homo sapiens 


(U655K7.1 (novel protein) 


263 


100 


5170 


AK024455 


Homo sapiens 


FLJ00047 protein 


177 


72 


5171 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


94 


100 


5172 


L38908 


Nicotiana 
tabacum 


extensin 


101 


47 


5173 


AF055904 


Myxococcus 
x an thus 


unknown 


107 


36 


5174 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


59 


5175 


Y00337 


Homo sapiens 


Human secreted protein encoded by 
gene 81. 


123 


86 


5176 


AJ010957 


Hippopotamus 
amphibius 


NADH1 


212 


90 


5177 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


84 


76 


5178 


U83280 


Leishmania 
donovani 


39 kDa antigen 


128 


90 


5179 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


183 


45 


5180 


L27428 


Homo sapiens 


reverse transcriptase 


106 


46 


5181 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


144 


45 


5182 


R83119 


Homo sapiens 


Human cisplatin resistance protein. 


142 


96 


5183 


R95330 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain ligand (clone ITU). 


475 


71 


5184 


AF090930 


Homo sapiens 


PRO0478 


121 


57 


5185 


AF193613 


Homo saoiens 


cell recognition molecule Casnr2 


338 


50 


5186 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


102 


36 


5187 


AL031118 


Homo sapiens 


dJ153G14.3 (novel C2H2 type Zinc 
Finger protein) 


465 


62 


5188 


AF183144 


Mus musculus 


left-right dynein 


199 


83 


5189 


J02459 


bacteriophage 
lambda 


J (taiLhost specificity; 1 132) 


748 


95 


5190 


AF193613 


Homo sapiens 


cell recognition molecule Caspr2 


229 


50 


5191 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


135 


65 


5192 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


84 


5193 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


112 


49 


5194 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


126 


56 


5195 


AF104261 


Mus musculus 


Pax transcription activation domain 
interacting protein PTIP 


487 


72 


5196 


G00333 


Homo sapiens 


Human secreted Drotein. SEO ID NO* 
4414. 


132 


65 


5197 


G03043 

i 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
7124. 


130 


80 


5198 


X55777 


Homo sapiens 


52kD protein 


106 


30 


5199 


AF092170 


Homo sapiens 


polycystic kidney disease 2 related 


122 


91 



267 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 








protein 






520U 


A CAQ9 1 *7A 

Aruyzi /U 


— — __ 

Homo sapiens 


polycystic kidney disease 2 related 


1 JO 


A£ 

96 








protein 






com 




no mo sapiens 


Jtiiunan secretea protein, oci^ iu lskj. 


10 f 


Of 








/o7y. 






CO AO 


AT UU VV 1 0 


uryza sauva 


similar to kunvj-Jtiz ringer protein 


i in 

J; IU 


ol 








KilA 1 a ^Ar U / oOoj ) 






com 


P05011 


_ . 

Homo sapiens 


Neural thread protein. 


vo 


ol 


^OA/f 

D_£U4 


VJVIZjVF/ 


rtomo sapiens 


Human secretea protein, oc^ iu inu. 


OJ 


JJ 








OJOO. 








Mj iy4/0 


rioiiiO sapiens 






1UU 


COAA 


vjUzoiz • 


Homo sapiens 


Human secretea protein, bJbQ ID NO: 


7o 


_:o 
62 








oylJ. 










— — ; 

Homo sapiens 


Human secretea protein, bbvj ID NU: 


1 1 A 

1 ly 


75 






— — _ 


4440. 






COAQ 


vjUJ /yU 


Homo sapiens 


Human secretea protein, SEQ ID NO: 


166 


CA 

50 








/o71. 






->zuy 


/""•AA^QA 

UUUooy 


Homo sapiens 


Human secretea protein, bbQ ID NO: 


1 1 A 

139 


39 








4770. 






DzlU 


A I?AOAO^ 1 

Ar uyi/y 5 1 


Homo sapiens 


JrKO04oj» 


y4 


OA 

yu 


jZi 1 


/"SAAICO 


Homo sapiens 


Human secretea protein, obvj ID NO: 


1 14 


HA 

74 








4438. 






con 


uUiZ4o 


Homo sapiens 


Human secretea protein, SbQ ID NO: 


1 1 c 

116 


OA 

70 








_>iz/. 








Uj.i_.uy 


Homo sapiens 


transformer-2 alpha 


1 Ov! 

124 


AC 

95 


COI >1 


PAO/I5C 

(jU_;4!>j 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


CO 

58 








6566. 






01 c 


1 Uzo / 1 


.... 

Homo sapiens 


Human secreted protein encoded by 


i < i 
131 


A A 

4y 








gene zz clone HMoJ w 1 0. 








¥ OiCAC _ 

L2oyjJ 


Homo sapiens 


chromosomal protein 


AC 

95 


CI 

52 


con 
DZl / 


oUzJJZ 


Homo sapiens 


Human secretea protein, bby ID NO: 


14y 


66 








6613. 








A TOIAO-Tvl 

At zzUz 64 


Homo sapiens 


MOST-1 


1 C 1 

151 


_"C 

65 


5219 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


53 






... 


7214. 






COO A 


/"■AAiC -O 

G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 








4718. 








UUJoU / 


Homo sapiens 


Human secreted protein, obv^ ID NO: 


izy 


DO 








-000 
/ooo. 






COOT 


Ari o/tc _o 
Ar iy4jJ7 


— — : 

Homo sapiens 


XT A 1 "2 


1 OA 

1ZU 


OU 


C002 


(jrUUj /o 


Homo sapiens 


T T ncjn lain CCA Tl~\ "KT/~\. 

Human secretea protein, ocy id in la 


1 1 <c 
1 13 


#CA 
OU 








/l/l<7 






3-1x4 


I 144oZ 


"Tr : 

Homo sapiens 


rragment 01 numan secrerea prorein 


1 AA 
144 


Jo 








encoaea oy gene i /. 








A"CAAAQA< 

Aruuuyyo 


xiomo sapiens 


uDiquiious lrivmoui, i lsoiorm 


104 


/U 




000332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


IOC 
125 


73 








A A 1 1 






5997 




— : 

Homo sapiens 


TJiim An o__h_~»i*__k_ b '-___r4 «»/\^a « n vufi iTv XT/ii 

riuman secrexea protein, oc\i iu rsvj. 


104 


^0 

oy 








/ j iy. 






599 ft 


M_>44j1 


— . 

Caenorhabditis 


collagen (rol-6) 




41 






eiegans 






599 O 


X1639o 


Homo sapiens 


precursor polypeptide (AA -zy to J 1 5) 


iUo 


OO 

OO 




AT _CAOQO 

AJ_Ojy /o2 


Trypanosoma 


possible Qniiv-oJ ui lUz, variant a dna, 


1 12 


oy 






brucei 


complete virion genome. 






5231 


G03798 


Hnmn *_mip?i^ 


Human cprrptf_H nrotein SFO TF> NO* 


157 


75 








7879. 






5232 


D86982 


Homo sapiens 


similar to human ankyrin 1(S08275) 


201 


100 


5233 


AF116712 


Homo sapiens 


PR02738 


135 


49 


5234 


AL049758 


Homo sapiens 


dJ437M2 1 . 1 (novel protein) 


184 


70 



268 



WO 01/64835 



PCT/USO 1/04927 



SEQiD 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


°/o 

Identit 
y 


5235 


AB014527 


Homo sapiens 


KIAA0627 protein 


164 


100 


5236 


AF194537 


Homo sapiens 


NAG13 


183 


64 


5237 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct 


170 


43 


5238 


AF135439 


Mus musculus 


form in binding protein 1 1 


230 


100 


5239 


GO 1246 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
5327. 


137 


69 


5240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


63 


5241 


AF116661 


Homo sapiens 


PR01438 


152 


58 


5242 


L27428 


Homo sapiens 


reverse transcriptase 


193 


53 


5243 


AF130089 


Homo sapiens 


PRO2550 


141 


60 


5244 


AF1 18086 


Homo sapiens 


PR01992 


135 


71 


5245 


AJ228863 


Mus musculus 


CLIP-115 


195 


97 


5246 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEOIDNO:163. 


148 


81 


5247 


AC007887 


Arabidopsis 
thai i ana 


F1504.29 


76 


93 


5248 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


75 


5249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEOIDN0163 


176 


66 


5250 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEOIDNO:163. 


136 


70 




Y86248 


Hnmo ^aniens 


Human secreted nrotein HCHPF68. 
SEQ ID NO: 163. 


162 


80 


5252 


Y86248 


Homo ^aniens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


112 


54 


5253 


AF090944 


Homo saoiens 


PRO0663 


177 


62 


5254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


132 


57 


5255 


Y02749 


Homo saoiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


74 


5256 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


157 


91 


5257 


Y86248 


Homo ^aniens 


Human secreted mote in HCHPF68. 
SEQIDNO:163. 


160 


78 


5258 


U16296 


Homo saoiens 


TIAM1 protein 


133 


100 


5259 


AL 160371 


T.eishmania 
major 


Drobable fhhv-6^ ul 102. variant a 
DNA, complete virion genome 


92 


93 


5260 


U49973 


Homo saoiens 


ORF1; MER37; putative transposase 
similar to pogo element 


324 


78 


5261 


AK024455 


Homo sapiens 


FU00047 protein 


100 


50 


5262 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


217 


100 


5263 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


125 


57 


5264 


U83246 


Homo sapiens 


copine I 


199 


86 


5265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


150 


72 


5266 


U91329 


Homo sapiens 


kines in-like motor protein KIF1C 


132 


93 


5267 


J00672 


Oryctolagus 
cuni cuius 


beta-myosin 


109 


100 


5268 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


74 


43 


5269 


AF1 19851 


Homo sapiens 


PR01722 


96 


61 


5270 


AF090930 


Homo sapiens 


PRO0478 


113 


91 


5271 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


123 


62 


5272 


B01372 


Homo sapiens 


Neuron-associated protein. 


122 


60 
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SEQ U> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterraan 


Identit 










Score 


y 


5273 


Y86472 


Homo <ianien<; 


Human pene 57 -encoded nrotein 

1JL.U111U41 gCIJC Ji> biivvuvupvivui 


116 


78 








fraement SEO ID NO*387 






5274 


AF090931 


Homo sapiens 


PRO0483 


144 


67 


5275 


AL 160493 


L»e ish m an ia 


nrnhable fhhv-6^ ul 102 variant a 


1 17 


68 i 






major 


DNA comnlete virion genome 






5276 


G03438 


Homo <5anien<; 


Human serrated nrotein SFO TD NO* 


11 1 
iii 


JO 








7519. 






5277 


G03714 


Homo ^Anipnc 


T-TumaTi QPnv»l¥ fc H r>rnt<*in SFO inNO* 


147 

1*T / 


^7 








7795. 






5278 


AP000616 


Orvza sativa 


similar to RTNO-H2 finder nrotein 

dJllIUIll W 1X11 1\J li« llllt^Xsl UJVlwUl 


120 

liV 


67 








RHA 1 a (AF0786831 






5279 


G03714 


Homo ^flnien*? 


Human serreted nrotein SFO TO MO* 


1S1 


07 








7795. 






5280 


AFO 10144 


Homo saniens 


neuronal thread nrotein AD7c-NTP 


196 


4S 


5281 


G03714 


Homo saniens 


Human secreted nrotein SEO ID NO* 


115 

iw 


71 








7795. 






5282 


A18812 


Brassica nanus 


extensin 


120 


^8 


5283 - 


Z68747 


Homo saniens 

A1V1UV QQUIvllw 


imo^en 38 

UliVgVU mJ \) ^ 


291 


7S 


5284 


X778I6 


Rattus 


PR-Vbetal 


97 


17 






norvefficus 








5285 


AF130089 


Homo saniens 


PRO2550 


181 


71 


5286 


AF130089 


Homo saniens 


PRO2550 


147 


67 


5287 


AF090942 


Wr»mo canipn? 


PR 00^57 


00 


JU 


5288 


G03043 


riAsuiu oauiciii> 


million jcu cicu pivtciii, OE*\£ IXJ IN w. 


1 1 ^ 

1 iO 


74 








7124 






5289 


G03714 


IJAniA canipnc 
numu oafJldio 


XlUIilail oBCTClCU prULClil, OCy 1U INv^. 


14.1 


/^4 
Oh 








7795 






5290 




nomy sapicnb 


similar 10 c.cou nypouieuvdi z^.o inx* 


1 1 1 
ill 


i no 








nrnteinfPI-VTnW RTOT X\ 






5291 


W48351 


Homo wmipnQ 


Wiimnn hrpact r*afi/*Ar rplatpd nrntpin 
JuLUiiiaii UlCaol v^aiiuci iciaicu jJiUlClil 


14^ 


57 








RCRB2 






5292 


Y86248 


Homo saniens 


Human secreted nrotpin HC!HPF68 

1 LUlllUil Jbul WlbU UI VlbUl ll\jl U 1 \J\J y 


131 


6S i 








SEO ID NO* 163 






5293 


Y08061 


Homo saniens 


Human c-mvh nrotein frapnnent- 

x jl nil i cm \* injr u yi uiwui nagiiiwiic 


213 


76 


5294 


AF090895 


xxviuvf oajjicuio 


PR O0 1 1 7 

1 Ivwv 111 


114 




5295 


AFO 1763 5 


Wnmn canipnQ 




14^ 


100 


5296 


G02558 


I lUlllVJ oaLHGlld 


Human CArrpf#»H nrntpin Q"FO TH MO* 

nuillall oCwiCLCU piUvCUl, ODy ILJ Vi\s. 




ou 








6639 






5297 


AP000616 


Orv7_a Qativa 


cirnilflrto RTMfr-H'7 finc»er r»rntein 

OlXllllCU LU rVlLN VJ 1 JuCt lUlgCI U1V/ Iw LU 


118 

no 


67 








RHA 1 a CAF078683 » 

ivii/\i a y/ii v / ouoj y 






5298 


AK024455 


Homo saniens 


FLJ00047 nrotein 


67 


80 1 


5299 


Y86248 


Homo saniens 


Human serreted nrotein HPHPF68 

A mumii Jvwiwiuu yiviwiii liviu i \JsJj 


164 


75 








SEQIDNO:163. 






5300 


Y86248 


Homo saniens 


Human secreted nrotein HCHPF68 


129 


55 








SEQIDNO:163. 






530] 


R14643 


Homo sapiens 


Gamma interferon receptor. 


107 


82 


5302 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


238 


90 


5303 


Y08062 


Homo saniens 


Human PR0245 nrotein fragment 

A JLVU1 1UH A. i/1 VlvllI JUUCJUV4H 


205 


68 








derived from DNA35638. 






5304 


AF130089 


Homo saniens 

nviiiw juyivuv 


PRO2550 


100 


77 


5305 


R13556 


Homo saniens 


Protein encoded downstream of hhc A/f 

A 1\SIA*111 vllWUUVU UwT? 1UUVCUU \J 1 ULUV* 1VJL, 


1 17 
iw 


44 








nn ronrote in 






5306 


X92485 


Plasmodium 


pval 


128 

l—O 


61 






vivav 

V I V OA 








5307 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


57 








6678. 






5308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


140 


42 








clone HPMBQ32. 






5309 


Y91577 


Homo sapiens 


Human secreted protein sequence 


489 


78 
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SEQI0 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








encoded by gene 2 SEQ ID NO:250. 






5310 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


5311 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


129 


65 


5312 


AF1 19851 


Homo sapiens 


PR01722 


97 


65 


5313 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


160 


90 


5314 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


113 


81 


5315 


AF071310 


Mus musculus 


OPA-containing protein 1 


228 


53 


5316 


AF090942 


Homo sapiens 


PRO0657 


162 


70 


5317 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


109 


83 


5318 


AF090894 


Homo saDiens 


PRO0113 


125 


63 


5319 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


129 


66 


5320 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


75 . 


5321 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


138 


81 


5322 


L27428 


Homo sapiens 


reverse transcriptase 


139 


52 


5323 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


109 


83 


5324 


AF 11 8082 


Homo sapiens 


PRO1902 


107 


70 


5325 


M81321 


Macaca 
fascicular is 


proline-rich protein 


114 


37 


5326 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


112 


79 


5327 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


99 


72 


5328 


AF090930 


Homo sapiens 


PRO0478 


152 


73 


5329 


L27428 


Homo sapiens 


reverse transcriptase 


267 


65 


5330 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


82 


5331 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


164 


54 


5332 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


135 


52 


5333 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


124 


76 


5334 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


97 


56 


5335 


AP000060 


Aeropynim 
pemix 


103aa long hypothetical protein 


71 


50 


5336 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


75 


5337 


Z93784 


Homo sapiens 


dJ398C22.1 (novel protein, ortholog of 
mouse brain protein E46) 


280 


68 


5338 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


132 


64 


5339 


AF210651 


Homo sapiens 


NAG18 


160 


56 


5340 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


133 


80 


5341 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


151 


75 


5342 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


50 


5343 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


103 


75 


5344 


G03021 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 j 
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WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

7 








7102. 






5345 


Y87103 


Homo sapiens 


Human secreted nrotein seauence SEO 
ID NO: 142. 


92 


48 


5346 


AL121891 


Homo sapiens 


dJl I87M17.2 (KIAA0552 protein) 


148 


43 


5347 


G00689 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
4770. 


130 


33 


5348 


AF090944 


Homo sapiens 


PRO0663 


205 


72 


5349 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


135 


57 


5350 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
Drotein rha 1 a 


101 


37 


5351 


AF265578 


Homo sapiens 


orphan neurotransmitter transporter 
NTT5 


121 


39 


5352 


AP0006I6 


Oryza sativa 


similar to RING-H2 finger protein 
RHA la (AF0786831 


105 


75 


5353 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


69 


5354 


Y02671 


Homo ^aoien*; 


Human ^ecrpted nrotein enooderf nv 

gene 22 clone HMSJW18. 


97 




5355 


G00590 


Homo ^aoien*: 


Human ^prrptftH nmtpin SPO TO NO* 

I 1 Lull cm oCUvlw plUlClUj uCy XU llv. 

4671. 


1 13 


ou 


5356 


R96800 


Homo ^anien^ 


Human hi<itinrvtp-<;**f*retf*H fartnr HSF 


123 


67 


5357 


W88609 


Homo sapiens 


Secreted protein encoded by gene 76 
clone HKIXL73 


94 


79 


5358 


AC004450 


Arabidopsis 
thaliana 


putative extensin 


110 


37 


5359 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHA la fAF078683"> 


120 


79 


5360 


Y86248 


JL lwLXJLV/ OClLflVllO 


Human cprrpfp-d nrnfpin HPHPPfift 

I lLXllla.ll dbtlCLCU L/l VsU&LU nVUi FUO, 

SEQ ID NO: 163. 




74 

/ fr 


5361 


Y86248 


Homo ^anif*n^ 


Human <5f*fTi»tftH nmtpin HPHPFfift 
nuiutui ovui &i£^i jjiuixvLU xxv^xxjr r uo } 

SEQ ID NO: 163. 


1 f*T 


Q7 


5362 


AB017437 


Cralln<i callus 

VJ CXI 1 no goiiuo 


avpna 


98 
70 


AA 


5363 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID TsIO-163 


155 


93 


5364 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO TD NO* 163 


181 


70 


5365 


G02538 


Homo sanien^ 


Human secreted nmtern 55FO TO NO* 

XXIXJUCUI gwlwlwU UIUH/III) Ulj\^ LLS 

6619. 


159 


68 


5366 


Y73966 


Homo ^aniens 


Human wo^tate tumor EST rrapmpnt 
derived protein #153. 


94 


60 


5367 


X52022 


Homo saoiens 


collagen tvoe VI aloha 3 chain 


150 


33 


5368 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


120 


77 


5369 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


145 


75 


5370 


X97675 


Homo sapiens 


plakophilin 2b 


170 


85 


5371 


AF130079 


Homo sapiens 


PR02852 


103 


95 


5372 


L26953 


Homo sapiens 


chromosomal protein 


117 


54 


5373 


AF210651 


Homo sapiens 


NAG18 


137 


67 


5374 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor sunoressor e*ene 7.AP-1 


106 


76 


5375 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla CAF078683> 


109 


70 


5376 


AL109819 


Arabidopsis 
thaliana 


extensin-like protein 


123 


34 


5377 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


127 


79 


5378 


AF036170 


Dictyostelium 


homeobox-containing protein Wariai 


93 


83 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


iNU: 








Waterman 


Identil 










Score 


V 






discoideum 








5379 


AF130089 


Homo sapiens 


PRO2550 


134 


75 


5380 


AF161384 


Homo sapiens 


HSPC266 


663 


96 


5381 


AF130051 


Homo sapiens 


PRO0898 


147 


77 


5382 


AF090944 


Homo sapiens 


PRO0663 


118 


83 


5383 


G03133 


Homo saniens 


Human secreted nrotein SFO ID NO* 


129 


55 








7214. 






5384 


Y02671 


Homo sapiens 


Human secreted Drotein encoded bv 


no 


68 








gene 22 clone HMSJW18. 






5385 


Y86248 


Homo saoiens 

1WU4V JUl/iVllw 


Human secreted protein HCHPF68, 


165 


53 








SEQIDNO:163. 






5386 


B03628 


Homo saoiens 


Human nhosnholinase 2 HPPI/7 


145 


96 


5387 


LT54788 


Mus musculus 


Wiskott-Aldrich Svndrome Protein 


108 


51 


5388 


GO 1828 


Homo saoiens 


Human secreted nrotein SRO ID NO* 


104 


89 








5909. 






5389 


Y86248 


Homo <*anien < i 


Human secreted nrotein HPHPFfift 


1?5 
i i,j 


70 i 








SEO ID NO* 163 






5390 


GO 1828 


Homo <?anien^ 


Human <*ecreted nrotein SFO ID NO' 


103 

1 UJ 


72 








5909. 






5391 


AL359782 


Trvoan osom a 


tvo^siHIe flihv-6^ 11 1 1 0? variant a dtia 


1 19 








brucei 


conrolete virion penome 






5392 


G00454 




Human QAPr*»t*>d r»rrvtf»in QPO TO MO* 
JXmiiall oCwiCLdl piutciii, OCy JJL/ IiV/. 


VIA} 










4535. 






5393 


G03172 


Homo «*anien«i 


Human «M^Tpted Tirntf-in SPO TD MO* 
xxmuciii o^/icicu uiv*LcuUy ocy \xj itv. 


I J I 


755 








7253. 






5394 


G01552 


Homo saoiens 


Human secreted nrotein SEO ID NO* 


159 


90 








5633. 






5395 


G03043 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 


110 


65 








7124. 






5396 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


85 


59 








encoded by gene 17. 






5397 


X86681 


Homo sapiens 


HNP36 protein 


248 


71 


5398 


L27428 


Homo sapiens 


reverse transcriptase 


133 


62 


5399 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


132 


54 








oncoprotein. 






5400 


AL080196 


Homo sapiens 


hypothetical protein 


164 


96 


5401 


D38112 


Homo saoiens 


NADH dehvoroiyena«?e "suhunit 5 


119 


75 


5402 


AF090944 


Homo saoiens 


PRO0663 


99 


64 


5403 


G00352 


Unmn cftnipnQ 


lliimon cprrpiWt nrntein QFO TH "WO* 


01 

yj 


JO 








4433. 






5404 


U42471 


Mus musculus 


Wiscott-Aldrich Svndrome nrotein 


103 


37 








homoloe 






5405 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 


137 


80 








product CL6.5.40. 






5406 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


69 








4484. 






5407 


G00403 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 


102 


69 








4484. 






5408 


G00328 


Homo saoiens 

& ivmv i wi u 


Human secreted nrotein SRO TD NO* 




69 








4409. 






5409 


G03438 


Homo ^aniens 


TTiiman ^efTP-tefi nrotein SEO ID NO* 












7519. 






5410 


Y12950 


Homo sapiens 


Amino acid seuuence of a human 


65 


47 








secreted nentide 






5411 


AF181626 


Drosophila 


BcDNA.GH03694 


171 


53 






melanogaster 








5412 


U93565 


Homo sapiens 


putative pi 50 


195 


70 


5413 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


64 








7253. 
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WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


5414 


Y86248 


Homo ^anicriQ 


Unman c*»rrptf»H nmt#»in HPHPFKR 

JTLUlUall oCUlClCU JJlUtClll nV^XXT r UOj 

SEQID NO: 163. 




ai 


5415 


X83572 


Homo sanien*; 




133 


inn 


5416 


D86853 


Catharanthus 
rose us 


extensin 


113 


41 


5417 


Y86248 


Homo ^anien<: 


Human <&rre.teA nmfpin HPHPFfiR 

SEO ID NO* 163 


139 


fin 


5418 


G03472 


Homo saoierK 


Human secreted nrotein SEO FD MO' 
7553. 


175 


n 


5419 


R32010 


Homo <;anien<i 




140 


Oj 


5420 


D67066 


Ros taums 

1-/ Vu 1 Cl 1 II 1 li 1 


N-WASP 


153 


41 


5421 


U42391 


Homo sanien^ 


m vo^i n-Dfh 


703 


100 


5422 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


116 


74 


5423 


AF220264 


Homo <iani*»n<; 


MOST-1 


94 


7R 


5424 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


63 


5425 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


131 


57 


5426 


AF130089 


Homo sapiens 


PRO2550 


120 


58 


5427 


Z70204 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
rrUUoZo (JrHJL> linger}, ocore=63.1, 1> 
value=1.9e-15, N=l~cDNA EST 
tMoL. iuudjo comes irom mis 
gcne~cjLiiN/\ noi yivj i + c n / . j uumcs 
from this gene 


131 


55 




VJUJ 1 / X. 


nuiiiu sapiens 


Unman fo/>rofaH ntvifofn CPA m W/V 

11 urn an sccreicxi protein, oc»y lu wu. 
7253. 


I Jo 


C 1 




AF1300RQ 


Hnmn contort c 

numu sapiens 




1 17 


HI 


5430 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene cione ruvioj wio. 


161 


52 






nuxnu sapiens 


Unman co/«rotaH nrnfain QCO 1T\ WY"V 

xiuman secrexeu protein, ony jjj inv/. 
7124. 


1^1 


77 




fr017Q7 


nuniu sapiens 


Unmon corrotM^ nrAioin CCA TT*| XT A. 

iimnon sccrcieu pruiem, oe\£ iu iskj. 
7878. 


\ ^7 


70 


5433 


AF090Q47 


numu sapiens 




i no 


fi7 

o/ 


5434 


AE003588 


Drosophila 

uiciaiiugabicr 


CG2839 gene product 


141 


50 


5435 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*fOJO. 


154 


79 


5436 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
78 so 


128 


41 


5437 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
#000. 


143 


63 


5438 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253 


117 


84 


5439 


G03793 


Homo Qar>ipn<* 


Human secreted nrotein SEO ID NO* 
7874. 


1 12 


7fi 


5440 


AP000616 

111 WW 1 w 


Orvza <iativa 

v/i jr *»a MU v a 


similar to RTNO-H2 fmo*emrotein 
RHAIa CAF078683^ 


110 


42 


5441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


64 


5442 


AT 160493 


lvvJolJLlll all la 

major 


TimKsiVil a /T1I1 11 1 1 07 variant a 

^UIIV v/ Ul 1UL-, YOliaXii a 

DNA, complete virion genome 


193 


JO 


5443 


AU/tyJ 


nuiiiu sapiens 


tranclci+AH r**<yinn f A A 1_ 7A4i 
uaOSlalcU tcgiun ^/Vrv 1 Zvvr-t j 


10K 

1 I/O 


41 


5444 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


62 


5445 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


130 


41 


5446 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c) 


136 


71 
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SEQto 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


5447 


Y014G0 


AXVJIltU iwljylvllo 


Secreted protein encoded by gene 1 8 












clone HNHF029 






5448 




Urtmn cariipnc 


Am inn or*iri cpnnptifA /^-f" cq 1 iv/qt*\/ 
xVAiiuiu cAutu dcuucutc kjl oiUlVal y 


101 


4S 








protein CON-2. 






5449 


U13866 


unidentified 


non-functional 1at*7 alnha nenti Hp 

l ivy ii AuiiviivMAai lui^o oAAMJu ugl/iavac 


106 


57 






cloninp" vector 






5450 


X01469 


Plasmodium 

M. ICXOlllVSvlllXlll 


hi^tidinp-rirh nrntpin 

lllOlUJUlt. Aiv>Ai JJi UlClil 




56 






Ioohurae 








5451 


G02480 


Homo saniens 


Human secreted nrotein SEO ID NO* 


130 


88 








6561. 






5452 


Y 14482 


Homo ^aniens 


Fragment of human secreted nrotein 


155 


68 








encoded by gene 17. 






5453 


AF1 19900 


Homo sapiens 


PR02822 


162 


63 


5454 


Y01400 


Homo saniens 


Secreted nrotein encoded hv f?ene 1 8 


120 


51 








clone HNHF029 






5455 


AC007^04 


Plnmn cfloifMic 

AAV 11 IV CKLL/lvllO 


BC273239 1 


1485 


69 


5456 


L27428 


Hnmn *mniens 

AAU111VS dtAAJLliltO 


rpvpHiP transcrintase 


122 

liil> 


41 


5457 


Ml 8094 


A AluOt/UlUd 


h vHroYvnrnKnp-ri^fi O"lvi*onrotpiri 

11 jr UJ. KJA.J pi Ulll.lt/ 11W1A jpljr vUUl UtClll 


141 
j" i 








vulgaris 








5458 


AP000616 


Orw3 oativa 


similar to R TMfr-T-T^ finopr rvmfpin 

dilAlllCU \\J 1XA1 tVJ 1 L£* AAUgVl L/lULdll 


118 


68 








RHAlaCAF078683i 






5459 


AF090930 


Homo aniens 


PRO0478 


144 


64 


5460 


AF 13 0089 


Homo saniMiQ 

AAU1A1U dOAJlWlO 


PRO2550 


124 


60 


5461 


G00377 


Homo saniens 


Human secreted nrotein SFO ID NO* 


135 


69 








4458. 






5462 


X97675 


Homo saniens 


nlakonhilin 2h 


103 


72 


5463 


G03172 


Homo ^anien^ 


Human arreted nrotein SRO ID NO* 

AAU1UCUA Jvvlvl^U plvlwulj UJLA^ 1 JLV liV/ . 


137 


69 








7253. 






5464 


AF229067 


Homo saniens 


PADI-H nrotein 


146 


65 


5465 


AF1 16715 


Homo sapiens 


PR02829 


83 


71 


5466 


AF090930 


Homo saniens 


PRO0478 


146 


52 


5467 


U93574 


Homo sapiens 


DUtative d150 


203 


52 


5468 


W48351 


Homo saniens 


Human breast cancer related nrotein 

mil mm i/&vm«?v vuuvwi iviuivu i/iviwm 


89 


50 








BCRB2. 






5469 


AF191838 


Homo saniens 


TANK binding kinase TBK1 


121 


95 


5470 


AF 130079 


Homo «*aniens 


PR02852 


146 


69 


5471 


AF090944 


Homo Ranien^ 

A AvH.il V ZSOQJlWllO 


PRO0663 

AT IVVJVVW 


on 


66 


5472 


G03801 


Homo «»antpn<; 


Human secreted nrotein SEO ID NO* 

AAU111CU1 JvH V/Lt>Ll y/lKJlXikXlj OUv^ AAV 1 N W- 


106 


63 








7882 






5473 


Y86248 


Homo <?anien<; 

A 1VJ111VJ SClLllvrllA 


Human secreted nrotein HPHPF6X 

AAlAIAlall JbUlwLCU UlUlwlll 1 1V>111 1 V/Oj 


127 


60 








SEOIDN0163 

*U/J^^^ X^X^«XV*/*> 






5474 


AF064553 


Mus musculus 


NSD1 nrotein 

11 KJU X fJl wlVAlA 


269 


100 


5475 


G00382 


Homo *;anien<i 


Human secreted nrotein SEO ID NO* 

XXU1X1CU1 OUVlvVvU UlVfcvUt) KfM^\J 11/ 1 IV/ • 


132 


65 








4463. 






5476 


G03172 


Homo saniens 


Human secreted nrotein SEO ID NO* 


130 


86 








7253. 






5477 


U11271 


Homo sapiens 


thromboxane A2 receptor 


118 


67 


5478 


G02532 


Homo sapiens 


Human secreted nrotein^ SEO ID NO* 


143 


60 








6613. 






5479 


G03172 


Homo saniens 


Human secreted nrotein SEO ID NO* 

llUilltUl JWVl VVVU UlVkvUl] *— ' A-/ \^ JlV/i 


113 


75 








7253. 






5480 


R96800 


Homo sapiens 


Human hi^ocvte-^ecreted factor HSF 


119 


74 


5481 


AP000616 


Orvza sativa 


similar to RING-H2 fineer nrotein 

OUllllfU %,\J 1V11 ~ Vp> A .1 «V * ui p vt iy*Vfc%*AX4 


103 


85 








RHA 1 a ( AF078683 ■ 






5482 


AF238235 


Entamoeba 


diaphanous protein 


108 


35 






histolytica 






5483 


AF090930 


Homo sapiens 


PRO0478 


120 


55 


5484 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


111 


84 








SEQIDNO:163. 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


5485 


M61883 


Sus scrota 


apomucin 


252 


55 


5486 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


77 








7124. 






5487 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


46 








7124. 






5488 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


62 








4414. 






5489 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


124 


47 






elegans 


binding protein K10 (NID:g8148) 






5490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


220 


71 








SEQ IDN0.163. 






5491 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


65 








SEQIDNO:163. 






5492 


M76546 


Helianthus 


hydroxyproline-rich protein 


116 


38 






annuus 








5493 


AF 11 8086 


Homo sapiens 


PR01992 


135 


82 


5494 


AF090930 


Homo sapiens 


PRO0478 


105 


74 


5495 


Y94451 


Homo sapiens 


Human inflammation associated 


170 


61 








protein #8. 






5496 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


65 








7546. 






5497 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


78 








4409. 






5498 


X56123 


Mus musculus 


taiin 


594 


88 


5499 


AL035652 


Homo sapiens 


dJU6.l ftopoisomerase (DNA) H 


105 


86 


5500 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


81 








4409. 






5501 


Y45389 


Homo sapiens 


Human secreted protein fragment 


161 


93 








encoded from gene 3 1 . 






5502 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


60 








4414. 






5503 


X03145 


Homo sapiens 


pot ORF I 


215 


79 


5504 


AF240630 


Mus musculus 


IO motif containing GTPase activating 


180 


61 








protein 1 






5505 


AL390212 


Homo sapiens 


(U29M10B.1.2 (protein kinase C 


1282 


93 








binding protein 1 (isoform 2)) 






5506 


AF1 18086 


Homo sapiens 


PR01992 


146 


61 


5507 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 


116 


74 








terminal, clone XEH.8c} 






5508 


U42471 


Mus musculus 


Wiscott-AIdrich Syndrome protein 


165 


50 








homolog 






5509 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


211 


95 


5510 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


114 


69 








SEQK)NO:163. 






5511 


AF090944 


Homo sapiens 


PRO0663 


125 


58 


5512 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


53 








7492. 






5513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


110 


68 


5514 


AF 11 8086 


Homo sapiens 


PR01992 


128 


76 


5515 


Z22866 


Mus musculus 


skelemin 


218 


95 


5516 


Z22866 


Mus musculus 


skelemin 


218 


95 


5517 


G00376 


Homo sapiens I 


Human secreted protein, SEQ ID NO: 


127 


71 








4457. 






5518 


AL133380 


Homo s aniens 


(U862P83 (Similar to MAP3K1 0 


603 


77 








(mitogen-activated protein kinase 












kinase kinase 10)) 






5519 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


187 


78 








7519. 






5520 


AF090901 


Homo sapiens 


PRO0195 


115 


60 
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SEQ ID 

NO: 




Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


5521 


Y8624S 

• 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


53 


5522 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


155 


73 


5523 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


149 


100 


5524 


X97675 


Homo sapiens 


plakophiiin 2b 


153 


70 


5525 


D37793 


Mus musculus 


synaptotagminII/IP4BP 


651 


84 


5526 


M32319 


Homo sapiens 


HLA protein allele B5 1 


754 


94 


5527 


AF130051 


Homo sapiens 


PRO0898 


93 


64 


5528 


X92485 


Plasmodium 
vivax 


pval 


88 


66 


5529 


AJ131890 


Homo sapiens 


DNA polymerase lambda 


205 


100 


5530 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


132 


68 


5531 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 10L 


87 


59 


5532 


AF090931 


Homo sapiens 


PRO0483 


133 


75 


5533 


AF090942 


Homo sapiens 


PRO0657 


122 


66 


5534 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


145 


56 


5535 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


569 


91 


5536 


K02401 


Homo sapiens 


chorionic somatomammotropin 


375 


98 


5537 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


136 


68 


5538 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
6561. 


130 


44 


5539 


S62928 


Homo sapiens 


PRBIM protein precursor 


104 


40 


5540 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


127 


70 


5541 


AE000463 


Escherichia coli 
K12 


putative glycosidase 


254 


100 


5542 


J02459 


bacteriophage 
lambda 


H (tail component;853) 


761 


100 


5543 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


119 


58 


5544 


U59185 


Homo sapiens 


putative monocarboxylate transporter 


233 


87 


5545 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


187 


65 


5546 


D87446 


Homo sapiens 


Similar to a C.elegans protein encoded 
in cosmid C27F2 (U404I9) 


556 


84 


5547 


X92485 


Plasmodium 
vivax 


pval 


110 


67 . 


5548 


AB023233 


Homo sapiens 


KIAA1016 protein 


246 


97 


5549 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


64 


5550 


AC002044 


Homo sapiens 


Alpha-fetoprotein enhancer binding 
protein (3* partial) 


111 


38 


5551 


AF026802 


Homo sapiens 


alpha-3 type DC collagen 


117 


47 


5552 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


85 


88 


5553 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


91 


88 


5554 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


71 


5555 


AF130089 


Homo sapiens 


PRO2550 


172 


86 


5556 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


105 


72 


5557 


G0341I 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


160 


61 
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SEQ ED 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


5558 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


131 


68 


5559 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


112 


67 


5560 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


159 


73 


5561 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


141 


67 


5562 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


150 


71 


5563 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


146 


80 


5564 


AF090942 


Homo sapiens 


PRO0657 


157 


73 


5565 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


169 


43 


5566 


AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


103 


40 


5567 


AF130089 


Homo sapiens 


PRO2550 ~ 


132 




5568 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO* 
4493. 


129 




5569 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor 1 68 
kDa subunh 


153 




5570 


AF068266 


Homo sapiens 


EHT protein 


169 


91 


5571 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
termina! fragment 


113 


42 


5572 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 12 


oo 


5573 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


71 


5574 


L20450 


Mus museums 


DNA-binding protein 


283 


61 


5575 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


95 ; 


5576 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


140 


71 


5577 


Y20648 


Homo sapiens 


Human neurofilament-L wild type 
protein fragment 38. 


66 


37 


5578 


U42471 


Mus museums 


Wiscott-Aldrich Syndrome protein 
homolog 


112 


41 


5579 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


74 


5580 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


11 


5581 


AF153606 


Homo sapiens 


angiopoietin-related protein 


737 


95 


5582 


AF090930 


Homo sapiens 


PRO0478 


149 


77 


5583 


X52493 


Glycine max 


DNA-directed RNA polymerase 


140 


46 . 


5584 


M76546 


Helianthus 
armuus 


hydroxyproline-rich protein 


101 


51 


5585 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


66 


47 


5586 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


130 


72 


5587 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


148 


56 


5588 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


148 


66 


5589 


AC006014 


Homo sapiens 


similar to KIAA0618 and nuclear 
envelope protein POM 121; 
alternatively spliced product; similar to 
PID:g3327050 andP52591 
(PID:gl709213) 


1 I/O 




5590 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


122 


84 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








oncoprotein. 






5591 


AB001684 


Chlorella 


ORF74 


61 


47 






vii loans 








5592 


AF090930 


Homo saoiens 


PRO0478 




70 


5593 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


57 








7077. 






5594 


W80406 


Homo sapiens 


A secreted orotein encoded hv clone 


lZf«T 










dh40_3. 






5595 


L26953 


Homo sapiens 


phromoQ/\m5iI fwrrfpiti 


1 tfi 


OA 


5596 


L26953 


Homo saniens 


chromosomal nrotein 

will UIUVJVI UU1 L/l ulviil 


1 1 S 
1 1 j 


fn 
\j § 


5597 


G02532 


Homo <5anipn<; 

IA.KJU.HJ ^CLLJlKiLlj 


Hnman sprrptftd nrotein ^FO ID "MO* 

A LU.llla.LI oCUl ClCU Ul KJIGLLL) Od^ IL/ V% 


1J f 










6613. 






5598 


Y71213 


Homo saoiens 


Human irritahle howel disease related 


111 

i J A 


AA 






Dorvoeotide IMX39 






5599 


G00376 


Homo sapiens 


Human secreted orotein SEO ID NO* 


125 


45 








4457. 






5600 


AF130089 


Homo sapiens 


PRO2550 


142 


62 


5601 


AY014404 


Homo sapiens 


kinesin-like orotein RBKIN2 


297 


98 


5602 


AF210651 


Homo saoiens 


NAG 18 


ill 


71 

/ 1 < 


5603 


AF090931 


Homo sapiens 


PRO0483 


129 


71 


5604 


X01060 


T-Tomo *;aoipn<i 


Tint truncfi^rrin rt^^^ntnr 1 _7^0 1 
jjui. u cuioici l in i cucuLui (da I / j 




RO 

Oil 


5605 


LT49973 


Homo ^anipTi*; 


vyrvr l , iyia-#av_p / , puiu.ii vc Ucui^pvoaoC 




7^ 








similar to nntrn plpmpnt 






5606 


M64793 




C JJ 1 1 V/ Q 71/ F%I l 7>IlTIA-?"T/ > 1l nfTI^Ain 

"Miiivaijr i/i vlliic~iicii lji uiciii 


1 1 0 

117 








norvepiciis 








5607 


A3 1039 


Nicotiana alata 


PRP3 


92 


JU 


5608 


AF090944 


Homo <*anipn<; 


A IvUVUVJ 


147 

1*T f 


Of 


5609 


G00552 


Homo Qftnipn<i 

aauiaivs oapiciio 


Hitman cpprpfpd nrntpin QPO TTi XI fV 




HO 








4633. 






5610 


X01065 


Cairina mo^chata 


histone 


oo 




5611 


U03413 


Oirtvo^tplhinn 

is IV l>Jr \JO lv 11L1111 


calcium V\inH incr nrotein 
V/cLiUiulH UlllUlilg pi UlCLU 


1 V** 


4.0 






discoideum 








5612 


AL 160493 


Leishmania 


rirnl^stKlp fln\\\/— A\ nil 00 variant si 
piuuaL/iG \lillv U y Ul 1 V/X'j Vol lalll a 


1 A A 


17 






major 


TYNA rrvmrvl pfp viriirn ffpnnmp 

UliA, l«UltljJJGl£» VAlJUIl gCUUlAlG 






5613 


G00333 


aawiaivj oaLiiciio 


Hitman co/^T/^orl nrA+pin QCr\ XV\ \T/*^» 

nuiiiaii sccrcicu protein, ocy jllj in w. 


199 


Oj 








4414 






5614 


G01218 


Homo <;anien^ 


Human epprpfpd nrntoin QPO ID MO* 
AAutiiaii dcucicu yivvciiiy os+\j7 ixj t*s\J, 


1Q£ 


07 








5299. 






5615 


G00332 


Homo sapiens 


Human secreted nrotein. SEO ID NO* 


122 


uo 








4413. 






5616 


AB029309 


Homo sapiens 


Now38-bindinff nrotein NowBP 


312 


78 


5617 


X03145 


Homo sapiens 


not ORF III 


101 


80 


5618 


AF090930 


Homo sapiens 


PRO0478 


159 


78 


5619 


D88461 


Rattus rattus 


N-WASP 


107 


40 


5620 


W48927 


Homo sapiens 


Schwannom in-binding protein C- 


104 


38 








tenninal fragment 






5621 


Y36156 


Homo sapiens 


Human secreted protein #28. 


134 


50 


5622 


AF090894 


Homo sapiens 


PRO0113 


119 


65 


5623 


AF130089 


Homo sapiens 


PRO2550 


130 


69 


5624 


AF130089 


Homo sapiens 


PRO2550 


140 


78 


5625 


G03789 


Homo sapiens 


Human secreted nrotein SEO TD NO* 


162 

1 WA 










7870. 






5626 


AF130089 


Homo sapiens 


PRO2550 


152 


80 


5627 


AF151837 


Homo saoiens 

IIVIUV ^(AL/lWAlaJ 


CGT-79 nrntpin 1 


197 


7R 


5628 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


84 








4442. 






5629 


AJ237981 


Vitis vinifera 


putative proline-rich cell wall protein ! 


93 


50 


5630 


G02501 : 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


52 








6582. 
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Qvn in 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Id en tit 

y 


5631 


AF130089 


Homo sapiens 


PRO2550 


151 


62 


5632 


U47661 


Luptnus luteus 


proline-rich protein PRP2 precursor 


67 


28 


5633 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


93 


61 


5634 


AL133215 


Homo sapiens 


bA108L7.6 (semaphorin 4G (sema 
domain, immunoglobulin domain (Ig), 
transmembrane domain (TM) and short 
cytoplasmic domain)) 


323 


100 


5635 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


138 


67 


5636 


AF090942 


Homo sapiens 


PRO0657 


184 


67 1 


5637 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


80 


5638 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


139 


83 


5639 


AF1 19900 


Homo sapiens 


PR02822 


143 


62 


5640 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


151 


61 


5641 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


118 


72 


5642 


AF090895 


Homo sapiens 


PRO0117 


140 


56 


5643 


Y51611 


Homo sapiens 


Human HSGT1 protein. 


611 


96 


5644 


AFI 62692 


Homo sapiens 


Dutative voltase-eated calcium channel 
gamma-4 subunit 


172 


ID 


5645 


Z98883 


Homo sapiens 


c407A10.1 (GPU (N- 
acetylglucosaminyl transferase 
component)) 


229 


1 VVJ 


5646 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


80 


5647 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


159 


78 


5648 


AF217517 


Homo sapiens 


uncharacterized bone marrow protein 
BM041 


97 


49 


5649 


AF210651 


Homo sapiens 


NAG18 


149 


58 


5650 


AK000385 


Homo sapiens 


unnamed protein product 


125 


36 


5651 


U02313 


Mus musculus 


protein kinase 


143 


96 


5652 


AF090944 


Homo sapiens 


PRO0663 


95 


43 


5653 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


131 


61 


5654 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


100 


58 


5655 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


157 


70 


5656 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


100 


5657 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


47 


5658 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


159 


46 


5659 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


163 


60 


"5660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


105 


46 


5661 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


125 


71 


"5662 


D87744 


Mus musculus 


DRPLA protein 


89 


40 


5663 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


151 


65 


5664 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


87 


65 
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SEQ if) 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idkntit 

y 


5665 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


71 


44 


5666 


A18812 


Brassica napus 


extensin 


98 


41 


5667 


AF1 30089 


Homo sapiens 


PRO2550 


128 


74 


5668 


AF090930 


Homo sapiens 


PRO0478 


136 


54 


5669 


X52634 


Mus musculus 


tlm protein 


169 


59 


5670 


AB001684 


Chlorella 
vulgaris 


ORP49b 


70 


62 


5671 


G03798 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7879. 


153 


66 


5672 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5673 


AF090942 


Homo sapiens 


PRO0657 


150 


60 


5674 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


123 


78 


5675 


M22332 


Homo sapiens 


unknown protein 


121 


43 


5676 


W40353 


Homo sapiens 


Human unspecified protein from 
US5702907. 


128 


72 


5677 


AL049868 


Homo sapiens 


dJ927M24.2 (KIAA1219) 


403 


95 


5678 


U31785 


Human 

papillomavirus 
type 36 


Dutative 




AA 


5679 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


101 


7^ 


5680 


AF068229 


Homo sapiens 


lysyl hydroxylase 3 


148 


92 


5681 


Y00358 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


209 


81 


5682 


AB008515 


Homo sapiens 


RanBPM 


142 


100 


5683 


ABO 1 0692 


Arabidopsis 
thaliana 


contains similarity to endo-beta-N- 
acetylglucosaminidase-gene id:K18I2 
3.27 


148 


HJ 


5684 


J02459 


bacteriophage 
lambda 


J (tail:host specificity; 1 132) 


746 


100 


5685 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


194 


69 


5686 


AB049054 


Homo sapiens 


brain iinkprotein-1 


160 


100 


5687 


AF229067 


Homo sapiens 


PADI-H protein 


144 




5688 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


120 


70 


5689 


AP001313 


Arabidopsis 
thaliana 


contains similarity to 
pherophorin~gene_id:T5M7. 14 


106 


47 


5690 


M76976 


Drosophila 
melanogaster 


dorsal-ventral patterning protein 


167 


37 


5691 


L24559 


Homo sapiens 


DNA polymerase alpha 


175 


100 


5692 


AF071777 


Mus musculus 


IRE1 


604 


85 


5693 


AF061741 


Homo sapiens 


retinal short-chain 
dehydrogenase/reductase retSDRl 


704 


99 


5694 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


66 


5695 


AF092092 


Homo sapiens 


AP-3 adaptor complex mu3 A subunit 


653 


92 


5696 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


106 


86 


5697 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


132 


75 


5698 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


116 


40 


5699 


AF161361 


Homo sapiens 


HSPC098 


128 


56 


5700 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


106 


57 
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SEQ JD 


Accession No. 


Species 


Description 


Sraitb- 


% 










Watermnn 


Identit 










Ci« A ra 
oCOIC 


y 


5701 


AF1 19901 


Homo sapiens 


PR0283 1 


113 


76 

/ w 


5702 


K02576 


Homo sapiens 


salivarv nro line-rich nrotein 1 


101 




5703 


G03043 


Homo sapiens 


Human secreted nrotein SEO TO NO* 


105 


/ o 








7124. 






5704 


AF130089 


Homo sapiens 


PRO2550 


172 
ij.ju 




5705 


G00407 


Homo sapiens 


Human secreted Drotein SEO ID NO* 


136 


77 








4488. 






5706 


G02455 


Homo sapiens 


Human secreted orotein SEO ID NO* 


1 13 
i ij 


7S 








6536. 






5707 


AF229067 


Homo sapiens 


PADI-H protein 


134 


O 1 


5708 


W40353 


Homo sapiens 


Human unsoecified nrotein from 


129 


6S 








US5702907 






5709 


R96800 


Homo sapiens 


Human histioevte-secreted factor HSF 


137 


0 1 


5710 


AK024455 


Homo sanien^ 


FLI00047 nrotein 


1 x/O 


J*f 


5711 


Y76381 


Homo ^anien*; 


FrAomf*Tit of* hum^n cpprptf*/! nmfakin 
naguiciii. ui Lituiicui dcvjIclcu piLnciu 












encoded hv p"ene 7'S 






5712 


AF1 16715 


Homo sianien*; 


PRO? 890 


1 J o 


7*\ 


5713 


G00332 


Homo <;anipri<i 


IT 111 I tail bCU-ICLCU. piUlCUl, OEA^ 1U INVJ. 




7 1 








4413 
J. 






5714 


G04078 


Homo ^^nipriQ 
xi win w oapiuud 


Human cf»r"rf*ff=»rl nmf/»in QpA TPi "T\TfV 




70 








8159. 






5715 


GO 1 828 


Homo Sfinien*; 


Human QPiTPfpH r»rnf?»in QT?(^\ TTi WH - 


Q1 


oU 








5909. 






5716 


AF090944 


Homo <vAni#*nc 


JTlNAJAJvvJ 


1 JO 


JO 


5717 


G00382 


Homo •wmipnc 


Human cprrptprt nr^tpin CRO TTi WH' 
rruiiidii bCwicLcu pruieiiij ocy 1JJ inva 












4463. 






5718 


G03807 


Homo sanien^ 


Human QprriM'f'.rf nrntpin ^PO ID NO* 


ivy 


70 








7888. 






5719 


AF 130089 


Homo sanien*; 


PRO2550 




oO 


5720 


G03043 


Homo sapiens 


Human secreted nrotein SFO ID NO* 


144 
i*t*t 


u / 








7124. 






5721 


AF130089 


Homo sapiens 


PRO2550 


146 


KQ 
JO 


5722 


AF1 18086 


Homo sapiens 


PRO 1992 


no 


JO 


5723 


D21230 


Homo sanien*; 


alnha lOfA/^ cnlJa<*/*n pbain 




72 


5724 


AL049569 


Homo sapiens 


dJ37C10 5 i / KIAA04451 


266 


yj 


5725 


X04412 


Homo sanienq 


nlacma pelsolm 


/V 1 


yj 


5726 


A21853 


wnrhetir 

•J J UUJ VII \* 


serine nrnteaQp inhihitnr 

Obi 111b |J>1 \J IUIjC 11UJJUJUJI 




i no 






construct 








5727 


Y16610 


Homo sapiens 


naranlesrin 




yj 


5728 


AF 132021 


Homo sanien*; 

A. 1V4I1V/ tfupiviU 


mvosin X 


613 


X7 


5729 


AL 163491 


Arabidopsis 


outative nrotein 


736 


4/c 






thaliana 








5730 


L19704 


Homo sapiens 


alternative first exon 


561 


86 

*JW 


5731 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


240 


84 


5732 


AB007936 


Homo sapiens 


KIAA0467 protein 


189 


77 


5733 


L38622 


Mus musculus 


mStn3 B f? en e nroduct 


780 


94. 


5734 


AF051944 


Gallus gallus 


Xin 


460 


68 


5735 


B07857 


Homo sanieni 


Amino acid sequence of Smadl 


172 


67 








interacfor nrotein clone S1+12-^ 






5736 


Z14020 


Nicotiana 


Pistil extensin like protein, partial CDS 


105 


44 






tabacum 


only 






5737 


LI 7308 


Gossypium 


nro line-rich cell wall nrotein 


145 


34 






hirsuturn 

UJL1 JUVWlll 






5738 


Y76141 


Homo sapiens 


Human secreted protein encoded by 


109 


72 








gene 18. 






5739 


AF178534 


Homo sapiens 


talin 


583 


82 


5740 


D13435 


Homo sapiens 


PIG-F 


143 


100 


5741 


AC005578 


Homo sapiens 


F20887 J, partial CDS 


584 


86 
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CPA ITk 

ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


5742 


AC005578 


Homo sapiens 


F20887J, partial CDS 


470 


85 


5743 


AJ010479 


Homo sapiens 


kinesin-like protein 


102 


23 


5744 


M19419 


Mus musculus 


proline-rich salivary protein 


107 


42 


5745 


U29377 


Caenorhabditis 
elegans 


rysine-rich 


97 


44 


5746 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


90 


67 


5747 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


165 


83 


5748 


J04990 


Homo sapiens 


cathepsin G 


165 


100 


5749 


Y36203 


Homo sapiens 


Human secreted protein #75. 


122 


77 


5750 


V00488 


Homo sapiens 


alpha globin 


181 


94 


5751 


Ml 8642 


Homo sapiens 


lactoferrin 


122 


96 


5752 


W15286 


Homo sapiens 


Soluble type I insulin-like growth 
factor receptor. 


481 


87 


5753 


Z98595 


Schizosaccharom 
yces pombe 


coronin-Iike protein 


192 


38 


5754 


AL035683 


Homo sapiens 


dJ1063B2.1 (Beta-1,4- 
galactosyltransferase) 


239 


100 


5755 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


170 


61 


5756 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


105 


61 


5757 


D49490 


Homo sapiens 


protein disulfide isomerase-related 
protein (PDIR) 


669 


96 


5758 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


175 


72 


5759 


X61451 


Mus musculus 


F41 


130 


100 


5760 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


91 


61 


5761 


AL1 60493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


130 


76 


5762 


AF 130079 


Homo sapiens 


PR02852 


126 


61 


5763 


AF143946 


Homo sapiens 


transcriptional activator SRCAP 


288 


40 


5764 


M80627 


Homo sapiens 


helix-loop-helix protein 


127 


88 


5765 


AF038007 


Homo sapiens 


FICI 


290 


47 


5766 


G01881 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5962. 


427 


97 


5767 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


172 


75 


5768 


B12884 


Homo sapiens 


Human alpha-bradeion protein 
sequence. 


465 


70 


5769 


Y66645 


Homo sapiens 


Membrane-bound protein PRO 13 10. 


347 


100 


5770 


AB033168 


Mus musculus 


nuclear protein ZAP 


115 


39 


5771 


X83575 


Cricetulus 
griseus 


CHOI antigen 


113 


57 


5772 


AF135422 


Homo sapiens 


GDP-mannose pyrophosphorylase A 


764 


91 


5773 


L38644 


Rattus 
norvegicus 


karyopherin beta 


121 


68 


5774 


AB020746 


Arabidopsis 
thaliana 


protein kinase-like protein 


107 


33 


5775 


Y17920 


Drosophila 
melanogaster 


CALO protein 


128 


31 


5776 


AK000309 


Homo sapiens 


unnamed protein product 


988 


98 


5777 


AF0 10404 


Homo sapiens 


ALR 


469 


97 


5778 


AB045131 


Homo sapiens 


NADPH-dependent retinol 
dehydrogenase/reductase 


126 


92 


5779 


AF197927 


Homo sapiens 


AF5q3l protein 


530 


95 


5780 


X79198 


Homo sapiens 


host cell factor 


119 


74 


5781 


X69838 


Homo sapiens 


G9a 


157 


41 



283 



NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


5782 


Y94450 


Homo sapiens 


Human inflammation associated 
protein #7. 


227 


44 


5783 


AF216804 


Rattus 
norvegicus 


nuclear matrix transcription factor 


1539 


98 


5784 


R06463 


Homo sapiens 


Derived protein of clone ICA13 
(ATCC 40553). 


638 


91 


5785 


AF000149 


Mus musculus 


ATP-binding cassette transporter 


292 


41 


5786 


D89974 


Homo sapiens 


grycosylphosphatidyl inositol-anchored 
protein GP1-80 


490 


78 


5787 


AF090942 


Homo sapiens 


PRO0657 


132 


63 


5788 


X56932 


Homo sapiens 


23 kD highly basic protein 


286 


100 


5789 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
. 7253. 


114 


63 


5790 


29721 J 


Schizosaccharom 
yces pombe 


kinesin-like protein 


346 


60 


5791 


AF134707 


Homo sapiens 


disintegrin and metalloproteinase 
domain 19 


144 


95 


5792 


Z19550 


Homo sapiens 


N-acetyllactosaminide beta- 1 ,6-N- 
acetylglucosaminyltransferase 


547 


76 


5793 


AF090930 


Homo sapiens 


PRO0478 


103 


80 


5794 . 


Y59879 


Homo sapiens 


Human normal uterus tissue derived 
protein 42. 


407 


97 


5795 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


72 


5796 


AF077208 


Homo sapiens 


HSPC022 


682 


100 


5797 


D42138 


Homo sapiens 


PIG-B 


597 


86 


5798 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


75 


5799 


M29487 


Homo sapiens 


integrin alpha subunit precursor 


160 


80 


5800 


D87438 


Homo sapiens 


Similar to a C.elegans protein in 
cosmid C14H10 


354 


85 


5801 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl . 


195 


97 


5802 


AB030176 


Homo sapiens 


peptidylarginine deiminase type II 


107 


100 


5803 


D38441 


Homo sapiens 


acylamino acid-releasing enzyme 


156 


71 


5804 


Y55939 


Homo sapiens 


Human GEK2 protein. 


157 


96 


5805 


X89626 


Bos taurus 


240K protein of rod photoreceptor cng- 
channel 


92 


42 


5806 


AC006963 


Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PID:g4650844) 


177 


38 


5807 


Y86275 


Homo sapiens 


Human secreted protein HSRGW16, 
SEQIDNO:190. 


436 


77 


5808 


D50577 


Mesocricetus 
auratus 


carboxylesterase precursor 


328 


55 - 


5809 


AF272148 


Homo sapiens 


KRAB zinc finger protein 


593 


68 


5810 


W57260 


Homo sapiens 


Human semaphorin Y. 


651 


96 


5811 


AF202893 


Mus musculus 


Ki£21b 


491 


82 


5812 


AC006963 


Homo sapiens 


similar to Kelch proteins; similar to 
BAA77027 (PID:g4650844) 


219 


38 


5813 


AB017114 


Homo sapiens 


AD3 


142 


57 


5814 


AC007954 


Homo sapiens 


unknown 


678 


100 


5815 


AF124726 


Homo sapiens 


acinusL 


790 


93 


5816. 


AB015330 


Homo sapiens 


HRIHFB2007 


167 


36 


5817 


AB020684 


Homo sapiens 


KIAA0877 protein 


699 


57 


5818 


Z15025 


Homo sapiens 


Bat2 


172 


49 


5819 


X95761 


Mus musculus 


new:Rhobin 


549 


79 


5820 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


68 


5821 


X55126 


Mus musculus 


Zfp-29 


633 


93 


5822 


Y99366 


Homo sapiens 


Human PR01475 (UNQ746) amino 


452 


98 
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SEQID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Watennan 

Score 


% 

Identit 

y 








acid sequence SEQ ID NO:88. 






5823 






9RFR3 


Oil 


yj 


5824 


M27878 


Homo sapiens 


DNA binding protein 


462 


64 


JO<* J 


I 1JOZ.U 


tiomo Sapiens 






/in 


5826 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
pidicin 


525 


92 


5827 


M23451 


Achlya 

omnicAviiQlic 

djnoiscAAidiis 


steroid receptor 


337 


61 


5828 


AJ0 11305 


Homo sapiens 


guanine nucleotide exchange factor, 
delta subunit 


132 


100 




I ojjOj 


Homo sapiens 


Human nomoiogue 01 ujnu-j3 vxis- 
usHKyjo//.) sequence. 


/lU 


A£ 

yo 


JOJU 


TTft3977 


JJTOSOpilUa 
lliVLallVgcLb LCI 


cioned oy auiiiiy to aires 1 me ceu cycie 
wnen CAprcsoCu in me iission yeast 

OL/lllZAJouvAyilalUllljr UCj L/UIUUC 


Zj6 


**o 


5Jttl 

JOJ 1 


t\jr 17JJTA 


A/fiic mucf nine 
IVJLUb UlUol/UlUb 


r?r r NJ9nln)in 
VJV/lNxaipila 


*toy 


OJ 


5ftt2 


AF97996S 


riuiiiu :>apiciio 


puiauve amuii uanspUilGi l 


940 


y 1 


JOJJ 


AXvu& j yyo 


riuiiiu sapiens 


uiuiajjitju proiem pruuiivt 


^R7 


07 


5834 


AK02399R 


T-Trim rs cirnipTic 

AlXJlUKJ Cx3|JlC10 


nun Amp/I r\rAtpir» T\rAHi \rt 
UHllQJAJCtl piLrLCUJ pjUUUlsL 




i»o 


JOJ J 


AF071059 


IVIUo mUSV/UlUo 


/•uiu img vi ixiNrv uinaing proicin 


94^ 




5836 


AF17RS34 

rvr 1 / ojjt 


Wnmn carti^nc 

I LKJl I l\J cxajJICllJ 


tat in 


46^ 

fOj 


69 


5837 


J04569 


Homo sapiens 


glial fibrillary acidic protein 


102 


55 


S818 


AFftfiR/v?*; 
rVj UOOOzJ 


TiAite mtic/*ii1nc 
IvlUS lilUSCuilio 


jl/in/\ cyiosme-j memyiuansierase ja 


1 JO 


so 
oy 




A Rft3773Q 


tiomo sapiens 


ITT A A 1 ^ 1 8 nrn+Mn 

Jvi/v/ii jio protein 




07 

yi 


5840 


R86865 


Homo sapiens 


Human protocadherin pc3. 


660 


96 


JOH- 1 


vjUjlrfo 


Homo sapiens 


xiuman secretea protein, otiv^ wj nu. 

7194 


11 / 




5842 


X51486 


Gailus gallus 


cNFI-Al protein (AA 1-522) 


590 


86 


5843 


U65785 


Homo sapiens 


150 kDa oxygen-regulated protein 

LIKrl jU 


200 


95 


JOTT 


If 1/51 96 
u 10 I/O 


— : 

Homo sapiens 


PA A A 


o/y 


y/ 


5845 


AL035071 


Homo sapiens 


dJ1085F17.13 (ortholog of mouse 
DNMT3B (DNA Cytosine-5 
Methyltransferase 3B3) (isoform 3)) 


717 


100 


jo40 


vi neon 


Homo sapiens 




505 


81 


5847 


AC004812 


Homo sapiens 


similar to human HsGCNl U77700 
(rii>.g22i>zj7oj; similar to yeast 
translation activator GCN1 

^JrlL'.g^O^ lOo J 


316 


100 


5848 


M77246 


Rattus 
norvegicus 


beta-chain clafhrin associated protein 
complex Fix"*. 


611 


89 


SR4Q 

JO*T7 


AFftft7ft33 

j\r\t\M 1 0 j j 


jtiomo sapiens 


Kruppei-reiaieu zmc linger proiein 
hcKrox 


JOi* 


ly 


5850 

jo Jv 


AF0S6617 


XlUIIlU sop 1 Clio 


D W O^IXZ aSSUdolCU zuic-iingcr 

protein BAZ1 


1 / J 


DC 


5851 

JO J 1 


AJJOJtO 


JTLUIllU bapiCud 


inc naU7jo gene proauci is novel. 


4AQ 


OJ 


5852 


G00S98 


TJV\mA compno 
XT L> lllO SapiCUS 


Hi i man C&f*r&tt*r\ nrnfpin CCA TT\ XT A. 

nuuian bc^rcieu proiem, ocy jjj inv/« 
4609. 


1O0 


JJ 


5853 


D87071 


T-tr»m r\ canipnc 


bunuai w k^.cicgdns pruicm encoded in 


1 

LOO 


7A 
/O 


5854 


M64979 


Homo sapiens 


glial fector-1 


557 


83 


585S 

JO J J 


H0 1480 

VJ v lHOv 


riomo sapiens 


xiuman secreted protein, oc»\i iyj inu. 
5561. 


Ann 


oZ 


JO JO 


Yft/;i4< 
AAJ0j*#j 


Xenopus laevis 


lam in a (aa i-Zoj) 


1 AO 


iCO 
DO 


5857 


AB0I7499 


Orvctciln&H q 
cuniculus 


i^i^u|iiuicui ivvvcpiui jxe.iai.wu nuicui J 


10fi 

1"U 


4n 


5858 


D43633 


Oryzias latipes 


G protein-coupled seven- 
transmembrane receptor 


281 


50 


5859 


AF067165 


Homo sapiens 


zinc finger protein 3 


669 


87 
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Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


5860 


AL031118 


Homo sapiens 


dJ153G143 (novel C2H2 type Zinc 
Finger protein) 


218 


38 


5861 


U13897 


Homo sapiens 


homolog of Drosophila discs large 
protein, isoform 1 


130 


35 


5862 


M74002 


Homo sapiens 


arginine-rich nuclear protein 


80 


23 


5863 


X67855 


Oryctolagus 
cuniculus 


brain calcium channel BII-1 


147 


66 


5864 


U35376 


Homo sapiens 


repressor transcriptional factor 


475 


67 


5865 


AF251038 


Homo sapiens 


GAP-like protein 


325 


58 


5866 


L46815 


Mus musculus 


DNA binding protein Rc 


409 


66 


5867 


S68944 


Rattussp. 


Na+/Ci(-)-dependent neurotransmitter 
transporter 


582 


95 


5868 


X97675 


Homo sapiens 


plakophilin 2b 


159 


93 


5869 


AF152478 


Homo sapiens 


protocadherin alpha 13 short form 
protein 


680 


100 


5870 


AF1 18082 


Homo sapiens 


PRO1902 


130 


48 


5871 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


149 


49 


5872 


A61971 


unidentified 


MCSP 


621 


82 


5873 


254206 


Bovine 
herpesvirus 1 


UL51 


104 


33 


5874 


AF069765 


Homo sapiens 


signal recognition particle 72 


674 


97 


5875 


AF179428 


Homo sapiens 


DNA polymerase zeta catalytic subunit 
variant 1 


692 


100 


5876 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


119 


48 


5877 


Y94990 


Homo sapiens 


Human secreted protein vb21 1, SEQ 
IDNO:20. 


446 


98 


5878 


L08811 


Drosophila 
melanogaster 


adherin 


264 


42 


5879 


D86983 


Homo sapiens 


similar to D.melanogaster 
peroxidasin(Ul 1052) 


429 


100 


5880 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


664 


100 


5881 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


660 


98 


5882 


AF190018 


Mus musculus 


K-Cl cotransporter 


199 


88 


5883 


AB023229 


Homo sapiens 


KIAA 10 12 protein 


107 


83 


5884 


U53420 


Rattus 
norvegicus 


sodium-calcium exchanger form 3 


595 


98 


5885 


AF039019 


Homo sapiens 


zinc finger DNA binding protein 89 
kDa 


608 . 


92 


5886 


AF207661 


Homo sapiens 


sodium bicarbonate cotransporter-Iike 
protein 


701 


100 


5887 


AF130051 


Homo sapiens 


PRO0898 


115 


74 


5888 


AF152508 


Homo sapiens 


protocadherin gamma A2 short form 
protein 


720 


100 


5889 


AF251079 


Homo sapiens 


heptacellular carcinoma novel gene-3 
protein 


741 


98 


5890 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


118 


51 


5891 


AF157326 


Homo sapiens 


T1P120 protein 


571 


85 


5892 


AF129170 


Homo sapiens 


apolipoprotein E receptor 2 


630 


99 


5893 


XQ5561 


Homo sapiens 


alpha- 1 chain precursor (AA -27 to 
917) (2953 is 2nd base in codon) 


197 


45 


5894 


AJ238374 


Homo sapiens 


putative protein TH1 


641 


96 


5895 


X63526 


Homo sapiens 


homologue to elongation factor 1- 
gamma from A.salina 


709 


97 


5896 


Y11651 


Homo sapiens 


phosphate cyclase 


521 


80 
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SEQ n> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


5897 


Y 18448 


Homo saDierm 


Raccoon nrotein 


681 


100 


5898 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


138 


63 


5899 


AF152473 


Homo sapiens 


protocadherin alpha CI short form 
nrotein 


733 


100 


5900 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


133 


66 


5901 


R06463 


Homo saDtens 


Derived protein of clone 1CA13 
(ATCC 40553). 


522 


99 


5902 


X12928 


Triticum 
aestivum 


HMW glutenin subunit 5 (AA 1-848) 


169 


25 


5903 


X61048 


Hydra sp. 


mini-collagen 


88 


33 


5904 


AF 11 8082 


Homo saoiens 


PRO1902 


247 


60 


5905 


A00279 


synthetic 
construct 


Human serum albumin 


482 


84 


5906 


T14722 cdl 


Homo sanien<i 


01 -DEC- 1992 Human CD27 antieen 
cDNA. 


431 


94 


5907 


Y73475 


Homo sapiens 


Human secreted protein clone ye7_l 
protein sequence SEQ ID NO: 172. 


271 


94 


5908 


AF175265 


Homo sapiens 


vacuolar sorting protein 35 


645 


92 


5909 


Y12713 


Mus. musriiliis 


Pro-Pol-dUTPase Dolvorotein 

A A\f^A KM A KAKJ JL X U'3v) r v r 


125 


50 


5910 


G00360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4441. 


115 


58 


5911 


W88762 


Homo sapiens 


Polypeptide fragment encoded by gene 
20. 


152 


50 


5912 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


154 


68 


5913 


G02827 


Homo saniens 


Human secreted nrotein SEO ID NO* 
6908. 


209 


71 


5914 


AB046048 


Macaca 
fascicular is 


unnamed portein product 


143 


49 


5915 


S80644 


Sus scrofa 


glutaminase; GA 


338 


85 


5916 


AF213393 


Mus musculus 


ATP-binding cassette protein 


92 


95 


5917 


X52622 


Mus musculus 


integrase (409 AA) 


425 


45 


5918 


U49973 


Homo saniens 

AlVlllV K\4A±*J 


ORF2: function unknown 


114 


64 


5919 


S71403 


Mus musculus 


m33-A isofonn 


120 


42 


5920 


AF090931 


Homo sapiens 


PRO0483 


94 


89 


5921 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


99 


36 


5922 


AJ006770 


Cicer arietinum 


extensin 


96 


32 


5923 


AJ250193 


Mus musculus 


muscle protein 637 


730 


72 


5924 


AF229067 


Homo sapiens 


PADI-H protein 


278 


63 


5925 


X924S5 


Plasmodium 
vivax 


oval 


99 


50 


5926 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


84 


66 


5927 


AF130089 


Homo sapiens 


PRO2550 


114 


64 


5928 


J00126 


Homo sapiens 


MHC factor B 


237 


100 


5929 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


251 


79 


5930 


AF090895 


Homo sapiens 


PRO0117 


63 


62 


5931 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


147 


37 


5932 


U83857 


Homo sapiens 


Aacll 


209 


80 


5933 


AF068294 


Homo sapiens 


HDCMB45P 


144 


50 


5934 


X92485 


Plasmodium 
vivax 


oval 

Mr 


135 


69 


5935 


L07545 


Leishmania 
tarentolae 


A V was inserted after nt 369 (^t 
10459 in genomic sequence (M10126)) 
to correct -1 firameshift probably due to 


127 


36 
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oxA£ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Wfl tprmn n 
▼ t aiCI Iff 2IH 


Identit 










Score 


y 








gel compression 






5936 


R95913 


Homo sapiens 


Neural thread protein. 


276 


58 


5937 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


130 


65 








isoform 






5938 


Y16781 


Homo sapiens 


Human secreted protein (clone 


1258 


100 








bhl57J7). 






5939 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


167 


68 








gene 44 clone HTDAD22. 






5940 


U93570 


Homo sapiens 


putative pi 50 


103 


41 


5941 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


189 


74 








7553. 






5942 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


47 


5943 


AL3 90114 


Leishmania 


probable (hhv-6) ul 102, variant a 


122 


90 






major 


DNA, complete virion genome 






5944 


AF130089 


Homo sapiens 


PRO2550 


101 


48 


5945 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


86 








6953. 






5946 


X92485 


Plasmodium 


pval 


104 


70 






vivax 








5947 


M29622 


Mus musculus 


open reading frame 2 


132 


64 


5948 


AF130089 


Homo sapiens 


PRO2550 


131 


60 


5949 


AF130089 


Homo sapiens 


PRO2550 


151 


60 


5950 


AF130079 


Homo sapiens 


PR02852 


117 


52 


5951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


185 


62 








7870. 






5952 


U49973 


Homo sapiens 


ORF2: function unknown 


160 


62 


5953 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


145 


55 








gene No. 107. 






5954 


M22332 


Homo sapiens 


unknown protein 


169 


41 


5955 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


133 


72 








gene 22 clone HMSJW18. 






5956 


U70935 


Peromyscus 


reverse transcriptase 


100 


36 






maniculatus 








5957 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


63 








7871. 






5958 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


112 


48 


5959 


Z49944 


Canis familiaris 


CDC42 GTP-binding protein 


120 


40 


5960 


AF1 18082 


Homo sapiens 


PRO1902 


158 


61 • 


5961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


65 








7795. 






5962 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


62 








6720. 






5963 


AF1 18082 


Homo sapiens 


PRO1902 


113 


60 


5964 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


65 








7795. 






5965 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


120 


57 








similar to pogo element 






5966 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


306 


77 








4718. 






5967 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


41 








7305. 






5968 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


75 








6012. 






5969 


AF1 19855 


Homo sapiens 


PRO 1847 


162 


72 


5970 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


146 


46 


5971 


M19155 


Plasmodium 


S-antigen precursor 


122 


37 






falciparum 






5972 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


121 


66 








gene 22 clone HMSJW1 8. 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


5973 


AL160371 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


107 


60 


5974 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


83 


68 


5975 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


111 


47 


5976 


U93572 


Homo sapiens 


p40 


112 


33 


5977 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


58 


5978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


70 


5979 


AL160493 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


106 


54 


5980 


M64792 


Rattus 
norvegicus 


salivary pro line-rich protein 


105 


31 


5981 


X66285 


Mus musculus 


HC1 ORF 


86 


36 


5982 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


110 


43 


5983 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


131 


60 


5984 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3 . 


no 


60 


5985 


G02753 


Homo sapiens 


Human secreted Drotein. SEO ID NO* 
6834. 


107 


47 


5986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


160 


77 


5987 


AF130079 


Homo sapiens 


PR02852 


272 


77 


5988 


U93570 


Homo sapiens 


putative pi 50 


87 


46 


5989 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


202 


89 


5990 


AF130089 


Homo sapiens 


PRO2550 


238 


74 


5991 


AF 118082 


Homo sapiens 


PRO1902 


119 


41 


5992 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


60 


5993 


L27428 


Homo sapiens 


reverse transcriptase 


110 


60 


5994 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


45 


5995 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


90 


5996 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-lectin antibody 


78 


68 


5997 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


97 


63 


5998 


AP002460 


Arabidopsis 
thaliana 


gene id:FlD9.26~unknown protein 


100 


38 


5999 


U93569 


Homo sapiens 


p40 


103 


52 


6000 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


100 


63 


6001 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


121 


51 


6002 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


64 


6003 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


105 


40 


6004 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


6005 


R95913 


Homo sapiens 


Neural thread protein. 


235 


56 


6006 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


173 


58 


6007 


X92485 


Plasmodium 
vivax 


pval 


96 


53 



289 



WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


6008 


003790 


nOIDO Sapiens 


Human secreted protein, SEQ ID NO; 

7R71 


97A 


62 


6009 
i 


Y17S32 


Human 
cnuugciivms 

retrovirus K 


env protein 


103 


66 


6010 


AF1 19900 


xiDiLHJ oajJiCUo 


PT?ri9R99 


OI 

yi 


^o 


601 1 

UU 1 X 


G02532 

VJUZ>JJX< 


•l 1UII1U CKlUlCllS 


iiuiiiaJi sccrcieu protein, ocy iu inu. 
6617 






6012 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
cpn rn \jrv 1 67 

Oily IU iNVA lOJ. 


201 


91 


6013 


Y361 S6 

I JUl »/U 




jtiuiDdii soctclcu proicin ttzo. 


7*17 
ZD J 


7A 


6014 


AF 170079 

r\r low 


Hrvmn cstrMAfic 

numu oapiciis 




1 io 
i iy 


7/C 
JO 


6015 


G0037R 

VJviV/ JiO 


UnmA wnipne 


Unman c*»r»rf»tfvl r»mt*>in QPO TT*b Mn- ! 

iiuiiidii scurcicQ pruccuu, ocy 1 u in w. 
44.00 

TTV7. 


1 77 
1 j / 


03 


6016 


AF130089 


Homo sapiens 


PRO2550 


110 


40 




AF010144 


numu ixipiens 


neuronal mreaa protein aju /c-in i r 


1 AO 
14Z 


<7 
J/ 


6018 


AF1 16661 


nuiiiu Sapiens 




1 A7 


oy 


6019 


AF130089 


Homo sapiens 


PRO2550 


169 


71 


6090 


vji/j /yo 


tiomo sapiens 


riunian secretea protein, oc^ iu jnvj. 

7X70 

/o /y. 


zUo 




6021 


AF 1 19851 


Homo sapiens 


PR01722 


147 


65 


6099 


(709^07 


riomo sapiens 


Human secretea protein, oea^ iu nu. 
6588. 


1 on 
139 


4/ 


6097 


API 7AACQ 


Homo sapiens 




142 


66 


6024 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OO 1 A 
12.1**. 


91 


76 


609^ 




"Hu 1 

Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


103 


48 


OUZO 


Uooojj 


Catnarantnus 
roseus 


extensin 


97 


46 


6099 


A 171 7AACQ 
Ar 1 JUUoy 


Homo sapiens 




95 


47 


6028 


X92485 


Plasmodium 
vivax 


pval 


102 


61 


6029 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

0501 

loll. 


149 


53 


6030 


G00643 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


76 


6031 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oi ->y. 


144 


44 


6079 


AT7A1 14R7 


AraDioopsis 
thaliana 


contains similarity to root cap 
protein-~gene_id:MUF9. 1 6 


1 IO 


IO 

37 


6077 




riasmooium 
vivax 


pval 


9< 
OJ 


>l A 
40 


6034 


AF1 1K0R9 


T-TAmn QflmAnc 
aIUIIHJ tSapiCuS 


PRO1909 


1 19 


JO 


6035 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*rtJ0. 


121 


59 


6036 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770^ 


140 


65 


6037 


G00352 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

A All 


107 


71 


607 R 


<7077£Q 

VJVj /07 


nonio sapiens 


Unman conrotoH nrrvtoin Qt«/"\ TT*\ XT/^\. 

nuinan secreicQ protein, ocy iu rsw. 
7870. 


1Aft 

14U 


OJ 


6070 

VVJ7 




Plasmodium 
vivax 


pval 




78 


6040 


AF 130051 


Homo ^aniens 


PRO0898 


107 

IV/ 


69 


6041 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


113 


58 


6042 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


209 


75 
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SEQ n> 
NO: 


Accession No, 


Species 


Description 


Smith- 

Watcrman 

Score 


% 

Identit 

y 


6043 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


98 


56 


6044 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


126 


43 


6045 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


65 


6046 


Y36156 


Homo sapiens 


Human secreted protein #28. 


93 


62 


6047 


AF130079 


Homo sapiens 


PR02852 


205 


83 


6048 


R95913 


Homo sapiens 


Neural thread protein. 


148 


45 


6049 


D38I12 


Homo sapiens 


cytochrome c oxidase subunit 3 


234 


86 


6050 


D38113 


Pan troglodytes 


NADH dehydrogenase subunit 5 


403 


61 


6051 


Ml 1901 


Ratals 
norvegicus 


proline-rich salivary protein 


96 


42 


6052 


AF130089 


Homo sapiens 


PRO2550 


348 


73 


6053 


AF1 19851 


Homo sapiens 


PR01722 


109 


68 


6054 


G03793 


Homo saniens 


Human secreted nrotein SEO ID NO* 
7874. 


168 


66 


6055 


AF161356 


Homo sanien^ 


HSPC093 


83 


58 


6056 


X92485 


Plasmodium 
vivax 


pval 


155 


55 


6057 


G04000 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8081. 


520 


100 


6058 


AF010144 


Homo sanien*; 


neuronal thread nrotein AD7c-NTP 


148 


65 


6059 


W03626 


Homo saniens 


Human thvrotronin GPR N-terminal 
sequence. 


163 


53 


6060 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


300 


68 


6061 


U93563 


Homo sapiens 


putative pi 50 


389 


37 


6062 


AF090895 


Homo sapiens 


PRO0117 


66 


78 


6063 


X92AS5 


Plasmodium 
vivax 


pval 


110 


86 


6064 


AF130089 


Homo sapiens 


PRO2550 


166 


71 


6065 


M22332 


Homo sapiens 


unknown protein 


188 


48 


6066 


Y08062 


Homo saniens 


Human PRG245 nrotein fragment 
derived from DNA35638. 


101 


52 


6067 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


41 


6068 


G00528 


Homo saniens 


Human secreted nrotein SEO ID NO* 
4609. 


99 


75 


6069 


M22332 


Homo saniens 


unknown nrotein 


153 


80 


6070 


AP000616 


Oryza sativa 


similar to RING-H2 fineer nrotein 
RHAla(AF078683) 


109 


82 


6071 


G00403 


Homo sapiens 


Human secreted nrotein. SEO ID NO" 
4484. 


123 


73 


6072 


AF130079 


Homo sapiens 


PR02852 


143 


60 


6073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


132 


55 


6074 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO) 
7795. 


158 


65 


6075 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


68 


6076 


AF116712 


Homo sapiens 


PR02738 


102 


41 


6077 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


102 


69 


6078 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


60 


6079 


AF1 18082 


Homo sapiens 


PRO1902 


196 


61 


6080 


AF1 18082 


Homo sapiens 


PRO1902 


165 


59 


6081 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


153 


66 



291 



SEQID 
NU: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


6082 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


188 


71 


6083 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


109 


60 


6084 


V00672 


Pan troglodytes 


reading frame protein 4 


185 


85 


6085 


AF130051 


Homo sapiens 


PRO0898 


307 


67 


6086 


AF130079 


Homo sapiens 


PR02852 


96 


72 


6087 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


171 


82 


6088 


AF090895 


Homo sapiens 


PRO0117 


110 


67 


6089 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


101 


46 


6090 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


114 


36 


6091 


Y02925 


Homo sapiens 


Fragment of human secreted protein 
encoded bv eene 101 


111 


46 


6092 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


137 


54 


6093 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


176 


59 


6094 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
&ene 44 clone HTDAD22 


199 


50 


6095 


U70935 


Peromvscus 
manic ulatus 


rftverxe trimmer rnt?i£ft 


216 


44 


6096 


X71442 


Rattus 
norvegicus 


ORF 1 • Dutative 


109 


53 


6097 


AF 119855 


Homo sapiens 


PRO 1847 


88 


71 


6098 


L27428 


Homo sapiens 


reverse transcriDtase 


104 


45 


6099 


AF090895 


Homo sapiens 


PRO0I17 


119 


61 


6100 


U52077 


Homo sapiens 


mariner transposase 


295 


67 


6101 


G03714 


Homo sanien<i 


Human arreted nrotfiin SPO TD NO* 
7795. 




73 


6102 


M22332 


Homo sanien*\ 


unknown nrotein 


132 


44 


6103 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


140 


56 


6104 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


64 


47 


6105 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


123 


41 


6106 


Y86248 


Homo saniens 


Human secreted nrotein HCHPF68 
SEQ ID NO: 163. 


139 


69 


6107 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


75 


58 


6108 


AF068294 


Homo sapiens 


HDCMB45P 


141 


43 


6109 


AF1 18086 


Homo sapiens 


PRO 1992 


160 


77 


6110 


M63819 


Plasmodium 
falciparum 


malaria antigen 


62 


46 


6111 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


130 


65 


6112 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


282 


70 


6113 


AF119851 


Homo sapiens 


PR01722 


156 


63 


6114 


AF229067 


Homo sapiens 


PADI-H protein 


134 


48 


6115 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


44 


6116 


D38112 


Homo saoiens 


cytochrome c oxidase subunit 1 


201 


80 


6117 


AF285567 


Microcebus 
griseorufus 


cytochrome b 


185 


58 


6118 


AL138656 


Arabidopsis 
thaliana 


extensin precursor- like protein 


141 


28 


6119 


X92485 


Plasmodium 
vivax 


pval 


165 


61 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO; 






waterman 


ldenut 










Score 


v 


6120 


G03714 


Homo saniens 


Human secreted protein, SEQ ID NO: 


164 


84 








7795. 






6121 


M13100 


Rattus 


unknown protein 


176 


49 






norvegicus 








6122 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177 


50 








7214. 






6123 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


71 








6566. 






6124 


AF130089 


Homo sapiens 


PRO2550 


109 


85 


6125 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


70 








4718. 






6126 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


149 


54 


6127 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


70 








7795. 






6128 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


236 


77 








similar to D020 element 






6129 


U93564 


Homo sapiens 


putative pi 50 


152 


53 


6130 


Y02999 


Homo saniens 


Fragment of human secreted nrotein 


129 


57 








encoded by gene 12 1 . 






6131 


G02538 


Homo saniens 


Human secreted protein, SEQ ID NO: 


127 


56 








6619. 






6132 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


195 


67 








8153. 






6133 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


92 


42 








protein CON-1. 






6134 


AK000496 


Homo sapiens 


unnamed protein product 


158 


59 


6135 


X92485 


Plasmodium 


pval 


146 


55 






vivax 








6136 


Y36203 


Homo sapiens 


Human secreted protein #75. 


133 


57 


6137 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


68 








7867. 






6138 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


106 


36 








gene 44 clone HTDAD22. 






6139 


AF130051 


Homo sapiens 


PRO0898 


77 


65 


6140 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


222 


73 








7874. 






6141 


AF1 19851 


Homo sapiens 


PR01722 


107 


63 


6142 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


57 








7867. 






6143 


R95913 


Homo sapiens 


Neural thread protein. 


114 


70 


6144 


AF068294 


Homo sapiens 


HDCMB45P 


123 


56 


6145 


AF 118082 


Homo sapiens 


PRO 1902 


113 


32 


6146 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


105 


42 


6147 


Y30731 


Homo sapiens 


Amino acid sequence of a human 


938 


100 








secreted protein. 






6148 


U93574 


Homo sapiens 


putative pi 50 


99 


47 


6149 


AF090894 


Homo sapiens 


PRO0113 


145 


55 


6150 


AF130079 


Homo sapiens 


PR02852 


125 


49 


6151 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


48 








7214. 






6152 


G03224 


Homo sapiens 


Human SQCTQted protein, SEQ ID NO: 


89 


60 








7305. 






6153 


Y01398 


Homo sapiens 


Secreted protein encoded by gene 16 


90 


82 








clone HNGFR75. 






6154 


X92485 


Plasmodium 


pval 


110 


42 






vivax 








6155 


S79410 


Mus musculus 


nuclear localization signal binding 


118 


48 








protein 






6156 


GO 1495 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


89 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








JJ / vl. 






6157 


X92485 


Plasmodium 


p va i 


1 S7 


57 






vivax 








6158 


G02872 


UnmA cantcn<i 


Human <5PrrptpH nrAfpin CPH TTi MO* 
xxuiuaix ovucicu uiuiciii, ocy ul/ liv. 


lUO 


5R 








n95^ 

U7JJ> 






6159 


U36794 


Gonirnix 

V/V l UU 111 A. 


OvtArhrrMTiP ayiHqcp eiihnnif 1 
Vvjr lUwlUUlllv UAlUadv SUUUllll 1 


70 


7^ 






ianonim 








6160 


AF 11 8082 


Hnmo canienc 


PR0 1007 


1 10 

i iy 


00 


6161 


G00637 


Homo <*aniftns 


Human cprrptprl rvrnt*>in CPH 1T> "MO* 
xlUIUail SCUICICU piUlElll, uCy 117 IN W. 


1 S4 

1 J*r 


(LA 








471 X 






6162 


GO 1984 


Homo saniens 

XXU1XLV OOpii'lld 


Human <y»rrf*tf>H nrnti»in QPO TYi "MO* 
XlUtuaix oCUICLCU piULCLll, oJj»v^ JLL7 1> v7« 


1 07 

1U f 


^7 








UuUJ. 






6163 


M22332 


Hnmn canipne 
x xviixiu aa^iwu 


iinVnAwn nrntpin 


17^ 




6164 


W48351 

YY *tOJJ X 


Hrtmn canipnc 
xhjixili oapiciio 


nixiiiatx u i cab i od.uL/ci reiaieu proiein 


1 fSX 
lOJ 










T%n?R7 






6165 


GO? 879 


xxLHllLI oaDlCllb 


xTUuiaQ secreiea protein, ocv^ ID INC. 




04 








U7JJ. 






6166 


G00689 


Homo canipnc 


Human cf-rn»tp*H nrnfr*in QPO TFi"MO* 

XXU1UOI1 jCtlglCU Ul UlCLU, OJJrV£ LL7 lNV^. 


iox 

1VO 










4770. 






6167 


AF010144 




XlCUIUIlai UllCaU piUtCIU /VJL»/C-1N If 


1 ^0 


yy 


6168 


D38112 




Lyu/L/i if L/ii ic c OAiciasc suoumi j 


DOZ 


ft5 


6169 


nun n 


r all LTUgJUUyiCb 


iN/vL7xi uenyarogenase suoutju *+ 




CI 

31 


6170 

V/X l\f 


GO? 51? 


xiuiiiu sapiens 


riuman secreieu protein, ola^ ijj inu. 




QC 
OD 








OO 1 J. 






6171 

VI X / 1 


G017B6 


xiomo Sapiens 


ruirnan secreteo protein, or\i vu in (J. 




to 








7X^7 






6172 


GO? 87? 


XxLHliLI SdpiCilS 


iiuniaD secreieu protein, on>\i id inc 


l JO 
















Ux 1 o 


K"0170/> 


rtomo Sapiens 


salivary proline-rich protein precursor 


1 A>l 

104 


36 


6174 


YRn74R 


nuiiiu sapiens 


num an secreieu protein nv^nrroo, 


1 Jo 


75 








ocy ixv inw. luj. 






617^ 

Vl/J 


ITQ15/;Q 

U7JJQ7 


nomo sapiens 


p^fu 




40 


6176 


X70141 


INieOllaHa 


extensin 


OA 








cv/i ctn c 

syivcsuis 








6177 




xiuiiiu sapiens 


xiuman secreieu protein nLnrroo, 




Ol 














6178 


AF1 19855 


linmn cam'pne 
XiUJllL? oapicilo 


PRO 1X47 


1 lA 




6179 


Y161nY> 

X JUJUO 


ixoino Sapiens 


r ragmen 1 01 numan secreteo proiein 


no 


Oo 








AnroHpH hv or An a ^ 

C11UUUCU I7y cCllC D. 






6180 


X03717 


hattia caniprtc 

nuniu sapiens 


puu uniuenixiiea reaamg name 


117 


*frZ 


6181 


U49971 


Hattia csni'piic 
IXVJ111U aOpiCllo 


v7xvT 1, IVxxjIVj /, pUlaUVe uallSpOSaSe 


7AA 


/o 








ciiYiflor to nAPA plf-tnpnt 

OUAtllCU l\J ^IfgU WivXIX*51XI> 






6182 


AL3597R2 


Trvnan o<iom a 
x x y yaxxKj jkjmxo. 


tviQQfhlp /Tihv-/»^ ill 1 07 i/flrianf a Hno 
pv/ooiuic ^iixiw^ ux tU&> } ValxallL a llila, 


110 
1 111 


71 
to 






hrucei 


cotnnlete virion penome 






6183 


D38112 


Homo Qanipris 


NADH rfphvnVotypnaQp ^nhnnit *\ 

XYZXLxXX Ui. Ugwlaob dULIULUL J 


740 


oft 


6184 


Y27868 


Homo canipnc 


Human cp^rptpH nrotpin pnroHpH Kv 

XXlXiXiaXI Ovvl CICU L/1UIG1XX CXlw\JUCLl uy 


107 
J LIZ 


47 








eeneNo 107 






6185 


X86031 


Su<» scrofa 


homolopue of*nrnlinp7araininp-rirh 


99 

yy 


40 








antibacterial nrotein 






6186 


Y36156 


Homo ^aniens 


Human secreted nrotein #7X 

XIUIUOU OwululuU UlUivlU TT4*0, 


196 


75 


6187 


AF 169387 


T^4ii<; mn^mlnc 

lYXlXD lUUOUulUj 


alr>Ha pnllaopn TV 
axpxia j istsiiagwij i v 


111 
111 


JO 


6188 


AF1 94537 

ill 4 / 


nuJiiu aopjcxio 


liV~\LJ I J 


1 Jo 


17 
O / 


6189 


*\J I OJU 


VlOTIJI 


pYtpncin place 1 nrotpin 

CAlvlXdlll UlCloo 1 pi LI LC ill 


1n7 
10 / 


JJ 






i in oi i i f*i 1 1 sttfi 
uxiguiuuiaux 








6190 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


63 








8153. - 






6191 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


123 


46 








gene No. 107. 






6192 


X92485 


Plasmodium 


pval 


115 


54 



294 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 






vivax 








6193 


U49973 


Homo sapiens 


OKr l; MJbK3 /, putative transposase 




OJ 








similar to pogo element 








Woo627 


Homo sapiens 


secreted protein encoaea oy gene y* 


o/. 


^4 








Clone rarJVlDi^Z- 






£.y AC 

6195 


G0o790 


Homo sapiens 


riuman secretea protem, ocv^ iu 


Ol A 
Z14 


DO 








lot l. 






6196 


Y 19767 | 


Homo sapiens 


orAj JJJ JN\j 4oD rrom wuyyzzz*fj. 


5/ 


7Q 

/y 


6197 


Y08062 


Homo sapiens 


Human PR0245 protem fragment 


IIS 

1 lo 


/1 1 






aenvea trom uw A3 jo jo. 






6198 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 


Zl J 


ID 








I>Z4 






6199 


G02872 


. - 
Homo sapiens 


riuman secretea protem, ocy id invj. 


inn 


HO 












6200 


G02994 


- 

Homo sapiens 


Human secretea protem, &e\i id inu. 




1 3 












6201 


V007 1 1 


. 

Mus museums 


cytochrome oxidase I 


Hj J 


2rZ 


6202 


U93572 


Homo sapiens 


putative pi 50 


oo 


03 


6203 


A C C~t C 

AF265575 


Homo sapiens 


ubiquitous lrR-moui protein i 


1 AO 


/f ^ 








isoform 






6204 


A T"*/"V 1 /\ AAA 

AF010144 


Homo sapiens 


neuronal thread protein AD /c-w ir 


lOJ 


^y 


6205 


M15530 


Homo sapiens 


B-cell growth factor 


111 

1 13 


OU 


6206 


G02872 


Homo sapiens 


Human secretea protem, SJSQ ID NO. 


1Z5 
















6207 


WoooZ/ 


Homo sapiens 


Secreted protein encoded by gene 94 


lol 


77 
/Z 






* 


Clone rlr IYIdV^jZ. 






ozuo 


ArUi0144 


Homo sapiens 


neuronal tnreaa protem j\u i c-jn i r 




7^ 


6209 


a yA i At /I /t 

AF010144 


Homo sapiens 


neuronal thread protem AD /c-in i r 


nn 
IzU 


OZ 


6210 


Y36203 


Homo sapiens 


Human secreted protein #75 . 


ioy 


01 
ol 


6211 


U03790 


Homo sapiens 


Human secretea protem, oc\i id rsu. 




JZ 








7871. 






6212 


G03295 


Homo sapiens 


Human secretea protein, ofcvj tD ino. 


1 *71 


/I? 








7376. 






6213 


D86854 


Catharanthus 


extensin 


1ZU 


/i /i 
44 






roseus 








6214 


G02754 


Homo sapiens 


Human secretea protem, MiQ id nu. 




inn 
1UU 








ooij. 






6215 


1171 "J £ AO 

W33698 


Homo sapiens 


AL-z-long ^AL-21) protem. 




inn 
lull 


6216 


G0193 I 


Homo sapiens 


Human secretea protein, ocv^ tD in^j. 


lft9 
1UZ 


OZ 








6012. 






6217 


U4Ui42 


Mus museums 


ninein 


/ / 


77 


OZlo 


UUUo J / 


Homo sapiens 


riuman secretea protem, anv^ inw. 


174. 
1 ft 










*f / lo. 






6219 


U49973 


Homo sapiens 


OKr J ; MbKJ / j putative transposase 


7^y4 

ZJ4 


j4 








similar to pogo element 






6220 


VM AO C 

X92485 


i IP 

Plasmodium 


pval 


1 fl7 


AA 






vivax 








6221 


003 135 


Homo sapiens 


Human secretea protem, oe,ki id inu. 


lU*f 


Aft 








701/1 






6222 


X86029 


— ji — 

Vigna 


extens in-like protein 


lfU 








unguiculata 








6223 


G03790 


Homo sapiens 


Human secretea protem, o-bv^ id inu. 


91A 
ZiO 


J7 








7C71 

7o/l. 






6224 


AF068294 


Homo sapiens 


HDCMx>45r 


1 CA 


Aft 






t-frvmrv eonipn c 
nUIilU oapiClio 


Unman mlmrviiilin/KTFP' nrntpin 


135 


50 








binding inhibitor 2. 






6226 


AE003859 


Xylelia 


hypothetical protein 


139 


41 






fastidiosa 








6227 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


58 



295 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Wateritian 

Score 


% 

Identit 

y 








4718. 






6228 


AF130089 


Hnmn cam pnc 




1 z.O 




6229 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


190 


88 


6230 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

I\Xl/\ 1 a \J\T \I / OOO J ) 


124 


77 


6231 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


174 


79 ; 


6232 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


168 


78 


6233 


I I4QQ7^ 


numu sapiens 


jjkt l , ivLcto / , puiau ve iransposase 
similar to pogo element 


1 £4 
104 


o/ 


6234 


Y91577 


nuiiio Sapiens 


Unman PO/>rAta/i nfAfoin camian/tA 

numan secreiea protein sequence 
encoded by gene 2 SEQ ID NO:250. 


Zoo 




6235 




nuino Sapiens 


r tvAJUy /4 


11/1 


4o 




VJvJ xjj 


Homo sapiens 


xluman secretea protein, obv l*-* NU: 

77 14 


1 Cft 


40 


6237 


AT 096R44 

xTL JLi V y \J 0*t*T 


ou cjj luiuy iatss 

coelicolor A3(2) 


putative integral memorane proiem 


oo 




6238 




numu Sapiens 




1 £7 
10/ 


CA 


6239 




nuuiu sapiens 


x ISXJZDJV 






6240 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


212 


69 


6941 


I Z / OOO 


riomo sapiens 


Human secreted protein encoded by 

Ann a XT/-. 1/1 *7 

gene ino. iu /. 


102 


54 


6242 


AF1 09907 


Homo sapiens 


S164 


146 


90 




Yft9fi71 
I uzo / 1 


riomo sapiens 


Human secreted protein encoded by 
gene cione rnvioJ wio. 


132 


CO 

52 


6244 


ATVJ V iff 


nouiu Sapiens 


neuronal uireaa protein alj / c-in x jr 


1 "2!? 

1 Jo 


C7 


6245 




nuiuo sapiens 


numan secreiea protein, oHv£ iu invj. 

7R71 
toil* 


1 1 J 




6246 


AF 130089 


Homo sapiens 


PRO2550 


154 


77 


6247 




UnmA cant Arte 

nomu sapiens 


neuronal inreau proiein ai^/c-in i r 


Ol 

y 1 


/IA 

4U 


6248 


Y36156 


Homo sapiens 


Human secreted protein #28. 


105 


43 




r\_T 1 lOOO 1 


Homo sapiens 


t>t> r\ i >t 1 o 
rKU14io 


111 

131 


60 


6250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


68 


6251 


fifing fli 


riomo sapiens 


numan secretea protein, oJbv IX' JNU. 
4462. 


*7A 

70 


73 






riomo sapiens 


Jtiuman d co i reiateu poiypepnae otxi 


1 AjC 

lOo 


WO 

42 


6253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


247 


68 


6254 




nuuiu oapiciLb 


xituiidii scorcLcu protein nunrroo, 


117 
11/ 


/-> 


6255 


V00662 


Homo sapiens 


cytochrome oxidase I 


355 


78 


6256 




numu sapiens 


rragmcnt ui numan secreiea protem 

euwjueu uy ^eiie ix-i. 


111 
I J 1 


Oj 


6257 


D38112 


Homo sapiens ~" 


cytochrome c oxidase subunit 1 


176 


86 


6258 


AF061 140 


ArrihwK 
r\A uueus 

jamaicensis 


L>y tOLIHUlIlC C OXlUaDC SUULUllt 111 




oo 


6259 


AFOIfiHQQ 
viur v i v\jyy 


ivxus inuscuius 


enaonucieaSc/revcrse iranscnptase 


1 14 
1 14 


4J 


6260 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oyuo. 


179 


78 


6261 


unyy to 


nuino sapiens 


vjivr i , rvLcio /, putative transposase 
similar to pogo element 


\yL 


/o 


6262 


G03714 


Homo sapiens 


Human «5prTPtftH nrnfpin SRO IT) NO* 
7795. 


146 


71 


6263 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


114 


37 


6264 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


88 


60 



296 



WO 01/64835 



PCT/US01/04927 



SEQID 
iNL>: 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

Ideotit 

y 


6265 


U49974 


Homo sapiens 


mariner transposase 


206 


71 


6266 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


71 


72 


6267 


AF097473 


Mus musculus 


ORF1 


86 


40 


6268 


X70343 


Nicotiana 
sylvestris 


extensin 


95 


33 


6269 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


84 


6270 


L27428 


Homo sapiens 


reverse transcriptase 


132 


48 


6271 


AF1 18082 


Homo sapiens 


PRO1902 


153 


53 


6272 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


222 


91 


6273 


X03145 


Homo sapiens 


pot ORF V 


89 


35 


6274 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


90 


71 


6275 


U93572 


Homo sapiens 


p40 


83 


43 


6216 


AF058700 


Bos taunis 


ubiquitin-S27a fusion protein 


148 


70 


6277 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


96 


65 


6278 


AF130079 


Homo sapiens 


PR02852 


179 


86 


6279 


AF161356 


Homo sapiens 


HSPC093 


108 


57 


6280 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


109 


51 


6281 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


130 


58 


6282 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


81 


69 


6283 


AF068294 


Homo sapiens 


HDCMB45P 


98 


37 


6284 


AF1 16715 


Homo sapiens 


PR02829 


112 


68 


6285 


X92485 


Plasmodium 
vivax 


pval 


92 


64 


6286 


U93570 


Homo sapiens 


p40 


90 


38 


6287 


Y36203 


Homo sapiens 


Human secreted protein #75. 


225 


62 


6288 


Z11793 


Homo sapiens 


selenoprotein P 


220 


65 


6289 


X92485 


Plasmodium 
vivax 


pval 


125 


70 


6290 


X52634 


Mus musculus 


tim protein 


114 


45 


6291 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


94 


36 


6292 


AF1 16661 


Homo sapiens 


PR01438 


144 


46 


6293 


L27428 


Homo sapiens 


reverse transcriptase 


81 


44 


6294 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


152 


51 


6295 


AF130089 


Homo sapiens 


PRO2550 


137 


35 


6296 


L27428 


Homo sapiens 


reverse transcriptase 


96 


53 


6297 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


134 


69 


6298 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


110 


58 


6299 


AF130089 


Homo sapiens 


PRO2550 


110 


61 


6300 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


63 


6301 


AF220264 


Homo sapiens 


MOST-1 


172 


90 


6302 


AF130079 


Homo sapiens 


PR02852 


311 


71 


6303 


AF068294 


Homo sapiens 


HDCMB45P 


136 


53 


6304 


G03789 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7870. 


110 


62 


6305 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


113 


45 


6306 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


166 


58 



297 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


6307 


S85655 


Unrna Qanipnc: 






Q1 


6308 


Q037S6 


iJLVJilLv/ oCL^JIV^HO 


Human cprrpfpH nrofpm <?FO m TsIO* 

7867. 


i jj 


Oh- 


6309 


G03714 


Homo saDiens 


Human secreted orotein SEO ID NO* 
7795. 


151 


7^1 


6310 


AF1 19851 


Homo sapiens 


PR01722 


160 


67 ~" 


6311 


L27428 


Homo sanien*; 


rpv/pnift tran<!PTrinta^p 

IvTvlOv U CUlgvl lUUldw 


01 


so 


6312 


AF130089 


Homo sapiens 


PRO2550 


140 


55 ^ 


6313 


AF1 30089 


Unmn cantpno 


PR 07^0 




Oo 


6314 


X15917 


x ai cuucwui j j 
aurel ia 

uUlwlW 


ORP/l nrotpin /'A A 1 .1 


07 


IS 
JO 


6315 


Y08062 


Homo <«inipn<; 


Human PT?07A*i rimtpin frjifnYi pnt 

derived from DNA35638. 


OA 


J 1 


6316 


Y36203 


Homo <%anipTi«; 


Human ^pprptpH nrntpin ii7^ 

IIULIICLU ^<L1C/ICU \J\ W Lt 111 TT/->. 






6317 


L27428 


Homo sapiens 


reverse transcriptase 


117 


56 "j 


6318 


Ml 1900 


A/flK TTlllQfllltlC 


ij-ibXJa proiuie-ncn sauvary proxein 




At 


6319 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA 


175 


69 


6320 


IJ92673 


Fnrvttipnpo cn 

j^uiyuiciics sp. 

•AUS-3F 


cyiocnrome oxidase i 


jy 


3/ 


6321 


Y02693 


xiuiuu aapicUd 


jriuxndii 2>ccrcieu protein encouea oy 

nana /M rlrmp HTT^ A TY)0 




77 
/Z 


6322 


AK000496 


HnmA canJpnc 


uiiiiaiiicu pruicin pruuuee 


7DA 


7H 


6323 


AB047934 


Macaca 

■focfipiil arte 


hypothetical protein 


173 


68 


6324 


AB023485 


Mus musculus 


transcription factor CA150b 


131 


43 


6325 


AF07076? 


Rap tannic 


generaj prorein iransporT ractor pio 


HA. 
/O 


o/ 


rV*26 




riomo sapiens 


alternatively spliced product using exon 
13A 


OQ 

yo 


3/ 


6327 


1 V600U 


rxomv/ Sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


11C 

i I J 


OU 


6328 


R96800 

IvvOwv 


atuihu b dp lens 


xiuman msuocyie-secTetea lacior rior . 


i no 




6329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/O/v. 


115 


67 


6330 


U43360 


Peromyscus 

rnan if*nlc*'Hic 
UiaUH*uiaiiu> 


reverse transcriptase 


82 


63 


6331 




Wnmn canipnc 
xiuiiiu oapicilo 


7870 

/ O /V. 




S7 


6332 


K01 664 


T^rnsonliilfl 
melanoffaster 

tuvmiivfegjwi 


RVm-lilfp. HiTOtpin 


1 17 


J / 


6333 


M69297 


Homo sapiens 


ORF3 


70 


43 


6334 


U 15647 


^4ti^ rnii^riilnc 

l'luo iiiuoouiuo 








6335 


AF090894 


Homo sanien^ 


PRO0113 


in7 




6336 


AF130089 


Homo ranienc 


PRO2550 




7S 
t j 


6337 


AJ271872 


Nicotiana 
svlvestris 


extensin 


134 


32 


6338 


AJ132694 


Homo sanien^ 


small GTPase raclh 


99 




6339 


AF1 19851 


Homo sapiens 


PR01722 


123 


60 


6340 


G03789 


Homo ^aniPn^ 


Human QprrpfpH nrotpin SPO ir> MO* 

7870. 


71S 


JO 


6341 




JTCriuo apiVUXUd 


CytOCiiTUulC O/vlUdoC X 


10^ 
iyj 


ou 


6342 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


62 


6343 




1^1 ft c m 0 H it im 

vivax 


pval 




oi 


6344 


G00454 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
4535. 


110 


64 


6345 


U93572 


Homo sapiens 


p40 


79 


48 


6346 


X92485 


Plasmodium 


pval 


156 


55 



298 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Idcntit 










Score 


y 






vivax 








6347 


G02485 


Homo ^aniens 


Human secreted Drotein. SEO ID NO: 


104 


85 








6566. 






6348 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


158 


77 








SEQIDNO:163. 






6349 


G03786 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


89 


64 








7867. 






6350 


AF130089 


Homo sapiens 


PRO2550 


187 


69 


6351 


U56732 


Rattus 


KRAB/zinc finger suppressor protein 1 


161 


64 






norvegicus 








6352 


U70932 


Peromvscus 


reverse transcriptase 


126 


61 






leucoDus 








6353 


X55687 


Lvconercicon 


extensin (class II) 


89 


44 






esculentum 








6354 


G02872 


Homo ^anien^ 


Human secreted nrotein SEO ID NO* 


107 


57 








6953. 






6355 


G00357 


Homo saDiens 


Human secreted protein^ SEQ ID NO: 


88 


76 • 








4438. 






6356 


Y86248 

A \J V/4-* * \J 


Homo saoiens 


Human secreted nrotein HCHPF68 


98 


61 








SEO ID NO- 163. 






6357 


AF010144 


Homo saoiens 


neuronal thread protein AD7c-NTP 


165 


51 | 


6358 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


214 


58 








gene 44 clone HTDAD22. 






6359 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


88 








4508. 






6360 


X07882 


Homo sapiens 


Po protein 


151 


47 


6361 


Z11592 


Phytophthora 


cytochrome c oxidase subunit 3 


78 


76 






parasitica 








6362 


AF090944 


Homo sapiens 


PRO0663 


130 


46 


6363 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


39 


6364 


AF229067 


Homo sapiens 


PADI-H protein 


155 


75 


6365 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


100 


55 








gene No. 10 1. 






6366 


AK024501 


Homo saDiens 


FLJ001 10 nrotein 

A Jk_/*r w W A Aw JM* vVWLH 


282 


100 I 


6367 


AF 118082 


Homo saDiens 


PRO 1 902 


157 


57 


6368 


G04072 

WV1W 1 x# 


Homo ^aniens 


Human secreted nrotein SEO ID NO* 


151 


63 








8153. 






6369 


G02755 


Homo saoiens 


Human secreted nrotein SEO ID NO: 


161 


96 








6836. 






6370 


G03043 


Homo saDiens 

AAvlAAw aUUiviu 


Human secreted protein, SEQ ID NO: 


124 


63 








7124. 






6371 


AF130079 


Homo sapiens 


PR02852 


297 


77 


6372 


X92485 


Plasmodium 


oval 


129 


42 






vivax 








6373 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


63 








7339. 






6374 


AF1 19851 


Homo sapiens 


PR01722 


251 


65 


6375 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


73 








7871. 






6376 


X70343 


NicorJana 


extensin 


104 


35 






sylvestris 








6377 


G03224 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


70 


52 








7305. 






6378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


69 








6613. 






6379 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


109 


51 








gene 22 clone HMSJW18. 






6380 


Z38128 


Mus musculus 


histone HI 


113 


36 


6381 


U93573 


Homo sapiens 


p40 


212 


43 



299 



WO 01/64835 



PCT/US01/04927 



seq n> 


Accession Na 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


6382 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


• 109 


40 


6383 


AL096744 


Homo sapiens 


hypothetical protein 


164 


49 


6384 


AF220264 


Homo sapiens 


MOST-1 


119 


56 


6385 


X92485 


Plasmodium 


pval 


108 


65 






vivax 








6386 


X53581 


Rattus 


ORF4 


148 


36 






norvegicus 








6387 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


52 








6720. 






6388 


M13100 


Rattus 


unknown protein 


119 


48 






norvegicus 








6389 


AB047948 


Macaca 


hypothetical protein 


80 


33 






fascicularis 








6390 


X53581 


Rattus 


ORF4 


139 ' 


55 






norvegicus 








6391 


AK025947 


Homo sapiens 


unnamed protein product 


166 


52 


6392 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


244 


75 








7867. 






6393 


M22332 


Homo sapiens 


unknown protein 


166 


39 


6394 


AF229067 


Homo sapiens 


PADI-H protein 


128 


59 


6395 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


70 








6613. 






6396 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


44 








8144. 






6397 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


97 


61 


6398 


AFI18082 


Homo sapiens 


PRO1902 


93 


48 


6399 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


116 


89 








gene 22 clone HMSJW18. 






6400 


U93570 


Homo sapiens 


putative pl50 


88 


43 


6401 


Y45382 


Homo sapiens 


Human secreted protein fragment 


147 


61 








encoded from gene 28. 






6402 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


56 








8153. 






6403 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


47 








6953. 






6404 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


56 








7344. 






6405 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 


88 


39 








IDNO:I051. 






6406 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


68 








5066. 






6407 


G00560 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


244 


90 








4641. 






6408 


L27428 


Homo sapiens 


reverse transcriptase 


108 


42 


6409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 5 


136 


58 








6613. 






6410 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


44 








7870. 






6411 


U70935 


Peromyscus 


reverse transcriptase 


154 


40 






maniculatus 








6412 


AF090931 


Homo sapiens 


PRO0483 


99 


85 


6413 


AL031186 


Homo sapiens 


bK984Gl.l (supported by FGENES) 


249 


78 


6414 


AF130089 


Homo sapiens 


PRO2550 


100 


55 


6415 


AB047963 


Macaca 


hypothetical protein 


165 


57 






fascicularis 








6416 


M22332 


Homo sapiens 


unknown protein 


HI 


42 


6417 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


50 








6913. 






6418 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


60 



300 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

ocure 


% 

Identit 

y 


6419 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


194 


84 


6490 




nuiiiu otxpivu5 


Human Qf^rrptprf nrntpin WCWPFfiR 
riuiHaii oCwicicu i/iuiwiu xjwxxti uo, 

<JFO ID NO* 163 


144 

1 ft 


01 


6491 


TJ49Q71 


Hnmn QJiniPnQ 

JLXV/iUV OwJJIvlia 


OPF9* fiinrtinn imlmnwn 


91 


62 


6422 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6835 


135 


96 


6423 


AF072508 


Homo sapiens 


envelope protein 


128 


49 


6494 




XTUlliU bapiCiib 


lUUIalliCU. piUlCul piUUUUl 


161 




6425 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

SUnilal lO pugO C1CU1CIH 


148 


54 


6426 


X92485 


Plasmodium 
vivax 


pval 


103 


75 




VR694R - 


ilUIIlO SiipiCllb 


"Human CArrpfpfl rnv\tf*in HPHPF6R 
liUiiicUi bccicicu pruicm riv_/Xii r \jo 3 

WO TD N0 161 




ou 


6428 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6on 


97 


46 


6429 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


100 


51 






riomo sapiens 


xiuman secreiea proiein it ij. 


1 jy 


JO 


6431 


G03263 ! 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

71 44 
/j*Hf. 


112 


47 


6432 


X92485 


Plasmodium 
vivax 


pval 


S3 


72 


6433 


Y35984 


Homo sapiens 


Extended human secreted protein 

eonnonro Q170 TT*I "MO 9TI 

sequence, ony ljl/ xnvj. z-oj. 


121 


59 


6434 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


66 


43 


6435 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6<K3 


152 


51 


6436 


D89501 


Homo sapiens 


similar to salivary proline-rich protein 


88 


38 


6437 


Y02785 


Homo sapiens 


Human secreted protein encoded by 

gCUC Jl C1UDC xlvJJSJ_*A.OJ. 


113 


58 


6438 


X99467 


Medicago 
uuncaiuia 


ENOD20 


no 


37 




I ©OJ ft 


xiomo sapiens 


xiuman gene y i -encoaeu proiein 
fragment, SEQ IDNO:491. 


190 
ij>y 


JO 






DlOPtM A/111 im 

VlVOA 


pval 




4.6 


6441 


AK024455 


Homo sapiens 


FLJ00047 protein 


173 


57 


6449 


VJUJ /7U 


rtoroo sapiens 


Wnman ce*m*tf*A nmtpin QCA TTi NH" 

li urn mi secreieu proicm, ocy ixj nsj. 
7R7I 

(Of 1. 




ji 


6443 

VrrrJ 




Hnrnn cc»t»I f*n <i 


Human ^f*rTi*tf*l nmtf*in $7*5 


104 


56 


6444 


Y02671 


Hnmn canipns 


Human <*ecret£*d nrotein encoded bv 
gene 22 clone HMSJW18, 


138 


50 ! 


6445 


M24732 


Uomn <wmien^ 


lam in-like nrotein 


71 


45 


6446 


AF119851 


Homo sapiens 


PR01722 


163 


69 


6447 




XlvUlv oaulUld 


HH1 nrotein 


98 


53 


6448 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


251 


58 


6449 


AF200187 


cercopithicine 

hpiripcvmic 1 ^ 


EBNA2-iike protein 


125 


38 


6450 

VrtJV 


YR6(V* 1 
A-OOUj 1 




nuuiuioguc ui pruuiic/oA^iiuHc iiuii 
antihacterial nrotein 






6451 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


114 


54 


6452 


AF090942 


Homo sapiens 


PRO0657 


129 


65 


6453 


M19503 


Homo sapiens 


ORF1; putative 


86 


39 


6455 


X92485 


Plasmodium 
vivax 


pval 


99 


40 
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SEQID 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 








Waterman 


Identit 










Score 


y 


6456 


X86030 


Viann 
v l&iia 


pvtemoin-likp nrofpin 
CAVCllolll ILIN.C pilHClll 


i j j 








nnou feu lata 








6457 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


113 


61 


6458 


AF055985 


O n ch oeerca 


nvTTolidonp-riph antitrpn 


00 

71/ 


37 






volvulus 

▼ v * T UiU J 








6459 


AF1 19851 


Homo ^anien<; 


PRO 1722 


943 


66 


6460 


J05497 


Rattus 




101 


31 






norvpoirn^ 








6461 


AF130089 


XIVFLUU dapiCllo 


PR 09* 50 


183 


If. 
/0 


6462 


W50192 


Hnmn c^nipnc 
nuiiiv oa^icnj 


Amino nctrl QPnnpnrp of enlivens 
7AJJ J 11117 OVlvl oCHjUCiliA? \JX oaJlVdl j 


03 
















6463 


G03789 


HoiYlA caoif»nC 


Human Qprrptprl tvrofpm 51Ff\ FTk MY"V 


199 


79 








7870 
to tv» 






6464 


Y36156 


Homo c^rsifn c 


T-Tnman Cf»rr**rf»rl r\rot*p*iri HDH 
XI Lull all owviCiCU prOlCLU rrZO. 


8^ 


<?9 
DZ 


646 S 


AT 160403 


jueisninania 


prouaoie ^nnv-oj m variant a 


1Z0 


41) 






lllajUl 


wumpiccc virion genome 






6466 


G03133 


riomo sapiens 


niiman seereiea proiem, oeaj iu invj. 


QA 

V4 


3o 








7914 






6467 




rivjjiiu sapicnb 


nuxuaii bccrcica proiem, ocv^ iu invj. 




JO 








787A 
/o/*f. 






6468 




nomo sapiens 


nuiuaa secreiea protein, oci^ iu inl>. 


121 


0/ 








6008 






6469 


X99485 




pval 


fiO 


TO 

/z 






vivav 








6470 


G03786 


Homo oanipn^ 

11U1UV7 a«ipivll3 


Human cprrptpr? r»rot*»in QPH JTt XIO- 
riLUlldil bCUlCLCU piUlCUly OCy IU liW. 


198 


30 








7867. 






6471 


AF 130089 


T-fom c\ Qflni#*n<; 

ilUUJU XJpiVU*) 


PRO9S^0 


1 


8^ 

UJ 


6472 


U93569 


Hauia Qanipnc 


puiaLivc pi J\J 


131 

13 1 


A7 
4Z 


6473 


AF 130079 


T-Tr»m o cAniVnc 
nuiliu oapiCllo 




1 1 1 
111 


04 


6474 


G03133 


Worn A ennipn c 


Human cA/»r^f»H nrnfAin QPH TT\ XIH' 
xlUillail bCviclcU pro lo 111, oCy ULy fHKs. 


1 1J 


5 1 








7914 






6475 


AF220264 


UrntiA canipnc 




111 
ill 


5£ 
35 


6476 


G04067 


rix/iiiu SKipiciio 


xiuman secreiea protein, o&i^ id isvj. 


1^ / 


43 








8148 






6477 


A F0 16000 




endonuc lease/reverse transcriptase 


lUo 


33 


6478 


X99485 


PlocmoHitim 


nuo 1 

pvdi 


OOO 


<< 
33 






vivav 








6480 


G03714 


Haiti n QanipnQ 


Human QPrrpfpH nrotpin QPO IT") "WO* 


1 19 


61 








7705 






6481 


U49973 


Homo «uinien<i 


ORP2* fimrrinn unknown 


06 




6482 


U 15647 




rpvpr<!p tranQr^rintacp 
icvciot? uaiiowipuuC 


111 
ill 


61 


6483 


X99485 

A7£>*tOJ 


PlaQmodiiim 
it ttoiimu mill 


nval 
pv<ii 


181 


4!^ 






vivax 








6484 


L27428 


Hnmn ^anienc 






50 
3U 


6485 


Y73966 


Homo sanietis 


Human nrostate tiimor fraompnt 


144. 


59 








derived nrotein #1 53 






6486 


G02538 


Homo 5anien<i 


Human cprmfpr! nmfpin SPO ID XIO* 


174 


63 








6619. 






6487 


U93569 


Homo sanienQ 


p40 


176 


53 


6488 


Y64890 


Homo ^aniens 


Human S' RST rplatpd nolvnentidp SFO 


79 


49 








IDNO-1051 

liV LV\S. 1 17 «/ A . 






6489 




Hnmn c^ni^nc 
l kVjLUxj dapivio 


fnromocnmal nrntptn 

i^uiuiuUdUuioi piuicui 


03 


13 


6490 


G04072 


Homo sanien<5 


Human QprrptpH nrotpin ^PO TTj "MO* 

AllULKUl OwlvlCVi pi WtCill, ODy JUL/ 1NV7, 


19S 


57 
3 / 








81^3 
OX JJ. 






6491 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


70 








7795. 






6492 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


251 


75 






• 


similar to pogo element 






6493 


U15647 


Mus musculus 


reverse transcriptase 


100 


40 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


ldcntit 










Score 


y 


fAQA 


AKT)97908 




iiTinampfl rirrifpin nnoalict 


109 






00^714 




Hnm^n secreted nrotein SEO ID NO - 


135 


52 








7795. 






fAQf, 
\rty\j 


AF1 ^0089 1 


Unmn <*anien<i 


PRO2550 


230 


69 






nut ii u wKiLiiciid 


ORF 1 • MFR37* nutative transDosase 


237 


62 








^fmiljirtfi noon element 






\rt70 


AF0G0RQ4 


rxuuiu oauiciu 


PRO01 13 


140 


68 


64QQ 

\tt77 




Urtrnn cunipn <c 
XxUlllU oapiGUa 


Fraompnt" nf* human secreted nrotein 


138 


84 








encoded hv &ene 12 1 

vlivvUvU J & * 






6500 


G00357 


Hnmo saniens 

X iuluv Odplwlu 


Human secreted nrotein. SEO ID NO; 


123 


69 








4438. 






6501 

\fmJ\J 1 


M69997 


Ur»m/\ Qarwpn q 

xlvrJltv/ jaJf/lGlio 


ORF3 

V/XVl J 


87 


58 


OJUx 




xi oino sapiens 


1 1 nlnn rtivn r*rr»+Pi n 
UiUVUVWli J/1 U1CLU 


81 


44 


OJVJJ 




JrlablilUulUlIl 


pval 


111 


52 






VlVCuv 








VJUt 


A FOQORO^ 


nomo oapicuo 


PRO01 17 

X IVUV 1 1 r 


131 


60 






xiomo Sapiens 


Unman nnnctafp nim r%r F^T* frfltTTYiPnt 

riUlliaU U1U3U14.C lUiUUl Lv>l Xiaglllwlll. 


97 


69 








acrivcu pruicjii tti 17. 






0->UO 




xiomo sapienb 


mttafivp nl ^rt 
puuu.lv c uuu 


122 


32 


OX) / 


Aoovzy 


Vigna 


extensin-iiKe proxein 


177 
III 


^9 






UilgUICUlala 








OjUo 




riomo Sapiens 


Unman c<»r»r<»t*v-| rtrrvtoin QFO TH WO* 

xiuman secreieu proicm, ocy iiy nv/. 


?R9 


w 








7X71 
loll. 






OjU? 


WHOjJl 


UT /twist odmAnr 

xioroo sapiens 


Unman Virpact ranrAr rplafpH nrnfpin 
XTUiuau urcodi cau^ci i watcvi jji vjiv/ui 


142 


61 






UPRR7 










xiomo Sapiens 


Unman corrAtmi nrntpin ^FO ID tJO* 
ximnan secreieu piuieiu, ocy i^v/. 




35 








7914 






1 1 


T 1*59077 


Unmr* canipnc 
xxwilyj oapidio 


men juici u au^|/i/oaow 


372 


73 


/W19 
O.MZ 




xiomo sapiens 


Human cecreted nrntein SFO ID NO* 


81 


50 








A790 






AST* 


AF0Q0Q9R 


nomo sapiens 




97 


39 




A 171 "3/f 877 


iVLonoueipQis 


email nit^lAor riKr»niir , l*a/M - vrrk+Ain R' 

small nuciear rioouucieupi ulcui d 


108 

IwO 


47 






uomesuca 








Oj 1 J 




xiomo sapiens 


Unman cprrptpH nrntpin ^FO ID NO* 

XllUHall 5CU1CU3U pi UlCUIj OLy lU l^lV^. 


82 


44 














OJiO 




rioiuo Sapiens 


neat shock laLiur uuiuiug ptuicui i 


118 

1 io 


69 








tlCDDt 
nODr 1 






A517 
OJl / 


AFl^OORQ 


xiomo sapiens 


rivL/ZJJU 


207 


69 


£S1R 


M99H9 


xiomo sapiens 


iinlmnwri nTr\tpin 
UlUxllvrVVll piUU7Ul 


154 


55 


OJ 17 


00940** 


xiomo sapiens 


Unman cem^tpd nmtein SFO ID NO* 

XlUlllall Owv/lCLtAl piVILClll, OJ_»V^ *■ ' l^w. 


95 


39 








^^74 
OJ /*t. 






6590 




xiAJiiiu dapiwio 


Hum an secreted nrotein SEO ID NO" 


65 


37 








7200. 






6521 


AK023563 


Homo saniens 


unnamed nrotein oroduct 


268 


58 


6522 


G00357 

VivvJJ 1 


Hnmn *;anien<i 


Hnnnan secreted nrotein SEO ID NOi 


181 


61 








4438. 






6523 


AF? 10651 


Homo «?anien^ 


NAG 18 


85 


85 


6524 


AFH0079 


Homo QaniPn^ 


PR02852 


127 


51 


£S9s 


vJUO /OO 


xiomo SapiClK> 


Unman oprrpted nrotein ^FO ID NO* 

XlUUlaU oCvJCiCU pi \JL01U) u£ry 1XX li\/. 


113 


51 








7867 






UJZO 




XT.UUIU ZkapLvlio 


Unman cprrptpfl nrotpin fraornpnt 

XxUUiaU jCOl CLULl piVllCUl XI clg^lHVyl 1L 


85 


62 








f*nf*f\Af>f\ from apnp 9R 






6S97 


T 14947 1 


IV/fiic miicniltic 


Wicrntt-AMrirh ^vndrnme nrotein 
wiSkVUii'/viuiiwi ojriiui uujv pvu<iu 


115 


48 








liUlXli/li/g 






6528 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


86 


60 








6913. 






6529 


R59842 


Homo sapiens 


ApoE4Ll protease. 


92 


58 


6530 


R59842 


Homo sapiens 


ApoE4Ll protease. 


135 


77 


6531 


W48351 


Homo sapiens 


Human breast cancer related protein 


88 


60 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Vo 

Identit 

y 








BCRB2. 






6532 


AF1 19851 


Homo sapiens 


PR01722 


121 


61 


6533 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 


85 


35 


6534 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


123 


59 


6535 


R95913 


Homo sapiens 


Neural thread protein. 


132 


67 


6536 


AF130079 


Homo sapiens 


PR02852 


113 


58 


6537 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


159 


60 


6538 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


279 


68 


6539 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


112 


55 


6540 


AF 130089 


Homo sapiens 


PRO2550 


124 


56 


6541 


U93574 


Homo sapiens 


putative pi 50 


159 


39 


6542 


AF1 18082 


Homo sapiens 


PRO1902 


114 


45 


6543 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


78 


50 


6544 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


118 


44 


6545 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


62 


6546 


AF1 16661 


Homo sapiens 


PR01438 


162 


59 


6547 


S62928 


Homo sapiens 


PRB1M protein precursor 


116 


48 


6548 


AB030192 


Mus musculus 


homoiogue of S70447 FIFo-ATP 
synthase complex Fo membrane 
domain f subunit 


174 


63 


6549 


AK0251 16 


Homo sapiens 


unnamed protein product 


136 


56 


6550 


X92485 


Plasmodium 
vivax 


pval 


105 


56 


6551 


AF130079 


Homo sapiens 


PR02852 


172 


91 


6552 


U70935 


Peromyscus 
maniculatus • 


reverse transcriptase 


84 


42 


6553 


S79410 


Mus' musculus 


nuclear localization signal binding 
protein 


100 


55 


6554 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


39 


6555 


AF130089 


Homo sapiens 


PRO2550 


205 


64 


6556 


AF130079 


Homo sapiens 


PR02852 


165 


48 


6557 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoducL 


160 


74 


6558 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


133 


49 


6559 


X92485 


Plasmodium 
vivax 


pval 


121 


78 


6560 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


76 


45 


6561 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


234 


85 


6562 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


61 


6563 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


134 


84 


6564 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


66 


47 


6565 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


180 


49 
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SEQID 


Accession No. 


Species 


Description . 


Sraith- 


% 


NO: 








Waterman 


Identit 












y 


6566 


AF265575 


Unmn saniens 


nhimiitoni TPR-motif nrotein Y 




Jf 








isoform 






6567 


X92485 


Plasmodium 


pval 


96 


5R 

JO 














6568 


Y02693 


Homo sapiens 


Human secreted nrotein encoded bv 


103 


73 








gene 44 clone HTDAD22. 






6569 


AF1 19851 


Homo saniens 


PRO 1722 




ou 


6570 


AF289098 


dadrastis 


EN0D2 


i in 








kentnkea 








6571 


AF 118086 


Homo saniens 


PRO 1992 


1 1 *; 
iu 




6572 


G02538 


Homo sapiens 


Human secreted nrotein- SEO ID NO* 

M. AVI lull mM tfVWlvwU fch/A X/ 1 1 1 j * * J X 


142 


57 








6619 






6573 


G04063 


Homo saoiens 


Human secreted nrotein SEO ID NO* 


Oy 


47 








8144. 






6574 


M22332 


Homo sapiens 


unknown nrotein 


95 


34 


6575 


Y86248 


Homo saniens 


Human secreted nrotein HCHPF68 




64 








SEO ID NO- 163 






6576 


AK021848 


Homo saniens 


unnamed nrotein nmdiict 


131 


41 


6577 


AF161356 


Homo sapiens 


HSPC093 


155 


57 


6578 


AF1 16909 


Homo saniens 


unknown 


75 


76 


6579 


G03790 


Homo saniens 


Human secreted nrotein SFO ID NO* 


95 


53 








7S71. 






6580 


AF 130089 


Homo sapiens 


PRO2550 


276 


74 


6581 


K02576 


XJTrvrri (\ CAntPTIC 


ool i\/ art/ nrrtlinp.rirn orf\fr*»in 1 

oaiiYui y pruimc-rivii uiuicm i 


110 


jj 


6582 


U49973 


Homo sanien*; 


ORF7* fiin/*rirtn imknnu/n 




oz. 


6583 


G03411 


Homo oanipnQ 


Human oprrpterl nrotein ^PO TD NO* 


on 


A5 








7497 






6584 


AJ249395 


CiIohoHera 


NA DM-iihi/niinnnp rvYirfrtr*>Hnr*fsic*» 

UUll£UltlUUG UAIUUI vUUV/UwC 


99 
yy 








pallida 


subunit 4 






6585 


Y27907 


Homo saniens 


Human secreted nrotein encoded hv 


75 


67 

VIZ. 








gene No. 119. 






6586 


U49973 


Homo sapiens 


ORF2* function unknown 


134 


58 


6587 


Y08061 


Homo saniens 


T-Ttimsiri mvVi rwnt^Jn fi-a orin*nt 

JLXUU1CU1 V/^lLtjfU {JLVJL^lll lluglXiVUU 


1 1? 


47 


6588 


G03790 


Homo saniens 


Human secreted nrotein ^FO TD NO* 


717 










7871. 






6589 


K02576 


Homo saniens 


saltvarv nrnline-rirh orotpin 1 


l*r / 


dl 


6590 


U43627 


Arafiidnnsis 






3? 
j I 






thaliana 

uuuuuia 








6591 


X62677 




rcLiuviiuo rciaicu reverse uaiiovripiaoc 


71 


34 1 






cuniculus 








6592 


Y91577 


Homo saniens 


PTiimfin sf»rrf»tf»H nrotein ^pnupncp 


177 


oo 








encoded bv »ene 2 SEO ID NO*250 






6593 


G03786 


Homo saniens 


Human secreted nrotein SKO ID NO* 


100 


77 
/ z* 








7867. 






6594 


M69297 


Homo sapiens 


ORF3 


137 


43 


6595 


X68101 


Rattus 




372 


89 






norvegicus 








6596 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NOi 


81 


62 








4435. 






6597 


Y02785 


Homo saoiens 


Human secreted nrotein encoded bv 

ilUUKUI MVWlVlwU L7JV/bWUl V>llWViv\4 J 


99 


43 








eeneSl clone HUKEX85 






6598 


Z99714 


Homo saoiens 


hfT104/?F0 ^ /novel nroteinl 


164? 




6599 


G00637 


Homo saniens 


Human secreted nrotein SEO ID NO* 




50 








4718. 






6600 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


65 








6953. 






6601 


AK000385 


Homo sapiens 


unnamed protein product 


187 


74 


6602 


AF130051 


Homo sapiens 


PRO0898 


137 


54 


6603 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 


88 


48 



305 



WO 01/64835 



PCT/US01/04927 



SEQ0> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idcntit 










Score 


v 

y 








terminal, clone XEH.8c} 






6604 


X61047 


Hydra sp. 


mini-collagen 


116 


45 


6605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


71 








7888. 






6606 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


212 


66 








6953. 






6607 


AF232673 


Homo sapiens 


B17 long form 


1272 


99 


6608 


AF1 18082 


Homo sapiens 


PRO1902 


77 


52 


6609 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


132 


74 








oncoprotein. 






6610 


Y08062 


Homo sapiens 


Human PRQ245 protein fragment 


70 


61 








derived from DNA35638. 






6611 


Y36203 


Homo sapiens 


Human secreted protein #75. 


140 


62 


6612 


X92485 


Plasmodium 


pval 


157 


72 






vivax 








6613 


U52077 


Homo sapiens 


mariner transposase 


260 


67 


6614 


AF161392 


Homo sapiens 


HSPC274 


138 


45 


6615 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


65 








7795. 






6616 


AF1 19851 


Homo sapiens 


PRO 1 722 


147 


49 


6617 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


63 








7870. 






6618 


AL390U4 


Leishmania 


extremely cysteine/valine rich protein 


117 


39 






major 








6619 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


105 


58 








ID NO: 1051. 






6620 


U15647 


Mus musculus 


reverse transcriptase 


106 


38 


6621 


AF130089 


Homo sapiens 


PRO2550 


171 


52 


6622 


X92485 


Plasmodium 


Dval 


139 


44 






vivax 








6623 


AF090895 


Homo sapiens 


PRO01 17 


161 


58 


6624 


M63730 


Homo sapiens 


BPAG2 


107 


32 


6625 


G0325S 


Homo sapiens 


Human secreted orotein SEO ID NO* 


J 83 


61 








7339. 






6626 


AF078860 


Homo sapiens 


PTD007 


69 


73 


6627 


AL080253 


Arabidopsis 


putative snRNP protein 


101 


37 






thaliana 








6628 


AK000385 


Homo sapiens 


unnamed protein product 


140 


57 


6629 


W60652 


Homo sapiens 


Human D Vic- 1 (alternative transcript). 


218 


51 


6630 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


66 








6913. 






6631 


AB046048 


Macaca 


unnamed portein product 


118 


65 






fascicularis 








6632 


X92485 


Plasmodium 


pval 

Mr 


119 


88 






vivax 








6633 


X03145 


Homo sapiens 


pot ORF V 


98 


39 


6634 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


165 


41 


6635 


X59448 


Drosophila' 


dynamin 


111 


39 






melanogaster 








6636 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


50 








7870. 






6637 


AF130089 


Homo sapiens 


PRO2550 


141 


75 


6638 


AF1 18082 


Homo sapiens 


PRO1902 


108 


63 


6639 


W48351 


Homo sapiens 


Human breast cancer related protein 


144 


60 








BCRB2. 






6640 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


104 


47 


6641 


AF 118082 


Homo sapiens 


PRO1902 


121 


53 


6642 


AF118082 


Homo sapiens 


PRO1902 


195 


68 


6643 


R95913 


Homo sapiens 


Neural thread protein. 


212 


65 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


6644 


AF090928 


Homo sapiens 


PRO0470 1 


81 


65 


6645 


AF130079 


Homo sapiens 


PR02852 


151 


69 


6646 


AF003535 


Homo sapiens 


ORF2-Iike protein 


140 


35 


6647 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


169 


61 


6648 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


100 


42 


6649 


M15530 


Homo sapiens 


B-cell growth factor 


121 


76 


6650 


AF090942 


Homo sapiens 


PRO0657 


119 


53 


6651 


AF1180&2 


Homo sapiens 


PRO1902 


130 


52 


6652 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


143 


87 


6653 


AF130089 


Homo sapiens 


PRO2550 


143 


63 


6654 


AF119851 


Homo sapiens 


PR01722 


285 


70 


6655 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


103 


75 


6656 


AF1 19855 


Homo sapiens 


PR01847 


61 


60 


6657 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


107 


36 


6658 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 


61 


6659 


AF1 19851 


Homo sapiens 


PR01722 


177 


54 


6660 


X92485 


Plasmodium 
vivax 


pval 


104 


55 


6661 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


116 


57 


6662 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


68 


6663 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


115 


35 


6664 


D00570 


Mus musculus 


open reading frame (251 AA) 


126 


35 


6665 


AF130051 


Homo sapiens 


PRO0898 


129 


77 


6666 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


108 


42 


6667 


G02851 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6932. 


71 


41 


6668 


AF130079 


Homo sapiens 


PR02852 


no 


59 


6669 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


113 


66 


6670 


G00527 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4608. 


98 


73 


6671 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


51 


6672 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


113 


36 


6673 


V01201 


Simian sarcoma 
virus 


coding sequence of pol 


143 


47 


6674 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


159 


78 


6675 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4729. 


64 


56 


6676 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


96 


48 


6677 


AF130089 


Homo sapiens 


PRO2550 


100 


75 


6678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


94 


68 


6679 


G03263 


Homo sapiens 


Human secreted protein/SEQ ID NO: 
7344. 


105 


71 


6680 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7893. 


218 


53 


6681 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


53 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
v 








4433. 






6682 


AF16I356 


Homo sapiens 


HSPC093 


163 


47 


6683 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


78 


72 


6684 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


87 


42 


6685 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


43 


66S6 


U93564 


Homo sapiens 


p40 


96 j 


41 


6687 


AF1 19855 


Homo sapiens 


PR01847 


137 


67 


6688 


U52077 


Homo sapiens 


mariner transposase 


268 


81 


6689 


X92485 


Plasmodium 
vivax 


pval 


97 


58 . 


6690 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


133 


56 


6691 


M63274 


Plasmodium 
falciparum 


malaria antigen 


61 


46 


6692 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


80 


6693 


Ml 1901 


Rattus 
norvegicus 


proJine-rich salivary protein 


115 


40 


6694 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


131 


50 


6695 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


174 


66 


6696 


AF068294 


Homo sapiens 


HDCMB45P 


154 


59 


6697 


X55685 


Lvcooersicon 
esculentum 


extensin ( class X\ 


1 12 


41 


6698 


M83293 


Simian 

immunodeficienc 
y virus 




O J 


HI 


6699 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


169 


88 


6700 


G02455 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
6536. 


121 


58 


6701 


AF1I8080 


Homo sapiens 


PRO 1880 


127 


59 


6702 


X92485 


Plasmodium 
vivax 


pval 


106 


58 


6703 


AF161356 


Homo sapiens 


HSPC093 


155 


53 


6704 


X92485 


Plasmodium 
vivax 


oval 


164 


54 


6705 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


172 


59 


6706 


AK025326 


Homo sapiens 


unnamed protein product 


146 


50 


6707 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


112 


43 


6708 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


119 


71 


6709 


AF 176531 


Mus musculus 


F-box protein FBX16 


324 


60 


6710 


X92485 


Plasmodium 
vivax 


pval 


96 


64 


6711 


AF130089 


Homo sapiens 


PRO2550 


203 


67 


6712 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


150 


59 


6713 


AF090944 


Homo sapiens 


PRO0663 


114 


43 


6714 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


110 


47 


6715 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


133 


60 



308 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

v 
J 


6716 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
7795. 


103 


65 


6717 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


115 


46 


6718 


G01222 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5303. 


54 


100 


6719 


Y36203 


Homo sapiens 


Human secreted protein #75. 


129 


78 


6720 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


64 


6721 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


130 


61 


6722 


Y36203 , 


Homo sapiens I 


Human secreted protein #75. 


167 


72 


6723 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


46 


6724 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


133 


42 


6725 


U80761 


Homo saDiens 


CTG26 alternate open reading frame 


88 


76 


6726 


G00352 


Homo saDiens 


Human secreted nrotein. SEO ID NO: 
4433. 


218 


61 


6727 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


399 


81 


6728 


U01877 


Homo sapiens 


p300 protein 


119' 


27 


6729 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


51 


6730 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


58 


6731 


X66285 


Mus musculus 


HC1 ORF 


79 


38 


6732 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


114 


38 


6733 


AF057352 


Homo sapiens 


hepatocellular carcinoma autoantigen 


148 


81 


6734 


AF090895 


Homo sapiens 


PRO0U7 


152 


57 


6735 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


61 


6736 


X92485 


Plasmodium 
vivax 


pval 


92 


89 


6737 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


112 


49 


6738 


Y02693 


Homo saDiens 


Human secreted nrotein encoded bv 
gene 44 clone HTDAD22. 


278 


72 


6739 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


191 


47 


6740 


AF1 18082 


Homo sapiens 


PRO 1902 


83 


49 


6741 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


83 


43 


6742 


Y36708 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 62. 


71 


64 


6743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


177 


70 


6744 


Y91475 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 25 SEQ ID NO: 148. 


159 


50 


6745 


X92485 


Plasmodium 
vivax 


pval 


112 


39 


6746 


Z82268 


Caenorhabditis 
elegans 


predicted using Genefmder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), Score^.S, Ervalue=12e-18, 
N=2; PF01484 (Nematode cuticle 
collagen N-terminal domain), 
Score=37.5, E-value=le-07, 
N=l~cDNA EST yk92fl.5 comes from 
this gene-cDNA EST yk55f6.5 comes 
from this gene-cDNA EST yk65b7.3 
comes from this gene-cDNA EST 


108 


36 



309 



SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO; 






* 


Waterman 


Idea tit 










Score 


y 








vk58f5 5 comes from this crene—cDNA 












FST vk7?h5 5 comes from this 












gene— cDNA EST yk55c4.5 conies from 












this gene-cDNA EST yk58hl.5 comes 












from this gene-cDNA EST yk65al2.5 












comes from this gene~cDNA EST 












yk94f4.5 comes from this gene 






6747 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


38 








7214. 






6748 


R95913 


Homo sapiens 


Neural thread protein. 


108 


56 


6749 


U57747 


Mus musculus 


platelet-activating factor 


209 


69 








acetylhydrolase isoform lb beta subunit 






6750 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


65 








7795. 






6751 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


48 








4718. 






6752 


AF090895 


Homo sapiens 


PRO0117 


115 


56 


6753 


AF217973 


Homo sapiens 


unknown 


105 


66 


6754 


AF119851 


Homo sapiens 


PR01722 


156 


55 


6755 


AF 119855 


Homo sapiens 


PRO 1847 


189 


66 


6756 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


55 








7870. 






6757 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


57 








4435. 






6758 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


70 








7534. 






6759 


AF 130051 


Homo sapiens 


PRO0898 


98 


68 


6760 


M36914 


Zea mays 


cell wall protein (put.); putative 


75 


28 


6761 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


85 








6619. 






6762 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


6i 








6834. 






6763 


Ml 9756 


Homo sapiens 


MHC HLA-Bw47 precursor 


580 


92 


6764 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


115 


52 






brucei 


complete virion genome. 






6765 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


321 


51 


6766 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


59 








7867. 






6161 


G03790 


Homo saoiens 


Human secreted DTOtein SEO ID NO* 


303 


64 






7871. 






6768 


AF061340 


Artibeus 


cytochrome c oxidase subunit TJ 


213 


67 






jamaicensis 








6769 


T71265_cdl 


Homo sapiens 


19-JAN-1996 Neuronal apoptosis 


188 


59 








inhibitor protein (NAIP) cDNA. 






6770 


B01390 


Homo sapiens 


Neuron-associated protein. 


92 


55 


6771 


AF017777 


Drosophila 


la costa 


114 


50 






melanogaster 








6772 


Z34955 


Sus scrofa 


protein phosphatase 2A 65 kDa 


331 


93 








regulatory subunit, alpha isoform 






6773 


X61296 


Rattus 


open reading frame 2 


101 


37 






norvegicus 








6774 


ALU7345 


Homo sapiens 


dJ416F21 2 (novel protein) 


639 


91 


6775 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


61 








7867. 






6776 


AJ007798 


Homo sapiens 


stromal antigen 3, (STAG3) 


238 


76 


6777 


AB029065 


Mus musculus 


Ser/Thr kinase KKIAMRE-beta 


165 


66 


6778 


M37400 


Homo sapiens 


aspartate aminotransferase 


208 


68 


6779 


AF1 30089 


Homo sapiens 


PRO2550 


127 


52 


6780 


AF130079 


Homo sapiens 


PR02852 


332 


81 



310 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith* 

Waterman 

Score 


% 

Identit 

y 


6781 


AL049748 


Homo sapiens 


dJ41P2.2 (RNA binding motif protein 9 
(isoform 1)) 


264 


60 


6782 


LI 1007 


Rattus 
norvegicus 


cyclin-dependent kinase 4 


239 


75 


6783 


AF3 19938 


Therm us 
thermophilus 


competence factor ComEA 


115 


36 


6784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


44 


6785 


L16864 


African swine 
fever virus 


cd2 homologue 


152 


34 


6786 


Y36203 


Homo sapiens 


Human secreted protein #75. 


99 


44 


6787 


AF130087 


Homo sapiens 


PR02411 


172 


65 


6788 


AP208846 


Homo saoiens 


BM-004 


664 


100 


6789 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


90 


83 


6790 


AP000059 


Aeropyrum 
pemix 


120aa long hypothetical protein 


81 


43 


6791 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


53 


6792 


Y 12961 


Homo sapiens 


Amino acid sequence of a human 
secreted neotide 


175 


65 


6793 


Y73442 


Homo *ianien^ 


Human secreted protein clone ya66 1 
protein sequence SEQ ID NO: 106. 


565 


93 


6794 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


136 


50 


6795 


AC004022 




cenim naraoxonasearvlesterase 3 


137 


93 


6796 


AF009668 

ill UV7UUU 


multinle 
sclerosis 
associated 
retrovirus 


nolvorotein 


194 


60 


6797 


R59842 


Homo sapiens 


ApoE4Ll protease. 


121 


92 


6798 


U88167 


Caenorhabditis 
elegans 


contains similarity to C2 domains 


171 


73 


6799 


AF293024 


Homo sapiens 


steroid receptor RNA activator isoform 
1 


767 • 


97 


6800 


AF010144 


Homo sapiens 


neuronal thread protein AD7C-N IF 


123 


65 


6801 


B0 1203 


Homo sapiens 


Human GTPase associated protein-28. 


313 


87 


6802 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ui 102, variant a dna, 
complete virion genome. 


125 


47 


6803 


AF078036 


Homo sapiens 


RelA associated inhibitor, NF-kappaB 
subunit p65 binding protein 


1275 


95 


6804 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


131 


53 


6805 


Y57946 


Homo sapiens 


Human transmembrane protein 
HTMPN-70. 


1128 


100 


6806 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


81 


81 


6807 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


52 


6808 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


273 


63 


6809 


K02401 


Homo sapiens 


chorionic somatomammotropin 


446 


95 


6810 


AF130089 


Homo sapiens 


PRO2550 


150 


56 


6811 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


123 


65 


6812 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


93 


38 


6813 


AF090930 


Homo sapiens 


PRO0478 


103 


65 


6814 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


54 



311 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


6815 


B24423 


Homo sapiens 


Human PRO 1244 protein sequence 
SEQID NO: 1 93. 


484 


95 


6816 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


105 


43 


6817 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


96 


56 


6818 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


102 


48 


6819 


U82303 


Homo sapiens 


unknown 


105 


57 


6820 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


113 


45 


6821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


97 


39 


6822 


G01246 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
5327. 


146 


86 


6823 


U52077 


Homo sapiens 


mariner transposase 


192 


78 


6824 


AC004237 


Homo sapiens 


KIF3 


129 


86 


6825 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


138 


66 


6826 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


112 


64 


6827 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


108 


68 


6828 


U49974 


Homo sapiens 


mariner transposase 


103 


86 


6829 


AF1 19851 


Homo sapiens 


PR01722 


115 


53 


6830 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


62 


6831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


94 


73 


6832 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


131 


53 


6833 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


74 


50 


6834 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


168 


61 


6835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


60 


6836 


X92485 


Plasmodium 
vivax 


pval 


99 


58 


6837 


AB046048 


Macaca 
fescicularis 


unnamed portein product 


147 


42 


6838 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


134 


52 


6839 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


60 


6840 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


126 


73 


6841 


AF130089 


Homo sapiens 


PRO2550 


167 


63 


6842 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


140 


56 


6843 


AF090894 


Homo sapiens 


PRO0113 


148 


62 


6844 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478, 


119 


51 


6845 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


75 


61 


6846 


G03295 
• 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


65 


44 


6847 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


6848 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


132 


68 | 


6849 


AF1 16715 


Homo sapiens 


PR02829 


123 


67 


6850 


Y01437 


Homo sapiens 


Secreted protein encoded by gene 55 


142 


96 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


MA. 
liv. 








Waterman 


Id en tit 










Score 


y 








clone HLMBP 18. 






6851 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


76 








7879. 






6852 


AF068294 


Homo sapiens 


HDCMB45P 


161 


56 


6853 


G02532 


Homo sapiens 


Human seqreted protein, SEQ ID NO: 


111 


57 








6613. 






6854 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


72 








7874. 






6855 


AF13005! 


Homo sapiens 


PRO0898 


117 


61 


6856 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


72 








7879. 






6857 


X556S5 


Lycopersicon 


extensin (class I) 


190 


45 






esculentum 








6858 


AL359782 


Trypanosoma 


possible (hhv-6) u 1 1 02, variant a dna, 


164 


69 






brucei 


complete virion genome. 






6859 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


90 


52 


6860 


AF 119851 


Homo sapiens 


PR01722 


331 


73 


6861 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


67 








4499. 






6862 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


53 








7870. 






6863 


X92485 


Plasmodium 


pval 


105 


63 






vivax 








6864 


AF068294 


Homo sapiens 


HDCMB45P 


206 


61 


6865 


AF068294 


Homo sapiens 


HDCMB45P 


128 


46 


6866 


L00016 


Homo sapiens 


urf4 


157 


58 


6867 


X92485 


Plasmodium 


pval 


95 


89 






vivax 








6868 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


148 


77 








SEQIDNO:163. 






6869 


AF1 30089 


Homo sapiens 


PRO2550 


162 


67 j 


6870 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


257 


65 








encoded by gene 17. 






6871 


X71972 


Mus museums 


ribosomal protein S24 


211 


85 


6872 


M28016 


Homo sapiens 


cytochrome b 


72 


93 


6873 


M24069 


Homo sapiens 


DNA-binding protein A 


1662 


79 


6874 


AF130051 


Homo sapiens 


PRO0898 


163 


76 


6875 


AF006061 


Homo sapiens 


placental growth hormone isoform 


263 


90 








hGH-V3 






6876 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


80 








5066. 






6877 


AF130079 


Homo sapiens 


PR02852 


130 


76 


6878 


S8O905 


Homo sapiens 


Conl 


148 


38 


6879 


X55683 


Lycopersicon 


extensin (class I) 


70 


35 






esculentum 






6880 


U16359 


Rattus 


nitric oxide synthase 


106 


94 






norvegicus 








6881 


AK025047 


Homo sapiens 


unnamed protein product 


167 


50 


6882 


X92485 


Plasmodium 


pval 

* 


110 


63 






vivax 








6883 


W48351 


Homo sapiens 


Human breast cancer related protein 


185 


46 








BCRB2. 






6884 


AK024455 


Homo sapiens 


FLJ00047 protein 


81 


64 


6885 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


47 








6574. 






6886 


AF014008 


Bostaurus 


myocardial vascular inhibition factor 


158 


96 


6887 


AF265575 


Homo sapiens 


. ubiquitous TPR-motif protein Y 


183 


83 








isoform 






6888 


Y45382 


Homo sapiens 


Human secreted protein fragment 


199 


49 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Water in an 
Score 


% 

Identit 

v 








encoded from gene 28. 






6889 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


140 


67 


6890 


AF130051 


Homo sapiens 


PRO0898 


135 


65 


6891 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


147 


65 


6892 


X72963 


Nicotiana 
tabacum 


pAP8 product 


80 


44 


6893 


AF073957 


Homo sapiens 


CXC chernokine BRAK 


399 


82 


6894 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


160 


80 


6895 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO;163. 


150 


70 


6896 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


178 


85 


6897 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


142 


62 


6898 


AF 118082 


Homo sapiens 


PRO1902 


144 


53 


6899 


AF130051 


Homo sapiens 


PRO0898 


187 


72 


6900 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


85 


6901 


AF130089 


Homo sapiens 


PRO2550 


100 


63 


6902 


Z29446 


Hepatitis C virus 


core region 


60 


36 


6903 


AF107406 


Homo sapiens 


GW128 


71 


63 


6904 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPRN-terminal sequence. 


173 


64 


6905 


AF 130089 


Homo sapiens 


PRO2550 


88 


65 


6906 


GQ3789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


78 


84 


6907 


U93569 


Homo sapiens 


p40 


145 


48 


6908 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


186 


50 


6909 


AF130089 


Homo sapiens 


PRO2550 


244 


77 


6910 


Ml 8093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


119 


38 


6911 


AF130089 


Homo sapiens 


PRO2550 


139 


47 


6912 


G0032S 


Homo sapiens 


Human secreted protoin y SEQ ID NO: 
4409. 


146 


60 


6913 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 
sequence. 


131 


70 


6914 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


106 


74 


6915 


AF161361 J 


Homo sapiens 


HSPC098 


79 


66 


6916 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


207 


60 


6917 


AF130051 


Homo sapiens 


PRO0898 


258 


61 


6918 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


108 


65 


6919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


58 


6920 


G02635 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6716. 


168 


80 


6921 


AF116661 


Homo sapiens 


PR01438 


71 


48 


6922 


AF090940 


Homo sapiens 


PRO0644 


72 


85 


6923 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


186 


70 


6924 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


55 


6925 


AF229067 


Homo sapiens 


PADI-H protein 


122 


72 



314 
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SEQ© 


Accession No. 


Species 


Description 


Smith- 


% 










Waterman 


Identit 










Score 


V 

j 


6926 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


105 


37 








protein CON-L 






6927 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


60 








7881. 






6928 


X56603 


Mus musculus 


mouse 57-KD Calcium-binding protein 


103 


52 








(MCaBP) 






6929 


AFl 19851 


Homo sapiens 


PR01722 


148 


76 


6930 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


96 


61 








7868. 






6931 


AF077208 


Homo sapiens 


HSPC022 - 


155 


72 


6932 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


111 


62 


6933 


M22332 


Homo sapiens 


unknown protein 


154 


77 


6934 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


146 


60 








gene 22 clone HMSJW1 8. 






6935 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


168 


63 








6613. 






6936 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 








4457. 






6937 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


77 








6613. 






6938 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


ill 


68 








7795. 






6939 


AFl 18082 


Homo sapiens 


PRO1902 


120 


54 


6940 


AF090942 


Homo sapiens 


PRO0657 


149 


72 


6941 


M22332 


Homo sapiens 


unknown protein 


96 


51 


6942 


AF130087 


Homo sapiens 


PR02411 


132 


58 


6943 


M15530 


Homo sapiens 


B-cell growth factor 


97 


52 


6944 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


52 








4433. 






6945 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


73 


66 








7871. 






6946 


Z10561_cdl 


Homo sapiens 


12-MAR-1998 DNA encoding G 


330 


96 








protein-coupled receptor protein 












designated SREB2. 






6947 


W53869 


Homo sapiens 


Gravin polypeptide fragment, residues 


170 


89 








1526-1780. 






6948 


W48351 


Homo sapiens 


Human breast cancer related protein 


164 


86 








BCRB2. 






6949 


AFl 18086 


Homo sapiens 


PR01992 


117 


86 


6950 


AF090928 


Homo sapiens 


PRO0470 


115 


58 


6951 


AF130089 


Homo sapiens 


PRO2550 


124 


69 


6952 


Y38397 


Homo sapiens 


Human secreted protein encoded by 


94 


64 








gene No. 12. 






6953 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


55 








6613. 






6954 


X92485 


Plasmodium 


pval 


108 


64 






vivax 








6955 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


117 


85 








terminal, clone XEtL8c} 






6956 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


53 








6574. 






6957 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


50 








6983. 






6958 


AF130089 


Homo sapiens 


PRO2550 


127 


67 


6959 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


119 


49 








gene 100 clone HNFIU96. 






6960 


J02459 


bacteriophage 


U (tail component; 131) 


492 


92 






lambda 








6961 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


68 



315 
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SEQU> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Water man 


Id en tit 










Score 


v 

7 








7214. 






6962 


Z66490 


Salmo trutta 


slow myotonia! muscle tropomyosin 


182 


72 


6963 


AF130089 


Homo sapiens 


PRO2550 


108 


73 


6964 


W21733 


Homo sapiens 


MP-1 encoded by clone 59. 


131 


69 


6965 


AF161356 


Homo sapiens 


HSPC093 


58 


48 


6966 


AB047934 


Macaca 


hypothetical protein 


146 


58 






fascicularis 






6967 


X92485 


Plasmodium 


pval 


116 


55 






vivax 








6968 


AF1 18082 


Homo sapiens 


PRO 1902 


85 


65 


6969 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


63 








6908. 






6970 


AF1 16712 


Homo sapiens 


PR02738 


101 


56 


6971 


X96667 


Homo sapiens 


beta-palactoside aInha-2 3- 


172 










sialyltransferase 






6972 


Y86574 


Homo saDiens 


Human gene 91-encoded protein 




oz. 








fragment, SEQ ID NO:49L 






6973 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


58 








6619. 






6974 


AF162149 


Mycoplasma . 


variable surface liDoorotein 


143 


40 






bovis 






6975 


W17971 


Homo sapiens 


RAC protein kinase C-tenninal binding 


135 


81 








protein C-terminal region. 






6976 


AF130089 


Homo sapiens 


PRO2550 1 


345 


78 


6977 


AF130089 


Homo sapiens 


PRO2550 


195 


63 


6978 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


75 








7795. 






6979 


M605I0 


Mus musculus 


beta-D-galactosidase fusion protein 


96 


47 


6980 


AF1 18086 


Homo sapiens 


PR01992 


131 


64 


6981 


AF130089 


Homo sapiens 


PRO2550 


132 


64 


6982 


L27428 


Homo sapiens 


reverse transcriptase 


128 


35 


6983 


U93566 


Homo sapiens 


r TV 


98 


31 


6984 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


138 


78 








derived from DNA35638. 






6985 j 


L27428 


Homo sapiens 


reverse transcriptase 


133 


67 


6986 


AF090895 


Homo sapiens 


PRO0117 


139 


62 ! 


6987 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


212 


60 








4609. 






6988 


Y36203 


Homo sapiens 


Human secreted protein #75. 


189 


73 


6989 


AF1 16712 


Homo sapiens 


PR02738 


87 


42 


6990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


63 








6613. 






6991 


AF090930 


Homo sapiens 


PRO0478 


154 


52 


6992 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 | 


76 








7867. 






6993 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


48 








6953. 






6994 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


47 








6834. 






6995 


J02974 


Acanthamoeba 


myosin IB heavy chain 


120 


43 






castellanii 






6996 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


69 








4433. 






6997 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


60 








7867. 






6998 


Y12961 


Homo sapiens 


Amino acid sequence of a human 


114 


66 








secreted peptide. 






6999 


AF1 13685 


Homo sapiens 


PRO0974 


88 


58 


7000 


AF130089 


Homo sapiens 


PRO2550 


146 


58 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


7001 


U93565 


Homo sapiens 


putative pi 50 


96 


46 


7002 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


146 


57 


7003 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


206 


92 


7004 


AF130089 


Homo sapiens 


PRO2550 


104 


50 


7005 


AL137472 


Homo sapiens 


hypothetical protein 


179 


55 


7006 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. - 


379 


98 


7007 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


38 


7008 


AF119851 


Homo sapiens 


PR01722 


136 


63 


7009 


AF218028 


Homo sapiens 


unknown 


104 


52 


7010 


AF130079 


Homo sapiens 


PR02852 


129 


60 


7011 


Y64890 


Homo sapiens 


Human 5* EST related polypeptide SEQ 
ff)NO:1051. 


110 


54 


7012 


X03145 


Homo sapiens 


potORFVI 


130 


65 


7013 


AF090944 


Homo sapiens 


PRO0663 


171 


75 


7014 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


122 


65 


7015 


M26460 


Homo sapiens 


retinoblastoma 1 


113 


37 


7016 


X92485 


Plasmodium 
vivax 


pval 


112 


56 


7017 


X92485 


Plasmodium 
vivax 


pval 


144 


57 


7018 


AF130079 


Homo sapiens 


PR02852 


224 


74 


7019 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


129 


43 


7020 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


58 


7021 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


106 


69 


7022 


AF090901 


Homo sapiens 


PRO0195 


103 


88 


7023 


AF1 16661 


Homo sapiens 


PR01438 


120 


47 


7024 


AP002460 


Arabidopsis 
thaliana 


gene_i±FlD9.26~unknown protein 


325 


60 


7025 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment j 
derived protein #153. 


126 


63 


7026 


AF1 19900 


Homo sapiens 


PR02822 


111 


70 


7027 


AF130051 


Homo sapiens 


PRO0898 


133 


44 


7028 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


179 


68 


7029 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


72 


60 


7030 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


130 


50 


7031 


J03941 


Mus museums 


ferritin heavy chain 


124 


57 


7032 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


157 


59 


7033 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


120 


65 


7034 


Y36156 


Homo sapiens 


Human secreted protein #28. 


164 


48 


7035 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


116 


48 


7036 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


130 


52 


7037 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


113 


55 


7038 


G02538 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
6619. 


78 


68 


7039 


AF1 30089 


Homo sapiens 


PRO2550 


121 


65 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


7040 


API 19855 


Homo saoien*; 


PRO 1847 


158 

I JO 


03 


7041 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


116 


62 


7042 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


117 


54 


7043 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


80 


7044 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


63 


7045 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


86 


44 


7046 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881 


80 


72 


7047 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


100 


57 


7048 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
o\> i y. , 


152 


66 


7049 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

\jy\JO. 


94 


72 


7050 


AF176323 


Macropus 

pncrpnii 


small nuclear ribonucleoprotein B' 


103 


41 


7051 


AF255446 


Crypthecodinium 

cftlrnii 

\AJ1UHI 


Dipl-associated protein C 


127 


38 


7052 


M64792 


Rattus 


salivary proline-rich protein 


149 


42 


7053 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ /7J. 


147 


69 


7054 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


109 


54 


7055 


Y91577 




TJiimon PQ/>rotw4 nrnfain ca/iiTanna 

iiuman secreiea protein sequence 
cncuucu uy gene ^ onv^ ixj ihkj.z.dxj. 


1 c< 

lOJ 


50 


7056 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
cnv ui i ype t\ <uj<j i ype o rccro viruses 
and to class II HERVs 


125 


65 


7057 


AF068294 


JTLvJJUU bO-JJlCUb 




Lxy\ 


Id 


7058 


Y86248 


Wnmrt cnniVnc 

llA/HIV OU-JJlCllO 


Unman <u*rrt*tf*A nmhin HY^WPFYCff 

rxuuioii bcorcieu protein n^rarxoo, 
SEO ID NO* 163 




SO 


7059 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
ffene 22 clone HMXJWl 8 


140 


62 


7060 


AF090894 


Homo saoiens 

A A VUiV/ Jul/] vIUJ 


PRO0113 n 




£9 
o£ 


7061 


AF107406 


Homo sapiens 


GW128 


66 


80 


7062 


W50193 


Homo ^ai>ierK 


Am inn nriH ^pniipnr^ r>-f* ealivrcrrvr 
fllUulU Qvlvl ovvJUGUvC \JL SQ11 V til Y 

protein CON-2. 


OD 




7063 


AF119851 


Homo sar>ien<; 


PRO 1722 


1 17 


/** 


7064 


X92485 


Plasmodium 
vivax 


pval 


123 


uu 


7065 


G02532 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
6613. 


79 


59 


7066 


L23545 


Homo sapiens 


putative 


135 


44 


7067 


AF1 19851 


Homo sapiens 


PR01722 


301 




7068 


AF130079 


Homo sapiens 


PR02852 


118 


71 


7069 


AF119851 


Homo <;anien<; 


PRO 1722 


1 JU 




7070 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


48 


7071 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


39 


7072 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


145 


62 


7073 


G00397 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


155 


44 
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SEQID 


Accession No. 


Species 


Description 


Smitb- 


% 


NO: 






Waterman 


Idcntit 










Score 


y 








4478. 






7074 


AF130089 


Homo sapiens 


PRO2550 


146 


60 


7075 


U49974 


Homo sapiens 


mariner transposase 


198 


83 


7076 


AF236085 


Homo sapiens 


CYP4F11 


427 


79 


7077 


AF130089 


Homo sapiens 


PRO2550 


159 


52 


7078 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


206 


85 








clone HPMBQ32. 






7079 


D32202 


Homo sapiens 


alpha IC adrenergic receptor isoform 2 


152 


71 


7080 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


62 








4718. 






7081 


AB046048 


Macaca 


unnamed portein product 


135 


46 






fascicularis 








7082 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


75 








7795. 






7083 


X53581 


Rattus 


ORF4 


119 


64 






norvegicus 








7084 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


62 






6913. 






7085 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


64 








7795. 






7086 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


71 


66 








7376. 






7087 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


54 








6613. 






7088 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


50 








7214. 






7089 


U08021 


Homo sapiens 


nicotinamide N-methyltransferase 


116 


100 


7090 


U93569 


Homo sapiens 


p40 


199 


82 


7091 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


56 








7795. 






7092 


X92485 


Plasmodium 


pval 


125 


60 






vivax 








7093 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


62 








6536. 






7094 


AF 118082 


Homo sapiens 


PRO1902 


191 


50 


7095 


X92485 


Plasmodium 


pval 


129 


67 






vivax 








7096 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


80 


66 








derived from DNA35638. 






7097 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


67 








6613. 






7098 


AF1 16715 


Homo sapiens 


PR02829 


123 


63 


7099 


L27428 


Homo sapiens 


reverse transcriptase 


189 


50 


7100 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


48 








6613. 






7101 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


65 


7102 


D85030 


Schizosaccharom 


ribosomal protein S33 homolog 


139 


85 






yces pombe 








7103 


G0302L 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177 


77 








7102. 






7104 


U72543 


Sus scrofa 


ubiquitin-like/S30 ribosomal fiision ' 


129 


100 




- 




protein 






7105 


AC007787 


AA 187-502 


NFI-X3=transcription factor 


190 


84 


7106 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


38 








7874. 






7107 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


56 








8153. 






7108 


Z69381 


Saccharomyces 


Nil 14 


450 


91 






cerevisiae 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






w a terra an 


Identic 










Score 


y 


7109 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 


353 


70 








clone HP10 122 cDNA#l. 






7110 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


99 


45 








protein CON- 1. 






7111 


AF068294 


Homo sapiens 


HDCMB45P 


132 


55 j 


7112 


AB033020 


Homo sapiens 


KIAA1 194 protein 


850 


100 


7113 


AB019692 


Homo sapiens 


protein kinase PKNbeta 


792 


87 


7114 


M12140 


Homo sapiens 


envelope protein 


154 


79 


7115 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


88. 


62 








gene No. 101. 






7116 


AF124491 


Homo sapiens 


ARF GTPase-activating protein GIT2 


1216 


100 


7117 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


153 


54 








gene 22 clone HMSJW18. 






7118 


AF130089 


Homo sapiens 


PRO2550 


184 


64 


7119 


AF1 30079 


Homo sapiens 


PR02852 


144 


68 


7120 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


112 


51 






brucei 


complete virion genome. 






7121 


AF032384 


Xenopus laevis 


metalloprotease-disintegrin 


304 


76 


7122 


K01664 


Drosophila 


Bkm-like protein 


99 


71 






melanogaster 








7123 


AJ238706 


Drosophila 


monocarboxylate transporter 1 


161 


38 






melanogaster 


homologue 






7124 


X92485 


Plasmodium 


pval 


127 


54 






vivax 








7125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


65 








7795. 






7126 


AF1 16712 


Homo sapiens 


PR02738 


118 


53 


7127 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


64 








7871. 






7128 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


62 








6566. 






7129 


AF1 18090 


Homo sapiens 


PRO2044 


280 


80 


7130 


AF129085 


Homo sapiens 


carboxy terminus of Hsp70-interacting 


593 


91 








protein 






7131 


AF05484O 


Homo sapiens 


tetraspan TM4SF; Tspan-3 


513 


83 


7132 


L27428 


Homo sapiens 


reverse transcriptase 


156 


51 


7133 


AK001647 


Homo sapiens 


unnamed protein product 


2334 


99 


7134 


AF130051 


Homo sapiens 


PRO0898 


142 


84 


7135 


X53581 


Rattus 


ORF4 


213 


46 






norvegicus 








7136 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


86 








7124. 






7137 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


194 


68 








6953. 






7138 


U21634 


Onychomys 


cytochrome c oxidase subunit ID 


Hi 


84 






torridus 






7139 


X60592 


Homo sapiens 


CDw40 


201 


95 


7140 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


63 








7795. 






7141 


M76979 


Homo sapiens 


pigment epithelial-differentiating factor 


458 


93 


7142 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


54 








7795. 






7143 


Y27677 


Homo sapiens 


Human secreted protein encoded by 


323 


90 








gene No. 111. 






7144 


AF116712 


Homo sapiens 


PR02738 


132 


60 


7145 


AB024057 


Homo sapiens 


vascular Rab-GAP/TBC-containing 


121 


45 








protein 






7146 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 








7253. 
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SEQ n> 
NO: 


Accession No. 


Species 


Description 


[ Smith- 
Waterman 
Score 


% 

Identit 

y 


7147 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 102, variant a dna, 
complete virion genome. 


141 


76 


7148 


R06400 


Homo sapiens 


Lambda gtl0ch2 encoded C-terminal 
alpha amidating enzyme. 


337 


85 


7149 


S87759 


Homo sapiens 


protein phosphatase 2C alpha, 
PP2Calpha 


427 


78 


7150 


M12140 


Homo sapiens 


envelope protein 


541 


64 


7151 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


106 


85 


7152 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


70 


38 


7153 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


194 


61 


7154 


AF130089 


Homo sapiens 


PRO2550 


89 


60 


7155 


AF 100742 


Homo sapiens 


M-phase phosphoprotein homolog 


714 


94 


7156 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


143 


78 


7157 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


174 


79 


7158 


AL031230 


Homo sapiens 


dJ73M23.3(KIAA0319) 


214 


45 


7159 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


155 


37 


7160 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


129 


46 


7161 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


150 


53 


7162 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


107 


55 


7163 


M13043 


Mus musculus 


alpha- 1 (IV) collagen 


80 


42 


7164 


AB01 1004 


Homo sapiens 


UDP-N-acety Ighicosam ine 
pyrophosphorylase 


257 


90 


7165 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


111 


47 


7166 


AF1 19851 


Homo sapiens 


PR01722 


279 


61 


7167 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


136 


51 


7168 


W73327 


Homo sapiens 


Bipolar affective disorder associated 
gene £sh!6 product 


287 


63 


7169 


Y35946 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 195. 


1231 


99 


7170 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


132 


76 


7171 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


184 


60 


7172 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


141 


64 


7173 


AF129507 


Homo sapiens 


transcription factor ICBP90 


245 


55 


7174 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


131 


56 


7175 


AF161356 


Homo sapiens 


HSPC093 


107 


55 


7176 


AF2 18028 


Homo sapiens 


unknown 


116 


55 


7177 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6735. 


117 


52 


7178 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


94 


38 


7179 


AF130089 


Homo sapiens 


PRO2550 


124 


47 


7180 


AP000616 


Oryza sativa 


similar to RENG-H2 finger protein 
RHAla(AF078683) 


109 


66 


7181 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


65 


7182 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


62 
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Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Identit 










Score 


y 


7183 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


75 








6688. 






7184 


X92485 


Plasmodium 


pval 


129 


60 






vivax 








7185 


AF130089 


Homo sapiens 


PRO2550 


116 


53 


7186 


AF130089 


Homo sapiens 


PRO2550 


137 


64 


7187 


X51760 


Homo sapiens ~" 


zinc finger protein (583 AA) 


236 


56 


7188 


X92485 


Plasmodium 


pval 


131 


50 






vivax 








7189 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


152 


57 








6619. 






7190 


AF068294 


Homo sapiens 


HDCMB45P 


163 


50 


7191 


API 19855 


Homo sapiens 


PR01847 


55 


50 


7192 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


57 








6619. 






7193 


M15530 


Homo sapiens 


B-cell growth factor 


106 


53 


7194 


G0341 i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


41 








7492. 






7195 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


60 








7519. 






7196 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


68 








7882. 






7197 


U60315 


MoUuscum 


MC054L 


116 


36 






contagiosum 












virus subtype 1 








7198 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


67 ; 








7881. 






7199 


Ml 5530 


Homo sapiens 


B-celi growth factor 


146 


59 


7200 


AF090894 


Homo sapiens 


PRO0113 


165 


65 


7201 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


57 








6913. 






7202 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


129 


53 


* 






encoded by gene 17. 






7203 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


92 


65 








7868. 






7204 


AF126163 


Homo sapiens 


HHLA3 protein 


131 


78 


7205 


X92485 


Plasmodium 


pval 


116 


42 






vivax 








7206 


AF090931 


Homo sapiens 


PRO0483 


136 


83 


7207 


Y36156 


Homo sapiens 


Human secreted protein #28. 


127 


49 


7208 


Y12961 


Homo sapiens 


Amino acid sequence of a human 


158 


55 








secreted peptide. 






7209 


Y36203 


Homo sapiens 


Human secreted protein #75. 


124 


56 


7210 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


90 


34 


7211 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


71 








7795. 






7212 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


60 








7795. 






7213 


AF130089 


Homo sapiens 


PRO2550 


163 


86 


7214 


AF130089 


Homo sapiens 


PRO2550 


111 


52 


7215 


AF1 18082 


Homo sapiens 


PRO1902 


150 


63 


7216 


Y53663 


Homo sapiens 


A human B-cell growth factor related 


83 


66 








protein (BGFRP). 






7217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


68 








7795. 






7218 


AF220264 


Homo sapiens 


MOST-1 


123 


66 


7219 


AF229067 


Homo sapiens 


PADI-H protein 


121 


52 


7220 


AF1 18086 


Homo sapiens 


PR01992 


130 


61 


7221 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


179 


100 



322 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- - 
w aici man 
Score 


% 

Ideotit 

y 








SEQID NO: 163. 






7222 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


154 


78 


7223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


200 


95 


7224 


L27428 


Homo sapiens 


reverse transcriptase 


127 


55 


7225 


AB008227 


Adiantum 
capillus-veneris 


Extensin 


121 


43 


7226 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


113 


78 


7227 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


153 


96 


7228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


112 


74 


7229 


M15530 


Homo sapiens 


B-cell growth factor 


137 


44 


7230 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


140 


87 


7231 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


162 


46 


7232 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


90 


7233 


XI 5769 


Mus muscuius 


UlRNA-associated 70-kDa protein 


105 


29 


7234 


M15530 


Homo sapiens 


B-cell growth factor 


113 


61 


7235 


AF194537 


Homo sapiens 


NAG 13 


249 


65 


7236 


R20313 


Homo sapiens 


Gly(5) beta-globin mutant. 


296 


80 


7237 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


50 


7238 


Y36156 


Homo sapiens 


Human secreted protein #28. 


97 


80 


7239 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


395 


81 


7240 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


70 


7241 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


90 | 


7242 


AF090942 


Homo sapiens 


PRO0657 


118 


47 


7243 


AF1 19851 


Homo sapiens 


PR01722 


112 


62 


7244 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


100 


53 


7245 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


144 


66 


7246 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


129 


78 


7248 


X92485 


Plasmodium 
vivax 


pval 


108 


58 


7249 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


170 


74 


7250 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


153. 


70 


7251 


U01849 


Trypanosoma 
brucei 


ORF2 


93 


39 


7252 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


113 


54 


7253 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7554. 


110 


83 


7254 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


167 


66 


7255 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


179 


83 


7256 


AF 130079 


Homo sapiens 


PR02852 


147 


70 


7257 


AFJ 18082 


Homo sapiens 


PRO 1902 


80 


45 


7258 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


137 


55 


7259 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


90 


65 



323 



WO 01/64835 



PCTYUS01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO* 








Waterman 


Identit 










Score 


y 


7260 


X92485 


Plasmodium 


pval 


105 


85 






vivax 








7261 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


76 








7879. 






7262 


AF130089 


Homo sapiens 


PRO2550 


98 


43 


7263 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


114 


60 








gene 22 clone HMSJW1 8. 






7264 


AF1 16661 


Homo sapiens 


PR01438 


139 


67 


7265 


AF104021 


Bovine viral 


poly protein 


161 


40 






diarrhea virus-2 








7266 


AL 160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


116 


43 






major 


DNA, complete virion genome 






7267 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


112 


62 








gene 44 clone HTDAD22. 






7268 


M15530 


Homo sapiens 


B-ceil growth factor 


154 


60 


7269 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


54 








7437. 






7270 


AF I 18082 


Homo sapiens 


PRO 1902 


139 


62 


7271 


S52010 


Mus sp. 


orfl 5' of EpoR 


93 


33 


7272 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


50 








6913. 






7273 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


61 








7795. 






7274 


AF130089 


Homo sapiens 


PRO2550 


124 


43 


7275 


X92485 


Plasmodium 


pval 


147 


44 






vivax 








7276 


G00357 


Homo sapiens 


Human secreted Drotein SEO ID NO* 


81 


59 








4438. 






7277 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


67 








4488. 






7278 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


67 








7253. 






7279 


AF130087 


Homo sapiens 


PR02411 


72 


65 


7280 


Y40039 


Homo sapiens 


Peptide sequence derived from a 


130 


69 








human secreted protein. 






7281 


J03507 


Homo sapiens 


complement protein C7 precursor 


357 


81 


7282 


AF1 16661 


Homo sapiens 


PR01438 


117 


48 


7284 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


61 








6613. 






7285 


AF1 16715 


Homo sapiens 


PR02829 


128 


75 


7286 


AF130079 


Homo sapiens 


PR02852 


122 


52 


7287 


R13556 


Homo sapiens 


Protein encoded downstream of hhc M 


120 


50 








oncoprotein. 






7288 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


338 


75 








7870. 






7289 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


114 


72 








encoded by gene 17. 






7290 


W48351 


Homo sapiens 


Human breast cancer related protein 


146 


59 








BCRB2. 






7291 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


58 








6619, 






7292 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


167 


61 








6613. 






7293 


X92485 


Plasmodium 


pval 


122 


71 






vivax 








7294 


AF130089 


Homo sapiens 


PRO2550 


87 


62 


7295 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


50 








7870. 






7296 


X92485 


Plasmodium 


pval 


152 


75 



324 



SEQ ID 
NO: 


Accession No. 


Species 




omitn- 

Waterraan 

Score 


0/ 

Zdentit 

y 






vivax 








7297 


AF090894 


Homo sapiens 


PRO0I13 


172 


65 


7298 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


121 


61 


7299 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


114 


66 


7300 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


228 


80 


7301 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


111 


73 


7302 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


119 


46 


7303 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


168 


76 


7304 


U58658 


Homo sapiens 


unknown 


102 


48 


7305 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


118 


53 


7306 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


63 


7307 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


149 


58 


7308 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


52 


7309 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


115 


54 


7310 


Ml 5530 


Homo sapiens 


B-ceII growth factor 


131 


54 


7311 


AF1 18082 


Homo sapiens 


PRO! 902 


168 


46 


7312 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


412 


92 


7313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


326 


74 


7314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


149 


59 


7315 


AF289099 


Maackia 
amurensis 


ENOD2f 


107 


32 


7316 


U17009 


Phytophthora 
infestans 


SecY-independent transporter protein 


135 


41 


7317 


AF220264 


Homo sapiens 


MOST-1 


133 


72 


7318 


U49973 


Homo sapiens 


ORF2: function unknown 


111 


53 


7319 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


274 


80 


7320 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


79 


56 ' 


7321 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


205 ! 


95 


7322 


AC005175 


Homo sapiens 


TA2R HUMAN, BETA ISOFORM; 
TXA2-R; PROSTANOID TP 
RECEPTOR 


127 


57 


7323 


R48296 


Homo sapiens 


Human PGF-2/NT-3. 


89 


93 


7324 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI 8. 


157 


60 


7325 


AF1 19855 


Homo sapiens 


PR01847 


172 


66 


7326 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


50 


7327 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


56 


7328 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


98 


56 




uUJ /89 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


64 


7330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


111 


72 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


7331 


AF126163 


Homo sapiens 


HHLA3 protein 


98 


47 


7332 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


122 


61 


7333 


AF130079 


Homo sapiens 


PR02852 


136 


54 


7334 


AK023563 


Homo sapiens 


unnamed protein product 


265 


73 


7335 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


113 


61 


7336 


MI 5530 


Homo sapiens 


B-celi growth factor 


136 


75 


7337 j 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


168 


81 


7338 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


339 


80 


7339 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


170 


74 


7340 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 3 


400 


82 


7341 


L00016 


Homo sapiens 


urf4 


295 


72 


7342 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


86 


7343 


M36647 


Homo sapiens 


mitochondrial hinge protein precursor 


286 


82 


7344 


X15917 


Paramecium 
aurelia 


ND2 protein (AA 1-193) (unusual start 
codon) 


132 


34 


7345 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


136 


82 


7346 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


89 


69 ! 


7347 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


133 


69 


7348 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


169 


71 


7349 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


347 


83 


7350 


V00662 


Homo sapiens 


cytochrome oxidase I 


325 


85 


7351 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


406 


80 


7352 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


224 


79 


7353 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


270 


86 


7354 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


167 


90 


7355 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


120 


87 


7356 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


84 


7357 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


188 


72 


7358 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


335 


72 


7359 


U09500 


Homo sapiens 


cytochrome b 


275 


80 


7360 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


209 


73 


7361 


AF1 18078 


Homo sapiens 


PR01848 


95 


58 


7362 


AB017116 


Homo sapiens 


AD 1 


148 


89 


7363 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


146 


42 


7364 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


203 


82 


7365 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


206 


84 


7366 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


408 


82 


7367 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


456 


86 


7368 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


481 


85 


7369 


X86031 


Sus scrofa 


homologue of protine/arginine-rich 
antibacterial protein j 


150 


53 


7370 


AB017116 


Homo sapiens 


AD 1 


145 


91 


7371 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


173 


85 


7372 


AF 118082 


Homo sapiens 


PRO 1902 


122 


54 


7373 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


163 


62 


7374 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


53 


7375 


AF090894 


Homo sapiens 


PRO0113 


128 


64 


7376 


AK000241 


Homo sapiens 


unnamed protein product 


131 


67 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






tt a ici man 


Id en tit 








Score 


y 


7377 


AF1 19855 


Homo sapiens 


PR01847 


164 


62 


7379 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


114 


37 


7380 


U57362 


Rattus 


collagen XII alpha 1 


148 


47 






norvegicus 








7381 


D38112 


Homo saniens 


ATPase subunit 6 


423 


76 


7382 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


135 


58 








gene 22 clone HMSJW18. 






7383 


AB040938 


Homo sapiens 


KIAA1 505 protein 


983 


76 


7384 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


48 








7871. 






7385 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


122 


75 






SEQIDNO:163. 






7386 


U63295 


Homo sapiens 


seven in absentia homolog 


260 


85 


7387 


AE003499 


Drosophila 


CGI 2706 gene product 


103 


32 






melanogaster 








7388 


X03342 


Homo saniens 


rpL32 (aa 1-135) 


123 


51 


7389 


G02872 


Homo saniens 


Human secreted protein, SEQ ID NO: 


129 


58 






6953. 






7390 


Y86248 


Homo saniens 


Human secreted protein HCHPF68, 


154 


100 






SEQIDNO:163. 






7391 


AK026800 i 


Homo sapiens 


unnamed protein product 


137 


52 


7009 
toy/. 




xiuiuu oopicuo 


Human secreted nrotein SEO ID NO* 


117 


73 








6613. 






7393 


G04072 


Homo saniens 


Human secreted protein, SEQ ID NO: 


89 


70 






8153. 






7394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


161 


80 








SEQ ID NO: 163. 






7395 


M15530 


Homo sapiens 


B-ceJ] growth factor 


98 


57 


7396 


L10908 


Mus musculus 


Gcapl gene product 


105 


51 


7397 


AF 119851 


Homo sapiens 


PR01722 


127 


55 


7398 


X99256 


Hylobates lar 


cytochrome oxidase subunit HI 


319 


73 


7399 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


77 


7400 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


399 


80 


7401 


L78671 


Homo saniens 


CoxII/D-loop DNA fusion protein 


170 


83 


7402 


D38112 


Homo saniens 


NADH dehydrogenase subunit 3 


183 


91 


7403 


Y86248 


Homo saniens 


Human secreted protein HCHPF68, 


189 


88 








SEO ID NO* 163. 






7404 


(502639 


T-Tomo saniens 


Human secreted nrotein SEO ID NO: 


115 


40 








6720. 






7405 


D83006 


Saccharomvces 


gene required for phosphoylation of 


171 


38 






cerevisiae 


oligosaccharides/ has high homology 












with YJR061W 






7406 


AF144636 


G alius gallus 


osteoblast regulatory factor 3 A 


108 


44 


7407 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


70 








6613. 






7408 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


407 


93 


7409 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


140 


87 


7410 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 


111 


59 






RHAla(AF078683) 






7411 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


138 


82 


7412 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


431 


82 


7413 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


56 








4478. 






7414 


AB036396 


Rana 


cytochrome b 


109 


87 






nigromaculata 








7415 


AF090930 


Homo sapiens 


PRO0478 


121 


76 


7416 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 


171 


96 








1 (NDF-1). 






7417 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 3 


446 


91 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Wa term an 


Idcntit 










Score 


y 


7418 


AB018440 


Echinococcus 


NADH dehydrogenase subunit 2 


112 


33 


• 




multilocularis 








7419 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 


123 


66 








SEQIDNO:163. 






7420 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


116 


50 








SEQ ID NO: 163. 






7421 


AF155232 


Pisum sativum 


extensin 


128 


37 


7422 


AF161356 


Homo sapiens 


HSPC093 


89 


48 


7423 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


162 


81 


7424 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


171 


79 








SEQIDNO:163. 






7425 


AF275953 


synthetic 


Cox8ND6gfp fusion protein 


214 


93 






construct 








7426 


D381I2 


Homo sapiens 


cytochrome c oxidase subunit L 


156 


73 "j 


7427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


59 








4718. 






7428 


AK024455 


Homo sapiens 


FLJ00047 protein 


82 


66 


7429 


D38112 _j 


Homo sapiens 


NADH dehydrogenase subunit 1 


212 


82 


7430 


S79410 


Mus musculus 


nuclear localization signal binding 


124 


48 








protein 






7431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


195 


90 


7432 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


217 


84 


7433 


U87145 


Toxoplasma 


ORFD 


89 


36 






gondii 








7434 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


213 


72 








SEQIDNO:163. 






7435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


135 


57 








gene No. 107. 






7436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


82 








7888. 






7437 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


190 


88 








SEQIDNO:163. 






7438 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


141 


78 


7439 


AF250284 


Amsacta moorei 


AMV012 


88 


31 






entomopoxvirus 








7440 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


149 


85 


7441 j 


U43360 


Peromyscus 


reverse transcriptase 


114 


52 






maniculatus 








7442 


AF275807 


Homo sapiens 


PNAS-110 


157 


48 


7443 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


126 


56 








SEQIDNO:163. 






7444 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


475 


95 


7445 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


71 








6953. 






7446 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


60 








7867. 






7447 


X92485 


Plasmodium 


pval 


153 


45 






vivax 








7448 


AF1 19851 


Homo sapiens 


PR01722 


96 


55 


7449 


AF130079 


Homo sapiens 


PR02852 


153 


73 


7450 


AF119851 


Homo sapiens 


PR01722 


238 


52 


7451 


AL451017 


Neurospora 


related to Ul SMALL NUCLEAR 


113 


38 






crassa 


RIBONUCLEOPROTEIN C 






7452 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


57 








7867. 






7453 


G03243 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


136 


63 








7324. 






7454 


R14163 


Homo sapiens 


Cellular DN A-binding protein RS 1 . 


270 


42 


7455 


X63526 


Homo sapiens 


homologue to elongation factor 1- 


387 


80 
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Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








gamma from A.salina 






7456 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


80 








7253. 






7457 


AF1 19851 


Homo sapiens 


PR01722 


102 • 


55 


7458 


G02756 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


55 








6837. 






7459 


AF1 19851 


Homo sapiens 


PR01722 


132 


64 


7460 


L10908 


Mus musculus 


Gcapl gene product 


106 


64 


7461 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


54 








6913. 






7462 


X77816 


Rattus 


PR-Vbetal 


142 


51 






norvegicus 








7463 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 


96 


35 








reactrve fragment (residues 69-226). 






7464 


AB046048 


Macaca 


unnamed portein product 


302 


62 






fascicularis 








7465 


X92485 


Plasmodium 


pval 


115 


47 






vivax 








7466 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


65 




• 




6619. 






7467 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


91 


35 








protein CON-1. 






7468 


AF161361 


Homo sapiens 


HSPC098 


92 


57 


7469 


AF216389 


Homo sapiens 


semaphorin Rs 


288 


88 


7470 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


153 


73 


7471 


AF130089 


Homo sapiens 


PRO2550 


108 


48 


7472 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


58 








7870. 






7473 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


115 


71 


7474 


X61296 


Rattus 


open reading frame 2 


134 


49 






norvegicus 








7475 


U22376 


Homo sapiens 


alternatively spliced product using exon 


165 


46 








13A 






7476 


AF1 19851 


Homo sapiens 


PR01722 


133 


46 


7477 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


127 


69 






bnicei 


complete virion genome. 






7478 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


87 


7479 


S79410 


Mus musculus 


nuclear localization signal binding 


122 


59 








protein 






7480 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


168 


84 


7481 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


181 


80 








7519. 






7482 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


56 








6292. 






7483 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


132 


37 








7874. 






7484 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


100 








6836. 






7485 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


72 


7486 


M29294 


Rattus 


small nuclear ribonucleoparticle- 


132 


42 






norvegicus 


associated protein 






7487 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


140 


50 








gene 22 clone HMSJW18. 






7488 


AF090901 


Homo sapiens 


PRO0195 


101 


62 


7489 


X97249 


Homo sapiens 


leucine-rich primary response protein 


304 


93 


7490 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


447 


84 


7491 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


63 


7492 


U87269 


Homo sapiens 


pl20E4F transcription factor 


244 


66 


7493 


A3 1036 


Nicotiana alata 


PRP2 


87 


31 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


7494 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


97 


7495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


166 


91 


7496 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


161 


66 


7497 


AK024455 


Homo sapiens 


FLJ00047 protein 


67 


61 


7498 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ED NO: 163. 


174 


84 - 


7499 


X75438 


Sus scrofa 


homologue of prolme/arginine rich 
antibacterial peptides 


116 


44 


7500 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQlDNO:163. 


166 


89 


7501 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


137 


61 


7502 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


213 


60 


7503 


AB042770 


Pteropus j 
dasymallus 


cytochrome c oxidase subunit 3 


169 


89 


7504 


AF126163 


Homo sapiens 


HHLA3 protein 


110 


62 


7505 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


299 


90 


7506 


D3S112 


Homo sapiens 


cytochrome c oxidase subunit 3 


194 


88 


7507 


W48351 


Homo sapiens 


Human breast cancer related protein . 
BCRB2. 


128 


72 


7508 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


162 


86 


7509 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


116 


65 1 


7520 


L38908 


Nicotiana 
tabacum 


extensin 


114 


36 


7512 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


273 


73 


7513 


Y36156 


Homo sapiens 


Human secreted protein #28. 


168 


44 


7514 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-l). 


171 


96 


7515 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


110 


46 


7516 


D3SU5 


Pongopygmaeus 


cytochrome c oxidase subunit 3 


170 


84 


7517 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


105 


43 


7518 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


350 


82 


7519 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 3 (ND3) 


141 


84 


7520 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


410 


85 


7521 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


112 


71 


7522 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


96 


46 


7523 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


142 


57 


7524 


AF220264 


Homo sapiens 


MOST-1 


86 


62 


7525 


AF130089 


Homo sapiens 


PRO2550 


109 


35 


7526 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


106 


79 


7527 


U49973 


Homo sapiens 


ORF2: function unknown 


105 


68 


7528 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


122 


62 


7529 


X05831 


Rattus 
norvegicus 


fibronectin precursor 


100 


64 


7530 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


175 


60 


7531 


X92485 


Plasmodium 


pval 


120 


59 
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CPA 1I> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 






vivax 








7532 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


383 


75 


7533 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


321 


67 


7534 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


126 


72 






brucei 


complete virion genome. 






7535 


AF1 19851 


Homo sapiens 


PR01722 


110 


67 


7536 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


531 


81 


7537 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


75 








7795. 






7538 


X92485 


Plasmodium 


pval 


141 


48 






vivax 








7539 


AF1 19900 


Homo sapiens 


PR02822 


86 


42 


7540 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


50 






* 


6613. 






7541 


M15530 


Homo sapiens 


B-cell growth factor 


136 


53 


7542 ( 


X53581 


Rattus 


ORF4 


174 


46 






norvegicus 








7543 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


138 


74 








SEQ ID NO: 163. 






7544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


116 


86 


7545 


M15530 


Homo sapiens 


B-cell growth factor 


96 


47 


7546 


X92485 


Plasmodium 


pval 


149 


58 






vivax 








7547 


AF1 13685 


Homo sapiens 


PRO0974 


110 


48 


7548 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


226 


73 








similar to pogo element 






7549 


L78671 


Homo sapiens 


CoxIITD-loop DNA fusion protein 


316 


73 


7550 


S79410 


Mus museums 


nuclear localization signal binding 


107 


70 








protein 






7551 


AF130089 


Homo sapiens 


PRO2550 


130 


45 


7552 


R95913 


Homo sapiens 


Neural thread protein. 


99 


56 


7553 


U93569 


Homo sapiens 


putative pi 50 


171 


47 


7554 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


75 








4435. 






7555 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


207 


56 


7556 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


59 








7795. 






7557 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


43 








6913. 






7558 


Y86445 


Homo sapiens 


Human gene 42-encoded protein 


95 


71 








fragment, SEQ ID NO:360. 






7559 


X83413 


Human 


U88 


108 


32 






herpesvirus 6 








7560 


A18812 


Brassica napus 


extensin 


125 


38 


7561 


D38112 


Homo sapiens 


ATPase subunit 6 


495 


87 


7562 


AF130089 


Homo sapiens 


PRO2550 


141 


46 


7563 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 


106 


61 








isoform 






7564 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


62 








4442. 






7565 


U50191 


Caenorhabditis 


C. elegans collagen dpy-10 gene (Levy, 


101 


37 






elegans 


A.D., Yang, J. and Kramer, J.M. MoL 












Biol Cell 4, 803-17, 1993) 






7566 


V00672 


Pan troglodytes 


reading frame protein 4 


294 


73 


7567 


A18812 


Brassica napus 


extensin 


145 


48 


7568 


D38U2 


Homo sapiens 


cytochrome c oxidase subunit 3 


241 


69 


7569 


L00016 


Homo sapiens 


urf4 


284 


71 


7570 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


50 


7571 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


65 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smitb- 

Waterman 

Score 


Identit 

y 








4754. 






7572 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


7573 


X92485 


Plasmodium 
vivax 


pval 


157 


49 


7574 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


151 


54 


7575 


AF130089 


Homo sapiens 


PRO2550 


348 


70 


7576 


R95913 


Homo sapiens 


Neural thread protein. 


113 


45 


7577 


L26953 


Homo sapiens 


chromosomal protein 


148 


38 


7578 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


116 


46 


7579 


AB047948 


Macaca 
fascicular is 


hypothetical protein 


173 


67 


7580 


GO3801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


149 


57 


7581 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


95 


51 


7582 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


125 


85 


7583 


AC004079 


Homo sapiens 


human HOXA3; 95% similarity to 
e307530 (PIDrgl 888441) 


127 


100 


7584 


Y86248 . 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDN0:163. 


171 


71 


7585 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


141 


80 


7586 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


222 


100 


7587 


D38112 


Homo sapiens 


cytochrome c oxidase subunit I 


194 


71 


7588 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


115 


80 


7589 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 


77 


7590 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


112 


60 


7591 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


133 


72 


7592 


A49807 


Neisseria 
meningitidis 


IGA-LPHA3 


93 


33 


7593 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N- terminal domain), Score=27.5, E- 
value=0.0001,N=l 


102 


42 


7594 


U43627 


Arabidopsis 
thaliana 


extensin 


99 


44 


7595 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


74 


7596 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


49 


7597 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


190 


92 


7598 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


96 


7599 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


95 


7600 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


192 


85 


7601 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


91 


7602 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


333 


71 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 


7603 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


190 


86 


7604 


U09500 


Homo sapiens 


cytochrome b 


279 


93 


7605 


Z14016 


Nicotiana 


pistil extensin like protein, partial CDS 


84 


40 






tabacum 








7606 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


71 








7882. 






7607 


G03636 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


55 


52 








7717. 






7608 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


54 








7795. 






7609 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


97 


62 


7610 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


434 


87 


7611 


Y21418 


Homo sapiens 


Human high mobility group protein 


97 


46 








HMGI-C mutant fragment 2. 






7613 


X92485 


Plasmodium 


pval 


106 


58 






vivax 








7614 


AC003058 


Arabidopsis 


unknown protein 


191 


88 






thaliana 








7615 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 


153 


65 








HLA class II 






7616 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


148 


74 






elegans 


RD protein (nuclear rna-binding 












protein) 






7617 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


56 








8159. 






7618 


L35013 


Homo sapiens 


spliceosomal protein 


95 


55 


7619 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


441 


85 


7620 


AF1 16638 


Homo sapiens 


PR01546 


172 


42 


7621 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


146 


46 


7622 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


72 








7795. 






7623 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


67 








6619. 






7624 


U41031 


Caenorhabditis 


proline-rich 


92 


59 






elegans 








7625 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


55 








7534. 






7626 


L19098 


Choristoneura 


cytochrome oxidase I 


102 


66 






fumiferana 








7627 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


76 








7253. 






7628 


AF126163 


Homo sapiens 


HHLA3 protein 


117 


67 


7629 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


69 


53 








4442. 






7630 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


55 








7870. 






7631 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


64 








4458. 






7632 


Y27907 


Homo sapiens 


Human secreted protein encoded by 


120 


48 








gene No. 119. 






7633 


R95913 


Homo sapiens 


Neural thread protein. 


126 


61 


7634 


V00662 


Homo sapiens 


cytochrome oxidase I 


396 


85 


7635 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


205 


84 








SEQIDNO:163. 






7636 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


464 


85 


7637 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


165 


94 


7638 


AF 118086 


Homo sapiens 


PR01992 


120 


68 


7639 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


176 


84 








SEQIDNO:163. 
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SEQID 
NO: 


Accession No. 


Species 


DfKrrintinn 


omitn- 

Waterman 

Store 


% 

Identit 

y 


7640 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


95 


39 


7641 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


68 


7642 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


192 


91 


7643 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


416 


80 


7644 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


133 


84 


7645 


U93570 


Homo sapiens 


putative pi 50 


206 


42 


7646 


D38112 


Homo sapiens 


ATPase subunit 6 


192 


80 


7647 


AF090942 


Homo sapiens 


PRO0657 


12S 


71 


764S 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


134 


73 


7649 


G60397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


115 


52 


7650 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


130 


61 


7651 


X71442 


Rattus 
. norvegicus 


ORF 1; putative 


105 


52 


7652 


L07545 


Leishmania 
tarentolae 


A V was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 frameshift probably due to 
gel compression 


111 


32 


7653 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


89 


53 


7654 


G02473 


Homo sapiens 

* 


Human secreted protein, SEQ ID NO: 
6554. 


291 


49 


7655 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


165 


63 


7656 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


470 


88 


7657 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


367 


80 


7658 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


108 


95 


7659 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


109 


50 


7660 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


112 


77 


7661 


AP002460 


Arabidopsis 
thaliana 


geneJd:FlD9.26~unknown protein 


121 


40 


7662 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


137 


96 


7663 


AF2 17973 


Homo sapiens 


unknown 


124 


51 


7664 


X92485 


Plasmodium 
vivax 


pval 


118 


59 


7665 


AF1 18082 


Homo sapiens 


PRO1902 


123 


44 


7666 


AB014563 


Homo sapiens 


KIAA0663 protein 


267 


90 


7667 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


49 


7668 


AFI 19855 


Homo sapiens 


PRO 1847 


152 


72 


7669 


AK025116 


Homo sapiens 


unnamed protein product 


127 


65 


7670 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


140 


55 


7671 


R14847 


Homo sapiens 


Protein associated with biochemical 
pathway involving cAMP. 


164 


89 


7672 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


68 


/O/J 


U 3 7426 


Homo sapiens 


kinesin-like spindle protein HKSP 


247 


81 


7674 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


75 


7675 


S67633 


Homo sapiens 


class I major histocompatibility antigen 


103 
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SEQID 




Species 


jjcscnpuou 


oiuitn- 


- „/ ■ - ' 


NO: 








Waterman 


Identit 










Score 


y 


7676 


AF126163 


Homo sapiens 


HHLA3 protein 


146 


72 


7677 


AC005192 


Homo sapiens 


similar to mouse interferon-related 


143 


87 








protein PC4; 96% identical to P19182 












(PID:gl35861) 






7678 


X07882 


Homo sapiens 


Po protein 


126 


39 


7679 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


148 


60 








encoded by gene 17. 






7680 


AJ242540 


Vol vox carteri £ 


hydroxyproline-rich glycoprotein DZ- 


177 


37 






nagariensis 


HRGP 






7681 


AF090930 


Homo sapiens 


PRO0478 


80 


58 


7682 


X55654 


Homo sapiens 


cytochrome C oxidase n subunit 


115 


65 


7683 


AB046048 


Macaca 


unnamed portein product 


145 


67 






fascicularis 








7684 


AF130079 


Homo sapiens 


PR02852 


117 


60 


7685 


M13101 


Rattus 


unknown protein 


114 


53 






norvegicus 








7686 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


134 


74 








clone HCACJ8L 






7687 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


84 


7688 


Y36156 


Homo sapiens 


Human secreted protein #28. 


120 


61 


7689 


M37197 


Homo sapiens 


CCAAT-box-binding factor 


104 


95 


7690 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a \ 


97 


54 






major 


DNA, complete virion genome 






7691 


AF220264 


Homo sapiens 


MOST-1 


117 


76 


7692 


AF063866 


Melanoplus 


ORF MSV233 hypothetical protein 


94 


31 






sanguinipes 












entomopoxvirus 








7693 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


40 








4433. 






7694 


AF1 18082 


Homo sapiens 


PRO1902 


115 


55 


7695 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


56 








4598. 






7696 


Y15163 


Mus musculus 


putative splice variant containing 


133 


51 








region C 






7697 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


132 


52 








IDNO:1051. 






7698 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


155 


77 








clone HCACJ8L 






7699 


M15530 


Homo sapiens 


B-ceU growth factor 


106 


79 


7700 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


70 








7871. ! 






7701 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


62 








6619. 






7702 


AF090931 


Homo sapiens 


PRO0483 


125 


68 


7703 


G03432 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


79 








7513. 






7704 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


63 








6613. 






7705 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


141 


35 


7706 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


58 








6613. 






7707 


X92485 


Plasmodium 


pval 


126 


42 






vivax 








7708 


AK023563 


Homo sapiens 


unnamed protein product 


217 


72 


7709 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 


124 


58 








RHAla(AF078683) 






7710 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


124 


56 








gene No. 107. 






7711 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


62 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 










lueniii 










Score 


y 








7795. 






7712 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


116 


72 


7713 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


194 


58 


7714 


X92485 


Plasmodium 


pval 


123 


68 






vivax 








7715 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


48 








7878. 






7716 


AF119851 


Homo sapiens 


PR01722 


168 


60 


7717 


U93570 


Homo sapiens 


putative pi 50 


133 


35 


7718 


AF1 16712 


Homo sapiens 


PR02738 . 


94 ■ 


68 


7719 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


107 


55 








terminal, clone XEH.8c} 






7720 


U49973 


Homo sapiens 


ORF2: function unknown 


200 


85 


7721 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


234 


73 








similar to pogo element 






7722 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


76 








4478. 






7723 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


65 








6953. 






7724 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


70 


73 








gene 22 clone HMSJW18. 






7725 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


55 








7870. 






7726 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


130 


75 








terminal, clone XEH.8c} 






7727 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


94 


78 








encoded by gene 90. 






7728 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


65 








7881. 






7729 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


111 


63 








gene 22 clone HMSJW18. 






7730 


W80406 


Homo sapiens 


A secreted protein encoded by clone 


119 


69 








dh40_3. 






7731 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


84 








7075. 






7732 


Y08769 


Ratals 


microvascular endothelial 


94 


38 






norvegicus 


differentiation gene 2 






7733 


U09500 


Homo sapiens 


cytochrome b 


463 


84 


7734 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


334 


92 


7735 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


399 


92 


7736 


Y12950 


Homo sapiens 


Amino acid sequence of a human 


128 


55 








secreted peptide. 






7737 


AC003058 


Arabidopsis 


unknown protein 


167 


66 






thaliana 








7738 


W48353 


Homo sapiens 


Human breast cancer related protein 


86 


70 








BCFLT2. 






7739 


D86854 


Catharanthus 


extensin 


154 


40 






roseus 








7740 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


57 








7324. 






7741 


X86031 


Sus scrofe 


homologue of prolme/arginine-rich 


99 


49 








antibacterial protein 






7742 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


35 








6834. 






7743 


V00662 


Homo sapiens 


cytochrome oxidase I 


568 


89 


7744 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


145 


93 


7745 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


154 


100 


7746 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


133 


61 








RHAla(AF078683) 
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orn irk 

NO: 


Accession No. 


Species 


Description 


omira- 

WfltermaD 

Score 


% 

Identit 

y 


7747 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


126 


71 


7748 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


327 


92 


7749 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


150 


71 


7750 


AF1 30079 


Homo sapiens 


PR02852 


147 


81 


7751 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


104 


49 


7752 


U43627 


Arabidopsis 
thaliana 


extensin 


116 


37 


7753 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


133 


7i 


7754 


D38112 


Homo sapiens 


N ADH dehydrogenase subunit 1 


551 


83 


7755 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


88 


56 


7756 


AF119851 


Homo sapiens 


PR01722 


97 


56 


7757 


U80955 


Caenorhabditis 
elegans 


similar to RE repeat region of 
PHLS27806 D. melanogaster homeotic 
protein BarH2 protein (dual bar 
protein) 


89 


100 


7758 


AC002045 


Homo sapiens 


Unknown protein product CIT987SK- 
A-589H1J splice form 2 


97 


94 


7759 


Y02775 


Homo sapiens 


Human secreted protein encoded by 
gene 12 clone HFTCU19. 


288 


98 


7760 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 


80 


7761 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


199 


76 


7762 


AF090931 


Homo sapiens 


PRO0483 


153 


84 


7763 


X92485 


Plasmodium 
vivax 


pval 


114 


62 


7764 


AF090931 


Homo sapiens 


PRO0483 


107 


86 


7765 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


103 


43 


7766 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


168 


73 


7767 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


72 


7768 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


222 


83 


7769 


Y40039 


Homo sapiens 


Peptide sequence derived from a 
human secreted protein. 


68 


72 


7770 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


62 


7771 


U93569 


Homo sapiens 


p40 


188 


43 


7772 


AL049934 


Homo sapiens 


hypothetical protein 


113 


47 


7773 


AF090931 


Homo sapiens 


PRO0483 


86 


65 


7774 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


137 


92 


7775 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


143 


65 


7776 


AF126163 


Homo sapiens 


HHLA3 protein 


141 


71 


7777 


G00423 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4504. 


93 


47 


7778 


AF090942 


Homo sapiens 


PRO0657 


148 


74 


7779 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


194 


83 


7780 


U14966 


Homo sapiens 


ribosomal protein L5 


315 


81 
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SEQ ID 


Accession No. 


Sneries 


i/uu i piiun 






NO: 








Waterman 


ldentit 










Score 


y 


7781 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


126 


47 


7782 


AF 119855 


Homo sapiens 


PR01847 


131 


51 


7783 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


53 








4425. 






7784 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


405 


82 


7785 


G00331 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


64 


42 








4412. 






7786 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


141 


59 








SEQIDNO:163. 






7787 


AF1 18082 


Homo sapiens 


PRO1902 


131 


48 


7788 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


64 








7795. 






7789 


J03807 


Strongylocentrot 


histone HI -delta 


111 


43 






us purpuratus 








7790 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


139 


85 


7791 


V00662 


Homo sapiens 


cytochrome oxidase I 


522 


80 


7792 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


153 


72 








encoded by gene 3. 






7793 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


53 








6913. 






7794 


W48351 


Homo sapiens 


Human breast cancer related protein 


82 


46 








BCRB2. 






7795 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


156 


76 








SEQ ID NO: 163. 






7796 


D38112 


Homo sapiens 


ATPase subunit 8 


166 


77 


7797 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


159 


48 








SEQ ID NO: 163. 






7798 


AF090931 


Homo sapiens 


PRO0483 


132 


75 


7799 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


57 








7795. 






7800 


D21261 


Homo sapiens 


similar to human 22kDa, SM22 mRNA 


99 


84 








(HUM22SM). 






7801 


X64707 


Homo sapiens 


BBC! 


232 


77 


7802 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


33 








7879. 






7803 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


112 


50 








SEQ ID NO: 163. 






7804 


Y36203 


Homo sapiens 


Human secreted protein #75. 


142 


56 


7805 


AF1 19851 


Homo sapiens 


PR01722 


88 


46 


7806 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


57 








7871. 






7807 


X68600 


Hordeum 


pZE40 


90 


37 






vulgare 








7808 


AK025047 


Homo sapiens 


unnamed protein product 


138 


54 


7809 


AF 130089 


Homo sapiens 


PRO2550 


375 


71 


7810 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


65 








5330. 






7811 


R59842 


Homo sapiens 


ApoE4Ll protease. 


156 


58 


7812 


AF016446 


Caenorhabditis 


similar to C. elegans cuticulin precursor 


95 


42 







elegans 


CUT-2 (SP-.P34682) 






7813 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


154 


65 


7814 


AK025047 


Homo sapiens 


unnamed protein product 


165 


71 


7815 


W34499 


Homo sapiens 


Obesity receptor C protein. 


309 


89 


7816 


AJ222673 


Hepatitis C virus 


core protein 


95 


42 






type 2c 








7817 


AB011483 


Arabidopsis 


contains similarity to root cap 


104 


61 






thaliana 


protein-gene Jd:MUF9. 1 6 






7818 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


74 








6444. 







338 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 
y 


7819 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


7820 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


103 


60 


7821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


65 


7822 


AF130089 


Homo sapiens 


PRO2550 


107 


67 


7823 


AF 118086 


Homo sapiens 


PR01992 


143 


73 


7824 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. . 


132 


63 


7825 


AL390114 


Leishmania 
major 


probable (tahv-6) ul 102, variant a 
DNA, complete virion genome 


142 


59 


7826 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


42 


7827 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


128 


65 


7828 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


104 


58 


7829 


AF132961 


Homo sapiens 


CGI-27 protein 


154 


49 


7830 


X70343 


Nicotiana 
sylvestris 


extensin 


105 


30 


7831 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


100 


62 


7832 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


129 


51 


7833 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


99 


46 


7834 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


74 


65 


7835 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


163 


55 


7836 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


72 


52 


7837 


AF090942 


Homo sapiens 


PRO0657 


126 


59 


7838 


Z33426 


Human 
respiratory 
syncytial virus 


glycoprotein 


110 


34 


7839 


D63480 


Homo sapiens 


The KIAA0146 gene product is novel. 


158 


89 


7840 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLTL 


88 


63 


7841 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


156 


60 


7842 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


145 


100 


7843 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


127 


66 


7844 


M28016 


Homo sapiens 


cytochrome b 


214 


78 


7845 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


283 


82 


7846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


94 


73 


7847 


D38112 


Homo sapiens 


cytochrome b 


525 


83 


7848 


L27428 


Homo sapiens 


reverse transcriptase 


210 


67 


7849 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


156 


96 


7850 


X92485 


Plasmodium 
vivax 


pval 


121 


54 


7851 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


59 


7852 


D38112 " 


Homo sapiens 


NADH dehydrogenase subunit 2 


191 


93 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


7853 


W48808 


Homo sapiens 


Homo saniens clone CGI 09 1 nrotein 


72 


69 


7854 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


142 


76 


7855 


U43627 


Arabidopsis 
thaliana 


extensin 


124 


35 


7856 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


133 


95 


7857 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


95 


57 


7858 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


157 


91 


7859 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6834. 


108 


45 


7860 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


159 


54 


7861 


G02872 


Homo saniens 


Human secreted nrotein SEO ID NO* 
6953. 


155 


63 
\i<j 


7862 


D38112 


Homo saniens 


cvtochrome c oxidase suhunit 3 


265 


11 


7863 


AP000616 


Orvza sativa. 


similar to RJNG-H2 finder nrotein 
RHAla(AF078683) 


145 


56 


7864 


Y86248 


Homo saniens 


Human secreted nrotein HCHPF68 
SEQIDNO:163. 


222 


100 


7865 


D38112 


Homo sapiens 


cvtochrome c oxidase suhunit 1 


409 




7866 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla CAF078683^ 


112 


50 


7867 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


169 


74 


7868 


AL031004 


Arabidopsis 
thaliana 

u muni ifii 


RSZp22 splicing factor 


127 


35 


7869 


U83771 


Accipiter 
sunerci Iiosui 


cytochrome oxidase I 


168 


73 


7870 


U83771 


Accipiter 
sunercilkreus 


cytochrome oxidase I 


186 


81 


7871 


G04072 


Homo saniens 


Human secreted nrotein SFO TD "NO* 

8153. 


203 


~i \j 


7872 


Y08062 


Homo saniens 


Human PR0245 nrotein fragment 
derived from DN A3 563 8 . 


130 


71 


7873 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


44 


7874 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


69 


7875 


AFl 19855 


Homo saniens 


PR01847 


174 


70 


7876 


AF210651 


Homo sapiens 


NAG 18 


164 


76 


7877 


Z93244 


Homo saniens 


bKl 16F5.2 (PUTATIVE RhoGAP 
fCDC42 GTPAse Activatins Protein') 
LIKE protein) 


111 

AAA 


45 


7878 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


227 


79 


7879 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


68 


7880 


Y36156 


Homo sapiens 


Human secreted protein #28. 


85 


68 


7881 


AFl 16661 


Homo sapiens 


PR01438 


138 


61 


7882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


141 


55 


7883 


Y45382 


Homo sapiens 


Human secreted nrotein fra&ment 
encoded from gene 28. 


111 


63 


7884 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


106 


64 


7885 


AK022821 


Homo sapiens 


unnamed protein product 


175 


72 


7886 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


59 | 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

identic 

y 


7887 


L27428 


Homo sapiens 


reverse transcriptase 


165 


63 


7888 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


139 


61 


7889 


MI5530 


Homo sapiens 


B-cell growth factor 


132 


74 


7890 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


SI 


64 


7891 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


150 


61 


7892 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


154 


73 


7893 


AF130089 


Homo sapiens 


PRO2550 


224 


61 


7894 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


127 


69 


7895 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


142 


62 


7896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


50 


7897 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


10S 


68 


7898 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


144 


65 


7899 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


105 


83 


7900 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


104 


57 


7901 


X03717 


Homo sapiens 


pot unidentified reading frame 


105 


46 


7902 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


67 


7903 


X6I046 


Hydra sp. 


mini-collagen 


107 


46 


7904 


AF124726 


Homo sapiens 


acinusL 


111 


44 


7905 


U77049 


Ovis aries 


bactinecin 11 1 


116 


38 


7906 


AF 1 18086 


Homo sapiens 


PR01992 


130 


73 


7907 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


121 


48 


7908 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


124 


83 


7909 


AC004450 


Arabidopsis 
thaliana 


putative extensin 


167 


34 


7910 


AF130079 


Homo sapiens 


PR02852 


110 


42 


7911 


AF090942 


Homo sapiens 


PRO0657 


144 


66 


7912 


AF090931 


Homo sapiens 


PRO0483 


96 


80 


7913 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


128 


46 


7914 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


87 


68 


7915 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


105 


75 


7916 


X92485 


Plasmodium 
vivax 


pval 


102 


67 


7917 


AF090942 


Homo sapiens 


PRO0657 


126 


76 


7918 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


108 


47 


7919 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


63 


7920 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


138 


61 


7921 


L26251 


Trypanosoma 
brucei 


CR5 


95 


37 


7922 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


176 


73 
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SEQ ID 


/*v*"Caaiun lit). 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 


7923 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


174 


61 








6613. 






7924 


L27428 


Homo sapiens 


reverse transcriptase 


121 


33 


7925 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


67 


7926 


Y36495 


Homo sapiens 


Fragment of human secreted protein 


119 


73 








encoded by gene 27, 






7927 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


65 








4718. 






792S 


Y02693 . 


Homo sapiens 


Human secreted protein encoded by 


138 


60 








gene 44 clone HTDAD22. 






7929 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


61 








6913. 






7930 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


144 


87 


7931 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


53 








6613. 






7932 


Y91577 


Homo sapiens 


Human secreted protein sequence 


179 


60 








encoded by gene 2 SEQ ID NO:250. 






7933 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


279 


92 


7934 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


132 


63 








SEQIDNO:163. 






7935 


W48351 


Homo sapiens 


Human breast cancer related protein 


101 


64 








BCRB2. 






7936 


AF010144 


Homo sapiens 


neuronal thread protein AD7oNTP 


159 


55 


7937 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


320 


80 








clone HPMBQ32. 






7938 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


71 








4484. 






7939 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


127 


65 


7940 


Y91577 


Homo sapiens 


Human secreted protein sequence 


314 


73 








encoded by gene 2 SEQ ID NO:250. 






7941 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


121 


57 








encoded by gene 17. 






7942 


AB04O936 


Homo sapiens 


KIAA 1503 protein 


149 


71 


7943 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


83 








7437. 






7944 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


74 


60 








encoded by gene 90. 






7945 


Y13141 


Bromheadia 


extensin 


68 


35 






finlaysoniana 








7946 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 


126 


50 






derived from DNA35638. 






7947 


AP000735 


Arabidopsis 


extensin protein-like 


214 


40 






thaliana 








7948 


AF161356 


Homo sapiens 


HSPC093 


134 


75 


7949 


AF090942 


Homo sapiens 


PRO0657 


110 


48 


7950 


AJ249395 


Globodera 


cytochrome b 


117 


36 






pallida 








7951 


AF1 16661 


Homo sapiens 


PR01438 


135 


49 


7952 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


189 


92 








SEQIDNO:163. 






7953 


M15530 


Homo sapiens 


B-cell growth factor 


143 


65 


7954 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


68 








787 L 






7955 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


62 









7253. 






7956 


AF1 30089 


Homo sapiens 


PRO2550 


153 


62 


7957 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


99 


48 






major 


DNA, complete virion genome 






7958 


AF161356 


Homo sapiens 


HSPC093 


154 


56 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideatit 

y 


7959 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


131 


57 


7960 


X92485 


Plasmodium - 
vivax 


pval 


107 


53 


7961 


AF210651 


Homo sapiens 


NAG 18 


113 


77 


7962 


AF130051 


Homo sapiens 


PRO0898 


84 


71 


7963 


R59842 


Homo sapiens 


ApoE4Ll protease. 


116 


68 


7964 


AF090931 


Homo sapiens 


PRO0483 


129 


56 


7965 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


151 


51 


7966 


R95913 


Homo sapiens 


Neural thread protein. 


110 


40 


7967 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


180 


50 


7968 


AF090894 


Homo sapiens 


PRO0113 


104 


58 


7969 


Y21040 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP mutant fragment 49. 


285 


100 


7970 


AK001452 


Homo sapiens 


unnamed protein product 


346 


56 


7971 


V00672 


Pan troglodytes 


reading frame protein 4 


134 


90 ! 


7972 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


48 


7973 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


104 


84 


7974 


AF116661 


Homo sapiens 


PR01438 


140 


62 


7975 


AF 119882 


Homo sapiens 


PR02492 


112 


48 


7976 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


80 


65 


7977 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


124 


89 


7978 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


125 


36 


7979 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


149 


68 


7980 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 


84 


7981 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


59 


7982 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


109 


69 


7983 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


149 


73 


7984 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


128 


72 


7985 


R59842 


Homo sapiens 


ApoE4Ll protease. 


130 


82 


7986 


AF090895 


Homo sapiens 


PRO0117 


83 


58 


7987 


X07881 


Homo sapiens 


proline-rich protein Gl 


131 


41 


7988 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


123 


53 


7989 


AF090894 


Homo sapiens 


PRO0113 


104 


61 


7990 


Z93244 


Homo sapiens 


bKI 16F5.2 (PUTATIVE RhoGAP 
(CDC42 GTPAse Activating Protein) 
LIKE protein) 


100 


61 


7991 


AK024455 . 


Homo sapiens 


FLJ00047 protein 


131 


56 


7992 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


136 


61 


7993 


AF210651 


Homo sapiens 


NAG18 


133 


61 


7994 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


92 


42 


7995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


171 


82 


7996 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


54 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


7997 


AF090928 


Homo sapiens 


PRO0470 


86 


76 


7998 


AJ251579 


Arabidopsis 
thaliana 


cef protein 


113 


37 


7999 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


119 


60 


8000 


R95913 


Homo sapiens 


Neural thread protein. 


105 


70 


8001 


AF229067 


Homo sapiens 


PADI-H protein 


118 


61 


8002 


W88609 


Homo sapiens 


Secreted protein encoded by gene 76 
clone HKDCL73. 


83 


65 


8003 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


85 


61 


8004 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


123 


43 


8005 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 12 1 . 


93 


68 


8006 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


147 


64 


8007 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


147 


38 


8008 


G03714 


Homo sapiens - 


Human secreted protein, SEQ ID NO: 
7795. 


150 


72 


8009 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


71 


8010 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


92 


54 


8011 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670, 


152 


59 


8012 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


123 


66 


8013 


AF161356 


Homo sapiens 


HSPC093 


185 


64 


8014 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


107 


54 


8015 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494, 


71 


65 


8016 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


66 


8017 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


94 


77 


8018 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


108 


46 


8019 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


178 


61 


8020 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


85 


65 


8021 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


51 


8022 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


202 


77 


8023 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. j 


195 


68 


8024 


G03786 


Homo sapiens 


Human secreted protein, SEQ. ID NO: 
7867. 


158 


53 


8025 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


133 


58 


8026 


X92485 


Plasmodium 
vivax 


pval 


128 


56 


8027 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


135 


61 


8028 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


105 


66 


8029 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


126 


59 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






rralti lud II 


¥ri unfit 

identic 










Score 


y 


8030 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


40 








7874. 






8031 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


72 








6536. 






8032 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


41 








8153. 






8033 


Y40039 


Homo sapiens 


Peptide sequence derived from a 


113 


69 








human secreted protein. 






8034 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


102 


66 








encoded by gene 17. 






8035 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


221 


95 








SEQ ID NO: 163. 






8036 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


36 


8037 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


113 


55 








6536. 






8038 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


108 


64 








clone HCACJ81. 






8039 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


68 


8040 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


86 








7519. 






8041 


Y73966 


. Homo sapiens 


Human prostate tumor EST fragment 


170 


56 








derived protein #153. 






8042 


W48351 


Homo sapiens 


Human breast cancer related protein 


95 


58 








BCRB2. 






8043 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 


8044 


AF220264 


Homo sapiens 


MOST-1 


83 


68 


8045 


AK024455 


Homo sapiens 


FLJ00047 protein 


131 


55 


8046 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


181 


78 








derived protein #153. 






8047 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


147 


61 








gene No. 107. 






8048 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 


102 


43 








sequence. 






8049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


59 


8050 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


56 








6619. 






8051 


D00570 


Mus musculus 


open reading frame (196 AA) 


127 


59 


8052 


AF040964 


Homo sapiens 


unknown protein IT1 


143 


50 


8053 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 9 


168 


83 








SEQ IDNO:163. 






8054 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


70 








7868. 






8055 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


62 








6619. 






8056 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


112 


65 








SEQIDNO:163. 






8057 


AF090895 


Homo sapiens 


PRO0117 


110 


77 


8058 


AP000616 


Oryza sati va 


similar to RING-H2 finger protein 


124 


90 








RHAla(AF078683) 






8059 


AF1 18082 


Homo sapiens 


PRO1902 


117 


50 


8060 


AK022217 


Homo sapiens 


unnamed protein product 


145 


53 


8061 


AF130089 


Homo sapiens 


PRO2550 


160 


76 


8062 


AF119851 


Homo sapiens % 


PR01722 


177 


54 


8063 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


362 


73 


8064 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


426 


85 


8065 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


344 


90 


8066 


AF068294 


Homo sapiens 


HDCMB45P 


146 


60 


8067 


V00662 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


461 


87 


8068 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


504 


90 
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CPA Trk 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


8069- 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


97 


40 


8070 


R59842 


Homo sapiens 


ApoE4Ll protease. 


147 


47 


8071 


AF1 19851 


Homo sapiens 


PR01722 


139 


59 


8072 


X92485 


Plasmodium 
vivax 


pval 


183 


45 


8073 


AF1 16712 


Homo sapiens 


PR02738 


95 


52 


8074 


AF213465 


Homo sapiens 


dual oxidase 


184 


80 


8075 


AF090895 


Homo sapiens 


PRO0117 


153 


57 


8076 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8077 


T85403_cdl 


Homo sapiens 


19-APR-1993 Human cadherin-11 
coding sequence. 


284 


98 


8078 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
IDNO:1051. 


99 


63 


8079 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


162 


73 


8080 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


142 


52 


8081 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


37 


8082 


X05285 


Drosophila 
melanogaster 


fibrillarin 


100 


34 


8083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


65 


8084 


G02216 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6297. 


380 


82 


8085 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


64 


8086 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8087 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


117 


82 


8088 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


79 


8089 


U27143 


Homo sapiens 


protein kinase C inhibitor-I 


119 


41 


8090 


AF1 19851 


Homo sapiens 


PR01722 


85 


47 


8091 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


112 


73 


8092 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


133 


64 


8093 


AF1 19854 


Homo sapiens 


PR01843 


522 


100 


8094 


X92485 


Plasmodium 
vivax 


pval 


115 


82 


8095 


D38112 


Homo sapiens 


NADH dehydrogenase subunit I 


598 


86 


8096 


AF097473 


Mus musculus 


ORFl 


109 


46 


8097 


AF097473 


Mus musculus 


ORF1 


109 


46 


8098 


Y17833 


Human 
endogenous 
retrovirus K 


pol protein 


281 


73 


8099 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


156 


69 


8100 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


126 


42 


8101 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


126 


65 


8102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


8103 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


163 


71 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Wfltennan 


Identit 










Score 


y 


8104 


AF130089 


Homo sapiens 


PRO2550 


130 


42 


8105 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


57 








7492. 






8106 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


132 


61 








derived protein #153. 






8107 


AF090942 


Homo sapiens 


PRO0657 


123 


41 


8108 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


134 


38 


8109 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


101 


77 






major 


DNA, complete virion genome 






8110 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


340 


90 


8111 


AF130089 


Homo sapiens 


PRO2550 


123 


86 


8112 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


122 


90 








RHAla(AF078683) 






8113 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


182 


47 








6953. 






8114 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


77 








4435. 






8115 


G00452 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


50 








4533. 






8116 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


459 


88 


8117 


L17318 


Rattus 


proline-rich proteoglycan 


110 


45 






norvegicus 








8118 


S79410 


Mus musculus 


nuclear localization signal binding 


125 


62 








protein 






8119 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


106 


78 








7720. 






8120 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


106 


84 


8121 


GO2490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


45 








6571. 






8122 


D86854 


Catharanthus 


extensin 


142 


35 






roseus 








8123 


X92485 


Plasmodium 


pval 


144 


53 






vivax 








8124 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


59 








7870. 






8125 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


163 


67 








6613. 






8126 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


145 


62 








gene 22 clone HMSJW18. 






8127 


X07881 


Homo sapiens 


proline-rich protein Gl 


142 


37 


8128 


X70343 


Ntcotiana 


extensin 


102 


29 






sylvestris 








8129 


W48351 


Homo sapiens 


Human breast cancer related protein 


113 


48 








BCRB2. 






8130 


AF090895 


Homo sapiens 


PRO0117 


150 


71 


8131 


D00570 


Mus musculus 


open reading frame (251 AA) 


130 


41 


8132 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


61 








6613. 






8133 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


128 


61 








encoded by gene 17. 






8134 


AF130079 


Homo sapiens 


PR02852 


181 


77 


8135 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


58 








7795. 






8136 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


64 








7437. 






8137 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


112 


43 








protein CON-1. 






8138 


AF 119900 


Homo sapiens 


PR02822 


113 


57 


8139 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 3 


427 


86 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

1114111 

Score 


% 

Id en tit 

y 


8140 


AF004341 


Homo sapiens 


cytochrome c oxidase subunit III 


155 


88 


8141 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


163 


58 


8142 


D70836 


Homo sapiens 


Zinc-finger protein 


156 


47 


8143 


AC002291 


Arabidopsis 
thaliana 


extensin 


153 


37 


8144 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


85 


34 


8145 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


100 


76 


8146 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


10S 


63 


8147 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


110 


79 


8148 


M22332 


Homo sapiens 


unknown protein 


156 


40 


8149 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


92 


76 


8150 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


131 


66 


8151 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


441 


76 


8152 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


67 


8153 


AF130089 


Homo sapiens 


PRO2550 


129 


42 


8154 


AF130089 


Homo sapiens 


PRO2550 


173 


50 


8155 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8156 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


171 


82 


8157 


L27428 


Homo sapiens 


reverse transcriptase 


181 


50 


8158 


U11271 


Homo sapiens 


thromboxane A2 receptor 


138 


73 


8159 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


145 


53 


8160 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


139 


58 


8161 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


155 


59 


8162 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


141 


64 


8163 


AF161361 


Homo sapiens 


HSPC098 


89 


66 


8164 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


117 


58 


8165 


AF130089 


Homo sapiens 


[ PRO2550 


121 


63 


8166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


148 


85 


8167 


X92485 


Plasmodium 
vivax 


pval 


97 


56 


8168 


X92485 


Plasmodium 
vivax 


pval 


115 


57 


8169 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


124 


61 


8170 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


60 


8171 


X92485 


Plasmodium 
vivax 


pval 


105 


44 


8172 


Y19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


75 


64 


8173 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


106 


65 


8174 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


108 


44 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

W !)t>rmQn 
Yv a ICI Ul ft II 

Score 


% 

laenm 

y 


8175 


Y36203 


Homo sapiens 


Human secreted protein #75. 


150 


51 


8176 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


114 


41 


Sill 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


68 


8178 


AF090928 


Homo sapiens 


PRO0470 


119 


72 


8179 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


75 


8180 


AF220264 


Homo sapiens 


MOST-1 


94 


61 


8181 


AF1 19851 


Homo sapiens 


PR01722 


143 


73 


8182 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


134 


64 


8183 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


143 


59 


8184 


X92485 


Plasmodium 
vivax 


pval 


127 


72 


8185 


X92485 


Plasmodium 
vivax 


pval 


130 


65 


8186 


AF1 19851 


Homo sapiens 


PR01722 


285 


64 


8187 


AF090942 


Homo sapiens 


PRO0657 


137 


38 


8188 


AF090931 


Homo sapiens 


PRO0483 


154 


66 


8189 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


148 


43 


8190 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


268 


79 


8191 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


no 


43 


8192 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


129 


78 


8193 


AF1 13685 


Homo sapiens 


PRO0974 


86 


52 


8194 


Y36203 


Homo sapiens 


Human secreted protein #75. 


132 


61 


8195 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


225 


77 


8196 


S80119 


Rattussp. 


reverse transcriptase homolog 


105 


40 


8197 


X86031 


Sus scrofa 


homologue of prolme/arginine-rich 
antibacterial protein 


124 


39 


8198 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


121 


60 


8199 * 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


68 


8200 


Y36203 


Homo sapiens 


Human secreted protein #75. 


131 


56 


8201 


Y36203 


Homo sapiens 


Human secreted protein #75. 


136 


44 


8202 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


166 


48 


8203 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


50 


8204 


Y36203 


Homo sapiens 


Human secreted protein #75. 


127 


58 


8205 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


139 


61 


8206 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


190 


92 


8207 


AF090931 


Homo sapiens 


PRO0483 


160 


87 


8208 


AF126163 


Homo sapiens 


HHLA3 protein 


107 


67 


8209 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


93 


41 


8210 


Y36156 


Homo sapiens 


Human secreted protein #28. 


100 


62 


8211 


AF161356 


Homo sapiens 


HSPC093 


73 


60 


8212 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


107 


34 


8213 


AB046048 


Macaca 


unnamed portein product 


146 


61 



349 



WO 01/64835 



PCTYUS01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
v 






fascicularis 








8214 


AF130079 


Homo sapiens 


PR02852 


121 


59 


8215 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 
derived protein #153. 


109 


50 


8216 


AF090930 


Homo sapiens 


PRO0478 


153 


79 


8217 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 


60 


8218 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


135 


56 


8219 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


42 


8220 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


95 


32 


8221 


X52235 


Homo sapiens 


ORFII 


174 


39 


8222 


AF130089 


Homo sapiens 


PRO2550 


159 


67 


8223 


AF068294 


Homo sapiens 


HDCMB45P 


161 


58 


8224 


M62387 


Oryctolagus 
cuniculus 


ubiquitin conjugating-protein 


219 


90 


8225 


AF1 13685 


Homo sapiens 


PRO0974 


136 


65 


8226 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


430 


85 


8227 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


115 


58 


8228 


Y36156 


Homo sapiens 


Human secreted protein #28. 


117 


50 


8229 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


44 


8230 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


145 


50 


8231 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


105 


46 


8232 


S62928 


Homo sapiens 


PRB1M protein precursor 


113 


37 


8233 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


142 


62 


8234 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


104 


48 


8235 


U12695 


Pan paniscus 


cytochrome oxidase subunit II 


259 


84 


8236 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


116 


61 


8237 


AF1 16661 


Homo sapiens 


PR01438 


135 


56 


8238 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


107 


53 


8239 


U93572 


Homo sapiens 


p40 


168 


49 


8240 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


100 


72 


8241 


Z70292 


Homo sapiens 

£ 


chemokine COl 


231 


83 


8242 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


119 


82 


8243 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


97 


83 


8244 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


78 


8245 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


81 


65 


8246 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


74 


60 


8247 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


115 


58 


8248 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


127 


58 


8249 


G03789 


Homo sapiens 


Human sscreted protQin 9 SEQ ID NO: 
7870. 


107 


84 


8250 


AF 118082 


Homo sapiens 


PRO1902 


136 


50 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- n 

Waterman 

Score 


% 

Identit 

y 


8251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


196 


59 


8252 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


106 


81 


8253 


AF130089 


Homo sapiens 


PRO2550 


143 


53 


8254 


AK024455 


Homo sapiens 


FU00047 protein 


151 


68 


8255 


AK022821 


Homo sapiens 


unnamed protein product 


140 


68 


8256 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


53 


8257 


M24509 


Mus musculus 


ferritin heavy chain 


347 


69 


8258 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


57 


8259 


U93570 


Homo sapiens 


p40 


254 


57 


8260 


AF218028 


Homo sapiens 


unknown 


129 


71 


8261 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


77 


68 


8262 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


604 


90 


8263 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


60 


8264 


AL139298 


Streptomyces 
coelicolorA3(2) 


putative membrane protein 


92 


34 


8265 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


138 


70 


8266 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


100 


64 


8267 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y | 
isoform 


113 


38 


8268 


X92485 


Plasmodium 
vivax 


pval 


102 


63 


8269 


X61048 


Hydra sp. 


mini-collagen 


110 


46 


8270 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


82 


8271 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


121 


75 


8272 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


48 


8273 


AF090944 


Homo sapiens 


PRO0663 


122 


42 


8274 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


81 


85 


8275 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


140 


54 


8276 


AF130089 


Homo sapiens 


PRO2550 


275 


56 


8277 


L27428 


Homo sapiens 


reverse transcriptase 


105 


27 


8278 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


112 


54 


8279 


G01246 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
5327. 


159 


73 


8280 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


109 


71 


8281 


AF104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


188 


42 


8282 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


117 


39 


8283 


X55683 


Lycopersicon 
esculentum 


extensin (class I) 


96 


36 


8284 


AF225910 


Mus musculus 


DAZ-associated protein 1 


117 


33 


8285 


AF130089 


Homo sapiens 


PRO2550 


347 


75 


8286 


AB046100 


Macaca 
fascicularis 


unnamed protein product 


140 


45 


8287 


X55684 


Lycopersicon 


extensin (class I) 


67 


45 
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cert in 
NO: 


Accession iio. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 






esculentum 








8288 


AF029670 


Homo sapiens 


Rad5 1 C truncated protein 


186 


69 


8289 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


149 


50 


8290 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


119 


67 


8291 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


152 


50 


8292 


D89999 


Gallus gallus 


muscle derived protein 


1134 


66 


8293 


LI 1566 


Homo sapiens 


ribosomai protein LI 8 


335 


69 


8294 


W61170 


Homo sapiens 


Partial polypeptide sequence from 
CP140 partial cDNA. 


441 


95 


8295 


S62077 


Homo sapiens 


HPlHs alpha=25 kda chromosomal 
autoantigen 


175 


94 


8296 


AF 119882 


Homo sapiens 


PR02492 


125 


48 


8297 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


156 


54 


8298 


AK025116 


Homo sapiens 


unnamed protein product 


259 


82 


8300 


X92485 


Plasmodium 
vivax 


pval 


162 


47 


8301 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. j 


157 


76 


8302 


AF092176 


Homo sapiens 


MHC class II antigen 


280 


84 


8303 


AF1 18082 


Homo sapiens 


PRO1902 


174 


52 


8304 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


105 


50 


8305 


AK023563 


Homo sapiens 


unnamed protein product 


244 


60 


8306 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


136 


66 


8307 


U15647 


Mus musculus 


reverse transcriptase 


107 


38 


8308 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


110 


40 


8309 


AB005540 


Rattusrattus 


PCTAIRE2 


166 


59 


8310 


M37679 


Mus musculus 


Ig heavy chain precursor 


95 


64 


8311 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


137 


82 


8312 


W29643 


Homo sapiens 


Human secreted protein CW383 1. 


312 


100 


8313 


AFl 13685 


Homo sapiens 


PRO0974 


116 


39 


8314 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


128 


69 


8315 


Z28407 


Homo sapiens 


ribosomai protein L8 


497 


78 


8316 


AE003536 


Drosophila 
melanogaster 


CG6451 gene product 


237 


51 


8317 


M19419 


Mus musculus 


proline-rich salivary protein 


115 


39 


8318 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


92 


53 


8319 


Y29862 


Homo sapiens 


Human secreted protein clone 
du515_21. 


399 


81 


8320 


Z34278 


Homo sapiens 


mucin 


103 


25 


8321 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


142 


91 


8322 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


82 


62 


8323 


Z46791 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=67.1, E- 
value=1.2e-16, N=3; PF01484 
(Nematode cuticle collagen N-teiminal 
domain), Score=82.0, E-value=3.9e-21, 
N=l 


122 


37 


8324 


AF021232 


Homo sapiens 


TRAIL-R4-A 


113 


100 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


8325 


AB046085 


Macaca 
fascicularis 


unnamed protein product 


292 


75 


8326 


AF 113944 


Muntiacus 
muntjak 


prion protein 


96 


39 


8327 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


125 


63 


8328 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


123 


62 


8329 


AJ005897 


Homo sapiens 


JM5 


275 


73 


8330 


AF1 19900 


Homo sapiens 


PR02822 


138 


76 


8331 


AF163573 


Homo sapiens 


CARKL 


359 


84 


8332 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


103 


37 


8333 


AF026246 


Homo sapiens 


HERV-E integrase 


354 


78 


8334 


J03756 


Homo sapiens 


hGH-V2 


362 


82 


8335 


AB001517 


Homo sapiens 


KNP-I alpha protein 


126 


76 


8336 


X97490 


Mus musculus 


PNG protein 


469 


98 


8337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


74 


8338 


V18354_cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


379 


92 


8339 


V18354_cdl 


Homo sapiens 


17-NOV-1992 Human R20 seven 
transmembrane (7TM) receptor 
genomic DNA. 


402 


93 


8340 


Z11502 


Homo sapiens 


intestine-specific annexin 


385 


89 


8341 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


110 


54 


8342 


AC0031I3 


Arabidopsis 
thaliana 


F240I.6 


68 


50 


8343 


X12517 


Homo sapiens 


C protein (AA 1-159) 


321 . 


82 


8344 


D86549 


Homo sapiens 


p97 homologous protein 


279 


96 


8345 


Y73384 


Homo sapiens 


HTRM clone 2284580 protein 
sequence. 


1513 


100 


8346 


AP001745 


Homo sapiens 


human cDNA DKFZp586F0422, 
Accession No. AL050173 


118 


40 


8347 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


121 


49 


8348 


AF091090 


Homo sapiens 


unknown 


125 


44 


8349 


AF106542 


Homo sapiens 


pregnancy-specific beta- 1 glycoprotein 
1 


145 


100 


8350 


AF090942 


Homo sapiens 


PRO0657 


252 


65 


8351 


L23116 


Homo sapiens 


galctocerebrosidase 


185 


85 


8352 


AJ243803 


Streptomyces 
coelicolor A3(2) 


hypothetical protein 


97 


36 


8353 


Y11339 


Homo sapiens 


GalNAc alpha-2,6-sialyitransferase I 


306 


100 


8354 


U09477 


Homo sapiens 


p53-binding protein 


388 


100 


8355 


L27584 


Homo sapiens 


Ca channel B3 subunit 


197 


100 


8356 


AF071569 


Homo sapiens 


multifunctional calcium/calmodulin- 
dependent protein kinase II delta2 
isofonn 


574 


100 


8357 


W54282 


Homo sapiens 


Protein sequence of the di-alpha 
haemoglobin gene contained in pSSl. 


321 


81 


8358 


Y12916 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


759 


100 


8359 


AF038961 


Homo sapiens 


SL 15 protein 


166 


100 


8360 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 


328 


90 


8361 


R95326 


Homo sapiens 


Tumor necrosis factor receptor 1 death 
domain ligand (clone 2DD). 


386 


97 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith. 

waterman 

Score 


'% 

Identit 

y 


8362 


AF026954 


Bostaurus 


pyruvate dehydrogenase phosphatase 
regulatory subunit precursor, PDPr 


269 


88 


8363 


X07855 


Homo sapiens 


G-Protein 


289 


98 


8364 


AL1 10247 


Homo sapiens 


hypothetical protein 


265 


97 


8365 


AB046774 


Homo sapiens 


K1AA1554 protein 


3619 


99 


8366 


M10050 


Homo sapiens 


fatty acid binding protein 


97 


100 


8367 


M20882 


Homo sapiens 


pregnancy-specific beta-gfycoprotein e 


1097 


99 


8368 


AB051901 


Homo sapiens 


VDUP1 


662 


98 


8369 


U72678 


Mus musculus 


EF-9 


176 


65 


8370 


AL1 17237 


Homo sapiens 


hypothetical protein 


478 


60 


8371 


M10617 


Homo sapiens 


L-FABP 


104 


84 


8372 


AJ249980 


Homo sapiens 


hypothetical protein 


1527 


97 


8373 


AF003136 


Caenorhabditis 
elegans 


contains weak similarity to an AMP- 
binding motif 


242 


55 


8374 


X91638 


Gallusgallus 


BRM protein 


225 


90 


8375 


W75093 


Homo sapiens 


Human secreted protein encoded by 
gene 37 clone HFVGS85. 


369 


100 


8376 


Z36714 


Homo sapiens 


cyclin F 


519 


95 


8377 


X86401 


Homo sapiens 


L-arginine: glycine amidinotransferase 


200 


97 


8378 


M12413 


Homo sapiens 


apolipoprotein B100 


1278 


99 


8379 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


159 


66 


8380 


AF151860 


Homo sapiens 


CGI-1 02 protein 


202 


77 


8381 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


255 


92 


8382 


AF017807 j 


Homo sapiens 


Arp2/3 complex 16kDa subunit 


418 


87 


8383 


U28727 


Homo sapiens 


pregnancy-associated plasma protein-A 
preproform 


196 


97 


8384 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


128 


85 


8385 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


203 


69 


8386 


AF090931 


Homo sapiens 


PRO0483 


145 


77 


8387 


S63758 


Mus sp. 


metallothionein-I gene transcription 
activator 


324 


98 


8388 


W89030 


Homo sapiens 


Polypeptide fragment encoded by gene 
165. 


174 


66 


8389 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4581. 


93 


70 


8390 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


46 


8391 


V29245_cdl 


Homo sapiens 


15-NOV-1996 Nucleotide sequence of 
the human selenium-binding proteia 


421 


82 


8392 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


184 


77 


8393 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


99 


77 


8394 


AK023563 


Homo sapiens 


unnamed protein product 


285 


75 


8395 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


161 


75 


8396 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


121 


61 


8397 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


108 


70 


8398 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


57 


8399 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


108 


63 


8400 


W88607 


Homo sapiens 


Secreted protein encoded by gene 74 
clone HHSEG23. 


146 

( 


55 



354 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


8401 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


69 








4488. 






8402 


AF1 19851 


Homo saoiens 


PRO 1722 


133 


65 


8403 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


67 


76 








4488. 






8404 


L06498 


Homo sapiens 


ribosomal protein S20 


170 


94 


8405 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


58 








7795. 






8406 


AF1 18086 


Homo sapiens 


PRO 1992 


142 


SO 


8407 


G03800 


Homo saoiens 


Human secreted orotein. SEO ID NO* 


153 


81 








7881. 






8408 


AF090919 


Homo s aniens 


PRO0327 


96 


74 


$409 


G0253S 


Homo *5anien<i 


Human secreted orotein SEO ID NO* 


127 


62 








6619. 






8410 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


150 


100 








SEQIDNO:163. 






8411 


G02532 


Homo saoiens 


Human secreted orotein- SEO ID NO* 


103 


75 








6613. 






8412 


AF1 16715 


Homo sapiens 


PR02829 


158 


76 


8413 


AF229067 


Homo sapiens 


PADI-H protein 


137 


56 


8414 


G03793 


Homo aniens 


Human secreted orotein SEO ID NO* 


131 


77 








7874. 






8415 


AF068294 


Homo saoiens 


HDCMB45P 


132 


54 


8416 


W34499 


Homo cariiftn*? 


OHesitv recentor O nrotein 


251 


77 


8417 


G02872 


Homo saoiens 


Human secreted orotein SEO ID NO* 

IIUIUUU gWwl UJ UU/lil} Ui-lv 1.1-/ iivy. 


272 


57 








6953. 






8418 


S79410 


Mus musculus 


nuclear localization signal binding 


113 


54 














8419 


AK025047 


Homo saoiens 


unnamed nrotein oroduct 

UII1IMII IwlJ l/l \J will £JX VUUVi 


128 


58 


8420 


G03786 


Homo saoiens 


Human secreted orotein. SEO ID NO* 


130 


46 








7867. 






8421 


AL132841 


Caenorhabditis 


Y15E3A.3 


135 


56 






ele a ans 








8422 


AF1 30051 


Homo sapiens 


PRO0898 


164 


72 


8423 


G00092 


Homo vir>ipn<5 


Human secreted nrotein SFO TD NO* 


161 










4173. 






8424 




\Sl Y£M oaurtt 


Cf rm liar fo J? JN^t- fin <w»r nr/ite in 


I OR 

wo 


100 








RHAla ( AF078683^ 






8425 


GO0333 


Homo aniens 


Human secreted nrotein SEO ID NO* 


134 


75 








4414. 






8426 


G03798 


Homo saoiens 


Human secreted nrotein. SEO ID NO* 


91 


48 








7879. 






8427 


AF194641 


Homo sapiens 


immunoglobulin lambda light chain 


174 


69 








variable region 






8428 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


68 








7492. 






8429 


M64322 


Homo sapiens 


protein tyrosine phosphatase 


201 


100 


8430 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


135 


55 








RHAla (AF078683) 






8431 


AF090931 


Homo sapiens 


PRO0483 


158 


87 


8432 


J04495 


Macaca mulatta 


alpha-globin 


150 


100 


8433 


YI4482 


Homo sapiens 


Fragment of human secreted Drotein 


150 


78 








encoded by gene 17. 






8434 


Y02671 


Homo sapiens 


Human secreted orotein encoded bv 


137 


48 








gene 22 clone HMSJW18. 






8435 


AJ272324 


Homo sapiens 


adaptor molecule- 1 


333 


100 


8436 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


48 








6834. 






8437 


Ml 8085 


Homo sapiens 


glycoprotein lib 


154 


100 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


843S 


X92485 


Plasmodium 
vivax 


pval 


138 


64 


8439 


G03136 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7217. 


126 


56 


8440 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


105 


72 


8441 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


122 


55 


8442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


139 


69 


8443 


Y36203 


Homo sapiens 


Human secreted protein #75. 


103 


70 


8444 


AB034633 


Mus musculus 


deubiquitinating enzyme 


129 


100 


8445 


U01849 


Trypanosoma 
brucei 


ORF2 


95 


41 


8446 


J05497 


Rattus 
norvegicus 


snRNP-associated polypeptide N 


115 


37 


8447 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


137 


50 


8448 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


222 


100 


8449 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


168 


94 


8450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO;163. 


152 


63 


8451 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ1DN0:163. 


162 


91 


8452 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1. 


804 


97 


8453 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


109 


100 


8454 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


138 


88 


8455 


AF099505 


Homo sapiens 


colon carcinoma related protein 


100 


63 


8456 


AF2175U 


Homo sapiens 


uncharacterized bone marrow protein 
BM034 


130 


96 


8457 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


70 


8458 


L17007 


Didelphis 
marsupialis 


protamine 1 


101 


80 


8459 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


145 


62 


8460 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


121 


75 


8461 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


110 


81 


8462 


AF016371 


Homo sapiens 


U-snRNP-associated cyclophilin 


108 


70 


8463 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


80 


93 


8464 


AF130089 


Homo sapiens 


PRO2550 


190 


65 


8465 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 


159 


55 


8466 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


183 


70 


8467 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


97 


60 


8468 


AF1 18082 


Homo sapiens 


PRO1902 


72 


42 


8469 


AF217449 


Schistosoma 
mekongi 


NADH dehydrogenase subunit 6 


107 


40 


8470 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


88 


75 
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SEQID 
NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7339. 






8471 


AF090931 


Homo sapiens 


PRO0483 


161 


79 


8472 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


127 


56 


8473 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


50 


8474 


AF022789 


Homo sapiens 


ubiquitin hydroiyzing enzyme I 


140 


100 


8475 


X03145 


Homo sapiens 


pot. ORF VI 


349 


81 


8476 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


65 


8477 


R59842 


Homo sapiens 


ApoE4Ll protease. 


115 


69 


8478 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


112 


56 


8479 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


140 


44 


8480 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


95 


70 


8481 


AF090931 


Homo sapiens 


PRO0483 


111 


67 


8482 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


122 


92 


8483 


M69297 


Homo sapiens 


ORF 3 


102 


39 


8484 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


148 


72 


8485 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


136 


65 


8486 


AF032906 


Homo sapiens 


cathepsin Z precursor; CTSZ 


299 


100 


8487 


X15917 


Paramecium 
aurelia 


ND2 protein (AA 1-193) (unusual start 
codon) 


94 


35 


8488 


U93574 


Homo sapiens 


putative pi 50 


109 


35 


8489 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


142 


72 


8490 


X92485 


Plasmodium 
vivax 


pval 


120 


85 


8491 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


109 


42 


8492 


Ml 5530 


Homo sapiens 


B-cell growth factor 


126 


64 


8493 


AF090895 


Homo sapiens 


PRO0117 


123 


60 


8494 


G02403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6484. 


166 


100 


8495 


AF1 16661 


Homo sapiens 


PR01438 


126 


63 


8496 


AF130089 


Homo sapiens 


PRO2550 


298 


70 


8497 


AF090895 


Homo sapiens 


PRO0117 


113 


75 


8498 


AF090930 


Homo sapiens 


PRO0478 


142 


68 


8499 


AF126163 


Homo sapiens 


HHLA3 protein 


139 


72 


8500 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


109 


75 


8501 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


48 


8502 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


132 


63 


8503 


AF119851 


Homo sapiens 


PR01722 


107 


53 


8504 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


158 


67 


8505 


AF090931 


Homo sapiens 


PRO0483 


128 


Jw 


8506 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


126 


56 


8507 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


154 


82 


8508 


X92485 


Plasmodium 


pval 


83 


70 



357 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


Identit 

y 






vivax 








S509 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


144 


50 


8510 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


124 


86 


8511 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


141 


51 


8512 


AF229067 


Homo sapiens 


PADI-H protein 


155 


76 


8513 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


170 


68 


8514 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


157 


66 


8515 


AF090930 


Homo sapiens 


PRO0478 


88 


80 


8516 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 


59 


8517 


KOI 664 


Drosophila 
melanogaster 


Bkm-like protein 


102 


92 


8518 


M31145 


Homo sapiens 


insulin-like growth factor binding 
protein precursor 


460 


96 


8519 


M18193 


Homo sapiens 


inter-alpha-trypsin inhibitor heavy 
chain old gene name TIT 


118 


41 


8520 


AF1 16689 


Homo sapiens 


PR02168 J 


127 


92 


8521 


AF130051 


Homo sapiens 


PRO0898 


133 


85 


8522 


L27428 


Homo sapiens 


reverse transcriptase 


201 


72 


8523 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XER8c} 


108 


74 


8524 


AF161356 


Homo sapiens 


HSPC093 


127 


60 


8525 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


95 


83 


8526 


AF 118086 


Homo sapiens 


PR01992 


95 


64 


8527 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


108 


50 


8528 


ABO46048 


Macaca 
fascicularis 


unnamed portein product 


154 


62 


8529 


AF1 30089 


Homo sapiens 


PRO2550 


150 


70 


8530 


Y03000 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


111 


55 


8531 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


108 


65 


8532 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


172 


59 


8533 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


395 


91 


8534 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


45 


8535 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


107 


95 


8536 


G03800 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7881. 


128 


89 


8537 


Z28407 


Homo sapiens 


ribosomal protein L8 


242 


79 


8538 


AF130089 


Homo sapiens 


PRO2550 


139 


71 


8539 


Q55625_cdl 


Homo sapiens 


22-JUN-l 992 Human beta globin 5'- 
UTR-CDS-3'-UTR. 


274 


76 


8540 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


154 


61 


8541 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


69 


8542 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


64 


8543 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


240 


61 



358 



WO 01/64835 



PC17US0 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Wattrman 

Score 


% 

Idcntit 

y 








6953. 






8544 


G02490 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
6571. 


1 17 


85 


8545 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


110 


95 


8546 


AF1 19851 


Homo sapiens 


PR01722 


146 


80 


8547 


U49973 


Homo sapiens 


ORF2* function unknown 


105 


86 


8548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


71 


8549 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


152 


53 


8550 


G02832 


Homo ^anien*i 


Human ^errpted nrofein SFO TD NO* 

6913. 


80 


40 


8551 


G02485 


Hnmo *sanien*5 

A AULllv/ Ou^/iwllO 


Human secreted nrotein SFO TD NO* 

A 1111 1 1 Oil JbUl^lvU yrl KJiXalH) OLiV* 11-/ liv/. 

6566. 


140 


74 


8552 


V00672 


Pan trop'loHvtp^ 


read in 7 frame nrotein 4 
lbctuiug ii axil c *Jiv"(*wiii *r 


142 


92 


8553 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 

/ / 7J. 


164 


63 


8554 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Of/ 17. 


148 


71 


8555 


AB046100 


Macaca 

■fiiQrirt 1 1» ric 

lAdwlvUlvU Id 


unnamed protein product 


144 


58 


8556 


G03789 


Homo <iar>if*n«i 


Human secreted nrotein SFO TD NO* 
7870. 


322 


73 


8557 


G02832 


Homo <;anien*i 

A AlSlAlvr OO.lJl\*llS> 


Human secreted nrotein SFO ID NO* 
6913. 


140 


62 


8558 


U3088R 


fTnmn ^anipni 
1 1VJ11HJ o<xpiwLio 


tRTJA-Onsmine TVanQtrlvoncvlftQe 


276 


ou 


8559 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881 


114 


71 


8560 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437 

l*Tj / • 


108 


72 


8561 


G03136 


Homo <*anten<: 


Human secreted nrotein SFO ID NO* 
7217. 


1 10 


57 


8562 


G00328 


Hrtmri *5anif»n<i 


Human secreted nrotein SFO TD NO* 

1. ACA111C111 ovUvtCU J/1V/1CU1, ODy 1A-^ A"*V/. 

4409. 


140 


OJ 


8563 


W34499 


Homo sapiens 


Obesity receptor C protein. 


188 


92 


8564 


Y86248 


Homo cariipn 4 ?. 


Human secreted nrotein HPHPF68 

1 1 LAI 11 Cl 11 Dtvl blVyVi yiXJlXiUl IXKsXXX JT VOj 

SEQIDNO:163. 


124 




8565 


L07946 


Volvov carteri 


Histone HI -I 

HID IV/ll v 11 1 JL 


108 


38 


8566 


Y36366 


Hnmo ^aniens 


Fraomenr of human ^er refer! nrotein 
encoded by gene 3. 


132 


77 


8567 


Ml 3941 


Human 
herpesvirus 4 


nuclear anticen 1 

UUvlvCU CUlvl^*wll I 


100 


43 


8568 


Y36156 


Homo sapiens 


Human secreted protein #28. 


136 


67 


8569 


X70343 


Nicotiana 
sylvestris 


extensin 


92 


34 


8570 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


381 


87 


8573 


V00672 


Pan troglodytes 


reading frame protein 4 


146 


93 


8572 


D38116 


Pan paniscus 


ATPase subunit 6 


116 


96 


8573 


W50193 


Homo saniens 


Amino acid seouence of salivarv 
protein CON-2. 


132 


52 


8574 


Y86248 


Homo samens 


Human secreted nrotein HCHPF68 
SEQIDNO:163. 


217 


100 


8575 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


271 


84 


8576 


D38112 


Homo sapiens 


ATPase subunit 6 


117 


96 


8577 


D38112 


Homo sapiens 


ATPase subunit 6 


180 


93 


8578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


179 


91 


8579 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


189 


88 



359 



WO 01/64835 
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SEQDD 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


8580 


M15530 


Homo sar»if*n<: 


D-ceii gruwui idcior 




71 


858] 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


127 


77 


8582 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-b2 finger 

nrAfpin nisi St 

prvicui ilia, l a. 


109 


47 


8583 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


131 


71 


8584 


Y91577 


Homo ^AniAnc 


jn.uiu.aii scxjrcitxj protein sequence 
cniuucu uy gene z acv^ UL/ invJ.zju. 


z41 


97 


8585 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


185 


80 


8586 


M28016 


Homo sapiens 


cytochrome b 


129 


92 


8587 


M58009 


Pnn frriorlrwHvfiaQ 
r oil u vgiv/iiyica 


cytoenrorne c oxiuase suounit n 


1 C2 


96 


8588 


Y86248 




tiuman secreted proxein nLnrfoo, 

O-CA^ XLJ ri\J. IOj. 


I/O 


90 


8589 


Y08769 


norveO'ipiiQ 


uiiuruvaSLuiar cnaomeiiai 
uiiiercnLiauuu gene z 


1 AO 

iuy 


51 


8590 


L00016 


Homo sapiens 


urf4 


176 


94 


8591 


AF1501O5 


iTomrt enni^ne 
j LKji i iKj stxyvsim 


smaij zmc linger- liKe protein 


JZJ 


100 


8592 


U15306 


I iajhiu oapicilo 


VJTTV1 


123 


95 


8594 


R44519 


TTonriA canipnc 
AMXJlllKJ StajJlCllo 


A/flili* hitman rrl/*»ktr% \/1 Oi^C 

/Louii n urn an aipna-gjoom v i u/o. 


zzz 


TO 

78 


8595 


Y03000 


VTomrt canipn e 
xivsijiu aapiciib 


r ragmeni 01 numan secretea protein 
encoded by gene 121. 


lzo 


70 


8596 


S75895 


Hr>mn cnnipnc 


iN/vL/ti aenyarogonase suounit z, injjz 


1 < 1 


90 


8597 


X67688 


Homo sapiens 


transketolase 


123 


84 


8598 


L00016 


riomo sapiens 


un < + 


1 OA 

130 


85 


8599 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


109 


54 


8600 


Y86248 


numo Sapiens 


tinman secretea protein nUnrroo, 
SEQIDNO.163. 


159 


89 


8601 


G02872 


numu sapiens 


numan secretea protein, oc\i id mui 

07JJ. 


151 


54 


8602 


R96800 


Homo sapiens 


Human histiocyte-secreted fector HSF. 


115 


43 


8603 




\Jiy£A Saliva 


sirnuar to kjjn vr-nz linger protein 
ivn/via ^/iru/oDoj ^ 


1 OA 

120 


95 


8604 


S79980 


Bos taurus 


ribosomal protein L37 


109 


84 


8605 


G03807 

vjyjov / 


numu sapiens 


numan secretea protein, oJby UJ INL/i 
7888. 


127 


48 


8606 




numu sapiens 


a i rase suDunii o 


202 


100 


8607 


Y86248 


numu sapiens 


numan secretea protein nLrlrroo, 


1 Crt 

150 


42 


8608 


G00333 


Homo oanipnc 


FTiiman Qprr^tf»H nmfpin QPH TT~S XTD* 
ALUiiiau dcwcicu piuicill, ocy XXJ iNv/. 

4414. 


lz / 


JO 


8609 


AF229067 


Homo ^finipn^ 




1 Iz 


to 


8610 


AF130089 


Homo sapiens 


PRO2550 


125 


86 


8611 


J03071 


Homo ^anien<: 


uxiui luiizc buniaiuiiicUiuJiu trupixi 


*7i n 


yi 


8612 


AL390114 


Leishmania 

maior 

ILiOJ VI 


extremely cysteine/valine rich protein 


104 


41 


8613 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68 9 


112 


84 


8614 


AP000616 


\Jij tjsx saliva 


similar to KjjNvj-riz linger protein 

JVrXrV 1 a ^/\JT U / 0003 ) 


1 1 £ 
1 10 


70 


8615 


AP000616 . 


Oryza sativa 


similar to RING-H2 finger protein 
ivn/\ia ^Aru / oOoj^ 


123 


91 


8616 


M28016 


Homo sapiens 


cytochrome b 


148 


93 


8617 


AF1 18082 


Homo sapiens 


PRO1902 


94 


66 


8618 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


182 


82 


8619 


X02585 


Xenopus laevis 


unidentified open reading frame 1 (166 


170 


72 



360 



WO 01/64835 



PCT/US01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








aaj 






8620 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


125 


57 






j\sd.u luupsis 
iiumcuia 


putative cxienstn 


14U 




8622 


Y8674R 


t-Tr\TYl f\ Cdn ion c 


Unmon ct>r~re*tf*A r»rrvf*»in \-tf^ VTPT? A Si 

nuiudii sccreicu proxcm rioxxr roo, 
SEQ ID NO: 163. 


1 


70 


8621 




ITUillU Sapiens 




1 jU 


78 
/o 


8624 


I 00016 


Xlv/luU Sapi CD a 


nrfA 


^SOlr 




862 S 


AT UVUJ 1 


WVyiti/\ csmiAnc 
rxv/iiiu i»auicub 


rtSSJ\Joyo 


1 Al 

101 


7£ 
/O 


8626 




jiumu bapicns 


U„ m « n cjwtMpH nrrvha>in Qpr^ IT) "M/V 

numan secreieu proxein, ocvj uj l\kj. 

1 JOJ. 


iZj 


AO 


8627 


L00016 


Homo sapiens 


urf4 


201 


95 


RA7R 




xionio sapiens 


rsAUri aenyurogenase suounii 


icy 


5*7 


8A7Q 


xvr i l 70J i 


nurau sapiens 


PP01799 

r ts\J i 1 zz 


171 

1 / 1 


Oj 


8630 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


120 


92 


OOj 1 


APAAftAI A 
ArvUUOlO 


Oryza saliva 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


lUo 


JO 






Homo sapiens 


cytochrome C oxidase II subunh 


IV/ 




8633 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO. 
l\lA. 


132 


86 


OOJ*f 


A T7 94007 


norvegicus 


huntingtin 


/J 


jy 


oOjj 


VSlA7/lft 


Homo sapiens 


Human secreted protein rlCrirroo, 
SEQ ID NO: 163. 


1 <^ 


OA 

oO 


OOjO 


iv/f ioaai 
lYllyuoi 


Ateles geoffroyi 


delta-glob in 




/i A 


RA17 


VJvjoUU 


Homo sapiens 


Human secreted protein, abvj ID nu: 
7881. 




65 






Homo sapiens 


tinman secreted protein, oriv£ lvwu: 


1 A/1 


CA 


8639 


AF116715 


Homo sapiens 


PR02829 


158 


76 


QA4A 
oOfUr 




Homo sapiens 


Human secreted protein, ofcv^ ID NU: 
7344. 




62 






Mus musculus 


Ser/Thr kinase 




AC 

95 [ 


8642 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oyUo. 


227 


79 


8643 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


116 


95 


8644 


AP000616 * 


Oryza sativa 


similar to RING-H2 finger protein 

DUAlo t Al?A7QA£1\ 


134 


88 


8645 


V463ll_cdl 


Homo sapiens 


12-JAN-1998 Human secreted protein 
cluna iroro cione uidj. 


197 


100 


8646 


VI 7^70 


nomo sapiens 


in AUH.uDiquinone oxiaoreQuciase o 1- 
kD subunit 


7AA 


07 
y 1 


8647 


y^a^aa 


Homo sapiens 


rragmenc or numan secrete a protein 
encoded by gene 3. 


1/17 


AS? 

Oo 


8648 


/A. J jOO JZ.U 


v^ouis lanuuans 


isiDOSomai protein 




1 AA 


8649 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

717A 


136 


56 


8650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


80 


XAS1 

OOJ 1 


O"70A 1 A 


rvius muscuius 


nuclear localization signal binding 
protein 


GO 

yy 


JO 


8652 


G00492 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4573. 


136 


64 


8653 


AF229067 


Homo sapiens 


PADI-H protein 


134 


69 


8654 


AF090942 


Homo sapiens 


PRO0657 


103 


42 


8655 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


184 


84 



361 



WO 01/64835 



PC17US01/04927 



SEQID 

Na 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


8656 




nonio Sapiens 


MKbi I aomoiogue nMrel l 


238 


97 


86V7 


AT 160^71 


major 


probable (nnv-o) ul 102, variant a 
DNA, complete virion genome 


1 13 


60 


8658 


G00269 


numu Sapiens 


riurnjan secreted protein, ofcivj ID NU: 
4343. 


124 


83 


8659 


AF161TS6 


fiomo sapiens 




149 


64 


8660 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OOl J. 


121 


44 


86 6 1 

OUU 1 




— : 

Homo sapiens 


Human secreted protein encoded by 
gene «w cione n I uaajzi* 


107 


37 


8662 


U19098 


Lycopersicon 
chilense 


proline-rich protein 


72 


48 


8663 


AF1 18082 


Homo sapiens 


PRO1902 


95 


75 


8664 




Homo sapiens 




307 


79 


OOOJ 


AT OQA1 1/1 


Leishmania 
major 


extremely cysteine/valine rich protein 


115 


40 


OvJOD 


AjjOoj 


Lycopersicon 
esculentum 


extensin (class I) 


102 


32 


8667 

OuD f 


a If on 
J\j\SJJ.0o** 1 


Homo sapiens 


unnamed protein product 


276 


100 


Z66S 


AL1 09963 


Homo sapiens 


dJl 1 88 J2 1 . 1 (FSH primary response 
(LRPR1, rat) homolog 1) 


597 


100 


8669 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


123 


58 


8670 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCrLIl. 


103 


65 


8671 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


61 


8672 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


82 


8673 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

f»Y7/™* Tr> VTA 1 

SEQ ID NO: 163. 


211 


93 


8674 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


126 


61 


8675 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


114 


66 


8676 


X55691 


Lycopersicon 
esculentum 


grycine-rich protein 


85 


38 


8677 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SbQ ID NO: 163. 


188 


80 


8678 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


183 


75 


8670 




— . 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


71 


8680 


i uo /oy 


Kanus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


96 


82 


8681 

OUO 1 


?r^ft76i 

UOvl / □! 


riomo sapiens 


i vj/o aiiernaie open reading frame 


136 


o o 

88 


8682 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
orJQ UL> NU. lOJ. 


137 


93 


8683 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 


107 


45 


86X4 

OwO*t 


X oOZ*fo 


Homo sapiens 


Human secreted protein HOHrroo, 
SEQ ID NO: 163. 


141 


90 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
/J44. 


127 


55 


8686 


W64486 


Homo sapiens 


Human DR3 n rote in 


no 


mo 

1 uv 


8687 


M18093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


134 


33 


8688 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


119 


35 


8689 


U36448 


Homo sapiens 


Ca2+-dependent activator protein for 


107 


91 



362 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








secretion 






8690 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


43 




I jOjOD 


xionjo sapiens 


Fragment of human secreted protein 
encouea uy gene d. 


147 


74 


8692 


APOOOfilfi 
firuvwiu 


KJkjija. Saliva 


similar to tviiN vj-riz nnger protein 

INXLtV la ^ATU / oOOj J 


I/O 


AC 

95 


8693 


AF 1 30089 


Homo sapiens 


PRO2550 


126 


35 


OU/T 




jTaii uogjoayics 


cyiocorome c oxidase suounic u 




nn 
97 


8695 


Ml 1902 


Mus rouscuius 


proline-rich salivary protein 


98 


35 






Homo sapiens 


Obesity receptor C protein. 


263 


83 


8697 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SbQ ID NO: 163. t 


181 


73 


8698 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


125 


48 


8699 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


129 


88 


8700 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


51 


8701 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

npA IT"* VIA 1 

SEQ ID NO: 163. 


165 


73 


o /UZ 


I 30300 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


133 


62 


O /Uj 


i 50Z45 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


92 




rv*21 lO 
DJol iZ 


Homo sapiens 


A 1 rase subunit 6 


117 


96 


o/Uj 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


97 


58 


o/uo 


nm7i4 

vjuj / 1*1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770C 


115 


79 


8707 


AFi 18086 


Homo sapiens 


PR01992 


167 


81 


o /Ho 




Homo sapiens 


Human secreted protem HCHPF68, 
MiQ IL) NO:lo3. 


114 


51 


ft70Q 


ij 1 V7UO 


— 

Mus musculus 


Gcapl gene product 


179 


38 


8710, 


*j / 2fH 1 U 


Mus musculus 


nuclear localization signal binding 
protein 


135 


69 


£71 1 
o/ll 


API 


Homo sapiens 


nn /-y> C c A 

rKUZjDi) 


399 


78 


8712 


X55654 


Homo sapiens 


cytochrome C oxidase II subunit 


107 


95 


O / ID 


A 171 1 ftflfcl 


Homo sapiens 


PRO1902 


123 


58 


R71a 

O / 1*T 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


173 


81 


R71 ^ 




Homo sapiens 


2 1 -NOV- 1996 Human tumour antigen 
protein (clone 3) encoding cDNA, 


258 


83 


o / i v> 


AFI 'JflnRQ 
/vr outfox 


xiomo sapiens 


do m < cn 
rK.U255U 


i An 

143 


C 1 

51 


8717 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


92 


52 


8718 

Ol JO 




xiorno sapiens 


Human secreted protein encoded by 
gene No. 107. 


147 


71 


8719 




xiomo sapiens 


uu-ncn sequence UNA-omamg xactor 
candidate 


115 


70 


O /Z.V 




Homo sapiens 


FKUU4oi 


145 


87 


8721 


X92485 


Plasmodium 
vivax 


pval 


125 


79 


8722 


U72543 


Sus scrota 


ubiquitin-like/S30 ribosomal fusion 

jpi Ulvlil 


201 


97 


8723 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


223 


85 


8724 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


121 


60 



363 



WO 01/64835 



PCT/US01/04927 



SEQLD 
NO: 


Accession No. 


Species 


Description 


Smith- 
tt mci i nun 
Score 


% 

Ideonr 

y 1 


8725 


AF130051 


Homo sapiens 


PRO0898 


102 


72 


8726 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


185 


78 


8727 


R24750 


Homo sapiens 


hGH variant #38 - 174S 176Y 10F 14S 
18F21L 167S 171D 175T 179T. 


116 


82 


8728 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


72 1 


8729 


X53375 


Helianthus 
animus 


anther-specific protein SF18 


114 


39 


8730 


S74221 


Homo sapiens 


IK factor=cytokine down-regulating 
HLA class II 


159 


42 


8731 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


166 


89 


8732 


AB027508 


Homo sapiens 


hVPSll 


194 


50 


8733 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
KD protein (nuclear ma-binding 
protein) 


144 


93 


8734 


AF266479 


Homo sapiens 


rectachrome 1 


116 


74 


8735 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


125 


60 


8736 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


152 


65 


8737 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


56 


8738 


U93565 


Homo sapiens 


putative pi 50 


190 


52 


8739 


AF026689 


Homo sapiens 


prostate-specific transglutaminase 


117 


57 


8740 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


109 


70 


8741 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


165 


61 


8742 


M58009 


Pan troglodytes 


cytochrome c oxidase subunit II 


213 


97 


8743 


M17887 


Homo sapiens 


acidic ribosomal phosphoprotein (P2) 


106 


74 


8744 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 linger 
protein rhal a. 


116 


49 


8745 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


68 


8746 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


135 


71 


8747 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


94 


40 


8748 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


173 


76 


8749 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


67 


8750 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


272 


57 . 


8751 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


131 


71 


8752 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


123 


37 


8753 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


180 


55 


8754 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


105 


44 


8755 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


78 


8756 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


284 


87 


8757 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


190 


89 



364 



WO 01/64835 



PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smitb- 

Waterman 

Score 


% 

Identit 

y 


8758 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


103 


40 


8759 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


120 


61 


8760 


AF132984 


Homo sapiens 


nuclear pore complex interacting 
protein NPIP 


196 


94 


8761 


AF130079 


Homo sapiens 


PR02852 


125 


65 


8762 


AF155232 


Pisum sativum 


extensin 


177 


42 


8763 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


128 


95 


8764 


X75438 


Sus scrofa 


homologue of proiine/argiriuie rich 
antibacterial peptides 


96 


38 


8765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


96 


8766 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


137 


75 


8767 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQDDNO:163. 


207 


95 


8768 


AL161543 


Arabidopsis 
thaliana 


extensin like protein 


100 


38 


8769 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


110 • 


47 


8770 


D38112 


Homo sapiens 


NADH dehydrogenase subunh4L 


178 


78 


8771 


D83407 


Homo sapiens 


a thyroid hormone responsive gene in 
human skin fibroblasts 


133 


100 


8772 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 
subunit) 


172 


73 


8773 


W03642 


Homo sapiens 


Human cannabinoid GPR N-tenninal 
sequence. 


107 


66 


8774 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


117 


100 


8775 


AF090931 


Homo sapiens 


PRO0483 


175 


88 


8776 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


125 


58 


8777 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


126 


73 


8778 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


154 


70 


8779 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


149 


50 


8780 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


170 


91 


8781 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


65 


8782 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


184 


70 


8783 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


169 


97 


8784 


U55376 


Caenorhabditis 
elegans 


F16H1 1.2 gene product 


141 


90 


8785 


U55376 


Caenorhabditis 
elegans 


F 1 6H 1 1 2. gene product 


150 


100 


8786 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


127 


96 


8787 


AF266479 


Homo sapiens 


rectachrome 1 


148 


61 


8788 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


106 


95 


8789 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


162 | 


96 


8790 


AC006693 


Caenorhabditis 


Hypothetical protein W02H5.e 


180 


94 



365 



WO 01/64835 



PCT/US01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






eiegans 








8791 


U28971 


Caenorhabditis 
eiegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
pro icin ) 


159 


80 


£70? 




T-Tom r\ ennipne 
nouiu bopiCIlS 


Ctviatl ti nr> ftnftAr film *it-/*vf'«^t»"» 

smau zinc linger- nice protein 






o / y j 


at 390935 




pxuuoujc piUUl^J 


OJ 


^1 
^>1 


8794 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


115 


55 


8795 


AF1 18082 


Homo sapiens 


PRO1902 


123 


75 


o fyo 


UjoI 1Z 


Homo sapiens 


ATPase subunit 6 






8797 


AFl 18086 


Homo sapiens 


PR01992 


156 


81 


07/10 

879© 


Y5J037 


Homo sapiens 


Human secreted protein clone 
dx219_J3 protein sequence SEQ ID 

XT f\*Qf\ 


196 


100 


8799 


X92485 


Plasmodium 
vivax 


pval 


132 


63 


00UU 


OU25J2 


Homo sapiens 


Human secreted protein, Sbvi NU: 
6613. 


170 


65 


1?oa 1 




Homo sapiens 


Human secreted protem, SbQ ID NO: 
4478. 


123 


CO 

58 


00UZ 


A 17 1 1 £AAA 

Ar 1 1 t>y i)y 


Homo sapiens 


unknown 


116 


£-** 

63 


OQA1 

00UJ 




Plasmodium 
vivax 


____ 1 
pval 


127 


OO 

82 


8804 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


67 


8805 


Z95114 


Homo sapiens 


bK2l2A2.2 (apolipoprotein L, 2) 


294 


100 


00UO 


Ar 1433oy 


Mus musculus 


msgl -related protein 2 


348 


83 


8807 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4551. 


110 


76 


QQ AO 

00U0 


A DAAfl£ 1 


Oryza sativa 


similar to R1MG-H2 ringer protein 
RHAIa(AF078683) 


129 


91 


QOAQ 

oouy 


Ar /y 


Caenorhabditis 
eiegans 


i> W13-like protein; rbA-1 


126 


37 


8810 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


168 


82 


001 1 


IO024o 


Homo sapiens 


Human secreted protein xiCrirroo, 
SEQIDNO:163. 


122 


54 




DJollZ 


Homo sapiens 


A 1 rase subunit 0 


OAT 

207 . 


1 /\ A 

100 




wyyj /o 


Homo sapiens 


riuman tiDrouiast growtn tactor z JA 
kjj isoiorm w-xerminus. 


yi 


A 1 

41 


00 it 




nomo sapiens 


riuman secreica proiein, ocy iu in w. 
7124. 




D / 


OOlJ 




cos taurus 


i / ,uvu Qaiion myubiu ngni cnaui 


1 JO 




8816 


M58459 


Homo sapiens 


ribosomal protein S4Y boform 


260 


100 


Xfc17 




riomo sapiens 


jN/vL/jn. Qcnyarogenasc suduiui i 


zjO 


on 


OO iO 




jviycop iasma 
pulmonis 


VSaiyl 


1JZ 


Q 1 




nn^ 1 77 


riomo sapiens 


TTiifYian cam- Af*»rl rwrrtt^in QPO TFk MH' 

riuman secreiea protein, oci^ iu inu. 
7253. 


i*»u 




QOOA 


t nnni£ 

LUUU 1 0 


riomo sapiens 


unH 




on 


OOZ1 


AjjOoj 


j^ycopersicon 
esculentum 


exiensin ^ciass ij 


in 


4J 


ooZz 


AT KfMOl 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


O/t 


43 


ft J?") "2 

OOZ_> 


XoOZoD 


Mus musculus 


Hul UKr 


V4 




8824 




A rti V»aiic 
x\l UUClLb 

jamaicensis . 


r i ni rase suounii o 


07 


on 


8825 


M18093 


Phaseolus 
vulgaris 


hydroxyproline-rich glycoprotein 


123 


36 


8826 


AFl 19855 


Homo sapiens 


PRO 1847 


113 


52 



366 
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SEQ ID 


Accession No. 


Species 


Description 


Smiths 


% 


NO; 








Wo f~p-wm a n 

tt itic ruian 












Score 


y 


8827 


M28016 


Homo sapiens 


cytochrome b 


140 


93 


8828 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


142 


56 








SEQ ID NO: 163. 






8829 


AK024455 


Homo sapiens 


FLJ00047 protein 


145 


59 


8830 


D38112 


Homo sapiens 


ATPasesubunit6 


194 


95 


8831 


D38112 


Homo sapiens 


ATPasesubunit6 


184 


90 


8832 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


134 


60 








RHAla(AF078683) 






8833 


AF1 30079 


Homo sapiens 


PR02852 


115 


88 


8834 


AB003784 


Drosophila 


Histone H3 


135 


100 






simulans 








8835 


Y91577 


Homo sapiens 


Human secreted protein sequence 


173 


82 








encoded by gene 2 SEQ ID NO:250. 






8836 


X70343 


Nicotiana 


extensin 


112 


37 






sylvestris 








8837 


L17318 


Rattus 


proline-rich proteoglycan 


142 


38 






norvegicus 








8838 


Y36366 


Homo sapiens 


Fragment of human secreted protein 


148 


74 








encoded by gene 3. 






8839 


W34625 


Homo sapiens 


Human C3 protein mutant FT-3. 


277 


100 


8840 


AF1 16638 


Homo sapiens 


PR01546 


78 


41 


8841 


W48351 


Homo sapiens 


Human breast cancer related protein 


138 


61 








BCRB2. 






8842 


AF 130087 


Homo sapiens 


PR0241I 


131 


67 


8843 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


72 








5327. 






8844 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


60 








4433. 






8845 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


66 








7795. 






8846 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


68 








7795. 






8847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


146 


70 








gene 22 clone HMSJW18. 






8848 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


131 


68 








SEQ ED NO: 163. 






8849 


Y91577 


Homo sapiens 


Human secreted protein sequence 


273 


79 








encoded by gene 2 SEQ ID NO:250. 






8850 


K02I20 


Bovine leukemia 


gag 


201 


54 






virus 








8851 


AF229067 


Homo sapiens 


PADI-H protein 


162 


57 


8852 


AF090931 


Homo sapiens 


PRO0483 


125 


82 


8853 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 


189 


100 








clone BPA 13. 






8854 


U52197 


Mus musculus 


poly(A) polymerase III 


183 


85 


8855 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


193 


72 








6908. 






8856 


G00549 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


66 








4630. 






8857 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


93 


64 








encoded by gene 17. 






8858 


Q55625_cdl 


Homo sapiens 


22-JUN-1992 Human beta globin 5 f - 


412 


98 








UTR-CDS-3*-UTR. 






8859 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


152 


67 


• 






gene 22 clone HMSJW18. 






8860 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


72 








6909. 






8861 


AF130079 


Homo sapiens 


PR02852 


160 


64 


8862 


G00407 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


101 


61 



367 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 




Waterman 


Identit 










Score 


y 








A A OO 

4488. 






OO/CO 


AvH / V / 


— : 

Homo sapiens 




Z10 


1 Art 

100 


oo04 




Homo sapiens 


Human secreted protein, ocA} ID NU. 


lOO 

12/ 










44U9- 






ooOj 


OftftftQ? 


— : 

Homo sapiens 


Human secrete a protein, ojc\£ 11^ inv^. 


01ft 
Z IU 


i oft 

1UU 








/t 1 oo 
41 /J. 






ooow 


AT 1^77^1 


— : 

Homo sapiens 


nypotneucai protein 


JZ / 




ooo / 


A FAQ ft 047 


Momo sapiens 


rKJUUo J / 


1 ft^ 
1UD 


Oft 


00O0 




Homo sapiens 


unnamed protein product 




OU 


88o9 


VAOOC/C 


Homo sapiens 


Fragment of human secreted protein 


1 A 1 

101 


65 








encoucu oy gene 7u. 






007A 


A CO 1 iCTC 1 


- — . 
Homo sapiens 


CDA14 


282 


96 


OO / 1 


AT 10 0 O/CO 

Al, 133262 


Homo sapiens 


aJ319D22.1 (CDC5-liKe protein) 


1 CO 

158 


AA 
90 


0872 


V32779_cal 


Homo sapiens 


1 1 -MAR- 1 997 Novel human FO ATP 


267 


81 






synthase subunit encoding DNA 












sequence. 






8873 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


118 


58 








„„ „ VT_ 1 AT 

gene No. 107. 






oo/4 


VI OO 

Y 14482 


Homo sapiens 


Fragment of human secreted protein 


OA 

89 


43 








encoded by gene 17. 






oo / 3 


fl AO con 


Homo sapiens 


Human secreted protein, ofcQ ID NU: 


154 


63 








OO 13. 






ooo/: 
oo /O 


Ai^nni coo 
AJsa/u lo / J 


— - — : 

Homo sapiens 


unnamed protein product 


ooo 
3 /3 


1 A A 

100 


oo / / 


Y^OIOQ 


Homo sapiens 


L7a protein 


OCA 

259 


oo 

82 


oo/o 


A52138 


Homo sapiens 


L7a protein 


273 


87 


8879 


G0O328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


147 


57 








4409. 






OOOA 

8880 


VT1 if jI OO 

Y 14482 


Homo sapiens 


Fragment of human secreted protein 


128 


SI 








encoded by gene 17. 






ooo 1 


003714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


159 


71 








/ Jyj. 






QQQO 

oooz 


/"•AOOiCO 

U03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


67 








7344. 






QQQ7 

ooo3 


Ar 130089 


Homo sapiens 


PRO2550 


134 


79 


8884 


U36898 


Rattus 


pheromone receptor VN6 


143 


53 






norvegicus 








8885 


C»*7A >t 1 A 

S79410 


Mas musculus 


nuclear localization signal binding 


143 


64 








protein 






oooo 


V0 1 coo 
Y915 // 


Homo sapiens 


Human secreted protein sequence 


lOO 

138 


no 

92 








encoaea by gene 2 SEQ ID NO:250. 






ooo/ 


u03 790 




Homo sapiens 


Human secreted protein, bbQ ID MU: 


124 


IO 

72 








7871. 






OCCQ 

oooo 


K48290 


Homo sapiens 


Human fOr-z/N 1-3. 


1 1 o 

118 


66 


8889 


(j030o2 


Homo sapiens 


Human secreted protein, iSJbQ LD NO: 


O 1 

81 


77 








O1/10 
/14j. 






RftQA 
007U 


r; ao coo 


riomo sapiens 


Human secreiea proiein, ocy iu inu. 


10ft 










//in 
ooiy. 






ooVl 


rjAOOl /I 
UUj/ 14 


— — ; 

Homo sapiens 


Human secreted protein, SEQ ED NO: 


io 1 
121 


oo 








oonc 






oo92 


v<ooo< 
X53375 


TT-,1 1 „. a\- - . — 

Heliantnus 


_a.i___ ^ ^_ • r 1 * i. # . on o 

anther-specinc protein br 1 8 


1 A A 

109 


64 






annuus 








£800 


ArOOOolO 


Oryza sativa 


similar to KliN(j-Hz linger protein 


no 
119 


j4 








T>T_IA !.» /A17^00^COO^ 

KHAla (Ar\) /50o3) 






ooy*f 


nnocoo 
VJVZ »Jo 


riomo sapiens 


Human secreted protein, SEQ ED NO: 




Oo 








6619 






8895 


AF090894 


Homo sapiens 


PRO0113 


114 


60 


8896 


AF161356 


Homo sapiens 


HSPC093 


148 


51 


8897 


AF1 16638 


Homo sapiens 


PR01546 


91 


45 


8898 


U22231 


Felis catus 


ribosomal protein S3a 


201 


100 



368 



WO 01/64835 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


8899 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


114 


75 


8900 


AF090894 


Homo sapiens 


PRO0U3 


114 


61 


8901 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


63 


8902 


AP000616 


Oryza sativa 


similar to RENG-H2 finger protein 
RHAla(AF078683) 


126 


95 


8903 


R90288 


Homo sapiens 


Modified pigment epithelium-derived 
factor (rPEDF). 


258 


88 


8904 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


155 


86 


8905 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


66 


8906 


X70343 


Nicotiana 
sylvestris 


extensin 


122 


37 


8907 


G02386 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6467. 


162 


70 


8908 


AFI30089 


Homo sapiens 


PRO2550 


125 


85 


8909 


AF220264 


Homo sapiens 


MOST-1 


113 


47 


8910 


Y25769 


Homo sapiens 


Human secreted protein encoded from 
gene 59. 


119 


77 


8911 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


77 


52 


8912 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 


162 


83 


8913 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


104 


77 


8914 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


138 


64 


8915 


AF090944 


Homo sapiens 


PRO0663 


73 


58 ! 


8916 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


151 


76 


8917 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


84 


47 


8918 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


64 


8919 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


254 


82 


8920 


AJ245416 


Homo sapiens 


G7b protein 


360 


86 


8921 


D90228 


Homo sapiens 


mitochondrial acetoacetyl-CoA thiolase 
precursor 


190 


100 


8922 


AF229067 


Homo sapiens 


PADI-H protein 


157 


65 


8923 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


65 


8924 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


62 


68 


8925 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


116 


63 


8926 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


140 


65 


8927 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


59 


8928 


AF287892 


Homo sapiens 


sialic acid binding immunoglobulin- 
like lectin 8 long splice variant 


994 


76 


8929 


AF030131 


Mus musculus 


Plenty of SH3s; POSH 


181 


77 


8930 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 

* 


814 


98 j 


8931 


U13866 


unidentified 
cloning vector 


non-functional lacZ alpha peptide 


112 


59 



369 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idea tit 

y 


8932 


U67540 


Methanococcus 
jannaschii 


conserved hypothetical protein 


98 


56 


8933 


274472 


Caenorhabditis 
elegans 


predicted using Genefinder-contains 
similarity to Pfam domain: PF01391 
(Collagen triple helix repeat (20 
copies)), Score=84.3, E-value=8.1e-22, 
N=2; PF01484 (Nematode cuticle 
collagen N-tenninal domain), 
Score=342, E-value=9.9e-07, 
•N=l~cDNAESTykl00dl0.3 comes 
from this gene; cDNA EST 
EMBL:D65979 comes from this 
gene-cDNA EST ykl00d4.3 comes 
from this gene; cDNA EST ykl23g7.3 
comes from this gene-cDNA EST 
yk58e6.3 comes from this gene; cDNA 
ESTyk67a5.3 comes from this 
gene-cDNA EST EMBL:T01590 
comes from this gene; cDNA EST 
yk60cl 1.3 comes from this 
gene-cDNA EST yk89cl0.3 comes 
from this gene; cDNA EST yk85g9.3 
comes from this gene~cDNA EST 
yk83e2.5 comes from this gene; cDNA 
EST yk75f23 comes from this 
gene-cDNA EST CEESK90F comes 
from this gene; cDNA EST yk79f5.5 
comes from this gene-cDNA EST 
yk64M2.3 comes from this gene; 
cDNA EST yk64h9.3 comes from this 
gene-cDNA EST yk74dl.3 comes 
from this gene; cDNA EST yk75e43 
comes from this gene-cDNA EST 
yk83e23 comes from this gene; cDNA 
EST yk91cl2.3 comes from this 
gene-cDNA EST yk98E2.3 comes from 
this gene; cDNA EST yk60h4.5 comes 
from this gene-cDNA EST yk60cll.5 
comes from this gene; cDNA EST 
yk64h9.5 comes from this gene-cDNA 
EST yk64a9.5 comes from this gene; 
cDNA EST yk64bl2.5 comes from this 
gene-cDNA EST yk67a5.5 comes from 
this gene; cDNA ESTyk74dl.5 comes 
from this gene-cDNA EST yk75e4.5 
comes from this gene; cDNA EST 
yk75f2.5 comes from this gene-cDNA 
EST yk79e6.5 comes from this gene; 
cDNA EST yk80d2.5 comes from this 
gene-cDNA EST yk85g9.5 comes 
from this gene; cDNA EST yk89cl0.5 
conies from this gene-cDNA EST 
yk89c4.5 comes from this gene; cDNA 
EST yk90el 1 .5 comes from this 
gene-cDNA EST yk9ic!2.5 comes 
from this ffene* cDNA EST vk96e2 5 
comes from this gene~cDN A EST 
yk98c2.5 comes from this gene; cDNA 
EST yk98f23 comes from this 
gene-cDNA EST yk98hl 1.5 comes 


101 


38 



370 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








from mis gene; cDNA EST ykl07e7.5 
comes from this gene-cDNA EST 
yk313g9.3 comes from this gene; 
cDNA EST yk3 13g9.5 comes from this 
gene-cDNA EST yk3 12gl .3 comes 
rrom en is gene, ciyiNA no i ytCjizgi.-? 
comes from this gene-cDNA EST 
y&z.yjc L T*j eomes uom mis gene, 
cDNA ESTyk293c4.5 comes from this 

A0f|0 

gene 






RQ^t4 
oym- 


VR674R 


nomo sapiens 


Human cprrpfpri nrnfpin WC'T-TPF/^R 

XlUiilOll SCOJCLCli piL/LCIll 1 1 V^lXT J DO, 

SEQIDNO:163. 


197 




07JJ 


VRrV>4R 
I 00Z4O 


nomo sapiens 


VTiimQn Qf»rrpfF»H nrnfpin T4PHPF6R 
xi. tu 11 ail seuieieu. piuicixi xiv^nr ruo, 

ODy ijl/ X^ . iuj. 


1A.J 


4R 


RQ16 


Y01741 


Hnmn csintpnc 

numu sapiens 


mvnctn hpaw f nam IM^ A 
uiyusw ncavjr ^lxaixx x^txj 


1343 


99 


R017 
oy^ 1 


M7Q7QS 
ivL£*yz*yj 


JvallUS 

norvegicus 


email nurlpur rihnniiplpnnflrHf*1p- 
ammj jiui«jccu uwiiuvjcwpaiiivic* 

associated protein 


106 


52 

J£t 


RQ1R 

07JO 




nuino sapiens 


SCI UiXX OLUj 1U1U JT WJXXXpvlXCXXl 


1 oo 




R91Q 


W*4499 


nomo sapiens 


r^K^cifv rprpntnr f nrnfpin 

V/OeSlt>jr IdsGpiAJt w LJXVIiCUl. 


j j 


83 


8940 


L00016 


Homo sapiens 


urf4 


293 


83 


RQ41 


I OOZ*ro 


Homo sapiens 


Unman *»A/*»rat-ASl nrAtoin X-ll^t-I w Ax 

numan secreieo pruiein nx^risr do, 
^FO TD "NO* 1 63 


10*f 


04 
yi 


R947 


AR00714R 


nomo sapiens 


rinncnmal nmfptn £4^3 
I ii/UoVJiiiai jyxvjt&xxi ojcl 


153 


73 


8943 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
^FO TD TSTO-163 


213 


95 


8944 

07t*t 


M10119 

XY1 i v XX ^ 


Y-fritTtn canipnc 

XxUXXJU dOpiCIld 


fVrritin Hohf Qiihiinit 

Id UUIl XlgLXL OUUUIUI 


129 


100 


8945 


M74718 


Homo sapiens 


SEF2-1A protein 


118 


84 


RQ4£ 


AT q^<17ft 


TJTnmsv con! on c 

nomo sapiens 


rlTQ47T R 1 f\ fnnvpl PTTR and ^uchi 
rpriftfif^ Hnnnfltn rtrnfpin^ 


366 




8947 


Y56033 


Homo sapiens 


Polyproline-rich domain from dynamin. 


70 


51 


R04R 




nomo sapiens 


giuiaiuione o*uansicrase ^vjo 1 -xi) \t\t\ 

1 1l\J) 


177 


97 

y r 


8949 


AFi 19851 


Homo sapiens 


PR01722 


154 


78 


07JU 




nomo sapiens 


numan secreieu proiein, ojzki lu vsvj. 
IMA 




RR 


8951 


X54802 


Homo sapiens 


cytochrome-c oxidase subunit IV 


116 


95 


RQ^7 


A TA1 1 
nJUl 1J6U 


Odcicnopnagc 

PS34 




■?"?8 


QR 
yo 


RQS1 




nomo sapiens 


n tun oil uciuij) irogineni itj. 


771 


96 


R9S4 




uai/ienopnogc 
lambda 


D ^caps jo i/t>Jijponeni, jo j) 


R07 


99 

yy 


R9^ 

07JJ 


AT 139R41 


v^aenornaDaius 


I 1 JCJA..J 


177 


94 1 


8956 


W34499 


Homo sapiens 


Obesity receptor C protein. 


166 


89 


8957 


X92485 


x icii>in wiimii 

vivax 


pval 


135 


71 


8958 


ABO 10340 


IMiiq rrvncrnliiQ 

1V1UO II IUiTwHIIIiT 




122 


60 


8959 


AL359782 


Trvnannomrifl 

A & J UQUU3U1UA 

bnicei 


nmhahlp similar to rint*-h2 finder 

LIX CrL/CU^ X\* OUXXXlUl U/ 1 11 1C, Ufa Xll 1 

protein rhala. 


103 


51 


8960 


XI 4898 


{"VioptiiliiQ on 

VlitvlUlUo Ol7. 


ORF TAA 1-28M n is 2nd base in 
fifidon^ 


108 


54 


8961 


AF090931 


Homo sapiens 


PRO0483 


119 


82 


896? 


Y07671 


nAJixxvi oapicus 


T-TiiTricin cpr^rpfp/l nrnfpin pnr*nr1pr1 hv 

XXUUIOU oCW-XCtCHJ. pitJICXIX Olll-UUCLl KJJ 

gene 22 clone HMSJW18. 


130 

X JU 


72 


8963 


AFI 70724 


Homo ^anien^ 


cell cycle checkpoint protein CHFR 


1364 


99 


8964 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


43 


8965 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


250 


74 



371 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


8966 


Y87233 


Homo sapiens 


Human signal peptide containing 
protein HSPP- 1 0 SEO ID NO* 1 0 


445 


100 


8967 


AF 127085 


Mus musculu^ 


vmanhorin cvtoolasmic domain- 
associated protein 3B 


112 


ro 

ou 


8968 


Y86248 


Homo sapiens 


Human secreted Drotein HCHPF68 
SEQIDNOM63. 


109 


46 


8969 


AF090894 


Homo sapiens 


PRO01 13 


153 


61 

Ul 


8970 


Y19730 


Homo sapiens 


SEQ ID NO 448 from W09922243. 


158 


49 


8971 


U38544 


Mus mus cuius 


alnha-1 tvne 1 colla&en 


102 


52 


8972 


X92485 


Plasmodium 
vivax 


pval 


113 


50 


8973 


AF241228 


Xenopus laevis 


Friend of G ATA 


143 


100 


8974 


AF09G93O 




PRO047R 
ri\v\rt / o 


79 


di 

HI 


8975 


AF23R374 


T*Tf>mr> ^aniPTiQ 

k. 1U Lilly ouLMGlld 


muuuu iJUiuuiaol giuwui iai*iAJi 

recentor 1* FGFR1 


1 1 S 


1 

JU 


8976 


G02532 


Homo sanierK 


Human «^creted nmtein SFO ID NO* 
6613. 


99 


tj 


8977 


AB001838 


Homo sapiens 


recoverin 


179 


80 


8978 


AF1 18082 


Homo <^r>ienQ 


PRO 1902 


71 


7ft 


8979 


AF116715 


Homo sapiens 


PR02829 


102 


76 


8980 


AF090942 


Homo sapiens 


PRO0657 


132 


61 


8981 

070J 




nuiiiu bapiens 


UiiTnan corroto/l nrntoin CCA TT~V XT/^V. 

xiuman secreieu proieui, jcy iu r*\j. 

77Q5 


135 


OJ 


8982 


U28971 


Caenorhabditis 

plpoanc 
tiegau;* 


similar to RD tandem repeat region of 

INJL/ UlULCUl II1UU1C<U iUa-UlllUJXlg 

protein) 


108 


84 


8983 


ADO 127 

A.UU 1Z>/ 




IllCldllUllld abMJllalCu ailLIgCU r 7 / 




QA 


8984 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


88 


8985 


Y59778 


Homo sapiens 


Human normal ovarian tissue derived 


71 


61 


8986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
^fo rn >jo- 1 fa 


128 


42 


8987 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

tf?Q7 


98 


50 


8988 


W50193 


Homo sapiens 


Amino acid sequence of salivary 

Yirnt#»in 

pruicm v»/v/rN~^. 


80 


51 


8989 


Y74075 


Hnmrt canipnc 
JuLUiuu oapiciio 


Am inn i*f*JH cahiipti fvf f"Hf* h 1 1 mc»n 
/vLUllllS avlU oCUUCUvC VI U1C 11 LU Hall 

MMSC1 protein. 




inn 


8990 


G02538 




6619 

Uwl/» 


1JU 


0 1 


8991 


D38U2 


Homo sapiens 


N ADH dehydrogenase subunit 4L 


165 


82 


8992 


AL359782 


TrvrMinft^om n 

M i jf jjaiiv/ovjiiio. 

brucci 


nmhflhlp similar to rino-h? finfrw 
nrotein rhala. 


11 1 
til 




8993 


M28016 


Homo saniens 


cvtochrome b 


130 


100 


8994 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


133 


90 


8995 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


298 


89 


8996 


Y16430 


lVfii*; mti^rAiliii 


T*iWft<iomal nrntpin 1 ISn 


101 




8997 


Y45318 


Homo sapiens 


Human secreted protein fragment 

pnwiHftH from ctptip 1 ft 


96 


76 


8998 


AL049659 


Arabidopsis 
thaliana 


putative protein 


126 


42 


8999 


AE003500 


Drosophila 
m elan og aster 


CG12379 gene product 


285 


67 


9000 


R37991 


Homo sapiens 


Sequence of a new cytokine which 
inhibites induction by gammainterferon 
of expression of Class II 
histocompatibility antigens. 


124 


48 



372 



WO 01/64835 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


ViKJZ 






Waterman 


Identit 










Score 


y 


9001 


AF090930 


Homo sapiens 


PRO0478 


193 


89 


9002 


AP000381 


Arabidopsis 


gb|AAF57656.1~geneJ±K17E12.13~s 


193 


29 






thaliana 


imilar to unknown protein 






9003 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


364 


66 


9004 


D63424 


Homo sapiens 


glycogen synthase kinase 3aJpha 


126 


81 


9005 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 


113 


75 








RHAla (AF078683) 






9006 


AL1 17637 


Homo sapiens 


hypothetical protein 


257 


92 


9007 


AE001381 


Plasmodium 


hypothetical protein 


141 


26 






falciparum 








9008 


W34499 


Homo sapiens 


Obesity receptor C protein. 


152 


86 


9009 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


199 


80 








6908. 






9010 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


202 


94 








7645. 






9011 


AF090931 


Homo sapiens 


PRO0483 


135 


68 


9012 


M11717 


Homo sapiens 


heat shock protein 


696 


100 


9013 


G03789 


Homo sapiens 


Human secreted Drotein SEO ID NO* 


123 


75 








7870. 






9014 


R59842 


Homo sapiens 


ApoE4Ll protease. 


86 


85 


9015 


AL160371 


Leishmania 


probable (hhv^ u 1 1 02, variant a 


96 


65 






major 


DNA, complete virion genome 






9016 


AF229067 


Homo sapiens 


PADI-H protein 


119 


54 


9017 


AF090944 


Homo sapiens 


PRO0663 


138 


54 


9018 


AL160371 


Leishmania 


orohahle fhti nil 09 variant a 

|/iwt«4i/4C ^iui v ~vi J ui l v/z, ? valiant a 


i \j j 








major 


DNA. comnlete virion ?ennme 






9019 


X92485 


Plasmodium 


pval 


101 


73 






vivax 








9020 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


82 


72 








6613. 






9021 


AF073519 


Homo sapiens' 


small EDRK-rich factor 1, long isoform 


116 


66 


9022 


AF229067 


Homo sapiens 


PADI-H protein 


127 


45 


9023 


M33112 


Homo sapiens 


amyloid-beta protein 


105 


67 


9024 


G03789 


Homo sapiens 


Human secreted Drotein SEO ID NO* 


79 


68 








7870. 






9025 


AF161361 


Homo sapiens 


HSPC098 


130 


55 


9026 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


169 


72 








7795. 






9027 


AF130051 


Homo sapiens 


PRO0898 


138 


77 


9028 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


132 


49 








clone HPMBQ32. 






9029 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


58 








4435. 






9030 


AC002310 


Homo sapiens 


Unknown gene product 


155 


85 


9031 


X92485 


Plasmodium 


pval 


112 


42 






vivax 








9032 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


52 








4478. 






9033 


G00487 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


69 








4568. 






9034 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


61 








4414. 






9035 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


111 


66 






major 








9036 


G00397 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


173 


59 








4478. 






9037 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


54 








4478. 






9038 


AF220264 


Homo sapiens 


MOST-1 


96 


80 



373 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y i 


9039 


AF1 30051 


Homo saniens 


PRO0898 


119 


oo 


9040 


AF116715 


Homo sapiens 


PR02829 


121 


63 


9041 


L76200 


Homo <«anien<i 

XXUIX1U OuL/lVUld 


onanvlatf* Winamp 


1 -7 J 


OO 


9042 


G03415 


Homo <tfmien<; 


Human cfv-rpfpH nrofpin ^PO FH NO* 

XXlXiUCUl KUVlbU JJ1 V/UCill, OCV^ 1U li VJ. 

7496. 






9043 


AF152513 


Homo <»ar> ien<i 

XXV/IXX^/ dCtlVlwIXd 


nrotofaHhprin <7atnma A6 chnrf form 
rvrotein 


334 


inn 


9044 


AF 130079 


Homo saniens 


PR02852 


125 




9045 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


128 


64 


9046 


AF090942 


Homo saniens 


PRO0657 


XX>v 




9047 


AF284223 


Homo sapiens 


terra-like protein 


228 


100 


7VMO 


firuuuoiu 


V/l y Z*x ixlLlytl 


oJLUiiiai iu rviiN vjr- ll£. linger protein 
RHA1 a ( AF078683^ 


17R 
iZo 


05 


9049 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


132 


92 


9050 


VJV J*T 1 I 


TJ/^mrt canipnc 
IxUliltl oapiCxl5 


Hitman c/vrpf/vl nmfpin QPD TFi MO* 
xxuillau oCUlClCU pruiclll, OCy IU IiVj. 

7492 




7"» 


9051 


AL034344 i 


Homo QAntPriQ 
xxvrmu da^iwixd 


dJl 1 8B1 8 1 (forkhead hoy PI • 


^7^ 




9052 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SRO ID NO* 163 


106 


62 


9053 


AF225910 


Mus musculus 


DA2-associated protein 1 


100 


45 


90 S4 


AF1 1fl0ft7 


xiUiilu oapicxio 




147 


7fl 
fir 


9055 


W80400 


Homo sapiens 


A secreted protein encoded by clone 


858 


97 


7UJU 




IN ICU U<U la 

sylvestris 


CAtCIolIl 


i ii 
1 1 j 


11 


QOV7 


nrUUUOlO 


\JiyZa oaUVa 


sinuiar to r\xiN vj-nz linger protein 
RHAla(AF078683) 




<*7 
J t 




RQ5Q11 


Unmrk cqtiiptic 

flUluO oapicild 


Mmtral ♦fif**=aH nrntpi'n 

incutoi uiicau proiem. 


147 


35 


9059 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

11 HA 1 a {A Pfr7ft£ftTi 


131 


56 


9060 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A61Q 


131 


50 


9061 


X70343 


Nicotiana 
syivcsuis ' 


extensin 


103 


42 


9062 


AJ271872 


Nicotiana 
syivesins 


extensin 


129 


33 


9063 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OHy \jJ fNv_/. IOj. 


151 


73 


9064 

7UO*T 




p nttiic 
xvalLUo 

nnrupoinic 

ALU1 Vv^lwiu 


rlpncin- 1 SO variant Pi 
UGIulIi** 1 OV V<H Idlll xJ 


144 


yo 


9065 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


135 


68 


9066 


M28016 


Homo aniens 


cytochrome b 


178 

I/O 


87 


9067 


M28016 . 


Homo sapiens 


cytochrome b 


193 


95 


9068 


Y86248 


Homo v*nipn*i 

XXVJUXV OCl^-' IvllO 


Human secreted nrotein HPHPFfiR 

xxixxitoxx dwuivivu yiKjixjiii nvruruO) 

SEQIDNO:163. 




UU 


9069 


D38U2 


Homo ^ampnc 
1 JLwtuvj oapi^iio 


WAOH rlpHvoVoopna<ip Qiihnnit 4T 

I^lii-L/l X UwllJrUl VgvllCldW OllUUllli *TXv 


1 1 1 
111 


ion 

lvv 


9070 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


72 


9071 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
ciicuucu oy gene i / • 


147 


64 


9072 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


109 


69 


9073 


G04078 


Homo ^aniens 


Human secreted nrotein SEO ID NO* 
8159. 


171 


85 


9074 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


88 


88 


9075 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


137 


62 



374 
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SCQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Water man 
Score 


% 

Identit 
y 








encoded by gene 17. 






9076 


L38941 


Homo sapiens 


ribosomal protein L34 


139 


88 


9077 


G0I984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


164 


86 


9078 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


72 


35 


9079 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


176 


82 


9080 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


229 


59 


9081 


AF130089 


Homo sapiens 


PRO2550 


118 


71 


9082 


X92485 


Plasmodium 
vivax 


pval 


125 


71 


9083 


AF216650 


Homo sapiens 


MTAP 


154 


73 


9084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


124 


78 


9085 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


191 


73 


9086 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


138 


61 


9087 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


130 


75 


9088 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


144 


61 


9089 


AK024455 


Homo sapiens 


FU00047 protein 


145 


82 


9090 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


181 


60 


9091 


AF090S95 


Homo sapiens 


PRO0117 


87 


66 


9092 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


211 


86 


9093 


AF130079 


Homo sapiens 


PR02852 


157 


74 


9094 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


180 


82 


9095 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


97 


42 


9096 


X65551 


Homo sapiens 


antigen of the monoclonal antibody Ki- 
67 


106 


65 


9097 


W34499 


Homo sapiens 


Obesity receptor C protein. 


231 


80 


909$ 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


127 


63 


9099 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


129 


78 


9100 


AF229067 


Homo sapiens 


PADI-H protein 


162 


59 


9101 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4754. 


157 


65 


9102 


Y60010 


Homo sapiens 


Human endometrium tumour EST 
encoded protein 70. 


69 


63 


9103 


R95913 


Homo sapiens 


Neural thread protein. 


124 


72 


9104 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


104 


68 


9105 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


156 


88 


9106 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
L 6619. 


170 


57 


9107 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


163 


89 


9108 


G02753 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6834. 


123 


41 


9109 


L41944 


Homo sapiens 


interferon receptor 


162 


66 



375 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


9110 


AK000496 


Homo saDiens 




lift 

l JO 


/o 


9111 


G03797 


Homo sapiens 


Human arreted Arotein ^FO TD 1MO* 
7878 


O 1 




9112 


G02538 


Homo sapiens 


Human secreted Drotein SEO ID NO* 
6619. 


142 




9113 


M26361 


Mus musculus 


LCNE/If* H-chain fusion nrotein 


105 


50 


9114 


AF073519 


Homo sapiens 


<5mall KDl^JC-rich factor 1 Iatio i^Afrtrm 


119 


JO 


9115 


AF1 19900 


Homo sapiens 


PR02822 


140 


58 


9116 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478 


158 


60 


9117 


L00016 


Homo saoiens 


urf5 


171 

in 


97 


9118 


AC006276 


Homo sanien^ 


R28379 3 


138 

l JO 


100 


9119 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRR2 


131 


72 


9120 


AF1 19855 


Homo sapiens 


PR01847 


155 


SI 


9121 


AF010144 


Horn n <jani<»nc 


iicui uitdi uucdu piuicm r\U 1 U~1N l r 


i 

IV J 




9122 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 

TD NO- 1 4? 


115 


78 


9123 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


53 


9124 


G00397 


Haiti ri <anif»nc 


Human CA/*rAtoH r»TY\tAin QUO 111 "KJO* 

riunidii bccicLcu proieui, ocy il/ ini-j. 
4478. 


11/ J 




9125 


AL390I14 


XvwLolUllaJlla 

major 


pruudoie ^iinv-o^ ui iuZj variant <i 
DNA, complete virion genome 




Al 


9126 


AF 130089 


Wattia oariif>nc 


PT*fY7SSft 
rivuzjju 




7ft 
/o 


9127 


W48351 


Homo sapiens 


Human breast cancer related protein 

WT*R7 


98 


56 


9128 




l-IVimft coniAnr 
JTOIIIU oapiCUS 


irUman secreieo protein, otv^ ilj xnu. 
7795. 


10/ 


7C 
/J 


9129 


G02455 


Rnmn cnnipnc 


nuiiiaii secrcica protein, ocy 11/ in w. 


l j 1 


AO 


9130 


AK024455 


Homo sapiens 


FLJ00047 protein 


130 


78 


9131 


W88415 


VTomn canipnc 
xLULUU dctpiCIlS 


i/iseasc aoSuciaicu protein Kinase 
DAPK-4 




1 AO 


9132 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


144 


59 


9133 


Y94922 




rxumaii bcureieu protein cionc pvo 1 


1 


OA 


9134 


AF3 17425 


Homo sapiens 


GAC-1 


182 


70 


9135 


G04072 


XIUIHO sapiens 


Human Q/vr#»t#»rl rtrnt^m QPO TO \Tf"V 

nunian beurctea protein, ogv^ llj invA 
8153. 


174 




9136 


AF1I6715 


WniTIA QflTYIfMl^ 
i LkJIIIVJ OdXwIlwUd 


PR02829 


1 1^ 


71 


9137 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


122 


65 


9138 


AF130089 


Homo sapiens 


PRO2550 


140 


62 


9139 


AF161221 

xvi, jwi f»f« x 


Horn a ^flnifnQ 


Ifftllilcrpin-lik'P nrnfpin f\ 
ivau iriA wui lixvw jJi w twill v 


704 


00 


9140 


AL359782 


Trypanosoma 

ATUCfii 

1/1 Uvvl 


possible (hhv-6) ul 102, variant a dna, 

PAmnl^tf 1 virion ffAnnmA 

wunzpiwiw VII H/lt gWlll/UlW. 


110 


48 


9141 


U27486 


Pseudorabies 
virus 


EP0 


84 


44 


9142 


Y9I577 


Homo sapiens 


Human secreted protein sequence 


150 


96 


9143 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


220 


80 


9144 


Y86248 


Haiti a czinipnc 
Jl 1U1UU cxipicub 


Human cpr*r*»t^H nrAf**m T-Tf^T-rPP^Sf 

nunidii ocwreicu proieuj riwrurroo, 
SEQ ID NO: 163. 


10 J 


OA 


9145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


180 


77 


9146 


U52077 


Homo sapiens 


mariner transposase 


365 


78 


9147 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


143 


67 



376 



WO 01/64835 



PCTAJS01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








clone HPMBQ32. 






9148 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


100 


81 


9149 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


118 


66 


9150 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


105 


54 


9151 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) u 1 102, variant a dna, 
complete virion genome. 


101 


70 


9152 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


66 


9153 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


160 


72 


9154 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


106 


63 


9155 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 101. 


95 


80 


9156 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


133 


67 


9157 


G03924 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8005. 


265 


80 


9158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


131 


72 


9159 


AF130089 


Homo sapiens 


PRO2550 


180 


61 


9160 


AF1 16715 


Homo sapiens 


PR02829 


184 


82 


9161 


K01664 


Drosophiia 
melanogaster 


Bkm-like protein 


200 


69 


9162 


D88146 


Homo sapiens 


UDP-galactose transporter 2 


101 


100 


9163 


AF107406 


Homo sapiens 


GW128 


155 


60 


9164 


G02902 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
6983. 


97 


51 


9165 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


146 


60 


9166 


AF2 10651 


Homo sapiens 


NAG18 


83 


65 


9167 


V00672 


Pan troglodytes 


reading frame protein 4 


119 


95 


9168 


Y01155 


Homo sapiens 


Secreted orotein encoded bv eene 4 
clone HTBCW32. 


225 


74 


9169 


AF161356 


Homo sapiens 


HSPC093 


100 


75 


9170 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


174 


66 


9171 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


86 


9172 


AF1 19855 


Homo sapiens 


PR01847 


178 


59 


9173 


AF130087 


Homo sapiens 


PR02411 


129 


48 


9174 


AF130079 


Homo sapiens 


PR02852 


149 


59 


9175 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
•7214. 


101 


90 


9176 


AF090894 


Homo sapiens 


PRO0113 


118 


65 


9177 


AL359782 


Trypanosoma 
hrucei 


probable similar to ring-h2 finger 
protein rha la. 


105 


46 


9178 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


100 


52 


9179 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


70 


9180 


AF1 16715 


Homo sapiens 


PR02829 


127 


64 


9181 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


144 


71 


9182 


AF229067 


Homo sapiens 


PADI-H protein 


164 


64 


9183 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


80 



377 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








UO l J. 






9184 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 

iVXXrVl a \r\Jr\j f OOOJ j 


106 


64 


9185 


G02538 


Homo sanien<z 


Human c/»r»rAtp>/1 «mt*»in QCO m "MO* 
nuiuoii ocucicu pjuiciii, OJCv^ ilj 

6619. 




57 


9186 


G03786 


Homo sanien<; 


Human cffcr»rpl¥»H ni*nt<*m QPH in Mf^l* 

I lUlliaU owLLClCU UIUICUI, OCV/ LU WKJ. 

7867. 


1 1 J 


CO 


9187 


AF090931 


Homo Rflnipnc 


X JS\J\J t rOj 


1 1 T 


08 


9188 


G0T714 


rxuiuu vKipiciib 


Unman earti>afa/l ni<Atai« CPA TT"\ VT/*\« 

riurnan secreiea protein, otxi i\j nu: 
7795. 


13o 


67 


9189 


G02832 




Unman c* a r* r*£» 4-^>sJ nrnfflin PT?A T T"\ X T/^\ » 

xiuman secreceu protein, oiiv£ 11/ in (J: 
6913. 




47 


9190 


D381 12 


Hatha com one 


cytochrome c oxidase subunit 1 


269 


85 


9191 


Ml 8094 


Phaseolus 

Vulgar ib 


hydroxyproline-rich glycoprotein 


150 


40 


9192 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


73 


9193 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770^ 
/ /JO. 


137 


62 


9194 


M18093 


Phaseolus 

vnlcrcrrie 
V Ulgcu IS 


hydroxyproline-rich glycoprotein 


103 


31 


9195 


X77816 


Rattus 
iiorvegiciis 


PR-Vbetat 


98 


47 


9196 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7/177 


107 


48 


9197 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^777 


111 


68 


9198 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A flft 

H4oo. 


164 


83 


9199 




oacienopnage 

laLUUUa 


E (capsid component;341) 


235 


85 


9200 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

770^ 


117 


65 


9201 


X67640 


Homo sapiens 


HS24/P52 


222 


75 


9202 




xiomo sapiens 


xioZ*f/JrjZ 


T> 1 

231 


81 


9203 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7C70 

to /y. 


128 


66 


9204 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ASIA. 


107 


74 


9206 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


186 


66 


9207 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAA") 


136 


56 


9208 


G00397 


nuino sapiens 


riurnan secreiea protein, ot»v NvJ. 
4478. 


lot) 


67 , 


9209 


G03714 


nuuiu sapiens 


riurnan secreiea protein, oe\} iu ncj. 

77QS 


1 1 1 


68 


9210 


AF1 16715 


Homo sapiens 


PR02829 


123 


81 


9211 


G03714 


riuiinj sapiens 


riuman secreiea protein, oci^ id wu. 
7795. 




70 


9212 


AF 118086 


rxUllLU SaU LC lib 


i ivL/ 1 y yz 


SLA 


57 


9213 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

717/1 


115 


62 


9214 


J02459 


bacteriophage 
lambda 


E (capsid component;341) 


247 


92 


9215 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


74 


76 


9216 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


134 


74 



378 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


9217 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


137 


53 


9218 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


130 


44 


9219 


AP000616 


Oiyza saliva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


110 


68 


9220 


L41944 


Homo sapiens 


interferon receptor 


141 


50 


9221 


AF130079 


Homo sapiens 


PR02852 


170 


70 


9222 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


100 


85 


9223 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


56 


9224 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


104 


63 


9225 


U80761 


Homo sapiens 


CTG26 alternate open reading frame 


92 


84 


9226 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


157 


65 


9227 


AK027208 


Homo sapiens 


unnamed protein product 


138 


68 


9228 


B25722 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 9 SEQ ID NO:l 11. 


125 


87 


9229 


G00381 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4462. 


127 


88 


9230 


AK025047 


Homo sapiens 


unnamed protein product 


179 


64 


9231 


AF090894 


Homo sapiens 


PRO0113 


195 


57 


9232 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


176 


54 


9233 


AB008227 


Adiantum 
capillus- veneris 


Extensin 


93 


34 


9234 


AF130089 


Homo sapiens 


PRO2550 


98 


73 


9235 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


135 • 


52 


9236 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


162 


62 


9237 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


59 


9238 


G03790 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7871. 


126 


52 


9239 


AE003499 


Drosophila 
melanogaster 


CG 12706 gene product 


166 


37 


9240 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


161 


64 


9241 , 


X92485 


Plasmodium 
vivax 


pval 


112 


61 


9242 


AF055985 


Onchocerca 
volvulus 


pyirolidone-rich antigen 


69 


42 


9243 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


140 


80 


9244 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


187 


92 


9245 


M22332 


Homo sapiens 


unknown protein 


117 


40 


9246 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


109 


66 


9247 


AL136173 


Homo sapiens 


dJ914B9.1 (novel protein (HSPC162) 
similar to Rattus norvegicus 
bithoraxoid-like protein) 


305 


76 


9248 


AF1 16661 


Homo sapiens 


PR01438 


141 


53 


9249 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


144 


76 



379 



WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
ocore 


% 

Identit 

y 


9250 


J02459 


bacteriophage 

InmHHn 
uuuuua 


E (capsid component;341) 


245 


89 


9251 


J02459 


bacteriophage 

lCUHLrUa 


E (capsid component;341) 


245 


92 


9252 


Y02999 


Homo sapiens 


Fragment of human secreted protein 

pnrn^pH hv opnp 191 
eni*uucu uy gCIIG 


128 


63 


9253 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5197 


124 


72 


9254 


AP0D0616 


nn/7Q cafrii/a 
Saliva 


ctmilar tt\ K tNIf"?— M9 firtOAr nrntpin 

Sim Hoi tu iviiNVj~rLA iiii^ci proiein 
RHAla(AF078683) 


i in 


Oft 
y\9 


9255 


Y97R6R 


nuuiu sauicus 


Unman cfWPtpH nrAfAin aii/*/\/4aH 
xlUIllail aCv^IClCU piUlCJUl C11VUUCU. Ujr 

gene No. 107. 


1^9 


65 




X 5916d 


IViUb UlUSCUlUS 


niftft nr/Stpin f A A 1 «7T\ 


1D6 


69 


9257 


AF217374 


Acanthaster 
pidiici 


cytochrome oxidase subunit I 


259 


98 


9258 


W48351 


Homo sapiens 


Human breast cancer related protein 


94 


64 


9959 




jtiomo sapiens 


Unman co^rA^o rl nrntoin QT7l^l TT\ "NJfV 

Human secreiea protein, invj. 
4718. 


1 ft7 








v/ryza Soiiva 


simiiar to kj/\u ringer protein 
RHAla(AF078683) 


1 90 


oo 


9961 


M9R016 


riomo sapiens 


cytoenrome o 


1 Aft 






I oDZH-o 


rionio sapiens 


xi urn an secreieu proiem ri^txrroo, 
SEQIDNO.163. 


119 
1 1Z 


Oo 


9961 


R557AQ 


xiomo sapiens 


extracellular aomain 01 numan mgn 
auonuy iriN-gainnia receptor. 


1 1 A 
1 lO 


OJ 


9264 


K02403 


Homo sapiens 


complement component C4A 


9025 


99 


0965 




riaSmOaiurn 

vivax 


pval 


loi 




9266 


AK024455 


Homo sapiens 


FU00047 protein 


98 


83 


9969 




nomo sapiens 


Hitman co^rafa^ nr />f A i n QtJA TT*\ XT/^\- 

iiuroan secreteo protein, ocv^ iu invj. 

51179 


1 A1 


CI 


9268 


AF130O89 


Homo sapiens 


PRO2550 


124 


11 


99 60 


J\L>jjy fox. 


I MfHAflAfl AIM A 

i rypanosoma 
brucei 


prooaoie simuar to nng-rtz ringer 
protein rhal a. 


QA 




QO 7 ft 


I lyotw- 


fiomo sapiens 


QDA TD "MO /1ft') fr/\m WftQOO'JOAl 
oJtil^ ID ISkJ hUA ll O ui Wl/77Z/^*fj. 




51 


9971 


VR69il8 

I OOZHo 


riomo sapiens 


Unman cwr^lp^ nrnt^in U/^T-TDPAft 

Jtiuman secreiea protein xii^rLrroo, 
cijA rn wv 1 61 


119 




9272 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795 


155 


72 


9273 


AF229067 


Homo sapiens 


PADI-H protein 


127 


54 


9974 




numu sapiens 


nuuidii secrcieu pri/icm tt/j. 


1 <A 


77 


9275 


X92485 


Plasmodium 

vivav 


pval 


157 


72 


9276 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6951 


132 


60 


9277 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068 


156 


84 


9278 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
^RO TD NO* 1 61 


138 


84 


9279 


Y20772 


Homo sapiens 


Human neurofilament-M mutant 
pruicin ijagzneui dh. 


112 


70 


9280 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


59 


9281 


G02485 


Homo saniens 


Human secreted Drotein SEO ID NO* 
6566. 


103 


63 


9282 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


89 


60 


9283 


X92485 


Plasmodium . 


pval 


124 


79 



380 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 






vivax 








9284 


G02902 


Homo sapiens 


Human secreted nrotein SFO TD NO* 


95 


75 








6983. 






9285 


AF130087 


Homo sapiens 


PR02411 


138 


78 


9286 


Y27854 


Homo sapiens 


Human secreted nrotein enondpd bv 


140 


54 








eeneNo 101 






9287 


AF004340 


Homo sapiens 


ATPase 6/8 

fill UJv \Jf V 


98 


100 


9288 


S79410 


A/fnc musculns 


nuclear localization signal binding 


102 










nrofpin 






9289 


J02459 


Vi5>r , tprinrvhi»o , A 




*t J / 


07 






lambda 








9290 


AC003058 


ArabidoDsis 


unknown nrotein 


177 


84 






thaliana 






9291 


U23455 


Caenorhabditis 


similar to O melanno-astpr hompotic 


107 


87 






elegants 


Drotein BarH2 (PIR:A4 17261 






9292 


D38112 


Homo sanipns 


WAFifT rlphvHrrKypnasp snhiinit dX 
iinuu Mdijrui ugviiaoG suuiuiu hl< 




01 


9293 


S79410 


Mu<? miiscuhis 


nnclpar lnralizarinn liona! hinHinp' 


139 


61 








protein 






9294 


R28916 


Hfimo sanipns 

X IVJJ 1 1 V/ CXl L/ It/ HO 


Tvnp ITT nrnfAllaopn frvrior arti 
1 jr jjc JJii jyi uiA/iiagGii ally. 


1 17 


33 
jj 


9295 


W48351 


Homo sanipns 

XXV/IX1\/ OdL-rilsliO 


Human hrpast rtanrpr rplatp/1 nrotpin 




77 








BCRB2 






9296 


AF 130089 


Homo sanipns 


PRO2550 

X XWJ£mJ J\J 


137 


jz. 


9297 


G02987 


Homo sanipns 


Uiimon pppr<*tftH nrnfpin QRO fH MO* 
xxixiuaxi ocucivu pjvsicui, ooy il/ xn v^. 




70 








7068. 






9298 


U22376 


Homo sapiens 


alternatively spliced product using exon 


239 


77 








13A 






9299 


AL3 59782 


Trvnanoso rna 


nnssihle fhh v-6 1 11 1 1 09 variant a Hna 


334 


69 






brucei 


pnmnl ptp vtrinn opnnmp 






9300 


AL390114 


Leishmania 


extrpmelv cvstpine/valinp rirh nmtptn 


105 








major 








9301 


D90282 


Homo saniens 


carhamvl nhosnhate synthetase T CF.C 

vui uuiii jr t lulu jyuuiuuuu 1 


276 


96 








6.3.4.16) 






9302 


J02459 


bacterionhape 


E fcansid comDonent*341 1 


231 


93 






lambda 








9303 


M15077 


Photinus nvralis 


Luciferase 


557 


100 


9304 


L00016 


Homo sapiens 


urf4 


303 


93 


9305 


Y14482 


Homo saniens 


Praoment nf* human ^erretpfi nrotpin 


141 


69 








encoded bv gene 17 






9306 


L46721 


Homo saniens 


mucin 

lUUwlU 


204 


80 


9307 


S79410 


A/fnc mnopiiliic 


niif*lf»ar Inf*»H7»tinn cicmal Kinrfinci 
uuV/iuui lUUaxlxAilUll digiiai Uixluxilg 




OH- 








nrotein 






9308 


J02459 


bacteriophage 


E fcaosid comoonent*341 1 


218 


93 






lambda 








9309 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


66 








6613. 






9310 


AF130075 


Homo sapiens 


PR02532 


92 


69 


9311 


G00354 


Homo saniens 


Human secreted nrotein SEO ID NO* 


97 










4435. 






9312 


J02459 


bacteriophage 


E fcaosid comnonent*341 1 


201 


79 






lambda 








9313 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO.* 


133 


95 








5327. 






9314 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


68 


9315 


D38112 


Homo sapiens 


ATPase subunit 6 


223 


0 1 


9316 


AL160493 


Leishmania 


probable (hhv-6) ul 102, variant a 


113 


85 






major 


DNA, complete virion genome 






9317 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


117 


88 


9318 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


257 


72 








6953. 







381 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Sraitb- 
Wa term an 
Score 


% 

Identit 

y 


9319 


G03172 


Homo sanien^ 


Unman Cf>rrpfr»ri nrnt^in ^PH IH XFfY 

xTuiiidii sewcicu protein, ojca^ iu in v. 
7253. 


mi 
i\j i 


A1 

Ol 


9320 


AK001116 


Homo sapiens 


unnamed protein product 


96 


40 


9321 


G02535 


Homo saoien^ 

A iviUV JUL/ I wJLlO 


Human cpr-rpfpH rtrntoin ^Pfl TD MO* 

6616. 


174 




9322 


AF037081 


Pan troglodytes 


ribonuclease k6 precursor 


109 


100 


9323 


G03800 


Homo sanipn^ 


Unman ♦sprrptpd nrntpin RFfi ID MO' 

XI 111 J I all iCUClCU JJ1UIC1U, OL/y lU LNV/. 

7881. 


140 




9324 


AF1 53056 


Fa^piola hpi-tati***** 

x (uwivtu Lit/jjaLiua. 


icguuiciiiai antigen 


Ou 


44 


9325 


G02827 


Homo *sanipnQ 


xiixiiiait iCL-icicu ui vice ill, oiiy \±j in w. 

6908. 


ZUv 


AQ 


9326 


G02755 


Homo Qfvnipnc 

XI Will vl Od|71v>lio 


Human cpr*rpfpH nrntpin *^T?0 10 MO* 
xxuiuaii secicicu pruiein, ocy llj invj. 

AJH6 


Ovl 


J / 


9327 


AF1 18086 


Homo sapiens 


PR01992 


139 


60 


9328 


O07360 


IxUinu oapicila 


Unman <Sf±/*rat(±A *\rr»t**in QPH TTl MO< 

jtiuman secreieu p rote in, ocy id inw. 
6441. 


1 ill 


i j 


9329 


AF 107406 


xiwlliu aapiCIlo 


VJ W \.£.0 




DO 


9330 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


75 


9331 


X92485 


Plasmodium 

viuav 


pval 


121 


56 


9332 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
wo in wn-iffi 

ODy 11V xNky.lvXJ. 


179 


52 


9333 


G03714 


XXVI 11 vl oaLJlvllo 


Unman c^rrra+prl nrnt<*»in QPH IF) MO- 

numcui secreieu protein, ocy iu inv/. 
7795. 


1 1 J 


o / 


9334 

7JJT 


YRA94R 

I OVx*rO 


xiuinu sapiens 


Unman c-e>r>rair*.A r-vrrtf^irt LI/" 1 [4T>I«£Q 

xiuman secreiea protein civ^rxxroo, 
SEQIDNO:163. 


luo 




7JJJ 


M993^9 


numu sapiens 


unKnown protein 


i JO 


Aft 


9336 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


46 


9337 


Y27868 


Homo sapiens 


Human secreted protein encoded by 

nariP XTrv 1ft7 

gene iNu. iu/» 


160 


76 


9338 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
cione xiiNxir 


150 


68 


9339 




xxuniu sapiens 


Unman ce»rrc*+i>A nrrtfAin CCA \T\ XTO< 

xi in ii dii secreieu protein, jr«y lli invj. 
6619. 


1 AO 




9340 




nuiiiu sapiens 


xxuuian secreieu protein encoucu oy 
gene 22 clone HMSJW1 8. 


1 *^4 


A4 
0*+ 


9341 




xxajiuu sapiens 


FT Iftftft47 ni-rifpin 
xbJuuuH/ protein 


1 17 


70 


9342 




Uattia caniAnc 

XXVllXlV" o&JJlCllo 


Human cprrpfprt* nrotpin QPH TO MO* 
xxuiiiaxi sciyieicu uivhcui, OEs\£ iu inu. 

6613. 


101 

171 


7A 
/O 


9343 


G02987 


Hattia cunipriQ 
xx villi vi acijjiciij 


Human cprrpfpH nmfpfn ^FH TTiMO* 

XXlXlliClXI Ul VJLC11J, uliy 1XX IN v_/. 

7068. 




S9 


9344 


AF032457 


Homo <sanipn<: 
xjLv/ixtvr oajjiciia 


1 11111 1 ,1 j 




1 vvl 


9345 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
443R 


117 


46 


9346 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438 


153 


75 


9347 


X53020 


Plasmodium 

X laolllV/lllUXll 

falciparum 


prvtVirrifx/fp mpmKr*3np-5»ccrkpiat"pH 
cijr uxi viv<jr it* i*jcinuiauw~aoi>vFviaL"vu 

antigen 


vio 


7S 


9348 


G03356 


Homo ^ar>if»nc 


Uiimon CP/*rPtprf T*»7TifPTT> ^Ffl IT) MO' 
XXUlliaXf dClrlClCU t/lUlCUi, JJjy LLf iNvy. 

7437. 


120 


70 


9349 


AF1 30089 

rvx 1JWO7 


Hr»TTio cartipnc 
xwiuv* aopiciio 




116 

1 ID 


V»"7 


9350 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


165 


72 


9351 


G00397 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4478. 


94 


82 


9352 


AL080242 


Homo sapiens 


bA554C12.1 (RBX1 orROCl (ring- 
box or ring finger protein 1)) 


184 


60 



382 



WO 01/64835 



PCTVUS01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


9353 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


64 




V71 447 


xvaiLiis 
norvegicus 


V/IYT 1, puiallVC 








vjuj /yj 


ilOIBO Sapiens 


Human <zf*rrf>tf±r\ hrnt^in QUA 1T\ XfO' 

tiunidii ocuicicu pioicm, ocv^ lis 
7874. 


yo 




9356 


AF090931 


Homo sapiens 


PRO0483 


148 


76 






Homo sapiens 


Human secreted protein, SEQ ID NOi 

/O/U. 


1 A A 
1*14 




9358 


AC005175 


Homo sapiens 


TA2R_HUMAN, BETA ISOFORM; 
RECEPTOR 


153 


63 




Ar 1 10/ I 3 


Homo sapiens 




175 


oo 


9360 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene cione ruvioJ w io. 


143 


72 


9361 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoaea oy gene z oiiv^ lu jnlj.zju. 


178 


75 




I 14437 


— . 

Homo sapiens 


Human secreted protein encoded by 
gene z / cione iio/v w /\z > . 


on 
yyj 


S7 


9363 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7417 
/4J /. 


86 


76 


9364 


AF229067 


Homo sapiens 


PADI-H protein 


116 


51 




ATTA1 A1 AA 


Homo sapiens 


neuronal uireaa protein ajj/c-in ir 


748. 


o*t 


9366 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


76 


9367 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ 17 J, 


95 


76 


9368 


G02455 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

OJJO. 


105 


74 




JK.OO4U0 


7Ja*vi *r\. c-i am 

xiomo sapiens 


Human mamx meiaiioprorease 

lVliYlxIIl l cL 


l\JO 




QV7ft 

y j /u 


AT71R1Q77 


nepauus ud 
virus C 


nz. proiein 




JO 




iw vyyjyj i 


numo sapiens 




151 






A FA OA ft Q4 


iiomo sapiens 


pp nn it? 




J f 


9373 


AK0Q2154 


Homo sapiens 


unnamed protein product 


226 


55 




VA7A71 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


14U 


71 




1 1 77404 


Homo sapiens 


P AXTTIDQ 

KAJNrJJro 


JOO 


inn 


9376 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4/170 
44 io. 


155 


52 


9377 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7Q7A 


126 


63 


9378 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7^10 


151 


78 


9379 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


68 


9380 


AL160493 


Leishrnania 
major 


probable (hhv-6) ul 102, variant a 

TVNJ A f^rtmnl^tA \nrtrsn opnniriA 

jl/ina, complete virion genuine 


117 


62 


9381 


G02514 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


104 


47 


9382 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770S 


139 


57 






jytacaca 
faeciciilaric 


hypothetical protein 


OS 


41 
41 


9385 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


57 


64 


9386 


U04810 


Homo sapiens 


tastin 


151 


100 


9387 


U79260 


Homo sapiens 


unknown 


97 


79 



383 



WO 01/64835 



PCT/US01/04927 



SEQII> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


(dentit 










Score 


y 


9388 


U 18339 


Variola vim^ 


rviT 

J~" B TJL' 


i \jj 




9389 


G03800 


Homo «?anipnc 


Human ♦wxpfpH nrotpin ^FO II) iNTO* 




7S 








788 1. 






9390 


AL390935 


Leishmania 


nossihle dorian 


99 


88 






major 








9391 


ABO 15633 


Homo sapiens 


tvne 11 membrane nrotein 


108 


100 


9392 


AF 113685 


Homo «;anien<; 


PRO0974 


98 


69 


9393 


AF064819 


Homo sanien<: 


<5prinp nrofpa^p DF.V*l 


227 


69 


9394 


G03786 


Homo <>anipn<* 


Human sprrpfpri nrofpin ^PO TO WO* 

JXlllJ ICXIX oCW CICU 1)1 VlClJIj OlTsW J. J-/ 11V« 


219 


<Q 

oo 








7867 






9395 


G01400 


Homo <ianienQ 


Human <5ArTPted nrofpin ^FO ID MO* 


104 


100 

XX/ tl 








S4R1 

JtO I . 






9396 


AK024455 


Homo ^anipn<; 


Fl 100047 nrotein 




AR 


9397 


AF033260 


nnrnnp 


icvcidc u alloc J iputoc 




S3 






endoo~enou<*; tvne 












C! retrovirus 








9398 


X12517 


Homo <?anipriQ 


C nrotein (AA 1-1 ^9S 


147 

I*T/ 


ss 


9399 


AB09994R 


Unm r» can ipti c 

XIAJllHJ OaJJlCilD 




190^ 

1Z.I/J 


100 


9400 


Y87075 


Horn a wniVnc 


TJi im an cp/**rpf , p | H nmfptn cpnnpnrp Cpn 
XIUIIKUI SCUCICU pil/iCIIl oC^UCll^C uCy 




Q3 














9401 


G02455 


Homo Qanipnc 


Human c&rrr>tpA nrnfpin ^PO m NO' 


1 SO 

Uv 


SR 








6516 






7HUZ. 


M1SS30 

1Y1 1 JJJV 


VTAm/\ conipn c 


D-ccii growui laciur 


1 u 


7A 


7 tVJ 


"VSA93? 


nUlilU oapiCilS 


Lj jssj uigniy oasic proiein 


1 txt 
1 l*t 


US 


9404 


A CO 1874ft 

.TV*-*!/ iO / HO 


A raHiH Ancic 


Pntitoinc oimilurifrx/ tr» D 1 1 rtr'otAin from 

i^uiiutuib ouiiiiaTiiy lur u pruicui lsuiii 


7S 


AS 






fhaliana 

lXI\lHCillCl 


j^i vbv/piiiia ixicicwvgcuivi QV\y\jy\jy l . 






9405 


Y86248 


Homo sanipn<» 


Human <tf*rretpd nrotein HCHPFtSR 

lIUIIIcUI jCA/I ^IV^Vl ytl VJL&1I1 I Lv^>l 11 luOj 


1 14 


oo 














9406 


AF220264 


Homo ^anipn<*. 


MOST-1 


194 

lit 


70 


9407 


/VT 1 170J 1 


nunio sapiens 


PRO 1 799 


1^ / 


AS 
03 


jr*r\JO 


Y99ARS 


* lasnzoui um 


pval 




/o 






vivax 








9409 


Y994RS 


Placmrk/I turn 


. 

pval 




77 






vivav 








9410 


G003S2 


Haiti o oaiMPnc 
xixjuiyj oapiciio 


Human cprrpfpH nrnfpin QPO TO rS/O* 

riUillCUI JCVICICU piUlCUl, OCy ILf 1^1 v/> 


97 
y f 


J J 








4433 






9411 


G03714 


Homo ^aniens 


Human <?ficretprl nrotein SFO ID NO* 


104 

X ITT 


64 








7795. 






9412 


AF1 300R9 


xiajixiu oajjiciid 


PRO? 5^0 


147 

l*T f 




9413 


AF 1 30089 


Homo saniens 


PRO2550 


175 


65 


9414 


I 14848 


Haiti a Q^ni^nc 
AAviiiv zkijjiviio 


^/fH^ ,, plno^ T-rpJafpH nrAtpin 
J-Vuliv^ uiooo i**i viaiv/U piv/iciii 


??*> 


97 


9415 


R95913 


Homo oani pne 


"Mpnral tnrearl nrofp-in 

11 viUI <X1 till WU JJ1UUGUI. 


107 


74 


9416 


AB001684 


Chlorella 


ORF49b 


68 


53 






vulparta 

» UlgUI AO 








9417 


G02532 


Homo *5anien<5 


Human secreted nrotein SFO ID NO* 


111 


so 








6613. 






9418 


W03642 


Homo *sanipn«> 


Human cannahinoid GPR N-terminal 

X XUXXXC1XX ViJiXIX<WXXX\/X\X VJX XX XI twXXXXlXlCXX 


107 

I V / 


73 








sequence. 






9419 


AL050285 


Homo «*anien<; 


hvnothetical nrotein 

xi j Uviuvuvoi r 


194 

I 


100 


9420 


AF090944 

X»-l \J J \J J I I 


Homo <*anipns 


PRO0663 


132 


S9 


9421 


G02872 


Homo mnipni 

X 1U111VJ OCUJICii^ 


Human <ipprptpd nrntpin SFO TO NO* 
x lvjiiicxxi scucivu jji vie ill j ouy x_u/ nv/. 




O 1 








6953. 






9422 


G02532 


Homo Qanipns 


Human secreted nrotein SFO ID NO* 

XXL1XXXCX1X jvV) wlwU L/XV/lwXXXj WUV^ 1JL/ llv/i 


140 


70 








6613. 






9423 


M20030 


Homo sapiens 


small proline rich protein 


62 


41 


9424 


AF1 16718 


Homo sapiens 


PRO2900 


396 


100 


9425 


Y27854 


Homo sapiens 


Human secreted protein encoded by 


102 


35 








gene No. 101. 






9426 


X61047 


Hydra sp. 


mini-collagen 


92 


45 



384 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Wft term & d 
Score 


% 

Identit 
y 


9427 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


105 


71 


9428 


L10908 


Mus rausculus 


Gcapl gene product 


107 


44 


9429 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


79 


87 


9430 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


61 


9431 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ8L 


111 


80 


9432 


X12580 


Medicago sativa 


put nodulin (soybean N-75 homolog.) 


87 


33 


9433 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


48 


9434 


AC003028 


Arabidopsis 
thaliana 


30S ribosomal protein S31 


79 


42 


9435 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


106 


75 


9436 


G02827 


Homo sapiens 

j 


Human secreted protein, SEQ ID NO: 
6908. 


207 


75 


9437 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


163 


85 


9438 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


84 


57 


9439 


AF1 18086 


Homo sapiens 


PR01992 


145 


75 


9440 


AF229067 


Homo sapiens 


PADI-H protein 


140 


64 


9441 


M36913 


Zea mays 


cell wall protein (put); putative 


107 


38 


9442 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


161 


59 


9443 


AF229067 


Homo sapiens 


PADI-H protein 


147 


85 


9444 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


171 


79 


9445 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


155 


57 


9446 


X92485 


Plasmodium 
vivax 


pval 


97 


55 


9447 


AF1 19851 


Homo sapiens 


PR01722 


123 


55 


9448 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


88 


9449 


AL391688 


Homo sapiens 


bA524D16A.2.1 (novel protein similar 
to mouse granuphilin-a) 


258 


97 


9450 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


91 


9451 


X92485 


Plasmodium 
vivax 


pval 


123 


88 


9452 


X83000 


Myocastor 
coypus 


lipase related protein 2 


168 


43 


9453 


M35603 


Mus musculus 


Hoxr3.1 protein 


447 


78 


9454 


M35520 


Canis familiaris 


GTP-binding protein (rab5) 


271 


98 


9455 


D38112 


Homo sapiens 


NADH dehydrogenase summit 4L 


362 


86 


9456 


A13595 


synthetic 
construct 


immunosuppresive protein PP15 


111 


100 


9457 


X67788 


Rattus 
norvegicus 


ezrin, p81 


185 


97 


9458 


U28068 


Mus musculus 


neurogenic differentiation factor 


132 


89 


9459 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


556 


88 


9460 


AF155140 


Homo sapiens 


gonadotropin-regulated testicular RNA 
helicase; GRTH 


379 


94 


9461 


AF169689 


Homo sapiens 


protocadherin alpha 10 alternate 
isoform 


208 


95 


9462 


U73193 


Homo sapiens 


inward rectifier potassium channel 


236 


96 
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SEQID 
NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








Kirl.2 






9463 


U83278 


Bos taurus 


neural specific protein CRMP-2 


157 


61 


9464 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLTl. 


74 


57 


9465 


AL031673 


Homo sapiens 


dJ694B14.3 (PUTATIVE novel 
haloacid dehalogenase-like hydrolase 
family protein similar to (archaea) 
bacterial proteins) 


444 


100 


9466 


X67247 


Homo sapiens 


ribosomal protein S8 


519 


90 


(\A £1 

9467 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


115 


51 


9408 


Uo07ol 


Homo sapiens 


CTG26 alternate open reading frame 


95 


84 


94oy 


OUU407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
44oo. 


145 


71 


9470 


AF130056 


Homo sapiens 


PR01367 


98 


76 


5/4/1 


X71354 


Homo sapiens 


vesicular monoamine transporter 


102 


95 


9472 


U66372 


Bos taurus 


ribosomal protein S29 


120 


95 


9473 


D32002 


Homo sapiens 


nuclear cap binding protein 


583 


96 


9474 


W54282 


Homo sapiens 


Protem sequence of the di-alpha 
haemoglobin gene contauied m pSSl . 


469 


91 


947S 


V49572_cdl 


Homo sapiens 


13-NOV-1996 Human stomach cancer 
clone HP10122cDNA #1. 


562 


100 


947o 


D38112 


Homo sapiens 


NADH dehydrogenase subumt 4L 


247 


88 


9477 


AB043820 


Homo sapiens 


platelet glycoprotein VI-2 


95 


47 


9478 


AB020236 


Homo sapiens 


ribosomal protem L27A 


437 


86 


9479 


003714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


52 


9480 


AF205633 


Homo sapiens 


riinjurin2 


364 


100 


9481 


D14886 


Homo sapiens 


TFIIA-37 


196 


100 


9482 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


166 


76 


9483 


AF178842 


Homo sapiens 


Rhesus blood group-associated 
glycoprotein 


128 


75 


n a o a 

9484 


AF 134895 


Homo sapiens 


glyoxylate reductase 


838 


95 


9485 


AC004832 


Homo sapiens 


similar to 45 kDa secretory protein ; 

* • 1 A /"I A A -9 f\ A A -t 

similar to CAA 10644. 1 
(PID:g4164418) 


562 


98 


9486 


Z35761 


Homo sapiens 


TEL/ABL 


230 


71 


9487 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


149 


72 


9488 


AL032631 


Caenorhabditis 
elegans 


predicted using Genefinder 


159 


31 


9489 


AJ271079 


Oenothera elata 
subsp. hookeri 


Ycf2 protein 


113 


36 


9490 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A 1 4T 

4435. 


135 


40 


9491 


Y53005 


Homo sapiens 


Human secreted protein clone pm749_8 
protein sequence SEQ ID NO: 16. 


124 


61 


9492 


Y21091 


Homo sapiens 


Human p53 cellular tumour antigen 
mutant protein fragment 28. 


85 


41 


9493 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


153 


76 


9494 


AFH9851 


Homo sapiens 


PR01722 


94 


54 


9495 


R04932 


Homo sapiens 


Interferon-gamma receptor segment 
from clone 39 responsiblefor binding 
the target 


142 


100 


9496 


U40952 


Caenorhabditis 
elegans 


C03B1.10 gene product 


103 


75 


9497 


M63838 


Homo sapiens 


interferon-gamma induced protein 


128 


92 



386 



WO 01/64835 
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SEQW 


Accession No. 


Species 


Description 


Smith- 


% 


NO. 






Waterman 


Identit 










Score 


y 


QA OR 
ytyo 


/VL/tO I u 1 / 


iNeurospora 


tn III ^MAl I NTIPI FAR 


147 


44 






crass ii 


RmmsJTTPT FOPROTFrM C 

XVJJD v» VI \*^\->JL'\J x JVU J j-.il N 






Q4QQ 


/vr iwooj i 


nomo oapiciii 


iv.v^i tuu aiu5|Jvi ici j 


244 


77 


7JUU 


PA7A*\7 


nomo sapiens 


Qmollpr liAnot/Vf^lliilstr nnrnnrnt^ln 
oliiaiici ucuoiu^ciiuiai v/iicwpiuiwiii 


1S4 
i jt 


68 








ynncni? gene prcuuuuu 






7JVl 






Hum an Qfyretp/I nrotein SFO IT) NO' 


139 


63 








7492 






9502 


AE003499 


Drosophila 


CG 12706 gene product 


180 


36 






mA| on norsictAt* 
UlCUUlUg<U>lC( 












Unmn cunipn c 

xxmiiu oopiciia 


hf>ta~ti i Ki il i fi 
UClM^tUUUllil 


363 


94 


05A4 


n7RA14 


nuiiiu sapiens 


Wth\//lrrin\/rtmiHirmcp rplaffiH 


189 


84 


yj\tj 


T TS9446 


t*ncctf*Q nsinnc 
IM aSSlCa. QapUb 


m , \/mciri3Qp^hinHirio nrAtpin rplstpH 


116 


40 


9506 


A R0 10417 


R affile rsittllQ 


rciHhprin-ft 

vOUllwlUl O 


239 


50 


Q^A7 

7JU/ 


A P770/167 


Homo sapiens 


Jr/VL/l-Xl piuLCUi 


178 

I/O 


74 




A TOOyf oin 

AJzz4o 1 9 


Homo sapiens 


tumor suppressor 


701 

zyi 






AT7A1777^ 


ivius muscuius 


growui laCior juiucpunucnuc-i£> 


747 


40 




A CI 7T)A'3 

Ar l / I/ajd 


Homo sapiens 


cereurai ceu aanesion moiecuic 


JOO 


S7 


a< 1 1 


A VAm 1 £.i\ 

AYOO/loO 


Homo sapiens 


similar to riomo sapiens ovjio / 




QQ 






proiein iniviN/v wiuj vjcux>aiuv 












A rw»ccinn Mumhpr AF1S1S1S 1 






yMZ 


At l/CAITI 


— — r-r : 

Leishmania 


prooaoie ^nnv-o ^ u 1 1 uz, variaui a 


1 A4 

1\/*T 


66 






major 


yiNA, \^IUipiCU7 VII 11/11 gvlimiic 








Y<7 1 £iA 
ADZ 1 OH 


Mus muscuius 


OIAA nrnipin / A A 1 J77\ 

v^juu proxem i-/ / ) 


97 
yi» 


S9 
jy 


yDl*f 




wrycioiagus 


cysteine ncn nair iv.cid.mi ct56uci<u.cu 


96 


40 






cunicuius 


protein 










jvenupus mcvis 


IN IX l 


1777 


76 


9516 


V7A761 


xiomo sapiens 


XlUIOail UCia**allljr li/iu pepuuw j 


IwO 


80 








V\ try r\tnct mvYt^in RPP^ 

umQuig pruicin, DDr J- 






0517 


A KT1745 AA 


JtlUIUO sapiens 


UX faa 1 AO nrntpin 


1714 


100 


Q51R 
yj id 


A FA0R77 A 


DaLallllS ^UUUUa 


I UlvJ 


104 


41 




A T7AQOQ77 


ivius muscuius 


scniaren<t 


1DJ 


15 
j j 


q^7A 
yjzu 


VARA67 


xiomo sapiens 


Unmon pi? 00 A ^ rvmtAm "dm om Ant 

xiuman rivL/z 4 Tj proicm uagiiicui 


167 

1UZ 










aenvcu truui j-/in/vjjojo. 






Q571 

yjzi 


AT A^5A6R 


xiomo sapiens 


HT11 ^ni Q 1 ( nAVAl nmi-pinS 


102 


76 


o<77 
yozz 


"Q7j1^77 
UZ4 3Z / 


xiomo Sapiens 


Uinwian nrAtotn CO/inonAO 

xiuman sccreieu protein octjucsiiw? 


779 


90 
yu 








RnrnHpH Viv optip 1 S ^FO TD NO' 1 St 






Q^77 
yDZj 


A 171707^ 

Ar izy/Do 


xiomo sapiens 


Jvloxl3 


1X7 


100 

lw 


Q^7A 
yjZ*» 


a T7^£7A6 


urosopnna 


monocaiDoxy laic IXOilopUI ICI I 


126 


^2 






meianogasxer 


nomoiogue 






Q^7^ 

y^z.) 


V7A1QC 

z /oiyo 


nomo sapiens 


Unmon P0rrolpr1 nrntpin ^T»r»r»H*»H Y%\l 

xiuman secreieu protem encoaeu oy 


7 I ?A 


97 
y * 








npno 7S 
gciic /J. 






9576 




A^iiq nriiicpiiliic 

lYltlo UlUdwUlUd 




200 


92 


9527 


AR0T2918 




HAnaminp rAcentor D4 

UUUatU Ulv IvvvU Ivl i-^T 


91 


36 






moloch 








9528 


AB007830 


Homo saniens 


CSR2 


995 


99 


9529 


AC004472 


Homo saniens 


PI. 11659 4 


449 


98 


9530 


AF273047 


Homo saniens 


CTCL tumor antigen se20-7 


104 


29 


7 -J J X 


AF1 10079 


TTrmriA <?anif*rK 


PR02852 
x r\.w^«o»iji» 


153 


67 






l-Tr\m n cgnipnc 
nuillU ddpivllb 


nrrvlin A_rir*fi T^rrttAin f»l 
L7l villiic— l lull pi UICIU. VJ 1 


105 


43 


95H 


A1fAA1R45 


nuiiiu Sapiens 


mmaiuoci pruiciii pxuuum. 


1368 


63 


Q514 

yj^** 


o7Q/iin 

o/y'f-lU 




nuClCax lUCall^aUUll bigllal Uliiuill^ 


94 


41 








nro+Am 
piUlwiu 






9535 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NOi 


185 


86 








6613. 






9536 


X99583 


Homo sapiens 


CHL1 protein 


635 


98 


9537 


AL022170 


Homo sapiens 


dJ501N12.1 


111 


37 


9538 


X66179 


Xenopus laevis 


p70 S6 kinase 


120 


75 



387 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 




U68380 


fr^lllllC Online 


csdp 




AO 

yo 




AFO10I44 


nuinu sapiens 


neuronal tnreaa protein au /c-in a r 


Q7 

y I 


yu 


9541 


AB0139I8 


Homo sapiens 


CAD 


369 


78 


Q^A7 




v4cnomauains 
elegans 


similar to adenylate cyclase 




42 








proline-rich protein 9-1 




CI 

51 


9544 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

010. 


142 


96 


9545 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


66 | 


9546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 

A l An a TJf* A TO 1 

clone hcacj s l . 


110 


47 


9547 


AB044546 


Homo sapiens 


mitogen-activated protein kinase kinase 
kinase 


407 


60 




iKoni 

U JZl J 1 


Homo sapiens 


Ca2+/calmoaulin-dependent protein 
kinase I 


660 


100 


O^AO 


VJU3/14 


Homo sapiens 


Human secreted protein, bEQ ID NO: 

77AC 


• Oft 

129 


68 


9550 


Y87852 


Homo sapiens 


Human FGF-5 protein fragment 


146 


100 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


50 




t\az.\ 0.1 
1/4 D I 31 


Homo sapiens 


basigin 


311 


66 




A14o5o 


synthetic 
construct 


protein antigen 


450 


89 




rsA370A. 


Homo sapiens 


Human secreted pro tern, otiv^ iu Nij: 

7271 
foil. 


133 


CO 

52 


9555 


X73458 


Homo sapiens 


protein kinase 


568 


85 


7JJU 


A 11 11 C0A 


Homo sapiens 


DNA polymerase lambda 


1549 


100 


9557 


S56555 


Homo sapiens 


paraoxonase/aryiesterase 


224 


85 






Homo sapiens 


unknown protein 


90 


38 


yDjy 




Homo sapiens 


Human secreted protein sequence 
encoded by gene 47 SEQ ID NO:224. 


66 


44 




VjUJZU3 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


95 


51 


yooi 


VAQA£1 


Homo sapiens 


Human c-myb protein fragment 


77 


76 


Q^£7 


Y^^i*97 
AjjOo/ 


Lycopersicon 
esculentum 


extensin (class II) 


69 


60 


zOoj 


AA^UUz333 


Arabidopsis 
thaliana 


, - A-_r.-j-r.iu M j-l C , l™*1_£* mm mm *m A m Z mm t T T _ 1 _! • — _ lL - - * 

putative SFlo protein {Heliantnus 
annuus} 


111 


31 




A 171 1 RAR7 
AT 1 15U5Z 


Homo sapiens 


DDA1 A AO 


lOo 


C 1 

61 


9565 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


85 


52 


9566 


U10323 


Homo sapiens 


NF45 protein 


480 


82 


2f JO / 


AT?1ftn7^7 


xiomo sapiens 


K^Ury complex SUDUmt 4 


ooo 


90 


9568 


AL035608 


Homo sapiens 


dJ479J7.2 (transmembrane 4 
supenamiiy memoer o ) 


293 


90 


9569 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


63 




U4/030 


—? 

Araneus 

diadematus 


fibroin-4 




A A 

44 


QV71 




Homo sapiens 


Human secreted protem, bby ID NU: 
7879. 


83 


54 


0^77 


A fAAiCOOA 


Homo sapiens 


dihydropteridine reductase 


461 


OA 

84 


yo f o 




Homo sapiens 


Human secreted protem, SEQ ID NO: 
4902 


248 


75 


9574 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


90 


36 


9575 


V00488 


Homo sapiens 


alpha globin 


508 


84 


9576 


AFU6719 


Homo sapiens 


PR02987 


483 


91 
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SEQ ID 
NO: 


AWkcuiua IN o. 




l^*c/*i*i ntiAn 
L»CM.I ipilUIl 


Smith- 
Waterman 
Score 


% 

Id en tit 

y 


9577 


L77967 


Ovis aries 


small proline-rich protein with paired 
repeat 


62 


38 


9578 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


474 


92 


9579 


P90387 


Homo sapiens 
(Human) 


N-terminal of human serum albumin 
polypeptide. 


274 


67 


9580 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


271 


88 


9581 


U15779 


Homo sapiens 


p70 


101 


57 


9582 


M19419 


Mus musculus 


proline-rich salivary protein 


102 


36 


9583 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


114 


40 


9584 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


105 


32 


9585 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


61 


9586 


U93569 


Homo sapiens 


putative pi 50 


232 


51 


9587 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


120 


52 


9588 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787 L 


128 


58 


9589 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


90 


61 


9590 


L27428 


Homo sapiens 


reverse transcriptase 


120 


46 


9591 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


85 


62 


9592 


AJ133489 


Canis familiaris 


albumin 


204 


60 


9593 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


294 


68 


9594 


M15386 


Homo sapiens 


gamma-globin 


316 


77 


9595 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


134 


68 


9596 


AF1 16719 


Homo sapiens 


PR02987 


309 


89 


9597 


AF1 16719 


Homo sapiens 


PR02987 


485 


94 


9598 


AF068294 


Homo sapiens 


HDCMB45P 


131 


34 


9599 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. | 


152 


70 


9600 


AF1 18082 


Homo sapiens 


PRO1902 


151 


59 


9601 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


186 


68 


9602 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871, 


176 


65 


9603 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQIDNO:220. 


261 


80 


9604 


AF194537 


Homo sapiens 


NAG13 


154 


61 


9605 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


96 


67 


9606 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


100 


33 


9607 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


134 


48 


9608 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


129 


67 


9609 


G04063 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
8144. 


101 


32 


9610 


AF 118080 


Homo sapiens 


PRO1880 


237 


100 


9611 


AF1 19851 


Homo sapiens 


PR01722 


138 


81 


9612 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


178 


55 


9613 


AF090930 


Homo sapiens 


PRO0478 


88 


71 


9614 


AF068294 


Homo sapiens 


HDCMB45P 


121 


49 


9615 


G01657 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


79 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith. 

Waterman 

Score 


% 

Identit 

V 

J 








5738. 






9616 


AF090931 


Homo sapiens 


PRO0483 


143 


63 


9617 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


92 


44 


9618 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


121 


56 


9619 


AB046061 


Macaca 
fascicularis 


unnamed protein product 


93 


44 


9620 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 155. 


122 


71 


9621 


AF1 13685 


Homo sapiens 


PRO0974 


156 


65 


9622 


AF097178 


Equus caballus 


glyceraldehyde-3-phosphate 
dehydrogenase 


119 


75 


9623 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


134 


62 


9624 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


112 


53 


9625 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


123 


77 


9626 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


164 


73 


9627 


Y86573 


Homo sapiens 


Human eene 91 -encoded orotein 
fragment, SEQ ID NO:490. 


236 


71 


9628 


AF130079 


Homo sapiens 


PR02852 


138 


50 


9629 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


130 


60 


9630 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


167 


61 


9631 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7183. 


S7 


50 


9632 


U49973 


Homo sapiens 


ORF1 * MER37* DUtative transoosase 
similar to pogo element 


128 


41 


9633 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


134 


81 


9634 


AF1 18082 


Homo sapiens 


PRO1902 


114 


57 


9635 


G03793 


Homo sapiens 


Human secreted orotein. SEO ID NO* 
7874. 


133 


54 


9636 


L07543 


Leishmania 
tarentolae 


MURF4 


85 


42 


9637 


U49973 


Homo sapiens 


ORF2: function unknown 


163 


44 


9638 


AF090895 


Homo sapiens 


PRO0117 


77 


41 


9639 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


108 


71 


9640 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


90 


46 


9641 


AF1 18078 


Homo sapiens 


PRO! 848 


114 


66 


9642 


.AK024455 


Homo sapiens 


FLJ00047 protein 


131 


79 


9643 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


130 


62 


9644 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787 1. 


141 


42 


9645 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


69 


62 


9646 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


69 


35 


9647 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


478 


90 


9648 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


155 


50 


9649 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


60 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Ideotit 

y 








UOl7. 






9650 


G03133 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
7714 

/Z. I*t. 


173 


38 


70J 1 


W ooOZ / 




ferreted nrotein encoded bv sene 94 
clone HPMBQ32. 


118 


36 




nfi74.Ro 

VJUZ*toU 


XaUIIIU acLUlGlld 


Human <5ecreted nrotein SEO ID NO 
6561. 


162 


64 




xvUUO/lJ 


h/Iito miic/^nlitc 
IVitiS lUuoCUIub 


nnpn rMrlino' fr^mP f1Q6 AA^ 


182 


51 


9654 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7X70 


138 


54 


9655 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
<7f»ne 44 clone HTDAD22 

H CJ1C i t VlVllV X X X i/AUii*.. 


105 


74 


yOOO 




xiomo sapiens 


Pnlvnpntide irflffmerit encoded hv pene 

i Ul J L/C LIU. vlC lldgtllvUl wLlVv/vtwU L/jr gwilw 

118. 


82 


70 


yOD / 




riomo sapiens 


P/>1\mr»TitiHe frjioment en ended hv pene 

118. 


79 


39 






riomo sapiens 


lomin^l flfp nrrttpin 
ICLLLllil lUvC piUICUl 


76 


31 


9659 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OJOl. 


161 


73 


9660 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 

/TiVtrrn^ oATif* TM*f*nHnefr 

liUidll ) gCUC jpi CUUUl/U 


149 


48 


9661 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 

V/iuiic nr xyixj 


135 • 


62 


9662 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 

terminal rlnnp "5CFH 55c \ 

ICI iUXlial, WUUC y\JL>IX»Ov/ 


126 


71 




T Tvt 007*5 


riomo sapiens 


OPT* 1 • MT<T?'^7* nntative tran^no^a^e 
similar tn dopo element 


141 


63 


9664 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

similar to nof?o element 


153 


66 


9665 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


93 


yOOD 


AT 1 1 JOoD 


riomo sapiens 




112 


73 


9667 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

£571 

VJ / 1 . 


133 


62 


9668 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7714 


102 


47 


o££o 
yooy 


DIZjIV 


riomo sapiens 


Utimon cArrAtAr) rvrotAin pneoded Hv 

iXUlXluXl o&UlClOU UixJlXsLLt CiUv>v/vlUU i/Jf 

cene 10 clone HDPGP94 


80 


41 


7V /V 


WrOJJ 1 


xTUlxiu oajJiciia 


Human hreast cancer related nrotein 
BCRB2. 


106 


58 


y\j 1 1 




T-Trvmn canipn <i 

lAV/lliv/ dClXJlCllO 


PRO0657 


118 


66 


9672 


AF161356 


Homo sapiens 


HSPC093 


94 


63 


yO fj 


AFAQAQ^n 
i\r\iy\jyj\J 


XlUIIlU Sa[JlGllo 


PRO0478 

X I\Uvt / O 


167 


64 


9674 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214 


98 


50 




0(17640 


Hnmn canien^ 

XXVlXlv 2k^/1VUj 


Human secreted nrotein SEO ID NO: 
672 1. 


87 


70 


9676 




Homo ^aniens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


60 


9677 


X13412 


Rattiis rattiis 


fUr nrotein 


614 


69 


9678 


AJ006770 


C^icer arietinum 

vlvvi tuiviiuuui 


extensin 


163 


48 




Yl Q767 


Homo «5anien^ 


SEO ID NO 485 from W09922243. 


120 


67 


9680 


G00494 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4575. 


106 


58 


9681 


U63542 


Homo sapiens 


FAP protein 


120 


68 


9682 


Ml 5530 


Homo sapiens 


B-cell growth factor 


146 


76 


9683 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


180 


62 


9684 


AF32L379 


Callicebus 
moloch 


gamma2-gIobin 


108 


86 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


9685 


L27428 


Homo sapiens 


reverse transcriptase 


122 


39 


9686 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


77 


51 


9687 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


104 


58 


9688 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


123 


58 


9689 


X81713 


Homo sapiens 


smallest subunit of TF1IA 


210 


91 


9690 


AF193330 


Human 
respiratory 
syncytial virus 


G protein 


100 


25 


9691 


A r* 1 CA1AA 

AF 1501 00 


Homo sapiens 


small zinc linger-like protein 


205 


83 


9692 


AF068294 


Homo sapiens 


HDCMB45P 


122 


54 


9693 


AF1 89307 


Homo sapiens ! 


unknown 


96 


50 


9694 


G03I33 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


165 


50 


9695 


AF130089 


Homo sapiens 


PRO2550 


198 


68 


9696 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


170 


80 


9697 


A VA1 A AC C 

AK024455 


Homo sapiens 


FLJ00047 protein 


137 


63 


A/Aft 

9698 


M24732 


Homo sapiens 


lam in-like protein 


92 


42 


9699 


AF1 16715 


Homo sapiens 


PR02829 


119 


75 


9700 


AF 130089 


Homo sapiens 


xvn *"^T C f A 

PRO2550 


234 


57 


ATA 1 

9701 


V00662 


Homo sapiens 


ATPase 6 


255 


77 


9702 


L10908 


Mus museums 


Gcapl gene product 


93 


35 


9703 


AF 130089 


Homo sapiens 


PRO2550 


208 


78 


9704 


AF194537 


Homo sapiens 


NAG13 


89 


51 


9705 


U79260 


Homo sapiens 


unknown 


98 


77 


9706 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


78 


43 


9707 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


70 


92 


9708 


G00637 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4718. 


136 


60 


9709 


AF 194537 


Homo sapiens 


NAG13 


99 


48 


9710 


AF 130089 


Homo sapiens 


PRO2550 


87 


58 


9711 


X03145 


Homo sapiens 


pot. ORF III 


135 


41 


9712 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


97 


35 


9713 


AF1 16695 


Homo sapiens 


PR02221 


185 


50 


9714 


AF130051 


Homo sapiens 


nn aaoao 

PRO0898 


108 


68 


9715 


AF130089 


Homo sapiens 


PRO2550 


143 


50 


9716 


G03787 


Homo sapiens 


TT - „ A. 1 * • o f— i if> VIA 

Human secreted protem, SEQ ID NO: 
7868. 


124 


49 


9717 


L27428 


Homo sapiens 


reverse transcriptase 


133 


50 


ATI O 

9718 


AA1 tOO 

G03133 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

TT 1 A 

7214. 


88 


43 


ATI A 


L27428 


Homo sapiens 


reverse transcriptase 


i n 
12/ 


JO 


ATI A 

9720 


A if T jf AAA 

M74009 


Theropithecus 
gelada 


cytochrome c oxidase subunit II 


283 


^T 

67 


QTT 1 

9721 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


TAT 

292 


OA 

84 


9722 


AF130051 


Homo sapiens 


PRO0898 


96 


57 


9723 


U93564 


Homo sapiens 


putative pi 50 


144 


49 


y /Z4 


Y 86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


111 
J /J 


/O 


9725 


AK000496 


Homo sapiens 


unnamed protein product 


155 


55 


9726 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


94 


62 
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SEQ H> 
NO: 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

Identit 
y 


9727 


AF130089 


Homo sapiens 


PRO2550 


154 


40 


9728 


AF 130089 


Homo sapiens 


PRO2550 


127 


61 


9729 


M19419 


Mus musculus 


proline-rich salivary protein 


98 


39 


9730 


D90053 


Sus scrofa 


destrin 


119 


43 


9731 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


138 


50 


9732 


AF090942 


Homo sapiens 


PRO0657 


72 


45 


9733 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


110 


44 


9734 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDN0.163. 


159 


94 


9735 


G03797 


Homo saDiens 


Human secreted protein, SEQ ID NO: 
7878. 


150 


85 


9736 


L27428 


Homo saoiens 


reverse transcriptase 


217 


33 


9737 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


89 


65 


9738 


AF007064 


Saccharomyces 
cerevisiae 


basic protein 


108 


31 


9739 


B01372 


Homo sapiens 


Neuron-associated protein. 


128 


60 


9740 


G03140 


Homo saDiens 


Human secreted protein, SEQ ID NO: 
7221. 


109 


50 


9741 


AF194537 


Homo saoiens 


NAG13 


123 


54 


9742 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 
transferase Drotein. 


101 


77 


9743 


X92485 


Plasmodium 
vivax 


pval 


106 


41 


9744 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


76 


9745 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


128 


50 


9746 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


87 


64 


9747 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


163 


69 


9748 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


124 


74 


9749 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


97 


66 


9750 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


156 


71 


9751 


G03925 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8006. 


157 


71 


9752 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


97 


51 


9753 


X92485 


Plasmodium 
vivax 


pval 
* 


91 


39 


9754 


Y07786 


Vibrio cholerae 


Rfolike protein 


92 


32 


9755 


AF068294 


Homo sapiens 


HDCMB45P 


125 


63 


9756 


L27428 


Homo sapiens 


reverse transcriptase 


91 


34 


9757 


AK024455 


Homo sapiens 


FLJ00047 protein 


143 


59 


9758 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


103 


41 


9759 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


96 


67 


9760 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


179 


58 


9761 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


154 


65 | 


9762 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


91 


78 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7888. 






9763 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


119 


60 


9764 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


111 


85 


9765 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


116 


36 


9766 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


165 


74 


9767 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


162 


74 


9768 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


52 


9769 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


92 


50 


9770 


AF130089 


Homo sapiens 


PRO2550 


104 


31 


9771 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


62 


9772 


U52077 


Homo sapiens 


mariner transposase 


194 


56 


9773 


Y87297 


Homo sapiens 


Human signal peptide containing 
protein HSPP-74 SEQ ID NO:74. 


345 


100 


9774 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


73 


36 


9775 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


96 


32 


9776 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


53 


9777 


L27428 


Homo sapiens 


reverse transcriptase 


251 


53 


9778 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


141 


40 


9779 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


111 


58 


9780 


AJ224997 


Rattus 
norvegicus 


huntingtin 


64 


64 


9781 


AF283769 


Homo sapiens 


similar to GetiBank Accession Number 
AC021163 


119 


53 


9782 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


81 


50 


9783 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAIa(AF078683) 


120 


68 


9784 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


85 


65 


9785 


AF 144054 


Homo sapiens 


apoptosis related protein APR-4 


83 


53 


9786 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


75 


41 




M64792 


Rattus 
norvegicus 


salivary proline-rich protein 


122 


45 


07°. ft 




Sus scrota 


— : . — — 

destrin 


1 AC 

14o 


A1 

43 


Q7RO 


A 1 ftftIO 
AloolZ 


Brassica napus 


extensin 


1U4 


34 


9790 


AF1 30089 


Homo sapiens 


PRO2550 


102 


66 


y iy\ 


at?i i rtnon 

AF 13 0089 


Homo sapiens 


TYTk/"VI CCA 

PRO2550 


326 


70 


9792 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


495 


79 


9793 


M10546 


Homo sapiens 


cytochrome oxidase I 


343 


75 


9794 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


106 


58 


9795 


X92485 


Plasmodium 
vivax 


pval 


108 


48 


9796 


M15530 


Homo sapiens 


B-cell growth factor 


119 


70 


9797 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


148 


65 


9798 


X61046 


Hydra sp. 


mini-collagen 


101 


55 



394 



WO 01/64835 



PC17US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


070Q 


i icy* ^71 


nomo sapiens 


r*40 


82 


36 


9S00 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7X71 
/on. 


170 


52 






r IdSulUUl Uill 

vivax 


iwal 


113 


45 


701/Z 


Y1/L170 


nuiiiu oopiciio 


Praompnt of human <iecreted nrotein 
encoded by gene 14. 


63 


44 




DJol IZ 


Unmn conipnc 

riuinu sapiens 


nvtrirhmmf* c fvxirlacp ^liHiinit "\ 

1* Y IvH^lll Kj\ li C w VJ A 1 vl oj t juuiuiu J 


583 


89 


9804 


G01736 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

JO I / . 


150 


47 


9805 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


132 


71 


y<5uo 




riasmOulUm 

vivax 


pval 


106 


53 


QOA7 


I oOj /.} 


riomo Sapiens 


"Human apnp 01-#*nrriHed rtrotpin 

fragment, SEQ IDNO:490. 


163 


67 




At 1 iyoDi 


riomo sapiens 




341 


68 


9809 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


77 


57 


9810 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

dua 1 a / A VCHSL(\9X\ 
xvlXAia \t\r\J iOXJQD) 


133 


72 


9811 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770^ 


113 


65 


9812 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7714 


118 


37 


9813 


GO3703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ / OH. 


121 


72 


AO 1 A 

y<>i4 




riomo sapiens 


Human cprj-pfpd nrotfin SFO ID NO* 

4658. 


72 


92 


CUM < 




riomo sapiens 


Hitman cwrpfpH nrotPin *^FO TO NO* 

4770. 


71 


71 


Off 1 £ 

Volt) 


Arl loUox 


riomo sapiens 




93 


45 


9817 


GO3790 


Homo sapiens 


Human secreted protem, SEQ ID NO: 

7R71 
loll. 


207 


59 


AOI O 




— — : 

Homo sapiens 


Unman cprrpfpH nmfpin ^FO ID NO* 

n urn ail secreted pr oicm, ocy u-/ inw. 


99 


46 


Ofil Q 

yoiy 


I 


nomo sapiens 


QAnrpfA/*) nrntpin pnr*/v1f*H hv Pfitie 1 X 

OCWlvlCU Ui v/LCiil CllvA/vtvU UJT gwliv iu 

clnne HNHF029 


130 


69 


9820 


L10908 


Mus musculus 


Gcapl gene product 


99 


37 


OJi71 


AUUnOy 


nomo sapiens 


oT/\wt1i Virtrmnni* 
yuwui ULUlluv/11^ 


254 


81 


9822 


U79260 


Homo sapiens 


unknown 


85 


39 




Jnj/OoUU 


nomo sapiens 


Hitman hicfinrvfp«^ftcrt»ted factor HSF 


125 


62 


9824 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638 


103 


74 




VJVJAJU / 


VJ/\tr»rk c on i f±n c 
nVUllU 2><ipiCLL2» 


Human qprrpted nrotein SEO ID NO: 

6588. 


126 


51 


QR7£ 

yozo 




T-I/vmr» car\if*nc 

nuiiiu odpieuo 


ORF1 • MFR*??* nutative transnosase 

M j 1 v I 1 ■* 1 \ ' 1 J j/mO»l TV Mill ||1J/VJM>IV 

similar to pogo element 


148 


44 


Q877 


VM7RQ 


nuiuu oajJlCLLo 


Human Qftcreted nrotein bf377 1, 


265 


100 


9828 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


72 


60 


9829 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4770 
*♦/ / v. 


118 


45 


yoju 




nomo sapiens 


Human e<»rc»tpri nrntpin SFO 113 MO* 

7344. 


104 


51 


9831 


G00437 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4518. 


118 


74 


9832 


AF151866 


Homo sapiens 


CGM08 protein 


115 


31 


9833 


U93563 


Homo sapiens 


putative pi 50 


127 


61 



395 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


9834 




Ayieiia 
fastidiosa 


- j : 

hypothetical protein 


97 


30 


9835 


<jU3 1 /Z 


Homo sapiens 


Human secreted protein, ocKi iu NO. 


1 io 
lio 


65 


9836 


B01372 


Homo sapiens 


Neuron-associated protein. 


150 


88 


9837 


G03115 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/lyo. 


115 


57 


9836 


Ar 068294 


Homo sapiens 


HLKJvlD4jr 


AA 

90 


65 


9839 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 
terminal, alternatively spliced} {EC 
3.5.126} 


71 


66 


9840 


Ar 090942 


Homo sapiens 


PRO0657 


1 15 


76 


no a i 

9841 


G03564 


Homo sapiens 


Human secreted protein, 5>bv^ ID NO: 

ItLA C 
/04j. 


75 


54 




UU2363 


Homo sapiens 


Human secreted protein, aty iu Nu: 
6444. 


1D0 


6/ 


Vo43 


ArliOOoy 


Homo sapiens 


rKOZj3U 


zlo 


/J 


yo44 


\A1 CC1A 

M1553U 


Homo sapiens 


b-cell growtn ractor 


111 

1 13 


6/ 


GO A C 

Vo45 


U93567 


Homo sapiens 


p40 


248 


75 


9846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7000 

/ooo. 


151 


55 


9847 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


64 


9848 


AF 118082 


Homo sapiens 


PRO 1902 


58 


81 


9849 


Y02671 


TT • _ 

Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


129 


65 


9850 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 oby id no.250. 


329 


77 


9851 


AF090944 


Homo sapiens 


PRO0663 


122 


80 


9852 


X71442 


Rattus 
norvegicus 


ORF 1; putative 


109 


48 


9853 


AF130089 


Homo sapiens 


PRO2550 


126 


60 


nor a 

9854 


G04063 


Homo sapiens 


Human secreted protein, SbvJ ID NO: 
8144. 


AO 

V8 


62 


9855 


U93563 


Homo sapiens 


putative pi 50 


136 


48 


9856 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


276 


74 


Aon 

9857 


Ar 017777 


Drosophila 
melanogaster 


la costa 


AO 

99 


43 


9858 


G03790 


Homo sapiens 


ft — — ^ — a — J ^.A-fcifc Ot?/\ ¥TT\ "VT/*^\- 

Human secreted protein, bbQ ID NO: 

*Tft91 

foil. 


AA 

99 


37 


9859 


K02576 


Homo sapiens 


salivary proline-rich protem 1 


106 


37 


9860 


Y01400 


Homo sapiens 


Secreted protem encoded by gene 18 
cione riNrirozy. 


Uo 


*71 

71 


yooi 


A 171 1 £QAQ 


jioroo sapiens 


uniuiown 


1 ft.** 




7©0Z 


WooOZ/ 


xiomo sapiens 


oecreieu protein encoueci uy gene y*t 
clone HPMBQ32. 


i i j 


JO 




V91B1 1 
IZiOl 1 


jtiomo sapiens 


wr poiypepuue JOr 


i j i 




QQfZA 
yoOH 




norao sapiens 


P*w 




/U 


9865 


AF090895 


Homo sapiens 


PRO0117 


164 


60 ' 


VJ5O0 




Homo sapiens 


Human secretea protein, ot\i id no. 
7214. 


1 AO 

iuy 


4/ 


Voo/ 


Ar0y0930 


Homo sapiens 


rKOU4/o 


1 A£ 
1U0 


<S4 


yoo8 


Y27868 


Homo sapiens 


Human secreted protein encoded by 

msha "NT/-* | AT 

gene jno. iu/. 


16/ 


84 


98fiQ 




UnmA cam An c 

riomv sapiens 


UitmAn <a*rr«tft/i nrntoin QPO TTi NO* 
niumiu o^vicLcvi yisjwui} ocy i iv 1 1 w. 

8144. 




uu 


9870 


AF1 19900 


Homo sapiens 


PR02822 


134 


48 


9871 


AF116715 


Homo sapiens 


PR02829 


111 


81 j 


9872 


U52077 


Homo sapiens 


mariner transposase 


332 


76 



396 



WO 01/64835 



PCT/US01/04927 



cpn its 




Species 


Description 


bmitb- 


% 


NO: 








Waterman 


Idea tit 










Score 


y 


9873 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 


95 


85 








transferase protein- 






9874 


U93570 


Homo sapiens 


putative pi 50 


243 


44 


9875 


L27428 


Homo sapiens 


reverse transcriptase 


127 


54 


9876 


S80119 


Rattussp. 


reverse transcriptase homolog 


163 


43 


9877 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


64 








4484. 






9878 


AF 187823 


Zea mays 


transposase DOPA 


132 


38 


9879 


W40353 


Homo sapiens 


Human unspecified protein from 


101 


41 








US5702907. 






9880 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


43 








7496. 






9881 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 


101 


59 








similar to pogo element 






9882 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


52 








4718. 






9883 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


57 


65 








5330. 






9884 


X92485 


Plasmodium 


pval 


129 


43 






vivax 








9885 


M64792 


Rattus 


salivary proline-rich protein 


98 


37 






norvegicus 








9886 


Y02749 


Homo sapiens 


Human secreted protein encoded by 


89 


59 








gene 1 00 clone HNFIU96. 






9887 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


54 








8153. 






9888 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


40 








8144. 






9889 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


346 


68 


9890 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


38 








7214. 






9891 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


143 


66 








7879. 






9892 


X53581 


Rattus 


ORF7 


81 


56 






norvegicus 








9893 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


53 








7348. 






9894 


AE003859 


Xylella 


hypothetical protein 


99 


27 






fastidiosa 








9895 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


157 


58 


9896 


L26953 


Homo sapiens 


chromosomal protein 


107 


68 


9897 


AL390114 


Leishmania 


probable (hhv-6) ul 102, variant a 


95 


51 






major 


DNA, complete virion genome 






9898 


U93569 


Homo sapiens 


putative pi 50 


106 


52 


9899 


U15647 


Mus musculus 


reverse transcriptase 


118 


47 


9900 


U49973 


Homo sapiens 


ORFI ; MER37; putative transposase 


118 


70 








similar to pogo element 






9901 


U93563 


Homo sapiens 


putative pi 50 


116 


39 


9902 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


81 








4409. 






9903 


X03145 


Homo sapiens 


pot ORFV 


148 


54 


9904 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


65 








7868. 






9905 


AF066073 


Dictyostelium 


SP85; PsB 


113 


52 






discoideum 








9906 


AF090895 


Homo sapiens 


PRO0117 


125 


58 


9907 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


57 








4478. 






9908 


M24732 


Homo sapiens 


lam in-like protein 


95 


35 



397 



S£QH> 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


identit 










Score 


y 


9909 


Y30822 


Homo sapiens 


Human secreted protein encoded from 


96 


55 








gene 12. 






9910 


AF1 18080 


Homo sapiens 


PRO 1880 


120 


66 


9911 


AF116719 


Homo sapiens 


PR02987 


216 


72 


9912 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


150 


75 








gene No. 107. 






9913 


G03703 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


76 








7784. 






9914 


Y14480 


Homo sapiens 


Fragment of human secreted protein 


72 


58 








encoded by gene 14. 






9915 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


71 








4314. 






9916 


S79410 


Mus musculus 


nuclear localization signal binding 


1 14 


75 








protein 






9917 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


83 








7888. 






9918 


AF1 16712 


Homo sapiens 


PR02738 


106 


69 ! 


9919 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


77 








6613. 






9920 


AF068294 


Homo sapiens 


HDCMB45P 


296 


59 


9921 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


58 


54 








7077. 






9922 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 


143 


69 








IDNO:103. 






9923 


U38979 


Homo sapiens 


hPMSR3 


135 


46 


9924 


AB002317 


Homo sapiens 


KIAA0319 


294 


75 


9925 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


169 


42 








similar to pogo element 






9926 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase. 


153 


59 








similar to pogo element 






9927 


G00214 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


261 


97 








4295. 






9928 


Y14483 


Homo sapiens 


Fragment of human secreted protein 


863 


76 








encoded by gene 18. 






9929 


AF130051 


Homo sapiens 


PRO0898 


128 


60 


9930 


AF130089 


Homo sapiens 


PRO2550 


123 


41 


9931 


AF064597 


Homo sapiens 


LINE-1 like protein 


101 


48 


9932 


AF027144 


Homo sapiens 


zinc ringer protein 


76 


42 


9933 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


120 


44 








4770. 






9934 


U93569 


Homo sapiens 


putative pi 50 


105 


68 


9935 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


51 








7214. 






9936 


Y73344 


Homo sapiens 


HTRM clone 0258181 protein 


145 


46 








sequence. 






9937 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


-in 
11 








6953. 






9938 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


159 


40 








similar to pogo element 






9939 


D63643 


Homo sapiens 


clathrin coat assembly protein-like 


232 


69 


9940 


Y14480 


Homo sapiens 


Fragment of human secreted protein 


75 


65 








encoded by gene 14. 






9941 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


71 








6566. 








"Vrt A COO 

X04356 


. 

Homo sapiens 


cytoskeletal tropomyosin (AA 1-248) 


j fy 


9 1 1 
61 


9943 


G03683 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


58 


52 








7764. 






9944 


AF130114 


Homo sapiens 


PR02459 


117 


67 


9945 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


297 


64 



398 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








similar to pogo element 






9946 


D00570 


Mus musculus 


open reading frame (25 1 AA) 


1 1 A 

214 




9947 


VA'* Tin 


Homo sapiens 


pot. unidentified reading frame 


lo 


3d 


9948 


000442 


Homo sapiens 


Human secreted protem, SEQ ID NO. 
4523. 


12/ 


CA 

64 


9949 


J02621 


Homo sapiens 


high mobility group protein 14 


90 


CA 


9950 


AF090931 


Homo sapiens 


PRO0483 


1 1 A 

1 14 


/J 


9951 


AF164612 


Homo sapiens 


Gag protein 


132 


63 


9952 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


112 


65 


9953 


Y0S061 


Homo sapiens 


Human c-myb protem fragment 


1 1 o 

118 


59 


9954 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


138 


74 


9955 


G00974 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5055. 


94 


38 


9956 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


124 


61 


9957 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


177 


51 


9958 


X92485 


Plasmodium 
vivax 


pval 


137 


44 


9959 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


142 


53 


9960 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


174 


68 


9961 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


123 


38 


9962 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


56 


9963 


G00577 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4658. 


80 


57 


9964 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


136 


82 


9965 


Y27908 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


92 


53 


9966 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


76 


40 


9967 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


136 


38 


9968 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


101 


67 


9969 


G03797 


Homo sapiens 


Human secreted protein, stQ ID MO: 
7878. 


OA 

yo 




95/70 


YJOlDO 


Homo sapiens 


Human secreted protein #28. 


1 /U 


Oo 


9971 


U79260 


Homo sapiens 


unknown 


142 


67 


9972 


L27428 


Homo sapiens 


reverse transcriptase 


1 AO 
108 


o3 


9973 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


119 


60 


9974 


W48351 


Homo sapiens 


Human breast cancer related protein 
I5CKd2. 


78 


o c 

85 


AATC 

9975 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


ly 


'JO 

jy 


9976 


AF090944 


Homo sapiens 


PRO0663 


1 AO 

109 


73 


9977 


AF1 16715 


Homo sapiens 


PR02829 


120 


56 


yy/8 


U9Jjo3 


Homo sapiens 


putative pljU 


ill 


CT 
J / 


9979 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566, 


97 


80 


9980 


U42580 


Paramecium 
bursaria 


A658R 


70 


40 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 






Chtorella virus 1 








9981 


G03371 


Horao.sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


87 


61 


9982 


G02363 


Homo sapiens 


Human secreted'protein, SEQ ID NO: 
6444. 


135 


70 


9983 


X61047 


Hydra sp. 


mini-collagen 


90 


36 


9984 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


98 


40 


9985 


Y41552 


Homo sapiens 


Fragment of human secreted protein 1 
encoded by gene 80. 


90 


42 


9986 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


132 


52 


9987 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


115 


41 


9988 


AF042832 


Homo sapiens 


forkhead-related transcription factor 
FREAC-9 


89 


36 


9989 


X03717 


Homo sapiens 


pot. unidentified reading frame 


105 


67 


9990 


AL390935 


Leishmania 
major 


possible dopap 


118 


74 


9991 


AF201949 


Homo sapiens 


60S ribosomal protein L30 isolog 


109 


58 


9992 


AF130114 


Homo sapiens 


PR02459 


115 


68 


9993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


148 


83 


9994 


AF068294 


Homo sapiens 


HDCMB45P 


196 


43 


9995 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


117 


54 


9996 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


74 


53 


9997 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


132 


55 


9998 


AF102771 


Homo sapiens 


cardiac-specific ras association 
(RalGDS/AF-6) domain family 1 
protein isoform ID 


148 


100 


9999 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


213 


88 


10000 


D38112 


Homo sapiens 


ATPase subunit 6 


279 


69 


10001 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


97 


39 


10002 


AK000496 


Homo sapiens 


unnamed protein product 


154 


74 


10003 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


125 


74 


10004; 


B07702 


Homo sapiens 


Protein encoded by the endogenetic 
fragment of HERV-W. 


143 


56 


10005 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


97 


65 


10006 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


154 


72 


10007 


AF021231 


Mus musculus 


acetylcholinesterase-associated 
collagen 


109 


41 


10008 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


69 


78 


10009 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


170 


.78 


10010 


AF130051 


Homo sapiens 


PRO0898 


134 


67 


10011 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


100 


44 


10012 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


95 


78 


10013 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


90 


52 


10014 


AF130052 


Homo sapiens 


PRO0956 


104 


50 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 


10015 


X92485 


Plasmodium 
vivax 


pval 


119 


46 


10016 


AF130089 


Homo sapiens 


PRO2550 


171 


56 


10017 


AF130089 


Homo sapiens 


PRO2550 


172 


58 


10018 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


146 


38 


10019 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


109 


54 


10020 


G02584 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6665. 


109 


43 


10021 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


70 


51 


10022 


AF090942 


Homo saoiens 


PRO0657 


113 


67 


10023 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
nrvcomnotein 


77 


33 


10024 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


95 


35 


10025 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


93 


61 


10026 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMB032 


153 


42 


10027 


Y73377 


Homo sapiens 


HTRM clone 1645941 protein 


1180 


84 


10028 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


108 


59 


10029 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


112 


65 


10030 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
eeneNo 107 


163 


82 


10031 


R13556 


Homo sapiens 


Protein encoded downstream ofhhc_M 
oncoorotein 


131 


66 


10032 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6532. 


74 


48 


10033 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


104 


85 


10034 


G03093 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7174. 


192 


60 


10035 


G03790 


Homo saoiens 


Human secreted nrotein SEO ID NO* 
7871. 


153 


55 


10036 


Y14479 


Homo saoiens 


Fragment nf human secreted orotein 
encoded by gene 14. 


128 


76 


10037 


AF1 18086 

1 V-l. & & www 


Homo saoiens 


PRO 1992 


148 


68 


10038 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


125 


47 


10039 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


137 


65 


10040 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


139 


45 


10041 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


75 


10042 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


100 


33 


10043 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


102 


64 


10044 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


135 


71 


10045 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


148 


46 


10046 


X92485 


Plasmodium 
vivax 


pval 


98 


58 
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SEQ 10 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


1004/ 


YOZO/I 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


102 


87 


10045 • 


O03J56 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


63 


84 






Homo sapiens 


ornithine decarboxylase antizyme 


202 


67 


10050 


X92485 


Plasmodium 
vivax 


pval 


77 


34 


10051 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


87 


40 


10052 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


114 


57 


1003 J 


A PAACOAZ 

AC005396 


Arabidopsis 
thaliana 


putative proline-rich protein 


100 


33 


10054 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7681. 


91 


94 


10055 


X73434 


Ovis aries 


KAP5.4 keratin protein 


76 


45 


10056 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


82 


62 


10057 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


66 


10058 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


116 


55 


10059 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


130 


73 


10060 


/~»/\ A/\^*% 

G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


100 


42 


10061 


L27428 


Homo sapiens 


reverse transcriptase 


107 


43 


10062 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


92 


47 


10063 


G03U2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


93 


46 


10064 


AF220264 


Homo sapiens 


MOST-1 


135 


48 


10065 


AF130114 


Homo sapiens 


PR02459 


87 


68 


10066 


D86853 


Catharanthus 
roseus 


extensin 


115 


35 


10067 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


79 


71 


10068 


G04072 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
8153. 


102 


54 


10069 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


93 


77 


IU0 10 


AJ233597 


Mus famulus 


reverse transcriptase 


65 


36 


1 AAH 

1UU /l 


U04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


94 


67 


1U0 IL 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


114 


51 




A V tY\ A AC C 

AK.U24455 


Homo sapiens 


FLJ00047 protem 


146 


66 


10074 


X92485 


Plasmodium 
vivax 


pval 


121 


36 


10075 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


64 


10076 


AF055985 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


73 


43 


1 AAT7 
10077 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


165 


75 


10078 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


106 


72 


10079 


AF 130050 


Homo sapiens 


PRO0872 


102 


63 


10080 


G04072 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


130 


44 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

waterman 

Score 


% 

Identit 
y 








8153. 






10081 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
IDNO:195. 


285 


78 


10082 


AF068294 


Homo sapiens 


HDCMB45P 


144 


68 


10083 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


98 


48 


10084 


X92485 


Plasmodium 
vivax 


pval 


108 


53 


10085 


AF 11 8082 


Homo sapiens 


PRO1902 


117 


57 


10086 


AF090942 


Homo sapiens 


PRO0657 


88 


53 


10087 


AF1 18086 


Homo sapiens 


PR01992 


172 


71 


10088 


Y01400 


Homo sapiens * 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


119 


68 


10089 


K02576 


Homo sapiens 


salivary proline-rich protein I 


117 


34 


10090 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


154 


77 


10091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


138 


56 


10092 


G03I19 


Homo ^aniens 


Human secreted protein, SEQ ID NO: 
7200. 


90 


56 


10093 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


134 


66 


10094 


M15530 


Homo sapiens 


B-celi growth factor 


116 


64 


10095 


G00328 


Homo sanzens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


65 


10096 


M15530 


Homo saoiens 


B-cell growth factor 


87 


68 


10097 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


84 


79 


10098 


G00369 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4450. 


150 


66 


10099 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


157 


70 


10100 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


103 


60 


10101 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


114 


70 


10102 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


130 


53 


10103 


G03786 " 


Homo saniens 


Human secreted protein, SEQ ID NO: 
7867. 


156 


52 


10104 


G00365 


Homo saniens 


Human secreted protein, SEQ ID NO: 
4446. 


87 


60 


10105 


L27428 


Homo sapiens 


reverse transcriptase 


220 


38 


10106 


W90834 


Homo sapiens 


Human lymphocyte targeted peptide 
#2. 


91 


43 


10107 


AF161356 


Homo sapiens 


HSPC093 


227 


61 


10108 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


123 


64 


10109 


AF130089 


Homo sapiens 


PRO2550 


126 


77 


10110 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


118 


60 


10111 


M63274 


Plasmodium 
falciparum 


malaria antigen 


81 


54 


10112 


AF090944 


Homo sapiens 


PRO0663 


133 


70 


10113 


L27428 


Homo sapiens 


reverse transcriptase 


91 


32 


10114 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3 17- 
3). 


112 


64 


10115 


W88947 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


85 


44 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1U1 io 




riomo sapiens 


reverse transcriptase 


112 


44 


1U1 1 f 


owz ju i 


rtomo sapiens 


Human secreted protein, ahQ ID NO: 

OjoZ. 


1 1 c 

1 15 


61 


10118 


G04063 


Homo sapiens 


Human secreted protein, SEQ JD NO: 


79 


34 


10119 


AF194537 


Homo sapiens 


NAG13 


295 


51 


1 I/IZU 


U\J\)j IK) 


Miis museums 


open reading frame (196 AA) 


97 


40 


10121 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


122 


47 


JU1ZZ 


W4oj j 1 


Homo sapiens 


. .. . 

Human breast cancer related protein 
BCRB2. 


1 AO 

148 


55 


1UIZJ 


vjU3 /y3 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
7874. 


AO 

92 


66 


1U1Z4 




Homo sapiens 


rKU065 / 


127 


78 




O0044Z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


145 


68 


1U1Z0 


O03 807 


Homo sapiens 
___ 


Human secreted protein, SEQ ID NO: 
/ooo. 


113 


49 


JU JZ/ 


uyjj03 


Homo sapiens 


putative pi 50 


A/t 

94 


35 


10128 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone riUALJo 1 . 


146 


79 


1 AIOA 


ATI 3AACO 


Homo sapiens 


T>T> r\*\ C CA 


208 


75 


10130 


M15530 


Homo sapiens 


B-cell growth factor 


102 


73 


1 A1 11 

10131 


A55053 


Lycopersicon 
esculentum 


extensin (class I) 


122 


30 


10132 


L27428 


Homo sapiens 


reverse transcriptase 


197 


40 


1 A1 "11 


VTA 1 A AA 

Y 0 1400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


99 


73 


1 A1 'lA 

11)134 


Ax 194537 


Homo sapiens 


NAG 13 


220 


48 


10135 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


116 


75 


10136 


G034U 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*7A AO 

7492. 


117 


61 


10137 


Y86573 


Homo sapiens 


Human gene 91-encoded protein 
fragment, SEQ ID NO:490. 


223 


59 


10138 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A TO 

4478. 


178 


75 


1 A1 1C% 

10139 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


170 


52 


1 A1 /f A 

1O140 


VI A A OA 

Y 14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


133 


64 


10141 


AF072164 


Homo sapiens 


HFSE-1 


80 


60 


1 A "MO 


IjOZZl 1 


Homo sapiens 


riuman secreted protein, ofcl^ ID Nu: 
6292. 


1 A*> 

103 


74 


1U1*W 


A/Ti ^ia 


Homo sapiens 


B-cell growth factor 


91 


40 


10144 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/OOO. 


103 


61 


10145 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oo!3. 


162 


65 


10146 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


128 


53 


10147 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


112 


61 


10148 


X71975 


Drosophila 

m Pi 5inf\<T5icT'OT 
lUClailUgoaver 


put homologue to S.cerevisiae GAR1 
gene 


118 


39 


10149 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


49 


10150 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


163 


81 
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SKQW 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


lUCQUi 










Score 


y 


10151 


Y08769 


Rattus 


microvascular endothelial 


106 


83 






norvegicus 


differentiation gene 2 






10152 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


117 


52 






major 








10153 


AF 113685 


Homo sapiens 


PRO0974 


133 


62 


10154 


U41038 


Caenorhabditis 


Similar to eadherin-type repeat 


129 


67 






elegans 






10155 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


50 








8144. 






10156 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


10157 


AB011099. 


Homo sapiens 


KIAA0527 protein 


94 


35 


10158 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


72 








7888. 






10159 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


71 








6444. 






10160 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


68 








7871. 






10161 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


88 








6953. 






10162 


Ml 5530 j 


Homo sapiens 


B-cell growth factor 


137 


63 


10163 


AF090895 


Homo sapiens 


PRO0117 


93 


67 


10164 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


60 








7124. 






10165 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


52 








4478. 






10166 


AF 11 8082 


Homo sapiens 


PRO1902 


111 


55 


10167 


AF090895 


Homo sapiens 


PRO0117 


112 


75 


10168 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


69 








4409. 






10169 


AF090942 


Homo sapiens 


PRO0657 


149 


67 


10170 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 


68 








7870. 






10171 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


85 


SI 






major 


DNA, complete virion genome 






10172 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


136 


62 








terminal, clone XEH.8c} 






10173 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


63 








6613. 






10174 


AF090942 


Homo sapiens 


PRO0657 


75 


44 


10175 


U12390 


Cloning vector 


beta-galactosidase alpha peptide 


97 


47 






pSportl 








10176 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 


108 


75 








WO9901020. 






10177 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


168 


80 








similar to pogo element 






10178 


M22332 


Homo sapiens 


unknown protein 


131 


62 


10179 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


160 


60 




• 




similar to pogo element 






10180 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


65 


68 








4449. 






10181 


AF1 19900 


Homo sapiens 


PR02822 


132 


72 


10182 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


56 








8153. 






10183 


G02490 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


124 


59 








6571. 






10184 


AF210651 


Homo sapiens 


NAG18 


150 


65 


10185 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


46 








4463. 






10186 


X71442 


Rattus 


ORF 1 ; putative 


91 


47 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Sraith- 

Waterroan 

Score 


% 

Identit 

y 






norvegicus 








10187 


AK024455 


Homo sapiens 


FLJ00047 protein 


122 


54 


10188 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


82 


72 


10189 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


78 


61 


10190 


AF220264 


Homo sapiens 


MOST-1 


165 


81 


10191 


Y91617 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 19 SEQ ID NO:290. 


54 


44 


10192 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


119 


80 


10193 


X92485 


Plasmodium 
vivax 


pval 


128 


57 


10194 


X61296 


Rattus 
norvegicus 


open reading frame 2 


79 


72 


10195 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


116 


62 


10196 


AF130089 


Homo sapiens 


PRO2550 


144 


62 


10197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


124 


56 | 


10198 


AF090942 


Homo sapiens 


PRO0657 


133 


76 


10199 


S79304 


Rattus sp. 


cytochrome oxidase subunit I; COX I 


280 


66 


10200 


S61070 


Homo sapiens 


reverse transcriptase homolog=pol 
{retroviral element} 


188 


49 


10201 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787 1. 


125 


51 


10202 


AF068294 


Homo sapiens 


HDCMB45P 


174 


65 


10203 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


174 j 


50 


10204 


U93572 


Homo sapiens 


putative pi 50 


77 


40 


10205 


L27428 


Homo sapiens 


reverse transcriptase 


133 


50 


10206 


J02621 


Homo sapiens 


high mobility group protein 14 


94 


57 


10207 


AF130079 


Homo sapiens 


PR02852 


144 


73 


10208 


R95913 


Homo sapiens 


Neural thread protein. 


134 


51 


10209 


U49973 


Homo sapiens 


ORFl ; MER37; putative transposase 
similar to pogo element 


154 


76 


10210 


AB046048 


Macaca 
fascicularis 


unnamed portein product 


162 


60 


10211 


AF090931 


Homo sapiens 


PRO0483 


128 


79 


10212 


AF130051 


Homo sapiens 


PRO0898 


144 


76 


10213 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


95 


54 


10214 


AF130089 


Homo sapiens 


PRO2550 


158 


70 


10215 


G03564 - 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


96 


55 


10216 


M15894 


Homo sapiens 


chorionic somatomammotropin 
precursor 


116 


63 


10217 


AK024372 


Homo sapiens 


unnamed protein product 


85 


56 


10218 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSR 


118 


69 


10219 


M37679 


Mus musculus 


Ig heavy chain precursor 


64 


83 


10220 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


91 


69 


10221 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


127 


47 


10222 


Y36505 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 29. 


102 


48 


10223 


AF09094O 


Homo sapiens 


PRO0644 


105 


76 


10224 


U49974 


Homo sapiens 


mariner transposase 


155 


76 


10225 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


154 


42 
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WO 01/64835 
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SEQ1D 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


ldentit 










Score 


y 


10,776 


41*701610 
/vrzy.30jy 


liOITlO SapiCUS 


Hi/win tvvprvtAr rpnrf*ccnr 






10777 


T 77478 
JLZ /4Z5 


HOmO bapiCIlS 




137 


44 




f»rt74K^ 
OUZ^fOJ 


riorno bapiciib 


rxuiiidii bcuicLcu piuicixx, ocy %XJ inv_/. 


iov; 

xuo 


7rt 

/V/ 








VJOU. 












p40 


206 


go 




XU5U01 


JtlOCDO oapit/Ilo 


Unman f-mvK rvt-nivMn frncrm*»nt 

riuniaii i»~nijru pruiciij liagiiicui. 


l id. 

1 1*T 




1 A711 
1UZ31 


At 1 1 0O0Z 


Homo sapiens 


"DTJ/^I OA7 
rlvJ iy UZ 


/O 


^7 
3Z 


1 A7^7 


At 1 1 0O00 


Homo sapiens 


DOAI 007 


O0 

yy 


7C 
/J 


10233 


X53375 


Heliantnus 


antner-specinc protem or 1 5 


yj 


t>4 






annuus 








10Z34 


VA1 1 Cft 

YOl IDo 


Homo sapiens 


— : -~ 

Secreted protein encoded by gene 1 8 


1 DO 


77 
/Z 








clone rtw\AJJo 1 . 






t AOIC 

10Z35 


prt? I 11 

Cj03 133 


Homo sapiens 


Human secretea protem, oiiq iu inu- 


1 AH 
100 


3j 








771 a 
/Z14. 






10230 


L2742o 


Homo sapiens 


reverse transcriptase 


Z44 


jU 


10237 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


A1 

V3 


TA 

/o 








terminal, clone XbH-oc} 






10235 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


I4Z 


/Z 








clone rHMrirvyzy. 






1A710 
lUZJy 


VAl 4 Art 


xiomo sapiens 


O o/»rofoH nrnto > »"\ 1w nana 1 J? 

oecretea protem encoaea oy gene i o 


140 










clone nJNrirwzy. 






1 A^VtA 

10240 


S52010 


Mus sp. 


1st Met is at position 21 


AO 

ys 


4Z 


10241 


Al 1 1 A1 jC7 

AJL110147 


Homo sapiens 


hypothetical protein 


IOC 

IZj 


3j 


10242 


X9d485 


Plasmodium 


putative 


or 

55 


51 






vivax 








10243 


55o722 


Homo sapiens 


r— : : — : — — 

X-linked retinopathy protem {C- 


*7A 

/o 


0J 








farmtnal ««lnnA YT7U 5/»l 

terminal, clone AJDrLoC) 






1 A044 
1UZ44 


I 5 /004 


Momo sapiens 


riuman secreiea protem sequence ocxi 


17^ 

1ZJ 


40 














1UZ43 


I T4Q071 
U4yy/3 


xiomo sapiens 


wrvrZ- nine u on uniuiowij 




O 1 


1UZ40 


VA7 C7A 


Homo sapiens 


sauvary proime-ncn protein i 


1 11 


DO 


1 AO/17 


A F11AAOQ 


tiomo sapiens 


rlNAJZJjU 


110 


71 


10245 


003 /yo 


Homo sapiens 


riuman secretea protem, ocy iu rsw. 


i Art 

100 


^0 

oy 








7C71 
#5/1. 






1024y 




— — ; 

Homo sapiens 


putative pi 50 


1 AO 

iuy 


43 


1 AOCA 

10200 


Ar 1300oy 


Homo sapiens 


do rvi C CA 


1 A/1 

104 


34 


10251 


G04063 


Homo sapiens 


Human secretea protem, SEQ ID NO: 


7l> 


36 








8144. 






10252 


G04063 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


i in 

110 










ol44. 






irt7ci 


/"1AAC&& 

vjOOjoo 


Homo sapiens 


Unmnn na/^alori nrnluin C17r*\ 1T\ ~NTI^ a 

riuman secretea protem, oc^ \±j inu. 


171 
1ZJ 


^4 
04 








4ooy. 






1A7^A 


UUUOj / 


nomo sapiens 


riuman secreiea proiein, ocx^ uj iskj. 




^7 








/171ft 
4/lo. 






107^^ 


Yftl 1 4^; 


rioroo sapiens 


pOu VJlvT V 


7A 


£1 


lUZOO 


VAl AA^ 
I 0 140 J 


Homo sapiens 


oecretea protem encoaea oy gene zj 


1 1 A 
1 10 


00 








cione riiJrrJAHo. 






IvJZj/ 


flrtl41 1 
O034 11 


— : 

Homo sapiens 


Unman oo^af/vl nmfain CCA TP\ XTO* 

numan secretea protem, oov i*-* rNv/- 


111 

J ID 


05 








7407 

/4yz. 






1 A7C£ 


n A A1 10 

U00333 


■— ; 

Homo sapiens 


riuman secretea protein, oxiv£ ll/ inc 


135 


05 








AAl A 
4414. 






lUZDy 


1 1400*77 

U4yy / 3 


— : 

Homo sapiens 


UKr i , MHK3 /, putative tninsposase 


130 


Art 

00 








similar to pogo element 






lOZoU 


/*lAO CI Q 


— -— : 

Homo sapiens 


riuman secretea protem, orA^ id nkj. 


yj 


jy 








MOO 






10261 


M86246 


Homo sapiens 


EHS-2 


96 


62 


10262 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


180 


66 








6953. 






10263 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


102 


62 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Jdentit 

y 








7871. 






10264 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


313 


84 


10265 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


144 


56 


10266 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


150 


61 


10267 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


408 


89 


10268 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


74 


26 


10269 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


133 


52 


10270 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


61 


10271 


X61048 


Hydra sp. 


mini-collagen 


121 


40 


10272 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


150 


66 


10273 


L24521 


Homo sapiens 


transformation-related protein 


134 


51 


10274 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


96 


47 


10275 


U59446 


Brassica napus 


myrosinase-binding protein related 
protein 


115 


39 


10276 


GO 1480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5561. 


482 


98 


10277 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


143 


81 


10278 


U93564 


Homo sapiens 


putative pi 50 


130 


46 


10279 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


123 


44 


10280 


AF090944 


Homo sapiens 


PRO0663 


156 


73 


10281 


AF130079 


Homo sapiens 


PR02852 


146 


75 


10282 


X89401 


Homo sapiens 


ribosomal protein L21 


108 


48 


10283 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


65 


66 


10284 


Y25821 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 41. 


1602 


98 


10285 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


308 


43 


10286 


AF090931 


Homo sapiens 


PRO0483 


131 


74 


10287 


Y48480 


Homo sapiens 


Human breast tumour-associated 
protein 25. 


203 


90 


10288 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


116 


44 


1 AO OA 

10289 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


160 


ir a 

64 




A67d03 


Mus musculus 


TV) 

T2 


147 


37 


10291 


AF229067 


Homo sapiens 


PADI-H protein 


123 


43 


1 AO AO 

10292 


A 17/\**f Ot> A A 

AF078844 


Homo sapiens 


hqp0376 protein 


125 


59 


10293 



G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


123 


37 


10294 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


103 


64 


10295 


G03807 


■■ ■ - 
Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


162 


70 




v uuooz 


-— ; 

Homo sapiens 


a i r ase O 


zoo 


oo 


10297 


AB007861 


Homo sapiens 


KIAA0401 


424 


92 


10298 


M24732 


Homo sapiens 


lam in-like protein 


183 


40 


10299 


D90053 


Sus scrofa 


destrin 


123 


48 


10300 


X61047 


Hydra sp. 


mini-collagen 


86 


43 



408 



WO 01/64835 POYUS01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


\f\if\t 
10 JO I 


U4y973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


loU 


CA 

50 




API 1 Q(Y7St 


Homo sapiens 


r>"D f~\l QAQ 

rKU154o 


t ^7 


Oj 


lUoOJ 


ADDOol 


Ly cop ers i con 
esculentum 


extensin (class I) 


ft7 
of 


JO 


1UJU4 


DO£?AA 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


IUj 


£7 
02 






Homo sapiens 


rio-associatea spi icing iactor 


SLA 
OH 


77 


10306 


AF068294 


Homo sapiens 


HDCMB45P 


220 


42 


10307 


o79410 


Mus musculus 


nuclear localization signal binding 
protein 


yO 


A 1 

41 


10305 


AJ005559 


Musmusculus 


SPR2A protein 


64 


42 


10309 


AL353995 


Arabidopsis 
thaliana 


AtAGP4 


AO 

98 


38 


10310 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


213 


92 


10311 


M15386 


Homo sapiens 


gamma-giobin 


614 


88 


10312 


V00488 


Homo sapiens 


alpha globin 


574 


90 


10313 


AL049730 


Arabidopsis 
thaliana 


pEARLl 1-like protein 


117 


38 


10314 


AL356299 


Homo sapiens 


bK3216D2.1.1 (S- 
adenosylhomocysteine hydrolase 
(SAHH}, isotonn l ) 


256 


70 


10315 


AF1 94537 


Homo sapiens 


NAG13 


113 


63 


10316 


S58722 


Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone Abxi.oC) 


AA 

99 


62 


t a*» 1 1 
10j17 


A PAAAA1 1 

AF090931 


Homo sapiens 


PRO0483 


oO 


oo 


103 IS 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


1 AO 

102 


11 


10319 


L29219 


Homo sapiens 


clkl; putative 


341 


100 


10320 


V004oo 


Homo sapiens 


alpha globin 


3 14 


OA 

oV 




AJ223y53 


Homo sapiens 


nr 1 1U 




oy 


10322 




Homo sapiens 


glutathione transferase Tl 


to 


7^ 
fj 


10323 


\TA CIOO 

Y4D382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


101 


i 

OO 


1U324 


A U1 1 £71 < 
AJF 1 10/ 13 


Homo sapiens 




IZo 


71 


10325 


AF1 16619 


Homo sapiens 


PRO1051 


125 


77 


1 AM£ 

1032a 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


OAQ 

24y 


1 Aft 
1UU 


i Aon 
10327 


U73844 


Homo sapiens 


ESc-la 


Ioj 


OA 

yo 


10328 


AB051901 


Homo sapiens 


VDUP1 


132 


83 


10329 


AL163300 


Homo sapiens 


human ubiquitin conjugating enzyme 

/"*<-> r?/~i /" o o in « 

02 EC 6-3.2. ly. 


TO A 

720 


1 AA 
100 


10330 


AF1 17237 


Homo sapiens 


prefoldinsubunit2 


729 


100 


1 All 1 

10331 


1242054 


Homo sapiens 


KIAA0092 gene product is distantly 
related to smootn muscie myosin. 


7.^7 

332 


CM 
64 






Homo sapiens 


zoo proteasome a x rase suounu 


*1J 1 


i nn 


lUJ jj 


ly^-yoo 


Homo sapiens 


riuman secreieu protein cione mn l / 1 

pjuiciij sequence ocy u-' inv.iau . 




yo 




AOl/J /O 


DiaSSlCa nap US 


pruLinc-riuii pruieui 


1 14 

1 I** 




10335 


M34427 


Homo sapiens 


T-plastin 


343 


91 


10336 


V00488 


Homo sapiens 


alpha globin 


445 


88 


10337 


Ml 5386 


Homo sapiens 


ganima-giobin 


402 


85 


10338 


X61 123 


Homo sapiens 


BTG1 


367 


A A 

90 


10339 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation suppneu. uy 
author 


278 


87 


10340 


A03992 


synthetic 
construct 


growth hormone 


145 


61 


10341 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


57 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4518. 






10342 


U02680 


Homo sapiens 


protein tyrosine kinase 


140 


92 


10343 


AF 180681 


Homo sapiens 


guanine nucleotide exchange factor 


1425 


88 


10344 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


123 


69 


10345 


V00488 


Homo sapiens 


alpha globin 


318 


93 


10346 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


329 


98 


10347 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


100 


66 


10348 


U29953 


Homo sapiens 


pigment epithelium-derived factor 


359 


74 


10349 


G01790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5871. 


85 


66 


10350 


x r/\ a i no 

V00488 


Homo sapiens 


alpha gtobin 


361 


90 


10351 


U93567 


Homo sapiens 


p40 


203 


84 


10352 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQIDNO:220. 


253 


85 


10353 


AF1 94537 


Homo sapiens 


NAG 13 


196 


41 


10354 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


297 


81 


10355 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


144 


65 


10356 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


94 


54 


10357 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


153 


68 


30358 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


136 


53 


10359 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear rna-binding 
protein) 


95 


77 


10360 


X61296 


Rattus 
norvegicus 


open reading frame 2 


87 


40 


10361 


L24521 


Homo sapiens 


transformation-related protein 


103 


32 


10362 


G03790 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
787L 


101 


55 


10363 


Y95028 


Homo sapiens 


Human clone vp7_l ORF2, SEQ ID 
NO: 128. 


89 


80 


10364 


G0O985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


101 


58 


10365 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQ ID NO:220. 


322 


85 


10366 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


59 


10367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


53 


1036S 


G02363 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6444. 


120 


59 


10369 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


60 


83 


10370 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


169 


81 


10371 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


80 


75 


10372 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7UOO. 


60 


83 


10373 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


77 


10375 


AF130089 


Homo sapiens 


PRO2550 


139 


78 


10376 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


62 



410 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 








7874. 






10377 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


96 


51 


1037S 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


132 


58 


1U3 /y 


Woooz / 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


1 -J *7 

1 17 


71 


1 A*> OA 

lUoXU 


VAl/IAA 1 

Y0140U 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


125 


54 


i mot 




Homo sapiens 


putative pi 50 


160 


60 


10382 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


140 


77 


10383 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. . 


83 


71 


10384 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Di 
protein 


137 


63 


10385 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


67 


50 


10386 


X62691 


Homo sapiens 


ribosomal protein homologous to yeast 
S24 


117 


58 


10387 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


141 


45 


10388 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


70 


11)385* 


I TA1C7A 


Homo sapiens 


putative pi 50 


112 


58 




Ar09094Z 


Homo sapiens 


PRO0657 


126 


68 


10391 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


123 


64 


10392 


AF130089 


Homo sapiens 


PRO2550 


144 


71 


10393 


t t A An r\ 

U40739 


Homo sapiens 


cyclin C 


464 


86 


10394 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


151 


75 


10395 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 10L 


137 


55 


iu3yo 


Wooy47 


Homo sapiens 


Polypeptide fragment encoded by gene 
118. 


165 


66 




A CAOAAOA 


Homo sapiens 


PRO0478 


133 


80 


10398 


AK024455 


Homo sapiens 


FLJ00047 protein 


153 


62 


1 A1 f\f\ 

10399 


T» A£OAA 

R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


69 


10400 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


73 


51 


10401 


AB030033 


Dictyostelium 
discoideum 


AmiB 


82 


33 


1 A/I AO 




Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


1 13 


63 






Homo sapiens 


Human secreted protein, ofcv "J NU: 
4409. 


1 A O 

148 


11 


1 A/1 A/1 




Homo sapiens 


IV i Xi\i~\f\ A T - a. _ * ^_ 

FLJ00047 protein 


1 1 o 

138 


71 


1 /"MAC 




Homo sapiens 


Human secreted protein, SbQ ID NO: 
7143. 


100 

128 


59 


1U4UO 


OAO>1 OA 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


1 An 

143 


76 


10407 


AF090942 


Homo sapiens 


PRO0657 


105 


64 


1 A/4 no 
10405 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, cione AUxi.oC) 


93 


54 


10409 


J02621 


Homo sapiens 


high mobility group protein 14 


140 


64 


10410 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


46 


10411 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


79 


56 



411 



WO 01/64835 



PCT/USO 1/04927 



SEQI0 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








7879. 






10412 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


79 


66 








* 1 1 l/TITT mf% \ 

terminal, clone XEH.8c} 






1 A A 1 1 

10413 


An oaata 

AF 13 0079 


"Tj : 

Homo sapiens 


PR02852 


184 


90 


1 AA 1 A 

10414 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


237 


77 








SEQ ID NO:220. 






10415 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








A A 1 T 1M T\ a rv^n 

gene 44 clone KYDAD22. 






10416 


AF016099 


Mus musculus 


endonuclease/reverse transcriptase 


89 


78 


10417 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


184 


52 


10418 


Y3068I 


Homo sapiens 


Splice variant ZAP- IB protein of the 


131 | 


75 








human tumor suppressor gene ZAP-1 . 






i a a 1 rv 

10419 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


90 








7068. 






104zU 


002507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


59 








6588. 






1 AA<"* 1 

10421 


AF090942 


Homo sapiens 


PRO0657 


J 54 


57 


10423 


L78669 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


106 


90 


10424 


AF09577O 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


105 


75 








protein 






10425 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


130 


64 






brucei 


protein rhala. 






10426 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


122 


64 








A A -| W 1WW V A. TV AW 

gene 44 clone HTDAD22. 






10427 


AF068294 


Homo sapiens 


HDCMB45P 


256 


56 


10428 


AF 194537 


Homo sapiens 


NAG13 


182 


54 


10429 


AF090931 


Homo sapiens 


PRO0483 


108 


79 


10430 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


68 








7795. 






10431 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






10432 


X67920 


Homo sapiens 


tryptophanyl-tRNA synthetase 


138 


84 


10433 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


294 


78 








SEQ ID NO:220. 






10434 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






10435 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








6571. 






10436 


S79410 


Mus musculus 


nuclear localization signal binding 


117 


72 








protein 






10437 


G00577 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


120 


67 








4658. 






10438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


140 


69 








nil A 1 o / A \7t\7QfZQ1\ 

KrlA 1 a (M 0 / oOo J) 






10439 


AF1 30089 


Homo sapiens 


PRO2550 


106 


80 


1 AA" JlA 

10440 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


331 


83 








SEQIDNO:220. 






1 A/I A 1 


W94294 


Homo sapiens 


Human phosphatidylinositol transfer 


143 


96 








protein gamma. 






1 AjI jf O 

10442 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


121 


87 








RHAla(AF078683) 






1 A A At 

10443 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


127 


87 








protein 






*1 Avt A A 

10444 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


67 








/D ly. 






10445 


U93567 


Homo sapiens 


p40 


262 


69 


10446 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


112 


92 








protein 






10447 


AF09577O 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


120 


83 



412 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Sraith- 

Waterroan 

Score 


% 

Identit 

y 








protein 






10448 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oo78. 


104 


68 


10449 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


115 


80 




ArUJO / /[} 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


117 


80 






Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


104 


80 




At i jUUoy 


Homo sapiens 


rKO2550 


315 


68 


10453 


G02530 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6611. 


110 


71 


10454 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


86 


34 


10455 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


113 


85 


10456 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


180 


83 


1 f\A C7 


Ar 095770 


Homo sapiens 


PTH-responsive osteosarcoma D 1 
protein 


119 


96 




i oo24o 


Homo sapiens 


Human secreted protem HCHPF68, 

oca rr\ \ia. i 

bEQ ID NO: 163. 


133 


58 


10459 


Y41710 


Homo sapiens 


Human PR06I8 protein sequence. 


894 


71 


1 (\AAf\ 
lU40v 


Ovlz4o 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
5327. 


84 


71 


1 CiAAl 


^AO>! A1 


Homo sapiens 


chorionic somatomammotropin 


471 


88 


1 t\A£.~l 
1U40Z 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6194. 


241 


73 




U034U 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


93 


68 


1 U Z K>4 


I ri *£.A7 

U I jo4/ 


Mus musculus 


reverse transcriptase 


110 


47 


10465 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


57 


10466 


G02754 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/AO c 

6835. 


145 


93 


10467 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


137 


57 


10468 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


62 


55 






Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


93 


86 






Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


119 


63 


104.71 


/vruoozy4 


Homo sapiens 


HJJL,Mr54jr 


117 


43 




ioOj fj 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ EDNO:490. 


335 


72 






Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


121 


62 






rioroo sapiens 


ribosomal protein L30 


126 


58 


10475 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


127 


87 


10476 


G02640 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6721. 


106 


61 


JU477 


AC0U0O93 


Caenorhabditis 
elegans 


Hypothetical protem W02H5.e 


119 


41 


10478 


G02485 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6566. 


87 


66 


10479 


AF130079 


Homo sapiens 


PR02852 


97 


54 


10480 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


69 
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WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1 A A O 1 

10481 


Jv0z401 


Homo sapiens 


chorionic somatomammotropin 


463 


80 


10482 


G00398 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4479. 


69 


83 


10483 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


132 


61 


10484 


nil rr/ 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


1 19 


80 


1 A A OC 

10485 


A T?AAC*7*7A 

Ar 095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


90 


54 


10486 


U79260 


Homo sapiens 


unknown 


105 


90 


10487 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


243 


53 


10488 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


90 


78 


10489 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


64 


10490 


YOI400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


123 


60 


10491 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


471 


84 


10492 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


176 


77 


10493 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


135 


64 


10494 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


123 


44 


10495 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


86 


46 


10496 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


145 


40 


10497 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


111 


50 


10498 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XER8c} 


96 


73 


10499 


AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


142 


96 


10500 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


120 


49 


10501 


AF220264 


Homo sapiens 


MOST-1 


93 


83 


10502 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


116 


57 


10503 


G03371 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7452. 


100 


70 


1 f\Cf\A 

10504 


AF090931 


Homo sapiens 


PRO0483 


88 


80 


10505 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


89 


72 


IUjUo 


U03U43 


Homo sapiens 


Human secreted protein, obvj iu NU. 
7124. 


1 O 1 


81 


10507 


G00392 


Homo sapiens 


Human secreted protein, SHQ JUL) NO: 
4473. 


1 oo 
156 


82 


1 AC AO 

10508 


AF 130050 


Homo sapiens 


TVTi AAOT1 

PRO0872 


101 


74 


1 ACAA 

10509 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


176 


68 


1 AC t A 

10510 


A rAAfVTA 

AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


120 


96 






riomo sapiens 


unnamea proxein proauci 


07 


AH 
05 


10512 


X95276 


Plasmodium 
falciparum 


ORF91 


75 


29 


10513 


X17093 


Homo sapiens 


leukocyte antigen F 


397 


91 


10514 


W48351 


Homo sapiens 


Human breast cancer related protein 


115 


66 



414 



WO 01/64835 
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SEQID 
NO. 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 














ins is 




Urtmri caniAnc 
xlUUlU bapiCJlo 


T-liimnn ^ftrn^tfvl nrfttptti ^IPO YD TvJO- 
riujjiau j»/ivicu jsiuic.ui, ocy tu ii\y. 

4478. 


1^4 


Oo 




003 R07 


nuuiu cxipiciib 


Unman CPTTPtpH nrAtPITI RPO ID WO* 

7888. 


Rfi 


47 


10S17 


AF0Q0Q44 


XJ/^tYi r\ con if* nc 
riiJllXU t>ajJiCW> 




no 


4£ 
*rO 


10518 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


131 


77 


10519 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 


230 


58 


10520 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
pruicui 


127 


87 


10521 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

/ ooo. 


130 


40 


10522 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
proicin 


127 


87 






iiomo sapiens 


riuman secreieu proiem, oii\i hj in w. 
4413. 


toy 








Homo sapiens 


PTH-responsive osteosarcoma Dl 
proieui 




I Art 
IUU 


10525 


J04495 


Macaca mulatto 


alpha-globin 


144 


90 






nomo sapiens 


Unman co^ratoH nrntain C ITP\ fvT/A. 

riuman secrecea protein, ocy iu emvj. 
6616. 




Al 1 


10S97 




Unmo contort c 

nomo Sapiens 




137 


/J7 

o/ 


10528 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 

1L/ INU. llD 1. 


90 


53 


10529 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 


112 


75 


10530 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


79 


10531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770** 


151 


59 


10532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


63 


10533 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7407 
/*+yZ. 


112 


43 


10534 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


65 


10535 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/Mil 
*rrJJ ■ 


95 


60 


10536 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


157 


79 


10537 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

447^ 


199 


90 


10538 






Unman QprreH-pH nrntein <sKO TD NO* 

6566. 


in 


/J 


10539 


G00437 


IXXJIAIXJ lyaLJlXZlLo 


Hi i man ^Arrpfr»H nmtpin ^RO TD NO* 

XlUliJall ljCUICICU pi (J l Dili, OJ-A£ liv. 

4518. 






10540 


AF 1300 89 


Homo QanifMiQ 


PRO? S SO 




O / 


10541 


Y60522 


Homo sapiens 


Human normal bladder tissue EST 

pnrr>f^p/^ rawt^Tn 1 04 
CULUUCU pi LrlClil 1 3r*r. 


73 


72 


10542 


GO 1175 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
**;?s£ 


83 


100 


10543 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


81 


54 


10544 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


147 


75 


10545 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


109 


66 



415 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Jdentit 

y 


10546 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


123 


46 


10547 


AF130089 


Homo sapiens 


PRO2550 I 


133 


63 


10548 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEQIDNO.220. 


246 


80 


10549 


G00382 


Homo sapiens 


Human secreted protein,' SEQ ID NO: 
4463. 


122 


51 


10550 


J02459 


bacteriophage 
lambda 


J (taikhost specificity; 1132) 


772 


97 


10551 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


158 


80 


10552 


G00328 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 
4409. 


123 


50 


10553 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


184 


71 


10554 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


107 


60 


10555 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


140 


69 


10556 


U49973 


Homo sapiens 


ORF2: function unknown 


91 


70 


10557 


AF151850 


Homo sapiens 


CGI-92 protein 


459 


63 


10558 


X66285 


Mus musculus 


HC10RF 


92 


30 


10559 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


127 


67 


10560 


AF1 30050 


Homo sapiens 


PRO0872 


100 


71 


10561 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 


335 


80 


10562 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


126 


58 


10563 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


76 


10564 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


158 


62 


10565 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


118 


77 


10566 


V00488 


Homo sapiens 


alpha globin 


464 


89 


10567 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


162 


81 


10568 


S58722 


Homo sapiens . 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


113 


72 


10569 


G03040 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7121. 


127 


47 


10570 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


125 


65 


10571 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


79 


88 


10572 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


158 


80 


10573 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


138 


56 


10574 


AF1 19900 


Homo sapiens 


PR02822 


154 


55 


10575 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


87 


72 


10576 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


131 


66 


10577 


G03112 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


80 


47 


10578 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


121 


53 


10579 


Y91577 


Homo sapiens 


Human secreted protein sequence 


335 


83 



416 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idenrit 

y 








encoaeo oy gene z ocv^ iu nu.zju. 






10580 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


47 


10581 


U49973 


nuiiiu bap I cllo 


vJivr i , jvLDtsj / , puiauve iransposase 

cunilar tn rvwc» ffpmpnt 
ollilliai IV/ pv/gV/ vlvillvlll 


A 

324 


68 


10582 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4£70 


129 


66 


10583 


AF090895 


Homo sapiens 


PRO0117 


137 


63 


10584 




nomo sapiens 


reverse transcriptase 


AO 


50 


10585 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44QQ 


101 


68 


10586 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


82 


10587 




nomo sapiens 


riuman secreieo protein, oii^ ID v\\j\ 
7519. 


100 


76 


10588 


AF0QS770 


xxomo Sapiens 


Jr i ii-responsive osteosarcoma u i 
protein 


1 AA 

100 


70 


10589 


X0114^ 

AUJ l*tj 


xxomo sapiens 


«At flRP T 
poL uivr i 


1 OA 

120 


43 


10590 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ 005. 


139 


70 


10591 


AF144054 


Homo sapiens 


apoptosis related protein APR-4 


92 


50 j 


10592 

1 UJ7/> 




xiomo Sapiens 


rLJUUu^/ proiein 


in 


75 


10593 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
compicie virion genome. 


343 


58 


10594 


G0^616 

VIvJUjU 


nomo sapiens 


riuman secreieo protein, oc\i iu fSU: 
7717. 


1 16 


54 


10595 


W8R627 

VY OOUX / 


noinu sapiens 


occretea protem encoaea oy gene y4 
clone HPMBQ32. 


196 


57 


10596 


AF 1300X9 


nUillU SapicHS 




1 A A 

140 


66 


10597 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAAQ 


115 


52 


10598 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4 SAO 


79 


71 


10599 


Y86948 


nomo sapiens 


numan secretea protein riCHrroo, 
SEQIDNO:163. 


146 


79 


10600 


uvj toy 


nomo sapiens 


numan secretea protein, bbQ ID NO: 

75170. 


1 1 A 

119 


67 


10601 


AF090895 


Homo sapiens 


PRO0117 


106 


68 


10602 


"^58729 


nomo sapiens 


X- linked retinopathy protein {C- 

terminal rinn^ YPT4 ftr\ 

Lcrminai, cioue Acn.oc/ 


83 


72 


10603 


AF090930 


Homo sapiens 


PRO0478 


119 


70 


10604 


AF095770 


nuiiio sapiens 


ir i n-responsive osteosarcoma ui 
protein 


120 


83 


10605 


AF 132949 


Wattia csiriipnc 
mxjiuxj sauiciio 


v-aji-u proxem 


11/! 


96 


10606 


Y95028 


Homo sapiens 


Human clone vp7_l 0RF2, SEQ ID 


176 


77 


10607 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4400 


155 


74 


10608 


G02480 


I-Tatti r\ QiiniATtc 
xnviuu sapiens 


numan secretea protein, oJCrv^ ili inlj. 
6561. 


12/ 


63 j 


10609 


G04063 


TTaitia cnnipnQ 
Jivmu sapiens 


n uni au secreieo protein, oev^ iu invj. 
8144 


QQ 
OO 


Z!A 

60 


10610 


AF130089 


Homo sapiens 


PRO2550 


86 


90 


10611 




riuuio sapiens 


Unman /> a ^ v^+a *«vs\4-Aim CTJA 1 1 V \T/*V # 

numan secretea protein, oJc.i^ wj INU: 
4409. 


1 /in 

149 


72 


10612 


L27428 


Homo sapiens 


reverse transcriptase 


311 


52 


10613 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


83 


84 


10614 


D00570 


Mus rausculus 


open reading frame (196 AA) 


116 


69 



417 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 




I OOj / D 


noino sapiens 


- 

Human gene 91 -encoded protein 


AAA 

290 


62 








(VoitmAnt CCA TT\ XT/"V/1AA 

rragment, oc.\i id InU.490. 






1U010 


I z /yu/ 


— : 

Homo sapiens 


Human secreted protein encoded by 


1 A C 

125 


51 








MAMA. XT<\ 1 1 A 

gene Wo. 119. 






iOOi A 


AJT 1 1 005O 


nouio sapiens 


rKUzzz 1 


lit 
131 


56 


10618 


M64793 


Rattus 


salivary proline-rich protein 


101 


48 






norvegicus 








1 f\C 1 A 

10619 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


65 








T7AC 

/ /95. 






lUOZU 


/^Aft/ITT 


— tz : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


58 








AC 1 O 

4518. 






IOOZ1 


I 50305 


— — . 

Homo sapiens 


Human secreted protein HLYES38, 


336 


85 








SEQ ID NO:220. 








AT 1 15052 


Homo sapiens 


PRO 1902 


80 


73 


10623 


Y86573 


TT • 

Homo sapiens 


Human gene 91 -encoded protein 


136 


50 








fragment, SEQ ID NO:490. 






10624 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


109 


67 








clone HPMBQ32. 






10625 


tr/\o AAA 

Y02999 


Homo sapiens 


Fragment of human secreted protein 


92 


75 








encoded by gene 121. 






10626 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


121 


62 








7871. 






10627 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


104 


52 









7868. 






1UOZ5 


G037S9 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


66 








7870. 






loozy 


Y 50245 


Homo sapiens 


Human secreted protein HCHPF68, 


129 


67 








bJtsQ ID NO; 163. 






10030 


T TA/3 C£A 

U93569 


Homo sapiens 


putative pi 50 


104 


52 


1UOJ1 


A T7AAAOAC 


Homo sapiens 


rKO0117 


136 


70 


10032 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


89 


57 








gene 22 clone HMbJ wis. 








API i?jcoc 
AT 1 1 D 05 J 


Homo sapiens 


rKU09 /4 


130 


64 


10034 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


63 


lOOJj 


M24732 


Homo sapiens 


lam in-like protein 


80 


65 


10636 


U49974 


Homo sapiens 


mariner transposase 


116 


72 


10637 


A V/V1y| ACC 

AK024455 


Homo sapiens 


FLJ00047 protein 


158 


67 


1 A^l O 

10638 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


121 


64 








clone HNHF029. 






1 A/C3Q 

10639 


00378 / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


57 








7868. 






1 A/C/fn 
1O040 


U03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


72 








/5o5. 






lVO*f I 


L*Z / 4Zo 


xiomo sapiens 


reverse uansenptase 


1 C A 

150 


54 


1UOHZ 


VJvZjJ o 


riomo sapiens 


xiuman secreteo. protein, oty id fsu. 


68 


O 1 

81 














in/^i 

1 110*0 


U93505 


— r- — ; 

Homo sapiens 


putative pi 50 


A OA 

282 


57 


1 C\fxAA 


A "CAOAA/io 
AJP09094Z 


Homo sapiens 


JrKUOoj / 


15j 


74 




145333 


Homo sapiens 


Human prostate cancer-associated 


196 


90 








protein 30. 






luono 


I 563/3 


Homo sapiens 


Human gene 91 -encoded protein 


242 


73 








Iragment, SEQ ID NO:490. 






1U04/ 


U /9260 


Homo sapiens 


unknown 


95 


77 


1U648 


(303807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


64 








7888 

/ ooo. 






10649 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


87 








4409. 






10650 


Y91577 


Homo sapiens 


Human secreted protein sequence 


339 


89 








encoded by gene 2 SEQ ID NO:250. 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identir 

y 


10651 


U15183 


Mvcobaeterium 
leprae 


nroline-rich antigen 

UlUllllw 1 Iwll U|l HfiVdl 






10652 


AF> 17536 


Homo saniens 


truncated mevalonate kinase 


11K 

1 JO 


7ft 
/u 


10653 


AF 119855 


Homo sapiens 


PR01847 


74 


100 


10654 


M15530 


Homo <*anien<; 


R-tp11 prnwth factor 


04 


*7 


10655 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327 


144 


54 


10656 


M15317 


Plasmodium 

1 r»T-iVi 1 1 f» 


histidine-rich protein 


124 


51 


10657 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

UOl 3. 


143 


66 


10658 


W48351 


Homo sapiens 


Human breast cancer related protein 


114 


77 


10659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


172 


55 


10660 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A MS 


150 


78 


10661 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

a Ann 
/ rt yiy. 


160 


81 




Oftft19R 


nuino Sapiens 


n urn an sec re icq protein, oea^j iu in Li. 
4409. 


J jj 


fx 


lvUUJ 


VJvvjZ.0 


nuiiiu Sapiens 


Human <?or>r-A<-»r1 nrnfain QDA TT\ "M/~V 

riuman secretcu protein, otivj uj inli. 
4409. 


lou 


St 1 
o 1 


10664 


AK000406 


Hr\tDA caniptic 

xxuinu oapiciio 


uiiuaiiicu pruicin prouuci 




7*7 


10665 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
cione njNJtrrL/zy. 


147 


75 


10666 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


160 


81 


10667 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4400 


149 


78 


1066$ 


Y91577 


Homo sapiens 


Human secreted protein sequence 
entuueu uy gene z ociv^ jul/ invazju. 


303 


78 


10669 


D00526 


Rattus 

nnrvf»cririic 

LLSJl VCglLrUo 


L-gulono-gamma-lactone oxidase 


108 


73 


10670 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


143 


65 


10671 


Y91577 


Homo sapiens 


Human secreted protein sequence 


409 


84 


10672 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

477ft 


110 


65 


10673 


G00423 




4504. 






10674 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


85 


36 


10675 


U93565 


Homo sapiens 


putative pl50 


115 


45 


10676 


AF109907 


Homo saoiens 


S164 


905 




10677 


G02620 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6701 


118 


73 


10678 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870 


106 


62 


10679 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

TYt"ntf*in 


127 


87 


10680 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 

IXJ I7J. 


248 


54 


10681 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoded from gene 28. 


88 


52 


10682 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


126 


77 


10683 


S58722 


Homo sapiens 


X-linked retinopathy protein (C- 


123 


72 



419 



WO 01/64835 



PCT/US01/04927 



I SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 








terminal clone XEH 8c) 






i 1 flA£4 




VTatyia <sar>ien<; 


X-l inked retinopathy protein {C- 
terminal, clone XER8c} 


123 


72 


1 if^a^ 

I IUOoj 




Unmo ^anien*; 

. i iui 1 1 v/ sautviis 


Svnthetic human tvoe III collagen 
SYN-C3. 


99 


40 


I 1ft£RA 




1-f attia <»anien<5 


HDCMB45P 


182 


54 


10687 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
ffene 22 clone HMSJW18. 


104 


53 


10688 


Y86305 


Homo sapiens 


Human secreted protein HLYES38, 
SEO ID NO220 


277 


76 


10689 


G00233 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4314. 


118 


65 


10690 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492 


143 


71 


1 ia£ai 




nomu ottpiGIIo 


Human secreted nrotein SEO ID NO: 
4425. 


97 


81 


I 1Uo9z 


i Z/oj4 


. lomo Sapiens 


Hitman arreted nrotein encoded bv 

XXUlUClll obwlvlwU pi Ulvlil vuvwuvvi 

gene No. 101. 


160 


68 


I 1 Ail AO 


A T71 1AAQO 


xiomo Sapiens 


PRO2550 


133 


56 


10694 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tprrmnal clone XEH 8cV 


129 


71 


10695 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518 


123 


71 


10696 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7714 


161 


45 


10697 


G00648 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

479 Q 


88 


56 


10698 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


75 


71 


10699 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

tJ7D. 


92 


38 


10700 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

DOl7. 


132 


55 


10701 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718 


118 


54 


10702 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492 


66 


73 


10703 


S79410 


Mus musculus 


nuclear localization signal binding 


124 


44 




fIAA3 
OUUJ J / 


rionio bapiciib 


Human secreted protein, SEQ ID NO: 
4438. 


113 


78 


1 1 A7A* 


Q7QA 1 A 


Kyfuc mticpuliic 


nuclear localization signal binding 
protein 


119 


50 


I 1 ATAA 
| 1U/UO 


A \£(Y)*>1 1 A" 
/YIvvZJ 1 lO 


Hnmn oanipiiQ 


unnamed protein product 


160 


84 


10707 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10708 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


160 


81 


10709 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


230 


76 


1 1 0,7 1 n 




Haiti a qhtiipt>c 


Human secreted protein, SEQ ID NO: 
7645. 


96 


45 


I i mi i 
1 11// 1 1 


nmn9 


Wattia <iaAif*nQ 


Human secreted protein* SEQ ID NO: 
7253. 


174 


70 


10712 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


127 


50 


10713 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


148 


45 


10714 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


151 


80 



420 



WO 01/64835 PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith' 

Waterman 

Score 


% 

Identit 
y 


10715 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


157 


67 


10716 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


109 


68 


10717 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


151 


52 


10713 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


84 


80 


10719 


M19973 


Rattus 

UUl TvglvUu 


cytochrome PB24 


104 


45 


10790 


AT 3Q01 14 


? .pisrimania. 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


103 


72 


10791 


W40JJ I 


Homo oaniftTis 


Human breast cancer related protein 
BCRB2. 


161 


52 


1 0799 


T 974.98 


Hnmfi Q?*nipn<i 


re verse transcriptase 


159 


42 


1079"* 


OJ o / -tx 


Homo <jar>ien^ 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


127 


75 


10794 


YOO^I 1 

I UW 1 1 


Haiti fi <5jmien<i 


Human secreted protein encoded by 
gene 54. 


84 


50 


1079S 


AF090895 


Homo aniens 


PRO0117 


170 


71 


10726 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


105 


42 


10797 


K 03905 


Homo saniens 


salivary proline-rich protein precursor j 


123 


38 


1079ft 


TT93563 

U7JJUJ 


Homo saniens 


putative pi 50 


162 


64 


10729 


X14576 


Murine leukemia 
virus 


gag fusion protein 


172 


41 


10730 


AF090895 


Homo sapiens 


PRO0117 


151 


60 


107^1 

1 V f D 1 


API 10RS1 


Homo ^winien^ 


PR01722 


127 


54 


10732 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


152 


61 


10733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


156 


76 


10734 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


73 


10735 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


151 


65 


10736 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


141 


75 


10737 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


74 


l VI JO 


AF9/55575 


Homo «umiens 


ubiquitous TPR-motif protein Y 
isoform 


124 


69 


10739 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


91 


60 


10740 


AF130051 


Homo sapiens 


PRO0898 


90 


63 


10741 


AF090895 


Homo sapiens 


PRO01I7 


129 


55 


10742 


AF1 19900 


Homo sapiens 


PR02822 


164 


85 




AJ005560 


Mus musctihis 


SPR2B protein 


88 


35 


10744 


Z79996 


Homo sapiens 


CB33F2.1 (PUTATIVE novel protein 
similar to-C-terminal parts of APOL 
(apolipoprotein L) and TNF-inducible 
protein CG12-1) 


185 


100 


10745 


G03140 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7221. 


94 


88 ' 


10746 


X82208 


Homo sapiens . 


beta-centractin 


134 


96 


10747 


D49744 


Mus musculus 


farnesyltransferase alpha subunit 


228 


60 


10748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


127 


50 


10749 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


153 


62 



421 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

v 

j 








4523 










Lfnm r» canipnc 
nOlllU oaplCUo 


PRO2550 


147 


62 


10751 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484 


108 


71 


10752 


D 17652 


Homo sapiens 


HBpl5/L22 


416 


72 


10753 




riomo sapiens 




486 


86 


t at c A 
10754 




riomo sapiens 


Unman v.ereted nrotein SEO ID NO: 
7174. 


238 


73 


10755 


Q7Q/I 1 (\ 


ivius muse ui us 


mirier lA/*alization siffnal bindins 

rsrAfpin 


103 


57 | 


lU/50 


a t?i mnfco 
Ar l JUUo^ 


noiiiu sapiens 




104 


72 


10757 


X75421 


Hypocrea 

io/«Ai*in<) 

jecorind. 


actin 


197 


66 


10758 


AP298594 


Nicotiana alata 


arabinogalactan protein 


107 


30 


t at^a 

10759 


U9483Z 


Homo sapiens 




95 


40 


10760 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


179 


82 


10761 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


271 


80 


10762 


Z81528 


Caenorhabditis 
elegans 


predicted using Genefinder-contains 
similarity to Pfem domain: PF00939 
(oOQiurn.su iiaic symporicr 
uansmemorane region/, ocurc .^.o, cr 
valuA— l 7 7\T=1 • PF(117^7 f Domain of 

nnlrnnum fiinrtl ArA Score = 556 0. E- 

valne=8 le-164 N=l 


103 


27 


10763 


V00662 


Homo sapiens 


ATPase 6 


593 


83 


107o4 


AOO? / / 


riomo sapiens 


OlUUlIllll 


610 


81 


t nine 

10765 


Ao/ooi 


ivlus mus cuius 


T7 
1 1* 


109 


46 


10766 


Ar 13703U 


Homo sapiens 


trancmpmhranp rwAtPlTl 7 
U CUloillCIUUlolIC pi Ulvlll 


620 


96 


10767 


O0379 / 


riomo sapiens 


Human cprrpted nrotein SEO ID NO: 

7878. 


101 


66 


10768 


*V 1 CM/I 

A15JZ4 


nomo sapiens 




529 


89 


10769 




riomo sapiens 


fiVirmAOpn Kptii chain 
xiuruiugGiJ i/cut wit* iii 


604 


87 


10770 


VOOooZ 


nomo sapiens 


TTRP 7 /TvIADH dehvdroffenase subuniO 


381 


83 


10771 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OU 1 J • 


133 


73 


10772 


Mo411u 


riomo sapiens 


UUUwglUUl I 


346 


94 


10773 


AB014566 


Homo sapiens 


KIAA0666 protein 


145 


78 


10774 


ATI lOO 1 A 

AL138810 


Homo sapiens 


H1170r 10 7 Similar to CGI-29 

QJ X / /Jul vX lOULLUCU Ul V^vll 

protein) 


216 


90 


10775 


A F1110C2 

AJ223y53 


riomo sapiens 


nri ivj 


353 


82 


1 anna. 

10/76 


uyzzoo 


xiuman 
herpesvirus 6 


XXI > J 


113 


39 


107 /7 


ArUUUOlO 


Ai»tf^n oatit/o 


similar to RING-H2 fineer protein 
RHAlafAF078683) 


143 


78 


10778 


X12796 


Bostaurus 


HMG1 protein (AA 1-215) 


435 


96 


1U / /9 


UoO/oZ 


MOIUO bap LCI lo 


76S nmteasome-associated padl 
homolot? 


477 


89 


10780 


M60047 


Homosapiens 


heparin binding protein 


514 


81 


1 A.*7Q 1 

1U781 


VI /I A&fi 

I I445U 


Unrnn conionc 

nomo sapieub 


Fra<rment of human secreted protein 

pnrndeH hv $yene 14 


99 


48 


1U /<$z 


Ar l juu /y 


TJfrtm r\ cnr\ipnc 


PR02852 


175 


48 


1U/83 




nOIllU aapiCIla 


Wnman secreted nrotein. SEO ID NO: 
6574. 


115 


64 


10784 


M11147 


Homo sapiens 


ferritin light chain 


439 


83 


10785 


AF090931 


Homo sapiens 


PRO0483 


145 


55 


10786 


AF072441 


Homo sapiens 


calcineurin binding protein cabin 1 


498 


90 


10787 


AF116715 


Homo sapiens 


PR02829 


127 


88 



422 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. j 


Species 


Description 


Smith- 

tt aid Hidll 

Score 


% 

Identit 

y 


10788 


AF201949 

*kl *»V J S"^S 


Homo sapiens 


60S ribosomal protein L30 isolog 


631 


91 


10789 


AC005253 


Homo sapiens 


R26445 1 


403 


84 


10790 


M15386 


Homo saoiens 


gamm a-globin 


617 


82 


10791 


Y95005 


4*nmo saoiens 


Human secreted protein vc57_l, SEQ 
IDNO:50. 


716 


95 


10792 


GO 1285 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
5366. 


165 


52 




X67863 


Miis musculus 


T2 


128 


50 


10794 


Y82326 


Homo sapiens 


Human arginase I SEQ ED NO:17. 


507 


80 


10795 


Y00755 


Homo saoiens 


extracellular matrix protein BM-40 
(AA I - 303) 


559 


87 


10796 


AF1 19851 


Homo saoiens 


PR01722 


117 


71 


1A797 


AF09G930 


Homo saoiens 


PRO0478 


146 


63 


1079R 


V00488 


Homo saoiens 


alpha globin 


509 


84 




00407? 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
8153. 


131 


42 


10x00 


X83618 1 


Homo sapiens 


hydroxymemylglutaryl-CoA synthase 


422 


78 


10X01 


AF 132952 


Homo saoiens 


CGI-1 8 protein 


341 


88 


10802 


M17375 


Gallus gallus 


type XII collagen 


88 


42 






Homo saoiens 


chorionic somatomammotropin 


436 


85 


lUOVrr 




Hftmn saoiens 


chorionic somatomammotropin 


410 


86 


10805 


U61232 


Homo sapiens 


cofector E 


359 


77 ! 


J Uo I/O 




Homo saoiens 


Ibal (ionized calcium binding adapter 
molecule t) 


406 


77 


10807 

1UOU / 


A Fl 18082 


Homo saoiens 


PRO1902 


105 


38 


10808 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


60 


10809 


AF090942 


Homo sapiens 


PRO0657 


163 


52 






Homo saoiens 


super cysteine rich protein; SCRP 


116 


86 


10R1 1 


noo^7 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
4718. 


104 


45 






Rathis 
norveoi ens 


open reading frame 2 


156 


41 


ioxn 




Homo saoiens 


unknown 


103 


64 


10814 


G01478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5559. 


75 


77 


10815 


U52077 


Homo sapiens 


mariner transposase 


131 


85 


iiloJO 


ty*ri 19 


Homo saoiens 


ATPase summit 6 


251 


74 


10817 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


103 


80 


1081 Jt 




Homo saoiens 


Human secreted protein, SEQ ID NO: 
6913. 


102 


52 


10819 


G00361 

\JwUJvl 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
4442. 


94 


66 




Y21811 


Homo saoiens 


CPF polypeptide 36PRO. 


200 


62 


10821 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


161 


60 


10822 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


163 


70 


10823 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


337 


63 


10824 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


122 


61 


10825 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


122 


68 


10826 


J03634 


Homo sapiens 


erythroid differentiation protein 
precursor 


222 


68 



423 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


10827 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


452 


78 


10828 


V00488 


Homo sapiens 


alpha globin 


132 


86 


10829 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


131 


70 


10830 


AF090942 


Homo sapiens 


PRO0657 


122 


43 


MR31 

1 \JOJ 1 


AF090931 


Homo sapiens 


PRO0483 


141 


72 




AP000616 


Orvza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


138 


63 




G03806 


Homo saniens 


luman secreted protein, SEQ ID NO: 
7887. 


88 


57 




AFl 18082 


Homo saniens 


PRO 1902 


109 


62 


10835 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


135 


44 


IvOJU 


fi09S32 


Homo saniens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


86 


10837 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


143 


47 


10R3X 

lUOJO 


U52077 


Homo saniens 


mariner transposase 


259 


67 


10839 


M10546 


Homo saniens 


cytochrome oxidase I 


341 


83 


10840 


Y44361 | 


Homo sapiens 


Human cell cycle regulation protein-2. 


144 


60 




009 S 32 


Homo ^aniens 


Human secreted orotein. SEO ID NO: 
6613. 


96 


56 




I/JO 1 1 -c. 


Homo ^antfMtc; 


cytochrome c oxidase subunit 3 


565 


80 






An/Tji sativa 


similar to RING-H2 fmeer D rote in 1 
RHAla(AF078683) 


118 


100 


10844 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


157 


81 






Homo ^aniens 


NADH dehydrogenase subunit 2 


416 


85 


10846 


AF194537 


Homo sapiens 


NAG13 


177 


70 


105M.7 


I JVrr7J 


H rutin oaniMi^ 

i X\J lll\J ilsllO 


Fragment of human secreted nrotein 
encoded by gene 27. 


108 


40 




Ml ^3ft 
1Y11 JJJU 


nuiiiv japiciio 


R-ceII erowth factor 


92 


67 




V14489 


Homo ^anipns 


Fragment of human secreted protein 
encoded by gene 17. 


98 


52 


10850 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


121 


76 


10851 


AFl 16712 


Homo sapiens 


PR02738 


84 


57 




V UvOUZ 


Unmn canipns 


ATPase 6 


278 


84 


mas3 




"Trvnann<Jom a 

bmcei 


ORFl 


93 


37 


IvOJt 


Y02749 


Homo saniens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


93 


61 


10855 


AFl 30089 


Homo saDiens 


PRO2550 


126 


72 


10856 


AF130079 


Homo sapiens 


PR02852 


154 


75 


10857 




Homo saniens 


unnamed protein product 


113 


70 ! 


10858 


G00397 


Homo saniens 


Human secreted protein, SEQ ID NO: 
4478. 


114 


71 


1085Q 

1VOJ7 


G03703 


Homo saniens 


Human secreted protein, SEQ ID NO: 
7784. 


97 


56 


10860 


M36647 


Homo saniens 


mitochondrial hinge protein precursor 


207 


78 


10861 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


83 


50 


10869 




Homo sanit*n^ 


Human secreted protein, SEQ ID NO: 
6588. 


126 


46 


10?63 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


63 


10864 


B01372 


Homo sapiens 


Neuron-associated protein. 


106 


38 


10865 


U38964 


Homo sapiens 


KPMSR2 


152 


68 


10866 


AFl 19855 


Homo sapiens 


PR01847 


142 


67 



424 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


luoO/ 


vJ\J 1 0Z0 


Uattia conipnc 
rlOIuU oaJJlCIlo 


"Human secreted nrotein SEO TD NO* 
5909. 


99 


75 


10868 


U79260 


Homo sapiens 


unknown 


94 


47 


IAS AO 


PA7A<7 


Unmn caniAnc 

rioirjo ixipicni 


^mallpr henatocellular onconrotein 
(hhcra) gene preoduct 


151 


52 


i ab7a 


A IA1 AA7< 


Ustm n com one 
O.OI no bdpiCllb 


unr-intprafifinp rwrntein 


100 


64 


10871 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7914. 


110 


44 


1 AST79 


OAlTOft 
VJUO /70 


riomo salens 


Unman <;prrpted nrotein SFO TD NO* 
7879. 


136 


71 




VA77/10 

xuz 


iiomo sapiens 


Wiimsm ^fvrptpH nrntftrn enrodpd hv 

gene 100 clone HNFIU96. 


126 


61 




A T?l O AAQO 


riomo sapiens 




115 


88 


1U5/J 


VA1 1 **C 


riomo sapiens 


^A^rptpH nrAtpin pnprtdpfi Hv oene 1 & 


114 


59 


10876 


G00500 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

4^R1 


83 


48 


10877 


AF1 18086 


Homo sapiens 


PR01992 


145 


82 


1 A010 
10878 


AF1 16712 


Homo sapiens 


DDA073C 


1 in 


jj 


10879 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
795^ 


123 


77 


10880 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA19 
*r*f /O. 


111 


61 


10881 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 

rr\ "MO- 1 n-j 


106 


51 


1U55Z 


^TAIQAA 


riomo sapiens 


lT lirn « n cf*r~rf*tt*A nrrrfpin ^FO f D NO* 

riuniaii sccreicu pruicui, ocy vvj v\\j. 
7881. 


107 


67 


lUooi 


hiAA/i 1 a 
VjUU41o 


riomo sapiens 


Unman c^n-Atpd nrntpin ^FO TD "NO* 

riumau sccreicQ pruicm, ocy vxj v\\j* 
4497. 


101 


65 


1 AQQA 

lUoo4 




riomo sapiens 


UILRJIUVVII 


1 1 1 

111 


56 


10885 


W88627 


Homo s^iens 


Secreted protein encoded by. gene 94 


153 


60 


10886 


GO3801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

fooZ. 


107 


57 


10887 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

71 9A 


148 


80 


10888 


Q00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


58 


10889 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*H)JO. 


156 


67 


10890 


G00328 


Homo sapiens 


Human secreted protein, SEQ JDD NO: 
4409. 


105 


52 


1 AQA1 




Homo sapiens 


T-Tirman c&*rMf*A nrntpin QPO ID MO* 

rium an sccrcicu pruicin, oc«v^ u-' rtvy. 

*rrJJ« 


1X7 
101 


65 




AO/ jiU 


Ayfuo miiOfniiic 
1V1U5 UllibUUlUo 


Mil 1 *: MIJSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


120 


37 


10893 


AJ223410 


Homo saniens 


EBIl-ligand chemokine 


174 


77 


10X94 

wo/ • 


D381 1? 


Horn a aniens 


cytochrome c oxidase sub unit 1 


475 


83 


10895 


V00662 


Homo sapiens 


ATPase 6 


316 


95 


10XQA 




WrwnA cattipnQ 

fXlMllV* M^/lvlU 


ATPase 6 

Ail QOw V/ 


427 


82 


10897 


U09500 


Homo sapiens 


cytochrome b 


403 


92 






norno sapiens 


l/TU 


215 


91 


10899 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


69 


77 


10900 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


159 


60 


10901 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


93 


43 



425 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


10902 


M10546 


Homo sapiens 


cytochrome oxidase I 


302 


66 


10901 


G00447 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
4528. 


102 


58 


10904 


AK024455 


Homo sapiens 


FLJ00047 protein 


118 


74 


10905 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


108 


64 


10906 


AF090942 


Homo sapiens 


PRO0657 


149 


74 


10907 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


167 


91 


10908 


D381 12 


Homo sanien^ 


ATPase subunit 6 


355 


68 


10909 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


145 


72 


10910 


003714 

\J\JJ fit 


Homo <mniens 


Human secreted Drotein SEO ID NO: 
7795. 


114 


72 


10911 


G00637 


Homo ^anien^ 


Human secreted Drotein. SEO ID NO: 
4718. 


144 


53 


10912 


V00662 


Homo aniens 


ATPase 6 


289 


69 


10913 


X62677 


Oryctolagus 
cuniculus 


retrovirus related reverse transcriptase 


93 


51 


10914 


V00662 


Homo sapiens 


cytochrome oxidase III 


509 


86 


10915 


J03071 


Homo sanifcTK 


chorionic somatomammotrooin CS-2 


459 


80 


10916 


Z28029 


Saccharomyces 

rprpvteisie 


ORFYKL030W 


96 


32 


10917 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

71X1 


94 


56 


10918 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 

nn con rote in 

UllvVt'l V/ tu ill. 


114 


36 


10919 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


117 


88 


10920 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal clone XEH 8c 1 

l&l ill 111 OX, WIS 11 ^ AwiLMV^ 


103 


39 


10921 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO* 163 


99 


95 


10922 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


75 


63 


10923 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF07S6&3) 


125 


85 


10924 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla CAF078683) 


140 


73 


10925 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


77 


38 


10926 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


111 


58 


10927 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 


75 


10928 


AP000616 


Orvza saliva 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


78 


10929 


G02501 


Homo sac i ens 


Human secreted protein, SEQ H) NO: 
6582. 


77 


57 


10930 


K02576 


Homo saoiens 


salivary pro line-rich protein 1 


93 


53 


10931 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


83 


54 


10932 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


262 


84 


10933 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


113 


72 


10934 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


120 


62 


10935 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


69 



426 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO; 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 


10936 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


120 


56 


10937 


AB028893 


Homo sapiens 


ribosomal protein SI 1 


279 


53 


10939 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


127 


55 


10940 


AF119900 


Homo sapiens 


PR02822 


154 


53 


10941 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


127 


62 


10942 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


109 


41 


10943 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


61 


10944 


AF1 16715 


Homo sapiens 


PR02829 


119 


69 


10945 


M10546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


lV7*rVI 


f 27428 


Homo ^aniens 


reverse transcriptase 


112 


51 


10947 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


118 


90 


I U740 


AF095770 


Homo saoiens 


FTH-responsive osteosarcoma Dl 
protein 


121 


83 


10949 


AF116715 


Homo sapiens 


PR02829 


134 


67 


10950 


S58722 


Homo sapiens 


X-linked retinopathy protein {G- 
terminal, clone XEH.8c} 


139 


83 




AF 130079 


Homo sapiens 


PR02852 


119 


79 


10953 


Z38128 


Mus musculus 


histone HI 


126 


35 


ino^4 




Homo saoiens 


Human secreted protein, SEQ ID NO: 
4718. 


135 


53 


10955 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


126 


37 




G00500 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
4581. 


81 


69 




AF11R0R2 


Homo ^aniens 


PRO1902 


90 


51 




G02485 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
6566. 


116 


53 


10959 

1 U7J7 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


82 


51 




1 179260 


Homo saoiens 


unknown 


97 


52 


10961 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


194 


66 


10962 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


124 


48 


10963 


G030S4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7165. 


66 


73 


1 U7wt 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


87 


10965 


Z28029 


Saccharomyces 
cerevisiae 


ORF YKL030w 


99 


70 


10966 


V00672 


Pan troglodytes 


reading frame protein 5 


107 


70 


10967 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


138 


58 


10968 


K02401 


Homo sapiens 


chorionic somatomammotropin 


478 


77 


10969 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


86 


10970 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


156 


57 


10971 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


126 


56 


10972 

X v7 / -C 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


103 


51 


10973 


K02401 


Homo sapiens 


chorionic somatomammotropin 


491 


82 


10974 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


146 


93 


10975 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


236 


85 


10976 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


108 


50 



427 



WO 01/64835 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 
y 








gene 51 clone HUKEX85. 






10977 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-2 


435 


73 


10978 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


119 


74 


10979 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


137 


73 


10980 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


133 


63 


10981 


Y14482 * 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


90 


65 


10982 


X92485 


Plasmodium 
vivax 


pval 


119 


47 


10983 


Y15913 


Homo sapiens 


COL1 Al and PDGFB fusion transcript 


64 


47 


10984 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


463 


77 


10985 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


309 


65 


10986 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


145 


87 


10987 


AF130089 


Homo sapiens 


PRO2550 


88 


31 


10988 , 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


127 


65 


10989 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


137 


75 


10990 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


76 


10991 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


118 


72 


10992 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


69 


10993 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


120 


92 


10994 


AJ277092 


Oryctolagus 
cuniculus 


translationally controlled tumor protein 
3 


172 


49 


10995 


AF052831 


Trypanosoma 
crazi 


unknown 


92 


37 


10996 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


119 


53 


10997 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


43 


10998 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


165 


52 


10999 


S52010 


Mussp. 


orfl 5' of EpoR 


93 


39 


11000 


AF130079 


Homo sapiens 


PR02852 


131 


81 


11001 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


111 


75 


11002 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


132 


83 


11003 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


151 


60 


11004 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


159 


58 


11005 


K02401 


Homo sapiens 


chorionic somatomammotropin 


579 


89 


11006 


K02401 


Homo sapiens 


chorionic somatomammotropin 


464 


87 


11007 


M15894 


Homo sapiens 


chorionic somatomammotropin 
precursor 


370 


74 


11008 


K02401 


Homo sapiens 


chorionic somatomammotropin 


467 


88 


11009 


K02401 


Homo sapiens 


chorionic somatomammotropin 


537 


91 


11010 


K02401 


Homo sapiens 


chorionic somatomammotropin 


475 


87 


11011 


K02401 


Homo sapiens 


chorionic somatomammotropin 


460 


88 



428 



WO 01/64835 



PCT/US01/04927 



SEQU) 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Idenlit 

y 


11012 [ 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7720. 


101 


61 


1 Ivl J 




fiillichthv^ 

VJllllWIUljrO 

mirabiiis 


ribosomal protein S13 


223 


54 


1 1014 


G02872 


Homo saniens 


Human secreted protein, SEQ ID NO: 
6953. 


176 


61 


11015 


AF1 16715 


Homo sapiens 


PR02829 


121 


69 




OJO / ifL 


Homo ^aniens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


92 


75 


1 1017 


J\7UOvU 


Homo saniens 


Human histiocyte-secreted factor HSR 


113 


78 


11018 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


40 


1 1A10 




Homo oanipn^ 


X- linked retinooathv orotein (C- 
terminal, clone XEH.8c} 


101 


48 


1 iUZU 




Homo oanien<i 


Human secreted protein, SEQ ID NO: 
7214. 


151 


51 


11021 


X14963 


Homo sapiens 


collagen-like protein (447 AA) 


107 


47 


1 1 (VYJ 




Homo Qftnipri^ 


Human secreted orotein SEO ID NO: 
7795. 


151 


72 




at i 10/ ij 


JULUlllvJ oOlJlGUo 


PR02829 


140 


75 




VX/574.R 
I OQZ*f O 


Homo cpjni/^nQ 


Human secreted orotein HCHPF68, 
SEQIDNO:163. 


125 


70 


1107^ 


AP000001 


Pvrococcus 
horikoshii 


58aa long hypotbetical protein 


99 


52 


1 107A 


TY*81 17 


Homo saniens 


cytochrome c oxidase subunit 3 


601 


81 


11027 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


434 


74 


1 107R 


7*Q7m 


Homo ^anien<? 


RHCE orotein 


149 


51 


1 107Q 


\jUVrTJ*T 


Homo <?anien<i 

iAUlllvi iJOlJlv/lXO 


Human secreted orotein. SEO ID NO: 
4535. 


113 


48 


11030 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


120 


77 


1 Mil 




Homo saniens 


cofactor C 


226 


63 


1 10V? 


M 10546 


Homo saniens 


cytochrome oxidase I 


116 


88 


11033 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


584 


93 


i mid 


T 10910 


Homo saniens 


splicing factor 


108 


58 


1 10^ 




Mil*; musculus 


LENE/Ig H- chain fusion protein 


140 


48 


11036 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminai, clone XEH.8c} 


101 


79 


1 1017 


AK024455 


Homo saniens 


FLJ00047 protein 


122 


48 


1 101R 




Homo saniens 


HHLA3 protein 


96 


68 


11039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQlDNO:163. 


144 


79 


11040 


AK024455 


Homo sapiens 


FLI00047 protein 


151 


62 


1 1/U1 


JU\AJ*T70 


Homo saniens 


ribosomal protein S20 


321 


73 




AP000616 

ill V/ WW J. V 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


123 


84 


11041 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


125 


68 


11044 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


208 


91 


11045 


G00552 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4633. 


79 


62 




D38112 


Homo saniens 


ATPase subunit 6 


297 


90 


1 104.7 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


416 


73 


11048 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


73 


80 


11049 


AF090944 


Homo sapiens 


PRO0663 


96 


70 


11050 


AF130089 


Homo sapiens 


PRO2550 


152 


71 


11051 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


108 


81 



429 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 
y 








eene 22 clone HMSJW18. 






11052 


W89031 


Homo sapiens 


Polypeptide fragment encoded by gene 
167. 


254 


95 


11053 


AF109907 


Homo sapiens 


S164 


136 


45 


11054 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


117 


58 


11055 


X72004 


Halichoerus 


ATP synthase subunit 6 


160 


78 


11056 


V00662 


Homo sapiens 


ATPase 6 


300 | 


71 


11057 


AF228021 


Ro<5 taums 


cvcloohilin I 

VTvlvUlliull A 


209 


73 


1 1058 


L27428 


Homo ^aniens 


reverse tran scri ntase 


303 


65 


11059 


U09116 


Homo sapiens 


ORF I , encodes a 40 kDa product 


464 


80 


1 iwov 


W OOvl / 


Hnmn canrpn 
ilUlliu oapit-iij 


Sprrpfprl nrotein encoded hv $>ene 94 
clone HPMB032 


145 


68 


x 1VO I 


n^Ri i? 


I Tr»m (~\ cnnipn<! 
JiAJUiv oaL/iwia 


rvtochrnmp ft mcirfji^ snhunit "3 


301 


85 


11062 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
aenftTsIn 101 


119 


64 


11063 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


128 


76 




11501 RR 


vsaVla puiV/CllUa 


PfiP P«rA/»fkntnr fv>mnnnpnt nrntpin 

V^VJAVI 1 vViCULUl IrVJLJILJV/llwllL |J1 ULOlll 


152 


88 


11065 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

71 RR 
/ loo. 


104 


44 


11066 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

t*t / o. 


112 


64 


11067 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAIa f AF07R68TI 


113 


71 


11068 
1 luuo 




i^rubopiiiid. 

iUCl<lilUgcL>lCl 


CCl 19706 opnp nrnrlurt 


145 


43 


11069 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFO ID NO- 163 


160 


65 


11070 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


416 


67 


1 1071 

1 1 V) 1 1 


UjOl \z. 


Univin pom one 

n. on it) Sapiens 


rvtnrhrnmA r* nviHncp ciiHunit ^ 


220 


86 


11072 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

661 3 

DDI 


144 


63 


1 107* 


ooidi i 


jiomo sapiens 


Hnman <w*rrptpr1 nrntein ^FO ID MO' 

7492. 


115 


60 


11074 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


117 


65 


11075 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


589 


87 


1 1076 

x IV f\J 


LfJO 


UTnmn ccinipnc 

JIUIUU dopicus 


rvtnr*hrome r nyirla^f* <nih)init 1 


588 


83 


1 1077 
1 1 v / / 


<s^R799 


xiuiiiu sapiens 


~Y-1inlrpH rpfinnnatfiv nrntpin 
terminal clone XEH 8c) 


106 


86 


11078 

1 X \f 1 %9 


Y86248 


Hnmn <iJinipti<5 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


90 


50 


11079 


AF 132200 


Homo <yanien<; 


PR01751 


97 


89 


11080 


AF1 16661 


Homo saniens 


PR01438 


109 


51 


11081 


AK024455 


Homo saniens 


FLJ00047 protein 


93 


48 


11082 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


84 


73 


11083 


J02963 


Homo saDiens 


platelet glycoprotein lib precursor 


108 


80 


11084 


AF068294 


Homo saniens 


HDCMB45P 


229 


59 


11085 


M10546 


Homo sapiens 


cytochrome oxidase 1 


137 


73 


11086 




Hnmn Qanienc 


Human secreted nrotein HCHPF68. 
SEQIDNO:163. 


105 


49 


1 10X7 




Hnmn Mnipnc 
n.umu dO^l^iK) 


rhrnmo^fttnal nrotein 


122 


75 


1 1088 

* 1 WOO 


VR694R 

I OVJirrO 




Human secreted nrotein HCHPF68 
SEQ ID NO: 163. 


103 


46 


11089 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c} 


112 


56 


11090 


AF130089 


Homo sapiens 


PRO2550 


172 


60 


11091 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


107 


35 
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PCTAJS01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 








Score 


y 


1 1092 


AF090930 


Homo ^aniens 


PRO0478 • 


146 


75 


11093 


R96800 


Homo sanieits 


Human histiocyte-secreted factor HSF. 


163 


82 


1 1 094 i 

1 1W7T 


V00669 


Homo sapiens 


ATPase 6 


130 


87 


11095 

1 ivy J 


B0137? 


Hnmn saniens 


Neuron-associated protein. 


122 


82 


1 1096 

1 1V7U 


D38112 


Homo ^aniens 


ATPase subunit 6 


152 


62 


1 1097 
l ivy / 


vl 10JJ7 


V ul iUia Yil uc» 


D4L 


96 


60 


i ioqr 


ATUUUO 1U 


KjlyzJa. Saliva 


similar to RTNG-H2 finser nrotein 


113 


60 






RHA1 a rAF078683^ 








AT 1jUUo7 


nUiUU oOpiCIIb 




128 


75 


in nn 


M0A079 


nomo sapiens 


Hum an secreted nrotein SEO ID NO* 


218 


62 








R153 






1 i lwi 


T TA0073 


iiorno sapiens 


OR PI • MPR37* rnttntive transno*ia*ie 


232 


79 








similar to noffo element 






1 1 X c\o 

1 1 1UZ 


a poono^o 


xl lyi 11 \J oapiciio 


PRO0478 


154 


73 


I 1 1A1 

I I IUj 


fll\1Q.(Y7 
VJ\I35U/ 


jioLno sapiens 


Human <:ftrretE*f1 nrotein SEO ID NO* 


133 


65 








7888 






1 1 1 (\A 




A if no miics*iilnc 

IV J us iiiusvuius 


iilnh?»-l tvnp 1 ororolljifen 

aipLLa l type 1 pi uvrVinagv/it 


105 


43 


111 Ci< 


/Z 


iriorno sapiens 


Hnmnn Qprr^tpd nrotein SEO TD NO* 

Xx 111 11 all bCl/lvlCil plUiCul) OLiy XX-/ x^tv-/. 


142 


59 








8153 






111 OA 
1 1 1 vo 


I 1^190 


TVAne mitc/Milnc 
iVIUS iliUOwUlUO 


cvnf*Yin 


119 


35 


1 1 1IV7 
1 1 1U/ 


AP9R01 1 1 
.rvrzoui i i 


jjLUiiiu sapiens 


rvtorJn rnme P450 suhfamilv ITIA 


122 


52 








nolvneotide 43 






111 no 
i 1 Ivo 


TflfVllzl 


xiumu sapiens 


Hpt?»-ti iVn 1 1 in 
Lfwta "iiiuuiui 


606 


94 


ii mo 


AvJlTO 


XT.UIUU sapicuo 


olrlfklocp 
oiuvriaov 


412 


89 


1 1 1 in 

1 1 1 1U 




ivius niusiAiius 


nnpn rrarfintr frame ^196 AAi 


127 


85 


11111 
1 1 1 1 1 




UnmA CQnipnQ 
Xxlsxllis oapitllo 


macmnhap'fi lectin 2 


165 


65 


11119 




muiiio sapiens 


QfrrefeH nrotein pncoHed Hv pene 1 8 


125 


74 








clone HNHF029 






11113 
111 I j 


Ml S530 

iYll JJD\J 


T-Trrmo cnnipn^ 

XlVlllU OOpiVlld 


B-cell PTowth factor 


163 


82 


I I 1 1/1 

III l*f 


V uuooz 


TTrwirt esmipne 
xXVJilHJ &apiciu> 


ATPa?^e 6 


217 


91 


1111^ 
1 I 1 ID 


VR69AR 


numu sapiens 


Human secreted nrotein HCHPF68 

1 lUXXlUlX Jvwl U/U piUlUlil X XV-/X XX X UUj 


165 


100 








ODy lxJ liv.lUJ. 






1111/; 
i mo 


iSAJZj /o 


riomo Sapiens 


oo1iv5)rv nrnJinp-rifh nrotein 1 
Sail vat jr pi villi GTi lui pi vj WtXix i 


119 


36 


1 1 1 1*7 

Jill/ 




riomo sapiens 


tTiirvi*»n cprrptpH nrntpin SPO TD NO' 
Jrl urn ail secxeicu pjuLeiii, oj&Vc n- / i^v*». 


132 


59 








447 8 






1 1 1 lo 


ntit\A 1 £ 
OUU410 


riorno sapiens 


xxUIIldll SCOICLCU. pi U LCI 11, Jljy 1^V_/. 


103 


67 






4407 






1 1 1 id 
11 1 \y 




noiuO sapiens 


Human cprrpfpH nrnfpin SPO ID 1MO* 
XT Ul 11 ail SC'uiCieLl piWLClll } ODy ixj nv/. 


94 


42 








4460 






111 90 




xiomo Sapiens 


Human cprrptpH nrnfpin SEO NO' 
xxuiilOll scuieieu pi v/cvtii, oijy u-' i^v. 


127 


75 








6613 






1 1 191 

1 1 lx.1 


fY03RA,7 


T-Tr»rn r\ canipnc 
Xxvjiuv sapiciio 


Human secreted nrotein SEO ID NO* 


104 


64 








7888. 






1 1 199 

1 1 \Z*L. 


00341 i 


HV/illU dapiviid 


Human secreted nrotein. SEO ID NO: 


90 


59 








7492. 








AF 161 356 


Homo aniens 


HSPC093 


119 


64 


1 11 94 
1 1 ix*» 




Homo Qanifn^ 


platelet glycoprotein lib precursor 


102 


70 


11 195 

1 1 xL> J 


G00637 


Homo <!fioiens 


Human secreted protein, SEQ ID NO: 


149 


60 








4718. 






111 o/; 

1 1 \L\> 


AF06R994 


Homo Qxiriipn'i 
xjiwixiv dapiwiio 


HDCMB45P 


272 


59 


1 1 197 


Y97854 


Homo c ianien < 5 


Human secreted protein encoded by 


120 


79 








ppnp No 101 






1 1 19R 
1 1 1x5 


A FOQ5770 


Hnmn campnc 
nuuiu oapiciid 


PTH-re<mon^ive osteosarcoma Dl 


101 


78 








protein 






11129 


AF 130089 


Homo sapiens 


PRO2550 


124 


80 


11130 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


109 


51 






brucei 


protein rhal a. 






11131 


U40265 


Trypanosoma 


ATPase subunit 6 


102 


31 






cruzi 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcotit 

y 


11 132 


X01677 


Homo saniens 


glyceraldehy de-3 -phosphate 
dehydrogenase 


555 


88 


11133 


K02576 


Homo sapiens 


salivary proxine-rich protein 1 


108 


37 


11134 


M81757 


Homo sapiens 


SI 9 ribosomal protein 


232 


76 


11135 


U93565 


Homo sapiens 


putative pi 50 


173 


51 


11136 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


369 


100 


11137 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


145 


61 


11138 


AB030255 


Homo sapiens 


serine/threonine phosphatase 1 gamma 


404 


83 




AT 04061 fl 


Vfnmft QsmipnQ 
xxuixlu oapiciij 


d11055C142 ACIAA0026 
ftranscrintion factor-like nrotein 
MRGXV) 


248 


77 


1 1140 

I I 1 T^V 


Y36421 


Homo saniens 


Fragment of human secreted protein 
encoded by gene 8. 


81 


53 


11141 


M13692 


Homo sapiens 


alpha- 1 acid glycoprotein precursor 


112 


41 


1 1 142 


Ml 6961 


Homo saDiens 


aloha-2- HS-el vcoDrotein 


543 


89 


ill 


D3R1 12 


Homo ^antpni 
xxvstixu aa^iwij 


evtochrome c oxidase sub unit 3 


605 


88 


1 1144. 


AF090944 


Homo <ianien*i 


PRO0663 


137 


67 


11145 


X92485 


Plasmodium 
vivax 


pval 


102 


57 


11 146 

1 x l*tO 




Homo <iar>ien<i 


PRO2550 


113 


58 


1 1 147 


1163 542 


Homo *vmien^ 


FAP nrotein 


154 


57 


11148 


W03988 


Homo sapiens 


SH2 domain from human SH-PTP2 
( amino acids 1 -106V 

y^ci-L 1111 ivy uuiui] x l\s\jj* 


162 


73 


1 1 140 


API 16695 


Homo *iani&n<i 


PR02221 


173 


44 


11150 


Ml 5530 


Homo sapiens 


B-cell growth factor 


105 


45 


111^1 

X 1101 


004079 


X1A7LU17 oapiviio 


Human cecreted nrotein SEO ID NO* 

xxtxiiiaxi dwUvivu uiuivJU) Juy xxx ii\/t 

8153. 


160 


64 


11152 


Ml 5530 


Homo sapiens 


B-cell growth factor 


124 


75 


ill <o 
1 1 1 j j 




nomu sapiena 


Human arreted nrotein SEO ID NO' 
7437. 


117 


67 


111 ca 
1 1 X J*1 


ATI 1 RORO 




PR0 1902 

r xvw i7vxr 


79 


52 


11155 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414 


123 


72 


11156 


Y27893 


Homo sapiens 


Human secreted protein encoded by 
p^nf* No 116 


266 


96 


11157 


Y27868 


Homo sapiens 


Human secreted protein encoded by 

pene No 1 07 


148 


84 


ill 

1 1 1 JO 


Y8674R 

I OUx.'rO 


XXVJ111U odJ^/lVlw 


Human secreted nrotein HCHPF68 
SEQIDNO:163. 


154 


72 


1 1 1 


O00365 


Uftmn <; aniens 

XlV7li.lv/ DClLJlVllO 


Human secreted nrotein. SEO ID NO* 
4446. 


105 


58 


11 160 
J x luv 


X60376 


ftra^«iica nanus 


nroline-rich nrotein 

1/1 V1JJ1V 4 1VJM VWUJ 


111 


39 


11161 

X X XV X 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 


37 


11162 


G02532 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
6613. 


149 


53 


1 1163 

X A 1UJ 


AF090930 


Homo saniens 


PRO0478 


146 


73 


1 1 164 

ill Ut 


G03798 


Homo saniens 

X 1UJJ 1 JUplVlU 


Human secreted protein, SEQ ID NO: 
7879. 


90 


60 


11165 


AF242772 


Homo sapiens 


mesenchymal stem cell protein 
DSCD28 


141 


51 


11166 


X82629 


Homo sapiens 


Mox-2 


307 


83 




TY*ff 1 17 


Homo sanipn*; 


NADH dehydrogenase subunit 4 


168 


91 


11168 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


71 


66 


11169 


AC079041 


Arabidopsis 
thaliana 


hypothetical protein 


98 


45 


11170 


AF090930 


Homo sapiens 


PRO0478 


156 


50 



432 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterraan 

Score 


% 

Idcntit 

y 


11171 


Y07R86 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


99 


70 


11 17? 
til i — - 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


124 


54 


11173 
1 1 1 1 j 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


82 


11174 

111 #*T 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


127 i 


66 


11175 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


157 


68 


11176 


AF090931 


Homo sapiens 


PRO0483 


146 


87 


1 1 177 

J XI r f 


M15530 


Homo sapiens 


B-cell growth factor 


153 


75 


11178 


AF090930 


Homo sapiens 


PRO0478 


129 


52 


11179 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


98 


77 


111R0 

111 OV 


AF 194537 


Homo sapiens 


NAG 13 


165 


48 


11181 


AF068294 


Homo sapiens 


HDCMB45P 


146 


53 




nXU7«7J 1 


Homo saniens 


PRO0483 


153 


78 


1 1 1 R3 


VYHOJJ 1 


Homo saniens 

i 


Human breast cancer related protein 
BCRB2. 


141 


65 


1 1 lOH- 


VI 3306 


Homo ^aniens 


Amino acid sequence of protein 
PR0332. 


452 


92 


1 1 


APnono3fl 


Homo ^anien^ 


PRO0478 


132 


51 


11186 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


178 


79 


1 1 187 
L I La / 


IJ40073 


Homo saniens 


ORF1; MER37; putative transposase 
similar to pogo element 


237 


70 


1 1 1RR 
1 1 ioo 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


83 


1 1 1 RO 


AT .080206 


Homo ^aniens 


hypothetical protein 


103 


71 


11190 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


135 


77 


11191 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


112 


65 


11192 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


127 


54 


1 1 1Q1 

1 1 17J 


P7H404 


Homo Rani ens 

X X\J1 llv 0**L/*vli*» 


Sequence of human B-cell growth 
factor (BCGF). 


150 


65 


1 1 104. 


M02046 


Trvnanosoma 
cruzi 


surface antigen 


160 


26 


1 1 1Q< 


Y 10767 


Homo saniens 


SEQ ID NO 485 from W09922243. 


126 


72 


1 1 106 
1 1 iyu 


G03274 


Homo saniens 


Human secreted protein, SEQ ID NO: 
7305. 


98 


75 


11197 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


117 


50 


11198 


D88548 


Homo sapiens 


24-kDa subunit of complex I 


160 


47 


11199 


AF090852 


Antilocapra 
americana 


prion protein 


115 


38 


11200 


AP000616 


Oryza saliva 


similar to RING-H2 finger protein 
RHAla(AF078683) 


136 


62 


11201 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


139 


75 


11202 


U93563 


Homo sapiens 


putative pi 50 


239 


43 


1 1203 


U93570 


Homo sapiens 


putative pi 50 


173 


45 


11204 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


166 


68 


11205 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


68 


11206 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


87 


56 



433 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


11207 


W48808 


Homo sapiens 


Homo sapiens clone CG109_l protein. 


123 


89 


1 lZUo 


thai 1? 


xiorno sapiens 


cyiuciLruiiic uaiucoc buuunii o 


0 \y 


89 


1 19AQ 


W4ooUo 


riomo sapiens 


rionio sapiens cione v^o iuy_i protein. 


19^ 
IZj 


so 


11210 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


521 


83 


11011 


VJUZ40J 


nomo sapicna 


T-fiiman Gf>r*rf*+oA r\n\tt>tn QT?<^ TT^ MH- 

riuinaii icwcicu pruLcin, ocy iu v\\j. 
6566. 




fO 


1 1 9 19 

1 iziz 


Uio 1 IZ 


noiuo sapiens 


cytoenrome c oxiaase suDunii i 


D4/ 


fit 
Ol 


11213 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o /o\ 


92 


74 


i i *> i A ( 
1 lzi4 


nni 1 99 
UU3 1 /Z 


jtiomo sapiens 


Human secretea protein, oni^ uj invj. 


IZO 


0^ 


11215 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


118 


34 


11216 


AF130089 


Homo sapiens 


PRO2550 


147 


96 


inn 
1 1/1 / 




Homo sapiens 


nn C\(\A 91 


1 19 

1 lz 


99 


1 1Z15 


OUZ53Z 


Homo sapiens 


riuman secretea protein, &r\Z iu inlj- 

#?£19 


1 /CI 

103 


9A 


11219 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAQ7 
445*/. 


111 


55 


11220 


AF130051 


Homo sapiens 


PRO0898 


122 


63 


i ion 
1 Izzl 




Homo sapiens 


rKAJzojz 


1*71 
1 /l 




11222 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAA A 

4414. 


146 


55 


11223 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

„ fl _ n j%fs> nA TJX /TO TXI/1 Q 

gene zz clone nMoJWio. 


130 


71 


11224 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7o/0. 


128 


69 


1 1225 


G02532 


Homo sapiens 


Human secretea protein, bfcKi iu WU. 
6613. 


1 A 1 

141 


or 
OJ 


11226 


G0Z36U 


Homo sapiens 


Human secretea protein, obQ ID NO: 
6441. 


t A1 


AA 

90 


1 1227 


M155JU 


Homo sapiens 


o-cell growth tactor 


lzz 


55 


11228 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


121 


67 


11229 


M15317 


Plasmodium 
iophurae 


histidine-rich protein 


116 


43 


11230 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

9£9C 

io/o. 


131 


78 


1 1231 


11/1 OQ19 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS55. 


HA 
/4 


10 


1 1232 


Yo6248 


Homo sapiens 


Human secretea protein riL>rirroo, 
SEQIDNO:163. 


19>4 

1J4 


Of 


1 1999 

1 1ZJ j 


A V/Y79A/K 


Homo sapiens 


unnamea proiem proauct 


inn 

1UU 


AQ 


11234 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

dua la f A FfttftAC^ 
KxlA. la ^/\rl// OOpJJ 


127 


82 


1 199^ 
1 IZjj 


Ar UUl/0 1 0 


v/iyZa adllva 


hiiniidi iu xviiNvJ~nz linger piuLciii 

RHAla(AF078683) 


19S 

1ZJ 


OJ 


1 i99/£ 

1 lZou 


At UUU01O 


wryza sativa 


buniidi 10 i\iiNvj"*jriz liugcr piuicin 
RHAla(AF078683) 




jj 


1 199*7 
1 lZJ / 




Homo sapiens 


oxy sterol /aipna-nyaroxyiase 


144 


99 
#Z 


11238 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


112 


44 


1 lzjy 


X oOZ4o 


xiomo sapiens 


uuman secretea protein Motir r 05, 
opa m wn- 1 

OE«V< INVy.lOJ. 


101 


yo 


1 174.0 


G0748S 




Wiiman QprrpfpH nrntpin ^FO TO TnJO* 

UmiMUl OVV/ICICU LflWtdalj vJJUiV/ 

6566. 


127 


43 


11241 


L36529 


Homo sapiens 


protein p84 


343 


90 


11242 


M15530 


Homo sapiens 


B-cell growth factor 


122 


55 


11243 


YOU58 


Homo sapiens 


Secreted protein encoded by gene 18 


130 


65 



434 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Water man 


Identit 










Score 


v 
7 








clone HCACJ81 






1 1 744 


1 197 698 


IVflUUj 


riHoiomal nrotein £2 


229 


74 






norvesncus 

AA wa ▼ v&iwwp 








1174S 


Y86248 


Hflmn sanien<* 


Human secreted Drotein HCHPF68 


116 


41 








SEQ ID NO: 163. 






J iZ*fu 


W /JOJ7 


Hnmn otmipnc 


Human Qpfrrp.torv orotein of Honp 


123 


37 








DM340-1. 






1 1247 


G03800 


Homo aniens 

J lUUiV dUUlVIU 


Human secreted protein, SEQ ID NO: 


132 


73 








7881. 






11248 


G03789 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


93 


56 








7870. 






1 1249 


B03678 

JJV JVJ— O 


Homo saoiens 


Human phospholipase 2 HPPL2. 


132 


56 


1 1750 


003789 


Homo ^anien^ 


Human secreted protein, SEQ ID NO: 


128 


72 








7870. 






1 1251 

1 1XJ L 


S79410 


Mus musculus 


nuclear localization signal binding 


115 


46 








rtrotein 






1 1757 


AK021455 


Homo oaoiens 


unnamed protein product 


146 


81 


1 IZjj 


Y4jjoZ 


jtiuino sapiens 


Human op/*rptprl rvrntpin "fTacrrriPnt 
nUlUQU oCL>i CltAJ yX ULG ill ua^lllwtlt 


88 


56 








encoded from &ene 28 






1 17^A 


cr ao i £ 


liUiliu Sapiens 


■nnrnrid ,, r* M nrotPHi Pos^alivarv 


113 


43 








oroline-rich orotein lexon 31 






1 175<> 

1 IZJJ 


O00454 

\jVI/*tJ*t 


Homo ^anipn** 


Human secreted protein, SEQ ID NO: 


136 


45 








4535. 










Homo oanipn*! 


Human secreted Drotein SEO ID NO* 


107 


67 








6463, 








A Fl 04.5^7 
rvT i 7HJJ / 


Hnmo oanipriQ 


NAG 13 


153 


76 


1 1758 

1 IX JO 


Ml S53ft 


Homo aniens 


B-cell growth factor 


130 


70 


1 1750 




Homo oanipriQ 


Human secreted Drotein. SEO ID NO* 


131 


72 








7437. 






1 lxOU 




VTrwnr» ettntpne 


Human secreted orotein SFO ID NO* 


104 


59 








7214. 






1 17£1 
l lxoi 


G07Q77 


Uftmn esmfpne 


Human <*ecreted orotein SEO ID NO* 


316 


92 








7003. 






1 1767 


(100337 


Homo <*anien<; 


Human secreted nrotein. SEO ID NO: 


111 


81 








4413. 






1 1761 


000^97 

VJUUJ7 # 


Homo <ianiens 


Human secreted orotein. SEO ID NO: 


142 


71 








4478. 






1 1264 


AF1 30089 


Homo saoiens 

A .ax/a aa\j wMt/j viiii 


PRO2550 


.111 


66 


1 1265 


G03807 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


118 


71 








7888. 






1 1766 


Ml 5530 


Homo saniens 


B-cell growth factor 

WVU &A vn VA1 A«*WW* 


158 


76 


11267 


AF 130079 


Homo saoiens 


PR02852 


124 


68 




VIAdRO 

I l*WOv 


Hrtmrt our»if*nc 

xLvjlUKJ JU^iwD 


Frapment of human *!ecreted orotein 


104 


50 








encoded by gene 14. 






11269 


G03790 


Homo saoiens 


Human secreted protein, SEQ ID NO: 


125 


57 








7871. 






11270 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


63 








7868. 






11271 


AF1 18082 


Homo sapiens 


PRO 1902 


87 


47 


11272 


M10546 


Homo sapiens 


cytochrome oxidase I 


305 


74 


1 1273 


U 12390 


Cloning vector 


beta-galactosidase alpha peptide 


108 


53 






pSportl 








1 1774 


G03793 


Homo <winien^ 


Human secreted protein, SEQ ID NO: 


235 


81 








7874. 






11275 


U93570 


Homo sapiens 


putative p!50 


182 


54 


11276 


X03145 


Homo sapiens 


pot ORF V 


144 


60 


11277 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 








7888. 






11278 


AL035681 


Homo sapiens 


dJ756G23. 1 (novel Leucine Rich 


269 


100 



435 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 














11279 


G00379 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


103 


61 


11280 


G03356 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7437 


115 


70 


1 17X1 

1 1 -£*0 1 


G02532 


Homo ^aniens 


Human secreted n rote in SFO ID NO* 
6613. 


87 


71 


1 1/oZ 






Hitman VtfT^teH nrotpin pnrnrfpfl hv 
eeneNo 107 


162 




11283 


AF116715 


Homo sapiens 


PR02829 


141 


71 


i U(TT 


AT 3W789 


Trvna n aq Am a 

brucei 


piuuauic oiiiiuai iv/ ting iia luigd 

protein rhala. 


1 1 1 


7r» 


1 

1 iz,o_> 


TT 59077 


Haitia canipnc 

Jd.VJl.AlU OcUJlvild 


marinpr triin<inAcjjQp 


1 70 


78 


11286 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Uu 1 J. 


120 


62 


11287 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7R79 


103 


45 


11288 


W50193 


Homo sapiens 


Amino acid sequence of salivary 

nrntpin PDN-9 


63 


60 


11289 


AF095770 


Homo sapiens | 


PTH-responsive osteosarcoma Dl 

Ytrr\tf*l~n 


123 


72 


1 1^7U 


V164QS 

I jO*t7J 




P"m trrrtpnf hiimnn cprrpt^H tirAfpiri 
ria^iLidii ui mmiaii bcuicicu. jjiui&ui 

encoded hv ffene 27 






11291 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


106 


46 


11292 


AF1 16715 


Homo sapiens 


PR02829 


130 


63 


1 19Q1 




jnuiuvj oopiCllb 








11294 


AP000616 


Oryzasativa 


similar to RING-H2 finger protein 

ivrxrv i a \i\r \j / ouo j ) 


119 


65 


11295 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
<*FO ID NO- 1 (%\ 


102 


50 


11296 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7R70 


141 


49 


1 1907 


vjvujy / 


xiuino bd.pi cub 


Human cprrptpH nr Attain QPO Wi KTO* 

4478 


Lj'x 




1129& 


R0RQ1 R 


IlUlliU daLJlviXd 


Human oftcreted nrntein cprnipnpp 
encoded bv eene 28 SEO ID NO*75 


126 


54 

i 


11299 


AK025270 


Homo sapiens 


unnamed protein product 


150 


54 


11300 


AF 118078 


Homo sapiens 


PRO 1848 


121 


63 


11301 


AF090942 


Hnm/) oanienc 


PRO0657 


81 


43 


11302 


AF1 16715 


Homo sapiens 


PR02829 


131 


60 


1 1 JV J 




jnuiuu ad^jiuur> 


PRO0483 


121 


71 


11304 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


108 


82 


11305 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


122 


59 


11306 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


93 


89 


11307 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


108 


65 


J J JVO 


<"»ft9R79 




Human secreted nrntein 5%EO ID NO* 
6953. 


114 


51 








Human cprrpfpH nrntein SRO ID NO" 
4435. 


67 


55 


11310 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


105 


71 


11311 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


104 


73 


11312 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


88 


73 



436 



WO 01/64835 



PCT/US01/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 










Score 


y 


11111 
1 13 1 j 




1 IjpdDObUIIla 


ATPa<5f» <nihunit 6 


94 


31 






CTUZl 








1 1 J 14 


VOOfiAB 

IAf 605 


tiomo sapiens 


Human cprrptftH nrAtein eti coded Hv 


157 


53 






UPT7P No 1 07 






1 1 J> 1 J 


Al 2CQ7CO 


1 lypaflOSODla 


Tr»tr/\1**»i1-»1f» cimilai* tA Hn<r-h0 finopr 
pXUUaUie suiilJaj iv/ luig itiigvi 


1 13 

1 lv 


54 






brucei 


nrotpin rha 1 R 
piULCiu. ma to. 






11310 


AO A Q1 

OU2493 


Homo sapiens 


Human cprrpff>H nrntf-in *sFO \M MO* 
riuxilau se^-iciraj unjieiiij ola,^ xu ywj. 


16S 

lOJ 
















11117 
1 13 I / 


n ao *\1 a 


tlUlilU Sapiens 


Human Qftcreted nrotein SEO ID NO* 


149 


74 








1 ^. 






1 1315 


atdaaa£1£ 


KJiyZa. Sail Vd 


Sllililai LI/ XVJLlH \J x L£* illlgvi V/lA/Ul 


144 


55 








RHAla CAF078683^ 






1 1110 


APl6115fi 


HA1YIA C^ntPTIQ 
I1UU1U oaiJlvlld 


HSPC093 


112 


56 | 


1 1 lOA 
1 1 32U 


<3<Q700 
OJO/22 


nonio sapiens 


W llllivt/ij ifwiiuupatuy piviicui \ v> 


102 


53 








terminal clone XEH 8cl 






1 1101 


API 1£££1 


nuuii/ Sapiens 


PRO 143 8 


131 


74 


1 1100 




xiumu sapiens 


nnn am pd nrAtein nrfviiirT 


137 


59 


1 1101 
1 1323 


OU3 /U3 


nomo Sapiens 


Hitman «^errpted nrotein SFO ID NO* 


74 


57 








/ / 0*T. 






1 110A 
1 1 324 


TT^I-^AO 
U03J**Z 


riomo Sapiens 


PAP nrotein 


151 


79 


1 110*\ 


X/f1^<1A 

jyll jjju 


Unmn cani enc 
jriuinu sapiens 


T**-cp11 ornwth far tor 


192 


45 


1 HO£ 
1 1 320 


AP1 1 £71 
At 1 lO/ 1 j 


nuuio sapiens 


PR02829 


164 


78 


11327 


i~*A1A/i1 

003043 


xiomo sapiens 


Unman cprr^tpd nrntpln v^Uf ^ f I i Tvjf >• 

riLLiria.ii scrcreieu pr ulclii, ocy jul/ n 




62 








7124. 






1 1 179 
1 1325 




jFiumu sapiens 


PROA 1 1 7 


116 


69 


1 132y 


A PAOAOAO 

Aroyuy4z 


xiomo sapiens 


PROfi/\*?7 

I I\UUvJ / 


132 


62 


11330 


002532 


HOmO sapiens 


jiuiTiaji score ieu pruicin, joy xu nvj. 


100 

i 7U 


72 








A611 






1 1331 


TT1710A 


oioning vecior 


DeEa-gaJaClUSlUdsc dip I la pcpuuc 


110 

X 1Z, 


49 






poponi 








1 1 117 
11332 


OOOooy 


Homo sapiens 


"Human cw>ri»tf»d r»rrktf»in ^Pfi TD TSlO* 

ii urn ail sevreieu piULCin, ocy xu n\j. 


1 1Q 

117 


38 








4770 






1 1 1ll 
1 1 jjj 


VftAOAft 
I oOZfo 


numvj sapiens 


Human <*ecreted nrotein HCHPF68 


131 


52 








^PO ID NO- 1 






1 1 11>l 

1 1334 


/TiA1QA7 

O03o0 / 


nomo sapiens 


Wnman cprrpfv^d nrntpin QPO IA NO* 

ixUJuai) scwreLeu pruiciu, ocy ulf nu. 


145 


53 








/ ooo. 






1 1 11C 

1 1335 


Ay/4o3 


riasmoaium 


pval 


105 

IvJ 


38 






VivaA 








1 1 ii£ 
1 1330 


[yIIjjjU 


Homo sapiens 


orAwfK ■far , tAt* 


148 


73 


1 1 no 

1 1 .53 / 


A PA AAA 1£ 


An/*7Q oafT\/Q 
\JiyzdaL. Saliva 


cimilar tA R INO-H9 fmcpr nrAtein 


126 


100 






RHA1» ^AP078r*^ 






1 111ft 

1 1 JJO 


api i rar^ 

Ar 1 1 BUoO 


noiou sapiens 


PROI 009 

Jrivw i yy£m 


123 


80 


1 1 110 


nAO^io 


nuiuu sapiens 


Human oerreted nrAtein SrtO TO NO* 


79 


66 






6613. 






11340 


000510 


Haiti a oanien*i 


Human secreted Drotein. SEO ID NO: 


162 


73 








6613. 






1 11*41 


AP11fifV7Q 


Unmft canipnQ 
XAUiiivj oapieuo 


PR02852 


145 


80 


1 \ %AT> 
1 UH-£ 


MIOdlQ 


T^4n«5 mn^ciilii^ 

XVI UD UllUvUllU 


nroline-rich salivarv Tarotein 


104 


44 


1 11A1 




T4r\mA canipnQ 

XlL/llivJ *>01/lwJXd 


Human eannahinoid GPR N-terminal 


123 


62 








seouence 






1 HvM 


nA17ftfi 
VJv3 /oO 


lxUlllU SapiCUS 


Human secreted nrotein SEO ID NO - 


120 


91 








7867. 






1 11A*\ 


1 T0AR10 


XJLi/lllU dOpiCild 


KSRP 

IVOXVi 


122 


51 


1 1340 




numu sapiens 


Human lii*5t*APvte-9prTeted factor HSF 


139 


56 


11347 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


188 


87 








4473. 






11348 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


59 








7133. 






11349 


W48351 


Homo sapiens 


Human breast cancer related protein 


111 


68 



437 



WO 01/64835 



PCT/US01/04927 



SEQD> 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 








DVyfvDZ. 






1 13SO 


Ci034l 1 


14 run a cpnif k n<v 


Human cpvrptfvl nrntpin ^FO 1H >JfY 
7492 




67 
uz 


11351 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded bv eene 2 SEO ID NO250 


267 


68 


11352 


AF161356 


Homo sapiens 


HSPC093 


134 


51 






HrtTTIA QSmPflQ 

JTLVJlllV dCUJldid 


npurnnfll tfirpaH nrntpin AD7p-NTP 


139 


47 


11354 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6^74 
oj /h. 


115 


64 


113^ 


fifV7<\3R 

VjUZJJo 


XluiliU SaplCllS 


Human <5f»rTPtprl nrntf*in ^FO TP) NO* 

6619 


176 


67 


11356 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588 


123 


59 


11357 


AF130089 


Homo sapiens 


PRO2550 


114 


62 


1 13^R 

1 1 J JO 




niJIilvJ Sapiens 


UUUaliJCvl UlVJICul LUUUUI<1 


196 


41 
ti 


1 13^0 


fj/Y9^39 


nuiuo sapiens 


Mum an c^r*iv»+*»H nrntpin QPf^ TP) ^Jfl* 
11 UI 11 all SCUiCUJU piUUetJLl, JUy nv. 

6613. 


173 


XI 

0 1 


i i36n 


ofv37 1 4 


nomo biipicub 


Human w^rrptpH nrntcin ^FO TD NO- 
rr urn an scuicicu. piuicui, OCy ll^ IN V7. 

7795. 


1?X 

1ZO 


64 


i i3£i 


AT I JUUDO 


riomo sapiens 


ppni367 

rlsAJl JO/ 


74 


^7 


11362 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


63 


1 1 3A3 

1 1 jOj 


W91^R1 
WziJol 


nomo sapiens 


A 1 tV» i tti <» r* O rlicoocA TM*AfrPin ^YlfMtHpH rv\/ 
jrV.LZ.IlC II I1C I S UlSeoSC pi UlClll C11WUCU Ujr 

DNA from plasmid pGCS 1 1037. 


104 

lVT 


01 


1 ioCKt 


vjtKfU /Z 


riomo sapiens 


Lr l1t v.n n cr*rrf±tf*ri nrntpin CPO rpi NIO* 
rtumdu scLreieu pruiciu, ocy il/ in*c*. 

OIJJ. 


137 


^3 

j j 


11365 


Z30643 


Homo sapiens 


chloride channel (putative) 


409 


98 




vjUUjoo 


riomo sapiens 


Unman cp^r-At<^rt rtrnfv^in CPO ID MH* 

numau sevretcu pruiciu, ony llj in 
4669. 


1H7 
IV/ 


Oo 






riomo sapiens 




AO 1 


^7 


11368 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

661 0 


110 


71 


11369 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

717A 

/ 1Z*+. 


136 


89 


1 137ft 
1 U f u 




nomo sapiens 


Human cf>rrf>tr*r\ rtrr\tf>i-n QPO IPl MO* 

1 1 urn an seoreicu prutcin, oca^ iu in kj. 

4457. 


190 
izy 


70 
iy 


1 1371 


I oOZ^to 


nomo sapiens 


ri tun ail seoretcu pruicui riv^rixrv>o, 
SET) FDNO-163 


137 


S7 


11372 


U18339 


Variola virus 


D4L 


95 


61 


1 1-> /.> 


VAl 1 ^fc 
lUl I jo 


riuiuo sapiens 


^f»rr¥>tf»rl nrr\ff*in An/*/\HpH hv DP'Tif* 151 

clone HCACJ81. 


100 




1 1374 


x/fi <^qn 
iyii 


riuinu sapiens 


P-f pll ornwfh fhrtfir 

D~*"wvlL glvVVLLI la\A\fl 


135 


61 


11375 


X92485 


Plasmodium 

YlvcLv 


pval 


111 


50 


11376 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


111 


71 


11377 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 

JJvIW«n 


111 


68 


11378 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5327 


96 


53 


11379 


G02567 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
664ft 


83 


54 




APflftft61 6 


vji yjja. oauva 


similar tn PTNCt-1-19 finder nrotein 
RHAla(AF078683) 


110 


85 


11381 


GO3790 


Homo saoiens 


Human secreted protein, SEQ ID NO: 
7871. 


156 


64 


11382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


121 


79 


11383 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


104 


41 



438 



WO 01/64835 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% i 

Identit 

y 


11384 


G00354 


Homo sapiens 


Human secreted orotein SEO ID NO* 
4435. 


86 

Ov 


J J 


11385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


40 


11386 


AF068294 


Homo sapiens 


HDCMB45P 


158 


48 


11387 


B08976 


Homo sapiens 


Human secreted orotein seauence 
encoded by gene 28 SEQ ID NO: 133. 


136 


46 


11388 


AF090931 


Homo sapiens 


PRO0483 


123 


74 


11389 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


156 


75 


11390 


W4S351 


Homo sapiens 


Human breast cancer related protein 
BCRB2 


102 


48 


11391 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


117 


54 


11392 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


134 


43 


11393 


S79410 


Mus muscuius 


nuclear local i7.ati on si on a! hind in v 

protein 


1 1 1 


46 


11394 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


128 


49 


11395 


AF090930 


Homo saoiens 


PRO0478 


148 


70 


11396 


AF210651 


Homo sapiens 


NAG 18 


100 


62 


11397 


G03798 


Homo ^aniens 


Human secreted nrotein SFO iD NO* 

1 LLU Hull Owwlbl&u LIlV/LbUX, uUV^ JLL/ lr* \J . 

7879. 




56 


11398 


AF063243 


Ros taurus 


rihosomal nrntein T .^0 


161 


77 


11399 


AF162149 


Mvcoolasma 

1»1J W Ls 1 EUU4 a 

boYis 


variable surface linonrntein 


104 


37 


11400 


Y02999 


Homo saoiens 

Alvillw gflpiVHw 


Kraement of human secreted nrotein 
encoded by gene 121. 


175 1 


71 


11401 


D38U2 


Homo sapiens 


NADH dehydrogenase subunit 4 


341 


92 


11402 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
nrotein 


137 


90 


11403 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW18 


115 


70 


11404 


M10126 


T.eish mania 

i~i w loin uduiq 

tarentolae 


NH2 terminus uncertain 


101 


41 


11405 


G02485 


Homo saoiens 


Human secreted nrotein SftO TD NO* 
6566. 


126 


60 


11406 


X82385 


Homo saoiens 


RNA oolvmerase 11 snhunit 


296 




11407 


G03714 


Homo saoiens 


Human secreted nrotein SKO ID NO* 
7795. 


151 


68 


11408 


Y14482 


Homo saoiens 


FrapTnent of human secreted nrotein 
encoded by gene 17. 


141 


44 


11409 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


118 


77 


11410 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


169 


77 


11411 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


169 


82 


11412 


AK025116 


Homo sapiens 


unnamed protein product 


133 


44 


11413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


113 


63 


11414 


AK024455 


Homo sapiens 


FLJ00047 protein 


127 


65 


11415 


AK024455 


Homo sapiens 


FLI00047 nrotein 


87 


63 


11416 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


94 


57 


11417 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


123 


58 


11418 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


69 



439 
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SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


11419 


S79410 


Mus musculus 


nuclear localization signal binding 
proiein 


113 


53 


11420 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ 143. 


166 


68 


11421 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAAA 


124 


51 


11422 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
ST*n 

33 3U. 


113 


70 


1 14Z3 


VJU3 /*>/ 


fiuiiiu sapiens 


Uumon cafrataA nrntain CCA TT\ "KIO* 

numan secreiea proiein, ocy ul» invj. 
7868. 


oo 


£9 
OZ 


1 149 A 
1 14Z4 


WooOZ / 


nonio sapiens 


oecretea protein encoaea oy gene 74 
clone HPMBQ32. 


130 


/z 


1 1 A95 
1 14Z3 


AV flfi91 ^4 
AJVUUZ 1 


Uaitia coniAnc 

nonio Sapiens 


unnamed proiein proouci. 


909 

zyz 


7^ 
/3 


11426 


Y02887 


Homo sapiens 


Fragment of human secreted protein 

an/«Ai^a/1 \wt nana Gf~l 

encoaea oy gene y\j. 


98 


75 


11427 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


143 


64 


11428 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

nana OO r>1rkno I-IA/1CTU71 Q 

gene zz clone nMoJWio. 


131 


81 


11429 


S58722 


Homo sapiens 


X-linked retinopathy protein {C~ 
terminal, cione AJ&ri.oC) 


130 


81 


11430 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

^AQO 
/47Z. 


172 


61 


11431 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44U7. 


112 


74 


11432 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

0013. 


I2g 


52 


11433 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene yi). 


128 


58 


1 \A1A 

l 1434 




Homo sapiens 


Human secreted protein, aJ&vi juu jnci: 

4403. 


14/ 


03 




I UZO / 1 


nuuio sdpieiib 


Human c&watotA nrAtoin otipa/Ip/I Vvi/ 

nunidii bccreieu proiein encoueci oy 
gciie zz wune rnvioj wio. 


1 1Q 


53 








IN/VL^n. UCUjCU L>g,CIlaoC 


1Sd 
l J*t 


S7 
3 / 


11437 


AF090895 


Homo sapiens 


PRO0U7 


122 


59 


1 \A*\SL 
1 IHjO 




A o/*qfic ciium 
//VaCdTIS bUUUI 


COllagC 11 


on 
yvj 


40 


1 1 A1Q 




jtionio sapiens 


rlvUvvj / 


1AA 
100 


^9 
3Z 


11440 


AK024455 


Homo sapiens 


FLJ00047 protein 


100 


55 


1 1 AAA 
1 1441 


vjrUU03 / 


jtiomo sapiens 


jtiuman secreiea proxem, otiv^ uj invj. 

471 K 
4/ lO. 


110 


4y 


1 \AA7 
1 l*r*fZ 


mi ^sin 

lYlU J J\J 


riomu sapiens 


o-ceu growin laciur 


IU3 


7^ 


1 l*T**3 


VJI/3V43 


riumu sapiens 


Human c/v^ri^tf^H nrntpin QPO TFl XTPi* 
JTluniall SCvFClCU piUlCUl, i3Dy 1U iiU. 

7124. 


1 33 


t\A 
04 


1 1444 


Y1Q710 




SEO TD NO 44R from 

OJL>V^ IXJ 1 iV»/ *rtO U Villi VT V/77Z.x.Z.*tJ. 




67 


11445 


AF090895 


Homo sapiens 


PRO0117 


79 


61 


1 1446 


API l&ftKA 


iioiuu Sapiens 


JTIvV/ 1 77Z 




03 


11447 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


160 


80 


11448 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein naguicni jy. 


97 


75 


11450 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

crane* 99 rlnno "LTN/fC TW/1 Q 

gene zz cione xiiyioj wio. 


136 


72 


1 1431 


I VZyy y 


Homo sapiens 


fragment or numan secreiea proiein 
encoded by gene 12L 


RO 

oy 


33 


11452 






Secreted nmtpfn pnraHprf hv ppnp 1 X 

clone HCACJ81. 


144 


71 


11453 


AF090931 


Homo sapiens 


PRO0483 


140 


78 


11454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


129 


65 



440 



WO 01/64835 
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SEQID 
NO: 


Accession No, 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1 14S*> 


onoiRi 


rlOIIlU Sapiens 


Human oecretRd nrotein SEO TD NO* 

ixtUlluil ObvlblbU piV/lV/JXX, uUy YU XriKJ . 

4464. 


104 


61 
Oj 


1 14S6 


Ar i louoo 


iiOIDtl oapiCllb 


PRO 1992 


151 


71 
/ 1 


1 14 S7 


1 171 1 6R 

\J 1 J 1UO 


T f r\m n caniVnc 
ilUlllvJ ixipiciib 


narrial CDS human nutativf* tiimnr 

paxuax V/L/Uj xxuxxiaxx puiauvc ixxtiiv"x 

suonressor flJ2394t9y 


171 


OS 


11458 


S79410 


Mus muscuius 


nuclear localization signal binding 
nrotein 

L/X VflA^IXX 


96 


73 


11459 


AK000496 


Homo sapiens 


unnamed protein product 


233 


63 


1 1460 




JxC/lTIC/ SapiCHS 


FT To0f)47 nrnfem 

iljJUwVt/ piUlGllx 


90 

.71/ 


71 


1 1461 


ooi7i4 

VJUj / 1*4 


nuuiu sapiens 


Human ofiereted nrotein SFO ID NO* 

xxuiiiuxi oMfiwivu piv/iwxxx, oby xx-* liv. 

7795. 


119 


64 


1 1469 
1 l*rOZ 


Oftn.176 


nuniu sapiens 


Human secreted nrntein SFO TD NO* 
xxuiixaii dixicim piuicuij ucy xx./* iivy. 

4457. 


101 


86 






nomo bapiens 


calivart/ nritlinfv-rir'n Timtpin nrwnrcrw 
SOU VOX jr piv"lxxxC"*xx*-lx piuicxxl pi CVUl aUl 


IIS 


16 


1 1 AAA 


AA9S19 


jLiomo Sapiens 


Unman c/vrpfwl nrrttpm SFO TFi NO* 

n urn on scvrcieu pruicin, ocy wj 
6613 


197 


RR 


11465 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595 

UJ7J. 


104 


77 


11466 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


141 


48 


1 14A7 

1 1*H>/ 


AF0Q0Q49 


ixonio sapiens 


PT?Oft6-V7 

X lYWVO J / 


R0 


Jo 


11468 


AC005698 


Arabidopsis 
inanana 


T3P18.6 


128 


47 


11469 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7R67 


101 


48 


11470 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7R67 
/OO/. 


136 


81 


11471 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4660 


91 


45 


1 l AT) 


O09*\i9 


noiuo sapiens 


Human cp/*rptf»H nrrrtpin SFO TO NO* 
xx uixx oil :>cviciC\J piiHOiix, ocy ixj 1> v_/. 

6613. 


109 


77 


1 1471 
I l*f / J 


O017R7 
VJUJ tot 


xxvjJiiu sapiens 


Human cporpt^H nmtpin SFO TTi NO* 

XlUxlIOxx jCw>1 CIChJ pii/LClxl, ocy xi_/ mu, 

7868 


14S 


SS 


11474 


AF210651 


Homo sapiens 


NAG18 


117 


57 


1 1 47<\ 


VI 4AR9 


nomo sapiens 


rragmeni oi nuiiiuii scercLcu pruicm 
encoded by gene 17. 


1 17 
11/ 


69 


1 1 ATA 


A pi 1 £71 *\ 


nomo sapiens 


PPO9R90 


194 


77 


1 1 A77 
1. It / / 


I OOZ*+D 


nomo sapiens 


Human csnrrptfvl nrntpin HOPTPF6R 

nuiudu scvwieicu proiem nv>nr roo, 
SEQIDNO:163. 


116 


4S 
*tj 


1 147R 


Ofl9R9R 


nomo Sapiens 


Hitman cprxpfpH nmtpin SFO FH NO- 
jriiiiiicui sevfexeu pruicm, OE/\£ lu nw. 

6909. 


R*> 


7R 
/ o 


1 A*T /-7 


A7l7tU 


xiajhiv/ sapicxia 


WNT-RR nrntrin 

VT 1> 1 OD pi "J 111 


904 


97 


11480 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

apne 99 rlnnp HMSTW1R 

geuw Wl/JIG XXlVlOJ vy io. 


131 


71 


11481 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
ffene 99 clone HM55JW18 


117 


63 


1 1482 




P 1 a cm fMi ii l m 

XT laolUUUl U1U 

vivax 


pvcxx 


98 


52 


11483 


AF217518 

ill *- X / — ' X CJ 


Homo can i ens 


un characterized bone m arrow nrotein 
BM042 


895 


100 


11484 


U79260 


Homo sapiens 


unknown 


104 


80 


11485 


L27428 


Homo sapiens 


reverse trariscriptase 


369 


52 


1 1486 


Ml 5530 


Homo <y?nien*5 

XXVXXXVJ OCtL/XV^XlO 


B-cell prowth fartnr 


122 


67 


11487 


U49974 


Homo sapiens 


mariner transposase 


156 


62 


1 1 4RR 


AFfiQfi049 


I-Jnmr* canipnc 
I luinu 5apiciK» 




1 10 


44 


11489 


G00328 


Homo ^aniens 

A M.\JtXX\J OvU^XwXXO 


Human secreted nrotein SFO ID NO* 

XXIXXIKtll JvCt ^»L^vl L/J \/U^XXX, »JX_»\^ XI-/ llx/. 

4409. 


156 


65 


11490 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


115 


52 


11491 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


51 



441 



WO 01/64835 



PCT/US01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7214. 






11492 


AF090930 


Homo sapiens 


PRO0478 


145 


58 


11493 


AOJ720 


Phoca vitulina 


cytochrome c oxidase subunit I 


556 


87 


11494 


AL021897 


Mycobacterium 
tuberculosis 


pra 


119 


56 


11495 


X98485 


Plasmodium 
vivax 


putative 


92 


35 


11496 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


156 


73 


11497 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


129 


72 


11498 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


115 


66 


11499 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


58 


11500 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


113 


36 


11501 


AE003568 


Drosophila 
melanogaster 


CGI 25 66 gene product 


105 


32 


11502 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


141 


78 


11503 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


102 


51 


11504 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


138 


60 


11505 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


109 


85 


11506 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


59 


11507 


M81321 


Macaca 
fascicularis 


proline-rich protein 


128 


48 


11508 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


110 


78 


11509 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
788L 


98 


60 


11510 


X07882 


Homo sapiens 


Po protein 


118 


41 


11511 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene22cloneHMSJW18. 


133 


79 


11512 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


112 


68 


11513 


AF000381 


Homo sapiens 


non-functional folate binding protein 


203 


93 


11514 


X92485 


Plasmodium 
vivax 


pval 


115 


44 


1 1 C 1 c 

11515 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone Xbii.oc j 


109 


51 


11516 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


141 


78 


HOI / 


A CAAAOA/1 

Ar 090594 


Homo, sapiens 


nn 117 

rivUUl 1J 


1 AO 
108 


CO 

5© 


11518 


M15530 


Homo sapiens 


B-cell growth factor 


93 


72 


1 1 cm 

11519 


G03084 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7165. 


66 


73 


1 1 <1A 

1 1520 


Ar 117065 


Homo sapiens 


male-spec in c ietnai-3 nomolog l 


200 


61 


11521 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene z SEQ ID NO:250. 


335 


87 


1 1 ^OO 
1 1 JZZ 


Ml jDjU 


— — : 

Homo sapiens 


D-ceu growm iacior 




HA 


11523 


AF1 16715 


Homo sapiens 


PR02829 


122 


64 


11524 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


68 


11525 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


103 


61 



442 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Specks 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








nana OO Mr»riP> WlV/fQTWl ft 

gene zz ciouc riivioj wio. 






11526 


X92485 


Plasmodium 
vivax 


pval 


111 


42 


1 IdZJ 


Y J 0495 


Homo sapiens 


rragmeni 01 n urn an sct-reieu pruicm 
encoucu. vy gene z. / . 


101 


ftO 
ou 


11528 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


134 


58 


11529 


AF090931 


Homo sapiens 


PRO0483 


163 


76 


1 t OA 

11530 


ATI 1 Art Art 

AFl 1 9900 


Homo sapiens 


DD fY5 ftOO 




AA 


11531 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
//y5. 


126 


73 


11532 


U41038 


Caen orhabditis 
elegans 


o miliar to caanenn-iyps repeal 


196 


^0 


11533 




Mus musculus 


nuclear locauzauon signal oinamg 
protein 


107 


to 


11534 


A TA A Art OA 

AF090930 


Homo sapiens 




190 


XI 


11535 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44/5. 


105 


64 


11536 


X92485 


Plasmodium 
vivax 


pval 


123 


56 


11537 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A A A 

4409- 


122 


72 


11538 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


108 


64 [ 


11539 


AE003834 


Drosophila 
melanogaster 


CG8054 gene product 


456 


84 


11540 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA1Q 
44 jo. 


135 


53 


11541 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/4i /. 


99 


70 


11542 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

HAA C 


77 


63 


11543 


G04072 


Homo sapiens 


Human secreted protem, ocxi lL * nu. 
8153. 


191 
IZj 




11544 


G03052 


Homo sapiens 


rluman secreted, protein, ont^ id rsw. 
7133. 


7R 
/o 


^ft i 

JO 


11545 


U79260 


Homo sapiens 


unknown 


10ft 




11546 


Y95028 


Homo sapiens 


Human clone vp7_l ORF2, SEQ ID 


122 


56 


11547 


X92485 


Plasmodium 
vivax 


pval 


116 


46 


11548 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


111 
ill 


JJ 


11549 


T T1 0 AT 1 

U28971 


Caenorhabditis 
elegans 


sun liar 10 i\u lanaem repeat region ox 
RD protein (nuclear ma-binding 
proicnij 


SO 


7S 


11550 


U67056 


Acanthamoeba 
casieiianu 


myosin I heavy chain kinase 


111 


44 


11551 


VI A AQ(\ 


xiomo sapiens 


rragmeni ot nuniau secreicu. piuieui 
cumucu uy genu it. 




60 


1 1 

11 JJZ 


OUU4IO 


Unmn caniAnc 

nomu bdpicnb 


Unman c^rrafaA rkrrttpiti ^PO TO 
riUiiiCLLl dCvlClCU UiULwil, ODy It/ J^iW. 

4497. 


127 


50 


1 1 <c« 
11553 


£3497j 


xiomo sapiens 


IHlPn 
lUlv^p 


470 


86 
ou 


1 1 CC/I 

11554 


A "C0 1 A/C< 1 

ArzlOoM 


Homo sapiens 


VTAHI ft 


101 


VJ 




V7Rn77 
/ ou / f 


Homo Allien*; 


ketohexolcin ase 


241 


92 


11556 


G04039 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8120. 


254 


72 


11557 


AB032976 


Homo sapiens 


KIAA1 150 protein 


565 


75 


11558 


U93572 


Homo sapiens 


p40 


127 


37 



443 



WO 01/64835 



PCTAJS01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

IdenUt 

y 


11559 


U27488 


Pseudorabies 
virus 


glycoprotein gX 


88 


38 


1 1560 


R 80005 


XlUlUU ocipiciio 


11 Ui II all oUpcrUAluC^loinUlaSC~ £ f 

polypeptide. 


1 11 
1 1 J 


03 


11561 


G00354 


Unmn <raniens 


Human ^f*pr<*t*»H nrntf»in \T\ XI /'V 

4435. 


OJ 


A5 
OD 


11562 


AF 130089 

ru i JUU07 


Haitia vmien*? 


PRO2550 


1 1 1 
ill 




11563 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: ! 
4409 

TTV7. 


146 


60 


1 1564 


AF090895 


Haitia <^nipn<; 


PRO01 17 


177 
1 /z 


77 


11566 


V01512 


Homo sapiens 


c-fos 


416 


82 


1 1567 

1 i JU / 


<558777 

OJO /XX 


Unmn cant atic 

lAUlliU dCipiClld 


jv-uiik.cu iciuiopduiy proicm 

terminal rlnnp YPH ft** I 


77 
/Z 




1 1568 


XI 6074 




T -14 nrntpin (A A 1 Jl&A\ 

Lt"JjH- piULClIl ^/vr\ i-xO*f^ 


1 17 
11/ 


15 


11569 


Y45381 


Homo sapiens 


Human secreted protein fragment 
einuucu if uiu gene zo„ 


131 


77 


11570 


AK025395 


Homo sapiens 


unnamed protein product 


683 


98 


1 1 571 
113/1 




IVIUS rnus CUIUS 


PAP 1 


1 1A 

1 J4 


OA 

oil 


1 ID /Z 


V77ft54 


xiomo Sapiens 


Human secreted protein encoded by 
gene No. 101. 


1 1A 


*7A 


1 1 571 


VA7A71 
I UZO / 1 


xiomo sapiens 


Human secreted protein encoded by 
gene zz cione aaivioj wio. 






11574 


J01415 


Homo sapiens 


cytochrome oxidase subunit 3 


141 


59 


1 1 575 


Q7Q/11 A 


Mus musculus 


nuclear localization signal binding 
protein 


1 AO 

lOz 


58 


1 1 57/£ 
1 1 J JO 




Homo sapiens 


npio i iQi 1 


409 


86 


11577 


Z22636 


Sinortiizobium 
meiuoti 


Thi 


103 


36 


1157S 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


83 


36 


1 1 57Q 


✓1A17A1 


xiomo sapiens 


riuman secreted protein, c*c\i uj in u. 


tic 
1 15 


oU 


11580 


AF162149 


Mycoplasma 
bovis 


variable surface lipoprotein 


167 


44 


1 1 5R1 
1 1 JOI 


AT?JO A A70 


Homo sapiens 


DDAOff<'7 


1 17 


77 


1 1 5sr> 

1 IjoZ 


A 1777QAA7 


riomo sapiens 


PATM XJ nmfain 

jrAUi-ri protein 


1 1C 

lib 


/5 


11583 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
pnagecione iamoaa.riv^ZDjj. 


132 


72 


11584 


Z70292 


Homo sapiens 


chemokineCC-i 


386 


93 


1 1 5515 


P 1 A7C5 


Homo sapiens 


Non-A non-B hepatitis specific 
aiiiigcuic piAjiciu eiicoucu oy 
phageclone lambda HC2533. 


1 AT 

107 


67 


1 1586 


S5877? 


1 1UU1U ooAJlClO 


Y^linlrpH T¥*rinnnathv nrot<»in fC*— 
j\. i ii iacu icuiii^iLjaiAjjf uiuiciu 

terminal clone XF.H 8c^ 

l-wi IAIAIAQIj vlUUV ■/vJL«A JUOvj 


1 j*f 


75 i 


11587 


X05006 


Homo sapiens 


S-protein 


237 


100 


1 1588 

J 1 JOO 


Y21850 

I X l OJU 


T-fom rs osmiPTiQ 

ATAJlAlv/ OOplCUu> 


AAUlllcUi ol^llal pv|/lluC VvUUcUlUIg 

nrotein fSIGPi fclone ID 1880830^ 


^£5 


77 


1 1589 




T-fAmn Qiioipnc 


(/via " allUUwlU 




54 1 


11590 


AF030162 

ill \J-J\J L \J£~ 


A 1. 1/1111/ OOf/iVlO 


innpr mitApliAnflrifll mpmhranp 

Alilld Alill\7A>UUAIUl ICU (iltJIlUI dllt 

tran^locase Trm23 


104 
1 v/*t 


05 


11591 


X03557 


Hnmn <wnipn<: 

1 AvslAlU OuAM\/lAo 


56-KDa nrotein Caa 1-478^ 


491 


74 


] 1592 


AF151074 


Haitia *»flniftns 


HSPC240 

A AC/1 \/*TW 


151 


71 


11593 


AFl 16719 


Homo sapiens 


PR02987 


558 


89 


11594 


AF090942 


Homo sapiens 


PRO0657 


120 


41 


11595 


X55039 


Homo sapiens 


centromere autoantigen B (CENP-B) 


160 


32 


11596 


X98475 


Mus musculus 


vasodilator-stimulated phosphoprotein 


101 


43 


11597 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


140 


73 



444 



WO 01/64835 



PCT/USO 1/04927 



SEQ1D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Identit 








Score 


y 


11598 


U07151 


Homo sapiens 


A TO T O 


1 13 


3 j 


11599 


S58722 


Homo sapiens 


X-! inked retinopathy protein {C- 


IZj 










terminal, cione AJcJri.oc/ 






11600 


A t nrnAn a 

AL050084 


. _ 
Homo sapiens 


hypothetical protein 


1 io 


1UU 


11601 


AF090931 


Homo sapiens 


rRU0483 


1Z*1 




11602 


G02532 


Homo sapiens 


Human secreted protein, SbQ ID INU. 


loZ 


ou 








6613. 






11603 


G03807 


Homo sapiens 


Human secreted protein, ixbvj u-> wu. 


1DO 


oZ 








7888. 






11604 


a W^/^ A A 1*\ A 

AF224494 


Mus musculus 


arsenite inducible RNA associated 




/y 








protein 






11605 


M60047 


Homo sapiens 


heparin binding protein 




7J 


11606 


G00933 


Homo sapiens 


Human secreted protein, oJat^ iU XNU. 


701 


Q7 

71 








50 14. 






11607 


W12091 


Homo sapiens 


C57S derivative of antigen non-specific 


j / 3 


AO 








human U1K 






11608 


AF229439 


Mus musculus 


zinc finger protein 289 


j3j 




11609 


M86637 


Brugia pahangi 


unknown 


1 f\A 

IU4 


35 


11610 


G03789 


Homo sapiens • 


Human secreted protein, SEQ ID NO: 


93 


70 








7870. 






11611 


G02224 


Homo sapiens 


Human secreted protein, SEQ ID NO. 


f AO 

jUo 










6305. 






11612 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 AO 

108 


C 1 

Jl 








6613. 






11613 


AK024372 


Homo sapiens 


unnamed protein product 


1 AC 

105 


7/ 


11614 


Y94918 


Homo sapiens 


Human secreted protein clone 


310 


oz 








dd504_18 protein sequence SEQ ID 












NO:42. 






11615 


Y48547 


Homo sapiens 


Human breast tumour-associated 


641 


96 








protein 8. 






11616 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


1 AO 
108 


Q 1 
Ol 








gene No. 107. 






11617 


L34081 


Homo sapiens 


bile acid CoA: Ammo acid N- 


356 


82 








acyltransferase 






11618 


G01399 


Homo sapiens 


Human secreted protem, SEQ ID NO. 


J /u 


Oft 

yo 








GA OA 

5480. 






11619 


AF020O38 


Homo sapiens 


NADP-depenaent isocitrate 


z/z 


53 








dehydrogenase 






11620 


AB051901 


Homo sapiens 


VDUP1 


247 


98 


11621 


Y48547 


Homo sapiens 


Human breast tumour-associated 


691 


92 








protein 8. 






11622 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 


3oZ 


II 






Accession Numoer rt/z /oo, jviemoa. 












concepmai translation suppueu oy 


















1 lozi 


AOoy// 


xiomo sapiens 


aioumin 


S62 

i/Vii 


92 


1 I /TO A 

1 1624 


X7346U 


Homo sapiens 


riDosomai protein 




84 

Or 


11625 


AL035398 


Homo sapiens 


QJ /yoll /.l {irU lAiivii novel piuieuij 


J / o 


yo 


11626 


D5031U 


Homo sapiens 


cyclin I 




1 1/1/ 


11627 


a rnnnncn 

AE000950 


Archaeoglobus 


mexmosome, suDunii aipna ^uisaj 


1^ 1 


£/ 






fulgidus 








11628 


L>o531-> 


ivius muscuius 


tetracycline iianbpurici-iiivc piuiciu 


H7 


81 


11629 


Y36204 


Homo sapiens 


Human secreted protein f?/o. 


*MJO 


inn 


11630 


AF 130089 


Homo sapiens 




1 IV 


QJ 


1 1031 


juzyoz 


n.uiuv/ oiipiciio 




95 


100 


11632 


AF1 16719 


Homo sapiens 


PR02987 


558 


93 


11633 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


92 








6613. 






11634 


U09823 


Oryctolagus 


elongation factor 1 alpha 


483 


85 



445 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 






cuniculus 










G0 1979 


rtomo Sapiens 


Mum an secreiea protein, oe\{ lu inu. 


1 ^7 

1 j / 


15 


11636 


AF1 16719 

IkJl 1 IV// 17 


T-Iotyia canSpnc 




4y l 


on 


11637 


D 14421 


Rattus 


b isotype of B regulatory subunit of 

prOW? ill pnUapflalaac 


208 


86 


11638 


AF1 16719 


Homo sapiens 


PR02987 


544 


93 


11639 




numo Sapiens 


<r.KAJZo.>Z 


177 
1Z/ 


81 


11640 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

HA A A 


150 


59 


11641 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

444Z. 


100 


66 


11642 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


119 


71 


11643 


Y86574 


Homo sapiens 


Human gene 91 -encoded protein 
tragment, otv^ \u NU.^yi. 


64 


100 




VJUUD / / 


— : 

Homo sapiens 


Human secreted protein, SbQ JUL) NO: 

40j5. 


82 


70 


11645 | 


G02360 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

0441. 


109 


74 


11646 


AF130089 


Homo sapiens 


PRO2550 


128 


52 


1 1647 




Homo sapiens 


171 TAAA/f7 nvntain 

rLJUUU4/ protein 


138 


61 


11648 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

001 J. 


73 


84 


11649 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAAQ 

444y. 


128 


60 


11650 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4318. 


83 


78 


1 1J>^1 
1 lOJl 


i JO** SO 


xiomo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


t AC 

105 


64 




ArUUUOlO 


ciryza sanva 


similar to kiino-Hz linger protein 
RHAla(AF078683) 


103 


62 


116^^ 


A T?1 O AARQ 


Homo sapiens 


purv7<<A 
JrKAJZjjU 


116 


69 


1 \6<A 
1 10>*f 


X l*H*OVJ 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


176 


54 


1 LOJJ 




nomo sapiens 


Human secreted protein, bbvj 1L) NO: 
7214. 


1 OA 

129 


63 




At?imn&o 
at uuuoy 


jtiomo sapiens 




135 


11 


11657 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


146 


93 


11658 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz cione timoj wio. 


123 


61 


11659 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A71 R 


117 


47 


11660 


P70494 


Homo sapiens 


Sequence of human B-cell growth 


92 


84 


11661 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*fOJ6. , 


70 


75 


11662 


U00029 


Saccharomyces 
ccrcvisiae 


Yhr217cp 


101 


50 j 


ii6/>* 




riomo sapiens 


vjivr j , ivLctsj) / , putative transposase 

cirri ilcir tf\ r\r\crr\ ^Ipmpnt 

ouniiai lu pogu cicincui 


1 AT 


CO 

oo 


11664 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ i*o. 


110 


90 


11665 


U63542 


Homo saoiens 


FAP orotein 

& 4Vl l/l VlVUi 


119 


69 


11666 


AF130114 


Homo sapiens 


PR02459 


78 


38 


11667 


G03714 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7795. 


106 


64 


11668 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


66 



446 



WO 01/64835 



PCTYUSO 1/04927 



SEQQ) 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4478. 






11669 


G03787 


Homo sapiens 


Human secreted protein, bEQ ID NO. 
7868. 


77 


6j 


11670 


G034U 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


116 


68 


11671 


AK021618 


Homo sapiens 


unnamed protein product 


97 


43 


11672 


G02482 


T T _ - 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


108 


55 


ilo/i 


a75yy7 


Rattus sp. 


nucleoporin p62 homolog 




/I ft. 


11674 


AF229163 


Homo sapiens 


natural resistance-associated 
macrophage protein 1 


104 


54 


11675 


M15530 


Homo sapiens 


B-cell growth factor 


89 


81 


11676 


M92357 


Homo sapiens 


B94 protein 


1 At 

143 


100 


11677 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


74 


11678 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 


64 


11679 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


66 


11680 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


1 AO 

108 


85 


1 1681 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


1 A C 

145 


92 


11682 


M15530 


Homo sapiens 


B-cell growth factor 


127 


64 


11683 


AF09093 1 


r T ♦ 

Homo sapiens 


PRO0483 


152 


60 


11684 


AF130087 


Homo sapiens 


PR02411 


150 


68 


11685 


U 18339 


Variola virus 


D4L 


94 


70 


11686 


S58722 


Homo sapiens 


X-Hnked retinopathy protein {C- 
tennmal, clone XER8c} 


104 


80 


11687 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


134 


68 


11688 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


131 


70 


11689 


AF090944 


Homo sapiens 


PRO0663 1 


129 


73 


11690 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


67 


11691 


U79260 


Homo sapiens 


unknown 


98 


44 


11692 


W78226 


Homo sapiens 


Fragment of human secreted protein 
encoded by^ene 1. 


361 


64 


11693 


G00962 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5043. 


269 


100 


11694 


AL3 59782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


132 


83 


11695 


Y94526 


Homo sapiens 


Human lysine-rich statherin protein. 


136 


82 


1 i dL€\C 

11696 


AF1 16637 


Homo sapiens 


T>T> /"N i A Of\ 

PRO 1489 


232 


95 


11697 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


108 


50 


11698 


VA1 1 CO 

Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


120 


O 1 

81 


11699 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


t CA 

150 


OA 

80 


11700 


A T/" A*** A A C C 

AK024455 


Homo sapiens 


FLJ00047 protein 


1 AO 

148 


68 


11701 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


131 


72 


1 1 /U2 




Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


luj 


4j 


11703 


AC005545 


Homo sapiens 


delta-adaptin, partial CDS 


127 


38 


11704 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


143 


81 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smtth- 

Waterraan 

Score 


% 

Identit 

y 


11705 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


67 


11706 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


153 


67 


11707 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


129 


88 


11708 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


89 


69 


11709 


AB047600 


Macaca 
fascicularis 


hypothetical protein 


95 


63 


1 1*71 A 
1 1710 


R96800 


Homo sapiens 


Human hisuocyte-secreted factor HSF. 


100 


76 


11711 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


112 


80 


11712 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


117 


58 


11713 


AF008196 


Homo sapiens 


bax epsilon 


140 


71 


11714 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVf 
oncoprotein. v 


113 


56 


11715 


G00397 


Homo sapiens 


Human secreted protein; SEQ ID NO: 
4478. 


125 


67 


11716 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


121 


74 


11717 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


125 


51 


11718 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


96 


81 


11719 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


106 


85 


11720 


AJ243883 


Periplaneta 
americana 


putative transcription factor 


118 


57 


1 1 H+S 1 

11721 


AF220264 


Homo sapiens 


MOST-1 


121 


74 


11722 


AF 130051 


Homo sapiens 


PRO0898 


169 


72 


11723 


K02401 


Homo sapiens 


chorionic somatomammotropin 


507 


95 


11724 


K02401 


Homo sapiens 


chorionic somatomammotropin 


461 


85 


11725 


U93564 


Homo sapiens 


p40 


340 


90 


11726 


AF130089 


Homo sapiens 


PRO2550 


107 


73 


11727 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO; 
4457. 


128 


79 


11728 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


109 


90 


11729 


Y51824 


Homo sapiens 


Human OSBH protein. 


182 


89 


11730 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


107 


80 


11731 


K02401 


Homo sapiens 


chorionic somatomammotropin 


459 


94 


11732 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


93 


42 


11733 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


111 


67 


1 1 T1 4 

H734 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


103 


43 


H735 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


98 


75 


H736 


AF090931 


Homo sapiens 


PRO0483 


123 


88 


1 1 TT7 

11737 


K02401 


Homo sapiens 


chorionic somatomammotropin 


340 


92 


11738 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


73 


1 1710 

i i toy 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


131 


57 


11740 


X55695 


Lycopersicon 
esculentum 


glycine-rich protein 


164 


48 


11741 


AK024455 


Homo sapiens 


FLJ00047 protein 


89 


77 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
oturc 


% 

Idendt 


11742 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded bv eene 2 SEO ID NO* 125 


415 


90 


J 1743 


S58722 


Homo sapiens 


X-liiiked retmopathy protein {C- 
tprminal clone XEH 8c} 


105 


94 


1 17>M 


^R7?7 
OJO /zz 


llLJlllL/ OdjJldXD 


X-linked retinOT»aihv nrotein {C- 
terminaL clone XEH 8c) 


116 


87 


11745 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


131 


69 


11746 


W13831 


Homo sapiens 


Human cyclin E delta 9 mutant 


140 


52 


1 1747 






rnrttnr Homain of TfTFI 4 

ULUIKJL UUlllcUU vl 1V11 1 « 


179 


84 


11748 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7409 


141 


72 


1 1 7AQ 


Vn 1 AAA 


xiomo sapiens 


QpfrpfpH nrAfpin pncArfftrt 1 hv oene 1 R 
clone HNHF029 


99 


63 


11750 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


88 


64 


11751 


AF090931 


Homo sapiens 


PRO0483 


118 


68 


1 1 T?') 

• 1 1 /oz 


ATI CA1 AC 

AT OU1UD 


xiumo sapiens 


email Tinf Tirnypf^lilfi* r\rfvt"f* i n 


194 


68 


11753 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

HjVO. 


127 


69 






riomo Sapiens 


Human cprrf>tp/i nrAtein SFO ID NO* 
6613 


100 


80 


11755 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870 


109 


65 


11756 


K02401 


Homo sapiens 


chorionic somatomammotropin 


427 


87 


1 1 7C7 
1 1 Of 


TTCOA77 


xiomo Sapiens 


m^rinpr tmncnnosKP 
JllOlllivi UuJOpvdcuC 


113 


82 


1 1 TCft 
1 1 / jo 


vjUUj 5Z 


xiomo sapiens 


Hitman cprrptftd nrAtein 5sHO ID NO* 

IlUJ 1 lull 2kGlsiGlGU JJli/LClAl, ULA^ I.JLJ 

4463 


133 


54 


11759 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


111 


80 


1 1 f 01/ 




riuuiu sapiciis 


PRO0663 


143 


76 


1 17/51 
1 1 fOl 


mfti 19 




NADH dehydrogenase subunit 6 


529 


99 


11762 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868 


171 


58 


1 17/\* 


AP1 19900 


riv/iiiu sapiciis 


PRO 1751 


127 


66 


11764 


AK025116 


Homo sapiens 


unnamed protein product 


141 


80 


11 fOJ 




riomo Sapiens 


Human cp-rrpfpH r*rntf»in &FO ID NO - 
IxUIIlalL oCUlCLCU piUlCUl, OC»Vt *'V/. 

4435 


75 


62 


11766 


AF090930 


Homo sapiens 


PRO0478 


106 


59 


I 17A7 

I I fOf 




riuiiiu sapiens 


Human secreted nrotein SEO ID NO* 
4409 


127 


80 


1 17a"K 

1 1 fOO 


AT V/7U7J X 


Haitia *cflnif*nc 


PRO0483 


164 


60 


11769 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH. 8c} 


102 


81 


11770 


AF161356 


Homo sapiens 


HSPC093 


131 


68 


1 1771 


AFi^nnXQ 


Haitia <uiniPTi^ 


PRO2550 


160 


82 


1 1777 


V144R9 

I lH*tOZ 


Haiti a <mnipn^ 


Frapment of human secreted nrotein 
encoded by gene 17. 


140 


79 


1 1773 
1 1 1 / j 




Homo ^aniens 


Human secreted protein, SEQ ID NO: 
4518. 


136 


71 


1 177d 




Homo saoiens 


PRO0657 


153 


78 


11775 


AJ223410 


Homo sapiens 


EBIl-ligand chemokine 


108 


52 


1 177£ 
1 I / /O 


^09401 
IVl/Z'tVl 


Hattia <winipnc 
nuuiu doiJiGud 


cnoriAnic <tAmfltAmflmmotrooin 


395 


91 


11777 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


107 


70 


1177S 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


99 


62 


11779 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


64 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


11780 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


1 16 


68 


11781 


AF 194537 


Homo sapiens 


NAG13 


201 


65 


11782 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


139 


68 


11783 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


59 


11784 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


78 


11785 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


68 


11786 


AF130089 


Homo sapiens 


PRO2550 


10S 


83 


11787 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


124 


88 


11788 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


103 


100 


11789 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


146 


60 


11790 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


163 


73 


11791 


AL035494 


Homo sapiens 


dJ635G19.23 (novel protein 
(PUTATIVE PARTIAL isoform 3)) 


106 


56 


11792 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


66 


11793 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


125 


61 


11794 


A20770 


synthetic 
construct 


(Aspl-Thrl61>EPI 


95 


76 


11795 


S74221 


Homo sapiens 


IK factor=cytokme down-regulating 
HLA class II 


108 


61 


11796 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


96 


77 


11797 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c) 


111 


65 


11798 


Y75995 


Homo sapiens 


Human skin cell protein, SEQ ID 
NO: 173. 


324 


87 


11799 


J03071 


Homo sapiens 


chorionic somatomammotropin CS-5 


120 


75 


11800 


L00693 


Homo sapiens 


carcinoembryonic antigen 


96 


85 


11801 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


44 


11802 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


125 


64 


11803 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


82 


11804 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


107 


85 


11805 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


152 


80 


11806 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


122 


61 


11807 


U91985 


Homo sapiens 


DNA fragmentation factor-45 


585 


92 


11808 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


11809 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

oro rr> XT/""\. 1 Hi 

bJbv2 UL»NU:io3. 


170 


89 


11810 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


191 


76 


11811 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


120 


100 



450 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Sroith- 

Waterroan 

Score 


% 

Identit 

y 


11812 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4738. 


156 


55 


11813 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


121 


84 


,11814 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


107 


75 


11815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


74 


78 


11816 


M15530 


Homo sapiens 


B-cell growth factor 


148 


68 


11817 


AF090942 


Homo sapiens 


PRO0657 


139 


60 


11818 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


117 


100 


11819 


AP000616 j 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


133 


73 


11820 


Z72853 


Saccharomyces 
cerevisiae 


ORF YGR069w 


94 


42 


11821 


G03710 


Tlomo sapiens 


Human secreted protem, SEQ ID NO: 
7791. 


176 


65 


11822 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


60 


11823 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1037. 


104 


68 


11824 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


99 


51 


11825 


W90172 


Homo sapiens 


Human heart muscle specific protein. 


256 


34 


11826 


AB017007 


Homo sapiens 


PMS2L16 


248 


89 


11827 


AL1 60371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


127 


86 


11828 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


135 


71 


11829 


AF1 19851 


Homo sapiens 


PRO 1722 


129 


60 


11830 


AF132200 


Homo sapiens 


PR01751 


120 


77 


11831 


Y08061 


Homo sapiens 


Human c-myb protein fragment 


133 


83 


11832 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


128 


64 


11833 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


152 


100 


11834 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


73 


11835 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


155 


88 


11836 


AP000616 


Oryza sativa 


similar to R1NG-H2 ringer protein 
RHAla (AF078683) 


134 


95 


11837 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


411 


91 


11838 


AK000496 


Homo sapiens 


unnamed protein product 


113 


78 


11839 


AF119855 


Homo sapiens 


PRO 1847 


84 


68 


11840 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


128 


88 


11841 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


113 


69 


11842 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


151 


55 


11843 


AF090942 


Homo sapiens 


PRO0657 


139 


65 


1 1844 


AF220264 


Homo sapiens 


MOST-1 


oo 

88 


69 


11845 


AF184612 


Drosophila • 
melanogaster 


split ends 


95 


47 


11846 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


136 


82 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


11847 


AB017007 


Homo saoiens 


PMS2L16 


310 


ion 


11848 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


100 


79 


11849 


W03642 


Homo saniens 


Human cannabinoid GPR Isl-terminal 
sequence. 


111 


61 


11850 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


110 


72 


11851 


AF090931 


Homo sapiens 


PRO0483 


164 


80 


11852 


AF 116661 


Homo ^aniens 


PRO 143 8 


124 


67 


11853 


G03807 


Homo aniens 


Human secreted orotein SFO ID NO* 
7888. 


1 19 ! 


77 


11854 


G02872 


Homo sianien^ 

xxuxuu oaxyxuxo 


Human secreted orotein SFO ID NO* 
6953. 


155 


SO 

J" 


11855 


UUJJJL 


Homo vinipn^ 


<;impr cv<*tf»inp rich nrotpin* ^PRP 


mo 








xxuiiiv ocxpicixo 


Hitman QprrptpH nrntpin ^PO ID NO* 

XxUJlKUl OCUCICU UIUICUI, <OIZ>\£ XXJ V\\J» 

4414. 






11857 


G03714 

VJV/-/ / It 


I lVJiiivf Dcx^riciio 


l-Tiimjm cpprptpH r»mtf»"n RFO TTi MO* 
rj.Uillu.il dCUClCU LrlVli'wlll, OCV^ \XJ liv. 

7795. 




6R 


1 1858 


AK0944S5 


T-Trimft canipnc 


Fr J00047 nrotpin 

JT L/J UVfLr-r / Ul "JIG 111 




60 


11859 


W48351 


Homo sapiens 


Human breast cancer related protein 

RPRR7 


94 


66 


11860 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHFO70 

VIVsllW IXXfXXX V_/X.7. 


94 


66 


1 1861 

1 l OKI 1 


Y36495 




Frsiompnfr f\r tiiiirmn cprrf*t(*H nrfrftfMn 

1 lagllxwlll VJl lllXUlOll dCWVk(/U piVILCUl 

encoded bv eene 27 


1 I/O 


7S 


11862 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


108 


86 


11863 


AF1 30050 


Homo ^nnienq 

X XXSXJXxJ OCIXSlvXXO 


PRO0872 


si 

O 1 


JO 


11864 


AF1 19900 


Homo sapiens ^ 


PR02822 


152 


90 


1 XO\Jj 




xlUIIIU oapiCUb 


Hum tin nrrttotn QFO TTi MO* 

xTuiiiaii scvieicu pruicin, ocy xl/ inva 
6619. 


7« 
/ o 


**j 


1 1 ouu 




xiVJiixv/ actpiciia 


il/lllV bUlilalUlllallllllULI UUU1 V^O**^ 




0^ 


11867 


AP000616 


Oryza saliva 


similar to RING-H2 finger protein 


123 


80 


11868 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

467? 


119 


58 


11869 


Y10830 


Homo sapiens 


Amino acid sequence of a human 

QpprptftH nrotpin 


246 


100 


11870 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7124 
/ lit. 


101 


59 


11871 


Y86248 


Unmn cjinipn^ 


Human secreted nrotpin HCHPF68 

XTXm IIOIl owl VlvU lJIXJI&XXI XXwXXX x UO) 

SEOIDN0163 






11872 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


129 


84 


11873 


D28113 


Homo sapiens 


MOBP 


45S 


80 


11874 


D28114 


Homo saniens 


MOBP 


306 


68 


11875 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


101 


85 


11876 


Y02886 


Homo <winien<: 

Xlvlllv OCIUX^XXO 


Frapmenf cyf human secreted nrotein 

X X PRI * 'Vut vl XlitXXXCIXX dbVlWlvU L/i ULt. Ill 

encoded by gene 90. 


X Iv 


64 

v*r 


11877 


Y86248 


Homo <mnien<i 

XXAJllXVr tXI^JXvXXO 


Human secreted orotein HCHPF68 

xxuxxicux avwi^ivu yji\n&xix iiv/iuri vo, 

SEQIDNO:163. 


1X9 


57 


11878 


Ml 5530 


XXUXXiU DOUlCltO 


R«<"»pll orirtwtfi fuotnr 

U V6U glUVTUl Ldvlvl 






11879 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


130 


100 


11880 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


99 


70 


11881 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


101 


53 



452 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- . 

Waterman 

Score 


% 

ldentit 

y 


1 1 OOjL. 


VJI/UJ7Z 


iTOniO Sip 1C 11a 


Human vrTPtpH nrotPin SPO IF^ Mf"V 

4473. 


710 

^17 


75 


i lain 


vJVVrt-JO 


rioroo sapiens 


Human <f*rrptf^r1 nrntpin QPH TY\ \Tf"V 
iiUlllaU. od/icivu pLvicm, aCy IL/ iiv/. 

4497. 


1^1 
I J 1 


Al 
Ol 


11884 


U93563 


Homo sapiens 


putative pi 50 


139 


75 


1 lOOJ 


Rft7ft57 

ISA/ / VJ / 


xlUIUD boplcllo 


CmallpT" hpnatnr^llnlar rtnr»rtr\mf*»in 
Oiiiaiiw iicuaxuvdiuiai v/uuupi uiciii 

(hhem) gene preoduct 


1 Sd. 
1 J** 


57 


1 

1 looo 


a en 00.044 


MAtnA com an c 

fioiiiu sapiens 






5U 


1 lOOA 


/Irvl/V/OlO 


v/ryza saxiva 


bii ii lieu uj ixiiNVj-iXi uiigcr pruicm 
RHAla(AF078683) 


107 
IV/ 


7*1 j 


i looo 




jrioino sapiens 


Unman rM"^ocf pnnAAr nrnt^in 

nuxnaji urcaM uauccr r cjaicu pruiciii 
BCRB2. 


1^R 


^7 


1 1 RRQ 

1 l007 


T 7£053 


riomo Sapiens 


ClirOJllvJoOIIlal piv) LCI 11 


17^ 


75 


11890 


U16359 : 


Rattus 
norvegicus 


nitric oxide synthase 


105 


69 


1 1 by I 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 




OJ 


1 1 QQO 




Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
pnagecione jamDua hl*z j j j. 


lzo 


/o 


11893 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

45£7 


82 


70 


11894 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
enn in xiyy i k\ 

oJcA^ 1U IN 1 0 J . 


149 


90 


1 1 205 
I 157j 


^^3777 
ojo/Zi 


rionio sapiens 


AriinKjea reunopauiy protein, 
terminal, clone XER8c} 


1 15 


75 


1 Ioto 




nomo sapiens 


ri urn dJi sccieiea protein, ocy i\v/. 
6532. 


1 A7 

lot 


50 


1 1 £07 


UJ)51 1Z 


jiomo sapiens 


riJxjjri uenyajugenase suDunit o 


1J>0 


77 


11898 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAtA 


126 


58 


11899 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene no. iu 1 . 


158 


78 


11900 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
lermiiiai, cione Ai^n.oc/ 


137 


71 


11901 


GO0613. 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
ivy**. 


85 


62 


11902 


G02558 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OQj7, 


242 


73 


11903 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/oOo. 


127 


59 


11904 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7SST 
/ jjj. 


182 


85 






Wnmn canipn c 
XIUIUU oopiCIlo 


j\ 1H1A.OL1 icLuiupouiy piuiciu 
terminal cIotip YPH Rd\ 


115 


75 


11906 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal clone XFH %c\ 

t^l 11 Illicit, ^IV/li^ /VLtll.Ovj 


128 


79 


11907 


U93574 


Homo saoien^ 

X x\JXXl\J JUUlvllg 


nutative n 1 SO 


169 


64 


11908 


AF090944 


Homo ^aniens 


PRO0663 


128 


75 


11909 

1 4 7V7 


G02485 


Homo ^arnf*n^ 

11VJAUVI tXUJIwtlo 


Human <«creted nrotpin *nFO TD NO* 

mil lion jwwivibu \ji \J Lt/Uij o i— >V7 jll/ i> Vw/ . 

6566. 


100 




1 1910 

117 IV 


AF090930 


Homo <mnif*n^ 


PRO0478 


159 

l«/7 


76 






x lvlilU dOl/lCllo 


r*h ni*inni<* crvm 5*tr>m mmntrnnin f*^Q_7 

krllUl lV/lllls oUl 1 1 <3.L«J 111 all 11 11 VJLL ^JJ^Jlll Vj-ji 


*+7U 


oo 


1 191? 

117 LZ. 




xxuiuu oapicno 


UIUllUUlU bUluQUJUlQlIllllUilUpiU V^vJ z. 


551 
_) J 1 


07 

7^ 


11913 


Y87064 


Homo <?anipns 


Human secreted nrotein ^eouence SKO 
IDNO:103. 


168 


76 


11914 


K02401 


Homo sapiens 


chorionic somatomammotropin 


450 


85 


11915 


K02401 


Homo sapiens 


chorionic somatomammotropin 


478 


87 


11916 


K02401 


Homo sapiens 


chorionic somatomammotropin 


454 


82 



453 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


1 1017 

117 1/ 


VJUZJOZ 


norno sapiens 


Human secreted protein, SEQ ID NO: 

0*tOj. 


121 


73 


1 1918 


X65171 


l\/fiic miicf*iiin<i 

iVlUJ IJJLOV'UiUD 


aipnai conagen 


1 1 o 

llz 




11919 


Y45382 


Homo sapiens 


Human secreted protein fragment 
encoaeo rrom gene zo. 


115 


62 


11920 


AK024455 


Homo sapiens 


FLJ00047 protein 


102 


76 


1 10*71 




iiomo sapiens 


small zinc finger-like protein 


1 O 1 

lol 


52 


11922 


U55376 


Caenorhabditis 
eiegans 


F16H1 1.2 gene product 


122 


78 


j i yzj 




riomo sapiens 


Human secreted protein, SEQ ED NO: 


256 


80 


1 1Q7A 
l l;fz*f 




UnmA c onion c 

xiuiinj sapiens 


jtiuman secretea protein, oc-v tu nu. 
7492. 


1 07 


74 


1 1Q7S 




T-T/MYirt coniAnc 

nouiu hapicnb 


PP Cif\ A#C5 
rKAJUOO;) 


AAA 
144 


75 


11926 


AJ223475 


Escherichia coli 


InsA protein 


477 


100 


1 17 J. 1 




jiomo sapiens 


gamma-globin 


616 


92 


1 10751 




Homo sapiens 


Human secreted protein, bEQ ID NO: 

/o/l. 


105 


72 


1 1090 


ceo 700 
ojo /zz 


— : 

Homo sapiens 


X- linked retinopathy protein {C- 
terminal, clone XEH.8c} 


122 


92 


1 101ft 
i lyovi 




xiomo sapiens 


nn rvft iin 
rKUU 11/ 


80 


71 


11931 


AF130089 


Homo sapiens 


PRO2550 


129 


68 


1 1010 


AKUZDi Id 


Homo sapiens 


unnamed protein product 


113 


53 




iyi4->Z 


Homo sapiens 


Human secreted protein sequence 
encoded by gene z 1U NO: 125. 


114 


70 


1 l7j*f 




— : : 

Homo sapiens 


Human secreted protem, bbQ ID NO: 

A£*.9 
40Do. 


69 


76 


11935 


AFl 16695 


Homo sapiens 


PR02221 


336 


70 






Homo sapiens 


JrKJJU4o3 


133 


77 


11937 


AF130089 


Homo sapiens 


PRO2550 


135 


72 


1 1 Gift 




Homo sapiens 


microtubule-associated protein IB 


143 


36 


11939 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


72 


11940 


000427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/if AO 

4508. 


158 


76 


11941 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


130 


53 


11942 


Y14480 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


111 


65 


1 1 QA1 


fTAO/IGA 
OUz45U 


-— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 

0001. 


106 


59 


11944 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

QT70 m XTTV1 £3 


117 


65 


l lynj 


H7X1 11 

lA£o Ilj 


nomo sapiens 


SYlKJDr 


loo 


oz 


1 104/; 

1 17tU 


TORI 17 

l/JO 1 1Z 


noiuQ sapiens 


iN/vuri aenyorogenase suDunn 1 




/o 


11947 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
proiein nogrneni jy. 


113 


72 


1 1QAX 


tori 17 

Uj&I 1Z 


nomo sapiens 


iN/vLijci aenyorogenase suDunii o 






11949 


AJ006591 


Homo sapiens 


cysteine-rich protein 


148 


76 


1 10-^ft 

l iyjv 


OJO /ZZ 


Homo sapiens 


X- linked retinopathy protein {C- 
wnnuiaiy cione Acxi.oC/ 


11J 


88 


11951 


AFl 16695 


Homo sapiens 


PR02221 


205 


57 


1 1 0<9 


i y juzo 


Homo sapiens 


Human clone vp/_l ORrz, obQ ID 

1NVJ.1Z5. 


ICO 

158 


75 


11953 


AFl 13685 


Hnnift "cranipn^ 

H-\JHl\J j<U/lwllO 


PRO0974 


107 


77 
/ z 


11954 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


107 


80 


11955 


AK026107 


Homo sapiens 


unnamed protein product 


129 


84 


11956 


AF109907 


Homo sapiens 


S164 


140 


96 



454 



WO 01/64835 PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


11957 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR;A41726) 


99 


100 


11958 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


124 


41 


11959 


K02401 


Homo sapiens 


chorionic somatomammotropin 


497 


90 


11960 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


155 


69 


11961 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


126 


85 


11962 


M15530 


Homo sapiens 


B-cell growth fector 


114 


86 


11963 


K02401 


Homo sapiens 


chorionic somatomammotropin 


472 


87 


11964 


AK024455 


Homo sapiens 


FU00047 protein 


95 


61 


11965 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


145 


78 


11966 


L2742S 


Homo sapiens 


reverse transcriptase 


144 


50 


11967 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


187 


76 


11968 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCS 1 1 037. 


124 


81 


11969 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


112 


64 


11970 


AK024455 


Homo sapiens 


FLJ00047 protein 


120 


70 


11971 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 


61 


11972 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


106 


70 


11973 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


70 


75 


11974 


G00491 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4572. 


86 


60 


11975 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


72 


11976 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


105 


42 


11977 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


121 


58 


11978 


AK024372 


Homo sapiens 


unnamed protein product 


104 


80 


11979 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


137 


77 


11980 


S58722 


Homo sapiens 


X-lmked retinopathy protein {C- 
terminal, clone XER8c} 


96 


90 


11981 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


100 


56 


11982 


R59842 


Homo sapiens 


ApoE4Ll protease. 


129 


62 


11983 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


132 


53 


11984 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


105 


90 


11985 


M15530 


Homo sapiens 


B-cell growth factor 


117 


43 


11986 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


99 


65 


11987 


R59843 


Homo sapiens 


ApoE4Lx2 protease. 


117 


84 - 


11988 


L26953 


Homo sapiens 


chromosomal protein 


124 


79 


11989 


LU93564 


Homo sapiens 


p40 


523 


88 


11990 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQID NO: 163. 


155 


82 


11991 


AF090894 


Homo sapiens 


PRO0113 


126 


56 


11992 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


129 


63 


11993 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


67 


11994 


U71363 


Homo sapiens 


zinc finger protein zfp6 


213 


68 



455 



WO 01/64835 



PCT/US01/04927 



SEQ n> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


11995 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

471 R 


82 


65 


11996 


AF130089 


Homo sapiens 


PRO2550 


145 


68 


1 1007 


DQCQl? 


xiorno sapiens 


TV! £M » 1 tn#>AAf4 nf/Ntaf m 

iNeurai uxreaa protein. 


1 1 A 

114 


88 


1 100R 
1 \yyo 


VJUZjjZ 


no mo sapiens 


riuman secreteo protein, ocl^ lu nu: 

OO ID. 


lzz 


56 


11999 


X62677 


Oryctolagus 


retrovirus related reverse transcriptase 


147 


81 


12000 


AF090931 


Homo sapiens 


PRO0483 


152 


81 


1ZUU1 




Homo sapiens 


rKUU4oi 


145 


O 1 

81 


12002 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ooo. 


124 


64 


12003 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XbJtLoc} 


103 


78 




Ut/J /14 


Homo sapiens 


Human secreted protein, obQ ID NO: 

OOOC 
/ /SO. 


133 


51 


lonrK 

ILKJVj 




tiomo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


1 AO 

lOo 


oi 


12006 


L19527 


Homo sapiens 


ribosomal protein L27 


547 


93 


lOAAO 


Y? OA BAA 


Homo sapiens 


Human nistiocyte-secreted factor Hoc . 


127 


oo 

82 


lOAAO 




Homo sapiens 


Human histiocyte-secreted factor HSF. 


11*7 

127 


oo 
8Z 


lOAAQ 


JMUJ34U7 


Homo sapiens 


pancreatic tumor-related protein 


OCT 


oo 
yo 


12010 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


64 


ioni i 

1ZU 1 1 


TV* £ 1 1 9 
UOOl XL 


riomo sapiens 


NADH dehydrogenase subunit 4L 




on 


12012 


X03717 


Homo sapiens 


pot. unidentified reading frame 


126 


43 


lOAlO 


O C OOOO 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XJbrLoc) 


131 


80 


1ZU 1** 


A 13 A 1 1 1/f C 


: : 

Homo sapiens 


jkjlaauj/o protein 


135 


84 


12015 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
lerminaiy cione Ac.ri.oC/ 


114 


87 


izuio 


A VAO^« 
AJSAJZ4455 


— — ; 

Homo sapiens 


T7T TAAAvIO rvrrttain 

JrUUUU4/ protein 


1 AO 


OA 

7U 


1901 7 
1ZU 1 / 


JLZ /4Z5 


jtiomo sapiens 


reverse Transcriptase 


1 A A 

144 


oo 
o/ 


12018 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

443/. 


135 


87 


12019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


137 


93 




AivUUlioi 


Homo sapiens 


unnamed protein product 


490 


100 


12021 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene INo. 1U/. 


111 


79 


12022 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product 


106 


87 




WolHUO 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


1 CO 

157 


CO 

58 


1ZUZ4 


UU J 1 /Z 


riomo sapiens 


numan secreiea protein, oc,^ jlu vhkj. 
7253. 


90 
5/ 


00 


iZUZ » 


jtooyi J 


riomo sapiens 


iNeurai mreau protein. 


1 CO 
IJf 


45 


izuzo 


IjUU5j4 


riomo sapiens 


riuman secreiea protein, onv^ iu in\j. 
4435. 


Ijo 


<0 


1ZUZ/ 


flf\(\A C/f 

UUU454 


Homo sapiens 


Human secreted protein, oc\l ID tNU. 


llo 


iCA 

oO 


12028 


V00488 


Homo sapiens 


alpha globin 


191 


100 


i9mo 
izuzy 


Y£0£7O 


urycioiagus 
cunicuius 


retrovirus reiatea reverse transenptase 


1 AO 


CO 
Of 


12030 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


114 


79 






WfMTl A COt-%1 c 

jnuiiiu oapiCUb 


lJ.UIIla.ll bCWClCU prUlCLU, OJJ»V^ i-L/ Liu. 

5327. 


id? 


oo 


12032 


X01703 


Homo sapiens 


alpha-tubulin 


534 


93 


12033 


U09823 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 


557 


90 



456 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








W alcruum 


Identit 










Score 


y 


12034 


X01703 


Homo sapiens 


alpha-tubulin 


619 


94 


12035 


AF255556 


Notdthenia 


alpha tubulin 


430 


85 






coriiceps 








12036 


X53618 


Paracentrotus 


alpha-tubulin (AA 1-452) 


437 


87 






lividus 








12037 


M62810 


Homo sapiens 


mitochondrial transcription factor 1 


433 


95 


12038 


Y36156 


Homo sapiens 


Human secreted protein #28. 


157 


75 


12039 


X01703 


Homo sapiens 


alpha-tubulin 


604 


97 


12040 


X01703 


Homo sapiens 


alpha-tubulin 


517 


90 


12041 


S70154 


Homo sapiens 


cytosolic acetoacetyl-coenzyme A 


533 


94 








thiolase, CT {EC 2.3.1.9} 






12042 


X01703 


Homo sapiens 


alpha-tubulin 


472 


85 


12043 


M23613 


Homo sapiens 


nucleophosmin 


428 


80 


12044 


AL031174 


Schizosaccharom 


hypothetical protein 


102 


52 






yces pombe 








12045 


Y94653 


Homo sapiens 


Human netrin-like protein (NEL) 


152 


82 








amino acid sequence. 






12046 


AF016507 


Homosapiens 


C-tenninal binding protein 2 


172 


100 


12047 


X05196 


Homo sapiens 


aldolase C 


454 


93 


12048 


X03796 


Mus musculus 


aldolase C (aa 1-227) 


351 


74 


12049 


X05196 


Homo sapiens 


aldolase C 


408 


88 


12050 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 


147 


70 








clone HNHF029. 






12051 


S58722 


Homosapiens 


X-linked retinopathy protein {C- 


169 


55 








terminal, clone XEH.8c} 






12052 


AF090944 


Homo sapiens 


PRO0663 


119 


59 


12053 


V01 577 


Homo sapiens 


variable region 


223 


93 


12054 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


74 








7492. 






12055 


AF090931 


Homo sapiens 


PRO0483 


121 


74 


12056 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


76 








6983. 






12057 


U18339 


Variola virus 


D4L 


100 


52 


12058 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


135 


62 


12059 


AB001684 


Chlorella 


ORF41c 


81 


75 






vulgaris 








12060 


Y02785 


Homo sapiens 


Human secreted protein encoded by 


128 


74 








gene 51 clone HUKEX85. 






12061 


AK024455 


Homo sapiens 


FU00047 protein 


70 


46 


12062 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


69 








4658. 






12063 


AF090944 


Homo sapiens 


PRO0663 


110 


66 


12064 


Y14480 


Homo sapiens 


Fragment of human secreted protein 


109 


53 








encoded by gene 14. 






12065 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


79 








7881. 






12066 


AF090931 


Homo sapiens 


PRO0483 


123 


70 


12067 


G00365 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


111 


75 








4446. 






12068 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


77 








4658. 






12069 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


145 I 


76 | 








7253. 






12070 


G03639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


81 








7720. 






12071 


AF116715 


Homo sapiens 


PR02829 


138 


63 


12072 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


127 


82 


12073 


U79260 


Homo sapiens 


unknown 


98 


73 


12074 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


132 


80 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Score 


% 

Identic 

y 








clone HPMBQ32. 






12075 


J02459 


bacteriophage 
lambda 


T (tail component; 144) 


555 


98 


12076 


AK024455 


Homo sapiens 


FU00047 protein 


127 


65 


12077 


X00911 


Rattus 
norvegicus 


pot MSA-precursor 


190 


73 


12078 


AF164797 


Homo sapiens 


ribosomal protein L17 isolog 


538 


95 


12079 


A06977 


Homo sapiens 


albumin 


602 


87 


12080 


AF000198 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


118 


38 


12081 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12082 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 linger 
protein rhala. 


103 


50 


12083 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


142 


86 


12084 


G00413 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4494. 


64 


54 


12085 


G02532 ! 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


132 


53 


12086 


A14829 


Homo sapiens 


preproapolipoprotein 


619 


82 


12087 


AF 119900 


Homo sapiens 


PR02822 


107 


76 


12088 


U52077 


Homo sapiens 


mariner transposase 


277 


74 


12089 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


126 


100 


12090 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


134 


92 


12091 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


115 


42 


12092 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


112 


81 


12093 


G03343 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7424. 


114 


64 


12094 


AJ237660 


Bacteriophage 
21 


Ren protein 


187 


94 


12095 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


135 


80 


12096 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


161 


49 


12097 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


122 


92 


12098 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


148 


59 


12099 


AF090942 


Homo sapiens 


PRO0657 


115 


70 


12100 


AF090931 


Homo sapiens 


PRO0483 


112 


74 


12101 


AF090944 


Homo sapiens 


PRO0663 


103 


37 


12102 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


76 


58 


12103 


J02963 


Homo sapiens 


platelet glycoprotein lib precursor 


130 


80 


12104 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


106 


84 


12105 


R95913 


Homo sapiens 


Neural thread protein. 


105 


75 


12106 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


149 


75 


12107 


Y09561 


Homo sapiens 


ATP receptor 


188 


85 


12108 


AF090895 


Homo sapiens 


PRO0117 


107 


83 


12109 


G03797 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7878. 


117 


65 


12110 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


65 
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SEQI0 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 








£909 






12111 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


69 


12112 


Y12478 


Homo sapiens 


congenital heart disease 5 protein 


291 


62 


1Z J ID 


A F99ft9£4 
/\rzZUzo*f 


Homo sapiens 


MUS 1-1 


142 


76 


12114 


Y12077 


Homo sapiens 


Human 5* EST secreted protein SEQ ID 

xiw ion 


109 


95 


12115 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
I4yl„ 


171 


63 


12116 


AF064869 


Rattus 
norvegicus 


brain-enriched guanylate kinase- 
associated protein z; dcxjAZ 


294 


98 


191 17 




ivius museums 


nuclear localization signal binding 
protein 


IzS 


36 


191 1Q 

IZ 117 


OH 1 0514 
vju l "oh 


nomo sapiens 


riurnan secretea protein, ocvj iiJ iNU. 
6065. 


lOz 


73 


1Z IZV 


1 1709 An 
U /IrzOv 


Frist ponti>nf> 

xiomo sapiens 


unknown 




03 


12121 


AF161356 


Homo sapiens 


HSPC093 


118 


67 


izlzz 


AT I jOOo/ 


Homo sapiens 


PR02411 


115 


80 


IzlzJ 


vjvUjyz 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473. 


o o n 

280 


75 


19 19v1 
lziz<t 


CM fMr\£ 


Mus sp. 


alpha 1 (XII; collagen {triple-helical 
domain COL2} 


59 


41 


1919^ 
izlZJ 


AoUiUolO 


Homo sapiens 


H-KbV 107 protein-related protein 


420 


79 


12126 


R96418 


Homo sapiens 


Partial human transforming growth 
factor beta receptor type II. 


169 


100 


12127 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

TOO 


91 


46 


19 19ff 
izlzo 


I 1446Z 


— — ; 

Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


109 


83 


19190 

lzizy 


flA19fc9 
VjUJ /o / 


Homo sapiens 


Human secretea protein, obQ ID NO: 

/oOo. 


127 


57 


12130 


AFl 19900 


Homo sapiens 


PR02822 


10S 


79 


19 1*51 
iZi Jl 


I o0z4o 


Homo sapiens 


Human secretea protein HCHPF68, 

ccrv it\ xTfVl 
orAs£ ID rNVJ.lOj. 


127 


70 


191^9 
iz l Da 




~tu? 5 

Mus musculus 


UDc-nxe protein mmoz 


to 1 
131 


43 


12133 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

900Q 
/OOO. 


127 


58 




I T40094 


— ; 

Homo sapiens 


manner transposase 


124 


OA 

80 


IZI JJ 


YC91 £4 
WJZ 1 Dt- 


xvius muscuius 


v£juo protem (aa i-/ /) 


in/ 

106 


94 


IZ 1 DO 


nn9^9 

VJUZJ JZ 


nomo sapiens 


jtiuman secretea protem, ocxi iijmu. 


JDl 


^A 


12137 


AF192913 


HAmn cutmptio 


zinw linger pr uicin z*iNr lou 


Do! 




12138 


X58907 


Homo sapiens 


steroid 2 1 -monooxy genase 


128 


58 


12139 


AF09093 1 




PRrW14)2^ 

riVvAJHOj 


191 
lz J 


/U 


12340 


AK024455 


Homo sapiens 


FLJ00047 protein 


84 


60 


19141 

IZ IHl 


AFnono^i 

/trv7U7j i 


nomo sapiens 


Pi>nfi4si 

JrlvVJU*foJ 


lzz 


*7 1 
/ 1 


12142 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

cat/: 
jUOO. 


153 


60 




A FA0^77ft 


nomo sapiens 


r iti-responsive osteosarcoma ui 
piUieiu 


izJ 


1 AA 

100 


12144 




n villi/ oapicQo 


Humon nt-At^in TT> XTO- 

i ium fin secreicu. proiem, oiiv^ iu nu. 
7003. 


1Q1 
1^1 


6z 


19145 

IZ l*f J 




rn/niQ Sapiens 


ivaoju 


ZD/ 


0*7 

87 


12146 


AF130089 


Homo sapiens 


PRO2550 


129 


74 


12147 


AF090931 




PRO04X1 

X JVV/ w*t O J 


l^R 

1 JO 


fin 

oU 


12148 


G02382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6463. 


135 


60 


12149 


G02532 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6613. 


138 


60 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


12150 


U51723 


Plasmodium ' 
vivax 


V-SERA I 


107 


37 


12151 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/5/U. 


82 


68 


191 ^9 
1Z IJjL 


V9^ftftzt 

I Z3oo4 


— : 

Homo sapiens 


Amino acid sequence of FK506 
Din u ing protein od {t ajdjtOj ). 


1 A A 

124 


69 


12153 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OD / 1. 


93 


70 


191 *s± 


A F990A/C7 
/vrzzyuo / 


U/\m/\ rlr\ latin 

iiomo sapiens 


r AJm-ti protein 


1 £0 

loz 


68 


12155 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


125 


86 


191 <7 
iZl J/ 


VjUjoUU 


Homo sapiens 


Human secreted protein, bbQ ID NO: 
7881. 


1 13 


39 


191 <ft 




Bos taurus 


ubiquitin-like protein 


146 


IS 


12159 


G03725 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7806. 


677 


98 


191 /CA 

1Z lou 


vjUZoJy 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


124 


90 


191 /CI 


nAOI/Ci 
VjOZjCO 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


109 


80 


191/19 


Ar WUyJ 1 


Homo sapiens 


PRO0483 


125 


76 


12163 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


191 


64 


12164 


G00577 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
4658. 


96 


61 


lO 1 


000412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A AM 

449 J. 


115 


68 


12166 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OTJA TTV VTA. 1 O 

SEQ ID NO: 163. 


219 


93 


12167 


AF1 18086 


Homo sapiens 


PR01992 


155 


78 


IZlOcS 


V1070A 

Alz/oy 


Mus musculus 


cytokerann 8 (AA 1 - 489) 


174 


63 


12169 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


94 


48 


IZl /0 


YooZ48 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


150 


57 


1Z171 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


78 


IZl /z 


Ar 065294 


Homo sapiens 


HDCMB45P 


184 


56 


12173 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


50 


19 1*7/1 
IZl /4 




Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


59 


12175 


X92485 


Plasmodium 
vivax 


pval 


108 


44 


12176 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


102 


84 


1Z1 / / 




Homo sapiens 


nuclear protein 


330 


68 


1917K 


I oOZHo 


riomo sapiens 


numan secreiea proiein xicrirroo, 


1 AT 
14/ 


jO 


12179 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


103 


73 


12180 


AF132200 


Homo sapiens 


PR01751 


131 


64 


191 JM 


AUUIOD14 


Homo sapiens 




26 1 


AA 

90 


12182 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
Clone HrMBQiZ. 


156 


75 


12183 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


75 


12184 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


101 


74 


12185 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


156 


56 
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SEQ ID 
NO. 


Accession No. 


Species 


Description 


Smtth- 

Waterman 

Score 


% 

Identit 

y 




G02213 


Womo sanien<> 


Human secreted nrotein SFO ID NO* 
6294. 


626 


97 


12187 


Y014Q0 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


96 


51 


12188 


AF1 18067 


Homo saniens 


PRO 1578 


89 


73 


12189 


M63154 


Homo sapiens 


mtrinsic factor 


130 


96 


12190 


AF090942 


Homo saoiens 


PRO0657 


144 


72 


12191 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


76 


12192 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


134 


63 


12193 


S58722 


Homo saoiens 


X- linked retinooathv nrotein fC- 
terminal clone XEH 8c^ 


159 


76 


12194 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683} 


117 


76 


12195 


AF090931 


Homo sapiens 


PRO0483 


134 


62 






X XkJImHJ OtU/lvUd 


PRO2550 


117 


74 


12197 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
termtnal clone XFH Rc\ 


113 


57 




fiO?S37 

UUZJJZ 


LTrvmn canipnc 
xxiniiix oopit-ii a 


Human cerrpfed nrotein ^FO TO NO* 

XXLUXloJl 3CL1CICU JJlVJU^IXl, OLy XU iiv^. 

6613. 


126 


70 


12199. 


JQ2459 


hacteri onh ape 
lambda 


Fi CDNA nacka?in2'l 17i 


203 


95 


12200 




Homo saoiens 


Human c-mvh nrotein fVafflnent. 


135 


58 


12201 


R96800 

IV7UOW 


Homo saotens 


Human histioevte-^ecreted factor HSF 


132 


86 


12202 


AF1 19900 


Homo sapiens 


PR02822 


148 


68 






Homo saniens 


Human «icreted nrotein HC!HPF68 
SEOIDN0163 


128 


59 


12204 


R59842 


Homo sapiens 


ApoE4Ll protease. 


118 


91 


12205 


Y27R68 


Homo saoieTis 

XIV/IXIU Ou^lwUd 


Human secreted nrotein encoded bv 
gene No. 107. 


121 


72 




o / jyy / 


R ftffiic en 
rvxiitu a op. 


niicleonorin n67 homolop 


131 


65 


12207 


G00637 


Homo saoiens 


Human secreted nrotein^ SEO ID NO* 
4718. 


145 


65 


12208' 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH. 8c) 


145 


86 


12209 


AJ242956 


Homo saoiens 


El fusion protein 


96 


88 


12210 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


132 


78 


17211 




Homo oanipns 


Human secreted nrotein HPHPF68 

X Hil 1 Kill 9bV/lVl&U JJI vJ LV^Lll 11V/1U X UOj 

SEQ ID NO: 163. 


125 


81 


12212 


1WJ Lis 


Homo saoiens 


nro-alnha-1 tvne-T collagen 


96 


37 


12211 


G0 1895 


Homo saoiens 

XXl/lXXw lKX^/IVUo 


Human secreted nrotein SKO ID NO* 

JL ALII ill 111 duwlviw UIVlvUI) <Jl—i\£ XJLf llVi 

5976. 


310 


57 


12214 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO250. 


493 


82 


12215 


R13144 


Homo sapiens 


Deleted in Colorectal Carcinomas. 


700 


98 


12216 


AF 132200 


Homo sapiens 


PR01751 


109 


61 


12217 


AB013897 


Homo sapiens 


HKR1 


324 


64 


12218 


AF071172 


Homo sapiens 


HERC2 


123 


86 


12219 


L38593 


Homo sapiens 


integral membrane protein 


78 


62 


12220 


AF036233 


Homo sapiens 


cdc25B phosphatase 


204 


51 


12221 


Y14482 


Homo sapiens 


Fragment of human secreted protein 

pnmded Hv jtene 1 7 


107 


60 


12222 


AL049795 


Homo sapiens 


dJ622L5.9 (eukaryotic translation 
initiation factor 3 subunit 2 fnefa. 
36kD) (TRIP-1, TGF-beta receptor 
interacting protein 1)) 


163 


86 


12223 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


76 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


12224 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


102 


64 


12225 


AF130079 


Homo sapiens 


PR02852 


103 


80 


12226 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


163 


86 


12227 


AF1 19855 


Homo sapiens 


PRO 1847 


96 


78 


12228 


G00392 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4473, 


195 


87 


12229 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


103 


90 


12230 


U49957 


Homo sapiens 


LIM protein 


212 


100 


12231 


AK022759 


Homo sapiens 


unnamed protein product 


646 


100 


12232 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ81. 


110 


70 


12233 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


109 


74 


12234 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


123 


76 


12235 


AF220264 


Homo sapiens 


MOST-1 


104 


85 


12236 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


113 


87 


12237 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


143 


84 


12238 


AF090942 


Homo sapiens 


PRO0657 


103 


66 


12239 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


129 


72 


12240 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


135 


75 


12241 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


136 


95 


12242 


S58722 


Homo sapiens 


X-Hnked retinopathy protein {C- 
terminal, clone XEH.8c} 


148 


72 


12243 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


118 


39 


12244 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


78 


100 


12245 


AC005200 


Homo sapiens 


plasmalemmal porin 


131 


79 


12246 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
657L 


150 


65 


12247 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


90 


55 


12248 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


77 


64 


12249 


U63542 


Homo sapiens 


FAP protein 


108 


56 


12250 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


101 


75 


12251 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


63 


12252 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


122 


71 


12253 


AFl 18086 


Homo sapiens 


PR01992 


73 


70 


12254 


AF083929 


Mus musculus 


ES18 


108 


44 


12255 


AF130089 


Homo sapiens 


PRO2550 


124 


88 


12256 


AF090931 


Homo sapiens 


PRO0483 


132 


80 


12257 


X95190 


Homo sapiens 


branched chain acyl-CoA oxidase 


166 


86 


12258 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


131 


53 


12259 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


124 


88 


Q2260 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 
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WO 01/64835 



PCIYUS01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 














17961 


Oft771 1 


xiomo Sapiens 


riuman secretea protein, bevj lUNU: 
6292. 


133 


72 




rjrtic7ft 

VJAJ 1 0Z0 


Homo Sapiens 


Human secreted protein, bfay ID N(J: 
5909. 


no 


70 






norno sapiens 


Human c-myb protein fragment 


n c 
125 


oc 


19964. 




nomu Sapiens 


xi urn an secreted protein xiurxrroo, 
SEQ ID NO: 163. 


1 m 


65 


19965 
JZZOJ 


Apnnn6i6 

iVrvUUOlO 


Oryza sativa 


similar to KlNO-n2 linger protein 

ivXlA 1 a \Ar U / 000 


1 1 A 

114 


AC 

95 


12266 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


63 


19967 




iroroo Sapiens 


riuman secreted protein, bey jlu nu. 
7879. 


loo 


73 


1976X 




Homo sapiens 




1 AO 

108 


82 


12269 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone Xfc,H.8c) 


120 


68 


12270 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o/ 1. 


120 


66 


17771 
1ZZ / 1 


I OOZ*f 0 


— : 

Homo sapiens 


riuman secreted protein HCxirroo, 
SEQ ID NO: 163. 


1 AO 

148 


74 


1ZZ/2 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

'TOO') 
/00Z. 


129 


60 


12273 


AF130089 


Homo sapiens 


PRO2550 


134 


79 


1777/1 


r^A7Aa*\ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


1 18 


80 


1/U.tJ 


a p 1 0/icao 
/vr ly^oj / 


Homo sapiens 


XT A 1 


1 c 1 

151 


52 


12276 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

xoi 1 
091 J. 


75 


68 


12277 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


113 


60 


12278 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone JtIFMdQjz. 


111 


70 


17770 

lZX/7 


Oft 1 en 1 
vjrU i!01 


riomo sapiens 


riuman secreted protein, Sbvj ID NO: 
6012. 1 


1 1 A 

119 


92 


177RH 
1ZZ0U 




nomo sapiens 


riuman secreted protein, ofcQ ID NO: 
7795. 


108 


90 


199K1 


TT66A6A 


xiomo sapiens 


hematopoietic progenitor kinase 


102 


83 


12282 


GD3438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/J 19. 


118 


65 




wuuoJo 


Homo sapiens 


Tumour necrosis factor-related gene 
product 


162 


OO 

82 




V97R68 


xiomo sapiens 


Human secreted protein encoded by 
gene No. 107. 


130 


77 




AF1 16661 


xiomo sapiens 


pi? r\ 1 a 1 9 1 


1 f\A 

104 


74 


12286 


G00379 


Homo sapiens 


Human secreted protein, SEQ ED NO: 

AA&Ci 


98 


66 


12287 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


105 


74 




00^707 

vJwJ / 7 / 


ttumu sapiens 


riuman secretea protein, &n\i uj in la 
7878. 


129 


78 


199RQ 


AJjJOl 


rvULLUS 

norvegicus 


UKr / 


111 


47 


19907) 




xi ouio Sapiens 




1 ah 

147 


87 


12291 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


106 


69 


12292 


AP000616 


Oryza sativa 


similar to RING-H2 linger protein 
RHAla(AF078683) 


137 


53 


12293 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


142 


56 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 


12294 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

orn m "MO- Mil 
oll^ 1U INC. lO.). 


117 


60 


12295 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
cione rir iviD^/oz. 


112 


66 


12296 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A AIR 


140 


62 


12297 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7K7Q 


144 


70 


12298 


G02455 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 

OOJO. 


95 


51 


12299 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


108 


70 


12300 • 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz cione ruvioJ w 1 o. 


122 


85 






rtomo sapiens 


Human secreted protein, ofcivj ID NU: 

fOOO. 


1 17 


73 


12302 






numan secreiea protein, oHv^ iu vakj. 
7409 


IK 


oz 


12303 


AF130051 


Homo sapiens 


PRO0898 J 


116 


65 


12304 




nuuivi oopicns 


piaieiei glycoprotein no precursor 


1 1 Q 


60 


12305 


V00662 


Homo sapiens 


ATPase6 


118 


100 




AT71 rt74.AA 


Homo sapiens 


OW12o 


98 


66 j 


12307 


A05308 


synthetic 
construct 


glioblastoma-derived T-cell suppressor 
factor 


376 


93 


12308 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HrMJfc>v^J2. 


173 


71 


12309 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XbrLoc} 


140 


74 


12310 


U12390 


Cloning vector 
poport i 


beta-galactosidase alpha peptide 


98 


47 


12311 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


130 


42 




OV/J / 1H- 


-— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 

770^ 


134 


76 


123 n 




numo sapiens 


A-iuiKea reunopainy protein (c- 
leiuiindi, cione AJjri.oCy 


no 


82 


12314 


U79260 


Homo sapiens 


unknown 


97 


64 


12315 




XiUllHj bap 1 Clio 


reverse uanscripiase 


O 1 A 


A A 

44 


12316 


D00570 


Mus musculus 


open reading frame (196 AA) 


115 


83 


12317 


-tvt v7i/7 d l 


rxorcu sapiens 


rlvUU*toj 


104 


74 


12318 | 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

UJOO. 


123 


84 


12319 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

jyvy. 


107 


70 


12320 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAQ1 


97 


62 


12321 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


95 


58 


12322 




nomo sapiens 


numan secreiea protein, on\i iu in<j. 
4443. 


1 C\A 


jy 


1239*? 


\J\iD t r 1 1 


riomo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


lie 
113 


75 


1239d 




xiomo sapiens 


retinitis pigmentosa GTPase regulator 


111 


53 


12325 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminaL clone XEH 8c \ 


162 


75 


12326 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


129 


75 


12327 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


125 


88 



464 



WO 01/64835 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Sraith- 

Watcrman 

Score 


% 

Identit 

y 


12328 




Unmn ^anipn<i 


PRO0663 ' 


177 i 
lxx 


7A 

/o 


12329 


AF090931 


Homo sapiens 


PRO0483 


156 


81 


12330 


TV VVJO JO 


nuuiv od^J i^iid 


A Ulltvlui llwV/lUoio LuVAv/1 IClcHCvl HCiiC 

product 


1H7 


CA 
OU 


12331 


AF116715 


Homo sapiens 


PR02829 


127 


72 


12332 


Y27868 


Hoiho sapiens 


Human secreted protein encoded by 
gene No. 107. 


145 


49 


12333 


S80862 


Western equine 
encephalomyeliti 
s virus Western 
equine 

encepnaiomyeiiu 

b virui, repuue 
Partial 7 SO aa 


nucleocapsid gene C 


108 


35 


12334 


^8,777 


XxlHUU aopiCUa 


"Y_Hn rAHnrtrtartivf r»rrvt"f»in /f_ 
A'lUlNCU ICLiliUUaUlY UIUICLU S V-^ 

terminal, clone XEH.8c} 


1^1 




173^ 


AKY1744S5 


XIUUIU oapicUo 


ruuuuH/ protein 




/ 1 


12336 


AF090944 


Homo sapiens 


PRO0663 


139 


84 


IX J J / 




XxUtUU sapiens 


piaieiet glycoprotein no precursor 


10/1 
1x4 


71 
/ 1 


12338 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
una irom piasmia pouoi iuj /. 


102 


69 


12339 


AL359782 


Trypanosoma 
urucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


121 


81 


12340 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7X8 ft 
/5o5. 


149 


54 


1XJ41 


V71QAK 


riomo sapiens 


numan prostate rumor co i rragmeni 
derived protein #153. 


134 


00 


17147 


fifl17Q£ 

VJVJ 17 O 


tnuiiiu supwuA 


Unman rorrofcu4 nmtein QCH TF^ Xlf~\. 

numan secreiea protein, ony UJ lskj. 
7879. 


1A7 
lUx 


OU 






X-ifxmt\ conipno 

riuinu bapieiib 


Pi?nioo7 
jrtsAJiyyx 


5U 


OJ 




I X/OUO 


riomo sapiens 


n urn an secreiea protein encoaea oy 
gene No. 107. 


1 A/I 
1U4 


oU 




O0141 5 


flUIDO bopiCUS 


Unman car«mto^ nrAfain OCA TI~\ XT/^\. 

jnuman secreiea protein, oov£ uj invj. 
7496. 


143 


UL 




xvyoov/v 


riomo sapiens 


o urn an nisaocyie-secreiea ractor nor. 


14 J 


7c 
/J 


12347 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

477ft 
4/ / v. 


116 


37 


12348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

#££11 


176 


76 


12349 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

471 R 


103 


48 


12350 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 

terminal rlanf YptT Str\ 


101 


81 


12351 


Y02926 


Homo sapiens 


Fragment of human secreted protein 

pupoHpH hv optip 101 


108 


70 


12352 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from nlasmid nGCSl 1017 


101 


84 


12353 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


105 I 


42 


12354 


S58722 


AJ-VJ1U.V/ vXJjJlv'H J 


rpHnnnafhv nrot^iTi fC*— 

A JUJAwil I KjllilKJljaLliy JJi UtClil \\^ 

terminal, clone XEH.8c} 


1/17 


oo 




G00197 

\JVUJ7 / 


riULUU SapiCIlb 


XiUxiiciii bc^retcu proLcm, u£y iu VH\J. 
4478. 


175 
1ZJ 


A5 




rorooi/u 


nuiau sapiens 


riuman nisuocyie-secreieu iacxor nor . 


o/: 

yt> 


£.Q 
05 


12357 


G00354 


Homo saniens 


Human secreted d rote in SKO ID NO* 
4435. 


177 




12358 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


132 


38 


12359 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


90 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4478. 






12360 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


56 


12361 


AB001684 


Chlorella 
vulgaris 


ORF49b 


92 


44 


12362 


U93564 I 


Homo sapiens 


p40 


257 


84 


12363 


U60269 


Homo sapiens 


putative envelope protein; orf similar to 
env of Type A and Type B retroviruses 
and to class II HERVs 


427 


84 j 


12364 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


102 


73 


12365 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


102 


68 


12366 


AF090944 


Homo sapiens 


PRO0663 


94 


45 


12367 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


150 


55 


12368 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. I 


105 


85 


12369 


AF130089 


Homo sapiens 


PRO2550 


112 


78 


12370 


AF090895 


Homo sapiens 


PRO0117 


125 


64 


12371 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


121 


95 


12372 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


145 


72 


12373 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


72 


12374 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


133 


79 


12375 


AP000001 


Pyrococcus 
horikoshii 


58aa long hypothetical protein 


96 


51 


12376 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


105 


76 


12377 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


123 


78 


12378 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


126 


83 


12379 


Y53643 


Homo sapiens 


A bone marrow secreted protein 
designated BMS6. 


329 


98 


12380 


U64094 


Homo sapiens 


soluble type II interieukin-1 receptor 


114 


100 


12381 


J02459 


bacteriophage 
lambda 


D (head-DNA stabilization;! 10) 


467 


100 


12382 


L10908 


Mus museums 


Gcapl gene product 


100 


89 


12383 


AF220264 


Homo sapiens 


MOST-1 


113 


68 


12384 


U79260 


Homo sapiens 


unknown 


94 


60 


12385 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


52 


12386 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


116 


63 


12387 


G00952 


Homo sapiens 


Human secreted protein, SEQ CD NO: 
5033. 


125 


92 


12388 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


130 


65 


12389 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


150 


62 


12390 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4535. 


102 


38 


12391 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


113 


56 


12392 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


73 


92 



466 



WO 01/64835 
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SEQ ID 

Mfv. 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


12393 


M69297 


Homo sapiens 


ORF3 


167 


75 


12394 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


142 


43 


12395 


AF220264 


Homo sapiens 


MOST-1 


131 


57 


12396 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


139 


62 


12397 


AF130089 


Homo sapiens 


PRO2550 


120 


62 


12398 


AK023117 


Homo sapiens 


unnamed protein product 


529 


97 


12399 


AF123652 


Homo sapiens 


FEZ1 


157 




12400 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 6 


139 


87 


12401 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to D020 element 


306 


60 


12402 


AC005031 


Homo sapiens 


neuronal apoptosis inhibitory protein 


143 


96 


12403 


J03071 


Homo saniens 


chorionic snmatomammfttronin C*^-9 


5R5 

JO J 


inn 

IVJSJ 


12405 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 

nrotpin 


125 


100 


12406 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 

fhrirm^ opne nrpoHn<*t 

^1111^111,7 gvllv pi CUUUV/U 


150 


63 


12407 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878 


157 


85 


12408 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


120 


75 


12409 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613 


141 


50 


12410 


G03465 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7546. 


103 


51 


12411 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409 


147 


71 


12412 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


159 


70 


12413 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal clone XFH f£c\ 


129 


79 


12414 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


156 


72 


12415 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075 


74 


72 


12416 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 1 02, variant a 

rYW A r*n m nlpfp virion crpn nm #* 


102 


50 


12417 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


171 


75 


12418 


G02902 


Homo saniens 


Human secreted nrotein SRO ID MO* 

JL1U1JJUU *J%/\/l vl\Ai UJ vlvuli tJA~/\/ LKs 1 lV/. 

6983. 


137 


55 


12419 


G00365 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
4446. 


1 1 1 


52 


12420 


R96800 


Homo saniens 


Human histioevte-secreted factor HSF 


129 


51 


12421 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


110 


55 


12422 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


126 


87 


12423 


U93564 


Homo sapiens 


p40 


350 


93 


12424 


M17885 


Homo sapiens 


acidic ribosomal phosphoprotein (P0) 


484 


88 


12425 


G02532 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
6613. 


130 


V7 


12426 


X60376 


Brassier nanus 


nroline-rich nrotpin 




47 


12427 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


83 


88 


12428 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


117 


75 


12429 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


107 


67 



467 



WO 01/64835 PCT/US01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


DrVtrri ntfnn 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


12430 


AK027208 


Homo sapiens 


unnamed protein product 


108 


64 


12431 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


154 


68 


12432 


AF1 30089 


Homo sapiens 


PRO2550 


118 


88 


12433 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


90 


55 


12434 


AK024455 


Homo sapiens 


FLJ00047 protein 


113 • 


75 


12435 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


107 


80 


12436 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


114 


75 


12437 


AC005498 


Homo sapiens 


R31665 2 


163 


75 


12438 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


12439 


G03116 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7197. 


145 


84 


12440 


AK024455 


Homo sapiens 


FU00047 protein 


136 


73 


12441 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


122 


69 


12442 


AF090942 


Homo sapiens 


PRO0657 


120 


72 


12443 


G02902 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6983. 


159 


53 


12444 


AF130089 


Homo sapiens 


PRO2550 


110 


73 


12445 


AL021395 


Homo sapiens 


dJ269M15.1 (similar to peptidylprolyl 
isomerase (cyclophilin)) 


225 


64 


12446 


AF090931 


Homo sapiens 


PRO0483 


155 


79 


12447 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


130 


81 


12448 


G00392 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4473. 


122 


74 


12449 


AF1 19855 


Homo sapiens 


PR01847 


88 


72 


12450 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


84 


12451 


AK024455 


Homo sapiens 


FU00047 protein 


78 


68 


12452 


AF1 16712 


Homo sapiens 


PR02738 


118 


62 


12453 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


109 


67 


12454 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


59 


12455 


AK024455 


Homo sapiens 


FLJ00047 protein 


146 


64 


12456 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


166 


72 


12457 


AF090944 


Homo sapiens 


PRO0663 


129 


79 


12458 


G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. ' 


119 


70 


12459 


R22278 


Homo sapiens 


Human gp.-specific component 
phenotype Gel. 


250 


65 


12460 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


143 


58 


12461 


Y87156 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 195. 


111 


61 


12462 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


147 


84 


12463 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7885. 


94 


38 


12464 


S58722 


Homo sapiens 


X-Iinked retinopathy protein {C- 
terminal, clone XEH.8c} 


134 


74 


12465 


AF200715 


Homo sapiens 


FIB domain adaptor protein CED-6 


332 


95 


12466 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


139 


64 
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Waterman 
Score 


% 

Identit 

y 








7867. 






1Z40/ 




V^ryxa Saliva 


cimihr tr\ flTisSCw-W? finopr nrntAin 

SUUliai tv IvllN VJ JIZ. HUgCl UiUlvut 

K* HA 1 a i"AF078683^ 






12468 


AF199023 


Homo sapiens 


phospholipid scramblase 4 


476 


98 


\lPt\yy 


ArZODD /J 


nOIIIO SopiCllS 


liKJmiitrutQ 1 PR-mrttif nrotAin "V 
UOlC|UllUUS irix lUULll piuicui I 

isoform 


1 AA 


07 


19/17A 
1Z4/U 


t\T 1U5o41 


nomo Sapiens 


poi protein 


3A6 

JvO 


87 




i jo4yj 


nomo Sapiens 


Pranrnpnt ftf hitman OA/*t*AtAf( nt*ntAin 

rragmeni oi nuuiaii seuieieu pruieiii 
encoded by gene 27. 


1 I 1 
111 


70 


1x4 IZ 


VjUUj/0 


nomo sapiens 


Human CAArAtArl nrntAin ^PP) TT> "WO* 

niunan secreieu pi u lciii, oev^ xu iskj. 
44^7 




7R 


12473 


U18339 


Variola virus 


D4L 


117 


60 


1Z4/4 


vjUUj / / 


Homo sapiens 


Unman GP>n-rf*+oA r\rr\+r*\n QT70 IT) XJfV 

ii urn an secretea protein, onVd it-' nvj. 
4458. 


1*>A 




J 2475 


Ar liUUoy 


Homo sapiens 




1 A^ 
1U3 




12470 


U040/ o 


riomo sapiens 


Unman pa^rotoH nrntain C TP\ XJO' 

rtuman Sccreteci protem, oexj iu inu. 

01 J7. 


RA 


A7 
Ox 


1Z4// 


X/f017A7 

iviyiz4x 


norvegicus 


calrinm rhannpl *alnnci_l ciitMtnit 

udiLiLuii cnannet aipua-i suuuiui 


1 JO 


oo 


17vl7ft 
1Z4/5 




nomo sapiens 


rivv^zjjo 




70 


12479 


M15530 


Homo sapiens 


B-cell growth factor 


154 


70 


12480 


U39904 


Mus musculus 


citron 


HI 9. 
1 15 


07 


10/101 

12481 


Kyyio4 


Homo sapiens 


riuman KJio l proiein ussjx Dinuing 
domain. 


900 


^7 


12482 


AL 160371 


Leishmania 
major 


prooaoie ^nnv-o jui luz, variant a 

TYNJ A rnmnUfA vinAn o-^nnmA 

i/tin, compieie virion genome 


07 




1.&4 


AT71 £1 
AT 101 JDO 


riomo sapiens 








12484 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 

trancfpracA »-\T*rvt a i n 
UaiuiciooC ylvLeill. 


97 


75 


17A5K 


A FAQAQA7 


riomo sapiens 




1A1 




IZ450 




\Sr\ty\f\ cant An c 

nuinu sapiens 


tmlrn/>u/n 

UX1&J1U VVi I 


ox 


74 


12487 


AL160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 

T"YKJ A rnmnlpfp virir\n CrAnnmA 
tHJlxipJCLC VlIlUll gCllUiUC 


96 


53 


17/lfifi 


Traoi AA 


riomo Sapiens 


aiuose reuucutse'iitve pepnue 




07 


iz4oy 


uy^Doy 


tlrtmn com An c 

noino sapiens 


nntsit'iVA nl ^A 
piiLaLlvc pi JU 


141 


JU 


1247U 


I J045O 


riomo sapiens 


ITraomAnt nf human CA/^rA^Arl nfntAin 

r ragmen i or numau scvreieu piuicui 
encoded by gene 27. 


1A4 


Ov 


10/401 


U <02A9 

ioyo4/ 


riomo sapiens 


x\pocrrJLri protease. 


oo 




12492 


Y00876 


Homo sapiens 


Human LAPH-1 protein sequence. 


219 


45 


1Z493 


AK024433 


Homo sapiens 


17T TAAAA7 nrMai'n 

rjjuuu4/ protein 


ft^ 

o3 




12494 


W48351 


Homo sapiens 


Human breast cancer related protein 


95 


63 


Jz4yj 




nomo sapiens 


T7rorttriAnt hnmon CA/*i*AtP/^ nrnfpin 

riagmcni oi n urn an scureieo proiem 
encoded by gene 98. 




£A 
oo 


iZ4yo 


vjUj /yo 


numo sapiens 


Unman cAfTAtAH nrntAin ^Pi^ IT) MO - 
numau scui clcu pruieiii, oia^ il/ i>w. 

787Q 




^7 


12497 


Y77551 


Homo sapiens 


C-terminal domain of betalc integrhx 


132 


67 


ix4y& 


Ovv4Z/ 


Vs\msi pontine 

riomo sapiens 


Human cArnofAri nrntAin ^PO TTt "WO* 
rxuiiidii scvreieu proiem, ocy il/ inv-/. 

4508. 




Of 


19AQO 


A 171 All^A 
AT1013DO 


nomo sapiens 


nojr ^viyj> 




-> i 


io^aa 




nomo sapiens 


ynmon r-c»r*f otf»r1 nrrvf"*»in Q t?r^ TT^ ^T/~^ ■ 

numou sccreieo. proiem, ociv^ uj nkj. 
6983. 


I^A 


XI 

o 1 


io<ni 


O004yz 


nomo sapiens 


numan secrexea protein, ocy uj inv^. 
4573. 


07 
' * 


JZ 






XlUItlU oajJICllS 


PR017S1 

l IvU J / Jl 


94 


6Q 


12503 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


76 


65 


12504 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
factor (BCGF). 


99 


75 
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% 
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y 


12505 


M19651 


Rattus 
norvegicus 


fos-related antigen 


132 


92 


12506 


G00354 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
4435. 


113 


48 


12507 


U94832 


Homo sapiens 


KSRP 


117 


48 


12508 


W47029 


Homo sapiens 


Human N-proteinase (70 kDa short 
form). 


121 


62 


12509 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


124 


66 


12510 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


90 


80 


12511 


AF042384 


Homo sapiens 


BC-2 protein 


516 


98 


12512 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


96 


50 


12513 


G03482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7563. 


121 


45 


12514 


AL162044 


Homo sapiens 


hypothetical protein 


174 


48 


12515 


AF130079 


Homo sapiens 


PR02852 


96 


73 


12516 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


113 


77 


12517 


D32202 


Homo sapiens 


alpha 1C adrenergic receptor isoform 2 


164 


86 


12518 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


99 


85 


12519 


U00694 


Gailus gallus 


vitamin D3 hydroxylase associated 
protein 


187 


45 


12520 


AB030829 


Rattus 
norvegicus 


carbonic anhydrase III 


503 


63 


12521 


J04204 


Bostaurus 


32 kd accessory protein 


618 


95 


12522 


M36341 


Homo sapiens 


ADP-ribosylation factor 4 


374 


93 


12523 


AF090931 


Homo sapiens 


PRO0483 


163 


82 


12524 


AB007925 


Homo sapiens 


KIAA0456 protein 


130 


62 


12525 


G02314 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
6395. 


162 


79 


12526 


AF178948 


Homo sapiens 


TALE homeobox protein Meis2a 


321 


93 


12527 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


156 


85 


12528 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


126 


57 


12529 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


145 


68 


12530 


AK024372 


Homo sapiens 


unnamed protein product 


112 


56 


12531 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


76 


65 


12532 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


132 


66 


12533 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. . 


146 


53 


12534 


AF090942 


Homo sapiens 


PRO0657 


83 


60 


12535 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


136 


65 


12536 


R95913 


Homo sapiens 


Neural thread protein. 


106 


52 


12537 


AF130089 


Homo sapiens 


PRO2550 


148 


84 


12538 


AF102826 


Homo sapiens 


RD1 14/simian type D retrovirus 
receptor 


605 


91 


12539 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQEDNO:163. 


143 


52 


12540 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


87 


65 


12541 


G03600 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7681. 


209 


92 
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Smitb- 


% 


NO: 








Watenoao 


Identit 










Score 


y 


12542 


M31964 


Saimiriine 


collagen-like protein 


92 


52 






herpesvirus 2 








12543 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


76 








7795. 






12544 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


155 


72 








SEQIDNO:163. 






12545 


AC004912 


Homo sapiens 


similar to CR16, SH3 domain binding 


442 


98 








protein; similar to 2205340A 












(PID:gl587070) 






12546 


V00662 


Homo sapiens 


URF A6L (NADH dehydrogenase 


175 


91 








subunit) 






12547 


AF130089 


Homo sapiens 


PRO2550 


112 


40 


12548 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


109 


48 








6613. 






12549 


X55110 


Homo sapiens 


neurite outgrowth-promoting protein 


479 


72 


12550 


X98296 


Homo sapiens 


ubiquitin hydrolase 


500 


88 


12551 


R94317 


Homo sapiens 


Hepatocyte proliferation substance HP- 


390 


92 








041V. 






12552 


G01623 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


90 








5704. 






12553 


D86966 


Homo sapiens 


similarto human ZFY protein. 


161 


56 


12554 


D00570 


Mus museums 


open reading frame (196 AA) 


122 


67 


12555 


AF130089 


Homo sapiens 


PRO2550 


94 


88 


12556 


AF043184 


Homo sapiens 


T cell receptor beta chain 


607 


84 


12557 


U93564 


Homo sapiens 


p40 


154 


100 


12558 


U42026 


Homo sapiens 


plasma membrane Ca2+-ATPase 


200 


97 








variant 4a PMCA4a 






12559 


S79410 


Mus musculus 


nuclear localization signal binding 


103 


50 








protein 






12560 


U90552 


Homo sapiens 


butyrophilin 


356 


98 


12561 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


156 


37 • 


12562 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


138 


41 








clone HPMBQ32. 






12563 


X16454 


Homo sapiens 


carcinoembryonic antigen subdomain B 


353 


98 


12564 


AL031186 


Homo sapiens 


bK984GL4 (Ewing sarcoma brealqwint 


119 


100 








region 1 protein) 






12565 


M68941 


Homo sapiens 


protein-tyrosine phophatase 


223 


93 


12566 


Y14487 


Homo sapiens 


cytosolic serine 


145 


84 








hydroxymethyltransferase 






12567 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


149 


61 








6913. 






12568 


AB005047 


Homo sapiens 


SH3 binding protein 


124 


52 


12569 


G03411 


Homo sapiens 


Human secreted protein, SEQ ©"NO: 


102 


53 








7492. 






12570 


X68142 


Oryctolagus 


elongation factor 1 gamma 


678 


93 






cuniculus 








12571 


L20315 


Mus musculus 


MPS1 protein 


337 


77 


12572 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


114 


50 






brucei 


complete virion genome. 






12573 


AF1 30089 


Homo sapiens 


PRO2550 


124 


71 


12574 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


51 








7492. 






12575 


AF005902 


Monodelphis 


kinesin homolog 


534 


82 






domestica 








12576 


Y56021 


Homo sapiens 


Human CD40 receptor interacting 


711 


96 








protein 4C4. 






12577 


Y99662 


Homo sapiens 


Human GTPase associated protein- 13. 


634 


100 


12578 


Z48008 


Saccharomyces 


Soklp 


225 


100 






cere vis iae 
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y 


12579 


AF1 12207 


Homo sapiens 


translation initiation factor elF*2b delta 


195 


97 








subunit 






12580 


M22538 


Homo sapiens 


NADH-ubiquinone reductase 


224 


82 


12581 


AB034730 


Mus musculus 


This gene is isolated by means of 


112 


44 








differential display method using ttw, 












an excellent mouse model for ectopic 












ossification.; similar to megakaryocyte 












stimulating factor precursor and 












cartilage superficial zone protein 






12582 


U93567 


Homo sapiens 


p40 


364 


93 


12583 


AY007233 


Homo sapiens 


phosphoinositol 3-phosphate binding 


475 


100 








protein- 1 






12584 


L07592 


Homo sapiens 


peroxisome proliferator activated 


113 


81 








receptor 






12585 


AE003682 


Drosophila 


CG8135 gene product 


.191 


38 






melanogaster 








12586 


AF1 19851 


Homo sapiens 


PR01722 


88 


64 


12587 


AF090895 


Homo sapiens 


PRO0U7 


144 


59 


12588 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


57 








7645. 






12589 


U73199 


Mus musculus 


Rho-guanine nucleotide exchange 


111 


32 








factor 






12590 


J04615 


Homo sapiens 


small nuclear ribonucleoprotein 


383 


• 97 


12591 


L77967 


Ovis aries 


small proline-rich protein with paired 


60 


33 








repeat 






12592 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


118 


78 








gene 22 clone HMSJW18. 






12593 


Y36495 | 


Homo sapiens 


Fragment of human secreted protein 


123 


83 








encoded by gene 27. 






12594 


AF090944 


Homo sapiens 


PRO0663 


152 


82 


12595 


Y15243 


Homo sapiens 


ULBP-3 amino acid sequence. 


852 


100 


12596 


X98235 


Drosophila 


type I 


159 


42 






melanogaster 








12597 


AF191309 


Mus musculus 


zinc finger protein 


407 


66 


12598 


W82841 


Homo sapiens 


Human cerebral protein- 1. 


107 


91 


12599 


AF070664 


Homo sapiens 


HSPC008 


128 


89 


12600 


U03688 


Homo sapiens 


cytochrome P450 


257 


96 


12601 


AB015798 


Homo sapiens 


DnaJ homolog 


226 


73 


12602 


AC003058 


Arabidopsis 


unknown protein 


88 


40 






thaliana 








12603 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


35 








7348. 






12604 


AF200187 ! 


cercopithicine 


EBNA2-like protein 


109 


28 






herpesvirus 15 








12605 


AF161532 


Homo sapiens 


HSPC047 


720 


100 


12606 


M21302 


Homo sapiens 


small proline rich protein 


60 


59 


12607 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


107 


47 








gene 22 clone HMSJW18. 






12608 


AJ302650 


Rattus 


RP59 protein 


599 


54 






norvegicus 








12609 


J05401 


Homo sapiens 


sarcomeric mitochondrial creatine 


763 


96 








kinase precursor (EC 2.7.3.2) 






12610 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


216 


87 








6613. 






12611 


W78175 


Homo sapiens 


Human secreted protein encoded by 


337 


100 








gene 50 clone HSTAG52. 






12612 


S79410 


Mus musculus 


nuclear localization signal binding 


145 


66 








protein 






12613 


AF1 19855 


Homo sapiens 


PRO 1847 


157 


61 
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y 


12614 


U82303 


Homo sapiens 


unknown 


107 


62 


12615 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


49 








4718. 






12616 


AF130051 


Homo sapiens 


PRO0898 


254 


56 


12617 


AF037350 


Rattus 


NF-E2-related factor 2 


807 


59 






. norvegicus 








12618 


Y02168 


Homo sapiens 


A facilitative glucose transporter 


452 


100 








protein GLUT8. 






12619 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


84 








4478. 






12620 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


246 


60 








7893. 






12621 


X55683 


Lycopersicon 


extensin (class I) 


79 


43 






esculentum 








12622 


AF109907 


Homo sapiens 


S164 


93 


40 


12623 


Y36156 


Homo sapiens 


Human secreted protein #28. 


114 


61 


12624 


AF1 16712 


Homo sapiens 


PR02738 


104 


48 


12625 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


65 








7075. 






12626 


AF1 13685 


Homo sapiens 


PRO0974 


104 


43 


12627 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


116 


45 






major 


DNA, complete virion genome 






12628 


AF107406 


Homo sapiens 


GW128 


133 


58 


12629 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


57 








7496. 






12630 


AB047600 


Macaca 


hypothetical protein 


172 


66 






fascicularis 








12631 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


63 








6619. 






12632 


AF 118082 


Homo sapiens 


PRO1902 


138 


45 


12633 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


62 








4478. 






12634 


X52164 


Mus museums 


Q300 protein (AA 1-77) 


95 


58 


12635 


AF064597 


Homo sapiens 


LINE- 1 like protein 


108 


46 


12636 


Y02693 


Homo sapiens 


Human secreted protein encoded by 


116 


49 








gene 44 clone HTDAD22. 






12637 


W48351 


Homo sapiens 


Human breast cancer related protein 


110 


52 








BCRB2. 






12638 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 


79 


70 








protein fragment 59. 






12639 


M29622 


Mus musculus 


open reading frame 2 


81 


60 


12640 


M26460 


Homo sapiens 


retinoblastoma 1 


100 


45 


12641 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


110 


63 








terminal, clone XEH.8c} 






12642 


AB047936 


Macaca 


hypothetical protein 


100 


52 






fascicularis 








12643 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


41 








7214. 






12644 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 


143 


48 








encoded by gene 17. 






12645 j 


AF090942 


Homo sapiens 


PRO0657 


157 


54 


12646 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


56 








6953. 






12647 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


267 


65 








6953. 






12648 


AF130079 


Homo sapiens 


PR02852 


109 


57 


12649 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


61 








7868. 






12650 


X55686 


Lycopersicon 


extensin (class H) 


65 


35 
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ol/Q ID 


Access on No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Iden tit 










Score 


y 






esculentum 








12651 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


84 








7795. 






12652 


AF1 19851 


Homo sapiens 


PR01722 


150 


70 


12653 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


133 


50 








gene 22 clone HMSJW1 8. 






12654 


Y45382 


Homo sapiens 


Human secreted protein fragment 


146 


70 








encoded from gene 28. 






12655 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


136 


63 








4435. 






12656 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


52 








7870. 






12657 


U93563 


Homo sapiens 


putative pl50 


1046 


50 


12658 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


346 


70 








7870. 






12659 


Y 12950 


Homo sapiens 


Amino acid sequence of a human 


104 


70 








secreted peptide. 






12660 


AF1 19851 


Homo sapiens 


PR01722 


98 


56 


12661 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 


90 


65 








terminal, clone XEH.8c} 






12662 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


59 


64 








7344. 






12663 


AF1 18082 


Homo sapiens 


PRO1902 


91 


66 


12664 


L10908 


Mus musculus 


Gcapl gene product 


113 


50 


12665 


M18093 


Phaseolus 


hydroxyproiine-rich glycoprotein 


124 


37 






vulgaris 








12666 


AF090895 


Homo sapiens 


PRO0117 


76 


57 


12667 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


43 








7870. 






1266S 


W48351 


Homo sapiens 


Human breast cancer related protein 


90 


61 








BCRB2. 






12669 


M15530 


Homo sapiens 


B-cell growth factor 


90 


80 


12670 


AF198447 


Aspergillus 


60S ribosomal protein L3 


217 


69 






nidulans 








12671 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


51 








7871. 






12672 


Y20717 


Homo sapiens 


Human neurofllament-M wild type 


77 


68 








protein fragment 59. 






12673 


AF055985 


Onchocerca 


pyrrolidone-rich antigen 


67 


40 






volvulus 








12674 


X79417 


Sus scrofa 


40S ribosomal protein S12 


377 


75 


12675 


AF107406 


Homo sapiens 


GW128 


74 


51 


12676 


Y73966 


Homo sapiens 


Human prostate tumor EST fragment 


99 


62 








derived protein #153. 






12677 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


65 








6953. 






12678 


G00590 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


99 


62 








4671. 






12679 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


128 


85 








6065. 






12680 


AF194537 


Homo sapiens 


NAG13 


142 


59 


12681 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


12682 


AF161356 


Homo sapiens 


HSPC093 


104 


50 


12683 


AF 107406 


Homo sapiens 


GW128 


84 


54 


12684 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


90 


39 


12685 


V40883_cdl 


Homo sapiens 


03-DEC-1997 Coding sequence of 


728 


100 






clone BG366_2. 






12686 


M24732 


Homo sapiens 


lam in-like protein 


94 


54 


12687 


X00824 


Gallus gallus 


collagen 


66 


42 



474 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 




G037Q0 
vJvj / y\j 


Hrwnr* campn'i 


Human 9#»TTPt#*H nrnt^in QpO TT^ WO- 


184 


(LA 








7871 
/o/ I. 






12689 


AF1 18086 

fu A 1 OV/OU 


Hnmo **a.rvif*n , s 


PRO 1992 

a Avw 1 7 ✓ ^» 


140 

I "TV 


61 t 
ul 


19600 


AT OOOI 14 


1 AicfirnQTiiQ 
JL^lMlindllld. 


prooaoie ^nnv-o^ ui iuz, vanani a 


1 1 1 

131 








inniAr 
Iliajvri 


A-/1N/V, v^JillplCLC/ VII IVrll gdlViUlC 






12691 


G02872 


1 XU111U OuLJltsIlo 


Human v»rretf*H nrnt<*in ^iFO TTi "MO* 


180 


79 
/ z 








6953 






I960? 


003700 


nuiiiv sapiciio 


lAlJllld.ll SCL<1CICU prUlClil, OCy Lis iNW. 


104 


79 
/ Z 








7871. 






19693 


A Fl 30080 


Hrmiri c 5i »^ ip no 


PRO9S50 


904 


SI 

31 


12694. 


KOI 664 


T"jT/\cn rVh i Is* 


lJfVlll - lllVU piuiciii 


02 
yz 


59 

3Z 






m pi an no*a<«tpr 








12695 


AT 3901 14 


I Aichmania 


TvrnKsiMA /Tihv-6^ ul 109 variant a 
jJiuuaUic "^imvvy ui 1 V/jt-j vaiiaiii a 


1 10 

1 IV 


59 

■JZ 






tnainr 


T*jNA cnmnlAfp virion opnrtmp 

L/Ii/Tl, "kAJAllJJlvLW Vlllx/il J^dlWlllv« 






12696 






olvprvnnfifpin TK alnha 
gi^^wpj iu oji/iia 


61 


45 

*T»7 


12697 


Ml 5530 


Hrrrno earn An c 

AAVslAlV odJJl&Iia 


R»r*Al1 OT*owfh ■for'fnr 
irwu giwvTui icvvvji 


121 
ii>i 


64 


19608 


M36Q 1 3 


7aq mav/c 
Z«Ca lllciya 


L>eu wail pruiem ipuu ) 9 puiaiive 


75 


4.0 


19600 


^58799 
oJO / zz 


nomo sapiens 


-A.-uiiK.eci rcuiiopauiy protein \V-^ 


101 
IU1 


6S 

05 








terminal olAnp *Y PU 51/* \ 

ICillllliaiy U1U11C yVAiii. OV. f 






19700 

JZ 1 W 


M94739 


T-IrvtYi rs ccin i pn c 
jaUajiu oapicii;> 


lamin.liIrA nrntpi n 

Janiin-iiKc pruLcin 


05 


35 
33 


19701 
iz /Ui 


003963 


XlUIIiU sapiens 


Unman c<v^rataH nrAtatn QCH TTi XT/~i. 

o urn an secreiea protein, ocv^ lJL' inw. 


o / 


^O 
3U 








7344. 






19709 
IZ / UZ 


ATZ70J7H 


lNlCOllalld. aiala 


Eraoinogaiacian proxem 


io^ 

lvO 


30 


19703 

1Z /U3 


Y094B5 


r i asm oa i u m 


pval 


07 


3Q 
3o 






VIV3X 








19704. 

1Z / v*f 


A F9 10651 
/vrziu03i 


jrionio supicns 


XIAO 18 


80 


64 
04 


19705 


003700 


AALH11U SaUACAlS 


Uiiman CAr*i*A't > A/1 frMwt'Ai'n *^t7^^ W\ M/^* 

aa uiii ail score icu proiem, jCy lu iwj. 


100 


S3 
33 








fori* 






19706 


G003 59 


T-T/^Tn r\ cartiAnc 

nuiuu sapiens 


Human cATTAtAH nrrktAin Cprv TTf\ MA« 

nuindJi seureieu pruiein, ocy lu 


03 

yj 


7R 

/ O 








44.33 






19707 
iz / u / 


TT59077 

U JZU / / 


nuuiu sapiens 


mariner iiansposasc 


4.51 

*T J 1 


51 
3 1 


1970 J? 


AfOOIOSC 


Araoiaopsis 


unknown protein 


Z44 


/ / 






maligna 








lz/W 


0093A3 


xiomo Sapiens 


UiitMan ra^rala^ nrAlain CCA rr% XJ/^V. 

riUTnan secreieu protein, oty lu jmkj. 


1 Jo 


03 








HA A A 






197 lO 


003ROO 
OU3oUU 


xiomo sapiens 


ITimian po/>rafo/1 nmlain CCA 111 "Kir~\» 

riuman secreieu pro tern, onv^ il* inu. 


QO 


9*7 
OZ 








7881 
rooi. 






1971 1 
1Z / 1 1 


V096Q3 

i uzoy J 


riorno Sapiens 


xiuman secreieu proicm encouea oy 


104 


1/1 








gene hh eiune n 1 jl//\jl/zz. 






12712 


AF249779 


PfAmA caniAnc 

A AVJAAIU DOAJlCllO 


mAOAnohvmal ctpm /^a11 nrAtpin 
mesciiuuyiiiai slciu ecu piutciu 


106 

AUO 


45 














12713 


G03790 


AAU11JLF aOUIClAd 


Human CAn-AtAH nmfpin QRO TTi "MO* 


116 
J lu 


36 








7871 
lot 1. 






12714 


AF1 16715 

iil 1 lUf a«j 


x Ai/iiiir aapiviio 


PRO2820 


130 


68 


12715 


G03790 


Unmn canien'i 

A AVA1A11S oaLyiwiAo 


Human secreted nrntein SFO TO NO* 
nuiiittU acuicicu ljiiawUAj oi_»v^ ajls 


89 


62 








7871 

/Or 1< 






12716 


X55684 


T .vcnner<iinrin 

A_/ V uUpvl aJVUJ 1 


Airfpn <iin ^ pla"i<*; T\ 


86 


54 






escn 1 entiim 

VU XvU L> Ulll 








12717 


Z93891 


HPfTptAr nAHtllQ 


p vtr> r\\ Tr\tt\ a nviHacA 

v<jf UJlslll U111C UA.J LKloC 


72 
/z 


48 

to j 


1971 ft 
lz/ Lo 


004A/%3 
OU4U03 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


10 j 


40 








8144 

O It'r. 






19710 


0006RO 

VJUU0O7 


nomo Sapiens 


numan secreieu proicm, ocy iu invj. 


74 


A£ 

40 








4770 






19790 
1Z / zu 


TT30991 

U 3UZZ1 


\-/i lunula 


XJA TYW HAl^*i//1r > r\cTAnacA mVtiintf' *\ 

iN/vL^n ucnyurogenase suuunii j 


09 

7Z 


30 

J7 






fasciculate 






12721 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


82 








6619. 






12722 


AF107406 


Homo sapiens 


GW128 


72 


54 


12723 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


55 
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WO 01/64835 PCT/US01/04927 



NO: 


Accession No. 


Species 


Description 


Smith- 
, Waterman 
Score 


% 

Identit 
y 








6720. 






12724 


AF116661 


Homo sapiens 


PR01438 


145 


59 


12725 


Y53871 


Homo sapiens 


A human brain-derived signalling 
factor polypeptide. 


584 


98 


12726 


L10908 


Mus musculus 


Gcapl gene product 


95 


37 


12727 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


124 


37 


12728 


X55686 


Lycopersicon 
esculentum 


extensin (class II) 


66 


36 


12729 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065, 


110 


55 


12730 


L23852 


Homo sapiens 


Putative 3 f end of coding region; 
putative 


83 


37 


12731 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


59 


48 


12732 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


88 


69 


12733 


AF090895 


Homo sapiens 


PRO0I17 


130 


55 


12734 


R95913 


Homo sapiens 


Neural thread protein. 


162 


45 


12735 


U72355 


Homo sapiens 


Hsp27 ERE-TATA-binding protein 


972 


57 


12736 


AF247039 


porcine 
adenovirus 3 


163R* 


108 


35 


12737 


M22332 


Homo sapiens 


unknown protein 


148 


45 


12738 


AF090895 


Homo sapiens 


PRO0117 


89 


72 


12739 


B08525 


Homo sapiens 


Protein encoded by a novel gene 
associated with colon disease. 


296 


86 


12740 


X90872 


Homo sapiens 


associated to Golgi apparatus 


116 


54 


12741 


M26460 


Homo sapiens 


retinoblastoma 1 


75 


37 


12742 


S52010 


Mussp. 


orfl 5' of EpoR 


104 


31 


12743 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


88 


50 


12744 


AB047600 


Macaca 
fascicular is 


hypothetical protein 


111 


47 


12745 


AF090944 


Homo sapiens 


PRO0663 


152 


65 


12746 


Y21106 


Homo sapiens 


Human bcl2 proto-oncogene wild type 
protein fragment 3. 


62 


64 


12747 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


76 


12748 


X66861 


Mus musculus 


Hox-1.4 


92 


34 ] 


12749 


AF166125 


Homo sapiens 


selenoprotein N 


978 


98 


12750 


W87504 


Homo sapiens 


Human N-methyl-D-aspartate receptor 
subunit encoded by clone NMDA24. 


98 


29 


12751 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


91 


78 


12752 


X72963 


Nicotiana 
tabacum 


pAP8 product 


80 


45 


12753 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


123 


61 


12754 


AF146191 


Homo sapiens 


FRG1 


208 


85 


12755 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


75 


12756 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


72 


48 


12757 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


111 


51 


12758 


U83280 


Leishmania 
donovani 


39 kDa antigen 


106 


60 


12759 


AF1 16661 


Homo sapiens 


PR01438 


73 


48 


12760 


AF164615 


Homo sapiens 


envelope protein, 


257 


83 
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PCT/US01/04927 



SEQ ID 


Accession No, 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 
y 


12761 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJWI8. 


135 


70 


12762 


AF073519 


Homo sapiens 


small EDRK-rich factor 1, long isoform 


111 


50 


12763 


M29622 


Mus musculus 


open reading frame 2 


73 


68 


12764 


L27428 


Homo sapiens 


reverse transcriptase 


111 


60 


12765 


S79410 ! 


Mus musculus 


nuclear localization signal binding 
protein 


106 


45 


12766 


AK024455 


Homo sapiens 


FLJ00047 protein 


108 


66 


12767 


M36914 


Zea mays 


cell wall protein (put.); putative 


78 


36 


12768 


L27428 


Homo sapiens 


reverse transcriptase 


159 


59 


12769 


AJ005567 ! 


Mus musculus 


SPR2I protein 


55 


39 


12770 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


115 ' 


82 


12771 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


76 


64 


12772 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


126 


85 


12773 


AF130114 


Homo sapiens 


PR02459 


129 


50 


12774 


AF010I44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


220 


71 


12775 


AF287482 


ChloFobium 
tepidum 


Orfl22 


166 


68 


12776 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


84 


50 


12777 


AF130079 


Homo sapiens 


PR02852 


258 


58 


12778 


AF090931 


Homo sapiens 


PRO0483 


65 


76 


12779 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


152 


66 


12780 


AB047948 


Macaca 
fascicularis 


hypothetical protein 


59 


37 


12781 


AF1 19900 


Homo sapiens 


PR02822 


132 


59 


12782 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


97 


51 


12783 


G00588 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4669. 


77 


77 


12784 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


93 


84 


12785 


L27428 


Homo sapiens 


reverse transcriptase 


183 


39 


12786 


AF1 19851 


Homo sapiens 


PR01722 


128 


61 


12787 


AF1 16715 


Homo sapiens 


PR02829 


111 


72 


12788 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


147 


77 


12789 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


84 


66 


12790 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


155 


71 


12791 


AF124729 


Mus musculus 


acinusS' 


122 


50 


12792 


AC003058 


Arabidopsis 
thaliana 


unknown protein 


86 


52 


12793 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


108 


74 


12794 


M62415 


Pseudopleuronec 
tes americanus 


HPLC6 


82 


35 


12795 


Y36156 


Homo sapiens 


Human secreted protein #28. 


133 


48 


12796 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


74 


12797 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


170 


32 


12798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


114 


43 


12799 


Y76184 


Homo sapiens 


Human secreted protein encoded by 


90 


60 



477 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterraan 


fdentit 










Score 


y 








gene 61. 






12800 


*v /Co ah 
A02677 


Oryctolagus 


retrovirus related reverse transcriptase 


102 


38 






cuniculus 






1O0A1 


/"^AO'CIO 

OU2^32 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


98 


69 








G.CA1 

6613. 






1 OP AO 

12802 


002532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


177 


69 








6613. 






1 Oft A1 

lZoU3 


A T71 lAAttO 


i 

Homo sapiens 


PKU2550 


f\A 

94 


OA 

74 


12804 


S77772 


Homo sapiens 


aspartylglucosaminidase, AGA {C- 


83 


68 








terminal, alternatively spliced} {EC 












3.5.126} 






12805 


AF063866 


Melanoplus 


ORF MSV233 hypothetical protein 


82 


28 






sanguinipes 












entomopoxvirus 








1 OOA£ 

12806 




Homo sapiens 


prostate-specific transglutaminase 


80 


60 


12807 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


47 








4414. 






12808 


X03145 


Homo sapiens 


pot. ORF V 


150 


34 


12809 


LI 0908 


Mus musculus 


Gcapl gene product 


111 


34 


12810 


Y95028 


Homo sapiens 


Human clone vp7_l ORF2, SEQ ID 


170 


55 








NO: 128. 








O02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


62 








6566. 






lOQ 10 

12812 


AJ233591 


Mus musculus 


reverse transcriptase 


287 


67 


10017 


U 82303 


Homo sapiens 


unknown 


149 


52 


IOO 1 A 

12814 


Ar 107406 


Homo sapiens 


GW128 


103 


58 


12815 


U82303 


Homo sapiens 


unknown 


93 


83 


12816 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


161 


54 








6613. 






12817 


B01390 


Homo sapiens 


Neuron-associated protein. 


81 


38 


12818 


AF287482 


Chlorobium 


Orfl22 


174 


69 






tepidum 








12819 


AE003499 


Drosophila 


CG 12706 gene product 


166 


28 






melanogaster 








12820 


M15530 


Homo sapiens 


B-cell growth factor 


124 


71 


12821 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


161 


53 


12822 


G02493 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


150 


77 








65 74. 






10011 

12823 


G00497 


TT 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


66 






4578. 






12824 


AF064597 


Homo sapiens 


LINE-1 like protein 


64 


45 


12825 


X71442 


Rattus 


ORF 1; putative 


113 


45 






norvegicus 








IOOOIC 

12826 


U62039 


Elephantulus 


reverse transcriptase 


O A 

74 


46 






edwardii 








l^&z / 




Homo sapiens 


B-cell growth factor 


1 oo 
12/ 


CA 

54 


10000 
12525 


n(YJ ci /i 
UU2M4 


Homo sapiens 


Human secreted protein, obQ lU MU. 


QA 

94 


O/l 

84 














10000 


TT>£1110 

U0j332 


_ : 

Homo sapiens 


super cysteine rich protein; SCRP 


J7 


j/ 




A "COO AA/IO 

Ar22y0o/ 


Homo sapiens 


FADl-H protem 


1 St A 

184 


c 1 

61 


1 0oo i 
12531 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


69 


r> A 

34 


12832 


A t? 1 A At\C A 

Ar 144054 


Homo sapiens 


apoptosis related protein APR-4 


8j 


A £ 

46 


12833 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 


168 


61 








6613. 






1Z5J4 




Homo sapiens 


Human se ere tea protem, acv^ iu in\j. 


1 /IO 

142 


AC\ 

4U 








7214. 






12835 


U19098 


Lycopersicon 


proline-rich protein 


81 


38 






chilense 








12836 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


229 


68 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Cmith. 
OQIIUH 

Waterman 
Score 


Identit 

y 


12837 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


81 


60 


12838 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


88 - 


71 


12839 


X72963 


Nicotiana 
tabacum 


pAP8 product 


94 ' 


40 


12840 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


315 


56 


12841 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


94 


55 


12842 


AK025947 


Homo sapiens 


unnamed protein product 


124 


57 


12843 


AF159055 


Homo sapiens 


leucine zipper-like protein 


69 


55 


12844 


AF040257 


Homo sapiens 


TNF receptor homolog 


98 


50 


12845 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


94 


76 


12846 


X88799 


Oryza sativa 


DNA binding protein 


94 


37 


12847 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


98 


48 


12848 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


164 


56 


12849 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


75 


59 


12850 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


122 


54 


12851 


M31964 


Saimiriine 
herpesvirus 2 


collagen-like protein 


79 


50 


12852 


AF1 18082 


Homo sapiens 


PRO1902 


94 


77 


12853 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


139 


54 


12854 


R59842 


Homo sapiens 


ApoE4Ll protease. 


108 


82 


12855 


G03295 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7376. 


99 


66 


12856 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


89 


56 


12857 


D29833 


Homo sapiens 


proline rich peptide P-B 


64 


52 


12858 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


155 


80 


12859 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


88 


61 


12860 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


66 


33 


12861 


Y13141 


Bromheadia 
finlaysoniana 


extensin 


75 


47 


12862 


M29622 


Mus musculus 


open reading frame 2 


74 


62 


12863 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


141 


70 


12864 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


117 


54 


12865 


AF090894 


Homo sapiens 


PRO0113 


108 


63 


12866 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


121 


54 


12867 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.8c} 


103 


44 


12868 


S68106 


Ascaris suum, 
Peptide Partial, 
100 aa 


type IV collagen alpha 2 chain, alpha 2 
(IV) {alternatively spliced} 


75 


35 


lzooV 


wUJzoi 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


97 


56 


12870 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


146 


62 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identtt 

y 


128/1 


000412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A f\*5 

4493. 


TO 

78 


76 


12872 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


105 


56 


12873 


AB030234 


Canis familiaris 


D4 dopamine receptor 


61 


58 


12874 


M22332 


Homo sapiens 


unknown protein 


128 


41 


12875 


U05313 


Trypanosoma 
brucei 


CR3 


67 


48 


12876 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


115 


65 


12877 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


66 


54 


12878 


X55684 


Lycopersicon 
esculentum 


extensin (class I) 


78 


34 


12879 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


51 


12880 


AF130079 


Homo sapiens 


PR02852 


125 


75 


12881 


AF090895 


Homo sapiens 


tin AA1 1 

PROOl 17 


142 


59 


12882 


AF181251 


Rattus 
norvegicus 


lung Kruppel-like factor 


93 ] 


39 


12883 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


167 


74 


12884 


M26460 


Homo sapiens 


retinoblastoma 1 


136 


41 


12885 


AJ277557 


Homo sapiens 


mitochondrial 5'(3> 
deoxynbonucleotidase (dNT-2) 


280 


100 


12886 


G03789 


Homo sapiens 


tt a ff * • eyt-*s~\. ti\ vt/\. 

Human secreted protem, SEQ ID NO: 
7870. 


119 


62 


12887 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


145 


43 


12888 


AF119851 


Homo sapiens 


PR01722 


94 


50 


12889 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


121 


39 


12890 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


95 


68 


12891 


AK024455 


Homo sapiens 


FLJ00047 protein 


77 


51 


12892 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6908. 


115 


83 


12893 


L27428 


Homo sapiens 


reverse transcriptase 


89 


29 


12894 


Y01154 


Homo sapiens 


Protem sequence Seq Id 54 from 
WO9901020. 


112 


72 


12895 


G02597 


Homo sapiens 


TT x_ J x • nn/\ TT*v VIA 

Human secreted protein, SEQ ID NO: 
6678. 


72 


62 


12896 


AK023563 


Homo sapiens 


unnamed protein product 


260 


C A 

64 


12897 


G02832 


Homo sapiens . 


Human secreted protein, SEQ ID NO: 
691;>. 


76 


66 


12898 


AC002304 


Arabidopsis 
thaliana 


F14J16.29 


1 1 1 


Jo 




X00445 


Homo sapiens 


f A T~~A 7 

Human gene 42-encoded protein 
iragmeni, oiiv£ ixj invj. jou. 




KX 


12900 


AF1 16638 


Homo sapiens 


PR01546 


123 


57 


l *">Qni 

i/yui 


ABUlUJol 


Mus musculus 


mszr4/ 


(LA 


A1 


12902 


R59842 


Homo sapiens 


ApoE4Ll protease. 


I J\) 


oo 


12903 


L06498 


Homo sapiens 


nbosomal protein 5>20 




DO 


12904 


LI 3635 


Rattus 
norvegicus 


growth response protein 


1 Jo 


/CO 

oy 


loons 


Plt iDoyoi 


Homo sapiens 


gag 




44 


12906 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


207 


67 


12907 


AF107406 


Homo sapiens 


GW128 


99 


50 


.12908 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7881. 


93 


75 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idenlit 

y 


19Q09 


AF9£1^/tfl 


jciomo sapiens 


lYlLDOOO 


1 9AQ 


100 


12910 


Y48292 


Homo sapiens 


Human prostate cancer-associated 

tvr/vl"Ptri 7x 
piULCUl / O. 


60 


50 


12911 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

711Q 

/ Jj7. 


179 


87 


12912 

lx>7 1Z> 


Y09000 


xiomu sdpienb 


Prafnnpnf Ar human c A f rofo e\ nrAfpin 

riagnicni oi numan secreted protein 

MlCArfPfl h\/ CT#»TIP 191 


1 AQ 


7D 

/z 


12913 


272499 


Homo sapiens 


herpesvirus associated ubiquitin- 

bpciiill* piUlCaoC \rir\\JOx) 


5242 


99 


12914 


AF092136 


Homo sapiens 


PTD015 


297 


100 




AF1 A"11S£ 


xiomo sapiens 


w^pr*AOA 

rlorvA/y.3 


1ZJ 


4Z 


12916 


AF044670 


Homo sapiens 


33 kDa Vamp-associated protein; VAP- 


506 


90 


12917 


W27087 


Homo sapiens 


Human transforming growth factor 
aipna run. 


1243 


100 


19018 




no mo sapiens 


riuman normal ovarian tissue uenvea 
protein 84. 


111 
111 


4J 


19010 

1X717 




v^anis iarnuians 


non-nisione cnromosornai protein 
HMG-17 


1 AC 
1U5 


54 


1909A 

lzyzu 


AJvUZojyZ 


Homo sapiens 


unnamed protein product 


1 1 A 

119 


60 


19091 


I l/ZO / 1 


Homo sapiens 


Human secreted protein encoded by 
gene zz. ciune nivioj wio. 


1 Ait 

104 


87 


12922 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
icrniiflai, cione Axiri.ocj 


124 


81 


19091 


Y75MAA cA\ 


iiomo sapiens 


91 MAV 1007 Unman T IPQP 9 rTYKJA 

zi-ivi/v i-iyy/ numan uv^or-z cujna. 


1 AOA 


1 AA 
1UU 


1909d 




nuino sapiens 






jZ 


12925 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

>Mgg 
*t*H>5. 


82 


80 


12926 


AF1 19855 


Homo sapiens 


PR01847 


120 


63 


19097 




Homo sapiens 


T>T> riAi4C1 

rKUU*l-oJ 


AO 

yz 


AA 

y\) 


1909ft 




nomo sapiens 


xiuman secrexea protein, otiv^ id rsij. 
7881. 


1 1 A 

i iy 


OO 


19090 


i y l**/.y 


xiomo sapiens 


Human secreted protein sequence 
encoded by gene 21 SEQ ID NO: 150. 


1 AO 

lUo 


1 Q 


19010 


I IZOOl 


xiomo sapiens 


neuro-endoenne specific protein VGF 


ZjUO 


AA 

yy 


12931 


Y84546 


Homo sapiens 


Amino acid sequence of a fragment of 
human collagen type 1 protein. 


61 


50 


12932 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OjOO. 


.121 


53 


12933 


G00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*r JOZ. 


104 


52 


12934 


n04A79 


riomo sapiens 


numan secreted protein, oe\i lu iskj. 
8153. . 


1 AA 


/CA 
00 


12935 


G03789 

VJ VJ / O/ 


tTATTIA con ifln c 

nuixiu SapiClIb 


xiuuiaii bcoreicu protein, ocy jlu inv/. 
7870 


jj 1 


77 


12936 


M15530 


Hatha QAnif*n<i 


R_/>pl ] oTAwtli far+rkr 




71 


12937 


G00437 


uuiuu oaptdio 


T-fiiman Qprr^lj^fl ATAfpiri RP/"^ TO ^TO* 
xiujiiau owujclcu piULCiii, uidiy xxy l^v/. 

4518 


1*tO 


AA 


12938 


R59842 


Horn a sanien<5 


AnoK4I.l nrnteasp 


119 


61 


12939 


AF090895 


Homo sapiens 


PRO0117 


94 


80 


12940 


X73459 


Rnmn canipnc 


cicrnstl rpmon ifi ati r*s»r+J^*l#* ciiKnnit 1A 
Mgllal ICuO^lILiOll palliulC bUUUllU l*t 




70 


12941 


L23545 


Homo sapiens 


putative 


141 


45 


1904? 


/ITUJ'tI / o 


xiutuu Sapiens 


i^i-d i *r. ja nomoiog 


Z^o 


O/t 

54 




VR6942 

I 00446 


xiomo Sapiens 


riuman secretea protein riwrrrroo, 
SEOJDN0163 


1 1A 
1 10 


ff 
^5 


12944 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


143 


55 


12945 


D00570 


Mus musculus 


open reading frame (251 AA) 


166 


34 


12946 


U62039 


Elephantulus 


reverse transcriptase 


63 


50 



481 
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SEQ ID 
NO: 


Accession No. 


Spcdcs 


Description 


SmiCb- 

Watcrraan 

Score 


% 

Idcntit 

y 






edwardii 








12947 


AE003499 


Drosophila j 
melanogaster 


CG 12706 gene product 


104 


31 


12948 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ /9j. 


65 


100 


12949 


X70343 


Nicotiana 
sylvestris 


extensin 


102 


38 






Homo sapiens 


Human secreted protein, 5bQ ID NO: 
001 j. 


103 


AC 

45 


12951 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/o/u. 


85 


56 


12952 


AF090895 


Homo sapiens 


PRO0117 


J39 


68 


12953 


AF090896 


Homo sapiens 


tyt> r\i\ 101 

PRO0131 


A*2 

93 


or 

85 


12954 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


101 


75 


12955 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small * 
subunit 


292 


08 


12956 


X71087 


Homo sapiens 


monocyte chemoattractant protein 


408 


96 


12957 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


591 


68 


12958 


AF 109907 


Homo sapiens 


O 1 £A 

S164 


1 00 
188 


63 


12959 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


173 


50 


12960 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


168 


62 


12961 


GO j 801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


98 


56 


12962 


S75895 


-77- — ^ • — 

Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


106 


O A 

84 


12963 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


125 


44 


12964 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


107 


57 


12965 


P70494 


Homo sapiens 


Sequence of human B-cell growth 
tactor (dCw j. 


71 


65 [ 


i29oo 


A /"'AAAOCO 

ACUU9o5J 


Arabidopsis 
thaliana 


hypothetical protein 


1 ivy 


A A 

44 


1 1A/C7 


Ar 090942 


Homo sapiens 


rRU0o57 


99 


62 


12968 


M15530 


Homo sapiens 


B-celi growth factor 


90 


65 


12969 


a i"" Art a one 

AF090895 


Homo sapiens 


T»r> aa in 

PRO0117 


83 


54 


12970 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


119 


54 


12971 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


123 


57 


12972 


L32558 


Homo sapiens 


sequence is expressed in human Tera-2 
clone 13 (embryonal carcinoma) cells. 
Hie sequence may contain mismatches 
(one strand sequenced only once). 97% 
identical in 320 bp overlap with human 
54 kDA prot; ORF 


426 


85 


12973 


AF008196 


Homo sapiens 


bax epsilon 


78 


87 


1 "">A*7/I 

12974 


Y20717 


Homo sapiens 


Human neurofilament-M wild type 
protein fragment 59. 


9o 




12975 


AL031848 


Homo sapiens 


OJ202O8.2.2 (novel rodent HES2 (hairy 
and Enhancer of Split 2) LIKE protein 
^lsoionn z.)) 


I/O 


1 AA 
100 


12976 


AJ245905 


Chlorocebus 
aethiops 


HSBPl-like protein 


94 


100 


12977 


AF067519 


Homo sapiens 


PITSLRE protein kinase beta SV1 
isoform 


1923 


93 



482 



WO 01/64835 
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SEQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idenrtt 










Score 


y 


12978 


G01807 


T-Trunn canipnc 
IJUIIHJ SopiOUS 


Human secreted protein, oty iu ino. 


loo 


73 








/ooo. 






12979 


X52164 


ft/fnc miicriiliic 

lVlilo UlUjtUlUj 




1Uj 


39 


12980 


Y13141 


OrOIIlU Gail I a 


extensin 




36 






fiTilavcnn ism a 
lull a. jr 0 Ulilaild 








19081 


O0/?71/l 

VJVJ / l*f 


xiomo sapiens 


Human secreted protein, SEvj ID NO: 


87 


77 








770 c 






12982 


G099 1 1 


norno Sapiens 


tiuman secreieu protein, obvj id jno: 


83 


89 








6909 

oz?z. 






12983 


G00397 

VJUVJ7/ 


JTiUIIlO sapiens 


rium an secreieu protein, ocy iu jnvj. 


yz 


4o 








A AH St 






17984. 


AF1 136P.5 

r\T 1 1 J DoJ 


norno sapiens 




nt 


67 


19985 


in 108^ 

vJ IvrOO 


VJallUa gaUUS 


neuron^glia adhesion molecule 




50 


19986 
1Z.700 




norno Sapiens 


xiuman secreteo protem, oe\£ id 


132 


68 








/o/U. 






19987 




nt>mt> Sapiens 


Human secreted protem, bhyz lu N\X 


n i 

91 


71 








aaii 






199RR 


x z l oo** 


norno Sapiens 


Human secreted protein encoded by 


135 


81 








gene jno. lui. 






199X9 


ARft9Qft/19 
AOUX7UHZ 


norno Sapiens 


ATPase inhibitor precursor 


J 64 


100 


IZ77V 


nAAAA9 


Homo sapiens 


Human secreted protein, SfcQ ID NO: 


174 


72 














19991 
1Z771 




— 7 

Homo sapiens 


Human secreted protein, bliQ ID NO: 


174 


57 








'J-/ 15. 






19997 


Gno/*09 

vjvi/jyz 


ciomo sapiens 


Human secreted protein, obvj ID NO: 


1 CO 

158 


57 








AATX 






12993 

1Xi/7^ 


AF1 1 8082 


nomo ■sapiens 


rKU 1 y UZ 


/ / 


oo 
78 


12994 


G00637 


norno sapiens 


Human secreieu protein, ocv^ ijj jno. 


/O 


^A 








/171 r 






12995 


Ml 5530 


nomo sapiens 


o-ceu growui ractor 


1 AC 

1U5 


70 


12996 


A T00SS69 

/\ J VvJJ Ox. 


R/fitc tniicoiiliie 

ivjub muscuius 


orrvzD protein 


1 f\A 

1U4 


36 


12997 


Y4R346 


norno sapiens 


Human prostate cancer-associated 


184 


77 








procein *t j. 






12998 


Y02671 


norno sapiens 


Human secreieu protein encoded oy 


CO 

oz 


53 








gene zz cione hjvioJ wis. 






12999 

1*777 


T 1 1645 


xiomo sapiens 


alpha-tubulin 


1 Q£. 

iyo 


iCO 

68 


13000 


AF1 13685 


nuiiio sapiens 




101 
IZJ 






<XR799 

OJO /ZZ 


Homo sapiens 


A-lmKed retmopatny protein |C- 


TO 

78 


53 








tenriinai, cione AJcH-oc) 






13002 




ivius muscuius 


oJrxvzr protem 


/o 


44 


13003 


G00397 

UUUJ7/ 


norno sapiens 


Human secreieu protein, oiii^ iu InLa 


1 Al 
1U1 


/O 














13004 


AT 1S9I7K 


oircptomyccs 


putative secreted protein 


OA 


37 






oriplirrilor A'3f9^ 
vUCuvOlUT r\.D \A j 








13005 


T 77967 


v/VIS al ICS 


email mv\lma n/>n r»**rtf^a»rt ii/tm *%<x\v&/A 

sman proiine-ncn protein wim paireQ 


04 










repeal 






13006 


G03243 


nomo sapiens 


Human secreiea proiein, ocrVc uu vi\J, 


«c. 


7A 
/U 








7194 






13007 




Hrtnin cstnipnc 
xiujluu aapiviio 


XXL/ IVIiJ^t «J Jr 


9<17 
Zj / 


Oz 


13008 


G00389 


xi omo sapiens 


Uiimon ctar"r£i+0(-l ttrAfnin CCA TT\ \TA. 

Human secreieu protein, z>ca^i id jno. 


lOQ 

lzo 


CA i 

50 








4463 






13009 


AFI440S4 


nuuiu sapiens 


apopiosis relaxed protem Ariv-^f 


1 A A 

1UU 


*7C 

/5 


13010 


G03453 


noiiio sapiens 


Human secreieu protem, oty lu jno. 


1 AQ 

14o 


£.1 
Of 














13011 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


152 


66 








clone HPMBQ32. 






13012 


B01390 


Homo sapiens 


Neuron-associated protein. 


99 


32 


13013 


A27282 


Homo sapiens 


TGR-CL3C 


69 


42 


13014 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


46 



483 



WO 01/64835 • PCT/US01/04927 



SCQID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 








4770. 






13015 


AF 130089 


Homo saoien<i 


PRO95S0 


oo 


Aft 1 


13016 


AF266223 


Gillichthys 


riho^omal nir>tfMn 1 97 

l ll/V7ouiliai JJ1 wl v>lil \jg»j 


122 


^O 






mirabilis 








13017 


S79410 


Mus mu < 5ciilu'i 

I'iww vjlaI**> 


niirlftar lncali7atinn ^ional HinHincr 
llUVvllsCU luvaiizxiiiuii oigiiai 17 111 U 111 £ 


116 


AO 








nnofpin 

UlUlwIU 






13018 


U82303 


Homo sanien«> 


unknown 




60 


13019 


G03714 


Homo ^aniens 


Human <y*rrptftrl nrotf*tn *\PO IT) 

IXlUlloll Ovid t/lt/vl yjkxJIXulxlj JOy 1JL/ 1 vv . 


7fl 


^7 








7705 






13020 


G04078 


Homo <ianipTiQ 


Human cprrptpM nrntpin <IP(1 \T\ TvJfV 

JTlUl lldll MsVl CICLl JJlVJlOlil, ODy 1JL7 llVJ. 


160 


Q1 

y l 








8159. 






13021 




uuiiiu Dapiv/iio 


alpila glUUUl 


on 


1 AO 
1UU 


13022 


G02532 




Human QP/»rptpH nrntpin QPH fO TsJfY 
XI. Ui 11 oil bcvlclCU piUlCUl, ODy xxJ v\\J* 


1HO 


^O 








6611 
QUI J. 






13023 


vVVJO 1 




rmm<m sccreicu protein, oc^ LL7 rsv-i. 


100 

iuy 


<50 








4462 






13024 


G03472 


Hrtnrtrt cnnipnc 

XA\7111V7 bCtpiCllO 


Human cprrpfpH nrntpin \Ti MH* 


10 i 


RA 








7^1 






13025 


U82303 


Wftnirt en ntpn c 

A 11711 1VJ 2>0|71Cll0 


UIUSJ1UW11 


£6 




13026 


Y099QO 
i uz.yyy 


nomo Sapiens 


rragruenc or numan secreteu proicin 




/C1 
Ol 








cucoueu oy gene izi- 






13027 


G00689 

VJVVUO/ 


nouio sapiens 


numan secreteu proicui, o&ki id vskj. 




<o 








A770 






13028 


G03714 


U/\m t\ com ah c 

nun iu sapicno 


nunian secreiea proieui, oxiv^ il/ inv/. 












770^ 






13029 


Y84546 


XAU111U bapic-llii 


^vmino acia sccjuenee 01 a uagmeni or 


<;o 


40 








Tinman rrtllsicypn h/np 1 nrotpin 
li ui ii oil wuagvii type i piuiciii. 






13030 


Y07671 


nuiuu oapiciib 


XXntn !)n opfrptpH nrA^Atn pr»r*Arl*irl 

rruiiiaxi secreiea proieui cneoueu oy 


06 
5/0 


05 








eene 22 clone HMSJW18 






13031 


S71494 


A/|i iq en 
iviuo 517. 


OULL7 ilUlUOlUg ^VJiy-/\Tg I cpcaly 




JO 


13032 


AF1 1368/5 


nviiiv sapiens 


X XVVA7jf /*J 


JUU 


A Q 


13033 


D38112 


xiumvi oapiciio 


xHjt\uxx ucnyurogcQase suouni( *♦!-» 






13034 


AL3901 14 


T pidirruifiis* 
J-fClolluiallla 


cALiciiiciy cyoteine/ vaune ncn prucein 


17» 
1 Jo 


41 






major 








13035 


G01129 


X X\J1 11 VJ DCipit/lIO 


Hum an ^erreteH nmfein ID "NO* 

JLJLUUiail OtUtltU piUtOlll, OJL#V^ UL7 




ftl 
o 1 








5210. 






13036 


X66285 


Mil*? musculo^ 


HC1 ORF 

XX.\s 1 V-/1VJL 


103 


A A 
*t r 


13037 


AF026689 


Homo <:anipn^ 

1 1 VlllvF oClLrlG'llO 


nrAQtatp-^np^ifip tranccliitaminaop 

pi UdlOlG "dJLTCV/l 1 1 11 Olid gl U LOU! 11 lOoC 


on 

717 


JJ 


13038 


U47924 


J. -IV/ 11117 DCLjJl^/llO 


RPI 1 

X\X Xj X J t- 




6^ 


13039 


AF090942 


Hatyio cJir\ipTic 

X X\JXXl\J jflUlCUo 


X XS\J\J\lJ f 


111 

•111 




13040 


rvivWiCij 1 1 V7 


r-lATTiA citm^nc 
Xxuiiiu oapiCilo 


uiiiiaiiicu prv/iein pr uuuci 


1 j 1 




13041 


G00403 


Homn oatiipn<: 

11V7111V7 OCljplVllO 


Human QftrrpfpH nrnfpm RFO XTi "WO' 

XlUlliaii SCvlvlvU piUlClllj OLy 11_>* 1NV7. 


09 


79 








4484. 






13042 


Ml 1759 


Lveonercicon 


cpH wall hvHroxvnrnlinp-rich 

vCll VVOJ1 UV KJX V7Ajrpi villit/ 1 lv*ll 


100 


60 






esculentum 


fflvcoorotein 






13043 


AF130089 


Homo santerK 


PRO2550 


109 

1U7 


45 


13044 


G02879 


Homo saniens 

llVUiv OUL/l^fllO 


Human <»p-creteH nrotein SKO ID T*JO* 

llUllluli JVA^IUIUU ^JlV7iVlllj Uijy JUL-' l^lv/. 


116 

X XX) 


59 








6960. 






13045 


Y02671 


Homo saniens 


Human secreted nrotein encoded bv 


162 


70 








pene 22 clone HMSJW1 8 






13046 


L10908 


Mus mu*5f*iilii^ 

iriUd lllUOwUlUi) 


Gi*anl ppnp nrodnrt 

VJ \zCL^J 1 c^Xl^ Ul UUUvl 


162 


40 

*t\7 


13047 


L02321 


Horn a cunipnc 


ohitathionp ^-tranQfpra^e nSTM5-^ 


1079 

1 V7 §4. 




13048 


Y08062 


nomo Sapiens 


Unman DD ^ nrntotn Iraffmpnf 

rtuman nvv/z < r3 proieiu uagnicnt 


»0 


Aft 
OU 








derived from DNA35638 






13049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO; 


153 


60 








7075. 






13050 


U33547 


Homo sapiens 


MHC class II antigen 


69 


85 


13051 


AC006014 


Homo sapiens 


similar to mismatch repair proteins; 


652 


91 








similar to PIDrg 1304 125 







484 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


13052 


A27282 


Homo sapiens 


TGR-CL3C 


65 


51 


13053 


Y69166 


Homo sapiens 


A mature human N-acetylglycosaminyl 


106 


60 








transferase protein. 






13054 


M20030 


Homo sapiens 


small proline rich protein 


57 


33 


13055 


AF022117 


Balaena 


metallothionein 


119 


62 






mysticetus 








13056 


Ml 1759 


Lycopersicon 


cell wall hydroxyproline-rich 


78 


43 






esculentum 


glycoprotein 






1:>057 


R20305 


Homo sapiens 


Val(l) to Met, His(2) deleted, Ala(76) 


274 


AO 

98 








to Lys beta-globin mutant 






13058 


X55686 


Lycopersicon 


extensin (class II) 


56 


39 






esculentum 








13059 


U 10696 


Zea mays 


Ec metallothionein class II protein 


00 


26 


13060 


Y40417 


Homo sapiens 


A human N-acetylneuraminate lyase 


1183 


OA 

80 








/|_\T A "V TT \ " 

(hNANL) protein. 






13061 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


41 








01 CO 

8153. 






13062 


AF002210 


Homo sapiens 


copper chaperone for superoxide 


702 


61 








dismutase 






13063 


AF 11 8086 


Homo sapiens 


PRO 1 992 


69 


OA 

84 


13064 


M24097 


Homo sapiens 


MHC HLA-C-alpha-2 chain 


1550 


96 


13065 


AC003113 


Arabidopsis 


F2401.6 


51 


40 






maliana 








13066 


AF161531 


Homo sapiens 


HSPC046 


488 


97 


13067 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


141 


50 








66 19. 






1306S 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


116 


46 






brucei 


complete virion genome. 






13069 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


145 


69 








gene No. 107. 






13070 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 


224 


71 








1T\ XTTV 1 A3 

ID NO: 103. 






13071 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 a 

153 


52 








6913. 






13072 


ArZ 16965 


Homo sapiens 


ancient conserved domain protein 3 


0 /o 


1 An 
lUU 


13073 


Y15917 


Homo sapiens 


COL1A1 and PDGFB fusion transcript 


1 AO 

108 


35 


13074 


X77664 


Homo sapiens 


retinoic acid receptor beta isoform 1 


53 


71 


13075 


AF156961 


Homo sapiens 


gag 


136 


37 


13076 


R59842 


Homo sapiens 


ApoE4Ll protease. 


117 


75 


13077 


G03789 


Homo sapiens 


Human secreted proteui, SEQ ID NO: 


121 


65 








7870. 






13078 


U23183 


Caenorhabditis 


gene lies in inverted repeat and exon 1 


100 


79 






elegans 


overlaps tRNA; may be pseudogene 






13079 


003714 


Homo sapiens 


Human secreted protein, obQ ID NO: 


125 


56 








7795. 






1 OAOA 

UOoO 


O0253 O 


Homo sapiens 


Human secreted protein, obQ W NO: 


lio 


65 








6619. 






1 1 no 1 

13081 


Y 12473 


Homo sapiens 


centrin 


*> <A 

-?59 


79 


1JU82 


/^l A K1Q 
O01518 


Homo sapiens 


Human secreted protein, ocQ 1U NO. 


J /5 


V8 








c£aa 

5599. 






13083 


Y57891 


Homo sapiens 


Human transmembrane protein 


1 A^A 

1030 


AA 

99 








H1MPN-15. 






13084 


AB046765 


Homo sapiens 


KIAA1545 protein 


161 


93 


13085 


X55686 


Lycopersicon 


extensm (class II) 


66 


40 






escuieniuin 








13086 


AF130079 


Homo sapiens 


PR02852 


137 


45 


13087 


K01664 


Drosophila 


Bkm-like protein 


86 


33 






melanogaster 








13088 


G03812 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


44 



485 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


Identit 

y 








7893. 






13089 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


114 


63 


1 "7 AO A 


AT 1 O 1 Arte 


Homo sapiens 


dJ534B8.3 (novel protein similar to an 
aspartic protease) 


963 


100 


.13091 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


95 


47 


13092 


AB030236 


Canis familiaris 


D4 dopamine receptor 


65 


38 




Ar 161356 


Homo sapiens 


HSPC093 


119 


48 


13094 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


67 


50 


13095 


G01249 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5330. 


124 


56 


13096 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


73 


13098 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


62 


13099 


AF130051 


Homo sapiens 


PRO0898 


194 


61 


13100 


YS6445 


Homo sapiens 


Human gene 42-encoded protein 
fragment, SEQ ID NO:360. 


80 


70 


13101 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


111 


43 


13102 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


120 


71 


13103 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


116 


63 


13104 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein (nuclear ma-binding 
protein) 


90 


64 


13105 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


176 


56 


13106 


U23455 


Caenorhabditis 
elegans 


similar to D. melanogaster homeotic 
protein BarH2 (PIR:A41726) 


101 


75 


13107 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


100 


65 


13108 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


137 


77 


13109 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


94 


88 


13110 


AF1 19855 


Homo sapiens 


PR01847 


103 


49 


i<)1li 

13111 


AF1 19851 


Homo sapiens 


PR01722 


142 


80 


13112 


Y00311 


Homo sapiens 


Human secreted protein encoded by 
gene 54. 


107 


51 


13113 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


104 


54 


13114 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


144 


71 


1 1 1 1 C 


G03356 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7437. 


138 


50 


l J i lo 


UUU397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


110 


56 


1-51 17 


AJv0z5o53 


Homo sapiens 


unnamed protein product 


188 


100 


13118 


AF130089 


Homo sapiens 


PRO2550 


129 


68 


111 f A 

13119 


AF161356 


Homo sapiens 


HSPC093 


131 


59 






Homo sapiens 


j-Juman secreted protem, i>bQ 1U NO: 
7879. 


121 


45 


13121 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


82 


55 


13122 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


87 


45 



486 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 






Waterman 


Zdentit 










Score 


y 








4718. 






13123 


Y38394 


Homo sapiens 


Human secreted protein encoded by 


213 


100 








gene No. 9. 






13124 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 








8153. 






13125 


AF1 19855 


Homo sapiens 


PR01847 


109 


45 


13126 


R95913 


Homo sapiens 


Neural thread protein. 


137 


47 


13127 


BO 1390 


Homo sapiens 


Neuron-associated protein. 


101 


47 


13128 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


122 


67 


13129 


G03263 


Homo sapiens 


Human secreted protein, SEQ ED NO; 


105 


52 








7344. 






13130 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


170 


72 








6613. 






13131 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


500 


90 


13132 


AF138957 


Bos taurus 


type II collogen cyanogen bromide 


99 


38 








fragment CB8 






13133 


AF084256 


Homo sapiens 


beta glucuronidase isoform d 


157 


69 


13134 


U92817 


Homo sapiens 


unnamed HERV-H protein 


113 


48 


13135 


Y19743 


Homo sapiens 


SEQ ID NO 461 from W09922243. 


967 


99 


13136 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


52 








7492. 






13137 


AF090928 


Homo sapiens 


PRO0470 


133 


69 


13138 


AF090931 


Homo sapiens 


PRO0483 


150 


60 


13139 


AF1 16661 


Homo sapiens 


PR01438 - 


146 


57 


13140 


AL022318 


Homo sapiens 


bK150C2.2 (Phorbolin3) 


443 


56 


13141 


S79410 


Mii<* Tmi<iCiilii*; 

XVI IIS ItlLLoVUlLto 


nuclear localization dismal binding 


114 


80 








nrotein 






13142 


AF05598^ 

iTJL UJJ70J 


Onchncprca 


nvrrolidonp-rich antio-pn 


86 


43 






volvulus 








13143 


G03714 


Homo satiien*; 


Human secreted nrotein SEO ID NO* 


130 


62 








7795. 






13144 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


66 








6619. 






13145 


G03714 


Homo saniens 


Human secreted nrotein SEO ID NO* 


123 


71 








7795. 






13146 


AF107406 


Homo sapiens 


GW128 


120 


50 


13147 


L10908 


Mus musculus 


Gcapl gene product 


91 


42 


13148 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


113 


66 








RHAla(AF078683) 






13149 


S58722 


ITnmA CAniPT1<£ 


y-Hnlf^H rfttinonathv orotein (C- 


119 


67 








terminal, clone XEH.8c} 






13150 


Y27868 


Homo saoien*; 


Human secreted nrotein encoded bv 


168 


86 








gene No. 107. 






13151 


G00418 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


58 








4499. 






13152 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


61 








4478. 






13153 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


131 


67 








clone HCACJ81. 






13154 


G02872 


Homo saniens 


Human secreted protein, SEQ ID NO: 


113 


54 








6953. 






13155 


AP000616 


Orvza sativa 


similar to RING-H2 finger protein 


127 


84 








RHAla(AF078683) 






13156 


Y02671 


Homo sapiens 


Human secreted protein encoded by 


167 


80 








gene 22 clone HMSJW1 8. 






13157 


Y14482 


Homo sapiens 


Fragment of human secreted protein 


125 


68 








encoded by gene 17. 






13158 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


514 


71 


13159 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


130 


66 



487 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 














13160 


AF1 16661 


Homo sapiens 


PR01438 


152 


63 


13161 


ABG46048 




unnamea ponem prouucc 


1Uj 


C""> 

5^ 


13162 


AF1 07406 






1 so 




13163 


Y02999 


Homo sapiens 


Fragment of human secreted protein 

f±*\f*t\f\{*r\ \wi (Ton 1*71 

cncuucu uy gene izi. 


132 


59 


13164 


M15530 


Homo sapiens 


B-cell growth factor 


97 , 


80 


13165 




XTUU1U Sapiens 


salivary proiine-ncn proiein l 


0/ 


io 


13166 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

rrana "}") r*\r\na T_TT\/f C 1\1/1 Q 

gene cione ruvioj wis. 


126 


57 


i J 1 v/ 


AT 1^07 


i rypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


95 


53 


13168 




noruo sapiens 


Human secreted protein, SEQ ID NO: 

440O. 


yo 


57 


13169 


AL049795 


Homo sapiens 


(1J622L5.7.2 (novel protein (isoform 2)) 


250 


95 






tionio sapiens 


Human secreted protein, bbv2 LU NO: 
4447. 


1 1 c 

116 


56 


11171 


J-/ 1 V7UO 


Mus musculus 


Gcapl gene product 


75 


53 


13172 




xiomo sapiens 


NF2 == neurofibromatosis type 2 
{alternatively spliced, form A4} 


160 


73 


13173 




vjiyzd. sauva 


similar to kjjNO-Hz ringer protein 
KJnA 1 a { Ar U / o oo3J 


134 


76 


13174 


A52568 




Jnivivji'*^ 


9 A 


44 


13175 


G03807 


nwflo sapiens 


Human secreieo protein, ofcv^ lJJ jnu. 

/ooo. 


Ol 

y i 


/in 
49 


13176 


AK025116 




unnorneu proiem prouuci 


1 JU 


44 


13177 


AF090901 


Homo sapiens 


PRO0195 


103 


84 


13178 




xiomo sapiens 


rKUUl 1 j 


92 


62 


13179 


AF1 30079 


Homo sapiens 


PR02852 


118 


38 


11180 

lJ iOV/ 




Homo sapiens 


unnamed protein product 


141 


67 


111 R1 


V0700O 
I u.£yyy 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


87 


59 




Arrinift^R 

f\\Ai\JJVDo 


Araoioopsis 
uiuii<md 


unknown protein 


220 


74 


13183 


S79980 


Bos taurus 


ribosomal protein L37 


186 


94 


13184 




v/ryza sauva 


similar to KJLrsvj-rLd ringer protein 
RHAla(AF078683) 


113 


90 




AK1 <W*7 


r lsum sativum 


extensm 


95 


36 


13187 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


90 


78 


13188 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/OOO. 


113 


79 


13189 


G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

H^t JO- 


158 


53 


13190 


Y02671 




MiirnQti cf»/'*r/i'f'On nrAfom ^n^AHo/i r\-< r 

numdm secreieu proiem encoueo oy 
gene 22 clone HMSJW18. 




j9 


13191 




\JijLiX Saliva 


similar io ivirio-riz imger protein 
RHAla(AF078683) 


131 


66 


13192 


D181 19 


nomo sapiens 


lN/vL/ri uenyarogenase suounit J 




50 


13193 


Z66499 


Caenorhabditis 

cicgallS 


T01B7.8 


127 


35 


13194 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o / U. 


126 


61 


13195 


Y13141 


Bromheadia 
finlaysoniana 


ex tens in 


58 


*tj 


33196 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


95 


76 


13197 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


116 


60 



488 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: 

i 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


13196 


X58236 


Homo sapiens 


36/8-8 fusion protein with epitope for 
anti-lectin antibody 


98 


41 


13199 


Y01158 


Homo sapiens 


Secreted protem encoded by gene 1 8 
clone HCACJ81. 


102 


58 


13200 


AF220264 


Homo sapiens 


MOST-1 


141 


45 


13201 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


112 


58 


13202 


Y17221 


Homo sapiens 


Human secreted protein (clone fle3 1 7- 
3). 


115 


57 


13203 


W50192 


Homo sapiens 


Amino acid sequence of salivary j 
protein CON-1. 


91 


39 


13204 


G03714 


Homo sapiens 


Human secreted protem, SEQ ID NO: i 
7795. 


126 


65 


13205 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


98 


68 


13206 


M15530 


Homo sapiens 


B-cell growth factor 


105 


76 


13207 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment 


84 


45 


13208 


L27428 


Homo sapiens 


reverse transcriptase 


205 


49 


13209 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
tenninal, clone XEH.8c} 


77 


57 


13210 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


48 


13211 


X55685 


Lycopersicon 
esculentum 


extensin (class 0 


122 


33 


13212 


Ml 5530 


Homo sapiens 


B-cell growth factor 


159 


61 


13213 


AF130089 


Homo sapiens 


PRO2550 


126 


56 


13214 


AF210651 


Homo sapiens 


NAG18 


125 


68 ! 


13215 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


214 


76 


13216 


GO 1246 


TT * 

Homo sapiens 


Human secreted protem, SEQ ID NO: 
5327. 


157 


70 


13217 


R59842 


Homo sapiens 


ApoE4Ll protease. 


102 


72 


13218 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


123 


37 


13219 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


101 


55 


13220 


D14167 


Gallus gallus 


ribosomal protein L37a 


126 j 


40 


13221 


W88627 


Homo sapiens 


Secreted protem encoded by gene 94 
clone HPMBQ32. 


178 


82 


13222 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*f> A A 

7344. 


141 


55 




LlUyuo 




Mus musculus 


Gcapl gene product 


114 


53 


13224 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4L 


216 


73 




003259 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7340. 


104 


76 


13226 


AF130079 


Homo sapiens 


PR02852 


158 


66 


13227 


003714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


60 


13228 


S77772 


Homo sapiens 


asparrylglucosaminidase, AGA {C- 
terminal, alternatively sphced} {EC 
3.5.1.26} 


100 


46 


13229 


G02532 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
6613. 


156 


72 






Homo sapiens 


B-cell growth iactor 


162 


72 


13231 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


161 


66 


13232 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


80 


52 



489 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


1 'lOI'i 


vjU3 /yu 


Homo sapiens 


Human secreted proteui, bbQ ID NO: 


1 0£ 
IZO 


51 








7871. 






1 101A 


Ar 1ZD103 


Homo sapiens 


HHLA3 proteui 


1 1 1 
131 


78 




I UZooo 


Homo sapiens 


Fragment of human secreted protein 


JZ3 


85 








encoaeo oy gene yu. 








AJ5UU lOo4 


cniorelia 


f\T*T?A Al_ 

ORF49b 




45 






vulgaris 








1323/ 


AAt C.C2A 


Homo sapiens 


B-cell growth factor 


1 1 A 
1 14 


66 


13238 


AL3 55929 


Neurospora 


conserved hypothetical protein 


99 


46 






crassa 








13Z39 


ArOOUolo 


Oryza sativa 


similar to RING-H2 finger protein 


114 


CI 

63 








RHAla (AF078683) 






1 1<*iAI\ 

13240 


579410 


Mus musculus 


nuclear localization signal binding 


1 1 A 
1 10 


yf A 

40 








protein 






13241 


AF1 19851 


Homo sapiens 


T%Tl f\ 1 'fit 

PRO 1722 


1 ^O 

167 


63 


13242 


S79410 


Mus musculus 


nuclear localization signal binding 


147 


68 








protein 






13243 


U28971 


Caenorhabditis 


similar to RD tandem repeat region of 


95 


65 






elegans 


RD protein (nuclear ma-binding 












protein) 






13244 


W4835 1 


Homo sapiens 


Human breast cancer related protein 


f\{\ 

99 


84 








BCRB2. 






1324> 


vi /iAn 
Y 1443/ 


- . 

Homo sapiens 


Human secreted protein encoded by 


104 


^0 
62 








gene 27 clone HbA WAZ7. 






1 JZ*H> 


AT 1O1300 


Homo sapiens 


HorC093 


1 lo 


0/ 


1324/ 


X61046 


Hydra sp. 


mini-collagen 


f AO 

108 


/♦A 

49 


13245 


VOOooZ 


Homo sapiens 


URF 3 (NADH dehydrogenase subunit) 


A CO 

458 


OA 

89 


13249 


xxi i t\r\t\ 

Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


92 


37 


13250 


ABO 17362 


Bombyx mori 


fibroin H-chain 


70 


43 


13251 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


58 








6613. 






13252 


G00360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


58 








A A A 1 

4441. 






13253 


AF210651 


Homo sapiens 


NAG18 


151 


73 


13254 


AF224494 


Mus musculus 


arsenite inducible RNA associated 


299 


50 








protein 






13255 


AK026107 


Homo sapiens 


unnamed protein product 


102 


60 


13256 


Y02671 


Homo sapiens 


Human secreted proteui encoded by 


113 


61 








gene 22 clone HMMW18. 






13257 


a r/nn a a c c 

AK024455 


Homo sapiens 


FL J00047 proteui 


1 15 


59 


13258 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


155 


51 








„ MA \r„ 1A*7 

gene No. 107. 






13Z39 


AJ251579 


Arabidopsis 


cef protein 


115 


OA 

39 






thai i an a 








uzou 


T HQ. ATI \ 
JL/oO/1 


Homo sapiens 


— ; ; 

Coxll/D-loop DNA fusion protein 


JOU 


00 
oZ 


1 J ZD I 


VjUZo /z 


Homo sapiens 


numan se ere tea protein, oea^j uu invj: 


1 j1 


ol 














1 jZOZ 


TY1 Olio 

JL>3ollZ 


Homo sapiens 


in AUH oenyarogenase suounit 3 


413 


0/ 


1JZ03 


L/23001 


Homo sapiens 


ribosomal protein L37 


48/ 


94 


13Z64 


iT?n AAOA 

AF 130079 


Homo sapiens 


PR02852 


1 CI 

153 


43 


13Z05 


Y028SO 


Homo sapiens 


Fragment of human secreted protein 


1 AO 

109 


61 








encoded by gene 90. 






13266 


Y0H54 


Homo sapiens 


Protein sequence Seq Id 54 from 


133 


79 








YI Tt~\t\{\i\ 1 /VIA 

WO9901020. 






i jZo/ 


r mono 


— _ 

Mus musculus 


Gcapl gene product 


1U3 


43 


13268 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


151 


70 








6613. 






13269 


_ G04064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


51 








8145. 







490 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


13270 


AF 130089 


Homo sapiens 


PRO2550 


95 


47 




API 1Aft7< 


Homo sapiens 


rKUZ->JZ 


90 


60 


13777 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


10j 


54 


1 jZ / J 




riomo sapiens 


Human secreteu protein, oto id NO: 
4484. 


77 


A 1 

91 




API mnfco 


riomo sapiens 


t>d/v><:<a 
rKUZjDU 


104 


57 


UZ / j 


r5A7/iio 
vjvzojy 


Homo sapiens 


Human secreted protein, bbQ ID NO: 
6720. 


94 


45 


1->Z /□ 


nn77<\i 


Homo sapiens 


Human secreted protein, bhQ lu NO: 
6834. 


122 


52 




API 13£SK 


Homo sapiens 


JrKUU9 /4 


no 


36 


13278 


W48351 


Homo sapiens 


Human breast cancer related protein 

ovJKJbSZ. 


85 


68 


1 jz/y 


OUZjl J 


Homo sapiens 


Human secreted protein, bEQ ID NO: 
6596. 


139 


OA 

89 


1 37 £A 


vjunzv 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5210. 


243 


79 


1 17111 


A PI 1 #C71 < 
ATI ID/ID 


Homo sapiens 


PR02829 


1 AO 

108 


56 


13282 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


105 


60 


13283 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


194 


78 


1 17 5M 
1 J/64 


AT 1^0751 


Trypanosoma 
brucei 


probable granule cell antiserum 
positive 8 


1 Ail 

104 


47 


now 




Mus musculus 


nuclear localization signal binding 
protein 


104 


72 


1T?Ra" 


ApiinnsQ 


— ; 

Homo sapiens 


rKUZjjU 




58 


13287 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
00 ly. 


133 


58 


13288 


AF1 19855 


Homo sapiens 


PR01847 


115 


45 


1 17 RO 


API £1 I^C 


Homo sapiens 


HSPC093 


111 

131 


50 


13290 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


104 


43 


1 1701 


1VU10O4 


Drosophila 
melanogaster 


Bkm-like protem 


1 12 


49 




nnni'77 
OUU J / / 


Homo sapiens 


Human secreted protein, obO. ID NO. 
4458. 


1 CA 

150 


69 


11701 


N/flAA/17 
iVLjCKKf / 


Homo sapiens 


mitochondrial hinge protein precursor 


284 


76 


13294 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


444 j 


85 


119QC 

1 jzyj 


JJjol 1Z 


Homo sapiens 


cytochrome c oxidase subunit 3 


633 


A*> 

93 


13296 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz cione invisj wio. 


164 


76 


13297 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


147 


SO 


1 JZ70 


VI 11A1 


pro m neaciin 

tin 1 Qt/cnn 1 on 0 

iiJiiay5uiijd.ua 


extensin 




38 


1329Q 


Y01400 


TJatyia cnnipnc 

xiuixiu cKipicnb 


oecrecea protein encouea oy gene i o 
clone HNHF029. 




AQ 


J JJvU 


APiinn7Q 


riuiiiu Sapiens 


JrlvVJZoDZ 




ou 


13301 


AF0Q0RQ4 


xxviiiiu bapicns 


riSXJK) I ij 


1 1/1 


3Z 


13302 


AF130079 


Homo sapiens 


PR02852 


161 


83 


1 lim 


V778£°. 
lZ/000 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


154 


76 


13304 


AF1 18086 


Homo *sanipn<; 




1ZU 


7^ 


13305 


P70494 


Homo sapiens 


Sequence of human B-celi growth 
factor (BCGF). 


100 


73 


13306 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


108 


56 



491 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waternian 


Identit 










Score 


y 


13307 


Y74110 


aauiixv ooxjicud 


fluid an prostate tumor to i Tragment 


A AO 

*r*tO 


l nn 








derived nmtpin #797 






13308 


G00588 


Homo saoiens 


Human QpprptpH nrntpin <!Pn 1PI "WrV 

AJ.UX1KUX deweieil JJlULCUJj OAwV^ AiVJ. 


R5 


6*\ i 








tUU7 . 






13309 


G03793 


Homo saoiens 


Human Sfif*rptftd nrntpin ^PO TO NO* 
aauxiicui dev^ietCvi JJ1 uieiu., JDy XLy 1 1 vv. 


104 


47 








7R74 






13310 


AF1 16715 


Homo sanipn*; 




194 

l-fc*T 


76 
/O 


13311 


GO 1246 


Homo sjinipns 


Human QPrrPtpH nrnf^in QPD TD WO* 

JXUlXIaJX oCuiClCU pIULCIU, OCy xU llw. 


149 


61 








5327. 






13312 


G02872 


Hnmn sanipn*: 


Human cpr-rMfvT nrrttpin CPA \Y\ "MO* 


1 *^*> 


61 








6953. 






13313 


AF1 16661 


Hnmo saniens 


PROT43X 


149 


67 
O / 


13314 


AF1 61356 


Hnmrt cjinipnc 
JULvixiivJ oaxjxeuo 


H^PPOO'i 


191 
1Z1 


HO 


13315 


G02532 


Tfnmn enni^nc 

1 L\JILXVJ oatJieilO 


rxuiiictix scuTCicu pruit*iu, ocy IL/ IX vy. 


1 AO 

ioy 


*^9 








uuu. 






13316 


Y86248 


1 auihv cxaplCiio 


xiuiiiaii bcurcLcu proiein ii\_»nr roo, 


Oft 


QO 








cpn TT)*NO*1^ 

OJCrV^ JU-/ INU.JvJ. 






13317 


AF 130079 


lJ.Ulli.vJ odJJJlCLLo 




1AQ 


jo 


13318 


vju i y j x 


XlUillvl octplClib 


(litman rp^rata^l nrAfom CCn Tl^ XFO* 

rtuinaa secreiea proretn, ocy iu iNvj. 




Qf\ 
OKI 














13319 


Y02999 


Hnmn oflnipn*i 

AAUliAv* OdAJXClAo 


rid^juciit ux iiuiiiaii sccicicu pruLcin 


o 1 


61 
01 








encoded hv ffene 121 

vuwuuwu v«jr gvllv lLli 






13320 


AF 130089 


Homo v*anien<i 


PRO2-550 


106 


77 


13321 


AF090931 

*vi \j y\j y ~j a 


Hnmn ^nipn^ 

A AVJiAXVS i3dJJIV^llO 


PRO0483 

JT 1\.vVtOJ 


141 


61 


13322 


D29833 


Hnmn c**mipn«j 

X AUXA1U d<X|JlVylAO 


piAjiuic ricn pepuue r **ir> 


*\0 


11 


13323 


D181 12 


XJ/vfTI r\ caniane 


iN/\L/ti ucnycirogcnase suDunii j 


A1 7 


/O 


13324 


AP000616 




bimiiar lo tsjiN vj*"ix& imgcr protein 


1 1 0 
1 11/ 


oZ 








RHA 1 n /'AF07Rfift^ 






13325 


Y02671 


Hr\mn csmipno 


Tahiti 5kn c ^*r*t*o't"o/H tirntoi n aiip/^HpH T\\/ 

rAuxiiaLi bcuicicti proLcin encoQco oy 


luo 


*a0 








gene uioue nivioj wio. 






13326 


A11018 

-TV-/ I vJO 




pom 


oo 


41 


13327 


AFOQOXQS 

.rvir V7V/07J 


Xi.UlUU bapiCXAo 


PDO01 17 


o / 






Y096Q1 

I VZU;7 J 


xlUluO SaplcIlS 


Ut1tM4*ll PA/«nktA/1 ffVT->AflA!n A«%AA/1n/l • 

jriuinan sccretea protein encoaea oy 


Q9 
OO 










ooriA /I /I r*1/MnA UTTiA Ti99 

gene *+*t cione o l YjcWjaa. 






13329 


G03797 

VJ V/-/ / y f 


Yfninn ciinipnc 
X1U111U bapiCUb 


T-Tiirnon c r» rf* t~<»/T nrntpin QT7/^ TTi ^T^**l* 

nujiian bcLicicu proiein, ocy il/ in w. 


101 


6/? 
OO 








7X7R 






13330 


AL3901 14 


i^Louiiiauia 


rii*r>T*»***T*»T^" /TiTiv^^i till 09 v*»T*ian<* si 


101 


67 
0/ 






mainr 

AAltXJ W 


T^rVjT A rnmnlpfp ym rirvn frpnnm r» 






13331 


AF090930 


T-Ip»rn rk cnniAnc 
a iVJixxu aajjieilo 


A IVvy vr*r / O 


01 


Ifi 
JO 


13332 


AF 159055 

-*VA X-JySJJJ 


AALHXIIJ OtlXJXC-LIJj 


xcucme zapijci — ixxvc pi <j lexxi 


119 

1 lo 


71 


13311 


V09671 

I vXO / 1 


riuiiiu seipiciis 


jtiumau secrcieu protein encoaea oy 


SO 


J/ 








gene "wiiAiie xiavioj w 10. 






13334 


G02532 


Hnmn sanipnt 

XXXJXXXV 0€XIJI\M9 


Human «;prretftd nrntpin SFO ID NO* 
a alu xx on det«ieieu pjvrLvixi 5 oa_*v^ xxj iiv. 


191 


ou 








6613 






13335 


G02532 


Hnmn sfinim*; 

XX\JILX\M OCUvXVsiAD 


Human secreted nrotr**in SFO TD NO* 

AlUlllttXJ Oe^XwlAvU JJX Wl^IXL) Ouy AAV llV/. 


160 

lwV 


71 

ID 








6613. 






13336 


AJ005562 


A/fii« rmisi**ii1ii*£ 




f?6 

OVI 


44 


13337 


Y97K6R 


TTr*mrt coniAnc 
xlUimj SapicUS j 


ITnman co^roton t\r/>tam on/^A/io/i V%i t 

ii ui i bcoretcu proiein encoaea oy 


ioy 


OO 








opnp Nn 1 07 
gene J^l tr. J v / . 






13338 


G00528 


Homo QaniPTiQ 

liULUU OtlLJX&XAo 


Human *?prrptftd nrntpin SRO TH NO* 

AAIAXJIOLXJ. ClCvl CICU JJAULeillj JDy AA_/ 


7X 


S7 








4609 






13339 


Y08769 


JXAltvio 


TTiii'TrtvaQr'tilat* ^ndntTiAlial 
AlliwUVaoCLllal ClIlAiJLilCUai 


109 
1UZ 


01 






norvpp'inic 


dt^FpnpnHaHnn crprip 9 

UMJtVl eiAUCIUv/Il J^CIXC A 






13340 


AF090X95 


Hnmn coniAnc 
JlxULUu oapiCllb 


PPO01 17 

xxWJKr III 


117 


**/ 


13341 


K01664 


Drosophila 


Bkm-like protein 


115 


46 






melanogaster 








13342 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


76 








7871. 






13343 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


134 


65 



492 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Ideotit 

y 








77QS 1 






13344 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
proiein raa i a. 


76 


76 




Va£9A£ 

I OOZ-to 


xiomo sapiens 


xiuman secreiea proiein xi^Mrroo, 
SEQ IDNO:163. 


1 1 1 
111 


77 i 




nnn^7 


xiomo sapiens 


Unman cc*r>rc*tf*A nrntpin QPH TD MO* 

xiuman secreiea proiem, odv^ iu nvj. 
4633. 




£0 

%jy 


111A7 


/vr l lyyuu 


jTomo sapiens 




OI 


JO 


111AJ* 




Homo sapiens 


Unman censor at c±A r\rr\tc*\n CCA JT\ "MTV 

xiuman secreieu proiein, oc-y iu in w. 
7795. 


1£0 

ioy 


7^ 


1 11AQ 

1 JJ*r7 




xiomo sapiens 


xiuman oreasi cancer reidieu pi mem 
BCFLT2. 








API 1£71 ^ 


xiomo sapiens 




11A 

1 JH 


£A 

Ot- 


13351 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121 . 


115 


65 


1 11^7 


nni7i/t 


— — - : 

Homo sapiens 


xiuman secreiea proiein, ^cy iu inlj. 
7795. 


J 1 o 


so 




V07RR7 
I UZoiS / 


xiomo sapiens 


r ragmen i oi numan secreiea protein 

An/^/v/loyi r>i / 4>AnA Oft 

encouea uy gene yv. 


1 10 


77 • 


111SA 


AF1 1£££1 


T-I/"\mr% con? An c 
XIUIIIU bapicIlS 


PPOI AIR 

ST lS\J 1*t JO 


1 19 


7S 






riomo sapiens 


Unman cf*r*rf±te*s\ nmtoin QPO TFi MO* 

xiuman secreiea proiem, ocy n/ inw. 
4488. 


17fl 


71 


111<*£ 


Api 7£1 £1 


xiomo sapiens 


xixil/\j protein 


1 1£ 




13357 


U18339 


Variola virus 


D4L 


101 


58 


1 11^2 


A 171 QOACQ 

Ar i juUoy 


Homo sapiens 


XTOJZjjU 


1 77 




13359 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

0013. 


184 


82 


13360 


Y02926 


Homo sapiens 


Fragment of human secreted protein 
encooeo oy gene iui. 


78 


50 


13361 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO; 

AA7ft 


126 


71 


13362 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
££ii 


174 


76 


13363 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


176 


51 


13364 


U12390 


Cloning vector 
popom 


beta-galactosidase alpha peptide 


99 


41 


13365 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

££11 


118 


56 


111££ 


ttioioh 


— : 

Cloning vector 
pSportl 


beta-galactosidase alpha peptide 




sn 


111£7 

13 JO / 


V1QA£7 


xiomo sapiens 


/irnino aciu sequence ui a nuiudu 
dvUlClCU piuicui. 




£A 


13368 


M15530 


Homo sapiens 


B-cell growth factor 


132 


77 


1JJU7 4 


VIvJ / It 


XIUIIIU oapivuo 


T-Tnman c^rn>tf>H nrntf>m ^FO TD MO* 
XlUniaU SCLrlClCU piUlCUI, ODy 1L/ IN V_/. 

7795. 


117 


6^ 




AF1infY7Q 


XlULUU oapiCUd 




1 vo 


61 


13371 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
ciunc n^AUJoi. 


150 


76 


13372 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


117 


65 




VK£7AR 
I oOx^f o 


T-Irtm f\ CQntAnc 

notnu oapienb 


xiuuian secreieu proiein n^rxrTOo, 
WO Tr> >JO-1£1 

OGy LL7 XiV/.lOj. 


1 ix 

1 Id 


S7 


13374 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO - 163 


121 


63 


13375 


AF1 18086 


Homo sapiens 


PR01992 


147 


74 


13376 


AF130089 


Homo sapiens 


PRO2550 


149 


48 


13377 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


123 


55 



493 
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SEQ10 

MA. 


Accession No. 


Species 


Description 


Smith- 

Watermao 

Score 


% 

Identit 

y 


13378 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


53 


13379 


AF090894 


Homo sapiens 


PRO0113 


101 


52 


13380 


AF 107406 


Homo sapiens 


GW128 






13381 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW18 


156 


84 


13382 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


119 


80 


13383 


Y14437 


Homo sapiens 


Human secreted protein encoded by 
ffene 27 clone HSA WA27 


84 


70 


13384 


S79410 


Mus musculus 


nuclear localization signal binding 

X/l V/lvlll 


107 


53 


13385 


S79410 


Mus musculus 


nuclear localization signal binding 
nmtein 


107 


53 


13386 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870 


141 


65 


13387 


G03800 


11UU4U vXlJJlvHO 


Human ^f*rTf*ti*H rtrnf ^in QPO m XJO* 
ixuiucui j>cuicicu. pruicui, ady 1L/ xNVy. 

7881. 


1 1 J 


Oj 


13388 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
eene 22 clone HMSJW1 8 

gcut* Z.^. UIVJX1C/ X IXVXd J w 1 o. 


155 


77 


13389 


AF 130089 


Homo *5anten^ 


PRO2550 


1 so 


OZ 


13390 


AF1 61356 


Homo sapiens 


HSPC093 


1 54 

1 J*T 




13391 


AC006135 


Arabidoosfc 
thaliana 


nntative vifiilin tforapf* nrotpin 

L/UlClll V V/ V 1V111X1 Ucw UlUlviil 

(globulin-like) 


171 
l / 1 


Zo 


13392 


AF1 13685 


Homo sapiens 


PRO0974 


226 




13393 


AF1 18082 


Homo sapiens 


PRO1902 


69 


68 


13394 


D23661 


Homo sanien*? 


riVwwvnfll twrvtein T ^7 

1 ILfWDvlXlCU |yl VJLvlll 1 i J / 


4QR 


Q6 


13395 


AF1 30079 


Homo ^ani<*n^ 


PRO? 85? 


mo 
ivy 




13396 


AF1 19851 


Homo sapiens 


PR01722 


107 


57 


13397 


G02538 


Homo <iflnipn^ 


Hitman Qf»r*rptprt nrntpin ID MO* 

6619. 




05 


13398 


AF130089 


Homo saniens 


PRO2550 




6& 


13399 


G03789 


Homo ^anten<» 


Human cAprptft/i nrofpin ^PO TH NO* 

XXLUllall dGWlvlCU L/l \JVGHlj OCy lXJ liVy. 

7870. 


114 


S4 


13400 


J03798 


Homo saniens 


small nuclear rihonrotein Sm-D 


HJ 1 


yj 


13401 


D29833 


Homo saniens 


nrolinf* rich oentide P-T*t 

1X1 Villi v llvll L/wUUUv X U 


64 




13402 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


99 


76 


13403 


G03790 


Homo saniens 


Human secreted nrotein RFO TO NO* 

xiiuuaix g^wiviw j/lWwlHy JJJy XL/ i~ V^. 

7871. 


ISO 


76 


13404 


AF090894 


Homo saniens* 


PRO01 13 

1 lwv 11J 


is? 


7A 


13405 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal clone XFTT 8cl 

Ibl UlUlCUj Vll/llW /VLillaOvJ 


101 


45 


13406 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


172 


72 


13407 


G00407 ■ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


135 


84 


13408 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


125 


63 


13409 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


58 


13410 


AF130089 


Homo sapiens 


PRO2550 


101 


70 


13411 


AK024455 


Homo saoiens 


FLJ00047 orotein 


112 


73 


13412 


G02532 


Homo sanipn*; 


Human <5P-r*t**»tf"*r1 nrntpfn **?FO IT) NO* 

XXU1LKU1 JtUl tlCU jJl VJ LClllj OX->%J IX-/ 1 1 v_y . 

6613. 




4R 


13413 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


95 


50 


13414 


AF090942 


Homo sapiens 


PRO0657 


171 


72 


13415 


S58722 


Homo sapiens 


X-linked retinopathy protein {O 
terminal, clone XEH.8c} 


120 


64 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Sraitb- 

Waterman 

Score 


% 

Identit 

y 


13416 


M15530 


Homo sapiens 


B-cell growth factor 


121 


45 


1 1A 1*7 
J 341 / 


nmAan 

uux4yu 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
6571. 


115 




MA 1 9 




Homo sapiens 


prostate-specific transglutaminase 


1 AH 

140 


/o 


13419 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


85 


64 


1 1A1(\ 
1 J42U 


\s\}5 1 14 


Homo sapiens 


Human secreted protein, ob(^ lu nu. 
7795. 


11 


73 


13421 




Homo sapiens 


Human secreted protein, oEQ ID NU. 
0619. 


1 A O 

149 


62 


13422 


AF130089 


Homo sapiens 


PRO2550 


111 


39 


13423 


f*i\1A 1 1 

U034H 


Homo sapiens 


Human secreted proteui, 2>EQ ID NO: 

*"M AO 

/492. 




42 


13424 


AF I 19882 


Homo sapiens 


PR02492 


110 


70 


13425 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


104 


59 


13426 


AF 1 18082 


Homo sapiens 


PRO 1 902 


ion 

129 


/I o 

48 


13427 


X70771 


Chironomus 
tentans 


Spl7 


95 


56 


13428 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


164 


66 


13429 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


149 


80 


13430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:lo3. 


103 


63 


13431 


AF1 16661 


Homo sapiens 


PR01438 


128 


54 


13432 


A VtV^A ACC 

AK024455 


Homo sapiens 


FU00O47 protein 


117 


69 


13433 


VI 1 1 A 1 

Y13141 


Bromheadia 
finlaysoniana 


extensin 


91 


40 


13434 


U037 14 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
//9j. 


118 


£.A 

64 


13435 


AF090944 


Homo sapiens 


PRO0663 


115 


50 


13436 


X5j08/ 


Lycopersicon 
esculentum 


extensin (class II) 


/3 


32 


1343/ 


Ar 090942 


Homo sapiens 


PRO0657 


77 


50 


1 1A1Q 

13438 


ArOOOOol 


Aeropyrum 
pemix 


104aa long hypothetical protein 


82 


A 1 

41 


1 1 A1f\ 

13439 


U00481 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4562. 


Ail 

94 


51 


13440 


AF161356 


Homo sapiens 


HSPC093 


11C 

115 


53 


1 1AA 1 

13441 


V*>*7fl£0 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


1Z5 


44 


13442 


I jy /54 


Homo sapiens 


Human normal ovarian tissue derived 
protem 61. 




1 AA 

1UU 


U44J 


TT17'\/I'7 
U3334/ 


Homo sapiens 


MHC class II antigen 




15 


13444 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
oJEQ ID NU.103. 


106 


71 


13445 


W54966 


Homo sapiens 


Synthetic human type HI collagen 
cvxt r*i 


115 


34 


1 *2/M£ 
13440 


UU2256 


Homo sapiens 


Human secreted protein, i>b,<± lu nu: 
6337. 


486 


1 A A 
100 


1344/ 


UU2j32 


Homo sapiens 


Human secreted proteui, oEQ ID NU: 
ool3. 


Ijj 


60 


13448 


AF130079 


Homo sapiens 


PR02852 


167 


56 






fascicularis 


nypotneticai protein 


T71 
1/1 


02 


13450 


AK025326 


Homo sapiens 


unnamed protein product 


ill 


67 


13451 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


93 


76 
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SEQID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13452 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

6*74 
03/4. 


109 


91 


13453 


U65650 


AVI aUJUUL/jio 

thaliana 

UICUICUJCI 


oiuc-v-opper uiiiging proiem iu. 




I 4 ? 
jj 


13454 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6611 


123 


69 


13455 


AF090894 


Homo sapiens 


PRO0113 


122 


56 




lyJOJOJ 


nouio sapiens 


TCP 77 


71 A 1 
/ IK) 


QO 

yy 


11457 


Y07Q99 


Wr»m/\ ca ntf>n c 
I lUlllO SilJJiCHIj 


rnigioenL oi nunian sclicicu proiem 
encoded by gene 121. 


\ 1 1 
1 lj 


43 


13458 


AJJUOU 


P^fnlpntiim 
vavUldlllUIl 


Avtpnrin ( c*\ race 1 1 \ 

exiensin vciass ii ) 


OH 


CI 

J 1 


13459 


AF107406 


Homo sapiens 


GW128 


162 


54 




I uzo / 1 


xiomo sapiens 


Human secreieo protein encoded oy 
gene 22 clone HMSJW18. 


14j 


OJ 


1 j*tVj 1 


AOlvHJ 


XiyuTa Sp. 


mini -coiiagen 




7/1 
j4 




Oft7ftl9 


nomo sapiens 


xiurnan secreted protein, ocy iu inij. 
6913. 


1 1A 
1 JU 


40 


11461 


VI 507ft 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 2 1 9. 


loy 




11464 


T ,4766 ft 
L»4 /OOo 


Homo sapiens 


alpha-2 collagen type I 


04 


jy 


13465 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


155 


65 


13466 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene xso. iu/. 


159 


76 


13467 


Y14437 


Homo sapiens 


Human secreted protein encoded by 

rro.no 77 <*lnno TIC A \17 A7T 

gene z/ cione noAWAZf. 


109 


70 


1146ft 
1J40O 


r»fY>^lR 
VJUZjjS 


riomo sapiens 


xiuman secreted protein, ocvj uj inu. 
6619. 


144 


oy 


11460 




Homo sapiens 


Hitman <»As»r-A+£trl r\m+£*ln CCA TT~\ VT/^\» 

Human secreted protein, otivj iJJ wu. 

ft 1 57 






13470 


M15530 


Homo sapiens 


B-cell growth factor 


144 


57 


1 1471 




Homo sapiens 


Jriurnan secreted proiem, ocV£ iv raj. 

7ftft1 


1 1 £ 
1 10 


OJ 


13472 


V00488 


Homo sapiens 


alpha globin 


704 


96 


114T* 


/YJUlZlOO 


Canis familiaris 


brain-specific synapse associated 
pruLcin, Ddbbuuii 


JO 


45 


11474 


AF1 16661 


Xiomo sapiens 


PPfTl 47 ft 
rSWJ it jo 


17ft 


4j 


13475 


G01828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


99 


78 


11476 




Homo sapiens 


alpha globin 


779 


y l 


13477 


W65762 


Homo sapiens 


Human HDCAP protein. 


490 


98 


13478 


AB033767 


Homo sapiens 


brain-selective and closely mapped on 
me counter aneie oi v>ivi/\Jr in cy statin 

nlnct^r 


2184 


97 


13479 


AB011767 


IJfvm r\ csini'pnc 

riuiiiu bapicub 


Kfsif n_cp1/*/*ti\/f* a n rl r* 1 ACf* I v msinnAH r\n 
Uiaiii-bcicuii vc oil vi kyiuodjr uiappcu uii 

tfip r nn n fpr ?i Hp Ip of f^MAP in cv^tsitin 

cluster 


OI OA 

Z 1 OH 


07 

y 1 


13480 


Y42382 




Am in a £if*in ip/iiipupp at ftfl17 1 1 


Ul J 




13481 


D29833 


Homo sapiens 


proline rich peptide P-B 


61 


58 


134X? 


AlCft71?77 


nomo sapiens 


uiuiaincQ proiem prouuci 


*tOO 


Oft 


13483 


X67703 


Drosophila 
me lanogasier 


Mst84Da 


53 


60 


1l4ft4 


I z iy\j I 


nonio sapiens 


tiuman secreted proiem encooeo oy 
eeneNo 119 


1 


34 


13485 


AF210350 


Sceloporus 
siniferus 


NADH dehydrogenase subunit 4 


149 


79 


13486 


Y14544 


Danio rerio 


Hoxc8 protein 


98 


36 


13487 


AP002032 


Arabidopsis 




111 


27 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


MA. 








Waterman 


Ideottt 










Score 


y 






thaliana 








13488 


AB047846 


Homo sapiens 


gammal-COP 


4463 


100 


13489 


M37194 


Rattus 


clathrin- associated protein 17 


739 


100 






norvegicus 








13490 


Y59743 


Homo sapiens 


Human normal ovarian tissue derived 


138 


96 








protein 20. 






13491 


U0I317 


Homo sapiens 


G-gamma globin 


198 


90 


13492 


W04180 


Homo sapiens 


Cellubrevin-1. 


459 


98 


13493 


AL121585 


Homo sapiens 


bA504H3.1 (SNX5 (sorting nexin 5)) 


2058 


100 


13494 


X53777 


Homo sapiens 


putative ribosomal protein (AA 1-184) 


970 


100 


13495 


AJ271881 


Homo sapiens 


bromodomain containing protein 


3378 


99 


13496 


AF132970 


Homo sapiens 


CGI-36 protein 


878 


100 


13497 


V73498 cdl 


Homo sapiens 


21-AUG-1997 Human S100P1 DNA» 


273 


100 


13498 


AL121585 


Homo sapiens 


bA504H3. 1 (SNX5 (sorting nexin 5)) 


2058 


100 


13499 


X82456 


Homo ^anien<i 


L.IM and domain nrotein 


1420 


100 


13500 


AJ277932 


Homo saniens 


RPB1 la Drotein 


588 


98 


13501 


Y53569 


Homo saniens 


Consensus seauence for human RAD1 


633 


84 








binding proteins. 






13502 


W73434 


HnmA cstniAnc 


Hitman cpfrp+Arl nrntp in pnrrirf <=»/"" K\/ 
riuiii.au acticicu jjiuicjui cui*uucu *jy 




ion 








Gene No. 2. 






13503 


AJ005981 


Sus scrofa 


c AMP-regu lated phosphoprotein 


577 


100 


13504 


AF116715 


Homo sapiens 


PR02829 


143 


65 


13505 


U97553 


murid 


unknown 


84 


35 






herpesvirus 4 








13506 


X06323 


Homo sapiens 


put. ribosomal protein L3 (AA 1 - 348) 


1903 


100 


13507 


W82397 


Homo sapiens 


Human UBP protein #3. 


4264 


99 


13508 


M86246 


Homo sapiens 


EHS-2 


110 


71 


13509 


Y02671 


Homo saniens 


Human secreted nrotein encoded Hv 


121 


50 








gene 22 clone HMS J Wl 8. 






13510 


AF073519 


Homo saniens 


small EDRK-rich factor 1 lon« i^oforrn 


91 


44 


13511 


G03812 


Homo saniens 


Human secreted nrotein SEO ID NO* 


591 


100 








7893. 






13512 


Y04369 


Homo saniens 


Human HUCEP-14 nrotein 


1389 


100 


13513 


G03714 


Homo saniens 


Human secreted orotein SEO ID NO* 


136 


60 








7795. 






13514 


AE000232 


Escherichia coli 


orf hvnothetieal nrotein 


1480 


100 






K12 








13515 


X76717 


Homo sapiens 


MT-11 protein 


382 


100 


13516 


Y00919 


Homo saniens 


Human Rab nrotein R AHP-2 nrotein 

IIUIUUU lull/ L/lUlvul] 1VTV1/1 X>) UllilwUl 


1039 


100 








seauence 






13517 


G03714 


Homo sapiens 


Human secreted nrotein SEO ID NO* 


136 


67 








7795. 






13518 


AF090895 


Homo sapiens 


PRO0117 


109 


56 


13519 


G03714 


Homo saniens 


Human secreted nrotein. SEO ID NO* 


109 


65 








7795. 






13520 


G02538 


Homo saoiens 


Human secreted nrotein SEO ID NO* 


145 


71 








6619. 






13521 


G00577 


Homo saoiens 


Human secreted Drotein. SEO ID NO* 


93 


60 








4658. 






13522 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


140 


72 








6619. 






13523 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


130 


73 








8153. 






13524 


M15530 


Homo sapiens 


B-cell erowth factor 


112 


67 


13525 


AF1 19851 


Homo sapiens 


PRO 1722 


135 


57 


13526 


G00541 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


98 


82 








4622. 






13527 


W48351 


Homo sapiens 


Human breast cancer related protein 


117 


41 








BCRB2. 







497 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13528 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


105 


100 


13529 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene ino. ivi* 


79 


78 


13530 


AJ000536 


Arabidopsis 
thaliana 


COP1 protein 


98 


32 


ljjj i 


UJvJ*frU-5 


ttomo sapiens 


orcast cducer suppressor clement 
Ishmael Upper RP2 


1 Ul 


oz 


1 J D 3JL 




Homo sapiens 


Hitman ct*r+ratoiA nrntoin CCA "XT/"^* 

numan secrete a proiein, ocki ijj inw. 
4433. 




JO 


IJDJj 


LioKj! ly 


Homo sapiens 


nistone rLZr> 


OZo 


1UU 


13534 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ /Vj. 


151 


62 


13535 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


82 


63 




VA 1 1 ^ R i 
I U 1 I jo 


Homo sapiens 


oecreiea protein encoaea oy gene 1 o 
cione iiL//vL~Joi. 


yj 


JO 


13537 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein ! 

JXtlAln (ArU foOoj) 


139 


81 


13538 


AF130051 


Homo sapiens 


PRO0898 


90 


53 




Y1 iCQIO 


Homo sapiens 


preprocatnepsin H (AA -zz to 3 14) 


loi5 


no 
yy 


13540 


Y02671 


Homo sapiens 


Human secreted protein encoded by i 
gene zz clone HMoJ w lo. 


100 


80 


13541 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oolJ. 


131 


82 


1 JJH-Z 


uUU4jZ 


Homo sapiens 


Human secretea protein, ofcQ ID NU. 

4jJ3. 


94 




I JJ*fJ 


WooOz/ 


— : 

Homo sapiens 


Secreted protein encoded by gene 94 
cione ririviDi^JZ. 


141 


oO 


13544 


AF130089 


Homo sapiens 


PRO2550 


107 


72 






riomo sapiens 


miiocnonoriai ninge protein precursor 




CO 


13546 


U63810 


Homo sapiens 


WD40 protein Ciao 1 


505 


100 


13547 


AF090931 


Homo sapiens 


PRO0483 


126 


73 


1 1 CAO 

1J545 


V/ 1 A A O'** 

Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


143 


65 


13549 


U28971 


Caenorhabditis 
elegans 


similar to RD tandem repeat region of 
RD protein, (nuclear ma-binding 
protein ) 


82 


62 


13550 


AF1 13685 


Homo sapiens 


PRO0974 


126 


48 


1 jjj l 




Homo sapiens 


Jtiuman secretea protein, ocki ixj inhj. 

JOJH. 






13552 


AFl 16661 


Homo sapiens 


PR01438 


109 


69 




VJUJ l /z 


Homo sapiens 


Unman ca/trotaH nrAtoin Q tj/^ TT^k "KT/"^* 

Human secreted protein, oeAsi ijj invj. 
7253. 


1 Al 
1U 1 


OJ 


1 0 JJH 


AlOoJZ 


riomo sapiens 


preprocainepsin rt ^/\/\ -zz to a i*tj 


1 8^^ 
loJJ 


QO 

yy 


13555 


Y73435 


Homo sapiens 


Human secreted protein clone yd73_l 
protein sequence ocy ili invj.jjz. 


528 


84 


13556 


G00352 


Homo sapiens 


Human secreted protein, SEQ ED NO: 


126 


65 


13557 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoaea oy gene d. 


103 


67 


ijjjfi 




nOmo sapiens 


riuman secretea proiein, ocy iu iNvy. 
7174. 


180 


I J 


1JJJ7 




a f*At%iAnr 

riomo sapiens 


Jtiuman secretea protein, oe.ki uj jnu. 
6619. 


1 J 1 




13560 


AFl 1 8086 


Homo sapiens 


PR01992 


157 


71 


13561 


Y00320 


Homo sapiens 


Human secreted protein encoded by 
gene 64. 


101 


64 


13562 


AFl 59055 


Homo sapiens 


leucine zipper-like protein 


119 


74 



498 



WO 01/64835 



PCT/USO 1/04927 



SEQID 


Accession No. 


Species 


Description 


Smith- 




NO: 








Waterman 


Identit 










Score 


y 


13563 


AF 1 18082 


Homo <5anipn^ 


PRO 1902 


83 


ou 


13564 


GO 1246 


T-Tr\m/"» csniAnC 


Human Qf*rr/*tf*r\ nrntp'tn QPO TO MA* 




07 








5327. 






13565 


Y59778 

I J -r I/O 


iTumo oapieuo 


Hitman nArmal Avonan ficciiA /f^Wi/jarl 

ri urn an noiiiiai ovarian ubsuc ucnvcu 


oo 


IOA 








orofpin S5 






13566 


GO 1828 


Homo aniens 

XXUXXXVr duillvlU 


Human <zp.r.re,t&d nrotein SFO TD NO* 

XXIXIXIOXX OwJ wlWU L/lVlwXXlj UlvV^ 1 1 ■/ 11V. 


103 


vv 








5909 

J7U7. 






13567 


Y86248 


Horrifi Q3nipn< 

Ilvlllv OdL/lCllO 


Hitman <?ecrefen' nrotpin HPHPF6R 

XX Ul 11 OX I OwJ vlCU LSlULGXXl X XV^X XX X l»Oj 












SFO ID NO* 163 






13568 


X83703 


Homo ^anff*n^ 


mirlpar nrotein 

XitXvlwOl LsiULwXXX 


lUJJ 


00 


13569 


G02473 


Hnrnn cjinipnc 

1 11/1111/ OdpiCllO 


XX III Hall dCUClCU piULClil, ODy iU IN 


749 










6SS4 






13570 


G00521 


xlUillvl bapivlio 


Human cprrptpH nrntptn <JPO TO NfV 

ouiiid.li bcuicicu piuicin, ocy iu inv/. 


in*? 


00 








4602 






13571 


AP000616 


Orvza estiva 


Qtmilsn* tf\ RrWfr— H9 fino^r* MY\tf*in 
o 1111 Hal IV* IVlli VJ HZ. llll^Cl piLfLClxl 


1 10 
1 lv 


oU 








RHA 1 a rAF07R68^ 

xvxx/A l a V** ' OVJO j j 






13572 


AP000616 


yjiyiAX ban vd 


biiiiiidi iu r\aii vj~ixz linger proicm 


1 14 


4A 








RHA 1 a TAF078683 1 

xvxxrvx <x \^cik.i \j / ouoj J 






13573 


X92744 


Homo ^anien^ 




990 




13574 


P70494 


HATtirt canipnc 

XXUIXIU OOXJlCllO 


oci|uciicc ui u uiii ou jy-ccu ^i Livvm 


107 

11/ / 


/CO 

05 








fector f RCGFi 

XCl^LV/l ^J_> V^VJ 1 J. 






13575 


AF 107456 


axuiihj injjiciib 


GW19R 

VJ W 1 Z-O 


J JO 


i no 
lull 


13576 


W03642 


T Tom r\ car\if»riQ 
iiuiuu japiciio 


XX U I n oil uaiiiiaL/liluiu VJx xv IN-lciniUlal 


101 


4R 








Qfvnipnpp 

dC%JUvxlV^C 






13577 


AF220264 


Homo *?anipn<i 


i» xv/o i. x 




71 


13578 


Y86248 


Homo ^anipn*; 

XXV/1X1 vf oaj71vsllo 


Human Qprretpo 1 nrnfpin HPHPF/tR 
x xuiiioLi acuiwicu pivii^ixi iiviir ruO| 


199 


oo 








SEO ID NO* 163 






13579 


AKQ24455 


Homo saoiens 


FU00047 nrotein 


81 


76 


13580 


AF130051 


Homo saniens 


PRO0898 


128 


64 


13581 


AF084256 


Homo *iaoien<% 


lipta cltiraitmniflaci* iQirform fl 
ucia giuuiu uuiucuv lovi-wixlx \x 


149 


JO 


13582 


AF1 18082 


Homo <5anien^ 


PRO 1902 


R5 


<i 
j i 


13583 


Y02886 


Homo ^anif»n<: 


Fraompnt Jinmfln cp/"fiptpf) nrnfpin 
x^i agixxGjJi L/i liui li ail dCvicicu piuiciii 


1 14 
1 l*r 


S9 








pnmHpH Kv optip 00 

vllwUCU Ujr gCllC 7V1. 






13584 


G02455 


Horrirt Q^nipnc 
iivJUivj oajJiciio 


Human cp/TPt^H nrntpin QPO TFi NO* 
XX Ui 11 all bCUCLCU pi ULC111, O-CV^ lX-/ 


OO 


^4 








6536. 






13585 


X52164 


iuUo JUJLUjVUILLj 




10Q 

ivy 


^4 
o*f 


13586 


AL 3 59782 

i» UJJ7 / 


x i jrucuiuMJiiici 


pUbblUlc ^IlflV-Uj ill L\34.y VaXlailL a Quag 


mo 


79 






finirpi 


r*/^m r\1 p»t<* vrri/vn opnAmp 

vi/mpicu/ viric»n gcnvnit?. 






13587 


AF090901 


Hom/> Q^nipnc 

I1UU1U JKXpiCllo 


PRO010S 


1 14 
1 l*f 




13538 


G00397 


H/\tyio cflniAnc 
xxvlxxt* oapicixo 


riuniau od/icicci proicm, oek£ iu lskj. 




OO 








4478. 






13589 


G03356 


Homo saoiens 


Human QpcrpffvJ nrotein SFO ID NO* 


117 


sx 

JO 








7437. 






13590 


AK000521 


Homo saoiens 


unnamed protein product 


1313 


100 


13591 


AL357374 


Homo saoiens 


bA353C18^ fnovel nrotein i 


421 


100 


13592 


AL357374 


Homo saniens 


bA^SSClft 2 Aiovel nrotein i 


421 


100 

1 V»Vr 


13593 


G03786 


Homo ^anipns 

xxuxxivs oa^i&iu 


Human Qpprptpri nrotpin SFO TTi NO* 


139 

1J7 


69 








7867 






13594 


Y02749 


Homo ^anipriQ 
xxuxxivi oaxjxcxi;> 


Human cpt rptprl nrntpin pnrnHpH Wu 
XX 1X1 11 Oil bCTwl CICLl Ul UlC 111 CLLIL'LILICU XJy 


1UJ 


SA 
JO 








pene 1 00 clone HNFTTTQ6 

gCliw IV/vf CflUxlTw x XX lX 1U7U. 






13595 


U 16359 


Raltus 


nitrip oviHp cvntHac^ 
IXXlilV* V/A1U.C oYxxixia2»G 


100 
ivy 


7/\ 
/O 






n OTVPfyi riic 

X1V7A TvglwliJ 








13596 


AP000616 


Orvza <rativa 


*?imil/ir to RTNfr-H7 finopr nrotpin 

OXXXIXIOX IVX XVIXN VJ 1 XX 1 1 1 IQtt. pxuixyixi 


197 
iz. / 


oo 








R HA 1 a fAFO/78fiRl\ 






13597 


G00325 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


275 


100 








4406. 






13598 


W03642 


Homo sapiens 


Human cannabinoid GPRN-terminal 


108 


63 








sequence. 






13599 


AF1 16661 


Homo sapiens 


PR01438 


128 


64 



499 



WO 01/64835 



PCT/US01/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13600 


X55682 


Lycopersicon 
escuienxuiii 


extensin (class I) 


58 


43 


13601 


X52164 


Mus musculus 


Q300 protein (AA 1-77) 


106 


45 


11607 




v^aenorn aooins 


similar to RD tandem repeat region of 
jsxj protein ^nuciear rna-Dinaiag 
protein) 


I oo 


92 


n60^ 


AF1d.Qd.1Q 


wrycioiagus 
cuniculus 


eye sodium bicarbonate cotransport 
protein NBC2 


1 OA 
I/O 


75 


11604. 




tvius muscuius 


orKZl protein 


til 


40 


13605 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

to OO. 


150 


96 


13606 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

*71AA 
f J44. 


121 


60 




VJUZoO/ 


nonio sapiens 


riuman secreted protein, msq iu nu. 




60 


13608 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
l*VyJL. 


57 


35 


13609 


AF1 07406 


Homo sapiens 


GW128 


108 


42 


1 JO IV/ 




Homo sapiens 


Nd thymosin beta 


167 


100 


13611 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ /SO. 


161 


59 


13612 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

OPA TT\ VT/*V 1 i?"* 


125 


50 


13613 


Y32193 


Homo sapiens 


Human receptor molecule (REC) 
encoded by Incyte clone 044150. 


490 


80 


11614 


I OOZ*K> 


Homo sapiens 


Human secreted protein riCHProo, 
SEQ ID NO: 163. 


lie 

115 


57 


1161 * 

I jO 1 D 


WoU40o 


Homo sapiens 


A secreted protein encoded by clone 
dh40 3. 


124 


58 


1 JOlO 


a ci i oonn 


Homo sapiens 


PR02822 


95 


53 


UOl / 


WoU4U0 


Homo sapiens 


A secreted protein encoded by clone 
dh40J. 


124 


58 


1 l£1fc 




Homo sapiens 


prion protein, PrP {octapeptide 
repeats} 


88 


42 




A POOf*OA/1 


Homo sapiens 


IV /(APT 1 


130 


81 


13620 


U16359 


Rattus 
norvegicus 


nitric oxide synthase 


95 


65 


UOZ1 


AT 1lO*M1 


uaenornaoaius 
elegans 


Y15b3A3 


168 


85 






Drosophila 
meianogaster 


Bkm-like protein 


1 1 o 


65 


13623 


M86246 


Homo sapiens 


EHS-2 


100 


63 


13624 




nomo sapiens 


riii in an secreteu protein xiv^xix roo, 
SEQ ID NO: 163. 




Afi 
HO 


1369S 




xiomo sapiens 


vesicle transport related protein 


1 OQ 


/CO 

oy 


13626 


U39529 


Echinometra 
mauiaei 


bindin 


72 


47 


13627 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


102 


52 


13628 


W8Q4ft6 


I-Tatti r\ ccirxi An c 
JTUJilO SaplCXu 


j\ secreted protein encoded oy uione 
dh40 3. 




34 


13629 


icoi 66d 


L/roaopnLia 
meianogasier 


DI/wm Ills' A v%^/\fain 

jDKm-iuce protein 


111 




136^0 




numo sapiens 


llorLUj 1 


^co 


1UU 


13631 


Y27854 


Homo sapiens 


Human secreted nrotein encoded bv 
gene No. 101. 


158 


87 


13632 . 


AF130089 


Homo sapiens 


PRO2550 


105 


66 


13633 


AF1 16661 


Homo sapiens 


PR01438 


116 


53 


13634 


AF116715 


Homo sapiens 


PR02829 


101 


60 



500 



WO 01/64835 PCT/US01/04927 



SEQID 
INU: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


13635 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


139 

i J y 




13636 


U33547 


Homo sapiens 


MHC class II antigen 


154 


79 


13637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


107 

Ivf 




13638 


AF090895 


Homo sapiens 


PRO0117 


87 


62 


13639 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


131 


69 


13640 


AF026689 


Homo sapiens 


Drostate-^necific tran^plutanriinasp 


128 


60 


13641 


S58722 


Homo sauiens 


X-1 inked retinonathv nrotein /P- 
terminal, clone XEH.8c} 




O 1 


13642 


AC003058 


Arabidopsis 
thaliana 


unknown nrotein 


210 


67 


13643 


D38112 


Homo sapiens 


NADH dehydrogenase suhunit 4T. 


287 


76 ! 


13644 


Y73483 


Homo sapiens 


Human secreted protein clone yi!8_l 
nrotein seouence SEO ID NO' 1 88 


376 


88 


13645 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEO ID NO - 163 


130 


85 


13646 


Y87212 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO-251 


423 


97 


13647 


U50403 


Homo ^anien^ 


Krpact fsiniv.!' oiinrm*QQAP f*1prnpr»fr 

L/l WdOl V*C*1H^H OU£/£Jl Wodvl vlwlUClll 

Ishmael Upper RP2 


* vJ 




13648 


G03800 


Homo sarjiens 


Human secreted nrotein SEO XT) NO" 
7881. 


128 




13649 


X58521 


Homo sarjiens 


nucfeonnrin rt67 

UUvXwUl/l 111 UvLf 


2610 


70 


13650 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795 


141 


69 


13651 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


111 


81 


13652 


G02538 


Homo sari i ens 


Human secreted nrotein SEO TD NO* 
6619. 


140 




13653 


G03263 


Homo saniens 


Human secreted nrotein SEO TD NO* 

IJLiUJlCUl JbVlVlbU jJl Ulvlilj ULiVyJ VXJ i.\\J * 

7344. 


121 

1 £» 1 


46 


13654 


U13066 


Nicotiana alata 


ciLauiiivrgcuciv^Lcui |/i uiciu jji wi>uioui 






13655 


S79410 


Mus musculus 


nuclear localization signal binding 

•nrotein 

pl U ICUl 


134 


50 


13656 


Y07766 


Homo sapiens 


Human secreted protein fragment 
encoded from pene 23 


156 


100 


13657 


Y02671 


Homo saniens 


Human secreted nrotein encoded hv 

11U1UU11 Jwvl ulwU L/l Ulwill vUvvUwU L/ V 

gene 22 clone HMSJW18. 


174 
i*.t 


4% 

to 


13658 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


171 


so 


13659 


AK000017 


Homo sapiens 


unnamed protein product 


611 


100 


13660 


U90446 


Mus musculus 


RNAse L. inhibitor 


3100 


99 


13661 


AE000882 


Methanothermob 
acter 

thermoautotrophi 
cus 


phosphoenolpyruvate synthase 


88 


38 


13662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


65 


13663 


K01664 


Drosophila 
melanogaster 


Bkm-like protein 


97 




13664 


G04067 


Homo sapiens 


Human secreted nrotein SEO ID NO* 
8148. 


77 


41 
*t i 


13665 


Y94890 


Homo sapiens 


Human protein clone HP02798. 


325 


98 


13666 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


139 


65 


13667 


AF090930 


Homo sapiens 


PRO0478 


119 


88 


13668 


AF130089 


Homo sapiens 


PRO2550 


120 


77 



501 



WO 01/64835 



PCT/US01/04927 



SEQU) 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13669 


AC006693 


Caenorhabditis 

plpopinQ 
v i eg cm j 


Hypothetical protein W02H5.e 


165 


85 


13670 


W80293 


Homo sapiens 


Human translocation associated protein 

/■Acicrnsif *>rl fir»9SI— H 
UCo 1 glldLCU VJL/Z. JX-r^O. 


1003 


95 


13671 


U23455 


Caenorhabditis 

plppanQ 

vlCgOUd 


similar to D. melanogaster homeotic 

nrntPin RnrW9 fPIR- A41796^ 
protein Ddxriz ^riiv./\Hi its}) 


113 


91 


H679 

1 JV Mm 


G01714 


Haiti n vmipn^ 


Human cprrpfpH nrrktpin CPH IFi "WO' 

7795. 


160 




11673 


AM32841 


fapnnrhaliH ifiQ 

VrClvlil/1 Hat/UIllJ 

pleaan^ 


Yl^FIA 1 ! 


178 

I/O 


7*> 


13674 


AF1 18086 


Homo ^finipn^ 


PUO10Q9 

JT IVV/i 7/A 


8S 


7^ i 


13675 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

opnp 99 rlrmp HM9TW1 R 


132 


55 


13676 


AF220264 


Homo sapiens 


MOST-1 


133 


85 


11677 


1 I7/JU 


nuuio bopiens 


oty iu inlf *h4o rrom w \jyyLLL i \3 . 


199 
1ZZ 


D 1 


13678 


Y65416 


Homo sapiens 


Human 5' EST related polypeptide SEQ 

tn "MO* 1 ^77 
11./ PiKJAjf /. 


467 


98 




Y16SRQ 

I 1UJ07 


numo bapieni> 


/v proxein cnai interacts wiui 
presenilins. 


99R6 




116X0 

J. JUOu 


at i^n^i 


i^clblUJiania 

mainr 
iiiajvji 


prooaoie ^nnv-o ) u i iuz, variant a 
JL/n/v, complete virion genuine 


1 /VI 

1U4 




13681 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


13682 


X55686 


Lycopersicon 

\*olrLl ICI IlLull 


extensin (class II) 


60 


56 


1JVOJ 




riomo sapiens 


JtIvaJUOOj 


yo 




13684 


AL096770 


Homo sapiens 


bA150A62 (novel 7 transmembrane 
receptor (rhodopsin family) (olfactory 
receptor like) protein (hs6Ml-21)) 


178 


52 


116RS 


AR1 16661 


nuiiiu aopiens 




1 1 j 


^o 


13686 


U62039 


Elephantuius 

CUWaXUIl 


reverse transcriptase 


86 


53 


13687 


Y13141 


Bromheadia 
iiniaysomana 


extensin 


77 


36 


13688 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7R67 


114 


60 


13689 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encouea oy gene j. 


103 


53 


13690 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770^ 


107 


51 


13691 


AC003058 


Arabidopsis 

Ulallalla 


unknown protein 


178 


85 


13692 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
<ivn rn>JO'i6i 

OCrV^ JLL/ 1NW. luj. 


94 


35 


13693 


G03807 


Hrimfi QsinipTiQ 

XJIU1U.U O&JJlCllO 


Human cperptprl nrntpin ^PO JO MO* 

JULU11KU1 KVlvlSU piUlClllj OJdV^ JUL/ J/NV^. 

7888. 


14T 


7X 


13694 


G00416 


T4r»mn <"i?miprK 

JL1U1UU j>ajJl&lLo 


Hum an ^errpfpri nrntPin ^IPO TD "NO* 

4497. 






13695 


D63163 


"R affile ert, 


rvrlin C 

Ksj wlUl 


111 


86 ! 


13696 


Y02671 


Homo s^iens 


Human secreted protein encoded by 
apnp 99 clone HMSJW1 R 


118 


95 


13697 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 

1*HA la /A Fft7ft6fi1^ 


124 


80 


13698 


Y14482 


Homo sapiens 


Fragment of human secreted protein 

entuueu uy gene 1 /. 


129 


70 


13699 


G03787 


Homo saniens 


Human secreted nratein SEO ID NO* 
7868. 


93 


58 


13700 


W48353 


Homo sapiens 


Human breast cancer related protein 
BCFLT2. 


100 


73 


13701 


M76744 


Homo sapiens 


BGP 


112 


52 



502 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith* 


% 


NO: 








Waterman 


Identit 










Score 


y 


13702 


Y15155 


numu ao^jiciis 


T\fi/\cnli/\f"ir1;ic*» IrinncA Hp+?>-ciihnnit 
LHlUbUllUI Y lOoC JVllluSC UCul'oUUUllll 


206 


ion 


13703 


AF090901 


xiuiiio Sapiens 




100 


7ft 


13704 


009087 


xiomo sapiens 


riuman secreieu proiein, acy i\j invj. 


111 

13 1 


70 

/y 








/uoo. 






13705 


RO^OI! j 


numo sapiens 


iNcJurai inrcau pr uicin. 


106 

1 uu 


16 

3U 


13706 


M1S073 


xiomo sapiens 


JVIXIV^ Xll^/\~13xv'*OCut** 1 CUa III 


74. 
/ « 


77 


11707 

1 3 /U / 


VRA94R 
I 50/4 6 


Homo sapiens 


tiuman secreieo protein xiuxirroo, 


i in 


33 








O-DV^ 1J-/ IN\3. 103. 






1 170R 

1 J /UO 


f*017R0 


Momo Sapiens 


Unman oA^ratayl nrAtam OCA JT\ "hJC\* 

tiuman secreieu protein, ocy jlu invj. 


1 ^o 

1 3U 


30 








/5/U. 






1 17O0 
13 fU7 


\JVZ303 


— — ; ; 

Homo sapiens 


Mum an secreteu protein, ^cxi iu 


on 
oy 


77 
II 














11710 

1 3 / IU 


Y7ft77^ 




Caironomus 


opiz gene nornoiugue 


R^ 
03 


1R 
35 






ClQglUalUS 








1171 1 

13 /II 


v«n96<; 

yvouzoj 


xiorueuin 


Suucuirai protein 


06 
yo 


4.1 






vulgare 








1 1717 
1 J / 1Z 


A Rnn7099 
ADUU /7ZZ 


Homo sapiens 


VTA Ari/f <1 nvrtf^in 

jviaauh33 protein 


147 
l*r / 


/^4 


13 / 1 3 


VJU3UH3 


riomo sapiens 


Ilirmon eo^rAtoH rtr/vhain CPA \T\ MO* 

xiuman secreieu protein, ocy uj invj. 


194. 
lZ*f 


Aft 

oo 








7194 

/ 1Z*T. 






11714. 

13 1 i*+ 


Y09740 


T-tr\rn f\ coniefic 

nun jo sapiens 


nuiiioii SCVICICU pi <J LCI 11 ciicuucu uy 


10R 


6S 

03 








<rAnp 100 rlnnp HNPTT706 






1171S 
u / 13 


API 166A1 
Ar 1 1 OOO 1 


UAm/\ ronton c 

numo sapiens 


pt>ni41R 
rxvvJl**3a 


190 

1ZU 


77 


11716 

13 f l\> 


API 1R0R6 


T-1 Am f\ com an o 

nonio sapieiib 


PR A 1009 
rxvv/ 1 77Z 


197 
1Z / 


64 


13717 


AK"ft944SS 


numu sapiciis 


PT 100047 nrAtf»in 


01 

yo 


S6 

30 


11712 
l 3 I lo 


A PI 1 f>££1 
AT 1 1 DOO 1 


Homo sapiens 


rKU143o 


1 17 
13 / 


33 


11710 
13 i iy 


W4R1^9 


xiomo sapiens 


xiuman Dreasi cancer reiateu protem 


Ol 


^0 








Dpri T1 
HLrLl 1. 






11790 


AP1 1 QR^I 
M.r 1 1 70 3 1 


xiomo sapiens 


PP01799 
JrlvVJl /ZZ 


OA 


^R 
JO 


11791 
1 3 /Zl 


009 ^1R 


Homo sapiens 


Unman eo^rafcu^ nrA^Atn CCA IT\ XT A. 

xiuman secreieu protein^ oJcrV^ ii> inv/. 


194 
1Z^ 


4^ 
H3 








/c/:io 






11799 


V97R6R 

I Z / OOo 


xiomo sapiens 


ITiiman cAArA^a^ nvAtoin anAA/1a/1 

xiuman secreieu protein encoueu uy 


1 17 
11/ 


^R 

3D 








aahp Ma 1 A 7 

gene lit/, iu/. 






11791 
13 /Z3 


AP1611/\1 

/vr 101301 


xiomo sapiens 


XlOr ^/U70 


11^ 
1 13 


^ft 


11794 
13 /Z'f 




xiomo sapiens 


rvative niunan l le receptor signal 


1 0R 


10ft 
1UU- 








pepuue 






1179^ 
13 /Z3 


A PI 1 R0R£ 
At 1 1 oUoO 


xiomo sapiens 


ppmoo9 
rrvUi!7i/z 


100 


7^ 
/3 


11796 

13 /ZO 


API \f&\& 

/vr 1 1 0030 


xiomo sapiens 


pprki^RR 
JrKWl^oo 


1^3 


7ft 
/U 


11797 
13 fZt 


/"1A171 A 
\J\JJ 1 l*f 


Homo sapiens 


tinman ooAfata^ nrnlain Qt?A TT^\ XTA> 

xiuman secreieu protein, ocxi id rsu. 


117 
13 / 


7ft 








770^ 






1179R 
13 /ZO 


nn9 ^19 


— — : : 

Homo sapiens 


Human secreted protein, SEQ LD NOi 


140 

i<*y 


Ov 








6611 






1171ft 
13 / Jw 


O00671 


xiomo sapiens 


Unman c «*r>r*»+*» rl nrAfptn C"C A TTA XT A. 

xiuman secreieu protein, oc\i uj rs w. 


1 1 R 
1 lo 


03 








4754. 






11711 

13/31 


13333*+/ 


MAmA ronton t? 

xioino Sapiens 


ivixiv^ viass ii anugen 


191 
1Z3 


60 

oy 


13732 


R59843 


T-Tatoa cani#*nc 
xiuiiiu oapiciio 


AaaP4.T y9 TMTAt<*flO*» 


IIS 


oo 


11711 

13 / 33 


000157 


xiomo sapiens 


T-Tuman c^/*r*»fpv1 nronain QPO IT) XT A- 

xiuman seereieu protein, osz\i ixj nkj. 


11R 
130 


7^ 
/3 








4438 






11714. 
13 /3*t 


AP096904 
rvr vZOZ VJ*t 


v^aenornaouius 


Vv3uci.i gene proauct 


109 
1 UZ 


31 






ClCgallS 








1171S 


VI 9050 
i izy3u 


xiomo sapiens 


A nA m f\ QAin ca/niM/>a /\t o human 

/vminu aoiu sequence oi a ntunan 


1 17 
11/ 


61 
03 








secreieu pepuue. 






13736 


G0141R 


XXUulO Sapicllb 


rlUUlall SCviClvU prOtCtD, ODy IX/ lNv/. 


1 19 

1 1Z 


R7 
o / 








7^10 

/3 jy. 






11717 

13 1 j I 


at 15Q7R9 


i rypanosoma 


possioie ^nnv-oi ui iuz, variant a una, 


1 13 


35 






brucei 


comnlete virion ppnomp 






13738 


AF130089 


Homo sapiens 


PRO2550 


122 


70 


13739 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


74 








4478. 






13740 


AF130051 


Homo sapiens 


PRO0898 


85 


70 



503 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: 


Accession No. 


Species 


Description 


Smith. 

Waterman 

Score 


% 

Identit 

y 


13741 


AF093748 


Homo sapiens 


KH type splicing regulatory protein; 

rvOlvi 


91 


51 


13742 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

p*»tif» 79 rlnnp HM^TWI 8 
gGUC wuuc rxivioj vv io. 


150 


77 


13743 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

wvIj. 


124 


82 


13744 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871 


91 


48 


13745 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


116 


64 


13746 


AB001684 

/VUvu 1 V/0*T 


Phlorella ' 
vuloari^ 


DP F *\AA 




00 


13747 


Y02886 


Hf>mf> *wnien^ 


Frjicrmpnt rtf human cpf^fpfpH nmfrAin 
i la^iiieuL ui uuxiioiJ oei/i Gieu yi vicm 

encoded by gene 90. 


14.fl 


fin 
ou 


13748 


AF090901 


XXlrlXlvr OOLJlwI2> 


PR Oft 1 0S 




jy 


13749 


AF090895 


Homo sapiens 


PRO01 17 


148 


67 


137*50 


/vr i i70oi 


fiuiuo bapiens 


PP fV>A07 


Q1 
71 




13751 




nuinu sapiens 


numan secreieu proiein, oc-y iu invj. 
6613. 




ou 






numu sapiens 


Human secrexea proiem, oc^ id jnu. 
4478. 


1 OO 


o^ 


13753 


AF1 61356 


noitiu sapiens 








13754 
i j / j*t 


I \)£.\J 1 1 


numu sapiens 


Uiitnati c*£kr^t*A'4'A/4 ^v/vtAm AnAnnAn rvw 

xxuman secreiea proiem encoaeo oy 

geilC AZ dime xXLVloJ WIO. 




/u 


13755 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4478 


120 


51 


13756 




rruuiu sapiens 


PBn7ssft 

rivUZjjv 


1 

l*fZ 


yo 


13757 


AF130089 


Homo sapiens 


PRO2550 


132 


86 




n075^9 


numo sapiens 


numan secreieu proiem, oty ilj ihkj. 
6613. 






13750 


I 17wU7 


numu Sapiens 


cpn frv XJO 397 frnrn WO007774Q 
oily li-/ lNvy JZ / iTOlTI VY\-/yyZZ^nj. 






13760 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O7IJ. 


154 


52 


13761 




c l epi ion uii us 

pHwnrH i f 
cuwaiuii 


reverse uonscnpcasc 


ID** 


CI 

Jl 


13762 




V^illUieila 

viilaarK 

V UlgCU lO 


HPF40K 

v/xVr'rJ'U 






13763 


AFl 18082 


Homo sapiens 


PRO1902 


129 


50 


13764 


AF220264 

.rvx ^uMn 


Aivjiinj oojjieud 


IVIVJJ 4 " 1 


izy 


oo 


13765 


R59842 


T-Jrunn canl^nc 
i ilmiivj oofj ieuuj 


xvimjj->*tXj i L^i uieaoe* 




sn 


13766 


AF220264 


Wr\mn oorxipnc 

llvlUU MUlbllJ 


MOST-1 


1 1 S 
1 1 J 


71 
/ 1 


13767 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


139 


73 


13768 


AF130089 


Homo sapiens 


PRO2550 


97 


34 


13769 


U80739 


Homo sapiens 


CAGH26 


564 


100 


13770 


\J\ID I l*f 


numu sapiens 


Ultra an c*»r»t*»fA/1 nrrtt^in CCA TT"\ XT A. 

numan secreieu proiem, oc\f iu 
7795. 


198 


JJ 


13771 


R1A755 


numu sapiens 


inoei~/\ nou-o nepaiuis specmc 

allUgCUlU piULClli ClI^AKiCli uy 

nhapeclnne lambda HC753^ 




Aft 

Oo 


13772 


AE004507 


F*iPiTflom on ?vs 

aeruginosa 


hvnAthf»tiral nrnfpin fifhartprinnhacrp 

J^i r faciei lULFUdtc 

Pfl 




44 


13773 




f^ani^ fflmiliariQ 


uv^/aixiiiie i cuepiui jl^t 


/ o 


S7 


13774 


AF026246 


x ivuiv oopivixo 


UpP \/_ p pnvflnnp ryl \rr* r\ r» rn1"£* I n 
I AJ-/IV V ~Cj vilVeiUUC gljrWUL/lULCill 


I V/O 




13775 


S71805 


Homo sapiens 


RNA-binding protein=TLS/FUS-ERG 


126 


100 


13776 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


156 


82 


13777 


AB015727 


Mus musculus 


truncated granzyme M 


86 


37 


13778 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


74 


52 



504 



WO 01/64835 PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








6913. 






13779 


Y14482 


Homo saniens 


Fra<rmpnt of human ^ecrftteH nrntpin 

encoded bv eene 17 


i j/ 


Ol 


13780 


M14123 


Homo sapiens 


neutral protease large subunit 


246 


55 


13781 


Y02671 


Homo saniens 


Human epr-rpfpfJ nrotpin enrnderi hv 

X twl licit I O^wl £/l VLCLLi t>Ui<>VUC>U Uj 

gene 22 clone HMSJW18. 






13782 


AF 132972 


Homo sapiens 


CGI-38 nrotein 


90? 


OQ i 


13783 


G02469 


Homo saniens 


Human ^PfTPtpH nrotpin SPO XT\ "MO* 
6550. 


14.7 
if/ 


\ij 


13784 


G02532 


Homo sanierK 


Human oprrptpr? nmtptn m MO* 

6613. 


17fl 


fSL 
Oo 


13785 


AF130051 


Homo saniens 

IVluv iAiiyiyllJ 




1 JO 


7? 


13786 


AP000616 


Orv73 sativa 


similar tn RlMCr-H*? fincpr nrotpin 

0 1111 Hell LU A\J.lTl VJ A 1>C llll^d L/i ULClll 

RHAla(AF078683) 






13787 


Y64890 


Homo sanipn 1 ; 


Human S' F^T rplatpd nolvnpntiHp **lRn 
IDNO:1051. 




SI 
jj 


13788 


U 12206 


Homo <»anipn<; 


iinlmmx/n 
uu Mmwii 




d.7 


13789 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874 

/Of *t. 


118 


58 


13790 


S79410 


Mus musculus 


nuclear localization signal binding 

t"rtY\fPin 
UlUivUt 


102 


50 


13791 


G0221 1 




6292. 


1 ^7 
1 J / 


/O 


13792 


G00354 


Homo sapiens 


Human secreted nrotein SFO ID NO* 
4435. 


104 


77 


13793 


AB006006 


Bos taurus 


neurocalcin alpha 


1000 


100 ! 


13794 


U63332 


Homo ^antpn^ 

A2XJ1UV OQLflvll9 


oiinpr f v^fpirif* rif*h r\rrttpin* QfUP 






13795 


W34499 


Homo sapiens 


Obesity receptor C protein. 


120 


85 


13796 


U28971 


C\n pn aHi ahrliti q 

elegans 


oil u liar in isjl/ utuuciii repeal region or 
RD protein (nuclear rna-binding 

nrotpin^ 


197 


0/ 


13797 


U41038 


C*2i en n rh a h rl i ri q 

elegans 


Similar tn r"i*rjfi prin«.t\/T\p n»rv M ^tt 
oiiiiiiai wj Vrauiiviiii"*iype icpcai 


1 fc? 
I oz 


OJ 


13798 


Y02671 


± l\JUl\J OdJJlt/llO 


l-Tiimsm c< = »p , r¥^tf a r? Tnrrtt#»ir» *»nrTi/1*»H Vii/ 

xxiiiiiaii bCLricicci pi uiciii encoucu uy 
gene 22 clone HMSJW18. 


1 17 


71 


13799 


U63332 


Haiyia cpr>ipriQ 


oupcr v*j5iciiic nun pruitin, ov^Jvr 


01 




13800 


G02902 


Homo saniem 


Human cprrpff H nrotpin QPO IF) MO* 

6983. 


ins 


O/ 


13801 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


154 


72 


13802 


AF1 19900 


Homo sapiens 


PR02822 


131 


50 


13803 


AK024435 


Homo saniens 


X LsJ \J\J\J£*J ^/lWlwlll 


117 
it/ 


OS 


13804 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


142 


53 


13805 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


109 


48 


13806 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: - 
4478. 


98 


66 


13807 


G00397 


Homo saniens 


Human secreted nrotein .SKO ID NO* 
4478. 


98 


UU 


13808 


AL359782 


Trypanosoma 
brucei 


nn<y?ihlp rhhv-6^nl 1 (Y? vu riant a Hna 

wooio'i^/ "jliii v vf j ui iux., vai join a *»nia > 

complete virion genome. 


150 


SA 


13809 


M37679 


Mns musculiK 


Try hpaw rJisiin nrpniKMr 


70 


ion 

1UU 


13810 


AL451015 


Neurospora 
crassa 


putative protein 


96 


55 


13811 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


88 


77 


13812 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


128 


71 



505 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 


13813 


AF 130089 


Hnmrt cjinipnc 






75 


13814 


KOI 664 


Drosophila 

melanoo-actpr 


Bkm-Iike protein 


119 


60 


13815 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770 

/ iy j. 


130 


64 


13816 


AJ01 143S 


imperfoliata 


maLurase 


yj 


A Q 


nfti7 




Hditia cnnipno 


A-iinKea reiinopainy protein 
terminal, clone XEH.8c} 


i in 
1 JU 


7o 


IJO 1 o 


AF1 19RS1 


Xj /Af-vi r\ cQnionc 

tlUIHU bttpidlo 


DPfM777 


1 A 1 

141 


50 


13819 


G00985 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

jUOO. 


74 


60 


13820 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OjOO. 


76 


72 


13891 


Y09749 


riomo sapiens 


numan secreted protein encoded oy 
gene 1 00 clone HNFIU96. 


J Jj 


50 


1387? 


KOI fid 

Ivv 1 vUt 


lirnoAnntlq 

j^rusopniia 
melanogaster 


1jl/fwi lily a nrAtain 

DKm-iuce protein 


AC 

95 


52 




L> 1V7UO 


Mus musculus 


Gcapl gene product 


t A*3 

103 


50 




UvZjjo 


riomo sapiens 


numan secreted protein, sbO. ID NO: 
6619. 


146 


^o 

62 


1389S 


Gftni7^ 


nomo Sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


IOC 

125 


83 




F 10Q0& 
Liuyuo 


^yfnc mil c/*i tin p 


Gcapl gene product 


96 


43 


13827 


G02971 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7 A. CO 


102 


64 


13828 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OOl J. 


135 


53 


a oozy 


V978Afc 
1 6/0O0 


nomo sapiens 


Human secreted protein encoded by 
gene rso. iu / . 


142 


70 


13831 


apoooo^o 


/\civ>pyiuni 

npmiY 
lJwlUA 


1 u 1 aa tong nypomeucai protein 


/y 


CA 

50 


13832 


AF1 19900 




PI? 095199 


OA 

y°t 


Ad 
40. 


13833 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


145 


40 


13834 


AF91 R09R 


nomo sapiens 


unjenown 


1 1*7 
I 1 / 


57 


1383S 


1> 1 UJr I/O 


1V1U5 niUoCU J us 


Gcapl gene product 


nts 
79 


A C 

45 


13836 


Z26876 


Homo sapiens 


ribosomal protein 


129 


93 






nomo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


124 


76 




rVT uTtj 1 1 


riomo sapiens 


gamma-synuclein 


603 


AA 

99 


13839 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/OOO. 


100 


82 


13840 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7A71 


154 


46 


13841 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


64 




vJUwJJn 


riomo sapiens 


numan secrexeu protein, orii^ id nu: 
4435. 


92 


^o 

62 




H9ftQ71 
UZ&y / I 


^aenornaDdius 
elegans 


similar to tsxj tanaem repeat region ot 
RD protein (nuclear rna-binding 

pi UUG111 J 


93 


60 


13844 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770S 


130 


63 


13845 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


145 


79 


13846 


G03790 


Homo sapiens 


Human secreted protein, SEQ ED NO: 
7871. 


102 


50 


13847 


G03469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7550. 


101 


68 



506 



WO 01/64835 



PCT/USO 1/04927 



SEQ1D 
NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identil 

y 


13848 


G03240 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7321. 


107 


43 


13849 


U05313 


bnicei 




Oft 


ac\ 


13850 


U28971 


CaenorhahditK 
elegans 


similar tf\ RT^> tiiriHpm rpnoaf 1 rAomn rvf 
0 1111 UOl LU UU1UC1U 1 C|XTd.l I CtlUII KJl 

RD protein (nuclear ma-binding 
riroteirrt 

III \J LV» 1X1 J 


inn 


OD 


13851 


AF159055 


Homo sapiens 


leucine zipper-like protein 


103 


55 


13852 


R59842 


Homo *MnipnQ 


^vpUJ-itJ_ii jJlULCctaC. 


oo 


71 
/ 1 


13853 


Y01158 


11VUUU 0O£JlvJJd 


ocucicu pruiein encoaeu oy gene 10 
clone HCACJ81. 


14Z 




13854 


G03714 


Homo oanif»ns 


Human QATTPt/vf nmfpin CpO TT> XfO- 
nuiiiaii ^corcteu protein, ocy lis IN 

7795. 


1 1 0 


*70 

/y 


13855 


R59842 


Hattia oaniPTic 

I AlUilV OOplVllO 


/YpunHi^i proteose. 


oi 
yj 


09 


13856 


G03172 


Homo <ranipn<i 


Hiimnn cp/*rpt/»H r>rr\tPin CEO TT> "KT/"V 
JTlUlila.il prOLGUl, ocy IxJ INvJ. 

7253. 




/O 


13857 


W48353 


Homo <&nietrc 


Wirmnn hr*»nc+ rnnrAr rflsitoH nmtoin 

nuiita.il uicaoi uaUL.er rcjaieu p rule in 
BCFLT2 


y*f 


J/ 


13858 


U63332 


Homo sapiens 


super cysteine rich protein; SCRP 


105 


58 


13859 


AF289022 


Wnmrt cnntpnc 


luiixiuninoiiansieiase cycioueaminase 
formC 


Vt£'7 


100 


33860 


AF078844 


Homo cpnipnQ 


Hkjpxjj / o pi u it ill 






13861 


AB032436 


A X\Jlll\J oaiyicuo 


ujraui-opecinc iNa"*uepenaeni m organic 
phosphate cotransporter 


zyott 


1LKJ 


13862 


G02514 




Human Cf>f*rf*ti*f\ nrAt»tn QT7f\ XT\ X1/"V 

xi. ui ii oil scurcicti proicin 3 ocy 1L/ iNVy. 
6595. 


y i 


IZ 


13863 


AF130079 


Homo *Mnipn*? 


PR02R5? 


i^i 


03 


13864 


Z29701_cdl 


Homo sapiens 


29-MAY-1998 Wild-type human c-Src 

fvn*OQitiA lrinacp /^FYKIA 

tjrl VJjlUG MUojC t>L^lN/\. 


2380 


100 


13865 


L77967 


Ovis aries 


Qmal) nrolinA-fich rtivxtpin with t>oi rorl 
oiliail ^lUllllC'llLU piULClll Willi pcUlCU 

repeat 


oU 


11 


13866 


AL050318 


Homo saoien^ 


uj7 / iDk.j ^iiiyuoiii regulatory uj^iu 
chain 1 smooth miio/*!** icofhrm^ 


OA 4 


1UU 


13867 


U37690 


Homo sapiens 


RNA polymerase II subunit 


358 


100 


13868 


X13923 


1JLVSU1U OOXJlCUd 


\,y LLK^iuuiiic c oxidase suounit v id ^aa 
1-86) 


4yi 


1 AA 
1UU 


\3Z69 


L13848 


Homo cflni^nc 


JVIN/T. IlCIltrdOC /V 


oooy 


yy 


13870 


G03172 


Homo iflnif*n<; 


nuiuwi ocujcicu proiein, ocy iu inc. 
7253. 




4o 


13871 ' 


K02064 


Doc tannic 


cyit/viuuiiic u v/Aiu<zse ouounii i v 
precursor EC 1.9.3.1 


yo 


/z 


13872 


D31763 


Homo canienc 


Ilav7 c ri> prUvCin lb ivXUppcl-IwLaU^U. 


*fOUO 


yt> 


13873 


Z14014 


Nicotiana 
tabacum 


Pistil extensin like protein, partial CDS 
only 


83 


43 


13874 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


117 


70 


13875 


Z52203_cdi 


Homo sapiens 


17-SEP-1998 Human PR0217 protein 
encoding cDTvI A T IMOl 01 


2135 


99 


13876 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478 

If / o. 


106 


64 


13877 


G03807 


Homo sanien^ 


xxuuiaii dcli citAi proiein, oxz»v< jll/ in w. 
7888. 




JU 


13878 


G00689 


Homo sapiens 


T-Tiimnn <Zf*rrf*tf>r\ nrnfpin QPO TVS • 
null la U oCvlCLCU prOLcul, ODy IXy nv. 

4770. 


1 (\A 
lU*f 


ou 


13879 


L25404 


Brass ica nap us 


cyclin 


124 


42 


13880 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


139 


59 


13881 


AC003113 


Arabidopsis 
thaliana 


F2401.6 


70 


64 



507 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


13882 


AB000098 


Rattus 
norvegicus 


MIPP65 


905 


50 


13883 


Z95114 


riuiliu bapi cn_> 


uivz iz/vz. 1^1 iNr-inuucioic protein 
CG12-1 (similar to apolipoprotein L)) 




100 


11884 




ilUIIlvJ bdpiciu» 


in u clear pore complex interacting 
protein NPIP 


551 


82 




AF191K6? 


riomo sapiens 


oomng nexin u 


1453 


99 


1 J oou 




riomo sapiens 


Uitmnn n(-w>fn4rtrl _ „ _ A „ I „ OPA TT""\ VIA, 

Human secreted protein, obvj ID NU: 


83 


69 


13887 


KOI 664 

IVV 1 V7V/*-T 


JJiUbupHI la 

m ^1 si n r% era ct^t* 
ALiciaiiV/gcu LCI 


oKm*iuce protein 


luy 


11 


13888 


AF217197 


Hrtmn canton c 


for intcraciuig repressor 




AA 

yy 


13889 


D38112 


TTnmn wm'pnc 


iN/vuri aenyurugenase suouniL o 


15/ 


y4 


13890 


API 1X0X6 




puni009 ~^ 
r s\\j iyyz. 


©J 


oz 


13891 


M58664 


Homo sapiens 


Signal transducer CD24 


371 


97 


1389? 


AKOT34A3 

/VJSAM J 


xiomo sapiens 


Unnamed protein product 


IOC 

125 


45 


13891 


AR04007? 


nomo sapiens 


VIA A TM>/\4-n*n 

JviAA i jjy protein 


ZZ71 


99 


13894 


U03750 


Escherichia coli 


DeaD 


95 


48 






Homo sapiens 


¥ fiLLLJllft ji j-i_n.lTJll-J-L.fi or?/^ TFV Vl/^. 

Human secreted protein, SbQ ID NO: 
6619. 


73 


53 


T*JIQ6 


ArU/OODl 


Homo sapiens 


Oi, , 1 1 il l Tiff 

oecretogranm Ul 


2384 


99 






Elephantulus 
edwardii 


Reverse transcriptase 


109 


48 


1*1 COR 


YA78 1 £ 


Human 
herpesvirus 4 


bpitope C13 (57 AA) 


55 


53 


13899 


AFl 16661 


Homo saoien^ 


PRO 143 8 






13900 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


77 


60 


13901 


Y28643 


Homo sapiens 


Human serine protease inhibitor from 
cDNA clone HETDK50. 


2191 


100 
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TABLE 3 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Add, F-Phenylalanine, 
G=Glycine t ENHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
Threonine, V=Vafine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibte 
nucleotide deletion, ^possible 
nucleotide insertion 


1 


13902 


A 


I 


114 


434 


AIFKCVEGMFRIAMVWCFVSSGSIiLI * 
PLTY/GVYDEWTHFAYMTIDLLEI PITG 
SHPWIiNAIiFCLEAP\WISPNTGSFAYP 
VYPKSLIAHDFAVEATMPYIRLSST 


2 


13903 


A 


2 


124 


466 


K5NIPNLGDCGWESLFNR\QSWRSSLAV 
\NDTYSSKKSNAETFTFHADLCTLSDKD 
RPITIQTALAELVKHKPKATYEQLIAVL 
DEF/ANFLKKWWKAYDKENLFCEEG*KL 
CAASN 


3 


13904 


A 


4 


1 


427 


EGFLELIiRTRNHSNSQLQLTTGIGLFLN 
EGLKLVDKFLiEDV* K* YHSETFTVNFSD 
TE *AMKHIND YVEKGTQGOVDLVKELD 
RDTVFDIANYIFFKGKWDRPFEVNOTEE 
EDFHVDQVSTVNEPIMKLIiS \MLNIHPC 
FKL 


4 


13905 


A 


5 


1 


464 


KI KSFYAS KDTIKRMRVTDWKKIFAY* I 
SDKELIFTLRTLKT**K^GKQPNLKNGQ 
EI *VPISPQDIQIAHK\HIiEGWSTSLVS 
EIiAPCEAP V/ RHPLTGLTT AGLQGFGEA 
GRLVRGRWGC*W/VHPF*KYI /WQFLSK 
LHISLPYDPTTPLLGTCSR 


5 


13906 


A 


6 


308 


3 


HFVTHSKHDIiAIAHLGI Y/ PREMKT* VH 
TKTCT* IFTVALSVIARAWNQPGRPLtCS 
E WIj\ KYMVHTME * HSAI KRLNYRYKNNC 
VNIiFLG I TLS EKS QTQNVT 


6 


13907 


A 


7 


587 


2 


FLTRETGDPTGRSSSHANTQSRFFPDDP 
PG\ PLNNLGNTHGCGRRAGRCPGTGPDG 
P \ AGCGGPRCWPSGHLAATGD * GP S CGR 
LGANRGEAGPAGFTACSPLSGCRTPYTH 
HFPASRMS CHLNCAS PRTYRSQGNRGCE 
RVAQGSQGAGGERGAKSQVPVPAPARNK 
DPAKCRKPRNRRPGNSGPWRAYRRQR 


7 


13908 


A 


8 


1 


474 


RI LNEEHGKYEGLHE * EVKWHLY IKS PA 
FTDLHLCYQKDMNGISTSASSPAVGTVG 
MDMDEDDDFS KWNFYYS PHS YPDK * LTI 
FKTESRVRESDEVTQ I KVNWDEEVI SGI» 
LTSLKDNVLKATGVLYDYAYK\ YLCEHT 
RSTLKEESLKLERNLQNH 


8 


13909 


A 


9 


3 


539 


SQCSPFISPACSLTALEEETEALRVHPR 
LCLSFNLAPSSGPPRPPEIAPCPPSSQA 
GLRTCHSWVKGLHQPLPVASGMKSTFCN 
KTYTCPYPPP/PPIiCPNHSPNALTLPDS 
VTHAVPFE/L*SPSAPPSSTA*ILGSPS 
\CX5ASPCNHPHSHPGICPTPPGLWPVCP 
CAPRAWQRDGTRQT 


9 


13910 


A 


10 


2 


453 


RL*LGLEYALLVWGTPKV*H*GGFPIYY 
YI VLLLS YALHQVTE YSMYVS IMAFNAK 
VSDPLIVGTYMTLLNTVSNIilGNWPSTV 
SLWLVNPLTVKECV*TSYQNCCTPDAAE 
LCKKLGGSCWALDGYYVESIICVSIAF 
V/W*VFLVHKFK 


10 


13911 


A 


11 


20 


475 


KMGVPPLIWSDPNRFIiFPKNFLREKTIS 
PPKTF*PLKIWVKGQWVLNFLGFPGFKI 

VSQSELTAALTSPGSGDQVIIiPSQPPK* 
ENHLNLGGRGCS EPRLPRAEFLDLRS FS 



509 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine T H=Histidine, l=Isoleucine, 
K=Lysine, L= Leu cine, M^Methionine, 
N^Asparagine, P°Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














IIVIFP 


]] 


13912 


A 


12 


392 


63 


HIJRADPGLEPRPSARTGLGPELGCCTMW 
KLGDAGSATQSLGGSQLSWQLSRREQSL 
EQARWEAQ *QVETLGRVAREKEALAKEH 
AGLAVQLAA\AEHEGRTVSEEATHLQ 


12 


13913 


A 


13 


18 


338 


aphpqylqmppmllpprtgpgqfslpss 
rhgghle/gkehetsvtlcgge pppqtp 
prdpdpgp * arrapcprrpt * ahprals 
raapqepralagprarhpicpgsl 


13 


13914 


A 


14 


2 


371 


ttkq* kdnp iqtgakdlkrhfckedmqm 
asnht\krsltslvsremhiktprmagi 

KKSDNNKHW*GRGEIRT/LLRCWWDCKV 
VQLLWKTVWQFL*GLYQYI PHDPVISRL 
GISLILYIALRTF 


14 


13915 


A 


15 


443 


1 


SRTFLDTDMKSMRGFKAS * NRQTLLLGA 
NAAGD\KLK?UWLTNHSENRKILQNYVKC 
TLPMLYKWNNKAWMTAYLFTTQCTEYFK 
PTVETHCSEKKISF\KIIiLTIYNAPGHI, 
RALMQ I YKE IHWFVP VN I P S I LQP I VQ 
NVISTFKS 


15 


13916 


A 


16 


375 


38 


HQQNGFIjKKTDPTLLCLQBTHFR/CKDT 
KRLKVRRY/ QPNSNQKRAR/VPILIEDK 
IDFKTKKIFMMTKDLTIINI*ASNTRVP 
KSMKQKLAGLKBEMDNSVIMVGGFSYPV 
SIRK 


16 


13917 


A 


17 


448 


189 


NRDRVSLCCPGWS *TPGLKRSFHLGLPK 
CWDYIOlDIY*LL\FARHTQMISTHS*QT 
TNTCSYPAF*KSRP*EPGYNTTHTPHSS 
DML 


17 


13918 


A 


18 


1 


426 


GMSHHARSLI IKQLFKKCSTLFVL/REM 
QIKSLLASSSSSSSRNSVSGQGFETVGT 
HCGQKCKLVQPF\ WJCAVW * YLLKLNVFI 
LFLIYNRNAHLEDTCENVDRALFVIVKN 
WKLSKYPPGVKWRIKIjWYSHSVESSTAV 
LPKV 


18 


13919 


A 


19 


2 


423 


YFETFQPLLRRLGYQGTFFFP\KPWSSC 
LGIEYNSGPDSCA* FFLQNQIRLVNSAN 
IRLMAMTLKTNQVAIAQFTiECKESDQQF 
CIGVTHVKARTGWA* F*SAQGCDIiLQNL 
QNVTQ/GAKI PLW*GDFNAEPTQEA* K 
HF 


19 


13920 


A 


20 


10 


443 


LKVDSGDSEVRYVFILQHITLLMCSAYM 
NQLLNI FVRPSLLAVALHMTPGFTKEDV 
YSCFRFLRDVFADEFIFLPGNTL*DFEE 
SCYLLCKSEAIQVTTKDILFTEKGNTVL 
* FLVGLFKPF/VES YHI ICKSIiLDEK*A 
PFIEEP 


20 


13921 


A 


21 


11 


426 


VLVETNNLRMGQVTM* PELPDM/ S PDAW 
TLSDSPSQKIGHAQQ\KYSIIKWKWYTE 
DWAQACLEDTSKYEQVTQIPMAPNDATIi 
/ PSS/ AHLAFTAS * GAPSDS * LRNTLGL 
ICDGST\KPSSTN*KQTVIALQSHAGLN 
MKEC 


21 








1 14/ 


1 /Oo 


GHECLPKyiiNSNPPPIIKYLALQDI^MLL 
SQYSPSRRQEVFSLSQPGGHPHNWTAIS 
RECLNLLNGMTQKL ILYQEAAATNGRVS 
SSYPVEPKKLNSPEETAFQTPKSSQMPR 



510 



WO 01/64835 
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SEQ ID 
NO. of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=AIanine 
C=Cysteine, D=Aspartic Add, 
£=Glutamic Acid, F=Phenylalanine, 
G=Glydne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metb ionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y~Tyrosine, 
X=Un known, *=Stop codon,/=possibte 
nucleotide deletion, \=possible 
nucleotide insertion 














PSVPPLVKTSLFSSKLSTPDWSPFGTP 
FGSSVMNRMAGI FDVNTCYGS * AT*HSE 
W\LLNI^l,LyHWLCGVFLLTTJVYVSWI 
LFKIYATKAHVFP VQPPFAEGS * VPSKS 
VK*Q3S P \L IKYLALQDLMLLSQYSPSR 
RQEVFSLSQPGGHPHNWTAI SRECLNLL 
NGMTQKLILYQEAAATNGKVSSSYPVEP 
KKLNSPEETAFQTPKSSQMPRPSVPPLV 
KTSLFSSKLSTPDWSPFGTPFGSSVMN 
RMAGIFDVNTCYGSPQSPQLIRRGPRLW 
TSAS 


22 


13923 


A 


23 


421 


3 


KI IFRAFKG* KSPSMPGFKA* KDRLILL 
LGTNADNDFKRKPM/LLYHSKNPRALKN 
YIKSIQPLYE*KKA*MTAHLLKWVTEY 
FKPIVGTYC*KI/SFKIIiIiLNDDAPG\H 
PKALIEMYREINIFMSANTLFILQPMEK 
GVI 


23 


13924 


A 


24 


226 


2 


THERTHSKI IHVI IIKITHPLNPSILRP 
QTTA* I KWRDLGSLQPLP/ PG/LKRFS Y 
LIIiPSSWDYRCPPLRPANFCIF 


24 


13925 


A 


25 


2 


611 


FFFFLLZiGLIiHQIPDVSPTCKYTTIJjPL 
MIILMISGIKEVIY/DHK*HGRQNS/VR 
NTK1L*QDSWDTFKWKEVNVGDTVKASN 
GELLPADTVPMC\YIATSNPDRETN/VK 
TRQALPETASV 


25 


13926 


A 


26 


1 


443 


ATQWRPS LVPASAENVNKARS FAAG IHA 
1GGTNINDAMLMAVQLLDS INQDELVT * 
GSVLLI I LiLTDGE PTAWETNLMS IQNNV 
REAVRGRYSLFCLGLGFDVCYAFLHKLA 
LDNGGIiARRIHEDSDSALHMQDFY\QEV 
AHPLVTAV 


26 


13927 


A 


27 


2 


359 


KAQSKQWLPRGGSGPVK/ARVDEESAKV 
I VRVYWEAQG ILL I DFLKGQRRT I YAYS 
ENILRKPEL* * RNAWGSFSRVLHHDRAP 
AHFSHQ/ TRA1G/REF * * KTIRHPHY/S 
PNLNPLDAFCF 


27 


13928 


A 


28 


2 


427 


WRKTVKGHFTDQCRKHKAM/LGN* IHFL 
EYHIHKHS IQYSGIQATQEKFGGLKTSY 
CSKD*YITVLDLTLKISNKDVSN*HENP 
ISMYKN / 1 PTVIYGSKVI KDRKLKA* I * 
CP/HRNKKF * YFDYFYVLLNTMTFLKLW 
RIITEI 




13929 


A 


29 


47 


412 


HCDVLLASSRYTCILPYSDRDDGPQDQL 
KMS VDFRS RRTGS / WFPQNS WGHMGVGG 
WGGAGRTLDLIHLRIPMRGLRSGGFLCR 
RKLVSESYG*EPSPL*K*KGGWGSEPS / 
LTTVPSQL 


29 


13930 


A 


30 


336 


419 


RLECSGMISAYCNLSLPGLSDPSNSASR 




13931 


A 


32 


1 


408 


IUJUjKLRSN YSKISGY/MANIQKLI IFL 
YTSNEQIEFEIKNTIPFTLAPPKIK*LG 
TNLTKC\ VQELYEKI YKSVMKNI KELNK 
WRS S YGKGKS KI KI I KINK* MD IACS WL 
RRLBCIVKI S VLHNL I YRFNI VP IE I P 


31 




A 






z 


fV^HT A T.T3I7A/y™\T PDTkDDPrnrvnp 

yojn.ooi_rtjr^r ij \i_ur r vuuClr Ar*K(iVj V iRb 
RQDSLS CGGLHPF * ASGLLCL PTQAS AM 
VGTPPPASLLLCSSILDCCASNERGSVG 
VEPPEPGTGHNLLVCRLLRPLEKRS IRW 
KS AVGVARFSRCRPS WLPIiARKKNS P 



511 



WO 01/64835 
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SEQ JD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A Ian ine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, (^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^ryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 


32 


13933 


A 


35 


304 


33 


KVWGEKVWYWQKMTQIVQWDRTESPQID 
N*SLTKEIQWRKDSLFNKW*GNNWTAPF 
SS/RSLNIJ$KDLTAVTKIKSKVWTDLNV 
KHKTIKLL 


33 


13934 


A 


36 


2 


424 


SKTYS IGHFTYEGKGTITSLWGKVNVBD 
AVGETLGRLLVEYPWTQRFFDSFGKLSS 
ASAIMGNPFVKAHGIiKDLTSLGDAIiQHL 
DDLKG\TFAQLTELHCDKLiHVDPENFFL 
LG * VLVTVLALHF/SRRFTP/ELQAS/W 
PKM 


34 


13935 


A 


37 


2 


433 


NKRLPGPGF*KRPNPPGEGGPARYFNPL 
GGPNRE I P LGPEV* THPGPQMRP PVFLK 
ILKISGAFWGPPVGPPTWGG *EGLNPRG 
PGYN*P/R/PPAPPPWG*SQAPFPKPAP 
PP/TGIKPNP 


35 


13936 


A 


38 


3 


425 


GAAQLLPVSLPS AQRAIDLQ ILTTPWEN 
WTSIASLQHKSTIGRSWLITGPISATSS 
TPTSCTYASP\TRGHV*RSASWKYDVLQ 
HSSHMWWHTAVYLGEAFHQVHATGSCH 
HRVLSCCPITGRSGRDAL*QIjLPL/HAL 
TEA 


36 


13937 


A 


39 


2 


463 


IiNEVRDI KLSSDHW PSKTNS FLHS PGFIj 
SRFBPQPASVA/PRP*SQQQSLPGKAST 
SLWPPNPVPFVTSSLSLSALPGLFLWLP 
SIPPIiPGSPFPSPE*PEVPLFPGPTYHN 
PSPPD/PTVLEAHPDQAPLP\PGVPTAE 
QRPTPAP*AHRPS/LPLPP 


37 


13938 


A 


40 


223 


408 


RNTV*NIGTDKDFMT *TPQAI ATKAQID 
KWB&IBPMSFC/TYRETIIGVTDRYTRD 
KIFESGR 


38 


13939 


A 


41 


3 


427 


NLKMKSRTSKENIGBHIYDFDIGRFVNT 
*SKPERKRLSFKTSVQEKTP*Ij*GKHK* 
MKLEP\LPYPKINSKWIKDNVRSKAIKL 
LEENRSESSIWQRML/SIMPKAQAIKEK 
VDNLDFFKI KNCVS KDTIKKAIJ3RPSER 
EKI 


39 


13940 


A 


42 


30 


448 


femreii vafhs iliryhglinlrkfq* 
mdrryskevqdlletmklfarivgpl*h 
dkfieshalefeliire i krlqeyrtagi 
tnfcsartydhlkktr*eerlkrtmi>se 
vlqyiqdsnacqqwlrretyMbsgpnp 


40 


13941 


A 


43 


1 


928 


lrqrmppqglqpcpwppryp/ pdsplgc 
pggia\ppnspswpklasiagsrgssap 
pgvwglvatga*plqrpsapppcpr/lc 
pgrepcmcp 


41 


13942 


A 


44 


2 


409 


SKSPDPTQQPLRGGSLTHSAPGPSLSQP 
IiAQLTPPAFGPSTSSLFNLQKSSLSARH 
PQRKRRGGPSEPTPGSRPQDATVHPACQ 
IFPHYTP\VWHILGPQRHTP*SVD/HPG 
LDKRLLPETPGPCYSNSQPVWLCLTP 


42 


13943 


A 


45 


3 


136 


LPMTLFTELE*ATLRFIWNHITIQIAKA 
ILTNQK\NKARGITNIC 


43 


13944 


A 


46 


365 


2 


AWGSLRAPRS AVPE PGTPP VMGNALAHR 
yr oh^UljiAjijoLjror l»Kvaf*NbPPHPGSPG 
LPKHA * APAAVS PWVDPDRTVQPPPPAP 
PST\D* PHCSPPCTPWCPRRLGSAPVMP 
GCPTASYPRR 


44 


13945 


A 


47 


1 


456 


AELSELYEESSDLQMDVMPGEGDLPKW\ 
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C=Cysteine, D=Aspartic Acid, 
E=G!u tannic Acid, F=PhenyIalantne, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leu cine, M=Meth ion ine, 
N=Asparagine, P=ProIine, 
Q^Glutaraine, R=Arginine, S=Serine, 
I -Threonine, V=Valme, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














EEATGTRAAI PWVPPATGAQQLEEEGPM 
EE\EEAQPMAAPEGKRSLANGPNAGEQP 
GQPSPGRRTSRAEDEA\RBFDDWEDDYD 
F\PREEPVKGARLRFLPPS * KTPPSFWE 
WRNTPLWGGUCI FY 


45 


13946 


A 


48 


116 


442 


PQNMGMERKPGPPKGLSWASGDLEDNIiG 
VMSVGFLIjSSPDDAVIWRGPRICTGLIKQ 
FLRNGTGSKVDYFMGDTQLGTSDEHLSS 
VATWAKAHLGE PGAFTFPQRVS LRE 


46 


13947 


A 


49 


434 


3 


FTVPGPLLTVIPIFISKVPFKFSLPQIF 
NBTPFGFPFQIRV* TI I PISLRFFRKKP 
LFFFF+NKVPLCSPYWNSIARSWVPAAS 
VFQV\KESFYIiNLLSVPPQVPLNVFLTF 
FFFFFLGRDR/SLPLLPRMVWNSWAQAI 
LPLWL 


47 


13948 


A 


50 


1 


482 


EK^yQCSDFGKAFNMKTQ\LVVHQGVHT 
GNNPYQCRECX5KAFGRKEQLTAHLIAHA 
R*KPYGCSECGKAFSSKSYLVIHRRTHT 
GERPYECSSCDRAFCGKSQLI IHQITHS 
TENHYECNECENTYPRKASLKIHQKIDL 
GKKPF/EC2vTEWGKAIiIiK 


48 


13949 


A 


51 


1 


470 


REFLWQEGHRAFATVDEAGEEVLQILDL 
YADVYEELIiAI PVANGTXTDKDKLAGGD 
YTATIEAFICASGRAIQGGTSHLLGQNF 
SAMSEIVLEDPEIPGENQFAYANSWGLT 
TRTTWVMTIGHGDNMGL/LLTPRVA*VQ 
/MVIXPGGIPKAFLKRQKT 


49 


13950 


A 


52 


26 


448 


S PGTEREYRIGQQSVTGVTS VDDSNSYW 
RIRGKSATVCERGTPIK*GQP IRLTHVN 
TGRNLHSHHFTSPLYGN/ QVAL*GDTVI 
I L* RRKQRIiKGFTEEG I KLRFKEVS AFG 
DEGEGDYLDDWTVLCNGPYWVRDGEVRF 
NT 


50 


13951 


A 


53 


3 


495 


AMEVKAFAETHIRGFTLiNDAANSRLIIT 
QVRRHYLKEAATTLKTVLDHQHTPSRLA 
VTRVIQALAMKGDV\DNI *VFHKMLNGL 
EDSIGLSKMDFINNIALAQI *NSNLDAS 
V*HIENMLTS*NNVIBPQYFGSAYLFRQ 
VLEEQLEPTVEKIS IMAERLAKPLQ 


51 


13952 


A 


55 


1 


428 


QERGTKKEMEORMTLEETK\EQILKLEE 
KL*ALQEEKHQIiFLQIjKKVIjHEEERRRR 
KEQSDLTTLTSAAYQQSLTVHTGTHUJS 
VQGSPGGHNRPGTLMAADRAKQMYGPQV 
LTTRRYVGIAAAFAGTPEHGQFQGRPGG 
VYG 


52 


13953 


A 


56 


2 


453 


EDGDLDAFSSDBGLTMAMSYLKDDIFRI 
I X 1.ETQECRR YHRPPCAQEAPCNMVHP I 
VI CYGCYGPEVGTRYTCSVCPDYDLSSS 
CKGKGLHRGHTKLAFPSPFGHLSEGFSH 
SRWLRNVKHGLLRWS + WEMGPPGNWSPR 
TSCA/GEARLGP 


53 


13954 


A 


57 


3 


435 


ELNTS I FRSRP IEGLGLNTVLTTDNSNL 
VI \NRIGIV\PSVTEKEYTDPSSDGTYA 
WKI FSHET I TKAQ I LKLFLS YDYAVNNP 
WIAYPHYKSPEKCPSI ILHDRLYYLNGL 
* CAASAMVMIAI VTYNVALLAYHRWNGL 
TYMID 


54 


13955 


A 


59 


409 




LCCEHRGKTVCVPRGKFTTLNAS INKER 
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C=Cysteine, D=Aspartic Acid, 
fXJIutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, : 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Clutamine, R=Arginine, S=Serine, 
T^hreonine, V=Valine, 
W=*Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
n acleotide deletion, \=possible 
nucleotide insertion 


• 












SKISVLSFRLMKLKKEYVK* K* \EKEKI 
NNRIRIEKINKAKS/WFIi*HNKIE*TIR 
MINHQKRENIQITDFRNERTELPPDSTD 
I KKAIRKY * K* FYPNI F YNPBEMNKF 


55 


13956 


A 


60 


1 


393 


GNVSSCGDHPCEG*LCP\PDKVMS*GI* 
VPEEACTQCIGEDGVHHQFLESWVPDHQ 
PCHICTSLNGRKDNCTTQTCPTGKASTC 
GLCELARLRQNADQCCPENECVCDPES C 
DLPPVPN*ERGLQSTLTNP 


56 


13957 


A 


61 


250 


1 


REDCKRVLYKDKVSLCCAGPCSVA*SCL 
KAASTFQ\VKHFSLQSSWVYRHAPPGLA 
NFFHFLQRQVLTTPPGLVIiNSWVHAI 


57 


13958 


A 


62 


365 


3 


IKKKLIWRLFTFPKPSWGKKTKKGIPFP 
*GPKKIPGIKFPQGIKKAFKGNR*T3jGK 
GN* KKKKKKRBCGPGGDIPCS * IG/RNI 
VKILIIPKAIYSFNTIHIKIPKTFFTEI 
ETTIIiKFL*NH . 


58 


13959 


A 


63 


245 


558 


FLPTQVIS YVKRALAEGAQI * CGEGVDK 
LSLPARNQAGYFMLPTEITDIKDESCCM 
TEE I FGPMTCDVPFDS EEDVI ERANNVK 
YGLAATVW/ SSNVARV 


59 


13960 


A 


64 


106 


532 


ERACQSGTSGQGGVPRRAIGAPR/E/DA 
FTGAVYI YHGDAGGI VPQYSMHPFA*SL 
YPSGQ/SVAARGNLSGDSFNCHN+GLQG 
LLLASSV*RPVMLLliJDPIIYTTKN/ISG 
PKCHE 


60 


13961 


A 


65 


2 


548 


VQKYPQQNRS CVP PVAE WAVPQS SRLKY 
RQLFNSHDKTMSGHLTGPQARTILMQSS 
LPQAQLAS I WNLSDIDQDGKLTAEEFIL 
AMHLIDVAMSGQPLPPVLPPEYI PPSF/ 
RRVRSCSGISVISSTSVDQRLPEEPVFR 
DEQQQLEKKLPGTFEDKKRENS * ROJLE 
LEKRRQALLEQQRQG 


61 


13962 


A 


66 


72 


952 


SSRTYTTSLPLLKKEVTRKIRKYFIU3N 
N*NTTYHNLWDLTKWFSGKIiLF*RSR* 
EYRKDKISDLCFYIQMLE/QRQVIKPKV 
STRKEINKSRTQ*SGKETMERPKLEDKF 
LVNINNINNP * *LNL* RKKEGHHCIFYR 
H*KDNKGYIYANNFNILDQMDKFFERYV 
TKMD\QEQIVSLNIPIADKSNA*LNNFP 
SSSSSSSSSSSS 


62 


13963 


A 


67 


139 


422 


VNGNEPB PL* KGI SRHEHRRQPHNGFRP - "" 
KNKGGAKNQ*ASLGMKTPEAPAHSDKPR 
RRQHAAYSS*AMPFLGICISFSQCNL/C 
PPKLNV 


63 


13964 


A 


68 


274 


1 


NLKNKAVITKTA*H*QKNTDQWNRIKNP 
EIN\HPFYGQLIFDKDAKNTM/WKR/DS 
NFNK*CWEN* ILTCKKMQLHP/SLTTNT 
KLNSQWIKDL 


64 


13965 


A 


69 


22 


419 


KI IGEDGDT/ PLSEMAKSQRQ I FS KDLV 
KLDSTINQVDI IDIYKLHHP\KWTVYTF 
FSSH*GTFSKIDHILLDKTYPNNL*RTQ 
IMQHMLSDI KVFKLEISTRKVTEKS *HT 


65 


13966 


A 


70 


57 


423 


KDKNSQVTSEEEEQEGKIKENLNSWRDT 
R/CH/WIGR\NIVRLSFLPKLIFIFNTV 
PI KIRTQFFMKLNKPVLKFI Y* TKRTRI 
AKSLLKKK/RQEGKISPTNRRTYYEAYS 
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TLESIGASMPSLDLG 


66 


13967 


A 


71 


394 


1 


KHFRCMLQEDCRILLLWFFSETGSPCAV 
SPRIBCSSKIIVHSIiKIiIiGSSDPRASLS 
*VAGTIGVHHHTQP*GI/CNI*GSGRRR 
SEKQWIKEKNVSSEAKRRRTF*NE*SV* 
KGDYNKDQGLRLGTVABAYN 


67 


13968 


A 


72 


485 


1707 


SALGLSQPRPPGASDAAPDSTPAGV*TA 
SSSAPAGEKDANQMEWARRDPGPLHQAA 
PAPH*SHLVRDKRAQERIjRKTSRGPPHS 
HRSGPVDLSPRSSHSRDSTPVHGPYFRS 
APDPRGSAPAFRGP IATRGR/RPRAPRA 
TASHVPLSSSSWKEPEEATAPRF*ASAC 
MKWRLEAGCPGHRGVWPGPPSRRIiFMWA 
TATGEKSTPSPFTGVWRFGVEACT*AGS 
PTAHIQHSQRTWQPVPPCTNGSRSPWTS 
SYPKNQSKVGTVPKRGWACQPIPETAVR 
QATISQ*ARKGAARPHHQARQVEHSTQ/ 
QRPSGDSGKFDQRSLCEASTLLQPERPE 
VQGKI CKFRV/ GERRRTAS PNS AVPEKR 
TRTQTMVGTAFIGCSGKWRRVYSVRR/R 
SSPRIPPPGVGSS\PLKTPPPP 


68 


13969 


A 


73 


1 


462 


QDHRSKSSHSNKRPSIASSLSENFFQRA 
AKPLQVNKWKKLYSTPLLAI PTCMGFGV 
HQDKYRFLVLPSLGRSLQSALDVS PKHV 
LTKRSVLKVACRLLNALKSSLKMSWVYG 
RVT * KYIiL* F/ YRTTNRGLL * SMGFAFR 
YLPHGDNPAHLERGN 


69 


13970 


A 


74 


307 


436 


LP*VGCITEGIPEQDMVRGKHKIiLRQEK 
TRIiIGPNCPGVINPGECKIGIMPGHIHK 
KGRIG IASRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTEFIDCLEI FVNDSA 
TEG I ILIGENGA/HAEDNAAECL 


70 


13971 


A 


75 


1 


435 


EISDSKAQLAAMALIIDTWBRMNCFSE* 
NHEPLRTHCAIiAASKLtiKKPD\QAE\RB 
HLCTSL\WSGTNTDKNGEELHGGKRVME 
RLKKALI IAHQGMDPSLQVRVFIEILNR 
YI YFYEKENDAVTIQVLNQL IQKIIjEDL 
PNLESWK 


71 


13972 


A 


76 


58 


366 


E FPDLVKDMNLHI QEAQCI PNKINLNKL 
MNRHRI I /RLLNTTTKKRILKAAKQK*N 
I/T/IRGSSICMMMDFSSEITEDRRKWH 
SI*KILK/EKTQNCIiPRVFCPVKI 


72 


13973 


A 


77 


1 


445 


YHETGCFLMGAIVDITLTFNTYVHFQGK 

MKGFSLLADPQEFWVDNSTSMSAPMLSG * 

MGTFQHWSDIH\DNLSVTHVPFTDSACL 

LLIQPHYAFDLDKVEGLTFHQNSLNWMK 

KLSSRTIHLTMPQLALQGSYDL*DLIaDQ 

A*LSDILTP 


73 


13974 


A 


78 


53 


444 


ERGGYGAGPVAWQFIiVPSTAPMLQSPPL 
GFAIDHTPPVPAPAN*APCPI«PYAA*RT 
TGPHHIAHLDTTFGTGDTPHPSSPASPP 
S TPWL PDAFPCPLPTWD/ RPPCLQPLLS 
SIiPTPRLPFLCFLLLAPYAP 


74 


IJ1//J 




TO 


1 


353 


HI RTELD YYGIiT V VT I YSDT * EAYNYI Y 
IMVT *NVYKPQLWNI FDRATMHS QDVRH 
HLLC I RLMLKNP KHHAV* / VLNGHYAFV 

SR/SFKHALVQYVQAFRTHPDBPLYSFC 
IGLTFIHM 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cyste1ne, D=Aspartic Acid, 
&=G1utamic Acid, ^Phenylalanine, 
G=Glycine, BNHistidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutaraine, R^Arginine, S^Serine, 
T=OThreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


75 


13976 


A 


80 


1 


350 


IiYFYALIiPLSSG/CVAYVATRDNCCILD 
ERFGSYCPTTCGIAYFLSTYHIKVDKDL 
QTLBDIIiHQVEDKTS * FKQLI KAIQLTY 
NPDDSLKPNMIDAATLKSMKML* *IMTY 
EASILT 


76 


J3977 


A 


81 


76 


386 


PA YPQVRGPAS TPAS CIRP TNARVLS TT 
PRGKSVAEAHSVSPSAHRGVTSVIKLWS 
AKRLH* YGA\KVRPNS \GCTP 


77 


13978 


A 


82 


1 


360 


ESILQEDITVLNVGAPNNRASKYMRQ/M 
LVELQRQIDES I I ICEDFNTLI SEMDST 
RPKISKDIVGLNGTIN*LDI I0IYRLL* 
PTTAEKTFFSSSRGAFT*MDHIIiCYETH 
IYKFKLHE 


78 


13979 


A 


83 


39 


344 


WGILEGGEYIWHVGGGKSLRASGGLWSQ 
PPRQPDDSRPLVLVPVCHLLPHWIGPTD 
LGHKRQW/MGPGAVAHVCNPSTLGG*GE 
WIT* GRE FETS LANMVKPC 


79 


13980 


A 


84 


1 


427 


QQAHLAGHHRIHTGEKP*KCEACDKVYR 
GKSSL/MK/HRRIHTGEKAYKCEECHKV 
YSR/N/SQTVKDRRIHTGEKP * KCKS \C 
DKAFGHDSHLAQHARIHTGBKPYQCNGC 
GKAFSRQSTLVYRQAIHGVGKLY*GNDC 
HKVFSNAT 


SO 


13981 


A 


85 


88 


307 


TWTQRRAKLVRRIGWALLPFPRPSGSKP 
PP*TPPALPPCVP/PQSSPWTPPQPPAP 
DSREVSKDWTQMRSFKEN 


81 


13982 


A 


86 


5 


375 


NSLLNTHSRGPGASHCTFWIiHERASSRD 
LTGAESYGICRtiRRLISLSIVFSKSIHG 
LACISILFIiFLFLFLFFVKTGSWLCCPG 
WPQAPELRQSSLLSLLSSWNYRR* PPHQ 
LIF\DFFCRN 


82 


13983 


A 


87 


2 


342 


VIKNEDHYIMIKCLI*QEDQTILNIiHSF 
NKTNINIYKPHMTNLQKAVDKITirV*T 
CGTSLS I IAVCRLKIiVKL/VEDLSNI IN 
KLDLM/DN-IYKTLHLNIRDYTFLKHTWN 
IYKN 


83 


13984 


A 


88 


1 


338 


CNEPRSHHCTPAWRQS KTPSKKKKQKKK 
LTTS/CIKASMKSRVQG*LR/ CWAQVMG 
* *GGVLWLFVK/REFFTIiSINLSKEGHS 
RRVPCLGCLKKKKKKKKKKKKKKKKKKK 
KKFF 


84 


13985 


A 


89 


931 


2 


VASPNALPE*LLPVSGHTHDLERVGVAR 
LGQPHHTELTACRARMESRDTCPGVTLH 
P*PPSFPSSSSPSPGGPRTRLTHQGAGL 
EGSQGPIiQSQNPSAA/PLGACRGGWEWP 
QGPGSGS *GG*LMLRELRE/ WQBAAVQL 
PXPG * PS * ESGRPLPSS ASGPTPPGSP / 
SP*PGTQGLCGCHPSGLCATAAGPQDGS 
GPTPTPH\ PVQPS CRDSGPGQRLS PTPS 
LTSWPNTRPSPPTGPGPGPGGRATW/PG 
S/PSPQRESPSPQLPCCTPGSFAGHPCP 
AP\AAPSSVACPLP/PDGFPRAPASSGI 
TTAPSPGPDP 






TV 


on 


66 


ACA 


Li VV vloo X: ir KijirijxlotjiVCiO fr CvxU J. Ltrt rvjLi 

TLFLSPSPSPIiS\CHSHLSPTCRRPRES 
PRFSLCWRPHPPSKPPPG*VKRDCCPSP 
DPVRLSVSENPSAGPR/VFLRPP 


86 


13987 


A 


91 


197 


2 


NAEIIPAPQKVGPFIKIWVINFFFPFFF 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFF + DRVXiLCHPGWS AWQS * LTVASNF 
*F\K*SSCI.T 


87 


13988 


A 


92 


317 


2 


SRTPDFR* STRIjGIiPKCWDYRCEPPCPA 
TSSF* *HLSLC*QGNL/WGPPDSIRMG 
AG\TRKTKCWEGCHFLFFLRRSL/NSVA 
QAGVQWRDLCPLQSLTPRFKRLS 


88 


13989 


A 


93 


1163 


0 


FAFGFEM* YCS VAQAS\ VQWTOLTLLQP 
PLPGVQSDSPASSLPNKLGLQGTSHHAW 
LNFI» 


89 


13990 


A 


94 


3 


363 


VLWPCRPAGPSLGLNFPLYSWLRLQTFA 
AIRPGSTGRRLCLPNWVFTRNSLPFH+L 
VC/CSSRHNTYLQECTGHREPTYQLNIH 
DIKLLFLRFAMEQSFSADTGGGGRKSWI 
HLIPYIIHT 


90 


13991 


A 


95 


17 


353 


PEPRYVGFISGGKVDIKKTWKTQGRIiER 
TVYYTGMYFINCH\ VLKNTDSS WGTS P * 
* IQQHAGKRDNNYQLL\KLQG\QFSEAY 
TKCYSNPDSTHDVRKVYQDCPLLAPLND 
TS 


91 


13992 


A 


96 


3 


355 


ANYPRDLCKVTDEGGYTKQQIFNADQTA 
LYWKKMP\SRTLMEREEKSVSGVQASKD 
RLTLLLGGS VAGDVK/ LKPMLI YHSENP 
RIC*SYSLPVLCKWN/NKTWMTAHLFTT 
WFTECLNPL 


92 


13993 


A 


97 


367 


2 


SLHRARRGKGVGVRMGERLPSFQSWRLR 
L/RRRKLCRR WGQGPRGN* Q * QKPDGGA 
RASQTDGGGERRRGVQTGSQAERDTSRE 
TYSPRGIQREIVRVSKGGKTGRQWQSQR 
QIQREKRVGR 


93 


13994 


A 


98 


115 


359 


LNFFFYLLNFMLLFHRYYALKVSYFKSS 
LDRKLLELLWNKYWVNTLSSSSLLTVS/ 
DYTCKECI*ILSKLHSRLISETLFHRK 


94 


13995 


A 


99 


1 


490 


CVDPRVRTIFKKDKVGGLPLLNLCIYYK 
ATVT*TVWYWHKD/RTDQWDRIKSLEIN 
PCI YGQL I FFNYYFFSQ*CQDS SMEGNS 
LFNKWC/CRPLTLYTRIGSKWIKYLM*E 
LKLYRLSLHDPEFGKGFLEITSEVEITK 
EEIGKLD\LAKLKTFC 


95 


13996 


A 


100 


195 


3 


SRCCKQRHLVQWNRTEKPE/ YGRLIFNK 
GVKTN*WGKNNLFNKWYWDN* ISTCKRM 
NLVPYIiSPYT 


96 


13997 


A 


101 


1 


371 


FSLIKISMMIJjMKMEK*NLQFIWN/HRR 
LQIAKARLNKKNKTEGITLPDFKIYYKA 
WWYWHKKRHIDHWNRLENSNINRHICS 
QIiILTKVPGANTKDHPFNEWSWEN\VCT 
KMKPDP/YLSSYTK 


97 


13998 


A 


102 


6 


370 


KKGTIPNYF*RE*TDRSKPN*NYATKEN 
YI/PIS*NKKIL*KLANKIQQHIKKKPD 
NSLFYKIQFGSILETFITINQISKPKEK 
NHIIISTDAENA\FDKIQH 


98 


13999 


A 


103 


298 


24 


KCWTS I YLLLFFRDE VSIjCCPGCAQTPR 
LK*SFCLSFPKCWDYRREPLVPGLDQ/L 
FLESNRS VSVLKKNVP * WFSNSSGSKSG 

MTJT>7\ EW"» T C* 

UtiUAt (Jlj-Lo 


99 


14000 


A 


104 


2 


352 


GTIADFTQCCQAADKAACLLPKIiDELP\ 
QNGRLRS AKHRLKCAS LQKSG KRTFRAW 
T*AGLIHRFPIAEFAEVSNLPTELTTVH 
TECCHGDLLECADHRADLAKS ICENHDS 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cystein e, D=Aspa rtic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine t 
T=Tbreonine, V=*Va!ine, 
W«Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ISSKL 


100 


14001 


A 


105 


139 


361 


KHTEAF * MPCAKY Y \LFGGLNQ I CALP E 
KTPVSDRKTKCCTESLVNRRPCFSALEV 
DETYVP KE LNAET FTFHAK 


101 


14002 


A 


106 


3 


350 


elemimlseevmlkakirqklgll/qpv 
s * wnakekflke i ksatp vntqm i \ kn 
sliad/mewwiddqtshnvplsqsliq 
s/k/altlfssmkaergeeareeklbgs 

RX3WFTRFKER 


102 


14003 


A 


107 


225 


1 


QGI MMDTVEYVGKGE PVRCWWKCKL VAL 
L*KTTWKLLRKLKI*\YDPAVPLLDIHP 
KEIKSIYQRDSCISMFTA 


103 


14004 


A 


108 


390 


2 


LGYSGVRAPLEEAVCRFSDLKLRAGRTT 
TLFKADRQGHLSLQRFLLTFDSLCP/AP 
RGGVYRGRQASLSCSGLHPVGASWPRCL 
PTQASAMAGAPAAASLPPCSLISDCCAS 
NE*GSVSVGPSEPGTGHN 


104 


14005 


A 


109 


304 


33 


KVWGEKVWYWQKMTQIVQWDRTESPQID "' 
N*SLTKEIQWRKDSLFimi*GNNtiTAPF 
SS/R5LNLNKDLTAVTKIKSKWVTDLNV 
KHKTIKIiL 


105 


14006 


A 


110 


135 


1 


KQAI LWPGAVAHA/ WSCNPSTLGGPGGQ 
IT*GQEFEKSLTNMVKP 


106 


14007 


A 


111 


140 


338 


IMSIILYYGSICTCYYWLlD*LILRWTL " 
ALVAQAGVQWRELGSLQPLP /PCLSLSS 
SWDYRPP 


107 


14008 


A 


112 


26 


361 


RSFPWVFFFEKKSLLGIPGGKKGPPLN* 
LEPLAPGPKGFSGLNPLENGNWGPGPPC 
RGDFGFF* KKTPFPF\GPGGVKTPNPGE 
TF/EPKPPKGVGFPGETPGPGPGENLYQ 
WK 


108 


14009 


A 


113 


11 


330 


AS I WLHI FtiFLS YFLEKGPR YVAQAGLN 
LLGSTGPLS ASRAAGTTGALHCTQL* T Y 
LLLPKHGKAVFF\QETUiLRSTHPFPLK 
HVYTPTYELLLVWDRDSLSRPR 


109 


14010 


A 


114 


350 


2 


GCCPCLLFPGS PTVFGSLYS I RLVAFVR 
AVPPVWKAVPG*PPVKM*FFHLFIYLFL 
KQ/VSLWHPGWS*LAQSQVSRFFPRSLP 
SNWDYRGAPPRLTLFFFFFKQSQGIEAM 
VIAL 


no 


14011 


A 


115 


3 


341 


RMVS IS * PRDPPAFASQRAGITGVSHRA 
RPVYSFLLLSNVPVMDVPQDIYPFLVDG 
YLSLPLVCCV*V*VLLCYPTWPPGLKRS 
SCLSLPSSWDYRHEPLKPA/SCCVLLSS 
G 


111 


14012 


A 


116 


1 


384 


PS YPGDLTEIMDEGG YPNQQI FS IDDTT 
FYWKKMPSRTFLTTEKSIS /GF* ASKNK 
LTFLIiEANVAGDFKLKPVFTYHALRNPR 
TLKKYATSTLPVLCKWNNKA* VT /AHVF 
TTWFTE YFKLS VETCCS E 


112 


14013 


A 


117 


43 


347 


CAAGFGLL* TPPRT * TRKPRRNGTYKP I 
SLMNINAKILKK/ILANRIQQHIKTK\L 
LQSDQVGFI ; 


in 

1 13 


14014 


A 


118 


351 


23 


FNK*CWHNWIFTCKGMNLEPYFTPNTKI 
NAIKAIQLIiEENKEVNLC*LGLGNGF/S 
RLPKAQMAKETIDKLDFI KLKALCSK\N 
TIKKVKKSEELEKIFANHIHGTSIQNI 


114 


14015 


A 


119 


123 


340 


AADSSTHSLTKPAYLEKQNFFFFQMEYC 
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peptide 

sequence 


Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
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add 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Leucine, {^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LIPLAEGPWPNLA* MQPPPTCFKQSSCL 
SLPSTWD*R/HWSPRLAN 


135 


14016 


A 


120 


1 


371 


PSYP*DLTEIMDEGGYPNQQI PSIHDTT 
FYWKKMPSTTFLTTEKSIS /GF*ASKNK 
LTFLIi*ANVAGDFKIiKPVFTYHAIjRNPR 
TLTKYATSTLPVLCKWNNKA*VT/AHVF 
TTWLTKYFKLSVET 


116 


14017 


A 


121 


82 


367 


YSKRSNTIVAGD/FTPLSAIiDSSSRHRI 
NKKTSNCTIDPMDVIGIYTVFHTVSTBY 
TFFSSANGPFSKTDHLLGYKTSLKTF*T 
KLK*HQ/CIFSDHN 


117 


14018 


A 


122 


27 


253 


MKTBNILGENIGEVFGVGKDS LDMSPKA 
* TIKEKNDKLFF I KVKNFS YS KHTI * KI 
KNQATVWQ/KLLTVHKSDR 


118 


14019 


A 


123 


3 


305 


GTRQGCPLSPL* FNTVLEILVR* ISQNK 
*INKT/SIjrADP/MVLHIENPKGSIK*V 
LELINEFSQVAGYKINM/ QKTVAFLYTN 
N*LSKKK IKKTIQFI IASKRT 


119 


14020 


A 


124 


1 


327 


qvifclglpkcwdyrhep7rmpgi,rgad 
ffsli*ilwehmf*vmcilphpcgdfwa 
mlnf*eregmff/clkrwkshnvsqa/g 
pqtprfkrfsrltllsswdhrhapphla 

K | 


120 


14021 


A 


125 


74 


334 


I FEFGVLKVITCIiS VSSHEVGKLCPFFF 
FFKNGV* FCCPGLQG YNGS * L \ TLELLK 
QSSCLSL * CTQDHRCLPPCLANFNLINS 
SIH 


121 


14022 


A 


126 


381 


60 


TVSQVLAHTCS PS YLKAKAGGSLE PGS C 
YPGCSEIi*SCHCTPAWIT\SQTPSri*KI 
H * KKI I VKRAI VE CV CVYVYTY I WLL I L 
KFSDS I PFAQI WGFHEEFP YWNQC 


122 


14023 


A 


127 


323 


2 


RGFLGLSDEVSGQQLFS ILSR IENQFKN 
NIiBGCGGSCL/SIPKCWDYRCQPSRIAY 
A*LIFVFLVE\TKNTGFRHVGQAGLELL 
TSSDLP \AIiASQNAWLTGISHHAEPY 


123 


14024 


A 


128 


2 


354 


ENCQINNLRFHLRKPKKEEYTKPTASRR 
K*IVNIRIERNGIEQK\TIEKINEI*KSW 
LFENINKIDSHSGKWMVGEE * CVFVCVC 
VRERQSDRERETRIiI KI *NERGDITADL 
TEIK*LLILQKLNNY 


124 


14025 


A 


129 


74 


327 


GELAMLPRLILNS * \PCNPPA\SASLVT 
GTTDTPLYPAHYYY\II I III I IEMEFH 
S \ VAQAAMQ WCDLGS LE P L\ P PGFRRFS 
CLSL 


125 


14026 


A 


130 


83 


382 


YEFHASDGGSRLSSQHFGRPRWKDYLSL 
EVEGQPGQHSETPSliSLSLSF/CFIjEME 
SCSIAQAGWQWPYLGSIjQPLPPRFRRF 
*CLSLPKCWNYGHEPQIi 


126 


14027 


A 


131 


749 


2 


RQSLALSPRLECSSTISAHHNFCLQGSS 

dspasasrvagitgvhchaklifvflve 
trf\thvgq\aglelltssdlpalasqs 
sg itgmshhvqarcvif/cgptfknqdm 
ky*nkkeknqstflehlqnqqbnrhaah 
tlrhhias/lksseflfairtsflis*k 
rsgttgmf*yn/wtigngvedrfvlgpp 
fglgvqwyhhsllqpqspglkqsslfsl 
pss *dyrhvptpsflkflrrwgi»aillr 

L 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
OCysteine, D=»Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, f=Isoleutine, 
K=Lysine, Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possibie 
nucleotide insertion 


127 


14028 


A 


132 


371 


1 


KKNLNLNLTL * VKINTNWIMDLTIKYKI 
tT* KKQEKVGNLEQGKEFSDLTPKV+ S I 
BQGNTDK*DV*KPFSL*KPM*GNYRQATD 
*KKMSAKHVSNKYP\LSRIYKPNDKAKN 
KPIRKWAKQMN 


128 


14029 


A 


133 


2 


318 


NHKIILKDAKKAFDKM*HPFTMKILNK* 
GIEQYAT/ DI I KAMCDRPTAD 1 I LNGEK 
LKAFPLRSGRRPLL\FH*IW*VIARTVR 
PRKRNKGSQEWWLMALIPSTLGG 


129 


14030 


A 


134 


3 


362 


WSELLGSSDPPALASQIAGI IDMKHYAQ 
PGTDF/STSVYFTLPTQIRL*ALKRKFL 
L/EFK* IFASI *QFAKCSCMSRNFLPAQ 
PYHHWHFPNEETEAQGNFSLLWAC/LRE 
DWIPTD 


130 


14031 


A 


135 


12 


398 


KCSTS I VIRE IQVKTRE I YHYTPTKMAK 
IKKPDNAKC/WSGCGATKTLIHCW*NN\ 
SLEIWLFLMKFNICLPCDPEIVPLDLYL 
REMKTYIHENTYTKVS IAPLFK\QPKS P 
STGES INCS III PRMECI I 


131 


14032 


A 


136 


309 


1 


QSEAPSQKNTKKQKTKVGGLTS * PQTYH 
NATVIKTVWH/W/YIGIDIDQWNSIQSP 
EINFHIHGK/LISNKAAKTNEWGKNSLF 
NK*HKDNRIFTCKRMKVDSQPHI 


132 


14033 


A 


137 


2 


251 


QCGKAFRAASVLRMHGRTHPEDKPYECK 
Q*GKAFRSASHL*MHGRTHNQEK/P/HE 
CKECGKP FRSAQNLRI Q * RTQAHIRMHS 
G 


133 


14034 


A 


138 


396 


3 


IiELLTS * SACLGLPKCWDYRREPPHPAK 
MIL*SKSS\LPFLCSKCLNTIWAIiLLLL 
CSLVPIiLTLPLKEFYAAH*SPDSLKYPV 
LQKLSKLEMLVLHETVQRKEQLTYSLYK 
PIFGFLVGYSFPLFCSYKTS 


134 


14035 


A 


139 


3 


384 


LDFI KMKNCCSSKELCGKYBKP *T/MEK 
MFAKHISNK* FIFRI* K*LSKLTKKK* A 
KDLNRYSLKEDIQMANKCM/KKCSI*LV 
IREKQTKNTMRYY*NGLS*RS /GQTK* * 
QGWGGTGPLIHCWWEYPMI 


135 


14036 


A 


140 


63 


388 


FMYLFIY*LER/CLTLAQTKVQWCDHSS 
LQPQTPGLKH\PPASASGVATTTIMPG* 
FLFFF* KRGPCWVAHARTT / WLNLGSF 


136 


14037 


A 


141 


370 


3 


GPGHFPALFFKGPGLGPWVPPI IPALLE 
G*GGGFPGP/RGSGPPGQPCFFI*KFQNF 
PGFV\GAPWPGFPGNLGRGTALNPEAE 
■ GSINLKGPPGLQPGGKTKLFFQKKKKKY 
QHFQKTQLILCF 


137 


14038 


A 


142 


363 


2 


REKATEENFATS* DWLMMFKEGSHFHS I 
KVLGETPSADVETWSCPEDLAKIMDED 
GYTKQIFNVGET\ PSRTLITRKEKSVPG 
FKASKLQRIG*LIiLGVNAAGDFKLQSVL 
MDYSENHAE 


138 


14039 


A 


143 


99 


375 


KNKNQNISPPLVRYTLLSPLLLPFIINS 
FFFFFFSF/ CFF*KKSRFEPKVAFQGGN 
LS*LNPRPKGSKEFSCLSLQSTWXTGGP 
PPHWADFDF 


139 


14040 


A 


144 


39 


399 


LQIIYNYIWIiYIWLYIFWLVSKIYKELL 
Q INNKETNKNGQKT* I D VSSNNMYMNG* 
* LHEKIHDIVIR*MRIKTTVTYHVMSTR 
MVIIKETENKYW*GYGE\KIIiIYCWWEN 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PhenyIalanrne, 
G=Glycine, H=Histidine, Msoteucine, 
K=Lysine, L=Leucioe, M=Methionine, 
N=Asparagine, P=Pro1ine, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FEK/SVWPFL 


140 


14041 


A 


145 


358 


3 


REHTAEDKYEASRSWFPRFKEK\SPLRN 
irVQGEAATTNGEVAVSY/PEDIAKNID 
EGGYTQQI F\NADETF*YWKMLFRTFIA 
REKS \RLPSRDRLTLI*LGANAAGDFKLK 
PMFIYHSDSC 


141 


14042 


A 


146 


3 


376 


AVMGMYKINWFVPTNTS ILQP ID*GVI 
LTFRSY*LINT\FCNTIGAIGSDSSDGS 
GQSKLKTFWKGFTI PKS \ I KNTCDLWEE 
VKJSTLMGVWKNVI PALIDLFERLKTLL 
EEVTAYMVNIGRRL 


142 


14043 


A 


147 


374 


161 


EFHRVSQDGLDLLTS * STCLSLPKCWDY 
SKWQRRI *TQ/DLCDSED*SS IKP1QLC 
QELMS * MAEMAHGQP 


143 


14044 


A 


148 


373 


2 


FFFFFFFCDLLLFPKQPRSHAS FSLPS V 
VPFPBMPSSSLTLFYI»FVYNLLR*YL/N 
SVTQAGVQWHDKGSLQPRP\SliLSSWDH 
WCTTPCPAN*lFFL*RVGLAMIiPRLLI^ 
SWAQAILPLWSPK 


144 


14045 


A 


149 


3 


282 


ilkriihpdqvffipamqgwfni*k*in 
tv\nhiirikgk\nrmiisldtvkafgk 
iqqpfmi\*klgikgnflnlikgtyenp 
qlaghsglcl 


145 


14046 


A 


150 


3 


360 


DI I *BSKFQGHHTTGVQKGLQYGI ILFI 
I*EVFFFAGFFWAFYHSSLAPMPELGGH 
\YPPTGIFPLKRLEVP/LLNTSVLFASG 
VSVTSAHHSLIEGNRKQIILALSITITL 
GIYFTLLQI 


146 


14047 


A 


151 


349 


3 


KKSSGSHLETKSKRLPGPSQGPWCKEIR 
KGSLAWQEKSWPFYKPFLLBVGGGRVIR 
GSRMTRHGHIiIRSSKTIiIKHHVPK*TVQ 
FNFCLFI FLEMESRS \ VAQAGVQWCNIG 
SLQP 


147 


14048 


A 


152 


500 


0 


MPSRLECSGA/ ISDAQCNLQNSGAPSEP 
S SALSNP CG * D YRDTLPLPGYFLKFFVK 
TRSRYVTPAGL*LIASS\IPPTSA/FSK 
HWDYRHE 


148 


14049 


A 


153 


373 


2 


QAGFTLLTS S DP PAS AS \QSAG I TGMNH 
RTWPGNF* PQKS CDS FVTKLMCTCHKNH 
I YAQSLVT * LGHS YIFF* DSLQPS PPRF 
KQFSCHSLPSSWDHRHTPPCPARTQLHF 
GYEYFSRHFLRRR 


149 


14050 


A 


154 


332 


3 


THKFQRLTSKSKKLKRKLIFHQGAQKSN 
SLNPDI KKNLKRSQRNME*EKI FTDHLS 
SRIYRELLKLKAK/INLIKMGKGYFYEG 
* DT*MADKHIKRCSISPIIREMQIKTTIR 


150 


14051 


A 


155 


337 


21 


LGLLPFFQRCPAKRGGI *RGSLAAVALL 
NCGELHPV*ASWWLCL\PVRGKLPTEAS 
VMGDAPPPTKLECPRSTSDCCDGSKNFE 
SVDLSLLGSVGVGSVDLDPVAP 


151 


14052 


A 


156 


21 


491 


HPGSRGCSEPRSGHCTPAWGTKVKTPAS 
TKTKQNKKKK*RTVCLTPWHTLL/RVWC 
DGGDYSSSLL 


152 


14053 


A 


157 


3 


394 


REDLLSPGFQGCCEL*LCHCIPAWVT/S 
ETLSQKKKK 


153 


14054 


A 


158 


36 


354 


LSAAFTSQUjGRLRQENGVN \CSEPRLH~ 
HCTPAWATE * DSVS KXK3CGKPFW I PNPL 
SSLSPAFISGQGPWESHLKNPGRALQGG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E==Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q^GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFQPL I CPRPGRTHMSLKS P WRP 


154 


14055 


A 


159 


78 


340 


WFHKAQRGILhABLYPSGSAGHSIEPK 
TITLPPLGNKVLLCCPGWSAVA*S*STV 
ASN\FGPK*LLCLSPPSSSDSRYARPHM 
ANFK 


155 


14056 


A 


160. 


2 


362 


HLSPLSIPQNRHCHHGPFSVSCWAHLPD 
GWAGQRGSSLP/M*GRPGRGAPSPPRR 
GGWPGGGLTPHLPSGRGGWPGRGTP \ PS 
Q*GRPGRGAPS PPGRGGWPGGGI,TPPPP 
SQTGRLAGRR 


156 


14057 


A 


161 


293 


2 


PHLAIRPPTTVDKSALYWKKMPSRTYKV 
RQKSMPGFKSS KDRLTLFLGAK\AAANF 
KLKPMLID\HSKNTKALKNYAKSILPVL 
CKWR\HKA*MSPHLF 


157 


14058 


A 


162 


2 


136 


LIVPILXAMAFLRLTERKILGYIQLRTG 
PNVWTPPTGATYRLLS/P*QP+ *LFTKE 
PMjEPVTCTITLYITDPTLALTIALVL* 
TPLPI PNPVGNLNLGLLFI LATCSLAVY 
SIL*SNRNGIPKTYRTKNSRLYTTTHRP 
QRLDAAYWCYVQTP I A 


158 


14059 


A 


163 


1 


464 


RQGL/DSVTQAGVQGRNLSS \LQPL/PP 
GLK* SSRFSVSNSWDYRCAPPHLANFFN 
LFFVQRG\ FTMLPKL* TRS \GPGD 


159 


14060 


A 


164 


1 


353 


FNYSSSLGNKSETPPQKKKKKKKGRGPG 
ENPGTPTFGGAKGGGNPGIKILKTTGPK 
GGNPCFKKKPK* PHRGGRP /HGNPVYRG 
GPGKKNFWPPGGGGSKG*KSPIKPPPGG 
KKGDFF 


160 


14061 


A 


165 


45 


389 


FFVCTFCRERSHjCCRSWSGTPGLK* PS 
CLSLPKFSDYRHKWPCSARNFLPACLPA 
CLPPSLPSFFPPS/SPSSSPRQGVTVIQ 
VRVQWRDYGSLQP*PPKTKRSFTLAFQD 
LGPQ 


161 


14062 


A 


166 


330 


2 


wdyrcapphpvlylkm/ stfni ffl* rg 
vtolsrlvssdppisasqvari idvshw 
aql* rs v/ cyvfetgsgsltqagvqryn 
hgsvqpqhskls +sshpsimnyr ypppc 


162 


14063 


A 


167 


2 


396 


YSNVLLG I YPKEL ktlvhwmfi cv*ntc 
I WMFITALFLA\ES * CSS VGE * IGKTWY 
IPTMKYYSVMKRNAISSHEKTWTRLKCI 
LPSERSRCDRLQTV 


163 


14064 


A 


168 


486 

• 


2 


lifgkiykinnssktdnkrgktqiqilg 
mktgdimty\qqtskeyyahefnnleem 
dqlfkkhkrp/rihqyekhhltgpmtin 
efefinvktpknqspgwddfigkfyqmf 
eeflsenktgelil*s*yysltnsfyea 
sity*pkpdrq*kkqscgpisfm 


164 


14065 


A 


169 


200 


3 


grvdlpptqesrppgpppsdpgvqaprp 
sivrncgiltrgspgpd\psplss*aql 
cgsppppsfs 


165 


14066 


A 


170 


2 


327 


PGGlGCGELR*}iHCTPTCATERDS V/ WN 
KIIHYIIMIASPNELILPLLNICLEIL* 
AGPLTAICISMFMATL\FTIAQMGKQPK 
CPSTNEWIRKMWYIHIMEYYLHFKMK 


166 


14067 


A 


171 


2 


343 


PGWSQTPDFR*STCLGLPKCWDYRR/AA 
TVPGLFLFLTALC 


"167 


14068 


A 


172 


367 


3 


FIRDVQFISALRYLLTPERMA/MIKKSK 

NNRYRCGCGK/RG/ tllhcw* kcklvqp 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
OGIycine, H-Histidine, J=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagme, P^Proline, 
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L**SVWRPLKBLPLNSVlPLLGiyp K /D 
KEVIYEKDSCILVFIAVQFAIA/RNIHP 
TYCPINE*IKKMWYIY 


168 


14069 


A 


173 


102 


345 


YQLQNIPRWVYSKLQNLSFAFVLL*RDR 
VFhCHLGVISShL/QLKRSFKLYLtASSm 
YRQVTPRPAN* ILFRD/RGLALIiPR 


169 


14070 


A 


174 


22 


401 


STRLGLPECWDYRCESSCPATYLPF*HS 
IPFLSQLKVQVHHRRYFHLFPEIjSNLPL 
LSKNVS *TY*KS FhS /WPSLRVLFCFVL 
FCFVTGFHSVQARVQWGNHGSLQPQPPS 
tiK*SIHLSLPSS*DCR 


170 


14071 


A 


175 


2 


388 


SDLQLRAGRTTALFKAVRQGELTUQRFL 
LS F \V* LCPAPRGGAYRGRQASLS CGGL 
HPGRAYRLSCLPKQAWAMVGAPTPASLS 
SCSLISDCCASNHRDSVGVGPFEPCAGC 
NLTEHRFIiSPSGAWSC 


171 


14072 


A 


176 


334 


1 


KLLELIN* FNKVAVYKINTQNQIiHCYTL 
IRTTQKGSIiKTILLTIAPKRIKYLGI /Q 
/LTKEVKDLHSES YTIiLKE I K*DINK* K 
DIPCS*MGMVFFGVCLFVYFCFEMAFRS 


172 


14073 


A 


177 


339 


2 


KNS PYNFPKGKNPGNL* SLKSRIF /CWG 
STFAHIGEIiFFFKMGKKFPAI/RPFFFF 
EDRVLLiCRPGCSGWRSWLIAAST /CLG 
WDYRCTPPCLANFL *RQGLTMLPRLLSN 
SGLQ 


173 


14074 


A 


178 


3 


303 


DLRRAACLILPKCWDYRHVPPRPADNSG 
FLHLI 1 1 * VCLPLLCHAQLVFI FFIETG 
FCHVAWAGLKFLGSS SPLTLA/ FSKCWD 
YR 


174 


14075 


A 


179 


342 


1 


INRLNI FTMAI FSTLIYRFNTI P I KI PA 
GFVEMDKLILKLiVWKFKAHG IGKTTLKK 
NRVPNSKAYYNATVS KT\YWNEDRNINQ 
WN*IDNLKINPCVCGQLIFNKDAKTNEE 
R 


175 


14076 


A 


180 


2 


323 


STS YPI TGSHAFL * PQNVVDAETOS *HI 
NNVNLRLKI IKLLEENTEKNCHDLGLAT 
DYY/ SVTPKA*ATTTKIDKLELIiCIKNF 
CTSKD IT* KVKRQLIGENSCKS FM 


176 


14077 


A 


181 


326 


3 


RKRKKKRREREPKKDEERKRLYGKDIIK 
KRRKDTM/DWEKILQKGVRQRMCI *KI * 
RGVTQELKANP IRKGGNNLNKVHQRI *M 
ANKHMKRCPKS* VIREI* IPTIMRYH 


177 


14078 


A 


182 


5 


326 


TKTIEKAAN*IRRSRRKKIKFKPEIKNR 
KTTTKINESKSCFYKKTNKTDKLIiVKLI 
RKKKTKKI ITKDEKNHT I \ * DCTE VKGM 
TECYEQVYANKFDNS \EKMDNFLER 


178 


14079 


A 


183 


3 


378 


TVLCSFIVWGLDFFG* IHTI YS* VFCYK 
WNCVHNFLFRFFSAYI * YLLI FVY*S * I 
MHLCLICY*LLL/ CSCIFFWIFYMLYFS 
FWL/CFLYFYFYILCPFIFF/CI FICLL 
FFVYLFLLFIFFFIFLF 


179 


14080 


A 


184 


365 


3 


MQLKCIMKS ISVQAGVQ WHNFGSMQGPP 
PW\LHHFPASASLVTGTTWARHQGRVIF 
/IFFFFLMRRSL/DSVDQAGAQWHDPGS 

lqpllpgfk* fa/cpcwddrrapqcpan 
fvfla\rhgftilar 


180 


14081 


A 


185 


51 


293 


PYVNQEAS VLTNS FYQAS / I * KKSNYRP 
ISIJ4NIDAKILNKTQQIKLNKVNDSRQT 
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GEDQI PKYCTVSDGRMMRDYFPLFSS 


181 


14082 


A 


186 


326 


3 


TKICQWILNLGKKREFGFKKKKKIKCPN 
KRHRLANGIKK*DPTTGCIiQDICFRSKD 
AQRLQ I KGWKKI FQKTGI * R /GVAVLTS 
DNIEFKIKTVTRDRVANYI IKGSIHQ 


182 


14083 


A 


187 


17 


330 


KDNEM* IKRTMRCYYTSFTMAKINLKIN 
NIKC*QG*GTTGT/L/LYC*WKHKIVQP 
FSKTV* QFLMRSNKYLRYNPAI LLLGI Y 
SN* VKI / C* KT YMRMFI VFL ITKDWKQP 
NYSS 


183 


14084 


A 


18S 


124 


357 


SLWGGDIiEGGGEKKERGBNPGGPKKPGK 
KKKKGGNLGARG/VFKKRGRPQKEKPGK 
KGEKGGP *GAGI *GGGGNGVKKK 


184 


14085 * 


A 


189 


333 


124 


RLRQENRLNP / GSRGCREPRS RHCTPGW 
ATE *DS VSKTNKKMYKS 1 1 ARDCHAENS 
I VSFLS KRGCVFLF 


185 


14086 


A 


190 


1 


327 


KSWLIGDNIPSSQSL.IQKKGVNPSFKSM 
KADRGKEAAEEKSEASRSWFMREKERSH 
LHN/ I K* VQGKAAS YPEDLAS I IDEGGY 
TKQ* IFNEDY/MWKKM*FRSFLTREKL 


186 


14087 


A 


191 


53 


400 


VNILGPFHIiFFTFVQNALSPNIYMTHML 
TSLGPLHMSSLQTGFPKPS YIjKFQPS PH 
S VPVGF/ IPGMQR* FSIHKSVNVIYHVI 
RMNDISHMIIS KDT* KAFDKIQHIGSIiF 
PALSL 


187 


14088 


A 


192 


65 


321 


RARTEIYLIHTLLGVYAGETKTGPYTDC 
M*MFTTAPTTLVKTWI *PKSLS IGERIN 
KL/WILI CTYMBYYSAIKSVLTHLTTWV 
NPH 


188 


14089 


A 


193 


127 


241 


I IKNDSRRICNLTEKLPLRKINS/WPGA 
VAHACWPS TLGG *GGESRITYNESRNKG 
EQTQNNK 


m 


14090 


A 


194 


1 


337 


DLPASASQSAGIIG*ESIRN*NKLVKKT 
LKCSQKDPKEDLNK/WKDIVCSWLGRLS 
LTKVSILPKLIYKFAKIPTKLYYD*KKN 
RPGAVAQAYNSSTLGGRGGQI 


190 


14091 


A 


195 


2 


333 


NFNSLFFFVEIEKLILKFILNCK\VARR 
ILQRKNKVGRLTLPNFNTDYKATVTKTA 
WH*HKDTHMDQRNRTESTKINTYKGAKT 

i*wuknslfnk*cwdk/wittckrkkvd 
p 


191 


14092 


A 


196 


299 


2 


PHPQRDWQPSPPSICRIFSPSSIPGIQ 
SWFNIQKSINVFYHIVILNTQHRSSSSS 
SSSSSSSSSSY*LMIKF/LNKLGIEGNF 
LILIKAIHGKPTVNH 


192 


14093 


A 


197 


2 


199 


LALSPKLECTGAITAYCSLELLGSNDPP 
TLASKVF/GITGMSH*AQPQWFLLLYY 
FIFFFSSSSVL 


193 


14094 


A 


198 


226 


2 


KKFFFPNPNLKNFPLGRVFF/CSPG*KF 
FAPLGFFLVSFLRQKVPHLGSPGKTFFF 
FFFPGDRVWTCHPCWSAVA 


194 


14095 


A 


199 


338 


93 


PQHNGS CL* SQLFERLRAEDPLNIiRVQG 
CNEL*LYLCTPSWVNPVSEKKSIQEKNA 
NLDFREY/RDKRREKWTCV*NS IERSSE 


195 


14096 


A 


200 


1 


361 


PRSHLSQLRVRLSQ/ IKKNQKTIGKGFR 
AIG/MGKKF*GKTLKVPFTRGKVNQWKF 
LKLKSFLPRKKKTEITL*TLGKHFRKWE 
fCFFATYPSTRE* IS KI CRDLNHFTGKKI 
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IIPIFQRAKGL 


196 


14097 


A 


201 


2 


358 


ENKTTMRYH* IPIRMAEN* VLAVGTL/L 
QPWWEYKRV* PLWKTNSIiAVTLLNINLP 
CNPTILLVGIYPRERKTYVYTVACTQMF 
LTVLFTVAPNWKQSQGPSTGE * KNK* WH 
IHIMGHYS 


197 


14098 


A 


202 


110 


357 


IWVFIWCHFLLPEKTSFFPGFPQCLDIS 
* KTLFVHSVLFFFKDRVLLCHPGWSAVA 
Q**FTAASD/FQESSCFSLPGSWDYKRM 


198 


14099 


A 


203 


2 


374 


ILKLHAFFMABTPGACKTPCDLRQVFI F 
IFIYIiF/MRLSLTLVAQAGVQWRNLGSIi 
QPPPPGFKRF*GRCLFYPKDGLVCPGIiA 
GS*TGQIGKGIRTLSPDLCLSDGFCGSK 
PKS/ASASQGHVG 


199 


14100 


A 


204 


375 


3 


XiSS ILIRGSFNLSTLITTQBHL* LLLPS 
W\PLAII*FISPLAETNRTPFDHTBGES 
ELVSGFNIEYAAGPFAILFIAEYTNI 1 1 
INTLTTTI FLGTTYDALSHEL YTTYFVT 
KTIPLTSPALMNS 


200 


14101 


A 


205 


393 


3 


SARLGLPKCWDYRR/ASTVPGL\SLMLL 
LKSYVS KMKKCB * L VKPLKAKL * DS CKE 
L*VGNNLIMPSTYDQENDRVDSSLTWLC 
VLLLI.LTjT.TirjjrjFWRQSFALVARAGMRW 
RDIjSSL*PIiPPGFRQFSCI 


201 


14102 


A 


206 


2 


377 


FRAVLQGRWSLQKFLLPFVQLCPAPKGG 
VYM*RQRS/SLSCGGLRPVRASWLLCLP 
TQASAMVNAPPPARLLPPRS ISDCYTSS 
EQGSVSMGPAEPGVGYDI1LVCCLLRPI1E 
KHSIWVRVSCFSRY 


202 


14103 


A 


207 


327 


1 


I LTGNFKQ I RML I YHS VNPRALKNYAKS 
TLP VNYKWNNKA* MTAHLFI AWPTGCFK 
PTVEP YCSKKKK\ I PFKVLLLIDMAPGH 
PRALMKMPEEIS I VFMPANPASIL 


203 


14104 


A 


208 


310 


2 


FCWWAPGMGFPLGE WGLILGPGS PFFLK 
EGFPGVGQPGPPKRWGLKG* PPGAHWKG 
F/IFFFADRVSLCHPCWSAVAQSQLTAN 
PASQVQAILLPQPPDYRIR 


204 


14105 


A 


209 


374 


3 


GVRDP/LEEWCPF*DLKLVARRT/SLF 
FKAVRQGHLSLQRFLLPFVWLCPAPRGG 
VCRGTQASLSCSGLHPVRASWPLCLPTQ 
ASAMVGAPSPASLPPCSSISDCCASNER 
GSVGMGHSETDVGY 


205 


14106 


A 


210 


3 


196 


LENLKF/LDKFLETYNLTRLNQEETEIL 
KRPMTSSKI * SVIKNLARAMAHAYNPST 
LGGKGRRIA 


206 


14107 


A 


211 


34 


339 


I IKSKRINYMSCELYFNINIDNFFFFET 
VFRSVI \ KLKCYGAFMVHCTLNL PGSTN 
LSTSASQVSGAINRGYPGQF / IGLEK*F 
LVKTGFCHVA*AGLEPLGSG 




14108 


A 


212 


200 


312 


HYGQFHVLFCSLF*MESHSVT/QAG/VQ 
WP/ SLALQTPPPGFKQFSCFCLVSSWDY 


208 


14109 


A 


213 


300 


124 


NHI FFYFQI HRVC VCVCVCVCVCLCVCV 
YRYIYCVSPW/RN* *SL*KECIiSLVIiLT 


209 


14110 


A 


214 


1 


357 


QVDHTSDRKANLNMF* KI KVIQTMFSDY 
NGVICSENNRRKTOKLTOIOKLNST/L* I 
SNETNEKPQE IRKYFEVNINENTTYQDL 
WDEVS IVAQAHNPSTLEAEAGRLLEPRS 
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K=Lysine, ^Leucine, MNVfethionine, 
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LRSAWTTE 


210 


14111 


A 


215 


360 


1 


LTIBTYCSE\KKTPPKILLF IDNLAGHP 
RALMEM * KEMNVFMPAKTTS I ML PL I \S 
S FNS YYLRTR I HRTFWVAQS WLQCPTT 
AISDIQIGPVAMGLLIiIPSAFVLCWLSI 
QQVSKLKH 


211 


14112 


A 


216 


3 


347 


SSRSRADGLFFYQCK*QTLYW*HLWYKT 
QKEKKKTQMKEKKQRINKAR\GDKLPEC 
EAVCGKPKNPANP VQRILVGHIiDAKGS F 
PWQAKMGSHHNLTTGATLINEQWLLTTG 
KNLL 


212 


14113 


A 


217 


75 


347 


PPIFKRTARGKVTPGWBYPAKG/CGGTF 
LGNRGLFPNPGFSGPFPKKQVWEGALCA 
PG*KVCTLKF*FI\SLAIPGNGSKNFFT 
LFEGPSPIL 


213 


14114 


A 


218 


3 


397 


GGQGYSGTGHHGGIWHKMEH*R* PLPPK 
KKKKKKKKKKKKKKKKKKKGGPPKKKPR 
GGQNLKGGGKEKPPPQKGVKKKTLSGRI 
IKKDGREKHTRGELWKKTFIW/EGEKIG 
EKPPKKI * DHEGKKKVLRGKG 


214 


14115 


A 


219 


3 


401 


DSWATLHGNIMK*SAAVI,*ALGLV\FGF 
TVSGPTGI VLSNS * LDIELHDTYYWAH 
FHYVLSIGAVFAIIGGLIH* IPLFSGYT 
IiDQTYAKIHFTIIFIGVNLTFFPQHFLD 
IiSGMPRRYSDYPDAYTT*NlL 


215 


14116 


A 


220 


162 


3 


KPAQRNL*SNP EKED INILKRNQS/ WPG 
TVAHACNPSTLGGQGKQIIRSGV*EQP 


216 


14117 


A 


221 


374 


2 


WCRDRERERERATERERERERERQTDRQ 
TDRRREEA/EGWAAWAIN*GRAPGTSLE 
AAIiECSCPRRPPQPAPPEQGPFPRTTAR 
GQPRPPKLLQPEAPSQTRPHGYPWPLRV 
LPQSGPEVRPRE 


217 


14118 


A 


222 


3 


264 


DHMRPKVRGCSEL*SCHCTPAWAT\SES 
LSQKKKKKKKKRKKIGEKI *GAFFKVAP 
PFFFFKKKMNLKSPVGIAGVAKNTRWIL 
KHRG 


218 


14119 


A 


223 


2 


410 


ATSPI2EELIT*HDHAI*INIIiLM*FLGIi 
HALFVALTTTLTNTNI*HAEETQT**TI 
LPAIILVLIRLPSLR\ILYISDEVNDPS 
ITIESMGHQWY*TYBY\TDYSGLILNCY 
ILPPLFLEPGDLRLLDGDDQVALPI 


219 


14120 


A 


224 


399 


3 


GVGKPGFPWAKKWALPPYSKGYPK*VQN 
LIIiKVKLLIjIiKKNPGENSLEATGTWGLP 
PKARAPKLKPNW/DSPKRKNPGAFKKKK 
KRQPTEWEKIFANHASDKRLTSRIYKEL 
QQ*KSKQPIFLIPIYHLNHFKG 


220 


14121 


A 


225 


360 


71 


NRTTW VP FKTPPLL YH INTKKK * HTGNK 
KHKTKRAKKRKKTKNAKK/EKEKNPHPQ 
RPQKRKK/ EEQKPKRGfCKKRGKKKKP FK 
BYS YL* K\ YIHTNLQ 


221 


14122 


A 


226 


3 


386 


PSTHVS INLAMGI PL*ADAVI IGFHSKI 
KDAIAHFLPQGTPTPLIPILAIIETISL 
L I QPIALS VRLTANI TAGHLLMHL IGS T 
TLTVSTINLSSTLI IFTILILI*TIL*IA 
VALIQAY/VCPLLVGAY 


222 


14123 


A 


227 


3 


372 


YSLDSPSLTRFFTFHFMLPFISAALAAL 
HLLFLHETGSNNPLGITSHSDKSTFHPY 
YTIRDALGLLLFLLSLMTLTLFS PDLLG 
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DPDNYTGSRNAPVKQPRPHI \KPE * YFL 
FA/YTILDPPYXL 


223 


14124 


A 


228 


2 


263 


PRVRSRWEDO J RPGA*DQ/3 J CAP*RDCF 
FKKKKKKKKKKKKKK3CKKKKKKKKKKKK 
KKKKXKKKKKKiOCKXKKXKGGP FKKKPR 
GGQNLKGGKKKKFFPKRGE 


224 


14125 


A 


229 


314 


1 


LIKTIFLYQVSLCCPG*SQTPGLKQSSC 
LSLPKCWDHRHKPPCPAQSFYFTPTERA 
LRVT Y*FI *RQS /LRSVAQAGVRWCGPG 
SLHPPPPRFKQLSCIiSLLSSW 


225 


14126 


A 


230 


86 


405 


VSYMS IQDHBNGHVSGYFELVGQFCLRV 
GIVGGSTFESSNSHSYIYFFSFYLFIFF 
EMGLSVTQAGVQWHKHGLLQPQPPG\liT 
*SSHLSFPRSWDHRPVSPWPA 


226 


14127 


A 


231 


386 


98 


FFPQRR*FPFGNFPPFFPPPKLG\PSQK 
SPIGDFPPPPFFNPGPGPPFFFPPPFGK 
GFSFPPPL*FGPPQGFF*RAPPFFFFFF 
FFFFFFFFIRLLV 


227 


14128 


A 


232 


2 


362 


TYVLFVIPYVETDLVQ*AG*SYSMDYSN 
LTRLQTLNLNVPFNIAALATLRLLYLHE 
TGSNNPIRITCHSDEVTFHPYYTIKDAL 
G*L\LFI»LSLMTLTLFSPDIiLiRDPDNYT 
LANPLNTPG 


228 


14129 


A 


233 


5 


365 


KNVTPPVNSSQHDLTDIYRNLYLMA1EY 
TFFSN/AHGTYTKIDHNLSHKRNLNKF* 
RLKF * RLKLNS KGVLWPKWNQTTNQ * QK 
DNRKLSKHLKTKNTLLNNP * I KEEVS WE 
ILKMYLREG 


229 


14130 


A 


234 


3 


367 


NRI VPNHDIPYFSLCVCVIiFFFDRVS \T 
LSL*GWSAGAGSLLTAAPTSCVQAILCL 
SLLSTWNYRRNPLRPAYL/SYFKSSKSF 
TL/LPKGIL 


230 


14131 


A 


235 


• 107 


2 


QEAYE+DKERSSRIYMSLK/S IQPKPPN 
NRVKKWTKEINR/HFSKEDTQTDKRHMK 
KC*SLICISLVISD*QRNaN*TPNEIRL 
IP 


231 


14132 


A 


236 


349 


0 


NNL* IFLIFLFTSVFUSLISYliFYYIFI 
RSLPICGLFTYFFSFSYIFLKYLMTLAI 
YYSF* \FFSUQVILEALIFI 


232 


14133 


A 


237 


3 


317 


TRMHI SDKKTYNTN I KRI FRIHXiMRQTT 
QIiKIGKRYEQTLYKS I FEWTINTYKEVK 
QLNS KRKPTNTVI KK* SKDLYKHFSEED 
IQITTRY\LKKC*ASLIIKEM 


233 


14134 


A 


238 


346 


1 


AAAQMPAYQELVEEAIAYGRKLGGSQED 
QTMNANDQRFVMSAADILS* IPDWSTE 
VHARLSFDKYAMSARA\RRLIELSKEYG 
hSKGRIVX \KLSSTWEGLQAAKEIiDEQR 
GIGC 


234 


14135 


A 


239 


1 


279 


IiLLPPLPSLLLPPSPSSCLPPSPSS/SP 
PSP/S/PPPSLIiHPPPPSLLLPHLPSLL 
DPPPTSHLLPSIP*LDIPNAPT\NSSNP 
SCPCRPRSTEYFK 


235 


14136 


A 


240 


98 


2 


RPRRPRLVLNS * \LKLSAHISLPKCWDY 
RREPP 


236 


14137 


A 


241 


342 , 


1 


DTASLLQPMDQGLAISTFWVYYL/RNIFH 
EAIAAIDSDSPDGTGHSKLKTF*KGFTI 
LDAI KNI CDS W/EEGYKISTLTGVWKKL 
I LTLMDDFEGFKTI VGGVTADVVETARE 
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SEQID 
NO: of 
nucleotide 
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SEQID 
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sequence 


M 

eth 

od 


SEQID 
NO: in 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspoo 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
G=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-HJstidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V*=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LEL 


237 


I4I38 


A 


242 


3 


339 


RKDSRKRERKNKEKRKQRKEGRKKPIRM 
TVHFSSiCTMVAK/ARRQ*NPVFRMIiEE* 
KN\FPPRILYSEIIFFGSESQIKKFSGD 
GNPKGS FPELFFLRIPMEVFNLKEAYXiR 
KS 


238 


14139 


A 


243 


216 


3 


NSTGTQTQSQKFMLSLVKKLIS FI YL* D 
GVLLFAQDGVKWRDLGSLQPPPP/GPTL 
FKQFSCLSLPSS*GSRI 


239 


14140 


A 


244 


408 


2 


VLTS*SNIAST\WYGLYQ/TQLR/KILA 
YSSI /THIG/WNNPVLQYNHNITILNLT 
IYI ILTLNA/ FLVI4NFNSSIGTILLSRT 
*NEIT*LPSLIPSTLLSLGGLPPLTGFI» 
PKWAI IEEFTKINSLIIPTI IATITLLN 
LY 


240 


14141 


A 


245 


3 


243 


I IMLKAGQMTVFPNEDVWMAYKHTDRYS 
TSLVMNEM\Q I KTTMGYHYAP I TVAKLL 
N/SNTRYWP *RG/CSRVIHCWRKCR 


241 


14142 


A 


246 


2 


337 


FLIMDLQKYINPKIKEPQGETNHKTVSF 
MNINAKILNKILVIQI**FPKKNNVSKF 
G/FIP*/SQGCFNINQCYSLDERRKYMI 
ISKNIEKAFNKIQHSSFMIKTKNRKKLP 
EL 


242 


14143 


A 


247 


347 


1 


HYTP IRMTKIQNTDNIKCW* GCGNP/GT 
LIiHCW*ECKMVQAL/WL*TVWQLFTKLN 
LLLP YNP ITFFGI YPNEGRTHVHTKTCT 
LIFI IAALFIIARTQKQ/PRCSSVDEWI 
NKLWYIQ 


243 


14144 


A 


248 


319 


2 


KKIILFI ICTERVFLCCPGWSQTPRLKQ 
SSSLGLPKC*DYRRESS\PGLMVIL* CL 
KFHLRVAMLFYVFEV* \ IVIFFFLRQSL 
/ DSVTRAGVQWYNLSSLQLP P PGFK 


244 


14145 


A 


249 


329 


3 


KFPFFHTGEAKNYAVFVINKRIKKKQHI 
HI IYNYGGHQKAECKEIEAHVHCWWEYK 
MMQ/SL*KTAW* FLKK*TMELPYDPVIL 
LLRIYSKEIiKAGTRTDVCTSMCSFLFL 


245 


14146 


A 


250 ; 


316 


3 


KTHKTIMGTNHI PNEKIDST * LKNOJIR 
AITI KLL\KNNRVSLHDLRIGHGLLDMT 
PRAQAAKKKIKR* XDEFIETKNFCASKD 
NI KRV/NRQHKE * DTMFANHI S 


246 


14147 


A 


251 


152 


364 


QLTLSNHINNQI KYK* TRHS \NHHHHHH 
HHHHHHHHQ*KRLPE*SYKSKSQEHYTY 
GMDITGPKNLRTIPI* 


247 


14148 


A 


252 


230 


1 


PCWICEFIVSSNFCWFVLETGSCYVAHA 
RVQWCDNRSL* PQTPGLKRYSCLSFRSG 
WD*RH/ VP IANF * TLPRGGVL 


248 


14149 


A 


253 


3 


345 


DVGIiAGLELLTSGDPPAIiASQIAGITGV 
SHHSWPLLFFFFFFF/ CFFEGGFFFFLP 
V* SQGGDLGSREPWPYGFKGISRLRFPN 
EGKIGPQPLAPDMFCFFDKTWLSTWPG 
WFH 


249 


14150 


A 


254 


194 


1 


GRVDTKWANTHERFSKCS/TTYD*WIFK 
SWYLCTMEYYSTINKKEILSFVTTWMHL 
EDIMLSEVS 


250 


14151 


A 


255 


327 


3 


VKTABFVNKWQKNSTKLWNSQAQIDKKK 
IVNQINDLRQTEIWMGDRIMNLESRIQM 
QCDWNTSDFCVTPQ\YNETEH*WKKVKR 
HLEGREENLTL* IVKLKEQDFEASQ 
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SEQ© 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheay (alanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
KNLysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutarnine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


251 


14152 


A 


256 


314 


3 


KNFPSLINFLRQSFAPNGQGGVQRVDLG 
SPQPLDSRVSDRIRLCFKTQKKKRFSP* 

qtilklskv/e/mtervlktarekhlvt 
yk/gkpirltmdfsaetlqarre 


252 


14153 


A 


257 


2 


308 


RWim\EEEEEEVDCGGKV*FFFFCFFF 
FFFCFYVFFFYLFFFCFFFFIFFYFLFF 
LFLFIFFFFFFVFFIFFFFFDCFFFFFF 
FFFFFFFFFFFFFFFFVFF 


253 


14154 


A 


258 


52 


186 


CFGKLGWDDCIiRSGVQGCSET*ACHCTP 
AWAT\NSTLSI*KKRRRG 


254 


14155 


A 


259 


1 


269 


LKRSSHLSLPRSWNYRCMPPCPANF*TF 
FVETGSHHIAQAHLKL\PPASA/FPKCW 
DYRHESLCPALDSPFTNINSATIITILQ 
GKCWHYC 


255 


14156 


A 


260 


387 


3 


PLLQVAKIOTKKILDIiNVKPKTIKCLQE 
NTGENCWDFGSGKDFLDMTPKMQSTK*Q 
ISKLIKIYNFSSKTQ/SFCTTDHENFLE 
DIVKRIKIQSHKLEKRFVNHIPDKRLIS 
RIYQELFRTQHEKPHTIK 


256 


14157 


A 


261 


62 


72 


LR 1 1 KFS EES MLKAEMG * KLGLFHQAVN 
KCNSLTVNEVVNAKKRFLKKMKSATPVK 
TmiRK\DMEKVhEV^IDQISmiSLSQ 
SLIQRKVLILFSSMKPARGEEAV* * KNY 


257 


14158 


A 


262 


2 


193 


GGRGKZSELRSCHCIPAWVTRAKLSLKKK 
KKKNSKF/RELGNKGQFMGP * IiRKGFTC 
YKKRSPLIF 


258 


14159 


A 


263 


3 


330 


QLKKKYYEELYANKLGNPDEMDKFLEIC 
ILQKLTQHKN* KFFKDSRRKS KMNRP IT 
NRLIQ * PKELPKkKTLRPDGFTGELYHK 
HFFFKXTKPLHKFF/AKIKBEEPFPNS 


259 


14160 


A 


264 


326 


43 


TQBTEAGESL\DPGGRGCN*LGSCPCTP 
AWATGMKLPPKKKKKKITSLPELRFLSY 
TCSSSQNFLYLNDDTIIHPYAQAKNLGD 
KDSSPTDPIQ 


260 


14161 


A 


265 


311 


1 


FPPFP/SFPLERKEKFKGRKKRGGTKRG 
PFPPFFPFSSLPLGFP/SPQRA*FLLFR 
FREFLKFFFPFYLP PKPGIjGKFFFFFFF 
FWDGVSLCRPGWSAVARSRLTV 


261 


14162 


A 


266 


5 


323 


DSTKAQKNTWS IEPGEVGTLIHC/WVQ 
PL*KTVWWFLK/DVK/LELRYDPAIAHL 
GI/YPKKNKT*TCTQVNITALFIMTKM* 
K*PKD/PINNEWVT/KLWYMHTMEY*ST 
IN 


262 


14163 


A 


267 


3 


293 


GSLQPLAPRFK*FSCLSLL/SAGEWHEP 
WRRSLQRSCHCTPSSLGYRAGLHLKKNN 
NNNNKSEVSRICTIGIFIiLFIY/CIYBT 
ESHSVT*AGVHWLDF 


263 


14164 


A 


268 


3 


332 


HLS I INLNVNQLNS PLKAYTLPVWI * KK 
KWPNY \ CCLQE IHFAS KNTYKLKVKELK 
KKFQ ANEKEKHADKTGFKS KTVK\ KKNG 
HYIMIKRSVQKENIYIYMFLIADPDIC 


264 


14165 


A 


269 


3 


326 


KVLERHDVLKLTQ / DKICNLNM/ PYTKE 
MESIVWNLPKKKTPGLHSFTGEFYQTFK 
NEMILTSYNFSQKFEAEEILNSFYEASI 
LL\IPKLDKDIIRKEN*R/PNAKILNKI 
L 


265 


14166 


A 


270 


1 


318 


PYPAKLSSLSKGE IGTF\LDKQMLKEFV 
TTRPAIiQEILKGALNVLI * /LERKDHYQ 
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M 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G/ufcamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I^Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QMQKH 


266 


14167 


A 


271 


1 


318 


UCLLTSGNLPASASQTAGITGXNHRAQP 
HLSFSL*LWNET/CVPG 


267 


14168 


A 


272 


108 


322 


DSHVI FIiLCGNLTVFVCL FLRWGL /DS I 
THPGVQWHHHNSL* P*TPGIiK* S YHIiSL 
SSIWNYRRTPLRLAFFL 


268 


14169 


A 


273 


341 


3 


GFFFPFPFFRQGFPLLAQVK/VPWGNFR 
SLQPLPPGVKQFS CLNLPKN/WGFSLKG 
FFKKGCHFPF* Y* IFFNGVKKPDIKFSI 
FTNPFFFFFETESCSWQAGVQ*HDLGS 
TRP 


269 


14170 


A 


274 


41 


328 


FWLLTPGQETERERESETERESEREREA 
GEAERGRETDGIDSLIYSKLYSKTIHLK 
ILEYI\ATYFKRLFYNWEL*T*AK/TRI 
I EKPSVRHQCQQRS 


270 


14171 


A 


275 


3 


322 


EAQELL\DPGSGGCSELRSHHCTSAWAT 
KAKLRFKKKKKKKCPGRYLGNIVLQ* FN 
IIAKCDKYHIVKLIF/CFSSAYTASIQK 
LIQVSRTVTHRKNKKTRQYVYILD 


271 


14172 


A 


276 


175 


313 


STLISYFRDRVLLCYPGWSAVGQSQFTA 
A*NFW\VK*SSSLSS\PSSW 


272 


14173 


A 


277 


254 


3 


RPRRQFG I EGSFLNQI KNI YKKS TANVI 
LYVDRIiNAFSLISGTRQGCPIiSRLPFNI 
VL* LPVNI IRQ/EKVT *GMQ I VKEELNL 
SL 


273 


14174 


A 


278 


186 


1 


PETMQARRQ*SEIFKVLKEKN/LQHRIL 
YPEKLS FQSEREI KTF\QRLKKFITSRP 
ALQRMLKE 


274 


14175 


A 


279 


288 


3 


GRLRGKKGFNPEGENSKEFFKPLPSGLG 
AKTQPVFKKKKKKKKTRNPIKK/WAKDP 
NRYLTKEAI *MASRHMKRCSTSCVIRBL 
* IKTTMRYDYVLI 


275 


14176 


A 


280 


3 


329 


LKESSRLSLLCS WDHRFALP PMLSGLVW 
NSCPQ/CDPSASASIiSIGITGMSHHTWL 
*WLF/C*ETOSHPVTQAGMQWYDPSPLH 
PTKKGVYLT/RRFWRLGSPSAW 


276 


14177 


A 


281 


233 


3 


IITKDKEEHFIMIKR3IHQQGITITNIC 
VP YNRGS K* MK* KLTKLMGENR * / HS 1 1 
IAGDFNTTFS IMDRAIRKSAKNG 


277 


14178 


A 


282 


32 


309 


LPD ITPRDHLS PGTMD FIQETGCSKCWR 
GCGTPVCC*WECKL/VIVQSLSRTVWRF 
LKKLKIGO VRWLMP VI SAL WD VEVGGS P 
EARSSRPAWP 


278 


14179 


A 


283 


1 


302 


CKRNNKAWMTVHLFTAWLSEYFTPTVET 
YCS * KTI PFTIL/ LLDDNAPGQPGVLVB 
MHKDMNVAFRPANTASIIjQPMNQGGIST 
FNSYYLRNTFPKAIVAXE 


279 


14180 


A 


284 


155 


1 


PRRFFFFFF*DRVLLCHPG*STAT*SCL 
TTKSASQV\RQFSCLEPPSSWDY 


280 


14181 


A 


285 


131 


15 


DRSNPGRFLWTSNSSLYERPREIRPSSQ 
APPPVNDPI*T 


281 


14182 


A 


286 


1 


359 


FFFFFFSWFWCFSFFFFKKKGGAPGGG 
GPPPPKKTPLFSPQKKNFFLQKKPPGKG 
FKGPGLPPPNFGPG/RNGGPLKPGDPHP 
PDFTPPRWLFKPPF*KRGEPFFPPPGFP 
KNLFLKKVP 


282 


14183 


A 


287 


52 


419 


LEERAGTS W PKKENKKQHFCMKKKKKKK 
KKKKKKKKKKKKKFFKKGAQ IFSF* GGV 
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Predict- 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A-AIanine 
C^Cysteine, D=*Aspartic Acid, 
&=Glutatnic Acid, (^Phenylalanine, 
G=Glycine, ENHistidine, Hsoleucine, 
K=L»ysine, L=Leudne, M=Methionine, 
N~Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
^-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibte 
nucleotide insertion 














NL/H/ SGPGPVIiKTGGGGRHSPPPPPSP 
PSSLFFPSSSLPSLPFLSLPLPSFLPSS 
PPLLSS/CPPPLFSS 


283 


14184 


A 


288 


236 


325 


LNIRTPGSWIiXAVAHACNPSTLGGRGRR 
IT 


284 


14185 


A 


289 


218 


409 


KVKGNKADLIRSANGS*N/VKGVLCHQY 
LEKSFCLSTEFHS CLPQLVHWPDLGLLQ 
PMPPGIARF 


285 


14186 


A 


290 


3 


426 


HEETGSPRLECSGMITAHCSLDLPKLK* 
SSHLSLGS/RHVPPCIiAHFLy 


286 


14187 


A 


291 


382 


1 


anfyvtlvq\^£ , tmlrswrpa* pcdpp 
twasqsagitgvshhawpkmstitlgvy 
sfgbsevfsifkfyfletgssflpqaga 

* *CNH3SLYP*TPGLKQSFHLSIiLSTSV 
YRYLPPCPLCPVLV 


287 


14188 


A 


292 . 


3 


309 


HEVPY FTLNQQLEM I KLSKGGLVKAKMG 
\ISQIVNAKEKFLKEIKSVIPVNTRMIK 
QNSLIA\KTQEVSLVWIEDQTSHNIPLS 
*YLIQSKFI/TLFNSPKADM 


288 

• 


14189 


A 


293 


2 


415 


ARDQ YKKSTKK* AKDL YRHFKNEAI /HM 
ASNYK/KRCSIS*VIRKIEIKSSMTSCY 
THLQN/ALKLKSDNFKC*QKCTAMGIIL 
SWWECNLVQSLWTW\QNIiIiMLNTHKPY* 
SAI/PTSGLYPTEMYRNIBQASLRMFWA 
WRGG 


289 


14190 


A 


294 


335 


108 


IPTPK*PLSSPPIHTPYTXIPKP/RPPP 
P*HPPHIHPPPPPIiTP\PLPTHTLPTPT 
PTOLPHIPPLYSIPPSSPKIS 


290 


14191 


A 


295 


266 


2 


GFPKKVFS FPIGPPKLGFPNSVFS/LFT 
FFNPGLCLKFFQFFPFSAFLPLGFSFPF 
FFFFFFFF * DRAVLCHPGWSAVVRSWLT 
ATLV 


291 


14192 


A 


296 


347 


1 


DSIiQP * TRGLN* F FHLS FLTRWE YRCAP 
QCPANFC/VFL*RWGFAMLPKLVSSDLP 
AIASQSAGITGVSHCAWLVFIiLFSLYFI 
HFSKASETPWAQNQSQTPSEYPATYNFK 
IYSC 


292 


14193 


A 


297 


284 


2 


CRQGFVLCRLVSNSCDQVLCVRLCV*VL 
GLQACAT * LGAEGVS I EFS CFHKGRGCL 
CRHRSS/HSMFSAWC*PHTHTHTHTHTH 
THTPSPMQLV 


293 


14194 


A 


298 


1 


96 


GTRIjCLGAITTLFAAVCALTQKDLTKIV 
DFST* SQLGLILVTIGINQPHLS FLHIC 
THALFKAILSMCYITIIHILNNDQDSQT 
IRRIiLNTMHINSTS \LTIDSLS * L /GKP 
FLN/GYY*QKDLTKIVDFST 


294 


14195 


A 


299 


3 


134 


HEGRD* PGNHGDTLSLLKIQKLAGSGGV 
CL\KLRK2^CENHLNP 


295 


14196 


A 


300 


2 


333 


GVGREGGSREGGEEESGWNGGIRERRSG 
KREEEGKEYRDEVDRKERGEEGGRERER 
GRGWEEEQ KGAS * C/EEGRVK 


296 


14197 


A 


301 


354 


1 


TENELLSRIYKNFYQIN/RYKQKNSIKK 

*rk*oaj/ttetiqtvwkhlkkcsisli 
irelqtktr* ilyt*lvekklktngiyc 

WECCRELGSC 


297 


14198 


A 


302 


1 


282 


GTRGMVAGAYNLGLPS SWDYR/ * CLPPC 
IiANFFVSLVERGFTRLTTLVLLS * LYDS 
PASRSETAGI PLIHTSCI VYWSLRNNAD 
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Predict- 
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nucle- 
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location 
correspon 
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last amino 
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peptide 

sequence 


Amino acid sequence (A° Ala nine 
C-Cystemey D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, l=Isoteucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamtne, R=*Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














IAILCKAQAV 


298 


14199 


A 


303 


361 


3 


LPKWWDYRHEPLCIiATFFFF*AFPPVPL 
CMYVPPN/ RYVAFFFSLSSF* ICCTFTIi 
YIFPQLMDG* AVSS FLFLQP V6VKFKKK 
FFLS FFK \ KRR5LGMLPRLILN'S HAQVI 
HPPWPRPRA 


299 


14200 


A 


304 


2 


352 


ARMVSVS * PCDP PA* ASQSAE ITGMSHH 
AQPI I IS * TIAYVS FSLSSLGT* ISSVR 
S FTKSMSOLLYFYYFVSIj /PS FW 

**J * *wl iw till XV 1 1 J. w LJXJ/ C w S ffv 


300 


14201 


A 


305 


334 


1 


KNSFFFFFKRSLGLLAQI*/VQWGDFKT 
LQPLPPGVKQ I SRLNLLKKWDY* RGPSG 
LGKFW/ 1 FL»* KQGFPQLFQWLNFFFF / 
C FFETES RS VAOAGVOWHDLCS IjOAPPP 

ssc 


301 


14202 


A 


306 


270 


2 


DWCAGGGGDGVPRRQ VI FV I LVETGFWR 
VGQAGFQLIiASSYLPALA/FPKCLDFRH 
* PPHS ALKALFFF *DGVSLLT.Pl?TtFn>Jfi 
ASSPRA 


302 


14203 


A 


307 


306 


2 


HIFQOTCTCTCTCCWWCVVWCVCVCV 
KL VI SOT * IiSPLCSG P * A /CTVT? <3 l/POT* 

VCTCCWCV\CVCTCVCVCKVSDLTNLS 
TLCSGP*A*HFFSVCVCSC 


303 


14204 


A 


308 


117 


332 


SPPVLLRCSLSLSVOI*NGKTIRPl l »K'*T.TC " 
MEL* FHSAIPLLGI YPKGXKFLYQKDT / 
CHSIFITALFIIAK 


304 


14205 


A 


309 


1 


15! 
j j i 


GTRKTNN* KWJ\KDIiSRKPS KT?n T R Kf5r> * 
V* /HMKRCSASLI IRIM* IS ITV\RYNL 
TCIIM 


305 


14206 


A 


310 


1 


352 


PSPPNSPPSPLPSRPPQPPPYPPPPPRP 
PRSPL*YTPPAPPYW/PSPILTPSP/PS 
DPPEPRSSRP 


306 


14207 


A 


311 


1 


405 


FKPSP* PLTGALLGLLMTSGLAM* FHLH 
SIT\LLILC\LLTNT\LTIYQ*WRDVTR 
ERTYOGHHTPPDOKGLRYGIIIiFITSEV 
FFFAGYF*AFYHSSLAPTPQLGGHWPPT 
GMTPIiNPLEVLLLNTSVILAS*VSI 


307 


14208 


A 


312 


3 


176 


HEILGSNF/CX3ETQVSIHCPG*S*TREIi" 
KQSSHLNLPKCWHLRA*TTVPGLTTIFH 
LGKPKHKRVR 


308 


14209 


A 


313 


298 


327 


SLTLSPRL*KIKK*RKK*NKCWQGCKEI 
GAFTHCWWGCKMVQPLWKS\W* LPKKVK 
* KLSYDPAIPLV 


309 


14210 


A 


314 


344 


3 


HSTSLVIREM* IKTRMTYPFTTTKMAI I " 
*KNRS* \WLGCGKTGSLIHSW*KC*MVQ 
SLWNTFGQFFIKLI*EIiTIPLLDLYLRE 
MKTYDHTKTCI *MFI VALFI IAQNVKMS 
LV 


310 


14211 


A 


315 


18 


321 


WALFVYCLEKKEFGQPPRIACFFFWDRV 
SLCLPGWRAVMGSWVT/VRPKLLAFN*S 
SHLSLPSSWDYRLIPPCSAKDSICEKNR 
VSTVDGADKKVRRGKEGD 


311 


14212 


A 


316 


289 


52 


ETSCDVPSKGILHFGKRNTHIMPEKWAK 
GMR*YFH\KEDK*MAHIHVKRC*TSLVI 
MKMQI KSSTPYHFI PPRLTKPLKSANTK 
CW* TCIWAI YLSSSMKILSHALCPFFWH 
DVCISLTEVKNAFTGHITRRFS 


312 


14213 


A 


317 


57 


324 


YCVTFFFFGKQS FVLSPRLNC2MGPFWVN 
CSLSIiLSTWNYGLVPGTQPFF\EFLVQR 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KMLysine, L^Leucine, ^Methionine, 
N=Asparagine, P^Proline, 
Q=G)utaratne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GFHYVAQVGLKLLTSGDLLP*GSKGAGF 
TGLTHC 


313 


14214 


A 


318 


65 


341 


QWLLKSRSCYFFFL*DRSWAVAQA*VQW 
R/NPPPPGFKQF/SCVSLLSSWDYRCLP 
PHLADFWIFSRDQPDQHSGTSSLIiKKKK 
KREIPNLPGLG 


314 


14215 


A 


319 


2 


182 


ARGDYRHMPPYT\H*LETGSCFVIQAGL 
ELLDSSHPPALTSQCWDCTCEPQSWSAP 
CVSI 


315 


14216 


A 


320 


345 


3 


MPPRQAHF \VFFI EMGFHHVARAGLELI* 
GSINPPTSASQSSWDHR*SQSAWITGVS 
HHAQLGTITSYHILLFLKKGRAHAC*SQ 
HDPITCLHVI^CCSLLLAISRPPTTTNL 
PRA 


316 


14217 


A 


321 


122 


466 


QSFSTYC*KNWVTI/CFIjKSIDSYLVPY 
TKIDSKWI IDLNVKPKTI KLIjGGNMGEN 
LCHLMLS KDSSDVTQKA* SIHKQ INKFN 
SHGGTRLRSQPLKRAEVGRVRLSPGGRS 
YSDL 


317 


14218 


A 


322 


3 


345 


HEDAVSAPCNLHLPGSSNCPA5ASRVAE 
MTVPA\PCPANFFFFFFUEREFGFCCPG 
/ CK*K* VFQKKPKGKKKGRIBGKKGVWG 
PKLKERERKKRKKERRKRGKSRKEVRTE 
MNGG 


318 


14219 


A 


323 


206 


446 


GNLHQCPYKWTHLAKYMCEKQDS I C* IL 
KECYQKPLLEKSHLIAQVENDEKPADW/ 
PIiLRGEYVEWND 


319 


14220 


A 


324 


62 


327 


VERLLRDLRENFCRNPDGSEAPWCFTLR 
PGMRAAFCYQIRRCTDDVRPQGEAQAWG 
LQSRAGSLEPEGRGEVSA* W/ SARTRHR 


320 


14221 


A 


325 


441 


1 


RXKK* ILQGL/EFRLEHWAS PHRDHRV 
WPRQEGKLFSEGKNKATRMTVLYPEE/S 
SKKLGSRS *GSE/CTSVFRITLSVGYVA 
YTGAP VS I S ERP S AAI YHRMNKW * HQHR 
RHLLGFSEQEPVQGVEPLQGSSQGPKEP 
QPSPSNPAPRA 


321 


14222 


A 


326 


2 


358 


ARARTLR I MVNLNYAKSTLPVLCKGKNK 
VWVTAHLFTASFAEYFKPAVETYCLEKI 
FFKILQLIGDAPCHPRPLMDMHKEVNVF 
VPAKTS IQQPMD \ IS P FKS YY* E I TFQA 
GCCGSCL 


322 


14223 


A 


327 


1 


430 


ARENMPGHLHRCIIEQDWRIHMLIPKLN 
TQMIRIV*YWYKDRYVDTE/YNNIESTE 
VNLY I CGEL I FSRS AKI I QWWGKRIVLN 
KWCWDD/WNL/SCKSM*MDSYL7LCT\K 
ITPWI INPTGRAKTMRLLP/EKTGVSFC 
DFG 


323 


14224 


A 


328 


146 


374 


KCLMLTKPYNIMRLT* PYEIR* /CGDTF 
P * SYYA* TGTAVRTWGLTP VT PALWEAE 
AGGSRGQEMETILANTEKPRL 


324 


14225 


A 


329 


2 


322 


ARASRTFI VRKTQCIASKDKIiTLVRG* C 
SCDFQLKSMLIDYSENPRAIiK/imLYKR 
NNKAWMTLGIiFTAWSTEXKPTVETYCSQ 
*\KITIiKIWI, 


325 


14226 


A 


330 


350 


47 


EMGFLHAGQAGLELLTSGDPPASAFQSA 
GTTGVS HRARPAN * KKKF / CLKT / R VFL 
CCPGWS*TPGLK*SSCFGLSSHWNYRHE 
PPRPAQFIiLIDFYLRHYRES 
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dm inn acid 
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Predict- 
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ding to 
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acid 
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nan tide 
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Amino acid sequence (A=Alanine 
OCysteine, D==Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N-Asparagine, P=Proline, 
0=Gliitflftiine- R= A r p i n i n e. 5v=Serin^ 
T=Threonine, V=Vafine, 
W=Tryptophan, Y=<ryrosine f 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion. V=Dossih1e 
nucleotide insertion 


326 


14227 


A 


331 


753 


458 


FFFVE *G VL \HVAQAGLKLLTSRDLPTS 
AS * VAETTSTHHHA* LHLFNFEKYFCKE 
OILLCCPGWSRTPALKRYSHLCTjPNSWD 
YRHEHCNE PEB I SL I 


327 : 


14228 


A 


332 


256 


3 


TLVLMQSCLSLLSSWDYRHKPP/ *LSNI 
FNLPMGLSGHNPI INGGISVSLDRVWL 
FLFC/LCFETGSHSVAQAGVQWHDHGSL 
HSC 


328 


14229 


A 


333 


25 


358 


TPDIiR* STCLSLPKCWDYKHVPPHPANI 
FI*QKLFFISDGQYANVLTWFEGGTFHT 
*A*RPNHHSYKLRKDQTIIFIYIiFF/LR 
WSL/NSVAOAGVO /WHNIiGSIjOQtjPPHF 
K 


329 


14230 


A 


334 


3 


329 


HEDWS P FWPGWSQTPDLR* SACLGLP K 
CWDYRREPPHIiALNAFLT*NIFNL*W/C 

TJZCiRNPT TI?<?TPTP)n - /3R'^V3*T.T.Ji , TT?T T? 

IT J, lAOlUilvr i-JV3X\VjrV7 JjJjF JLC J. A? 

VWFLTQS LTHSVTQAAMQWHDLGS LQ 


330 


1423 1 


A 


335 


26 


330 


SQLLGRPRQENHLNTGGEGCSL/CKIIIi 
VCFWLGHLLPVWFRC/ ILYLF / CWSLG 
FNFCFSLVL/ CWFLLCFFCAFMFCLFY* 


331 


14232 


A 


336 


3 


348 


DEVFKS/YH*HPMIiKC*GDYSLI*FHGES 
DLDTTQILTHPSTTAM YFVHYCQSP* IL 
YGTIDT* PPVIHRNPIHIRTPYPCU3AS 
TAINLQLSHMNCNSKATPHSLGYHQTYP 
PLTV 


332 


14233 


A 


337 


2 


329 


ARAARAARVTGIAWAYHIjIGKGIiEPFFF 
MALIiFLR*GI*/NSVTIiARVHWYNHGSM* 
A*PPGLRQ1SIiCPA/NFLYFLVE/lGFA 
TIiPRLOAICr.GSSDTiPTSTqRQ AP TTCV 

a. uci\Liyn^ vuuuuUiir X »j xovsOAuJL J.VJV 

s 


333' 


14234 


A 


338 


60 


354 


NGKECKKLI YLS I YLS I YLS I YLS I YHL 
C*YL/SCLSI*LSK*LAVYYLSVCLSVS 
VCL\VAS <TL* IS I YLS IHLSFYS TDRTT. 
AHHCVPPTCSIiPHATH 


334 


14235 


A 


339 


] 


351 


RDADVMLLQILVI FQALALGVQTEGMAG 
VTHHDRVLVNWVRFPOCL /HRV* SHGT\ 
P*VCTSASERPGLKQSSHPSLPSSWDYR 
CEPPCPVNF/CFFV 


335 


14236 


A 








FGSWLVFFF / CFLETDSRFAP * AEMOGP 
NFG*LNP PPPGLRGFFGLSLPGTGDYGP 
V 


336 


14237 


A 


341 


306 


119 


LRWENHLS LGN*GCSEP * FRHCTPTWTT . 
E *DSVSKQKQKQNNLELKKPGSS \CWF 
LPETLFT 


337 


14238 


A 


342 


1 


122 


GTRGCX3EPRSRHCTLA\W*QSKTPSKKK 
KKKKKKKTGPFFF 


338 


14239 


A 


343 


92 


316 


VCWDIPHPJCSRTICILFYPLVIFLFETK 
I CSSFS F * KGCM/WLGAVAHACNPSTLG 
GQSGWIT * GHE/FKTSIA 


339 


14240 


A 


344 


OJ 




ATAPAPCYFLILCI cmcvcvcmhtsvct 
/ CVTLCMCMGLCHIL * TCVA/ CTCLGMR 
FLLPAHI I ICVKRFCLMCGSGNSGSLCL 


340 


14241 


A 


345 


127 


357 


YGVLSRGRVLPCWPG* SRTSGF/ SS PFA 
NSWPPCLANILYLVETGFCHVAQAGLEL 
LDSSHPPTSASQSAEITSMN/HRA 


341 


14242 


A 


346 


312 


1 


LMNLNRSVGALLTRGVKCHKAFRSTISL 
LIGALK* *QSFCSDLGISLSKWKDLNND 
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M 
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nucleotide 

location 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D-Aspartic Acid, 
E=Ciutaraic Acid, F=Phenylalanine > 
G=Glycine, H=Htstidine, l=lsoieucine, 
K=Lysine, LNLeurine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














IYRSLLRQSEEDTS\KWKDIPCS*TGTI> 
MI VKMS ILTKAI YRFNAS C 


342 


14243 


A 


347 


350 


243 


FHHVGQDGLNLLTL*ST/ PCIiGLPKCWD 
YRCEPPRLAY 


343 


14244 


A 


348 


165 


338 


HTLDPASHEGPTFLFSL/ PVEQCLK*LT 
LKVAT IVLFCFCF*DRVTLCHPGWSAVM 
QSQ 


344 


14245 


A 


349 


328 


2 


SVCPHGS INLSPAETTGACH/RRIiA*NI 
DSRV*AKTFKGK*TNCSYNNNNNNNNNN 
NNNKPFLIiLRLAGDRHPDQLCPPPRRCH 
FNGATPAGPPSCLRRSHHSSAVCSC 


345 


14246 


A 


350 


3 


338 


HEIEELITFHDRALIGNILICSLVLYAL 
FLTLTTKLTNTNILHAKEIDTVRTILPA 
I ILILIA\LPSLRILYITDEGNDPSLTI 
KS IGHQ WY* TYEYTD YGGL I FNS YILP P 


346 


14247 


A 


351 


124 


382 


NTFKS * WSNKTKNAI RT* AKDTKQHFTG 
EDXQXANKHMRRCLIP£jGKR\SIiKQEIS 
PRVRINKI RPGMVAHTCNPNTLGGWGRQ 
IT 


347 


14248 


A 


352 


3 


368 


RDRASLFCPGWS *HPELK*SSCLGLPKC 
WDYRR/AATAPGLLCL 


348 


14249 


A 


353 


390 


161 


FSRDRIS PRWSGWSP/DLRIiSP/CLSLP 
KCWDYR/R*ATMPGTFFHFNLKNQDARN 
QIITCPNLTNKYIYMTSSFILVHS 


349 


14250 


A 


354 


123 


345 


LAPQHFASIiRlARP / PAGPPCDRPLAPP 
RPS*FSPAPLPPGADRSVPLSPSVPP+S 
LPPPAGAPPR/PPP PRDSPRR 


350 


14251 


A 


355 


37 


393 


TKHFVSTCYVPEFIAGTNQRKRNRISVL 
ANVKDKQI ILLRVRTAI TEGYRNYHVLF 
LVFVCFFS PPKQGL/DSAFQAGVWWHNH 
NSL* P * TPGLKLSSHLDLQSS WDYRRVP 
/RMSSYF 


351 


14252 


A 


356 


2 


335 


arets kdrltlllganaggdf \ klkpvl 
lyhsenpr plknyawst / lp vl * kwnka 
*mt\hvftewlaeyfkptvqtyclen\k 
fsillinnplshpralmdrfketigvlm 

PA 


352 


14253 


A 


357 


199 


377 


VGSSCFLKVCQ/C*AVCVCVCVCVCVCV 
CVCLRCV/CVCL/CV\CVGFFFFLVLCG 
CGCVCVC 


353 


14254 


A 


358 


1 


335 


GTRTTCMYHVPPRSANF* LVLERWGLPM 
LPRLVANS PPQTI LLLQPPTVLPNPERT 
SALY/YRPLLDLSPTSLSPRIPSSPLTS 
SLSIPAPPRSHIAPPPPRPPPHPPIRPS 
A 


354 


14255 


A 


359 


3 


369 


KPSP* PLTGAIjSALLMTSGIjA\M* FGFH 
SITLLILGIiLTNTLTIYQ*WRDVTREST 
YQGHHTP PVQKGLRYGI I LFITS EVFFF 
AGFF*AFYHSSIjAPTPQL*GHLPLIVII 
LIjNSL* FPTLY 


355 


14256 


A 


361 


337 


121 


LWSQLLGRiRQEDCLS PRSQGQNKP * CH 
CTPAWVTE *DSVSKKGKWYIHNKKSSIi 
KKK\NSCKMQQFGGT 


356 


14257 


A 


362 


256 


375 


EKKTIVQYPHTYGINV* KNLPAKKTPGP 
DGFIDKLYNTFRGEITTSPHILFH* FKB 
VAVLLNSFSK\AASITLTPKPNKDIMRK 
ENYSPISSYIWNQCSFFLEQIFPLSFKL 
NCPGLIIPH 
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peptide 

sequence 


Amino acid sequence (A s Alanine 
OCysteine, D=Aspartic Add, 
EXJlutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Mso leucine, 
K=Lysine, L=Leucine, MNVlethionine, 
N=Asparagine, P=Proline, 
Q=Glutamtne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W«Tryptophan, Y=Tyrosine, 
X=Un known, *=>Stop cod on, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 


357 


14258 


A 


363 


170 


1 


SQLLRRLRSQGACHHAWLIPVPSVDTGS 
HHIGQAGLKLLS \ +AIHPPCPPKILARA 


358 


14259 


A 


364 


330 


3 


RIMSASASQSAVI SDMSHDVRL IDI *HE 
P * FYQALYS FRQ * ITSSPSYR*SYLLSY 
KWQDBARe/AFVTAWQAEAGGMLEPSRL 
VLDS*PRDLPTSASQSAGITDMSHHSC 


359 


14260 


A 


365 


1 


341 


SQYGLDFHDLSDLPTMASQSAGISGVSH 
RAWP AS G * F YVSRE VI CS YRL * G YTP S L 
LVLESGS S S VTQAGAQWHNHS I LYPHTH 
GLK*SSCLSITTR/WDYRHESP 


360 


14261 


A 


366 


2 


338 


ARMVSIS* PRDLPTMASQSAGITGVSHR 
AWPATGKFYVSRGVI CFI I * KDTHLFF / 
CFLETGS SS VTQAGAQWHNHS1>L*PQTP 
GLK*SSCLS IPKC/ WDYRHESP 


361 


14262 


A 


367 


2 


337 


ARAGILKH*QRKCKS IR\PLWNOTGYYI 
LKLNIGI THDPAI P FLVLYP SEMHTHAY 
* KTSMRMC ITTLFIMAIN*KFKCS *SEE 
E / IGKFWYFT* * LLYSNENKHLQLHTTW 
IN 


362 


14263 


A 


368 


59 


348 


S AFGIQENEMPAGRGGSHL* SQHFGRPR 
RVDHLRSGVPD*LGDR/ GETPSKKKKKK 


363 


14264 


A 


369 


350 


3 


GWSQI PDIiRWSTHLS LPKC* D YRRGPHA 
QL*TLMYKYI»FEHLIjSVIjLGICPEVEWC 
/DHMAILCLISLGTVILSSKALHHYTFS 
PVTCKCSNFSISLSTLVTFHCFLNACYF 
PLFSRA 


364 


14265 


A 


370 


908 


30 


ECSGTILAHCKLPLPGSCHSPASAS*VA 
GTTGVCHRARLIFLYFLVETGFHRGLNL 
LTS *ST/ SASASQSAE ITSVGHHAQL/ C 
LELELK* STCLNLPKCWDYPA*ATVPSQ 
FGSVFLRIF*PMFISNIGQLFSCSAFVW 
P * HQGNAGL I E* VRKYFLLFKFLKEFA\ 
RTGINFR * MSGRTDW *NHPVRG I SLLKG 
F*LVI*CSY*L*VYPYFIXF*EVSt»LSP 
RLECNGVHSESAGITGGSHHAQLVYPYF 
FFVCDS VLVGFVFLEI CHFIWLFNLLVY 
LCLWYYLI ITLIAVKSQ 


365 


14266 


A 


371 


164 


1 


SCRQVBSYGIKRITCISIASSWNYRHAP 
PRPANF\ * F P AEMG \ FLHVDQAS LDS C 


366 


14267 


A 


372 


3 


383 


MVSTPAEDAVNIVDMTTN/D*EYSLNLV 
DKAVAGLERIDSNFE ISSTVGKMLPNS I 
ICYKEI FHERKNQLMQQKSLLP YFNKLP 
QS IQHSAITTLISRQLSTIWQS /PPPTK 
/RFQLTBGSDD 


367 


14268 


A 


373 


1 


349 


TLliGNDHIYNVIVTAHAFWIFWIVIPI 
I IGGSGN* LI PLI IGAPVMAFARINKLR 
LRLLPTS \ I LLL\LAYAI LEAGARTG * T 
V*PALTRNY*NPEAYGHLSMFSLHLTGV 
SSISR 


368 


14269 


A 


374 


443 


2 


SSGSRSECRFLPIiLFNVVLEILAGAIRQ 
/EKE/I KG IHIGKVKVSIKLTEVINKFI 
KVA*YKIN\QKSWCLYTNNEQLEKKFK 
I PFKI AS KR I KYLE I NH * GKDLYNENLK 
TLLKEMKENKWINIPCSWIRRLNNVNIS 
ILSKVTYKF 


369 


14270 


A 


375 


3 


201 


LWIKKLDIIPIIPS*LFFNTELDKIiIIJf" 
LIWKFRGPR/ ITQNSFEKPIVGGLILSD 
FJCTYYKATVLK 
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Amino acid sequence (A^Alanine 
OCysteine, INAspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine, P=Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


370 


14271 


A 


376 


11 


324 


dkflkhp pllvlqavagkgs plhclstl 
ptslfi *drvllcspawsavaqtwftat 
aswaq* ssnhspsd* r/ phtttghffcr 
dkyl/tmlprlvsnswtqai 


371 


14272 


A 


377 


85 


364 


YQHNYS FCFISQYVYRLFFFF * TFFKKK 
GFWF *RLNFQGGNMG /SMEP * PP * IRQF 
FOU&PLN/SPDYRGPPRGANF* IFLKKK 
GFQVFPSLSKIP 


372 


14273 


A 


378 


1 


382 


GTSGTSYSTIFAGTLITALSSH* fft* V 
GLEINMLAFIPVLHQKYT\P*YTHAAIM 
HFLTDSGA/SVLLIEILLLYSNSLGE*S 
S I IELQL IS IFLSMMLVT PNKFGAAYYS 
IHALCLDRPQLLHILLI 


373 


14274 


A 


379 


24 


323 


I PGLKRSSRLGLLKCWDYR * DNFQHMCL 
KNFNNWAAYYFVCRI CCAVDPHGCS IKV 
SFFLSF/CFFFTKSRTVPQABVQGGDLG 
♦IiEPLPPGLMPFSGLSL 


374 


14275 


A 


380 


2 


317 


AGWFQTPDLR*SPCLSLPKC*DFRG*PP 
CQKLFLCP I KI F * TRLK/ CYLNTL * QSL 
PLMHFKKNVIYFILYKAALFFFFLRRSL 
/HSVAQAGVQWHDLG/S/LQTPSPGFK 


375 


14276 


A 


381 


1 


323 


VKRQPTEWEKIFA/ TI> YPSGKGL I TRJ Y 
KGIiKQLEGKN/ ksnnlilkwaked iqta 
NR YMKRCSTSLI IRE I \MQIKTTTR YHL 
TPVKMAFIKR* /GNNEC* *GYSEXRTLI 


376 


14277 


A 


382 


2 


248 


TQPLILRCLPPRS IYRFNSIPIKIQVNF 
F\ *EKBKSIiLKFIQNLKGP*ITKTILRK 
KKVDGKTFPDFTMYYKATVIKTVSWYQ 


377 


14278 


A 


383 


178 


1 


HNPPLAISFCFFPFFFFLRQGL/NSFAQ 
AIVKWYDHDLLQP* PLGLKRFSHLSLLS 
SWDH 


378 


14279 


A 


384 


312 


26 


FLRGVFFFFFLRRSL/ DSVAQAGVQWPS 
FGSLQAPPPGFMPFSCFSLL/SFFFFFL 
VE/MGFTMLARKVSIS*PFGPPASASMP 
VEITGMSHCLANMFF 


379 


14280 


A 


385 


1 


849 


FFFFKQTKFIKLSKYKNI IKKS /SAFLY 
ISNYLKMKFKKIPST*L*FEVNI»TKKLK 
HLTFYSKEHYTN* VTHKWNNITHS * TGI 
FNS*IFVLHKMICRYNATSIKIPVTYFI 
DI F/BKAYLKFI WYHKTP * IAKAIKTKE 
GI /liPDFEIHYKTVVTKTVWHLNKNRDI 
GQWSRRKREQKYISVFTAN* F * IQVTFF 
FKGNNSIFNK*CLENFMSTCR\KKK*DP 
HLTPYVKINSK* ISHLNVRPKTLKLL\H 
QKIE* KPHNTGLGSKFFDLT* ISQDTKG 
RTSQSDHF 


380 


14281 


A 


386 


3 


318 


LREMQNALESLSNRTEQVEETTSELKDK 
AFE*TQLNKDK\KKRI *KNE*RLQEVWD 
CVK*PDLRI IGVPEEEGKS KYLENMFEE 
IIEQNFPGLARDLDIQIQVAQR 


381 


14282 


A 


387 


1 


382 


FTPTRTAWKK/ SNDWCWRGOGS IGTLR 
HCS WECKMAQLLW * TV/ WT FPRKIRQ PS 
DVCCDMVIGLPYDPAVLLLGICPREMKT 
YVHTADMSVITSVSLVI I \ADSENNLNV 
PSADEWINEMWYTHTVDY 


382 


14283 


A 


388 


1 


341 


HKLENLENIVKFL*TH/TLPRLNQEKIQ 
TLNRS ITS SKI KSLI KNTPTRKBKKK\N 
PGPKGFPREFFPRA\ KKQGVPTLGNPFQ 
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G=Glycine, H=Histidine, I=Isoleucine, 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














KFRERGSLLNYFYKPGLFRIPKLGKNPR 
GKKK 


383 


14284 


A 


389 


168 


1 


KKFFFFSFFF * GDR VXiLCHPG WNAVMQT 
RLTAAS\TPGLKQSSHFTLPSTAGYTG 


384 


147S5 


A 


390 


337 


1 


TGITPLNPI»E V\ PLINTS VLLASG VS IT 
*AHHSVIESIGNLI IQALLITILLGLYF 
TLVQASE * FESPFT I SDGIYGST* LVAT 
, GFHGIiRVIIGSTFLTICSIRQLIFHFT 


385 


14286 


A 


391 


1 


235 


IiNFSYESSMYFALFTIVF\WVFLNF*KF 
FMN * I YHLCYVL * YFLLV\ FVCLLTW FM 
VFCFFFFFFFFFFFFFFFCFFFS 


386 


14287 


A 


392 


1 


258 


SCDRLFANHLSNKEIiVSRKYIYIF*KSQ 
DSTIRKQTDKK*AQELNRRFSTKDLQMG 
NKHMKRCSTPLAI /REMQIKTMLRYHCI 
PIG 


387 


14288 


A 


393 


2 


317 


IiAYCNLCIiPGSSDPPTSSSRVAGNYRG\ 
HHDSVF*RAEDINMHEIQFISFLFLNRD 
EVSLCCTGWS *TPGLKRASCLDLPKCWN 
YRHEPLCIiAFFFNGSWFQCQI 


388 


14289 


A 


394 


91 


408 


LGAEFDVRAYLTSGRLTGPGVPFLTGFY 
SKDHI I\ETANISYTNA*ALSITLIATS 
bTSAYSTRIlLLThTGQPRFPTLTNXNE 
DNPTLLNP I KRIiAAGS LFAGFL 


389 


14290 


A 


395 


3 


165 


RNKKLKNNRHWRGC*EKGTtiIHCCWWCK 
LVQPLWKAVW*VLR* /LKTELPFDPAIP 


390 


14291 


A 


396 


661 


1 


LCPLSSFYRKCALLGFWFVCLFVFQIRV 
SLCCPRWLQTPGLKGSSHLSLP ISWDYR 
HLPPRLAIGAVFLI FVLMRTCIjR* FADP 
ISLSFE*QQRN*IiHILISLL*MVDICXT 
QFTKWIFL*PKNS* * LKRQCKLPKITEL 
/PKNRIQIYPTD*MSSIPHSLSFYLVCL 
FVCLF*MESCFVT/TMAAVQWHDLGS\L 
QPLPPGFKRFSCLSLLRSWDYRRPSPC 


391 


14292 


A 


397 


320 


1 


PDSKQQIFNVDETVFC/WKKMPSRTFLA 
REEKS MPGLKASKDRLILLVQANPAGGF 
KLKFVLTYH/SENPRAI KNYAKS \TVL* 
KWNSKVWMTGHL FTALNVLS PRRAL 


392 


14293 


A 


398 


1 


158 


CIGPMWENRLILGGRGCSEL*SCHCTPA 
WAT\SKTLSQKEKKK*NMRKVMIQC 


393 


14294 


A 


399 


264 


1 


LINEFSQVAGYKINKQKSLVFLYTKY\Y 
SKLSEK* IKKAIPFTIPAKKKEIKYIjGI 
NIiTKDVKDLYNENYKILKKIEDT\KKWA 
DTPC 


394 


14295 


A 


400 


3 


343 


HEQKRQSKVREVRELSQGLNMDRKRWSQ 
DLNPGCRTVAIiSPYHHTRLALNCPGRQW 
FVGVNFF*RKKIFFFFKFWDRVLLCHLG 
WSAWPSWI,TAALISW\VK*SFRLGIiLS 
SW 


395 


14296 


A 


401 


1 


345 


GTRKNTDNTKCR*VCEETGYFIHCWWGY 
KMVYPLGKLVWHFLKKVS IHLP YGTSAL 
LSmiEKLTFT/CHTKTCTQM/FHVVLF 
P IVKKWKQPKCLPVGEIjLNKLWYIS IHT 
IYSAI 


396 


14297 


A 


402 


102 


355 


DRVIRLANFCIFGRDRVSPCCPSWARTP 
GLKRSTSLSLPKCWDHT*ATAPG/LRAI 
LLYFWDYRCLS PDLVNFACBHFS VELVA 
FF 


397 


14298 


A 


403 


156 


3 


NNKNGFP * SMLFQPSNGLVS I *NI \ PLK 
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*QKWFS FFFEERVSLFHPGWSAW* SWL 
TACSLDLPAQVILPPLSLPNSSC 


398 


14299 


A 


404 


I 


364 


GTRS EVI YKLLLQT PWVI * HMRTRLLLQ 
ACGAR*MWMIL\MI ILGLVTCRLT I YQ* 
WRDVTRKSTYQGHHTPPVQKGIiRYGIML 
/FITS*WFI*SCVX\WVFF 


399 : 


14300 


A 


405 


3 


365 


HEQTSHNI PLSQSP I *SKILMLLNSMKA 
ERGEEAAEEKLEASRGWFMRFKKRSHVH 
S \ KVQGEAANADVEATASYPEDLVRT ID 
EDGDTNKQIF\NVNKTAFCWKNMPSRTS 
TAREEKQCLP 


400 


14301 


A 


406 


3 


356 


HERSDQI.YAN/ KLDNLDKRSKFL\EGYK 
LQKLTQKEIQNTKRPTTRI/EISKKKKK 
KKKKKKKNFSPGGFTGEIPPTFKGQFKK 
IFKKFFKNLGGEKTIiPI * I YGAG/IKLL 
PKIKK/DPSKKK 


401 


14302 


A 


407 


116 


362 


YKYSLTPQKLYMHSYIKKNHNKQNNRNH 
STTTTTKQPPPPGFKRYSCLSFPI*S*DY 
RCTPSCPVNLF/CVFLVBTCFHHVG 


402 


14303 


A 


408 


3 


360 


HEVRLWDFAFERNEGGGENEEKVDWLNY 
RMWFHLI F* ABCYS ICRLHAYYSCILGP 
VLLLLFVLIILLFCCb/SFFDYF/VFMF 
FIFFYSFFFLIiFSLFFSLFL/CFIFLYF 
FIF/CFFLYFFFF 


403 


14304 


A 


409 


388 


1 


allkfpfpkvlkkpp*gsffp*gf*lls 
/lifppyfgretlffff*nkvplcppdw 
ssmarsrvpagsvfpvk\bslslsllsv 
ppqvqvngfltffffffsw*rqglpllp 
rmvwnswaqailplwphas 


404 


14305 


A 


410 


1 


386 


VFNAEESAY/YWEIMPQRKFIR/EEKQA 
P* FRQERIS* YCANAVVFTIRTTHICKP 
ANPQALKKKKKKKHQLTVFWL YTKKS * T 
MRTLFLDWFCQCFVPKVRKYLASKTLPL 
KVLLILDNAPGHLKPHIFN 


405 


14306 


A 


411 


2 


417 


AHHI FTARIDVDALAYFTYVTI I IDIPT 
GDEPFS *LATLR* INMK*SGAVL*ALRL 
IFLFTVRGLTGIALPDL*LDIVIiHDTYY 
VGAHFHYGLSVGAVFAILGGFID* WPLF 
\SCYTLDRPYAKIHFTIIFIG/VDLAFF 
P 


406 


14307 


A 


412 


295 


397 


WQWPGTVAHACNPRTLGS *GGRV/TLRS 
GVKBQP 


407 


14308 


A 


413 


437 


3 


PGFGSLIGNLIPASGNGRKSKSCVCVCV 
OTSVCFKSCT*SLCEHLFTCIiCPQICVR 
LQLMV/ CPQNCVCWCPESGLFLDESVCV 
RLCVCAWMAVWMGGSGSG*VCGCGC\MC 
IWGPIiDSEIX^OTC/LLGQSKCDCSL 
RCRLPAWCV 


408 


14309 


A 


414 


2 


392 


HLQFI FFWILKI FHYL/FLWFWFYLTSM 
ALFYLQ*QKRNCMYEVFNRGLFF/SOGE 
GVSGSPASSSSSPSCSSSTRGGGAVGGG 
GLGFVCFLLLFWGFWFVFLFCWFLVCVG 
WVWCVWWVFFLLGV/CCWCFFF 


409 


14310 


A 


415 


376 


1 


GFQASKDRL/ IiLLG/ ATAAGDLKLKPML 
I YNSKNPRVtiKNFAKCTLPVIjYRLYR* K 
NYAWMTAHLLKSWFTEDVKSTIQ* KISF 
EMLLIINWVPGHPRTPMEMYKELNFF\M 
PANTS IP* PMDQG I VL 
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Amino acid sequence (A- Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=GJutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, V=possib!e 
nucleotide insertion 


410 


14311 


A 


416 


409 


2 


FFPIKKKKFAGAIiVFFFFFRKIFFFFPK 
GEGKRGVLVSLNIiLPLC*KDFFSPPPPG 
EGVLWPPPPGPIFFFFFLKKGFCDFCQG 
GFYNPSLYLI PLS PFKI WG* TRGFT/LP 
PPALFFYFFFF* DRVSLCHPGWNA 


41] 


14312 


A 


417 


425 


3 


RELIiAFWQNFKLRPQGPFFS P \GPGGEK 
KR*SLPKTPPKGGPPGP/SPNGRLPSGG 
QLPPPRGG/SPQGPPPLPKGPQGWGPPF 
P*RSPPCPPHRIGVPQVSSSPTPGLVFP 
RGPLNPGNLGGTKKKKKISSKAARDLEIj 
VRTRG 


412 


14313 


A 


418 


1 


346 


LL P DRNLNTT F FD \ P AGGGDL I L YRHLF 
* IFGHPEGYMLMLPGFGI ILHI VTYYSG 
KKEPFGYIGMV*AMISIGFLGFIV*AHH 
IFTV* IDVYTRAYFTSATI I IAIPTGVK 
VFS 


413 


14314 


A 


419 


2 


382 


LFSTNHTDIGTLYLLFGA*AGVLGAALS 
LLIRAELGQPGNLLGNDHIYNVIGTAHA 
FVINFFIVIPI I I \GGSGN* LLPLI IGA 
PDMAI ARINNI SFRLLPPSLLLLLASAI 
VEV*S^TG* I I YSSL 


414 


14315 


A 


420 


261 


378 


KKNFFFF* KLNFF*KFFLIFFPPKKKIF 
FKKKKKFFFYKIFF/I*KNIFFSPQKNI 
*PFLFFFI * YPFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFTLN 
SLYFSGRVGGRV 


415 


14316 


A 


421 


124 


428 


KPAATHACAT I FMCLDQEAI I SN* H* AT 
TQTTEVSlASFKLDYFSIIPIPVALFVT 
WSI IEFSL*YINSDPNINQFFKYLLIFL 
ITILILVTANNLFQLFIG 


416 


14317 


A 


422 


8 


355 


P VFS YNHS TLLTFSLShSFFFFFFAPGK 
KGEKNGKPGPFRGGFFPKKIKKPFPPP / 
GGPKKPPGPGFPQKPHFPFKGGPWPSKT 
LFGRGGKPNFGGPQKPPLV*NPPSQGP/ 
HGPSGGG 


417 


14318 


A 


423 


356 


3 


KPLGIDLTKKVKKLSKKNYKTLMKKIED 
DSN*KDNSCL*IR/MTILPKAIYRFNTI 
SVNIPMRFLPDAWADAW 


418 


14319 


A 


424 


2 


338 


PS VRLGGHWPRSG I TPLNP VE / VGLLNT 
IRLLASGfVSXTGAHHRLIECNRHQIIQA 
LLITIIiLGLY\LTLLQAS*YFQAPFTIS 
DGIYG*TFFGSTGFQGLHVIIGSTFLTI 
CF 


419 


14320 


A 


425 


55 


342 


GPFTPWSLC*GDLQR* P/RAVKFFLKKK 
KKKKKGNP I RN* RDI S S * FLKNLETAVR 
NFWTFFSYFKSKKLIQKSDHPLKNVKKI 
LQNDKRYLKLDWG 


420 


14321 


A 


426 


2 


367 


DRRRFCTYHKDIGALYLLLGA*AGWCT 
AVSLL IRAEi^GQPGN\ LLGNDHVYNVTV 
SAHAFVIMFF\MEYPIINGGFGN*LVPL 
ISGAPDMECSGINNIS FWLLPP YVLLLL 
ASAIVEAGSRT 


421 


14322 


A 


427 


359 


74 


ICADYTRKPPYMGAPIEFSCATCVL*LT 
STQRECVCVCVCVCVCVCVSM*VALIPD 
SN/CIPLSAYVCVCVCVCVCVCVCVSI* 
VALLLDSNSYH 


422 


14323 


A 


428 


52 


361 


NYPQLS E I * * * VS * LHY/ LFS FLFFYFF 
*LVSRFVARLECSGAWH\ I IAHCSLDLL 
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GSS / DSS P ASAS L VAGTTGMCHQNQ 1 1 F 
LFKKVFVGRVQWLTSVIPATWE 


423 


14324 


A 


429 


105 


361 


SRLFFFFFFFKTNFGFVPKVGVK/WAFF 
GLMEPFAFRVK* FSGLTHPRTWNYRNVP 
HCPVNLEF* VKTGFNIiVDKAGPKLLT* K 
DFP 


424 


14325 


A 


430 


1 


379 


HAYHI VKPSP* PLTGALSALLMTOGIiAM 
* *RLHS ITLLILCLLTNTL\S I YQ* WRD 
VSRESTYQC3HHTPPVQKGL*YWIVLFIT 
SEGVFFAGIF*AFYHSSLAPTPQLGGHW 
PTTGITPLNALEGPL 


425 


14326 


A 


431 


1 


390 


KKVKKWKNLI FM I GRLNNVNMS VLHKVT 
YKFRAICLR/TFHFHRKKKNLTWA*KHK 
KALI PKTILKKNGKSGGITLSDYKM* YK 
TTITTTIRFWWKF/DINKYNKITTHEIY 
FHIYGQMYFFKIAKTFQGIK 


426 


14327 


A 


432 


1 


392 


TRTRGRTQWD* T* LRPI/WTKKKKKKKK 
GKKRKKKKGVFFFYFFLGF * FFLWVFFF 
FFFFLGFFFFFWFFFFFFFFFFFWGIFF 
FFFLLIIYNFFCKF*FFFFFFGFPNFFF 
FFFWFFFFLDFFFYFFFPRV 


427 


14328 


A 


433 


382 


1 


RGGKFQNQKQKRKVSPPPPKKGFFPPAA 
PKNIKGGGKK*TPPKKGGVS\PPPPKKR 
KSPPHKK/ IGNFFPPPRGKGGPPKNPKK 
PGPPFFFFKNPPPFFFFPFFFFLFFFFA 
HKDGLLAREQTQAEVKT 


428 


14329 


A 


434 


375 


1 


KYSQLIFDKTAKAIK*SKDCLLKEWFWN 

kwtsthksldl/spftkinsk* ITGLNL 

KCRI ITLLENNIEENLDRLGFGNDFLTT 
LPEAQSKTELISY/DFIQTNFCSANATV 
K*MKRQATEWEKIF 


429 


14330 


A 


435 


1 


459 


PTRPPTRPSTRTLGFTMLAKLGSNARPY 
/DYPSYSASHSAGITCVKHCARAVIHDF 
NGVHRPLLIW*EFLVEISFIFDFFLETN 
PGFITKGELQGHNLGSLQVSPPGLTLFS 
CLSLQKSGFYG/HLAKYLVKF /CFGIFS 
KRGF*GGYPGGAQSPPP 


430 


14331 


A 


436 


14 


382 


MARNTSQKDIRI D PNNKCLWL VKKKKKK 
KKKKKKKKKK3CKKKKKKKKK^ 
KKKKKRKNYFHVI \ LKNKAVNNGS TITS 
TNVMSPGLHIRLIIILSLMIYKKSTTE* 
VEKQTCYYILM 


431 


14332 


A 


437 


3 


192 


TPGLK*SPCLGLPKCWDYRR/AAAVPGL 
♦DILAIFFHHXLSESCLFLIiVFIHCFNA 
NLFLRWSS 


432 


14333 


A 


438 


I 


364 


DRIALGTVDDL PGRPTRP \AHPIILKVS 
LADRDAI *NLWQI PIVAS* YI PLGF/WS 
KAMPCSVDIYSSFEKKKKTVFFDRGQGP 
TPEIQVLGKAEGVKPFEPRGSKPSWRIM 
AKPKFYKRFKL 


433 


14334 


A 


439 


336 


2 


GMIPEMKGNAGP/ SG / PRKPGFFLG*KG ' 

KUKAWLNFKPPFRAFWPKKKILLPLGFS 

GILGPGPKGNLSRPGPPPQAFPLGFCQV 

XiGP/SGPGVGFPKPGQIGWSNNPCIiTLQ 

KNK 


434 


14335 


A 


440 


1 


333 


HKLENLENIVKFL*TH/TLPRLNQEKIQ 
TLNRSITSSKIKSLVKNTPTRKEK/SKK 
PLVPKGFTIEFSQLCKKLRVPVFMKPFQ 
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N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
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KYGERGLFLNGFYEPGLTL I PKFGKNPG 
G 


435 


14336 


A 


441 


320 


1 


WGINWWLSP*EKKGRPFLI,\SHPKINSK 
WI KDLNVKGKI * KPLEGQIG * YLMI FGV 
EKGLF KGGPKYTL YKRKY* NRE LLF I KR 
YFSLTEKS PAEICHIKKLAKKK 


436 


14337 


A 


442 


8 


391 


ERILRHDQVGIIPGM/QGWVRI *ESVDE 
I /QHV/NQLPKITHTITSVS VGKARDKF 
NIYS * FKLGKTEKERNFLHWIKS ICTSP 
AASIIiLEGKESPF\ PFRSGTRKD/CLIi* 
P/LGSGMVLEALGTAERRTRKEK 


437 


14338 


A 


443 


363 


2 


HHVGQACIiELLTSGDPSNIiASQSAEITG 
MSYRAQLSIVTFSAYFLW*KLSHARGL 
ML*QS III / HVLHFHQ VKKAFHVS /PEN 
SQPLHNVEPEDWIF*EHQRKTVPPIHTA 
RKLQDIiEPCLV 


438 


14339 


A 


444 


1 


900 


DSSAGIT/GICHHAQII/LFVFWETGF 
HHVGOAGIiELLTSGDPPASASHTGGDYR 
HEPPLIiASLSFIiNKELCTWPERRRKPIF 
SLPKLDPNKRKFSFPP*LF/ IGS *TLI W 
SLFCFRWVCTQLADPTKSIAYQSSLMKP 
FGQKS ICFGSEKFPDKVYLFSSDRKAKE 
QVLWWWEY* I *DMSACFQ* ALGFCY 
CliSIWN/YELPEKFEVIiACSLPSRNNDL 
ILSLKKKSQNSFFVCVFFFFKTKSNFVP 
QAEGQGPLTO*LKLPLPGFRNISCIiNLP 
GSC\QTGAVPPPPVNFGFLKKNRVSP 


439 


14340 


A 


445 


3 


346 


QIGKYKKIiNK*VPHEPSKNKIi\ FLEAS S 
LILCNWNEPFLSRI VTWDENWI L YDNH * 
QPAQLIiD* EAPKPNLHQKKKKKKKGLAP 
FWGAFSRGNPFYFFNPQQKLHF * KFSSQ 
KRG 


440 


14341 


A 


446 


2 


246 


FKCGKTRALMHCWWEGKMLRQL\RKVWQ 
LLAMLNMELPREPAAP LLS I S PRERK / S 
TTAKTCA* MCIAWCITVKKWKHRLLG 


441 


14342 


A 


447 


38 


393 


VILHRQGliSL/ VTQMVQWYDYNSL* P * 
TPGLKQSSCLSFLKSYCHG*LFWWVF 
KWM\GFTMLPGLILNSWPQ/ CNPPAVDA 
QIAGIRGFHS/VGQAGVQWHDLSSIiQPIi 
PPGFKQFSHLS 


442 


14343 


A 


448 


54 


540 


RIPFEHASGFLQSSHQKPHCLLHPLSGQ 
VSSDGQ\FRKFGISRLGWSGIYKAAFPL 
HDCKFRRQSEDPS CPNERYLLYREWAHP 
RSIY*KQPLDLIMKYYGEKIGIYFAWLG 
YYTQMXiLLAAWGVACFLYGYLNQDNCT 
WSIEVCHPDIGGMIIMSAQRDRL 


443 


14344 


A 


449 


2 


310 


FFFLRQSL/DSVAWAGVQWRDLGSLQPP 
PPGFK* FSCLSLPSSWNYRHAPARPANF 
FLYF* *RQGVTMLTRMVLIS * PRDLPSS 
AS QS AG I TGVSHHAWAKISL 


444 


14345 


A 


450 


2 


466 


KQKI FS VDETAFY WKKMPSKTFHS * REG 
TASKLHTAG* ILLGPNAAGDLKLKPVFG 
DDSGNLRALKNYAESPLPVLYKWNNKAW 
MTAHLFTAWFTEYFKPSLRS \FRKISWK 
I * LFMDNGPHHPRALMEMCKB /KAVFMP 
ANTTSILQPMDKGVILI 


445 


14346 


A 


451 


670 


212 


SSSSP*GS*YQNRTNTIPSPKKKQTKK\ 
KNNHYKS I FFTDAEILSNI LANAIQQS I 
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KRFIRHEQLGFILWLGWVNIQTPTNAT 
CYTNRLKKRNHMTILINABVLYKIQPSV 
I*KI»FPLRGAF 


446 


14347 


A 


452 


407 


219 


PLISLRWENHLGPGVQGCSEP* LCLCTP 
AWMTE * YP IS PSQNKTKKQ \HTRKQNQH 
KNKCVKN 


447 


14348 


A 


453 


2 


395 


WFLWFRERSHLH / RVQNEAAS ADVEAAA 
S YPEDLAKI IDAG\AKQI VSVEETAFHW 
KKVPSRTFITREKTTSAI»KG/RC*LSLV 
DNAAGHF*VEA\MLIYHSDNPRTLi/KN/ 
YAESTLPVLYKWNNKAWMTACLFTS 


448 


14349 


A 


454 


424 


38 


EETEPL/HRPISSAEVELVIKNLPT/KH 
KSPRPDGFIAKFYKMQKE*LVPIUjKLC 
QKIKEEGLLPNSFYEASISLIQ\NSGRD 
TV/RKENFRPIFPMNIFAKILNKILANC 
IQQHINKLIHHYQVLFSSLHSR 


449 


14350 


A 


455 


2 


309 


PRVRSQTPGHKRSTCDGLPKCWDYRHEP 
LHPATASFLVAAVPGM/FADPP/C^JMHIi 
NE*MNE*MNGDDASEILSFEMRSHSVTQ 
TCGQWCSHSSPQPQPPGPPK 


450 


14351 


A 


456 


3 


441 


DAWGLVLDRERPFFIFFFFFFIiGNGGYF 
RGPGGRPGGEGPPMETSVSPVKKILRGP 
GQGGGKARKPHPLGGPRGPNHKTGG * KR 
AYPTGENPVLTEKP KFTGPGGNGPETPV 
IGKAGAGKPLNPG\MGGSRNPKWAHCP* 
TGGKKGK 


451 


14352 


A 


457 


1 


234 


PTRPNHLGU3DRGCREPRSSHCTPAWAT 
ERDWVSKKKTKG/EKF*KGRTK*MS*VE 
QGGSGGKGIFTLAISNSPLPFF 


452 


14353 


A 


458 


42 


470 


KRIPQLKKSPLPLKNPPGEWVGKINFPP 
PRERPKNFF*KKKTLKTPPKQKFLRKKS 
PFKKPHFLLYSKATKKQKRGKOKAPPP 
KKKKKKENP KIVI FRTEITVAS PVLSWA 
VKPIIHIFFPREKKPHA\KPPPP 


453 


14354 


A 


459 


2 


393 


DRPIEQWNKRERPRINLYIDGQMIFSKG 
TTAFE/WK*NLCNKYVWNAWIC\NRRKN 
PYLTSSRKINLR * I IDIKVK/PKAIKLP 
EQNIRCSL*VLGVGRDFLENMNYTRKK\ 
IGKLDFIKISQLGMVADTSNPI 


454 


14355 


A 


460 


338 


33 


GDEKIEKSFFFPISSLCFVP/AYIi*QQI 
RLLIFTRNSRLLFIY/CLFLEMGSHFVT 
QTGVQWCHHSTLQPRPtiGLKQSSHLSLL 
SSWDYRHLPWLKWTEACAV 


455 


14356 


A 


461 


77 


435 


AKEVEASLSTLARP I S IQQQQQQQQNEK 
KNKN* PRIAKAI \LS * KGEITLP /ELQL 
CYRAMITKTAWY*HKNRHII)QWNRRENP 
ETNPHTYSELIFDKGPKSI\SLFNK*CW 
EYWIFICTR 


456 


14357 


A 


462 


157 


3 


NGRVDLKIQKIiARCGGACLQSQIi/RQEN 
HLNPGEKGCS ES * LHHCTPD * VTKQ 


457 


14358 


A 


463 


363 


3 


PSVAQAGVQWCHHSSLQPQPPRLR\HPP 
ASA/SQ*LGLQGQATALYIHKYILFPCN 
VLISFIFPFFFLEMRSHSV/SQARVQWH 
DGSPQPPPPELKRFSWEAELAVSRDRAT 
ALQPRRQ 


458 


14359 


A 


464 


3 


396 


IJCEIAEKVKKIIKFKKQLRL*MKKIINR 
YLKEN/LNQLLEIKGTLRELQNAVESFN 
NRL*QIEEGISKLEDKAF*WTESGKIRK 
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INK*TK*TDYIKEPNIiRIIDVPGGEEKA 
KSLKNLFEVI IEENFPGLAR 


459 


14360 


A 


465 


399 


1 


PPGVLLNGPPFFFFFLILGPGWVFGQPL 
KTPIiVFFPNFQYHI FPFKI SRI F/LPTR 
FPFQVPPLCP*PFSSPWGLKVVFILLSF 
FFFL*DRVSLCHPGWNAVM* SQLTAASN 
TW\VK* SSHLSLLS S * DYRCVLS 


460 


14361 


A 


466 


279 


1 


TTNI FNPRRVGSTDAGPLNMKEKEKENV 
KRRQATGWEKI VAKDTSDKGLLS KI YEE 
LkKLKiraKTNK*/ILCI»KWSKGFNRYEA 
KEYIQMANK 


461 


14362 


A 


467 


2 


436 


RGELPQLDKYLKKTTAI ILNGEEL/E /A 
CPLRSRTRQGCSLSPLL*KKFFPPKSHI 
ESLLFFNI I LEVLYSAVRQENE I KI VE I 
GKEEMKMYLFTDGNHLCKNSERIRN/YS 
KVAGYKVNV*T* ITFLYTSREQVEFEIK 
KTLSMWRFF 


462 


14363 


A 


468 


2 


420 


RTTALF * AVRQGRLS LQRLLLS FC \CLC 
PAARGEAYIG\RQASMSROGLQAVQASC 
LLCLPKQAWAMAGAPPSASLLPCSLISD 
RCASSQPDSVGVGPSEAGVGYNLWRGL 
LSRSEKR\NIRLGVTRFSRCV/LSPLSL 
TRR 


463 


14364 


A 


469 


1 


416 


PSP* PLTGAbSALLKTCGLAM * FHFHS I 
TLGI LCLLTNTLT I YQ * WRDVTRE S TYQ 
GHHT\QAVQKG\LRYGIILFITSEGSFF 
AGLV*AFYDSRLPPTLQLRGHWPSTGIT 
PLN\HLQVAVLGEPVIaIiAS*VSIT*ADH 


464 


14365 


A 


470 


211 


398 


I FFFPKMGGGLS I ILCWRRKRHPQKLK* 
STCLGLPKWWEYRCEPP/VPGQKKNFRP 
*KTKESPPLVSGGASSASNIKL*IALPP 
FLEKKKFKKQGFKDPPFPLFFTQNPGQG 
NFFFGKI * YCPP I FFFFLRRS FTLVAQA 
GVQWHDLGSLQPPPTGFKRF 


465 


14366 


A 


471 


62 


424 


TLMHCWWKWKMVHLLWKTVLNLLI KIN/ 
RTLNHT* PCNPAIPLLGVSLREMITYVH 
KKPCP * MF I VALFMVTKNSKHLKCSLTC 
KWINKLWSLYTMKYYLATKRKELLNHNR 
SCRNYSSDMS 


466 


14367 


A 


472 


427 


20 


LGLRLPFVEQEENKPYYPLAPFSPPEPP 
MSLYKNSPIYPGKGFLFPSKNNKI PPLN 
FLKMSNLF* / FALLLGNKDFHLLLLGGP 
IGIMGDWMATTSQMFYSRVPIFF/CFFE 
TGSHSVAQAGVQWCNLSLLQSLPPRLK 


467 


14368 


A 


473 


424 


98 


NWYIiYYYYYYHLLF * RQGL/DSAAQAGV 
QWHHHCSLQPRRHRLKQI LPPQLL \TSW 
NYRHKPPRLS P YVAQAGL* LLGSSDLPD 
S VSQSAGI IGNEPLHLAS L* FLFAFP 


468 


14369 


A 


474 


415 


75 


NHFLKFFPQTQANGPGGKNKFFFLKTRF 
CFFPPGKRPWAYYKSLQPPNSGGQQI\S 
APTP*IKGAPKGGPPTRVKFFFFFLF/L 
RWSLRSVAQAGVQWHDLGSLQAPPPGFM 
PFS 


469 


14370 


A 


475 


3 


413 


PVQKGLRYGr \ILCITSEV\LFFAGFF* 
AFYHSSLAPTPQLGGHWPPTSITPLDPL 
EGPLLITSVLIiASGVSIT*AHHSLIEN\ 
NRDQIIQALLITILLGLYLTLLQA*EYF 
ESPFTISDGMYGSSFFVATGFHGLHVI 
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470 


14371 


A 


476 


1 


440 


ITLTDRKLSYY*ANRLL*LAYTITFIV* 
IPI,YGLHL*L\PKAHVEAPIAGSIVLAA 
VLUO.GGFGVIRLTLILNPLTKHIAYPL 
LVLSL*GIIITSSICLRHKDLKSLIAYS 
SISHIALWTAILIQ i TP*SFTGAVTLI I 
AHGLTSS 


471 


14372 


A 


477 


2 


397 


LFVYNHIDVGALYLLFGA*TGVLGTGLS 
LLIRAEI/?HPGNLLGNDHIYNVIVTAHA 
FVIILFIEIPIIIGGFG\N*LVPLIIGA 
TDMAWPRINNISL*LLPPYLLLLLASAI 
AEDGAGTGGTD YP PLTGNYS 


472 


14373 


A 


478 


442 


44 


SSSSPFTPPRGRFFFKKXPRKKFFS/SP 
GNKGFFSPIiSP * KFFFFFNPFFFFGGFF 
PNFPPPKKNFFFKNSPGFFFFPPLKKKI 
FFFPPPLNFAPPKVFFKSPPPFFFFFFF 
FFFFFFFFFLRGHPWGREGI 


473 


14374 


A 


479 


1 


439 


PTRSPTRPIiVLDRERPPFFFFFFKKGPP 
SVPPAGGGGANLG*GNPPPL/GVKKFFG 
PKPPKIGE*RPWPPPPGQPECSFFLKKN 
GLPHGGQGR*N\ PPPRP 


474 


14375 


A 


480 


420 


0 


YSPPSPPK/YRAPGKKFF*KKPRKEKF* 
KKKILGFFFPLSPLKFFFFPKAFKFFGG 
VGPNCPPPKKRFFSKNSPGGFIKPPLKG 
KNFFFPAPVKFGPPRGFFKGPPSFFFFF 


475 


14376 


A 


481 


371 


2 


NKI*NKFSKLNSKKSSSSSSSSSSSSSS 
SSSSSSSSSPSVNK*IKKCSTSLTTSSS 
SSSSSSSSSPT* IAKIKNIDSPKCWQGC 
RETCLTV\LVQPLWWQFLLKLSMQII*D 
SANMILSMHARG 


476 


14377 


A 


482 


3 


335 


HASGKDRHTDQRNRIKNPETDT* IYSTF 
*QKC/RLI * WRKDSLVNKWC* SNWA\SP 
MKKIKLDIiSSSSSSSSSSSSSSSSV*NV 
KIiIjSNNVGGNIjQYRGLRVHTVDVKAQHI 


477 


14378 


A 


483 


1 


418 


GVR*FSPLNPPSRWGPKHGPPNL\LNFF 
FFFFVFLVETRFHPVGQAGLELLGSRAP 
PASA/FPK 


478 


14379 


A 


484 


1 


356 


FCANAVRSMIKTVLIDKATKP/RAWKGN 
YKYHLPVFNOCTGRT \DSGNPLNWFYQC 
FVPBIRKYLARVGLPFNVFLILDNAPGH 
PEPHEFNTEGFRWFLTPDMPLIQPLEQ 
GVLRTLTA* YQCFVPE IRKYIiARVGLPF 
NVFLIIiDNAPGHPEPHEFNTEGFRVVFIj 
TPDMPLIQPLEQGVLRTIiTA 


479 


14380 


A 


485 


166 


406 


FIFWGSVSSSAEGGGWSWSSLPRVTVR 
PDETVDVTIHIiIjKE*CRPGAVAHACNPS i 
TLGGRGGRI/TLRSGVQDQPSQHG 


480 


14381 


A 


486 


107 


429 


FWVTQTFGFFWLNPPGGLELWPPPPCPG 
NFGGNFKKKGGFPLWPGGVQTPGPRGIT 
PPGPPRGGK*RGGPP/SPGPGFLGKPKG 
GGP 


481 


14382 


A 


487 


399 


2 


GSPAPGLPKGWGFRGGPPGPGKFVFFKL 
PKGGFPGAEIFLGEFFQRGKKGLGPIFP 
PVFLKTEEGGTLS/NSFFKGRVFLVFKP 
KAGPKKKKKKTNYKPIYLNNTDTKFFNN 
MLANRI *QCI KEVMQYDRTRG 


482 


14383 


A 


488 


419 


2 


PPPPPPPREKKGGLYFNPKIFLGKSFPP 
RNPKDPS * RPRVGI FFEGPKPPKKKKF* 
FGPWGKPPGDG*RKNPFK\PFFFFFFFF 
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* DGVLLCRPGWIAMAQSRPGLQEWNS I S 
KKRSQPCVLQISS KEVPDANADAWADAW 


483 


14384 


A 


489 


629 


0 


SSSSR*SLLCKCFKN*GEKIRNPIEKWA 
KVMHRLFTKRCINI * KDVRSTSLMVRET 
* ITTTLRYHFSPIRI^KT*KLNRTLYG* 
DOGETGIFIHCWWECKMF*VL*RE/VWQ 
YLIKLL /LHIPVDTA 


484 


14385 


A 


490 


2 


351 


KNRPEMDPGM*GY*VNDEGDTTYQQGNE 
ELRS /WCWDDNRLAKVKLDLYFPPHITI 
EINSKWI *DLNIKNPPIHVLEENMN/l * 
F*HVGIGKHYLTI * LKI *NP * K* * I WLH 
ENKMYCL 


485 


14386 


A 


491 


42 


507 


NLAK* IQ* * IQTIMHHDQVSFIPGIQGW 
LNIIiKSINIIYYTNRL*DKTHITISIDA 
EKAFDKI*YIiFMKGK\KPLSQLGIEGN/ 
YLKLIKGIH 


486 


14387 


A 


492 


18 


417 


REGKKSRVHHFNIKQGRVMSTKH*KTQL 
/SHSNAHSNKAQISKSQPHGLHDFFKKK 
KKKKKXKKKKKKRGEKKK30CKRGKKKKK 
KKKRGGGPP*KGVREALFCFFWRKKFFF 
VGGGGGKTPLGCLQADTPliWGA 


487 


14388 


A 


493 


413 


82 


FFSSPPPPLPFFFPHLFPPPPNFFFFFS 
PPPFFFF/PPP*KTKFPPPPPI,FFPPPP 
LFFFSPPPPPFFFFFSPPPPFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFWL 


488 


14389 


A 


494 


I 


413 


PTRPPTRPPTRPPTRPVLDRBHSPSNLP 

GAKKKG/EKKKKKFF*KKGEKKKPPGKF 
GKKKKI WGGEKRAKTPQKKKTP * GKKKI 
LKGEGGKKNPKPRGGKKFFSGE KKKKK 














489 


14390 


A 


495 


274 


1 


IYRIDCAYMKKVERSKISILSFHIRKLE 
N/QLNPK* /RRRREIKIGAEINEIENRK 
* IEKINBTKS YF\WKISKPLAKLIKEfCT 
QITNTRNRAY 


490 


14391 


A 


496 


336 


1 


VFQYTYNKLVSMFYYCFFL*RWGLTVLL 
RLVSNSG \ PSD\LPS SAS * VAGTTDAHR 
HTQLCFTTLSDIjIiEPYFHLGSFIFSLIi* 
FFFLNTESHSWQVGV*WHNLGSLQPLP 


491 


14392 


A 


497 


445 


382 


PRVLIFFFLTPRGPPPPPPPNKVFFPPP 
PPQNFFFPPPPPPSWGGFAPK/ PPPPPK 
SFFPPKPPPVFFSPPPKEKKFSFPPPPH 
FAPPPVFF*PPPPQ 


492 


14393 


A 


498 


430 


2 


S P PPKPTRM * \ KVKFGR* KKGGGGGGGS 
PPLSPPLWGSKPWP*VPKGR/PLPG*A 
GKPPFFLKFQPFPRPGGGPRCPPLFGGL 
GQKNGFTPEWLLL* PKFSPFPSSLVDR 
RIYCLLKMLEYKSVMMILLFASCLNIiYT 
FPIV 


493 


14394 


A 


499 


363 


3 


KXLVTPARVLGDI I PRNRFJkQMPQQREN 
FLCQVWMTKPPTTIFVKTKTGKWYLISL 
KRFCPAKKKT/IKILVNRVNRQPPEWEK 
I FANYASDKGLISSL YKDLKQVYKRKNN 
PITK*VKGMY 


494 


14395 


A 


500 


145 


435 


VFMCINSSFIxLFIPWYEYTTVWPVTCRR 
T*MFIAALFVIARNWW*SRCPSTGNW/L 
KLWYIHTME 


495 


14396 


A 


501 


162 


1 


FYNYTIIFIYLFIFIFLRRNL/DSWHT 
GV*WRHLGSLQPLPPKFKPFPRLSLP 



546 



WO 01/64835 



PCT/US01/04927 



SEQED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Am inn §icit\ 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

OCUUCIltC 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
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niirlpotirlp in«tf*i"Hrm 

IllllflVVUUv Itltfvl 1IVII 


496 


14397 


A 


502 


343 


2 


IKKFRWGPKPPLKEPPGV/SPDFPII/N 
PGNPGPVPGKDFKFPNFPKKKKWGGRAP 
P WPTTPGG*MGRFP* PPAFGVPRGGDG 
SPPPGAPKEGPLSKKKKKENEKEKERKE 
KSARL 


497 


14398 


A 


503 


23 


405 


KGRNYLWEKKKKNMLVRGKIGGGPPPRI 
PKF * KVT FKTPPGPPFFFFGPPLNFFFF 
KTDAPLFF*NHPPKSKIWAIiAPPKKKIF 
LNPKKPPPFFFPPHFFKKNFPRAFFKNF 
\AFSPRGGASPSPPP 


498 


14399 


A 


504 


418 


247 


PPKWGF\FPKPPRGFFFPPPKGKKFFFP 
PPG*FGPPQGFF*RPPPLFFFFFFFFFF 


499 


14400 


A 


505 


26 


426 


G<^LLHS*WYAKWCCTDHPIT*VIiSP 
AFISRISYSPTHPQLILFF* IFFLFLVE 
TES CHT AOAGLK /LP ASS S LP VWASOS A 


500 


14401 


A 


506 


337 


2 


IPNLKESA/ CPNLPKGWGFKD* PPPPSQ 
ISLIFKYPKFLKFEI*KKKPPLKKGLFW 
OT*RKKAPVGF*G\EMALGBKFFFFFFF 
FFFFFFFFFPRRSFAFVAQAGVQWRDLG 
S 


DDI 


1 A A A1 

14402 


7V 

A 


307 


434 


Z 


RRGSRAEAIiLTSQTGWPGROTPHFPDDG 
RPGRGAPLFPDGAAGQRRSSLPRRG/VP 
GRGAP \QSQMGCQLGRGAPHFPDGVAGQ 
RCFPFPRWGSQVEALPTSQMGQPDRGTP 
HIPDGVAAGQRR 


502 


14403 


A 


508 


390 


13 


RIPPPKSRWKGKGPF*VS *NPRPQI*KN 
FLPPPP*K/YGDPRGPPPPPIKFLPLKK 
KGA PP T CPGCFE I PAPRES PPLAPPKS * 
NSRGNPPPPPFFKKKNPLFWGGKTKKLK 
IFFFFFEAGSCYVA 


503 


14404 


A 


509 


2 


282 


WQFLTKLNILPYEPVIMLCIYPNESKT* 
I YTKTYA* ILLAALF IMAKTWKQGVLKK 
V\TDKVWHIHMTEY 


504 


14405 


A 


510 


400 


250 


LLSVTQAGVQWHDHSLLQPQTPRIiK\HP 
PTSAS * VAGTIGAYHHAWI IFFF\ IFI Y 
CRDAVLLYWLGWFPKGLLKCWDYWCEP/ 
RMSGLFVFWSDLGFRSEVS* ILSSWDYR 
CVPP CLDNFFF 


505 


14406 


A 


511 


99 


242 


VTKEKEGHFIRIK*LrHQEDTTTINrGV 
PNNRTLKOIKQKLTELKKBI 


506 


14407 


A 


512 


139 


374 


SLWGKRIFFFFGADPHFVPQAGGA\WGN 
HG*LQPTPVGLKKSSLLTFPISWYYRLG 
TPPPANFKIFCKNGVKPCXIPACF 


507 


14408 


A 


513 


23 


401 


STCLRI/PKCWDYRRBSPLPALCMLFL*M 
KGKYVYMGFFI\LFFFLFLHKI *LLQEN 
FVYVCLKKNBCTKLKA*TKKQKKNETO*N 
ILVDLNVLS *KKWKCLLWGFILFYF/NF 
LRQSL /NSVAQAGVQW 


508 


14409 


A 


514 


7 


386 


FFVFSTUXTLLFLFLFFFFLFPRPPGFF 
LAGKKNTPPAPPEKPPPP\PPKKKRGPP 
FFFFGPPQKKPPNSPRGGRGPPPPRPPR 
KMEKGGPPPPREIPPPPFFF*PPKTPPP 
PPGGGGGGKKTPPKR 


509 


14410 


A 


515 


370 


2 


FVFFQWLPE*NSFLFSPF*RGLC*RDL 
AFKRFFPWPPNKKLWPSLF*GFLKNS*K 
FNFSLKSLEVPFFF\CFPKFFPWI*AFP 
RFFPFFPPPP*RG* IFFF\*DGVLLCHP 
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Predict- 
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nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla1anine, 
G=Glycine, H=Histidine, I=Isoleucjne, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Pro!ine, 
Q=Glutatnine, RpArginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GWSAWQFWLT 


510 


14411 


A 


516 


1 


163 


NTREKLSGGGGTQLS*/LhGRPRQKNHL 
NLGGGGCSEPRS\HCIPAWMTE*DPVSK 


511 


14412 


A 


517 


1 


390 


RIVRVYYKHLC/Y*GSVNPREKIDNFLD 
TPNI^SRLNQEEIETLSTPISSSI IBTVI 
KSPPT\KQKSPGPE 


512 


14413 


A 


518 


47 


312 


EYTGLILRLYIE/LFTTQH*KQKALNQA 
K*IAKGLNKHFIKRGCMNDPKHMKKSSA 
SLAIRKM* iktmmrchyilt*makienk: 

TESTRP 


513 


14414 


A 


519 


320 


1 


QNPINKQ*KKNNKKYRLV*HKKPPPFTT 
. QPHQSNQHQ\PN**PQKHT*TKNHHTPP 
PPP+SPPPPSLSPSPPPPPSPPSSSHPP 
LLSPSPPPPSSPPSSPPPRTSP 


514 


14415 


A 


520 


1 


368 


LKTQQEANNIilLKWAKDLNRYLIQK/VY 
RNVKHTKRCPTS\RELQIKTRYYYISIR 

* RVKYKKRDNTKC * *GRETTGTLVIHYS 

* EYKMV* PLWETV*OFLTKL / E * S Y / P * 

i — ■ A * » WHO A ■ VI- 1 - 4JX JCvIJ/ JO Ul^ C 

DPAITLLGIYSKELK 


515 


14416 


A 


521 


2 


264 


GKKIPMLYFAEIEKCILKFV\KRS*KAK 
* ILPKNNKAGGLKFPDFKTYYKAAASKH 
QVGGLWKELRSTIiQAQQTWVPREELRST 
SPS 


516 


14417 


A 


522 


1 


408 


LE KMSTS LAI / RVMQS KTTMKCHYI P I R 
KAKI*NNDNIKCWQ*CRETGPLIYCC*E 
CT/LVQPL*KTV*QFLIKVNM\DPAFVL 
T FT P ICFC * KHMFTfnCRNrKHTFP A AT ,FVM 

ANTRI IPNIFQFVEWIiNKLSYKHIVDY 


517 


14418 


A 


523 


320 


3 


T YI Y I KTCPRMFT/ AVMLT ITKKQKQPK 
YPPTDE*NV\YIHSNAYYSEMKRNEVLI 
HATTWMjWPKNIMIiWKA5SRJBKPHTI*LH 
i yemsri ioymvlcnctllt.psmf 


518 


14419 


A 


524 


426 | 


9 


AELPASPTPGTCTPQPLGSGRDQVP\GA 
VGGTHPGGSGLAGSPP*GGLGMAGCKS * 
AliPRGEVTEAWRQFKCGERRQASSTGGP 
GAPSAAAGPGAKPLTAWGQRHQPAAPSA 
GPABPLPTQNWCWPASNPGSRTRLSLHT 


519 


14420 


A 






WI 


ICIWRKlWILLPYTKLNSRWIATiLYVKG 
TTIMFLEVSVREHLHDSQKKIF* *DAKL 
MNQKE*LDILDSINIKSFVH*KAPLRBN 
KGKLQSAIHIYEKGLVSRIYEELLQ\TY 
KETLKANKKKIDNP I EKWAKNIiNRHFTR 
GCPSIYKHVK 


520 


14421 


A 


526 


288 


570 


ATGSLCCPGWS IVAIHGFNHSAL* LLTP 
GLKHPPASASQVAGTAGMSDCTQLCKIN 
FFVFVFL/RRQSLAVtQAEVQ 


521 


14422 


A 


527 


•j 




AVSHDCTTGLQPG *QRBILYQRKKKGTF 
SPLPPPSPGIGGKPALPFWGPKTWPPPG 
IPPFLFSPPPPSQSGGGPRPLBTLAGSF 
SQNPWNLRDGAPPPPGGGFFG/ CPPFGP 
PGGLG/ PPGAPPPPUCNFPKKKGGGGGF 
IiFPPQAKG 


522 


14423 


A 


528 


2 


616 


FFFDTEWRSVAQSGVQWRDIiGSJjQAPPP 
GFTPFSCLSLQSTWDYRRPPPRPANFFL 
YF* *RRGFTVIiARMVS IS *PRHLPALAS 
QSAGITDVSHRAIiQVCFITTLL * LSK/H 
QFK3CAGVTL/PHI^CLHEIGLDCTLHKH 
/WSVHHSPP/HKTNVC 
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SEQD) 
NO; of 
nucleotide 
sequence 
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NO: of 
peptide 
sequence 


M 
eth 
od 


SEQD) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A= Ala nine 
OCysteine, I>=Aspartic Acid, 
E=Glutaroic Acid, F=PbenylaIanine, 
G=Glycine, H=Histidine, I=Isoleurine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Tbreonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


523 


14424 


A 


529 


57 


485 


RHSSLGNKSETQSQMKKKKREKKKGKE I 
SFDPAISLS I *PKENKSLYQKD/TCICM 
FITAIiF I IAKTQNQPKCPSTOE* INKRV 
\IYIYIYIYIYIMKHYSPIKKNEIM/SF 
AATWMELEAI I ISETMQKQKAKYCLFS C 
I 


524 


14425 


A 


530 


399 


3 


FFFFFFFFWENPPPPPK/WRGGGKKKAP 
FFKFFKYKK * LFFF*CGGGGKQQKKKKK 
KSSSFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F * FKG FLLGTTHASAHAS AHAS 


525 


14426 


A 


531 


491 


4 


SKWIKDLNLRTK/TTKFVEET*VKIFMN 
LGLGNDFLSMTPKA/LKIDKSDFMKIKN 
/FCSSKD/TIKEVKRQHTEWVKIFTSYI 
FDKGSTSRTCKEL / RKQQQNKPIQKCSK 
VLNRFFFREDIQLSNKLMKGCSTSLNH 


526 


14427 


A 


532 


184 


509 


PQWPAHS FLPALGSSGTGP Y* WRQ I FD 
SKDKESSQ/WSHETSDRPKPADHRRRSR 
PSLATSP\PRLEPHPSLPNHSGLPILSS 
I*PWGAVALAPTHFSALAWP*RPIjPCNSQ 
GEKFFFWFGFF\ERESHSATQAGVLGHD 
LGSLKPPPWGSKGFFCPSYSGG/WNQKN 
HLSP 


527 


14428 


A 


533 


401 


3 


VIREMQIKPLHIQ/ PGWL* SKSVKKY*Q 
GCGQS/NVLIHY/WLKT*IMGPFWKIVW 
QFLIKLPYGTVTIiLSGIYILWRMEKKPP 
KFCTQIFIAAFFIIQ*PKSENNPNIHQP 
KNQ*DVHMYIHTMEYYLAIRMNIH 


528 


14429 


A 


534 


419 


93 


SLAIRQM* IKIPPRYTRVAK\ IEKSHNT 
KYWKRHGATGTL I HCWWE CKMA* S * KI W 
* FLIKSNINLPFWPVILL*GI YPREMKT 
CLYKD\ CM*MFMAVSFIBQNGKQPKCL 


529 


14430 


A 


535 


414 


2 


NFLARGYINCGPQFFFFPPRKNLKWGVP 
LSNFPPKL*KGGIFGEGPKKVKKNFFFF 
FLFFFFF* KKPPGS PRLKAK/ WKIFGPL 
KPPPPGLKKFS FLTPKKNGDKRGGPPRR 
GOTFFFFFLWLVKTEFHYVGQAGLK 


530 


14431 


A 


536 


32 


435 


DRATALQLG *KS ETS S YKKKKKKKKKKK 
IX3GAPLKKNPGGAKKKPGEKKKNFSPKR 
GGKKNPPGNFEKKTNFGGGKNGAKPPQK 
KKT/AWGKKKNLKGKRGKKNPKTLGAKK 
FPPRG* KKKKKPPAARPGKASS 


531 


14432 


A 


537 


348 


1 


EYIKSTHQMGKNYPLKNTVSSGQVQWLS 
PVISAQFAIAKSWSQPKCPSLNEWIKKL 
WCV/WCVOTCVCTCVCMMEYHSAIKRNE 
LMAFAVT*MRLETIIISEVTQEWKTKHR 
MFSL 


532 


14433 


A 


538 


523 


66 


TDQTSHNI PLSQSLIQSEVPTLFNSVKA 
E * CEEA/ AEHKFEASRGWPTRLKERSRL 
RNMSV\GEAAGSPEDPATV/ INGGGRTQ 
PQIFSVAEATLNWKKTPCRPFIGREKSM 
PGFRASKDSLTLCRDFKTHLCLVFHYWN 
AKHVQVY I LLKVTAKI 


533 


14434 


A 


539 


413 


2 


RDKV*PCCRSWSQTPELK*STRLOjPKC 
*DYR/R*APAPGPFLL*VHRHVSIFKSG 
PLSCRCSNFQD/HDS IKPS FPTIQYAHG 
NKFQL*TPOTLI FFLKWSFTFVAQTGVQ 
WHNLS SLQLLP PGFKRSDAWADAWADAW 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Hsoleutine, 
K=Lysine, LpLeucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Gtutaraine, R=Arginine, S^Serine, 
T«Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=po$sible 
nucleotide insertion 


534 


14435 


A 


540 


385 


2 


MASKHIKT/CARQLAIREVQIKTTQYHV 
IPTRMAKVKKT/ DNECWHGC*NTLINCW 
WDCKMMQLLWKK/ svwhssrgkmyvqlp 
CTPAI PIiLGI YTELKTCSHKN /T/CT * T 
FIAALLVIAEKWK\KCPSAEEW 


535 


14436 


A 


541 


44 


398 


RPPFFFFFKKRPLWKKKQGFPPPA*RGE 
IPSAKK\PPLGPVSGPGNPPGP*KP/PQ 
NPPPLGGSPFGKKPPLDIPFPGGPIKGK 
KWGGPPLAGGATTKN/PPPGGNFWKGGK 
APPFPSQKF 


536 


14437 


A 


542 


370 


10 


FFIjRTKVSLFPRVEGXKVQSQLIiAAPPS 
WGQVIPPLQSPE*LRQKDPFSPGG*GCS 
EPCSCPCLPAWMTEPNSVSKS * KKKRKK 
EKN* *KKKKSFINTTGKIRFRI IH 


537 


14438 


A 


543 


1 


370 


FLIiRHILLCHPGFCSVATAYCSRDPGSS 
D/ PLPPQAPLPDQ * PRLQA/ WHRLAPPH 
SANFF* *RQGFTVLARMVSISQPCDPPH 
WASQGATATKADDYQK 


538 


14439 


A 


544 


50 


395 


ipgltrqwlldpcaspstppyt/ p*vqp 
stpqhss pspkthnqkglpmpls ptpkp 

STAWKKAILEHTSSSSSSSSSSSTCRNR 

ngytytvpehrparghtasqtrkqvlaa 

THKP 


539 


14440 


A 


545 


1 


370 


LCSVTQAGVQRRKLNS VQP PP PRFKQVS 
CLSLLFFFFF+KRVFTLFPRLEGRGPI I 
FNGSPTLRG*GDPPV*ASKELRTKGGPH 
QGQLINLF/ CLGATGPTYGAQGGFKSPG 
LRRWAPLGPPRA 


540 


14441 


A 


546 


178 


365 


YKKTDATKTKMDK*DSIKLKSFSTVKET 
INKGNR\QPTEWEKIFASHTSDMGIISQ 
ICKELKQ 


541 


14442 


A 


547 


11 


236 


KGTTKLWKLSDFKS YFKAI VI KTVRN / 
WHKYK\HIDQWNRIQNPEINPHIYGQMI 
FKKRAKNKQWKKDGLHKKR*WSNDFQET 
CQEQTMEKGRSSQEKMM 


542 


14443 


A 


548 


355 


3 


I KKKNLGRKRFC F FQEEKTW P * T P KKKP 
LENPGGVFPNPGFPPFFKAKIPEGPFPG 
V/SFF*RGGFKGYPPPLFFFFFFL*DGV 
SLCHPGCSAVAQSRLTASSASQVHAILL 
RHVDEGR 


543 


14444 


A 


549 


1 


373 


CFLICRHTSHMWRIWFQTTAINQIWQ*K 
ESQKWSQ\LYI * KLHIaL YEVQ * HFV * K 
YTQ*LLLENADTK^AHAVGKMVLIDLLK 
AALSLIFNL*KNK*NLKNWPGMVAHACN 
PSTLGGQGGWLTS 


544 


14445 


A 


550 


446 


2 


NIDKAPTVLGKMVSIFPTSRSPPLGPPK 
CWGHRREPPRPAYFGI* ILFIiTPP*NPL 
NNSPQHLSGKISGIFSDPSLSVFFSSFF 
FAPPPKKFMIjCF*PFFFFFF*DRVSLCH 
PGWGAVERSWLTAAP ISQ\IR* SFHLSL 
LSSWDHRHA 


545 


14446 


.A 


551 


342 


26 


WAPPIFFFPPYKRSPQKPFPPPGENNSR 
GF* IFG/RGGFFKKEGLSQFFFFFLKKV 
FFFSPGGEPQGYFPPPKASFLKRI FFPP 
PP IKKGDPGRGS P PRGNNNPFF 


546 


14447 


A 


552 


3 


167 


QPHLQDCL* QQNPVAVG* AFSSHPRDSL 
NNPW\WPGAVAHACNPSTLGGRGGRII 


[547 _ 


14448 


A 


553 


3 


413 


TLITALSSH*FFT*VGLEINMLAFIPVL 
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E=Glutamic Acid, F-Phenyla lan ine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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T=Tlireonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














TKKINPRSTEAAIKYFLTQATAS I ILL I 
AILFNNILSGQ*TITNTTOQYSSLIIIM 
AlAIKLGIAPFHF*VPEVTQGTPIiTSGI, 
LLLT*QKL\APISFIYQISPSLKKK 


548 


14449 


A 


554 


441 


1 


SSPPPQARGEIPF*KNPGEEISQHNKKN 
EVFPPPPP*KFFFSPKAFIF/RGGGGPK 
RPPPKKKFFPKKTPRVFYKPPQKKKKFF 
FPPPVKFGPPRIFFKSPPPLFFFFFFFF 
FFFFFFFFFFFFFFRMGCDRWHGKFWIL 
RDGF 


549 


14450 


A 


555 


3 


455 


NTNSSMYVCI YVFILRQSL/DSVTQAGV 
QWRDLSSLQPLPP\GSWDHRHLAQQ*YL 
FNVLLLEV\ FVYERGTIMVP IL* DCNVN 
CRVFGTRS LS VYCVMG I FYRC / H IE SML 
CDRCF I FFDIGSHSVTQAGVHWCNLGL \ 
MKPLPPG*RNFSGLN 


550 


14451 


A 


556 


1 


299 


RRMRQENRWNPGGTGCSEPRSHHCTLAW 
ATEHDS VSKKKKKIRSSIiGKKT* FYHNI 
LKAMG/ CITGIHKGPEGAWQREGHFTPP 
QGPKKAPQRGQTYPDP 


551 


14452 


A 


557 


3 


391 


PVWWNSFEASSGWLIGFKKRSCICNIKV 
QGETASASVEAGVSYPEDLAKITDEDGY 
IKQQIFNVDKTT/YY*KKISSRTFIVRE 
KSMSGFKASEDRLM* FLGANAASNF* LK 
PILICHSKIPMTFKSRAKS 


552 


14453 


A 


558 


337 


1 


TRFPPFLGGFSTKFFYRPLVRFPLL/RE 
RKIPPLPPVFVGGPPPALGPFWGFLFFN 
FSKKRPFGPFWGFFF/ cfpkflpfkpfl 
GFFPSFRRGFFFFFETVSLCQPS * SAW 
Q 


553 


14454 


A 


559 


117 


419 


I P PLLLGVGLFFFF F I RKKRGVFS PRWR 
GGGSNFGLLETPPLGIDPFSGLTPPKSW 
ELRAPPPPPIKF*KFFLKKNGF*WVSPG 
GLEISALLVFPA/SASQ * 


554 


14455 


A 


560 


347 


1 


iykklkqlcrkksnylikkwaknpnrhf 
seeiqmanrqmkrcs\ttpvtiremlik 
ttlrypaspvkmafi *dr* * *mf/ wqrc 
gekgt/h/cwwecksvqpl*ravwrflk 

KLKILLP 


555 


14456 


A 


561 


2 


375 


I PPPLQNGEKI FWI I * TGAEKAF YKI P F 
PFPIK/TLNKVGIKGNFPHI IRAL* EKP 
PAYIFHGEAES FPVRSGTRLCPLLLLLF 
HPVLEVLVRAI CPLK/E/ 1 KGTQIGKEE 


556 


14457 


A 


562 


391 


2 


ASGSKGKKKYPPFLRGSPPLPGNPLFFL 
EGGEGKFP* P/RNWGPPPKI FPQKGQGN 
PFFFFFLKGPKGGGFFPPQGEKGVGFPL 
PGEKSQRPF*GEKGGVPPP*RKKPPPFF 
FFFFFSETVLLLPKLECS 


557 


14458 


A 


563 


384 


3 


ISDSGVHPLGLPKRILLCQLSYHVWP*P 
KCFC/PHLCLFMTLFQPP*GLPSVCAPS 
KQEEHGCFPISVARQEC*PLFPKNLDT* 
HFVGNFLKFFFFFF * DGVS LGHPG WS AI 
LTHCNLCLPGSSNSLPQ 


558 


14459 


A 


564 


522 


120 


SLFLPTLECNGTISAHYNIiHIaLGSSDCP 
ASASIiVAGITGMCNLCLPSSIDPSTSAS 
* VAIDLRK* *GARGNCPVS / VLEKGWPE 
AEMGRGESASWWDPRLGRPGVAKRICRR 
HCCPVQQPQVSMEPSGPASPN 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, I ^Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *^=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


559 


14460 


A 


565 


73 


264 


KDRHIDQWNRIESPQIKSMTIHLWSPDF 
FLQGY* ♦KDRHIDQWNRIESPQI\NL*L 
YIYGHLIFFYKGTE/ IQWRKKTL»SSKWF 
WDNWIFKCKQ 


560 


14461 


A 


566 


413 


1 


EKKKKVPSQVLTPKPQVLGNFPRFSEEQ 
* LSLVPVSSLPLQSLSGKNQGP * ARRVA 
LCFGKSPGSQQIWF/RLP/EDIVTTVQA 
SYSKKRLFLSLLDFQYVLQRREGAVNSA 
NLSLAPW 


561 


14462 


A 


567 


397 


1 


FLGQDIjTVAWARVQWCSHGSLKPRLPGL. 
KGSSRSATG/ SASPYLVPMFLSNTjFWHV 
YIL*LYKTMTGI*MEM*FTPSCPMYSW 
RDV* IFFSFETEACPVAQAGLQWRSLGI* 
PQPPPPGPKRFSCLSLLTSWD 


562 


14463 


A 


568 


437 


I 


KFSEDPAKLIGBDGYSKQQVLNAN/ETA 
LYWKNMLSSTFLARKETSMPSKLQGTG* 
LLLGANAAGDFQLKSMLTCHFKNPRALK 
NYATS P LP VbHKWNNKAWMTVHLMTALL 
TEYFKPTI KTYTYHNTTGSLTTPHASAH 
ASAHAS 


563 


14464 


A 


569 


234 


1 


FFPPFPFKASSPPQGTSSSRGVF/PPFF 
PPPKKGFFPKIPPGSSSPPPF*EKTYFR 
FPPFFLAPPGVFFR 


564 


14465 


A 


570 


2 


396 


FS*AFYHFGLALTPQriGGDWSPAGITQV 
KSIjQVPWNT\SGLIiASGEEIT*ADHNV 
I QNNRNRRI QALL ITVLLGL YFTLLQAS 
ENFKIPFTISDGIYGSTFFECAGLHGLH 
VIIGSALLTICFIPQLTFDC 


565 


14466 


A 


571 


3 


403 


HASGLPSSWDYRRPPPRPANFFVFLVEM 
GFHHLNKAIIKSFACNEIQPLSAVSVAG 
LVGCV*VCKCLFPVL*Q*LFQF* /S/HS 
IANWMREWPLRLSLF* LIC* GERMSGFA 
TQSRRDPCSIiPGFLYQVLSLAKFR 


566 


14467 


A 


572 


100 


371 


YKSNDFYVYGILHFLTTI \FFFFFFKKK 
KFPPVFQVGGQGGNLS* LNPLFLGLKGF 
S CliKLPRS WNNRGAPPPPPNFGFFSKNG 
VSPCNPG 


567 


14468 


A 


573 


371 


1 


REGARES /TWSS SHTPVQKGLRYGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEVPLLNTSVLLASGV 
SIT*AHHSLIEN 


568 


14469 


A 


574 


353 


103 


SIiQLQPPGLN*SSHliSIiIiSSWIYRHKFP 
CPANVRFFVFFCRDGVLLVAQAGLRA/S 
ASQSAG ITGVSHIiAQLVNPLLFYFSRAG 
D 


569 


14470 


A 


575 


1 


206 


FCIKKIIGFIPGK*GWFNISQPV*SLTN 
RIKEKIHISMTAEPIISIITEKVFDKIQ 
HPLIIiQKPFGQK* E YKFLNLIKGI CEKN 
/PIPNI I YNGEILKQGCLLSPFLFNIVXi 
EDIVTQSVKIKEGINIETQELR*SLTNR 
IKEKIHISMTAEPIISIITEKVFDKIQH 
PLILQKPFGQK 


570 


14471 


A 


576 


157 


255 


YISPTPPFPPDLPF*NYLPNPKLTSPAP 
NFQIY 


571 


14472 


A 


577 


365 


3 


PPHPSVFFF*KSPPPPPPtFFFPPPPFIi 
FRGGPLYYSPP/SPPLFFSRRGKDTNPP 
PYSGLSVAGQTPPPPPRRTPPLVSPPPP 
PFFFF* RGGPKNKKAKKQTITFKSPPGT 
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NKPKHKNDRDI 


572* 


14473 


A 


578 


2 


323 


VGSWEPDEKVSHRELLLS FLFYPFFSHI 
SS*ETLIDFALTSTDIWALWHDAENQTV 
VKYINFE/HVWVT*FCHPTSPQAISC*E 
DERPGAVSHACNPNTLRGRGRQ I T 


573 


14474 


A 


579 


369 


1 


AHLKGTLGGFPDFP F INLE FRGGLTS CP 
NPLNNWE*R/RPPPPPGKIFGAFFFFFF 
FKVETGFHRVDLLVL IS * PCDLPAS ASQ 
SAGITGVSHRARPTPG*FKKKIMTF/CL 
♦KWWSHYVAQIG 


574 


14475 


A 


580 


394 


125 


AAEGEQGREGWRNRPRGRERAS DRERER 
VR/ERGEERERGEERAPQ* SERWR/E/R 
SRERERVRELWSDSDRALEREAIiLR 


575 


14476 


A 


581 


416 


1 


KGQDLYGEV* KVIiLKVIKQVLNKKKDMS 
CS\KINILISFPIRIAliKFFP/EIiDEl/ 
CSFV**NKC*RIAKEVLKKKQ*aCiGGHL 
/TSPHIRRSYNTTFSQSIWCCYRTKQID 
* WTRTQNPEKDLRI *GDLI YDLRCCSSS 
RA 


576 


14477 


A 


582 


3 


403 


NCFSQFNVX 1MB I PAKFFIDINKLILKC 
FCKGRS ILKKKR* EDS / QRRNFLVIKTV 
WYWQRECHIDQWDRINNPEINLHKYSKL 
ILDKGAKAIR*TKES/l/YSK*C*NWKF 
CM*EK/lDPKIiNPIiYK 


577 


14478 


A 


583 


402 


121 


QSLIHSKALTXiFNSMKAEEAAEGKVEAS 
RGWFMRF * ERNHLHNIKVQVEATSAGVE 
AAA£SPQD*GKTTDE/GGYTTQQIFNVD 
ETAFYWKTMPS 


578 


14479 


A 


584 


255 


517 


IYIFGFIFRGRVLI,/CTHPS*SWLQS* 
LTAAIiNFWAQVS /LPSS WDYRCTPPCLA 
FFFFFCRKGGSTIiCPRLVGNSNPLKFPP 
LGTPCF 


579 


14480 


A 


585 


1 


414 


SSHSCC5KA*SSMGPSPAFYKLVPLFLC 
LFIWDGVWLCHPGWSAWQSLLTAASTS 
Q\VHPSS\CLSLRNSW/EYRYVP 


580 


14481 


A 


586 


392 


43 


CWDYRCEPPCLAXTErGSSYISQAGLEL 
LASSNPPMVCHPKCWDYSHETMHSAQNF 
FLKGIS*F*LCCSHFIHNHLLWLGKVIH 
TYTHAHTGLEKYKTQCLDVKCIYSDLLD 
GAIK 


581 


14482 


A 


587 


1 


394 


GTR/YGINLFITSEVFFFAGIF*AFYHS 
SLAPTPQLGGHWPPTGITPIEPIiEGPLI, 
NTSPLLASGVLIT*AHPSLIENNRNQGI 
QALLIT I VLGLYFTLLQASEDFEDPFTI 
CDGIYG * TFFWTGLHGLHV 


582 


14483 


A 


588 


190 


3 


DGINLKIPGGIFFFLKARSHSVTEAGMR 
W/P/NYGSLQPRPPGLK*SSHI»SLLSSW 
DHRHAPPSC 


583 


14484 


A 


589 


157 


387 


TKKRGRVGGWWI KILGINLTKE IKDLY 
SENYKTS LKE I KER/DTNKWKNI PCF*T 
ERVNTVKMSILPKAIYRFNVIP 


584 


14485 


A 


590 


891 


0 


PPPPFFFFLPALIFFFPPPPQ/PKTPKK 
KKNPPPPPPP\PPPFSHIP*QFLSPPPQ 
HHHFIHHPPPQPPFHPFYYNPFKFFFF* 
NINVPPPP * K* I YFF* TKFFFF I PNI FF 
F\TKKKIFLSPPPKXFPPLLTNI*SLPIL 
IKNLLSKPPPPQI 


585 


14486 


A 


591 


1 


496 


GTRAYQI VKPTP* PLTGALSALLMTSGL 



553 



WO 01/64835 



PCT/US01/04927 



SEQ U> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine 7 P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va1ine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PMRVHFHS ITLL I LRLLTNTI/T I YQRWR 
DVSRESTYQGHHTPPGQKGLPYGIMLFI 
TSKGFFFARFIi*AFYH\SSLTPTPQLGG 
HWPPTGMTPIiNTLKDPIiLNTCVIiIjASG/ 
VSIT*AHHSIiIENNRNRIIEALLMTIV 


586 


14487 


A 


592 


175 


2 


PFFFFFLLRHSLTLAQAGVQWCDLCSLQ 
PPSPGFKRFSRLSLP/H*PGMVANFCIF 
SK 


587 


14488 


A 


593 


107 


733 


AAAAASKVLM*REGQLPGAT \GTGGVQA 
*APGSVA/AEGA5VEGPGFGDTAPAHQG 
LSPTRSHGQGGAGRAS/SSQQGSPGGRG 
DGASEVWSGAIi/ SPGGGKDGASASVPRG 
PYAEAEKGGWALRGLGGVAAPGPPSRAG 
QAPSGS / YTGPNARPAPWP I PGQGGGLR 
RDQAG*VSSWTGSTEPGAHTAHRAPGHG 
GKQGS PQQPHPQGPGQ I PT 


588 


14489 


A 


594 


10 


435 


FKWIiLKSHAICFWTRS* SYCDNVCVPSL 
WAHHLGIRTEIPEFFLSKFLCTSIIPHF 
TYRRQLRLIQGSTE*EA*EDKLEQK*AL 
GAAQFTLPGMD VFVCFVFCF/ CLFEMES 
HSVT * ARVQWCDLGSLQPLPLGFKQFS C 
LGL 


589 


14490 


A 


595 


437 


3 


DEPKKWKTIPCSWIERXNI / VLKMAILP 
KAIYRINA/VPIKLPTSFFTEVGTFSQN 
* KTT ILKFRWNQ\ KRA* I AKATQSRKNK 
ASS IT/ PDVASNYKTTVTKTARHWYNNR 
HVDQWNTIENTE IKLHTYSQ* ILSKAGT 
SKQWGKEHV 


590 


14491 


A 


596 


2 


498 | 


FFPFLGKTKPPTHLFFFPPPPFPIFFFK 
IFFGPLFKKKNPPK/PFFKRPPSFPKNF 
FFFSPPPPFNPLFSKAPPPIFFFFFKKI 
FFFPFPPLFKKSPPKTPL*ILGFFPSFP 
FFPPGFPFFSLF*GGVSLCHPSWNPWL 
SRPSSKLASAFRMPPVEG/SPFPPSP 


591 


14492 


A 


597 


1 


311 


RRVSSESRWRSLESRGRSLEIQGRMERP 
RCQRFRENSCVAPRHCKGPGGKGLQALF 
♦VPVAQIjGEPGAQLGDPGAHGEATVPEV 
QGE / PAALLPGTAK\ PGGEGS PGFI P PR 
HCRGTGEEGSPGFTQGRETSKAQQRTGC 
LLSCRGCLCLSVSGDS 


592 


14493 


A 


598 


302 


2 


FLRFFHHKKFTKALNRHFSKDILILLSS 
NHMKGCSMSIiVIRD/MLLKIT/MKYQFI 
PTRMA/IIKKTDRYWQKN/CEKI*TFIH 
CW* \NEKMVQPS WKAVWQFLRK 


593 


14494 


A 


599 


3 


386 


HTWPPPPRSPTAPPARTQPPSLQSALPA 
PQPGKKKKALRNEKG*NGSKKG/RGQPR 
PPPIiRGPNGRTRS PAGI CKGGGGFCPGV 
SKAQGAPRPGGRETWPGFFGRRPPIiPR 
GPGPPGVLWTSRHCPLR 


594 


14495 


A 


600 


374 


53 


EGFFFFFPRWVGGQRALFWSPHFPPPGV 
KIFFPPPPPGWGGLRGLPPPPFNFFFFL 
KKKGFS PFCPVFSQLPPAGVPPPPPPP/ 
TGLDLRG* PPGPPPFFFFFFFFF 


595 


14496 


A 


601 


393 


1 


VSPPKGCVSENKI PP PHNYYFAKTFVHV 
KLEX3VAPLFIiQIIiF*GPFLTSIRGLIL\ 
WIPYSPGY\TLTLLIAPRHFLRTIIPVF 
RSVLPKTYLGLSGMPRRYSDYPDAYTT* 
NIGAFR IARES WGRKLDL 
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596 


14497 


A 


602 


1 


155 - 


FCFVT* SGVQWRNHDSLQPI*SPGFK* FS 
/CSSWDCRHAPPHLASFCIFRSTRP 


597 


14498 


A 


603 


170 


455 


KPNASLKRVLILCIQLQSSRQQKNNPT / 
VKKWAKDLNI L FS TEDL * TVNR YI KKYL 
TSLVIKXYHFIPTRLiSKMKKTEHSKCWE 
SCGEIETLTCSW 


598 


14499 


A 


604 


3 


449 


TLLSNLEEAKKKKEDALNETRES*TKLK 
ELPGVCNETMMALWEECKPCLKQTCMKI 
YARVCISGSGLVGRQLEEFLNQSSPFYF 
WMNGDRIDSLLENDRCX^THMLDVWDHF 
SRAFS I IDELF\QDRFFTREPQDTYHYL 
PFSIiPHRRP 


599 


J4500 


A 


605 


169 


435 


KYENTKINRR/ VAPN* ISLSPEKKKKKK 
GG 














600 


14501 


A 


606 


453 


29 


TSPPPPPGENFF*KKPP*K/HFFPPPQF 
RFFFPPFPLKIFFFPPSLFFFWGVFPHF 
PPPPKKVFFPKSPRGFFFPPPLKKKFFF 
SPPLFFFPPPGFFLSPPPFFFFFFFFFF 
FFFSPFFFFFFFFTVKNFYLLCYFYKN 


601 


14502 


A 


607 


170 


408 


NEYDHFS I IKERRDFIVCVFFFIFSR*S 
FTWTQAGLSSLQPPLPGYKRFSCLSFL 
CSWDY/RWHPQGSANFFVFLVEMG 


602 


14503 


A 


608 


2 


448 


SLHPVIYSEGIKSRQSPCWR* *KTVKKK 
KKKNPNQKPKKKKN/RPKNPLGGGKKFF 
* PEKKPGPGKRI tiKEKGKKI CPFPPQKK 
NFKILKKKKKGAPLKKNP*GAQNFPGMK 
KINFFP*RBVKKNPIGIFKKKPLFWGGP 
IGANPPPENL 


603 


14504 


A 


609 


441 


28 


GGTFLKGI KSAPKI *KGFPGWGGARFPA 
RDPPQFGG/ PKGGGSLSPRGIiNPPGPPG 
* TLFFFKKPKLTGGGGPPPLI PALRGVR 
P*NSLYPQRGGGAQIPPGPPPPGGKRAP 
CFQKKKKPRRKEKKK 


604 


14505 


A 


610 


445 


1 


lwlkkntgggggspplfpplgepkrd/g" 

fprgrgfgpppppiknppfflknqnnpg 

ggaqpgipgpwgg*ggefplpp\spgfh 

♦prfgpfppprgtkreppfqkkkkianc 

fllsdksllleeawgqwgpspleptva 

pkpnprgkaq 


605 * 


14506 


A 


611 


253 


437 


kkkkvqdmfsenfkml/neikatlnkwk 

DISYS * VRRLCWKMAILPT/ IQSYRLN 
IIPIKI 


606 


14507 


A 


612 


186 


2 


KXKKVQDMFSENFKML/NEIKENLNKWK 
DISYS*VRRLCVVKMAIL\P\KQSYRLN 
IIPIKI QA 


607 


14508 


A 


613 


37 


453 


KTPPPGENFFIiKKPQKKNFFPPGNWGVF 
FPPSPLKFFFFP *TFIFFGGVWPNFPPP 
KKKFFFKNSPGVFF/SPPP*KKKIFFSP 
PRYFWPP/HGFFFKGPPP 


608 


14509 


A 


614 


1234 


0 


KKPPNTPN* * WAKDLNRHFIKDSSIKDD 
QYY*SLGQCKLISQ YNFTCTR IAI I KKT 

nnnksw*gcgetgnlih/wieckmvq 


609 


14510 


A 


615 


327 


188 


FFSSSIiLI ISIPKFNHVTSIiLRNLK*AM 
LGSSHL* LQLLRRLRREDHLSPGVSGCS 
KL* SH\THCTP VWVTE*DGRKGLAGDGS 
HSVTQTGVQ 


610 


14511 


A 


616 


103 


446 


KXKKKKKKKKKKKKKJCKKKXK^ 
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DGGGALKKNPGGAQKNRGEKKKFFFF*G 
G\KKKTPGGFWKKNPFLGGGNPAPPPPK 
KKKPLEKKKNF*GGRGEKPSPLPCGKEK 
FSHKKK 


611 


14512 


A 


617 


3 


429 


GLLSI I YKEIiLQINKISNLVGKWTKDK/ 
NKQFLKKEIHRLAK\YMKRYSTSLVTIB 
M*LKTRYYFHPLNKIKHDNNIHC* *GYK 
EVGILHILLEA* IGKPFWKAITVTVLNA 
PVFCL* FCFEMEFHSCXIPSLKCRAPTSF 
IGAS 


612 


14513 


A 


618 

• 


427 


6 


WGGPPPPPPIFFFFPPPPXPYFPFFFLQ 
ALFS*RVFFFLTPPPPKNFPPP*GPPPP 
PPPPPPFCV\FFFPPPPPPPPPPFFFFP 
PPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFLVENCVFINS ITSDIEQKISI 
SC 


613 


14514 


A 


619 


68 


434 


YSKD* PINTKKKKKKKKKKKKRGGAFKK 
KPGGGKNKGGKKKKNFF*KGGGKKNPRG 
NFGKKTLFGGGKKGBKPPQK30CSLEGKK 
KI *RGKGGKKT*NP/ CGGKKFS FGGFF* 
KKFPPGGGGKY 


614 


14515 


A 


620 


454 


90 


NFPTPEKFGPPQGNL*KAPPFFFF*KGI 
PFFFPG*KQRGGFKSPQKPPPQGKTIFR 
PHP PKKGGPQGPPPPGGKI FFFLEF\KK 
KGGFPG*PRRFFFPGPGKPPPRPPKKAG 
IQGETPRPGP 


615 


14516 


A 


621 


432 


2 


PEAFLSSLLHPAPGEKFFLKKTPEEKFL 
TAKKYRVFLPPFPLKIFFFP/LRALIFL 
GRFAQI FPPQKKGFFPKIPRGVFFCPP* 
KKKNFFFLSG* IFAPPGIFFKGAPPFFF 
FFFFFLDRVWLCYPGWSAVARSRTS 


616 


14517 


A 


622 


422 


2 


GGGPPLPPPRGGLPPPK* *KA/VGGGKK 
PPPPAPL*NPPPPKKKIGGGGGGKNPPP 
PPLACFFFGFFLPPPPPSFWGGEKFFFF 

KNKKKNQTKKKKKKKRAAARDPRVRPRV 














617 


14518 


A 


623 


176 


401 


KFSMFI ILWKAYYTLCVCVCVCVCVCVC 
VCVIMYLFISSQ/RRLCFLGEGKICSIS 
LTVLRRGSRVCV*SQDLPVC 


618 


14519 


A 


624 




355 


HSSGLDNLTALAHSHLCCGV/ CLCFLVL 
VCVOTCVCVCVCLCTWCL/CLYPGPQV 
PK*SKNSSTSCLLFHFTSARTICP/CC/ 
CWFCFCFFA/CYQTCL*ILTSCL/AIC 
TASGVOj*ANKYSM 


619 


14520 


A 


625 


2 


400 


HTRLI FCRDGVLLCCPGWLQTPELKSLL 
FSPGFLSLFFFFLERDPGSHGP\VGGGG 
PNLG * WKFWPLG* KQ FSFLTLWRRGNT\ 
RGPPPPPPGLVFWF*KKTGFCFVAQAGL 
EQRPFGDQPVWASQGAGITGVP 


620 


14521 


A 


626 


307 


3 


VFFRLPLTQGPGLFACFCPPPPCRFFFS 
FFPGSNSPFFP/L*KLNFWAFBPPPFFF 
FFFFFS I L FKDRVLLCHPGWNAVV * S * * 
P*TPGLN*SSHLTLPSGRV 


621 


14522 


A 


627 


1 


391 


NPPPIX3GAKEGGPPRARGSNPPYPYWET 
PFFF*NPKIFPGGGGHPVTPSFPGG*GR 
KFPLPREGGVPL/RPNFPP 


622 


14523 


A 


628 


191 


375 


LILIFKTIFNFLILNFNF*SGGATQAGG 
QGRNLG* LQPPPPWLKRFSCLNliliS SW\ 
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DYRGWP 


623 


14524 


A 


629 


3 


382 


HHCQIRLFILLITASEVLFCFVLFFFET 
GPCSVTEAGVQ*CW/HriQPIiSPG/PPTS 
AA* VAWTADVHHHAWLFVCLLI *DGSWS 
VTQAGVQ*RDPSSVQPPTPGLK*ASC\P 
AFQKCIiDYR 


624 


14525 1 


A 


630 


396 


2 


CRIENVLKKKXLEMSNLSEEGMLKANIG 
QKLSLIiAPQQWNAKESLLKEI RSALPL 
NTQMIRMQNSIhSADMEDIL/WWIR/DP 
TNFNIPLSQSIIQSNA/L/TLFNFMKP* 
RGEEAA*EKFDAISG* FTTFKERS 


•625 


14526 


A 


631 


47 


392 


LHSFFFLFLGKKIFTPGGGGGGQKPFP* 
NPPPPG*RNFFFLPP*KTWFMGPAPPPR 
KIFGFLKKKGFPIiGGNL/SIGTPPPRGT 
PPPLPPRGGCTGGGPPPPPFIFFFRTFK 
NFSN 


626 


14527 


A 


632 


39 


382 


LFFSPFFFKNFFFFFGRFFFWGGVAPIF 
PPPKKI FFSQFP/ LQVFFFS PP* KKKFF 
FFPP* IFAPPKTFFSIPPPPFFFFFFFF 
FFFFFF 


627 


14528 


A 


633 


2 


213 


LDKPGKHSKISSLQKI/HNGVCLQS*LL 
GWLRQEDCLSBGS *GCSEL* FYHCTPAW 
ATK*DPVSKKKKGFLK 


628 


14529 


A 


634 


2 


206 


QENGMNPGGRACS * PRLRYCTSSWATER 
DSVSKKNE/TNKTTLLREI *HFVGGPNG 
KKGLLKTVKGGLT 


629 


14530 


A 


635 


205 


2 


KRSLGLLAQ I * / VQWGDFKTLQPIaPPGV 
KQISRIiNLLKKWDY*RGPSGLGKFW/lF 
Li*KQGFPQFFRWFN 


630 


14531 


A 


636 


3 


399 


QVQQTPASCPLDSDLSEDEDLQLAMASM 
TFPKKKKKKKK3CKKKKRGGAP* KKTRGG 
PKK/IRGKKK 


631 


14532 


A 


637 


122 


373 


VSNILWTQSLLFFFFFLK3GSSLFVPQPG 
GQGLDLG* TKFP PRGLKEFS CLTLRI SG 
NYGLAP P /HPG* FCFF I KKG VFPCCPGW 
F 


632 


14533 


A 


638 


390 


3 


LLVLFLPQDGWS PFAFPHEQKi PE VS PK 
ANAAMLPVQPAKP*AHPTFFFY* FPSFR 
FFFARMG+SS\YFVPNLTKLSEILKGPI 
NDMREVFFP I FRLL I LFFFGDRVLLCGP 
GWSSWQ*L*LIAASA 


633 


14534 


A 


639 


392 


42 


P SGPGKPGGKTFIiRKP P FQEACKRQGFY 
PLFPPKPLKNPKAPQNRKNLGPICPPPK 
P/QGPLRISPQNSQSGSL*GPNKGNCFP 
A*TWGPPGAP*RGPPKGPPFFFFEKVME 
AMGLA 


634 


14535 


A 


640 


390 


1 


LSFSEEGMSTAEAGQKIjGPLCQ/TSQVI 
NAKETFLKEIKSVIPVNIQMIRK*NSFI 
ADKEKILW*I*DQTSPN/IPL/SQCI*I 
QSNVLTLFSSKKAERGEEAABEKFINFS 
*RLRKEAASADREAAESYPEDL 


635 


14536 


A 


641 


368 


3 


KKGGYGVPPPFSPPPGFLGEGSFLPPKI 
KVQKTL/SWPPPPPPGGKKKTPFFKKKK 
RKNFIiTGFFFFP*KKGPFFFFF*CPW* 
KFWGGGFKPIFFFFFF/RDRLSLCCPGW 
SAW*SQFTPRV 


636 


14537 


A 


642 


134 


413 


QKDQRNRIESLE/IR/PHTHNTLIFDKG 
VTGFQYGKDSLFNKWCWDN*RE / ITQNL 
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SLDPAHTTTNVNLKWTRCLKARP / VKTL 
VENMGENLRD 


637 


14538 


A 


643 


2 


436 


GRVES INLPLGCRLFKKRDNI KCGQ \G* 
RAGGSLIHCW*EC/EWQPLWKRIQQFL 
IKLNIHLPRDSTILLLDIDLDGHPKTSA 
LFI ITHNCCFIHNHP *LEQPKCPSTDVW 
MHKP WP IHSMEHYS AI KSAAVWDNFKG I 
LL 


638 


14539 


A 


644 


446 


75 


IiDLLTS *SACMIiPKCWDYRSEPPRPAF 
NF*/STTK*SLWSSHKQDYICRFLLSYT 
ATQKFS TLTTY * S HLKTF * NAQM PRLYL 
QSIiILGSIF*KSNLPR*FQCAAKAENH* 
*SDLYCQIMLFSFM 


639 


14540 


A 


645 


318 


1 


KGVPRKCKNSYVGPPSGPPIK* ILFP*I 
F*DRISLCSPGWNARVFS*LPV\T*TTG 
VKKWWPLSLLNI*GYKGVPPRPGIFFFF 
FFFRDR/SLTMLPRLVTWVQVTFQ 


640 


14541 


A 


646 


3 


348 


QFSSFFVLIYFTLPFMVSFCF/WCFSNI* 
VLFCEIIFFLIFY\YYSLVLSFFYCLLV 
LFLSKLSFF/C*YLNirVFICSMTSFF* 
RIL/CCI^FIKTSFNFLL**FGMVIiESC 
FFFFFFWP 


641 


14542 


A 


647 


2 


553 


AIIiIPDKIDIjKK\VTGpKE*NVmi:KGS 
IHQEDVTTINIYASNSR/APKYMKQKLK 

gkekqisp*k/wgifntpfsi/drapr 
qkinqe i edlnni irqidlkdi yrtfhp 
tseyiffssayet/fskighklkhrtsl 
nkyk*teimqsmpsd/hhgmklenne/n 
vgkltnmwksdtlvsdwijgavaha 


642 


14543 


A 


648 


388 


1 


gpfr*sppfgkprgavpq/gggglkppg 
pqgenpfffkkpkitlgpgwgp/rnppp 
lgg*kgkip*/ppggggsnkpnfspsgp 
pgkqkqnffsqkkkkkkekkkekpes*c 
pgtspqpvtdgsqainasiss 


643 


14544 


A 


649 


387 


25 


pgmrggfppfplknfffppgp* /msggg 
ggpngpppkkgffpknppgvffsppkkk 
kiffspppenlgpprdflkgpplffffl 
qknpgvffragqgnppqklakifplppg 
ggggavrhi 


644 


14545 


A 


650 


918 


2 


lgucgltiykilhstiadhtffssshgt 

FAMTDHIXiGHKIHIiSTFSKE*EIIPSTS 
FQ/HHSRNLN*K* INNKVNWKIPK\FWR 
LNKTLLNNT * TKB /GLKRHKNI LS * TKN 
I TYQ I LWDAGKAVLRGKFI VLTT * IRKE 
ERSK\TTSFOTRKRNKKLIINIRTEINE 
IENRKSEKILN\TKSWFFEKKKS IKS IK 
TLARLNKRKRKETQIPKHQK*RRGITTG . 
NMANKKI KRNIHEQP YTHKLGNLDNIGQ 
FIiEKRYLPKLKQGEI * SGWAYIN/SKEM 
ES I INTLPKRKAQDLKMFSQSE * YQTFK 
EKK 


645 


14546 


A 


651 


282 


265 


GVFFLKKRPTVF* KKKI *LGPHPQLKGP 
PGVFQ/HFPI *NFGISGRGDLFFFFFFF 
FFFFFFLFLRQGQVSVAQTGAQWHNQGS 
LQPPGLK*FS 


646 


14547 


A 


652 


6 


223 


LYAHKFNNLDEMDQFLQRHNTPKLIQEE 
/HRLTVIK* IKSIVINLPKQKA*GPDGF 
SGEMYQLLKEEIIYNVF 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
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647 


14548 


A 


653 


128 


337 


FXFFFXXFFFXFFXXXFFFXXFFXFXFF 
FXXXFFFFFFFXXFFFFFFFFFXFFXLI 
FFXFFLFFFFFLFF 


648 


14549 


A 


654 


238 


2 


MAAASTTLLALVI ISTTF*LPQVNGYIE 
KSTPEECGFDP\SPARVPFSIKFIiLVGS 
PIQTHTU^IALFLSHTRALPHTHT 


649 


14550 


A 


655 


325 


3 


CS EHKKNVF * LRAKKTNNPNLKWAKDLN 
RHFSRENIQVAEKHI* ICSASLGLREMQ 
IQTTKC/WHPVWKTV/WTILQN/LHTRL 
PYD/ PAILLLGI YQRELKTCSHKNMYTS 
V 


650 


14551 


A 


656 


3 


1655 


FFLGMESPSVAQAGVQRCDLCSLQALPP 
EFK*FSWLTLLSWDYRRIiPPCIANFVFL 
VETGF\TMLSRMVLIS* PCDPPTSASQS 
GGITGVSHRVQPALIiFFLS clcvkmi vh 
APLVKTCRPSTVGSCL/SVPAFWE 


651 


14552 


A 


657 


519 


84 


SRVRCQSANQMGGASHLGYSGVRDPLEE 
AVFPFSDLKLHAGITTTLFKAVRQGHLS 
GFAKDF*RPAEQ/GT*YTSKXA/STKQN 
ILHEE+ITFVLLLPKHEGIWMPSVPPKP 
FLFFS+GMTNSGSAGILDFVPNHPSKDG 
♦HYVLITSH 


652 


14553 


A 


658 


38 


383 


EKCI FFFFFLERGFTFGGPPGGRAQFWF 
NKTPPP/HKKRBFPALPPPGGGNNGLGP 
PARENLVF* KKRGFPLGGKRGLNPRPQG 
NPPP*PPKGG/GNNGGGPPPRQKK 


653 


14554 


A 


659 


98 


394 


KPPGKKRETPSQ/NKKKKKKKKKKKPFF 
PPRGGKKNP FKKKRGG*KNP P PRGGGKG 
NFPPVMG*RGKNLPPPRKPLFPPPQGGG 
KN*KK/RA*KNSPQNFF 


654 


14555 


A 


660 


214 


567 


GFSVINIGLKMMMMIIIILRDSLTVTQI 
GVQWHNLSSLQPPPPGFK* FSCLHLLSS 
RDY*HAPPCPVNFCILS/ERQFHHVGQA 
GliKLSASSDPPTLAPQRAG ITVPSHQ 


655 


14556 


A 


661 


463 


392 


S*HFHPSSSSSSSSSSVINYY*/FGLLL 
C/QVAHFYNSIDQQMIQSQRPMMLQSAL 
AFEQI IKVNGLLILL 


656 


14557 


A 


662 


205 


242 


GTGWQGGGLDG* QMTLGWPGSGVGGSAA 
P*GI/GPGLPGLPAPPAPSP 


657 


14558 


A 


663 


63 


351 


qsetpsqfkkktqieiilybpavsllgiy 
qkerksvlkkhytpmftaalfti\vwdq 
pr/cpkymkr*nvmy/hvyththtmeyy 

SATEKNEILF\FATTW 


658 


14559 


A 


664 


1 


358 


FFIiRQGL / DSAAQAGVQWCNHS SQRAP P 
LGLGRSSH\PASLAGWSAVA*SRLAATS 
TFRVQVILIiSQPPE+LGLQSCSITQVGV 
QQWRDLGSLQPPLPGFKRLSCLSLPSN* 
DYKRLAPPP 


659 


14560 


A 


665 


10 


257 


GMNERGNITKEI KKI I WEYYE * LGTSQL 
DTL/ELNKFPERYKL/PRTNSQSIENMN 
RTITSDYISNFKRLPKKSPGLDGFTGEF 
I 


660 


14561 


A 


666 


360 


2 


ENKKIFNSAPPPGSPFLFPPLWEIRPEG 
FIiKPRVLNPPWPPN*APPFKPPMGPPFF 
PIiFRGVRVGDPLSPPGLRLP* PGLPP\G 
PPG*PWGQNPTFFSKKKKKSWLDSSGFL 
EPNISVNF 


661 


14562 


A 


667 


328 


1 


NLFAENYECPENKKKNFQKKKIHQIWQN 
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peptide 
sequence 
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SE FMKVTE YKLNYQN/ SQLYL YMPEKNT 
* KV/ ILEKHPL* PQITKYFGE ILKKDVL 
NLVTDKYKISLREVKQEQNKWNNVSS 


662 


14563 


A j 


668 


405 


566 


IHCQWECKTEQILWKTVWWFFTKLNIIiL 
PGDSAIMLLGIYPKELKMYIHTETCT 


663 


14564 


A 


669 


218 


2 


LLMLESLMFVPPFPSFEKWAKL*IY/CH 
GAHANFLPSFFPSPPFFSLSFFFFYNRV 
FLCCPGWSAWRSQLT 


664 


14565 


A 


670 


2 


210 


NFGQEENSEMNSLCSYLHNLEKGEQTRP 
KASRRKEIIKI*AEI*K\VQRSNRENK* 
KKWFFEKINKIDKPLARLTKKWRT* ITI 
VRKETGTITKDPADTKRIMKEY/YKL.LY 
MHS F * NLSRNLKRYREVI E KINERSGS L 
KRSIKLTNL 


665 


14566 


A 


671 


200 


3 


SLCHLPHVASKATLET\GLVEHMW*DFF 
CFFETESHCIAQSGVQWCNLGSI1QPI1PP 
RFKLFSCLSL 


666 


14567 


A 


672 


94 


374 


PKFRPQETTEQTSQFLC3QIWCKGRKRQG 
KRTYRLGEIQKQSQPMATFELYLD/H* F 
* LAKETL* DGAVAHTCNPSTLGGQAGWI 
TRSGVQDQPGQ 


667 


14568 


A 


673 


312 


3 


WKMGQLPGRPQLPQLSQEKILSLNSPNV 
PKKKKLK* * *NILPKKKRPDCFTHKFYQ 
TFKEEI IPF/L/HKIiT*EFEKKBILLKS 
F\YEGSW*AAKLNADINKKTTH 


668 


14569 


A 


674 


34 


438 


QLT* PD*HFIKYSTQQQKTFFSSTHRVF 
AKLGHNL VY\ KANIiNKFKWLQ VI SNMFL 
DRDRIALKINNKK/PSSPLKYLQTHLLL 
NDPRIKEGSKREI IKDFALNDNATY*NL 
WCL*NGTLRKVYTTKCLFRKEGPQM 


669 


14570 


A 


675 


3 


349 


QKDRKS VRPSS *NPPIiAVEKI PI /LKQN 
PERFNGPAFFKIAHGLTSSLVFCLANSN 
YERTHSRI I I LS QGLQTGFAI KTFRKLL 
ASIANLALPPTIKLLGELS I 


670 


14571 


A 


676 


346 


1 


KMVKLL*NIVWKLIiK\GLTEVPHGSAIF 
PFFIPIFPKEMETNVHIKTCTWAFPAAF 
FPMAKKKKQSGNNPTTDE*IQKK\W*TH 
IIDYYSAI KRNELLTHATSCNITLSKRS 
QTKK 


671. 


14572 


A 


677 


357 


6 


AAGDSELKPMFI YCFQKPMPL* IMLKST 
LLVLYKWNHKAWMTACIiFTAWFTE/HCK 
PKLETYFSEKRFLSNIT*LMKASGHPRA 
LTEMYKE INVVFKPSNTTCTLHPMDQEV 
ISTFNSY 


672 


14573 


A 


678 


401 


155 


YYSVAQAWQWRDLSSLQAPPPGFE* FS 
C\PASWDYRHKLLTSGDLPASASQSAGI 
TGVSHCAQPLSLFVFFLTNQRFVAALS 


673 


14574 


A 


679 


292 


395 


Q*KISFEMUjIITNVPGHPRTPMEMYKE 
LNFF\MPANTSIP* PMDQGIVLTFKSYY 
A/RNVFCKAIAV/DSDSFDGFGQSKLET 
FWKGFTI *DVIKNIDDL/WRGVKI PILT 
GVWEKLIPTLINNFEVFKASLEEI 


674 


14575 


A 


680 


275 


3 


WPRQASLAIjNT*SIHIYIHTDMRTHAHT 
Y 1 FGMKSVTQAGVQWHNHGSLE P * S PGL 
RRSSHPSLPTSGEHRRTPPR/LADFLHC 
L*R*GFTML 


"675 


14576 


A 


681 


330 


507 


GSV*PVNLIRNCQPFVQSGCA/SLHSHQ 
E \ WYMCS PHPRHYLVAS VT F I LAILFFF 
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LRRSL 


676 


14577 


A 


682 


390 


43 


HHLPIRYICLSISSIIiCLSFIMyLS/IY 
LS I YLS I YLS I YLCQRCQI PLVFLCLLL 
S*LS*IILILNFSCPFVGRTIVLS 


677 


14578 


A 


683 


396 


63 


ADPTEIQRIISDYYDQRHTNKLENL* KM 
DKFLKTYNLPRLN* EEVETLIRFIANHE 
IALIIKSLPIKRSPVLGGFSVGP*HIDK 
EELTPV3^GLFQK\ IKKEEIIiANSFYESR 


678 


14579 


A 


684 


1 


396 


EETIiPIiFADDM I L YI I KKKQKTKKTTKE 
STKRY*N* *M\NWGKVAGYKVNTQKS I 
N*QYTI *KVKLAS\FSSSPQKNKIGINL 
TKEIQNVYSENYKTLKEIKDLNK*ES IP 
CSQIRRFNIVKMTVLLKLIYR 


679 


14580 


A 


685 


283 


3 


NCVESSNTHLWEPSQKPLSSVSLTDVCP 
PMLS AALFAIA\RS YTLPTRSS IDE* IK 
KMWYIHTMEYYS AFKKKRIMS FLMPRME 
bKIFRVNKlR 


680 


14581 


A 


686 


53 


324 


HCFCDRIRAAFCFYILDCPFFISITHKL 
1*1 FFFFFLERES I FVPGVGGGGPNFGS 
LNPLPPRLRRFSCIiTLPRGGDYGIjGPPC 
PT/NFCVF 


681 


14582 


A 


687 


39 


208 


NIFFCREGFAMLARLVSNS*NWPGKVTH 
IHNP/STLGGLGGRITQTQEFETSIiVBA 
GA 


682 


14583 


A 


688 


47 


341 


SSGRVFVLFCFFETGSHS / VLFRLAYSG 
GISAYCSCHLLSSGDSPISATR/GTTGM 
CCSAQLGFCVCVCVCF/I FVF*VKIGFC 
HVAQAGLEUjDSSNPPTS 


683 


14584 


A 


689 


229 


1 


GRVDGRQIACQEFKARLASQSAGITGVS 
HRAQPFFCFC/LLFWFEMGSHSVTQAG 
VQWCAPGSLQPLPLRFK* FS C 


684 


14585 


A 


690 


339 


1 


KKGPPGGPTKPEGGRFPRGPPGGGKEVF 
PPPPPGGKGEKKRGGAPPPFFFLKKKRK 
SGKGG\N* FFPPGGGKKGGQKKKKRVFF 
PKKKKKKKKKALSIiSLSIiSl^IiSLSIaliG 


685 


14586 


A 


691 


372 


0 


YDYLYIKXL*N/LREKIDKFLDTYKTLSR 
LNQEEIENLNTPITSNKIETVIKSPPT\ 
KQKSPGPE 


686 


14587 


A 


692 


117 


297 


APLLESSSASYLHFSN*DIiQMAH/KHTK 
RCSTSLAFREM* IKTTRRHHFTPTKMAT 
SKRHT 


687 


14588 


A 


693 


122 


322 


EQTNSWS I DFFFQRCHSNAMGEKNCLFN 
KW* WDNYVAI \AKKMKLSLFKP YI KTNS 
K/WIKDLNIRAKT 


688 


14589 


A 


694 


21 


342 


RSHFRHLNEYYATQQGNSDVHPLLIHSN 
ISTAI IWQLQRQERRKEKRLKEVKNEKA 
LKEI FFLRGWVLLCHLGWS PWGS * FIV 
TIiN\ FGLRQSSHLSLPSSWDYRPV 


689 


14590 


A 


695 


2 


343 


KFVPLHSGLGNRSQKQTLSQRKKKKGGV 
FVAWAKVQWHNHG * QGP * TGLRQS SCLS 
MGGWVKL/PGGRHHAWGFHHVKKVGLE 
LIiTSDDAPSSASQKAGITGMSHTAPGPK 
NIjV 


690 


14591 


A 


696 


336 


3 


I PPVKS PKFPILLKKKKFSLFCFSGAPF 
PCYSPPRAQKKGW*KIFSPPPK\QKKPG 
ALFFSPKPAFFPPPFKKXKKKKKAKEGH 
YIMIKGLIQQEKIILNIYASNIGARRRG 


691 


14592 


A 


697 


374 


2 


AFY * KKMVS RT F I ARE KSMPRF KAS KDR 
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LTFLLGAGNFKLKPVLVYHSENTRALKN 
YAKSTLSVLYQRTTKAWMTSHLFTEWFT 
EFFEP\TVETYCSEKKIPSKILIiIiTDYA 
PRHPRTLMEMYSC 


692 


14593 


A 


698 


361 


3 


PQMCPPPPR*QGI IFL*L*THCPFCTVK 
ARIiFLFPPF/S*RKGGFFFFE*RKFFKY 
FGWELFQIRFKVFFFCDRVSLCCSGWSA 
GART*LTAASTSPAQASLLPLTLPSSWD 
YSHVLPRLV 


693 


14594 


A 


699 


75 


362 


KDCAYGICSKKQKGFCLTQTGAWQQYEE 
I * GLBT/GFSRSVHSMGQRRYWDYRHAL 
THPANFVFLVETRFLHVRQAG I ELPP / S 
SSQSGGXTGVSQ 


694 


14595 


A 


700 


2 


277 


FFLYTSNEQSEKKI /RETNWFI ITSK* * 
NRNVNKE I * DLYNENS KTFLKE I KEDLN 
KWKGILSS *TEKLNI IK/PTAI * *FNAI 
LIK/IPMVFDK 


695 


14596 


A 


701 


1 


407 


GTRKHFSKEDRQYHKSLRKQIKT/TVSY 
CFTPTKLTTVGKTANS K/CWAGCKVTGT 
LIHCQ * ECEMVWLLWT/SVQQFL /RNVK 
L YETAIS LLGICTREI KTPLHI 1 1 \ I IA 
KKWQQLRCLSTDEWINNM*YIHTMEYYS 
A 


696 


14597 


A 


702 


209 


3 


SLNRRLSFFFY*RQGLALLI/EPGWGAV 
ARSWLTPASN\FELQRSSCI*SLRSSWDY 
RRVPLHRVSLRFSC 


697 


14598 


A 


703 


308 


3 


T TGLKRFARVGP P KCWDYRCE P PGLATS 
SVLKAFQPIGSVPPRIoSRILSFM^QX* 
YRMLI TSTKYFPRMVS ISRPCDLPALAS 
QSAGITGMSHHAELVPLV 


698 


14599 


A 


704 


603 


2 


plpp\glqvesp/ cvslpsswdyrhapp 
rpanfsvf**rrgftmlakm/vlis*pc 
dplasasqsagisgvshhawpkqt*lld 
tdktegifiitnhllripppmfyqyilik 
vlesvketkdkrksvssctlyrcii* ilf 
is *ayikfttffff\etosrsatqagvq 
whglsslqppppgfk* fsclsllsswdy 
rsmpprpa 


699 


14600 


A 


705 


3 


366 


ARVTYYSGKKEPFGYLGMA*AMISIGFL 
GVI V* AHH I FTVGI DADTRAYFTSAT 1 1 
MAI PTGVKI FS *LATLHGSNMK*SAAVL 
* ALRFIFIiFTVGGLTG ICLSNSSLDI VL 
H/DTYYVGAHF 


700 


14601 


A 


706 


358 


3 


GFHHVGQAGLKLLTS * S ARLRLPKCWDY 
KREPPHLAPDRKYFSPLVFARDNFVPQG 
P* \HVWINICIiSQVKAATGIYPGEAGAL 
RNRLHPH*RIFWPQRPRVPIKSPWTIiRV 
ISTLCSC 


701 


14602 


A 


707 


358 


67 


QLSSTIWY* * KN/RIDQWNKIAQNTSVH 
RKLISDKVTKPIQWSKDSLFNKECWNN* 
TSICKK/ LSLDSALTLFTKTNSK* ITEP 
GSVAHTCNPTTLGG 


702 


14603 


A 


708 


34 


363 


RTVFFFFFFFFFFFPKKTPFFFPGPRGG 
*IPPPEKKIFPPVFRGKFKKPPHLDPGG 
GNFPRKPPFFFLGP\SFFFLGKTLGGGG 
WIPFPPPWPNLFFKKIFWGAGGPSPF 


703 


14604 


A 


709 


I 


380 


STCKRMTLDSYFTPYTKINSKWNKDLTV 
RGRTIKLIEENIGVNLHDLGFGSGFSDM 
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TPKAQAKRKIN/ WDF I KYLKFCA* EDS I 
ERV/N/ RRSTEWBKI FANHIQ *GN*NIQ 
RILKLSKL*ARLKWAEDL 


704 


14605 


A 


710 


382 


30 


PGTTGVHHHTRLT FVF/CGRAKI S I CWI* 
VWNS*AQSACLSLPKCWD*EPQHPARFV 
S FNTRSGI PI ISTNPSLSLS AFPSQGFV 
FLFPHFPASHPETALAEFFLGTFSLSPP 
GPFTPAS 


705 


14606 


A 


711 


1 


377 


LLVICGMQMKITVRCN*TLNKIAI IEKK 
TAKCC*HYGGLKTLIHQWCKSNMVQPLC 
KTV*QF*KKLNIQLLYLLK*KKAYIHPS 
NCTQMFIA/ ALFS VLAPNWKQSAY 


706 


14607 


A 


712 


1 


390 


LKYKIMLGMAAWACNPNYSGGPGNITA* 
TW* AYMALTRLE PGRGDHTAALQPGRQS 
TTPF*KKKAVPFRA/RPVKMREREI*KP 
FS PE \RTYS CAQEGPGRTFGSAQDLEAA 
GGRGHHRMGAVWQE PHRLLG 


707 


14608 


A 


713 


2 


393 


KVRRQI INVDTATDS IGIKWI I WKHYV* 
LYANKFNYLAEMDTYVERYKLP\KEIDV 
MNSLVPAKE I IWVKILLTKTPDPHIFT 
NEVYQTFKEYY/ PVLYKLFQKIEDERTF 
LSSFCEAGINLISICKRFFF 


708 


14609 


A 


714 


219 


1 


PRPGAVAHACNPS TLGGQGGRI * TQSS I 
IKACCLIHDLAIQTQKQMT\WPGAVAHA 
CNPSTLGGQGGRITRSGARDQPSQHSKT 
PSLLKIQKKLASWAGACNPGYFENYIQ 
KLLRGPGT 


709 


14610 


A 


715 


381 


126 


I IAGGGGVCT*SQLFKRLRQEDHLSPGV 
RGYKBL*SYPCTLAWVTE*DPVF*KKNY 
/TLNTSPKETNKK*GVRCIKTGKVLIFV 
AAG 


710 


14611 


A 


716 


417 


I 


OTCCWGATNKILLLTDNASGRQIVLMDM 
YKKNNWFVPVNTRSFVQLMDQGIILNF 
KS YLRN/TF/HKCIAVTNSNSS * ESGQS 
KLKALWKGFTVLYAI QNI CDSGQVKMPT 
LTEV/WKEVILPLMGDFVRNRTSEKQNY 
KLS 


711 


14612 


A 


717 


406 


122 


MFSRDAVSLCCLGWS * TPELKLSSSLSL 
PKCWDYRGEPPCSAAKDF*I IRKR/HSC 
RNI IRNSDAWQR * TTLVS YDRNPP * FPS 
PPPNSPLCPAA 


712 


14613 


A 


718 


535 


1 


HNIjNKIV*NLHVENYKMLVKDKKT*ISC 
LWVRR\LVKIPVLPKLFYSVIVFPIRSL 
VRYFIEVCTFTLKCIPKEKDLKQPNRF* 
KKKKKVHLPFIKVYY\KLQ*YIATVIKT 
VWY/W/ YSKIDKAKWYKTEKAEV* PHEY 
DQLILAEVQINH\NLFNKCY*SN* ttig 
KNMNLNLSFTSYT 


713 


14614 


A 


719 


255 


1 


GKLQPNKGLISLIY\NL*KLVRKSNPKE 
KWAKGMNR*LTEKGIQIAFRHMKRCSPR 
FIMKAMHI ITTWSYHFSSKRKTRIQRLT 


714 


14615 


A 


720 


381 


0 


KFIAVNEYIIKEIDLPINNLTLYLKEL/ 
EREELTLCKASRAEVI\KIRAKI irelt 
PIRKTDNQSWFFLKIHKIGQ/PFKIiD*L 
QKKKRVREKVPZTKLRYENRVTTTNFTE 
/ 1 KRI IRE YYDH 


715 


14616 


A 


721 


3 


381 


IYREWRDLTRDGTYQGHHAPPGEKG/VC 
RYGIILFITSEGFFFAGFF*PFNHCSLA 
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PTPQLGGPWPPTGITPLNPLKVPLIiDTS 
VLLASGESIT*AHHS\LIENNRNQLIQA 
LLITILLGLYFTIiLQA 


716 


14617, 


A 


722 


56 


455 


KTI INIKPPAI IQYQTPLFV*SVLITAE 
\LVLLSLPGLAVGITILLTDRNLNTTFL 
DPAGGGDP ILYQHLF*FFGHPEVYILIL 
PGPGI ISHIVAYYSGKKEPFGYIGMV* A 
MISIGPIjGPIG*AHHIFTGGIA 


717 


14618 


A 


723 


398 


3 


HQLPVFWQYNKKAWTTRPLFVDWLH* Cb 
VFE VS KYPAS KGLPFKVVLMLDNAPD/H 
PTQNSMSS/DSKGIEVIYLPPNTMSLI* 
FLDQGVIRTYR/RHYPQYSMQRS IRPMQ 
E I SNKENI I KVWKNSTTDDAI VA 


718 


14619 


A 


724 


318 


14 


TPPFFIiNIPNLFFLKFPSPPPPWFLTKS 
LFFPLKLKPKVFGK*SLSKF/CPFPKFN 
PKPQKKNFFFPPPPFFFFFFFFLRHNL/ 
VCHPAWSALAPSQLTATSAS 


719 


14620 


A 


725 


2 


371 


APRCKRFSCLSLPSGWDYRCMPPRPVNF 
F/SIFSRAGGLES*PPMIPPA*A/FPK/ 
CWDYRHE 


720 


14621 


A 


726 


430 


97 


PKKNPFWKRA*NGGAQKDVPLSLP/SGW 
AGGSPWGPGF* PPLGPKGGPPFSKKPPG 
GVGPFLMPPTPEGGGGKLA* PWPPGFNP 
PVSIFLPRPPRAGGKKGTPFPKKKKKEK 


721 


14622 


A 


727 


406 


1 


GGAPPPPFFFFFFIKKGVFPF*QEGFFF 
PGLKN/I*PP*PPKKVGVKGGSPPPGGFF 
FFFKKKVFFFFPGkRTRAQIKVPSTLVG 
QG*KILPPQVSQEGGVKREGPQAQLFFF 
FFFFEMESHFVTQAGLQWRDIGSP 


722 


14623 


A 


728 


98 


368 


KPHKN AKCP I RVAI VADME NNKYW * G YG 
EMGTL/ I»HY* WEGNTEQP WE /T/VWQLL 
EKLNTELPCDPATPLLGIYPKTLESRDS 
NRFFIFFFSF 


723 


14624 


A 


729 


19 


404 


VCVIS ICLATE ILFFLHTRPCVCIYCVC 
VCVCLAVWGTLCVYVCIESCVCVCFHLT 
GVLC/V*VCVISICLATEII*FFLHTRPC 
VCIYCVCVOTCl^WGTLCVYVCIESCV 
CVCFHLTGV\CVCVCVCIESSVCVCPPT 
CGTVCVCVRPPGVLCVCVCVCQAAWVAV 
SMCLTP/CPC/VCVCVCVCV 


724 


14625 


A 


730 


432 


12 


FFHKPNFPAARKGRIiFFPIYPSKCLISP 
KAL*F\FGGVGPFFPPPKEGFFPKVPR* 
CFFRPLIRKKQILFLPPLNLAPPGVI *M 
/PPPPIVFFFFFFKRDRVSLYCPGWSAV 
A*S *L/TAVRTRGLK*SSCLSLPKCWEY 
KCEP 


725 


14626 


A 


731 


332 


7 


NKTRKEKFLTRSISRF/CFPPFPLKIFF 
FPLRA* FFWGGLPQ I FPPPKKVFF/PKI 
PKVFPKWPPLRKKIFFFFSPLILGPPRV 
LLKGPPLFFFW*VFFF*DRALLCHPY 


726 


14627 


A 


732 


70 


294 


FLRFCVETAINKIAI IIANF*NFL*RWG 
\FTMFPRLVSKL*DSSDPPTSVSQSVGI 
TWSHHARLNFFYYQNSRR 


727 


14628 


A 


733 


1 


355 


FLLRHILLCHPGFCSVATAYCSRDPGSS 
D/PLPPQAPLPDQ* PRIiQA/WHRLAPPH 
SANFF* *RQGFTVLARMVSISQPCDPPH 
SGPQGA 


728 


14629 


A 


734 


50 


395 


IPGIiTRQWLLDPCASPSTPPYT/P*VQP 
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STPQNSSPSPKTHNQKGLPMPLSPTPKP 
STAWKKAILEHTSSSSSSSSSSSTCRNR 
NGYTYWPEHRPARGHTASQTRKQVLAA 
THKP 


729 


14630 


A 


735 


451 


0 


GPSSSPTNIPTPVSWNSFK*SSQVAGAT 
GTCYHAWLI FVFSVEMDFY/HPVA* ASQ 
SAQLCQyFWIPLROGVSLCPPGWSAVAW 
SKLTT/CLCLPGSWDHRCAIiAHPANFLY 
FWWR/HSC 


730 


14631 


A 


736 


2 


477 


PSHVSNKRLISRIYKELIHLNSKKVSNP 
I*K*GQSIiNRDFSKASFLPP*KC\QMVN 
RYIKRCSTSLI IKEMQIKTTMRCHLTPV 
RLAIMKKSKDNKPSLVWDC* IRVLHLF/ 
CK/WICQYVQL/FWLLLSNFLVKQPF\Q 
UAPPNSPTLG 


731 


14632 


A 


737 


3 


2344 


AAGGPTAQSPAQLAGRAIiRIiARWRAYAV 
GACRPGAGSPCSVQGGAASELSPRPQTW 
IGSLKP* TPGAAAG* AHRGCX3GSALLN* 
ATPRPAPGLPASPTSSQALPAPLGAWGH 
SDHQPRAFP * S PQASTAI RKE KKQRAQ P 
GRASVCPASNPFISSRALPVLQHGPPAI 
SGAGSAVASQAPGSS/GSHAESGSPALA 
HTP*GS*EPHSLIVBSTRKS\ELPSSSQ 
GRIiLLPLLTPGVAS / FVGTKLPGATAAT 
AGALHQPGLRLS SIiQGVGGAKNKQTGCC 
CLQLPTTGLPQAPGAIjRPLGRLGPAAAP 
GEHRQRTSPQGTVPYGGIRQGWEQPQRL 
RAYGTALPPPHTPPGSSGP/RQAPGCRG 
SGAGEAAGIRDTGGGGPGRRAPCSQASP 
GRGG WQAQVGCETCRGCAQS S /GGGAVQ 
PGLPRKP YPHS / AR* ENLWPFPCS PRT 
RAQBPQTQGEE/ GVBGPQGSPCAPGAVR 
GRGIQI*SSEPGKLARQG/PASGDGP*EG 
TGQEPSQAFSSARHPWPSEARMPDTAIC 
RNQAQ*LQLTSSQPSMGPQLRKSLPATP 
QPNSYWIXK3NSATLGRTTNTRRHC3GMSN 
FGARGDLLGTVPT*QPLMQRRKEKPRVG 
GEPVQSHTVCG*\PAGVSRGWPLRPMLP 
ERWRPLSASSGSQGRPGLHPPSIiA\CGP 
SSSPHRTCSP/GLDPGLPDAGSIKPPSL 
VGAGQGAGST/GLD/GPGLSLLS pgksl 
LPPSSPATGLSGIjGWAQSAF* SLLTVA* 
WLNPVPNGPSDTADCTPAQAPTAPAMLE 
NQANKSDFFFH 


732 


14633 


A 


738 


37 


450 


NQKKWPPPRSS KPARPLFFFJj *QKYIKD 
P*KTYQGCFFFSPPPKKKNPPPPKGGGF 
FFFF*KKKGGGPPP/QAKKRGGGGPPQK 
GGAKKNP PRGFTRVFKGPFKKRGPP 


733 


14634 


A 


739 


2 


426 


QEFVLGSAPGCGCWVGGEGRVSSPWGDR 
KVGGPSNSRHWNKGAGPSCPPGPSLAAG 
LPGRRAGSWAPVLW\PGPP*GLAQLPSP 
LW/PPSPPQEREPPLPD 


734 


14635 


A 


740 


21 


392 


AQEFKSSLGNMERPLPYQKKKKKKRPGG 
GA/RPLETQPFGGPGGGEKKGGKKKTSR 
P/IRGKPPFL*KKKN\CLAWGGGPAL*P 
QFPGGVGEKKTFNPGGGGLGKPKPPPFT 
PTRGKNQTPQKKKKK 


735 


14636 | A 


741 


1 


40& 


PSSQGG*EAGG*LEPRSSRPAWAI*RDP 
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/GSPEKGKKKKKKKKKKKKKKGGGGFKK 
TPGGARIKGGGEKKNFFFKGGEKKNLGG 
ILEKKLFFGGGKNGEKPPKKKKGLREKK 
KF*RGKGGKKABNPGGKKI FQKKKK 


736 


14637 


.A 


742 


394 


1 


RDPLEEAVCPFS DLQLRAGR TTAL FiCA V 
RQGHLS LQRLLiRLFVCLiCPAPRGGAYRS 
RQASLNRGGFHPVRAYLLLCLPKQASEM 
AGAPSPA*LPPCSLISDCCASNQ*DSVG 
/VGPSEPGEGYNLWRRFLS 


737 


14638 


A 


743 


384 


1 


FTGP YLXNNGYS FLFLYPPPSVLGNWQP 
PHPGRWPP*DVNFNQWIFVSL/ISSLK 
DEPAFSSTLYSF*QHFFFITYLPPP/CF 
FLIES CSVAQGGVQWCNFGSLQPPPPGY 
KRFSCVGLPSSWDYRCV 


738 


14639 


A 


744 


2 


410 


TPLNPLEVPLLNTSVLLASGVSMA*ADH 
SLIENNRNQIIQALVITILLGLYFTLXiQ 
ASEYFESPFTISDGIYGSTFFVATGFHG 
IjHVIIGSTFLTICFIRQLIFHFTSKHHF 
GFEAAV* YWHFVN/ VRGLFLYGS I YW 


739 


14640 


A 


745 


393 


1 


PPPPPRGRGSPPPPFSRKGGPKGKPPPP 
GNFFFFFKKRVFPF\CPGGFKS*V*GVP 
PPFPPKIFKFKGGTPFFGLWSPLKSKKS 
QFFWVPKPGERKKliFPPFFFFFFLRPCL 
SLSPRIiECSGAISAHRNL 


740 


14641 


A 


746 


2 


290 


KNLCRENYKYE * KKLKTMKDLCKE YYKV 
*MKKIKENM*KDKLMNFK\NNIKISTLI 
KVIYKFKAIPIKIPMTFFAE\IiK*ILKC 
V*NYKRP*IVKAI 


741 


14642 


A 


747 | 


59 


364 


CRFFFFFGLGGGGGGGGGGPPPFF*KNP 
PNPPKKI PNWGFFFRGPGFAPKKKKGPF 
LGEPPPWGAPIiYNPPPPFGGGGAKFPPI 
FFPRAGGPQKRVG/RPPPPPPPPPPRPK 
KKKNLH*LECVEGTEEPPTRPPTRP | 


742 


14643 


A 


748 


2 


371 


SSCLDLPKCWDYRHELLHLAHWLVLiGIF 
LLLLFCFVLRW/RS CS WQTGV * WS DLT 
SLQSLLP 


743 


14644 

i 


A 


749 j 


339 


198 


IADMEKV*AF*TEHKTNHISLNQN* I*S 
KVLTLFNSMKAERD*EGPEEKPGASTGW 
FMRFQE ISQLYN/ IDV*GEAASADGEAA 
ATCAEDPAKIPDEGGSPK* YI FNVD * AT 
FIRDLSWIFCTSCCSFSISTCCFTLHVM 
L 


744 


14645 


A 


750 


249 


2 


KrFKPPAFPVSFPPFPL*DSSSPPEP*I 
SGGRVGPI\FPPPKKGFFPKNPPGVFFS 
PPLRKKILLVPPPI*NLGPPKGPLKRPP 


745 


14646 


A 


751 


1 


288 


WNHMTDEGLVSRIY*QPLQL\KRKTEN 
PVQKWTKCPNRRFSEEDEQMARKHKQGW 
PASRAI RGTQSKI IMRCHFTHTRMARIK 
KEKEAGPGGSCL 


746 


14647 


A 


752 


494 


2 


ETGSWFGPQRItEVQWWIKSSLQP* PPW 
APVILPPQ/LSSG*DHRHVPSRPG*LKK 
KFFL*RWGLTMLPRLLLNNLPTSASQNA 
AI TGVSHCARS ASVFL I LRQGL WVWE E / 
GAQW*SALSHLPV* /LSR* WCQP * PLG* 
DSGSGSPLWRRLAYLPLCALWKEVTMCS 


747 


14648 


A 


753 


1 


533 


YLSPRLECSGGITAHCN/LRPPGLK*SS 
QLSLQ\SSWDCRPAPPRPAARLIFFCRF 
EEGRRSHYVAQACLKLPGS\SNPPTVA/ 
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FPKCWGYRCEPPHTGLCS/GLSEFGFCL 
WWGACSRSSHKNPGCGTA/CPAVFLHSV 
GYASHPS STSRTAFTLGWVLVFVFRDKG 
LTVFSQAGSTVGVILG 


748 


14649 


A 


754 


1 


346 


ELSKSTLPALCGGNDKAWMTAHRFIAWF 
TDYLKPTVENYW* EKKIRLKVLLLIDNA 
PGHPKALLEMYTE IHWFMPADTTSIL* 
PTHQGVI S TLKFCYli * NTFHKVQ \CY I D 
SDSS 


749 


14650 


A 


755 


1 


323 


EDQTSHNVLLSLNLIQSKALMLFNSMEA 
E/R/SEEAVEEKFEASRE*FMKLKERSC 
LQNIKVQGEEASADGBPPACDPEDLAEI 
TDEGDC I K* Q I FI VDKTAFYS KKMPS 


750 


14651 


A 


756 


1 


344 


ARQQPGDPARGGGRGRRL\QPRQAASFC 
TGTLARARHTCEGAGVLRRPADKIiASLN 

KKKKDSSYSSSSSRGGSSSSSSSSPPPP 
PPP 


751 


14652 


A 


757 


119 


317 


NRYFPFLRVKKFPWVEGPFFFLR*SLTL 
LTLVAQAGLQWRNLS SLKSPLPGFKPFS 
/PSA/LSRWDYRR 


752 


14653 


A 


758 


2 


363 


CILAIVKSAINRIAN*YI*KCSMSIITK 
RQA/IKRKNKTRRYQLIPVRMTLIKKKK 
RW* RCEEKGRLAHCWF \ECK*RQPL*KT 
K*RFLKKLKLP\FITAIALLDIYPKQI K 
SE/CKKH/CAILFIALFTIAK 


753 


14654 


A 


759 


358 


1 


KADFCFPPTFSPRVKKIFPPPFPKKWG* 
KGP/LPPPGENCVFFFFKKKRGFSPLGG 
GFFYFFPPGFPPPWPFKKLGFKGLTPPP 
GPPQKIIiVFFFFFFFFFFE/DGVLJjCRP 
GWSAVAQS 


754 


14655 


A 


760 


316 


2 


KATRSQDIRRIQVKLKEIEPQKPLQKTN 
KSRSYFF* KH* *NRTLTRVMENKREK/N 
E IDPIQNHKEDITTDPTEIQTIRKYYKH 
LYAHKLKKLEEVDKLLDTNTL 


755 


14656 


A 


761 


3 


321 


FPLGLWGSLGAKPEGQAPLYPPTLSKRV 
VPIiC/DKGRSAVTFNTVYSSGSSRWFP 
PPCM*IYRLCIKGRSSDPEQKKKKKKKK 
KKKKKKASKKKKAPKDSSSPKT 


756 


14657 


A 


762 


315 


2 


KTERW/ VFEKINKMGQS *DGLTKKNREN 
/V/QINKIQNERGGLSTGSTEIQRVIRG 
FCEPL*AQItiDNLEKMD*FLQPFHLPRQ 
NYKKKKI I *KKPISKVIELVIKNL 


757 


14658 


A | 


763 


226 


329 


NPTTLEGQGGRISKGPEF/CSRDEVSPC 
WPGWFQTPDLWRSVRLGLPKWWDF/RR* 
APPPGLNIiFFRMLSTHGSW/CQHPCFKF 
PTSAFQYIYIYIF/SFLSRSFTFVAQAV 
VQWHDLGSPQPPPPGFKQ . 


758 


14659 


A 


764 


335 


I 


QSQERPDS FQVMNLTLPRAPSMFIiVS PR 
TELS KSP *\P PAFLCPCVWPHSTLLSQT 
LKY* IKYP * SPTPFPSFVLF* RDRVLLF 
HPGWSTLAGS * LTAASNS WAQVTLL 


759 


14660 


A 


765 


100 


310 


H FGRPKRENCIjSP * V* DQTLHTHTHTHT 
HTHTHTHS RSENGGMRLTPHI PERVRW\ 
GIIC 


760 


14661 


A 


766 


315 


3 


I FPNPPR I FIQNPHPFLFGFFPKKKEGE 
Km^AMETPRFSPPPLGGKKAPTGFPPPP 
IGF*GPKPLIKGRGP*/PP*KNTKIFFF 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F»Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Argjnine, S=Serine, 
T=Threonine, V=Valine, i 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














FFFFF*DRVLLCHPG*£AVARW 


761 


14662 


A 


767 


1 


309 


KE P YKLTELHNE IRVACGHLS KKKKI TW 
VSDS + ASCPSKTRGNFIGPSAGSRAPSE 
KLBAHFHGCGSVNRVHLT/ CK*HKRRPL 
GVAHLEVSDQKSGRTSLALD 


762 


14663 


A 


769 


1 


316 


GRAPPPPPPPPPTPRVHTSSKSESEPER 
/DGRE*EPTRSLBRL/LFFASYFCLEAR 
QSTSALPPLPLSPRSGPT\VLLCPSPPL 
PGERPPNPTKLSREKQTKQTHRAR 


763 


14664 


A 


770 


2 


366 


AREIIILTF*QMITFLNTKGRT*SLIliA 
SLI I FIATTN\liLGLLPYS FTPTTQLFI 
NLTMAI PL* AGAWIGFRS Kt KNALGHF 
LPQGTPTPLKP ILP IIETI ILLIQPIAL 
AVRLTANITA 


764 


14665 


A 


771 


2 


364 


NAFPVISGAWTEYPLSHFIQRCRQKKE/ 
LNGLQ I GKEEE I FLFKDAML * I ES P KES 
VQKRLDVIYKFSRIAVCKINIQKSNIW 
YVWNTQFENEVKTI *DPQ/DIKHWGIIL 
SQRKKE 


765 


14666 


A 


772 


3 


376 


HE PLGKLKLS LLF I LAT YS LTVYS IL * S 
G*ATNSNYALIGALRAVAQTISYEVTLA 
IILLSTLLRRGCFNIiSTLMTTQEHL*LL 
LPS * PLAII * FI CTLABTNRTPFDLAER 
ESELLSC/FNIEYA 


766 


14667 


A 


773 


3 


350 


HEFFFFILKMYLGQAQYFICFLFFFLGF 
HLK/HKSCSVTQAKVQRRHLGSMQPPPP 
GFMQFSVAEIKDVHHHAQLIFYIFRIFI 
F* F / NFLRHS LALVAQDG VQWRDLGTLQ 
PLPPPG 


767 


14668 


A 


774 


1 


359 


GTRYAAMLSALGFI FLFTARGLTGI VLA 
NSSLDI VLHDTDYVGAHFHYVLS IGAVF 
ALIP\GFIH*FPLFSGYTLDQTYAKIH\ 
FTIIFIGANLTLLPQHFFGLSGMPRQYS 
DYPDAYTTW 


768 


14669 


A 


775 


2 


369 


ARGS ICLRQTELKTVTAYSSIRHIGLW 
TAILIQTP*SLTGAVILIIAHGLTYSLL 
CCLANSNYERTHS\RI I ILSQGLETLLP 
LITF* *LLARLANLALPPTINLLGELS V 
LVTTFS*ANIT 


769 


14670 


A 


776 


2 


353 


ARGTGA* VDS * LTTLHG^NMK* CAA\ LL 
\WTLRCKILFTVRGLTGMAITNSTLDIA 
LHDTYYWAHFHYVLS IGAGFAI IRGFI 
H*FPLFSGYTLDQTYAKIHFTriFIGVK 
ITFFPQ 


770 


14671 


A 


777 


3 


353 


HEGLHL* LPKAHVETPMDG* IALDAGLL 
RLRGYGI IHVTLILNPLRKHILHPFLVL 
SI*GIIITSSICLRQTDLKSLIAYSSIS 
HI \SLEVTAILIRTPL\SFTGADILI 1 3 
HGLTCS 


771 


14672 


A 


778 


367 


2 


FCPI / CPNQKFMGVGAVGPALYPNPFGG 
LGGLV/TPGAGVLNPAGPPG*TPPPPKN 
PNLGGGPALFFPPLKGLGWKIALTPEAK 
GSINPNSPPALPGGGPNQTFSKKKKRKK 
LMLVYS IELTSRA 


772 


14673 


A 


779 


3 


432 


HEPLHWLSSCVCPACVCVCVCQ/CLWLQ 
VGVQ*CMYSSWQHEPLH*VHTRVYVCVC 
LCHS VAQAGVQ * CNYS / CTAACNSW 


773 


14674 


A 


780 


213 


466 


DDILPVWNYSISFLFFFENRAFFLPPGW 
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Amino acid sequence (A=A)anine 
OCysteine, D=Aspartic Acid, 
E=Glu tarnic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K^Lysine, L=Leucine, M=Metbionrae, 
N»Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GAGTQL* LPAASN* GPKGPSRLTLL * W \ 
DNRHGPPC/LDNFIFLQKKNLTLLPRLA 
LN 


774 


14675 


A 


781 


304 


373 


R*VFFFPFLKKEFHFFPQAGGQNQEMG* 
LHPLPPGAG/RF* CLTPPHSWDNGEGPP 
RPPNFCFFRGNKVS PCGP 


775 


14676 


A 


782 


389 


1 


PFPKN* CRVNSRPNVKHKTI / QLLEGNL 
GAS VDDUEFGDDFLDT I PKYQSMKERS / 
WDFIKI KNVCFAKDNVKRRGPATNWKKI 
SVKDLSNKGLLPKIYEEIiLKI*NNTET*R 
LT* KWSKVLNRQLTRETRA 


776 


14677 


A 


783 


1 


383 


GTS P * PLTGALS YLLMTCGLAM* *HFHS 
ITLLILGIiLANTIiTIYPGRGDVSRQSAY 
QGHHTPPVQKGLLYRIILFITSEAFFFA 
GFF*SF\YHSSLSPTPQIiGGHWSPTGIA 
PLNSL*VPLLNTWRLL 


777 


14678 


A 


784 


1 


389 


GTSIVIPTGVKYFN*LATLHGSNMK*SA 
AEL*ALGFIFLFTGSGLTGIVIANSSLN 
I VLHDTYYGEAHFHYVLS IGAVFAI IGG 
FIH* FPLFSGYTLDQTYAKIHLT\ 1 1 FI 
GVNLTFFPQHFLGIiSGNA 


778 


14679 


A 


785 


374 


1 


ELNAYWNVMNLiQNL I WNAQ PLS IMQ I FQ 
ILIKSQIQNTLWSISDTGYLPGIDKWN 

CI *LKI lcivkvtinrarmpvidwentf 

STYTNDKGLIPKYKELKHS\KQTNNLIK 
KWAKGLHSHSRA 


779 


14680 


A 


786 


.1 


363 


GTRLYHANTN*KKLRVAILISKK\TDFT 
VKKIRNKEGHYIMIKRSIL*EGITILYV 
GTPSNRWNYIRQKLIKLPGEIDESTII 
LRDFNTPLSV1DASTRKKISKNIVESNN 
IISQLDLID 


780 


14681 


A 


787 


1 


361 


GTLFSSERKNPTWVSLNQKLEMIKLIEE 
GMLKAETCHKLRLLQQ/TSQVVMAKEKF 
LKBIKSATPVOTRKTRK*DSLLADTGKV 
LVACIED*TSHNVLFS*SLIQSKAIiTLL 
NCMKPERGE 


781 


14682 


A 


788 


1 


352 


GTRNYAKSTKS KL YRWNYKAWMTAYLFT 
AWCTEYFKPTVETYCSEGL/SLKILLLI 
DNASSH*RALMEMYKQINVVFMLDNRIS 
LLQPVDQ*VILTFKSYLRNTFHKAIiAAR 
DNDSSD 


782 


14683 


A 


789 


365 


128 


PLDQHGETPLLLKIQKLARRNGAHLYSQ 
IJLfGRLRHENIiLHPGGRGC/SHCTPAWVI 
E * DCLKNQTKTKAPRRVS I SHS A 


783 


14684 


A 


790 


217 


257 


WSGGYAHPVIPATQEAEAEELSLGGRDC 
SELRSCHS \CTPA* VTQAGVQ 


784 


14685 


A 


791 


2 


363 


LAILHQTVS * FVHAKEKFWKDLL \KS AT 
PVAI*MIRNLNSLYNLIMETV*W*KED 
QTSHLI PLS*SLTQSKALIFFKAMKTDR 
GKGAVE * KFEATRGWVMRLKEKFCLS /H 
HIKVQG 


785 


14686 


A 


792 


2 


362 


GQKLGIiliHQTVS KFVNAKEKFWKEIiL \ K 
SATPVDI *MIRNRTS\I»ISDMETV*W* 
KEDQTSPLIPIR*SLTQSKAVNLFRAMK 
PVRGKGAVE* KYEASRGWFKQYKEKSC/ 
RMCNIKIQG 


786 


14687 


A 


793 


2 


360 


ARAGSTMAFKNYAKSTLPVLYKWNKKA* 
MTAHLFAARVTKYVKS/YCLNKKIPFKI 
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C^Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, 1-lso leucine, 
K=Lysine, L= Leucine, M=Metbtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, RpArgtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














*VF IDIVPSHPRALIGIYKE/ INVFMPA 
NTTSTLHPTDQEVISTFKCYYFKNRFRK 
AIQIiPYSNSY 


787 


14688 


A 


794 


2 


359 


AREKLYIiS ILTPLQRSLVIAFAATELTR 
YYILFETTLIPSLAISTR*GD\QPERLN 
AGTYLLFYTLVGSLPLLIALIYTHNTLG 
SIjGILLIiTIjTGQELSNS*ANNLI*LAYT 
IAFIVKrP 


788 


14689 


A 


795 


157 


365 


GIIEEKGYLPEQIFNAKCSGMISAHCSP 
RLPGSSYSTSA*TDF*CQVQWYDLSSLQ 
PPPPRFKLFYLSDPSTWDYILAPP/RPS 
* * FVFFVEMGFRQTPE 


789 


14690 


A 


796 


224 


372 


I FF I F I F I YLFTE WRQSFALVA\QAGVQ 
WRNLGSL* PTHPPPRVKRLSCL 


790 


14691 


A 


797 


482 


3 


NSFFFFFFLFPEAKDHKDFPSPPFPVFK 
KLKGFC* APP * VQKPVFYLKSF/OiKDY 
MLSFPESKVNFPSPPFL/SPSPLFMVGD 
PPLPPETYFSFCF*DPHSINFFLPPGPF 
LFHFFFFLRWVLLCHPG*STAV*SLLTA 
TS\VK*FFHLSIiPSSWVYRCTSC 


791 


14692 


A 


798 


1 


415 


NLGGGGCSEIjRS YHCSPAWATE * DS ISK 
QTKTL/CTKDHTRAGWERA 


792 


14693 


A 


799 


2 


401 


VQTGFHHVGQAGL * LLTSGNPPAS AS /Q 
SAGITGMSHRARPQ 


793 


14694 


A 


800 


73 


307 


PMALEHHGCGMCLDFLPTFGKSHCFVLR 
CAEMETRSFLPSWSAGA*Y\CLLQPPPP 
RFS * LRLPSRWD YRHLLPCPAN 


794 


14695 


A 


801 


87 


401 


SLIE I WTLKKSTCHNL, YNWLS * HS LKA 
IVLARHS/VESLTHVLKMCLV*NFVFAF 
S*RSLCFFLKISRPVWAHVCNPSrLRG 
* GGHITRSRDRDHPSQHGE I PS 


795 


14696 


A 


802 


3 


354 


LRHYTP\PG*QSETLPFKKKKKKKKKKK 
KLSFFPPPKFLKKKKGLFKNPFKKKGKI 
FFNPPPPKKKGFFLrVNPPPKKKNPPPL 
GGGGPPKXIYF*KTFFFAPPPKFENPFF 
FFSPRF 


796 


14697 


A 


803 


1 


830 


VETGFLHVRQAGLKLLTSGDLPTLAYQS 
AGITGVSHCAWLFFFF**CLAVTQTEVA 
PS *LTIASN\PGLKLSSFFTI»PHHARLI 
FKIFSRNEVIiliFSR\ psqtpnlmqsscl 
slpkcwdyrceplypaeslsf\fiklsc 
mlkclevkcndvcnlii* ntl i kw inew i 
egwigrltcvkqieknvicri*vmlkat 
qpvgtbps *esrstnsgeamw* skdntp 
fpvfvsvfvlrqs/ lcfvaqagvqwshl 
sslqplsprlk*ssclsllsswdlrara 


797 


14698 


A 


804 


389 


3 


FIWPFKGAPENSFFVFMWIKGHKCPKKG 
FGRKKNQLGPKIPF*KKKKKGAFPLWLK 
RFQGP FFFFFEMEFHS \ I AQAGVQWCDL 
SSLQPLPPRFKRFSCLSLPSSWDYRHAP 
PRPANF\CVFSEDEVSPCW 


798 


14699 


A 


805 


161 


21 


RGGGCLQS Q FR/ RRLRHENRLNPGGGGC 
TEPRSYRCTP AWARE *DSVSK 


799 


14700 


A 


806 


405 


82 


FFLRWTL / DTVTRGG IQWCNLGS PQPP/ 
PPRFKRFSCLSLSSSWDYRRPPPCPANF 
LYF**RRGFTMLARLVUTP*PRDPPASA 
SQSAGITGVGHRAWPMPI FENRFDL 


800 


14701 


A 


807 


938 


2415 


KITFWETFWITTVHPHLCKEREAIAGIL 
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N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=$erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibJe 
nucleotide deletion, V=possible 
nucleotide insertion 














RRWS *NNW/MIHTHTKNI^PYLTPE)TKA 
TFK* IIGLT*RAKTMQL/LESFCQKENT 
GENLSDLGVGKDFLRHKKHGS IKGEKIA 
KLDPIQVKNF*SLKDTFKKMKKYAEjGWE 
KIFAERVSDRGCVSRRYKEL*L/IEIiKD 
NP IRKGGNNLNKVHQRI *MANKHMKRCP 
KS*VIRE I *I*TIMRYHCILPRMAVMN/ 
SD*SHGDKNGGSSGTLIH\CR*ECEMDQ 
LLWKTIGQFLSK*M*ROrTDSSIPFLII 
QEK*NCI STKKTCTQTFTAA/ MYLLVXA 
KNWKQfcPYPSSVWI/QQIWCIYT\MEYY 
SAIKRTVDTLNN\QMDLKIIRFWE*SyH 
NLRRVHCMY 


801 


14702 


A 


808 


414 


1 


KPKAKLFVPVRQVVNAKEKFIiKEAKSAA 
PLN\RMIRKQTSLTADTEQV*VIWIEY* 
TSHSIPLSQSLIQSRVLTL\SMKAETGE 
KAAEENLEDSRG *FTRLKEKSHLHNIKV 
QGEAVSADGEAAAGYPEDLAKI IDER 


802 


14703 


A 


809 


399 


1 


TGPPPRNQPFNKPAKGNFWLFLFFFL*F 
/ CNFFFFFFFTGAFLSLFGI \ LV* CLML 
VWGNFSP * PFKYCFTFSFFLSFI FLIFL 
WLLLNTCL YICYTFCS YLTVL * CS FLFS 
S IFFLLYF* FWRLLLIHPPPTRP 


803 


14704 


A 


810 


1 


392 


I IiLAGAIEDAEPGSG * TGYPTLTGS YSH 
PGAYVYLT XLSLHLAGDSSILRAINCI \ 
TTLINIKPPAVARYLTPLFV* PVLDTAM 
LLILSLPVLADGVTILLTDRNLYSTFFD 
PAGRGDPILYQHLL*LLRH 


804 


14705 


A 


811 


66 


411 


RETRAGAQGLSRFS PPWGNPSQKI FLGP 
GFKKNPGPQKKGNFFFFF*KKFPLVSPG 
GRGGGNFRSLQGPPPKVPPFFCFNPPRK 
GG* /MGAPP 


805 


14706 


A 


812 


3 


392 


LSVPQFPPFFliKKTFRNCPPFCFSPPNI 
LGFPLKGSKK*GS FFPFFPPPNYS FFWF 
PSYSF*NQPCLTRGPF*NQKFPRL*KGV 
P ILRL* S PLFFGPPPHSF/ IFFFFDRVS 
LCHPGYSAVAQS/LLTAASS 


806 


14707 


A 


813 


690 


153 


IISIDAKKAFYEIPPPSVISSSSPTDSL 
YYNLGFLKMFKKGREVKYLNI IKYIYEK 
PTANI I F I SGKLKAFSLRSGTRQI CPLS 
PLQFNKVLEVLARATWQEKEIRNIHIEK 
EEVKLSLFVSNILCIENPLKMP * KNSSN 
* * IQRVAGYKTNI *KI S\AFLYTKNKLF 
CCLSHPIYDIPL 


807 


14708 


A 


814 


32 


376 


LFGLARS YITEGGRLPENPTTPHG *REF 
WELCNKCD\TMRPKPSLHCSRCGHCVTR 
MDHHCPWINNCVGEDKH* LFLQLCFYTE 
LLTCYAIiMISFCHYYYFLPLKKRTLVRN 
VYI 


808 


14709 


A 


815 


2 


423 


YPLYPFKIFIFPKGFNFCREVGPICPPP 
KIKVL5KNSQVGFYTAPYKEKNNTLPAR 
VNFGPPKDSLKRPPLFFFFFF*RDGILL 
CYPSWS *TPGLKQSRCLGLLKHWDYIC* 
PLHPASF* KKHYSLRILP / SLLADAW 


809 


14710 


A 


816 


404 


54 


SLAHFFPPPKKGFFPKIPRGVL*PPP*K 
KKFLFS/HPPVNLGPPRDFIiKGPPPSSS 
SSSSSSSSSSPGG*MIFY*/P**FGPAL 
VQPHACYCKASCKVXAPFFSTHSGLMVA 



571 
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N=Asparagine, P=ProIine, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














PPTTAQS 


810 


14711 


A 


817 


3 


410 


DAEVGGSLEPG\SGGCSEP*SCHCTPAW 
VTG*ETVSKKKKKKKKGFFPPFFFKRGP 
FFREIEKKFGKKFFSKKKKKKKKKKRGG 
PFKKNFWGGKNLRGGKKKFFFFLGGEKK 
KPRGFFEKKTFFWGGKNWATTTQK 


811 


14712 


A 


818 


1 


404 


IPINSLTSKINKLLKRHKLP/HLNQEEI 
DNLTRTXPI*NINF*IYKQNTGPDGSIS 
KFY*TIRKT*Q*FYTSFQKIKNEEVLPN 
SFYEASTPLASPKKKKKKKKKK 


812 


14713 


A 


819 


421 


2 


LPPKRRCFSPKPPRGFFFAPS *GKKNI / 
CFPPR*NLAPPGVFIiKGPPLFFFFFFFF 
FRGRVG*SWVPAASPLC2GQVFI,PPQ/SS 
* VGGSPGPPEHARVNFFFFFFFFF* F* * 
RQSLTMLPSLVSNSWA*TILTPQPPKVL 
GLQ 


813 


14714 


A 


820 


84 


418 


ILTCHNARLGBK5QIVTNQASQNNPKLV 
NICEVFLVFFFWKP I FFGTQVGGQYFNL 
SSL* VRLPGYTHFFCLTLPKS WDYRP I P 
PS\P IFFCFFNKKGVSPGYPGGCNILVF 


814 


14715 


A 


821 


290 


3 


FSPPPENLGPPRNFLKGPPPFFFFFFFF 
FFFFFFFMFCVLIY\FIFFCTNUiFLDF 
CIT* *T*NWFISSKAVCLFLFLPLQPAS 
QKTKNRCRKSFF 


815 


14716 


A 


822 


416 


2 


ITKIiTEEKKIWSGKRKNDYVKYAGASQN 
YTAWLRPQSMITLKDSSERKAPHWGEL 
NW\HVHHFIWKER* PEV* IYIES*AID 
GGLASWSGPEKNKIGKLGAKKV*RRDMK 
TDPLECTQNINI FGFVCLFGDRVSIjCW 


816 


14717 


A 


823 


409 


45 


PPPPFLFFFF* KKKKAPGGGSQGPKF* P 
PPPPKGKPPFFKKKKKKPGGGGGPPNPP 
PPKGGGQK/ QFFPPQGGNFNKPKSPPPQ 














PQFLQLESGTY 


817 


14718 


A 


824 


I 


408 


ifsehngikl* intkrnfrnctniweiis 
nmllnnh\*vnken*nkkfiktne/nkn 

ITCQNL*DTTKVVliREIYSKKCIiLQKKK 
KKKKKKKKKKGGGALKNYLGGGQFYGGE 
GNFI FFFWGGAKKPCGWIFRRRPFFW 


818 


14719 


A 


825 


259 


274 


YL* YLLLYRRYSA* I PL/LGY YLKKRKIi 
LYQRDTYTPMFIAALFTIAKMWSRP*SP 
W 


819 


14720 


A 


826 


422 


105 


FFFSKIPWGNFFPPPPKRIFFPPFPPKI 
FFFPPPPFFFWGGFSPFSPPPKKVFFPK 
SPPVFFFPPP*KKKFFFP/HPPLIFPPP 
GFFFKPPPPFFFFFFFFFFFFFFW 


820 


14721 


A 


827 


15 


402 


I KSLGKNIGINLGDLVIJSINGF * YTLPKA 
QAT KB KIDN/ WDLAKAKNI CA* KDTI KB 
VERQPKK*QQTFTOHLFDKGLVSKHV/N 
EL*LINKK™H\HL* * WAKD*NRACSKB 
DVHMTDKLMKDVP S LVI RE I 


821 


14722 


A 


828 


420 


3 


KDAHTCP PGCKGGFSGHRALKAI CRNQG 
FLLTTTEFLTHKCPLPGQE PWGQHQG*R 
DSPRRVSPAPGTWQPPCHRERACLAPSA 
VE/GPS* IQEQEKSLLFFFFSLLRWSL/ 
NS VAQVGVQWRSLGSLQPPP PRFKQFSC 
LS 


822 


14723 


A 


829 


1 


269 


I KIKNLARHGDSCL * S PLLLEMLRLEEN 
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Predict- 
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nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=^Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possibfe 
nucleotide deletion, Y=possible 
nucleotide insertion 














LSPGSRGCSEL/ CQPARQGETLSQTNTKQ 
TNKKPHQVQLPCFEIINFKEFITCIiVLR 
LPGNSW 


823 


14724 


A 


830 


432 


1 


GYNNQQIFDVDQTAFY* KKI PSRTS I AR 
EEKSMLGFKASKDRLILLLGAKVAGDFK 
LKPMLVDPS^TPRELNN*ARCTLPVL/V 
KWN/ KAWMTAYLFTV* FIQYFKPTVDTY 
C/FKILLLIDNAPGLPRAVMDR*EESNV 
FMPVNTTL 


824 


14725 


A 


831 


1 


233 


KKLQIKYLGINLMKYTIHVVSICX3KLLM 
KEIOTDLNKWRDVIjCLWI*RIiNNMSILH 
KLIN/RINLLQLRI 


825 

• 


14726 


A 


832 


3 


477 


YQTCKEDLTQMLLKIFQKI KGEGI IPNS 
LYDASIAMMPKSDKDRTKKF\NYRPISL 
MSIDAKILNKILPNQIQQHIQKFIFIPE 
MKG*FNIRK* INLIHLITKMKTKTPMII 
SIGTEKVFDKIKY/PFPITTLTKLGIDG 
RSLDAIMVARE 


826 


14727 


A 


S33 


3 


402 


RYQTPLLV\ * TVLITTALLGLSLPVLTG 
GITI PLTDRRLDTTFFDPAGGGDPI LYH 
HLF*FFGRPBAHILILPGWGIISHIGTY 
YSGKKEPFG YTGMR+ AI I S IAFLRVIV* 
AHHIYTV* IDRDTRAYLPSLHN 


827 


14728 


A 


834 


7 


395 


DPQRVSCMALSSNSFFFFFWERKSLFFP 
GREGRGQIWVNGTPPLQGKRNSPASPPG 
GGGITKKAPKLHPPKKGWGGGLEKPP*N 
QRGGFGI *SPGGKKGFAGG* KQGQK5KG 
GGKKKGRA\DP* AYI PLK 


828 


14729 


A 


835 


41 


444 


DPRVRKKERBNLYRSVSIKEIESVI*NF 
PTKKI *GLDGITSEFIQILPKKKPK\IK 
KERTFSNPFDKTNITLl/SKPETTFTKk 
EN/ VRPVSLMNI /DCKILIiKVIiANCTL\ 
HI *REIHHDS /INFIPAIQ IS FNIQKTN 


829 


14730 


A 


836 


3 


415 


HAYY I VKPS P / W I PKGALS ALLKTYGLT 
M*MQFQSIKDLRLGLLTNTLTIYQ*WRD 
VTRKSTYQGRHTPPVQKGL*YGIILFIT 
SEVFFFAGIF *AFYQSSLAPTPQLGGHW 
PPTGBTTPIiNPLEDPILNTSGIiLGIGVS 


830 


14731 


A 


837 


378 


0 


TP PKGPGGKI FLKK/SPGRKI F* PPGNG 
/PFFSPLSPLKFFFFPKAFNFWGGGGPQ 
GPPPKKGVFSQNPPPGKKRPPQKEKKNF 
FP/PPGKMGPPQGFF*RAPPP 


831 


14732 


A 


838 


383 


38 


GPGEKIFLKKPREKKLSPPGEKAKFFPP 
SPLKNFFFPQGVFFLGGGGPKRPPPKKK 
GFFQKTPRGF* IAPQKKKKKNFPPRGKF 
GPPQ/RIF*KGPPPFFFFFFFFFFFFFF 
WSL 


832 


14733 


A 


839 


59 


418 


TKI SS KDQKQY \TTPGW*MAE I STTFED 
AEI IDALIFPFNLPIWLHKKPDCSWKVS 
VYCKLK* WSLISATGLDI INT\LLEEM 
KRVSGTWYTATDLAKTFFYI/PDQKEFA 
FSWYG*KDIF 


833 


14734 


A 


840 


5 


335 ; 


IRHLF* INKFENLDKMDrr*KLTYEGIE 
SLN/RI I +NF/PWKSLGSGS FTSEFYS 
LFKBE X TAI YKP I WRVEKGGI \LPTCFA 
TRITL I P KPETC I MRKKNCGPVLFMNGH 
F 


834 


14735 


A 


841 




38 


RRLKLEDHLSLY*KINSKWIRELNIR/P 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K-Lysine, L= Leucine, M-Metfatonine, 
N^Asparagine, PHProline, 
Q = Glutamine, R=Arginine, S^Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion \=no<wihle 
nucleotide insertion 














ETI KLVKKDKGNV \QDTDPGKDFMVKTT 
KTKI DKSDYI KLQS FYTANETVNRVKLQ 
PIE*GKIFKNYSSGKGLLSMIYNELKQQ 
HRNNNL I * KLEDHLS L Y 


835 


14736 


A 


842 


2 


474 


GRVGS I PLSOSL IOSKALTIiFfi FMKAET 
GQEAAEKKVEANRGWFTRFKERSGLCNI 
KVQ/GAGGDTAAS S PADPAKINDEGGHS 
/ KQQ I FSGNETAFY*KKMSSGTSIAREE 
KSMP I FKAS KDRVTLIj TG ANA ARNF\ KT, 
KPMP 


836 


14737 


A 


843 


281 


3 


FFFFFFFFFFFFFKTFLFFGYWFSIFFP 
LLFS I SLiLLS FSGS FVTFAFQPFF* VSR 
YHYQHFNSTF*FIt\KL*FIF*IFFLISI 
LFLFHGYNF 


837 


14738 


A 


844 


409 


1 


RCGGTCPWSPVLRRLR*KDHLSIiRG*GC 
SEP * LHHCTPAWATE\DSVSKKNKKKGI 
SCRQHIVGSFFFHLKMCIiLNGLLSFFTL 
NVIIYWEFKFTII^LFLLYSICSLSHLF 
S FPAKS WINSKFBSLFVCFNS FS 


838 


14739 


A 


845 


63 


451 


KNQEQESFJSTLPNIiFYKAS ITLI PKEiDT 
QK* K30RKEKKRKKKK\EN * KP I THRNI Y 
AKILNKILAHQIQQYIGKIIHHDQVGCI 
P/GAFDKIQYRCMIR/TLQKMGIEGTHL 
NI I KAI YIRPTDS I IENRRICP 


839 


14740 


A 


846 


60 


460 


RNNEPFLDQI VTYFEKWILYDNR* * / PT 
QWLDGEEGPKHLPKPNLH /QKKVWTVW 
WSASLTHYSFIiNP \ETITSENYSQEIDE 
MH*KLQSLQLASVNRKGQIPLHDNPRLQ 
VAOPVLOKINELGFEVLPHPP YS PR 


840 


14741 


A 


847 - 


344 


2 


NFLKFHPGGETFLQKGYTGYFSPFTP*K 
FFFSLKPLNFIiGRVGPFFPPPKKGFFSK 
IPQGGLISPPLRGKVFPPPPRLNLGPPR 
VFLKAP/HPFFFFGI \ PHGIiOPRPMI * K 
PTRP 


841 


14742 


A 


848 


439 


132 


RRVAAPPPSKNIFFPPGSYNCGGVWPQK 
SPPPK*GFFPKP\SGVYKNPPQIEKVFF 
FPHARIVPPPGDPIICrPPPIFFFCFNSI 
NFSLRGAQLYVWDFLGSGC 


842 


14743 


A 


849 


94 


470 


LNHFLSFFLSFFLSFFLSSGSLSLSIjSF 
FFFWKGSPPPP*GTK/ qkpclkkkkkkk 
GGGGGQNFFTPPGGKKNPPPQGGGGGGS 
P AQ KKKP / EGGGP PGS KKKNPGGGFKKK 
KPPKKPRGGGGKPFFP 


843 


14744 


A 


850 


2 


396 


FFKKF^IQMANKHVKTYTTIiLVIREMKI 
PTSMRYHFTSIRYAKI1jKTDNTRC*QEG 
RTIRILMLC*LQN1CMVLLL/WKNWQFLT 
KIKI FLF\FDPTISIiLGIYPREMKTYVH 
SSFIHGNS KNNRTGNNWVHN 


844 


14745 


A 


851 


369 


36 


S KGASDILEKPPLF * EVGLPTPS P PLGE 
KMP/ FNYKGGPGKGISLLIFPKKKFIPP 
GGHFPFPFP/ PFFFMDRVLLCHPGWSAV 
AVSQLTTTSASRL\K*FSCLSYSEG*DG 
RIS 


845 


14746 


A 


852 


I 


367 


PPPPKIKSPSGPGPPSKRGSPGAPPRGG 
KFWFLSQKGWGGP* FPPPPGVKPE /NIP 
* PPRGRVPL/HLNLAPAPPPGPPK/PKP 
PSP 


846 


14747 


A 


853 


1 


378 


AMLATLISNS*PQ/ CDPPAXAPQSAGIT 
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Amino acid sequence (A=Alanioe 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
KNLysine, L=Leucine, MhMethionine, 
N=Asparagine, P=Prolinc, 
Q=Glutaniine, R^Arginine, S==Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 










• 




GMSPCARPTSCSFYP* ILLTVRTFIKIR 
NFFFFFDTGSNFVAQPGGEGHDHG* * * P 
QPPGPNKSFNLSLPSRWDYGGAPPCPVI 
LCIFCKDEVSSCGSG 


847 


14748 


A 


854 


369 


1 


SACFGLPKCWDYR/R*ATTPSLEIIIYI 
ER*EKTTS PDNANRFDVTKLHHYDCS VM 
DF*KAL*NGDDFP VLMSKMHMQS LNI I \ 
FVFCFLRQGLALSQ/AGVQWRDHGLPQP 
QIiPRIiQPSSHLSIj 


848 


14749 


A 


855 


380 


335 


NPGGRGCS ELRLCHCTPAWAT \ S ETLSQ 
TKIQPKKQEV* LL 


849 


14750 


A 


856 


373 


1 


LRI KI LNKKGLANL I P * *R * KI IHSQVE 
FIPEMQA*PIIRK/TPSIKWPIVRTKKK 
KKSQMLISIDTEKI FDKIQHLFLVKPVI 
KLGIEGTFINFIKS IY\KKSTTATI ISG 
LNASPLRLKTRQDF 


850 


14751 


A 


857 


369 


44 


KVQNLYFEKKLLNKIKVDLNKW/NSLCT 
RFRKLNIANI IMFKLIHR YN/S KVKI P I 
KIPANCFGEIDKLILKFI * KFKRPQIAK 
I ILKKNKDGYILPDFKTYYI YMQYHLC 


851 


14752 


A 


858 


1 


348 


QWHAPAS/LARPPPPRFK*FSCXSLPNS 
WD YRHAP PRLANFVL FCFVG * GG IHL 


852 


14753 


A 


859 


1 


366 


CTSPTFNQKLEMIKL/EEGMLKAETG*K 
LCLLCQKV/ QWNAKEMFLKE I * S ASLV 
NTTMI\RKQSFTADMEKV*W* IEYQTS 
HNIPLSQSLIQSKALTLYSSMKAERGEE 
AAKEK/LEASR 


853 


14754 


A 


860 


12 


366 


PSTLGIiRRASCLSIiLSRWD*RHMPPHPA 
N*KNFFCRDG/SLTMLLRLVL\HSWPQA 
IHSPWPPQ 


854 


14755 


A 


861 


424 


78 


NPRPLEGQTAS FSLAPNF* TTLG KKVNP 
FF/SIiKNPPPPPPPPKIWAPQGPFGPPS 
PGGLNGGFFLPPRVKAPIIHKGAPPPQF 
PPQKKKGPRVFPKKKKKKERKKEKGMTR 
AFCYP 


855 


14756 


A 


862 


327 


3 


SHWFFAAV\GREISM*AMAPDQTKKICP 
RSAEDAI KYFLTQATGS IIIilRAILPNN 
RLSEQ* S I TNTTNQYSSLI I IMAIAIKV 
GMAPFHF*VPEVTQGSPLTSGLLVti 


856 


14757 


A 


863 


1 


341 


YDRNKWKDIPGS* IERLNIVKLSMLPTV 
IYRFNVILIidPMTFFAETEKSIVKFSR 
DYE* PKQS * KEQNWKTHTP * FQ /QFFTA 
TVIKMVWY*NKDRYIY/DQWNRI*SLGI 
NPCI 


857 


14758 


A 


864 


32 


324 


LVWPFFFFFFFF*KKKFFFFPPF*RKGK 
NFFFFTPPPPGLNLFF/ CPHPLKKWEKR 
APPPPPN* FFFFLKKTGFFFFTQKGLNF 
PPLETPPLTPPKGLN 


858 


14759 


A 


865 


350 


3 


RVKNPRPFWGF*MVLKPLSFFS KKTN* I 
LFPLKIFSPPKTVPWGKIFLGAL*NPFF 
CFKNPLWFFGF*KL/SFFFPPLYFF*KP 
LAPLKRRFS FFFF FFL * DGVSLCRPGWS 
AVARS R 


859 


14760 


A 


866 


342 


118 


GSVTQAEVQWCDHGSLQPETPGIK* S/H 
P PTSS YQ WGTTGV CHPGLAMLSRLVLS 
SWPQAILSPWPPTVQGLQV 


860 


14761 


A 


867 


1 


354 


VKPSP* PLTGALSALLMTFGLTM* IHFH 
SITLLILGLLSNTLTIY\Q*W\RDVTPE 
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Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histtdine, Msoteucine, 
K=Lysine, L= Leucine, MNMethionine, 
N=Asparagine, P=Proiine, 
Q=G tu tarn ine, R=Arginine, S s =Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X = Unknown, * == Stop codon, / = possible 
nucleotide deletion, possible 
nucleotide insertion 














STYHGHHTPPVQKGLRYGIILFITSEGV 
FWAGSV*TFYRSSLSPTPQLGGHWPPTG 
ITPLN 


361 


14762 


A 


868 


3 


344 


QIKNPDNISC/WIRCX3ATGMFLHCWWEC 
KLLQPLWKTT * HQ * SRR* AVPIiSCaiFP / 
NRYSSTCTPAS I * KTFLS ALFMMTPL / C 
LSRVEWIK*SWCNQKL*Y*SVMKRSBEF 
SLKNA 


862 


14763 


A 


869 


345 


1 


KQAGLKNVKITG/EGASVNQSR*VS/YL 
GTIKKI TEEKGYLPERVFNAGASAVF *G 
KKLPQRTFISK/EEKQAP/GSEVGKDRL 
TLLFCANMSRFM ISTALLYKAADDQSLK 
GKDKHRL 


863 


14764 


A 


870 


340 


45 


FCS FCRDRVSLCCliG WSRAPGLK*SS CL 
SLPKC* DHRC/AA/ VPAWFQRCI LEKKN 
QIY* CAERIL* SERPHTRHLDSSTVNIL 
PCPLSKVIFWKKKNQIY 


864 


14765 


A 


871 


209 


72 


KLWD IHTMEYYYSAI KRNKPLKYTTWMD 
LKDIMLCEKANLKRSHTV 


865 


14766 


A 


872 


3 


353 


SFFLGPP P FFFFFFPPPQPLP I FFWRFF 
FCFSPPKKPPP/PFFL*GGSPPPPF*RG 
FFFF*KKKGGVFPPPFFFFFSPKVSPPP 
PKKGGGG/ PPPPFFF 


866 


14767 


A 


873 


3 


267 


DFTMLARIjVSNS / WPG WAHTCDPSTLG 
G*GGW ITRSGVRDQPGQHGE I 


867 


14768 


A 


874 


1 


346 


PRRPFFFFFFFLIVFIYFKCVYLF*DKV 
LVCHPGWSAVPHHGSLQP*PLRIKQSSH 
LSLS SS / WDHRHAWLVFVS FYRDEVS PC 
LANFCIFIi*RQDFSMLPRQ/VLKAICLP 
WPPSL 


868 


14769 


A 


875 


1 


172 


KLLSSGSPPASASQSAGITGVSHRARPG 
UjNFFC*CAFSVPGPCLGYPDTFTH/LC 
PPSFHQS VIASLDFSCLS *L *MS P 


869 


14770 


A 


876 


1 


34? 


LLFGFNNYSFFHGVKPKIPAFFVFGPGC 
LPHFFPPITPAPFFFIiDRVLLFPPGWRE 
MGPF*APPTFSPQG *GVFPPPPPQ*LGP 
RGPPPQPGVFFFFCI FGRDSA/LAILPR 

Ii 


870 


14771 


A 


877 


J 


203 


GFH YAGQAS LE VLTS *STCLSLPKSWDY 
RRGPP/ * PGLS YFLYPSLRS + FICAMS I 
HIPFIKKKKASD 


871 


14772 


A 


878 


343 


102. 


EWEDCPS PGGRGCSEPRSHHCTPAWVRE 
TLSPKKYLFIX2P*KLKNTKIYLSISIP* 
ELET \ I LKILHFKMMNP LHS YNFFF 


872 


14773 


A 


879 


2 


359 


RDITKGDMQMETKHM* SFSTSLTSGI \H 
NSPIRPSKMKNIDSTRYWRGCRAIRALI 
HCWWRC*MVQPFGE* ISSFSKKLNMYLS 
YDERPTFRYEK* K/ P * VHPKI CV*MFMA 
AFFLISPNW 


873 


14774 


A 


880 


205 


1 


FFRGVTEGL*EPPYVESVI/AGGTTARR 
PLFFFFFFFFFFLRWSFALVAQAGVQWR 
DLGSPRPPPPGFK 


874 


14775 


A 


881 


150 


2 


CRARVDGVPWRNPGSLKPPSP \GSSDPP 
TSASQECGITGAHHHTRLT*VF 


875 


14776 


A 


882 


345 


1 


KGNQPWKTEKRALFC/TLKKKKIFKNSP 
PRGNF* KFFVKKKI PLKKEKGFSPTWGK 
KKEFFFKKKKKKKKK* * AKDLNRNFSRE 
DVQMAKKHMKRCSTSLIIREM* IKTTIR 
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N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














YHF 


876 


14777 


A 


883 


2 


355 


DRLLFS ASHLDLGTLYLLFGA* SGVLGT 
SF\SLLIRAELGQ\SGCLL*NDHIYNGI 
ETAHAFVMDLFIVRPII IGGFGN*LGFP 
NKKGADMMPRINNISF*H,PPSLIiI*LL 
ASAIVEA 


$77 


14778 


A 


884 


262 


2 


PTCEQSERE I KKTIQFTTESK* I KFVG I 
NIAQEAKDLYTETIKYKTLLKEIKDTNK 
* KNI I YV\S WTRRFNNFKMLGWAQ WLMP 
VIPA 


878 


14779 


A 


885 


16 


318 


ILRADCADLFFFFFLGGKKGFCFPRLKG 
RGETLLN*TLIFR\ VKGNF* PKLPKI W/ 
DFKGWPYHL/AENFVFFLKKGGLTLLPG 
LVLNS* IKEFFHLWPPKGVA 


879 


14780 


A 


886 


298 


322 


KRRTP * YPPGEPHFPPPPPQG* KARGAP 
PPPPKKGFF\ PEKXKKPGVMVQGTTQKP 
PKKTKVRVD ' 


880 


14781 


A 


887 


353 


3 


FPFWGFLKTLFGKALWLFKGFFFFAHQK 
FFFLPSLPPPP\LGFFLRGLT* FKIFSP 
LF*KGAPQKGISHPLFFFLISPKPPPPF 
FPFFFFPPPPFFFFFFFFFFS \RDRVLL 
YCPGWGA 


881 


14782 


A 


888 


22 


341 


I PCTCLKLHGKVHDHDQSQL WWPKPQEE 
MRTLENNLAVLQSILQWLDIELPYDRAI 
ICAREIKTYV\QQNCT*TFMLALCITAK 
KW/ KQLKCPSTDEWVSRMWYTCTR 


882 


14783 


A 


889 


51 


338 


ERSQIiQWLMPI IPSLFFFFETQFLFCCP 
GGRAWAIFKI FEFIiA\PK* KQFS CLTLQ 
TI * VYGLNPPPRKNFVFLKERGLFHVGQ 
SGRDLPPSGDPP 


883 


14784 


A 


890 


1 


225 


GRLRPENHLNPGGRGC3GELRLCYCTPAW 
VTERDTIS/RHTHT*NYFY*GSI*QLAN 
SCCCNMLNKFNVICILRWC 


884 


14785 


A 


891 


357 


164 


GKGCSDSRLCHCTPAWAT \TKTLSQKKL 
ILKKQNKFRS * LDD * INMRCTVWPCFHV 
FLF IRAAPLFSDWLYNK* MNRNT 


885 


14786 • 


A 


892 


207 


302 


EPFSGI I INES XHQEGI IVLNVYTSSNR 
PSKYMKQTLIELKGEKVKSTI IVGD/FY 
THLLVIDRTSR* KR 


886 


14787 


A 


893 


86 


332 


VMRVSCCCLKD* ISLCHPTWSAWQS*L 
TVASNSW\VKQSSFLGLPALWEAELGGS 
LEVRSLRPVWAT*TP\FCKNKKL 


887 


14788 


A 


894 


2 


336 


FFFWPPPQTFLKNPGGPGPKREEGPGGS 
PRGQKQGPPGLHLTGFGGQPGG* KNPEP 
GPGGRAPKGETRGEGGGPTRPQIP/QLI 
NGKSGKPPKVTLNGAWGTIKIFIilKTPV 
G 


888 


14789 


A 


895 


1 


214 


ARESLASFLPPSLPPFLLGLRFLPSFLP 
FYFS FLPS FIX3FGS FLPST* VPS FLPSF 
IiLFFPSFLPWLWFLPS/FLPS FLPSFLP 

sw 


889 


14790 


A 


896 


131 


352 


TLHESDSESVPRDFKISDALAVEDDQRS 
PGTLNAAELS/SSVRERKKKEK/KPEPG 
Ii* DQS I KESDS YMVSGGRIQ 


890 


14791 


A 


897 


3 


184 


CSVAPAGVQCHDLTSLQSPPP \SSSNPP 
TSSNPPTSAF+VAGTTGMCHHAWLI FVF 
LVDAA 


891 


14792 


A 


898 


327 


264 


NRAVSLMNIiDAKVL\KIjISASQIYVYIK 
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KMIH/HNQIRFTMRM*GQFI I * TAINIY 
HKLI K*GRKHMI PL I YAVKT PEKT * YHN 
IMIKTLHKL*IKEPPGSFIYSL 


892 


14793 


A 


899 


3 


313 


TKAASHSQ\RANIjQFLVGRIH*HIiKSRT 
ISPGRVGATAAVNSTAILEYLTAEVLEL 
ARNASKDFTVKHTTPHYLQLAI RGDKEL 


893 


14794 


A 


900 


340 


1 


HLCPHKEHLY/ IJjQTSCPLAI I * FIST I 
AEHTKAPFDMAKVESELVSGFNIEYAAG 

P F AT . F F T AFY PNT T T TNTT ,TVT T FT /3PT 
YVAIiSPEIiNTTYFVPRKIVWRVFIIIM 
FV 


894 


14795 


A 


901 


1 


252 


LTFFPQHLLGLSGMPRRYSDYPDAYAT* 
MTT.AQVn/<3 c ?FQT.PAVTT.TTFMT*FAFA 

SKRKVLIVEDPSINLE*LYGCPPSFPSF 


895 


14796 


A 


902 


326 


1 


U3PPPPLKEPPGVFPIFPF\* FWNFGPV 

PPF*GKIPFF*KNTPP^F*GIGGSIPHP 
PLFFFFFFL*DGVLLCHLGWSAVA 


896 


14797 


A 


903 


1 


352 


KG I FP VWSOG* KGTQKKGGLSQGGQGNV 
p phot WFvrn? ft /3Fpnar>T .t. t q * rsrrp t .t 

F*KGPKKGFLP*TVFG/RRAQFEAPR 


897 


14798 


A 


904 


325 


2 


RLEKKLNDYWNEMIG/KKNTEYPLNIjPVE 
DIQKRPDQTWVQCDACLKCRK* PYGMDQ 

WT.FFA WVMHMTJPTiQnFPVr'MVDTrnPWnM 
n ljCiIn. \ w x fv rixso y r k x utiv jf jsx^xrctx/n 

DLVHPNYGKPYKKPSKETYWFSQMP 


070 


1*1 iyy 


& 


y\jj 


l 


SLy 


QKKI KKKKGGPGRE FQNLKWENPKNPGG 
SF*RGLGPQSNFFYLfCQ/RLGFFF*KKP 
PKI* ILAMEPPNKKIF*KNKNKAHFFF 


899 


14800 


A 


906 


148 


2 


DVDKLFLLRSLPTL * RPEYGSYMI EGT \ 
TGO P YGfiTMfl R FWTT ,F AMMP T P 


900 


14801 


A 


907 


384 


1 


ESKRS IFGPPNPPPPGFKGFSRHAPLWI* 
RSREP PHKAGVKKGFPRFRPGS * MPI P * 
SFLQNCLPQRVGFPSLPPPGAPFLKISL 
FTjROESRSVTOAGV* \ YGT.T/1CPPPPT.K' 
SFSCLSLPNSWDYNR 


901 


14802 


A 


908 


3 


297 


TKIKSL* imLIRAKTVKLLEENMGINL 
HDLSSGRQQPFRYDTFQ/ SMINTAI KEK 
IKIKI^FVTCMGSFISQRxTLFJCERQIjTK* 
EKIFADVYLIIGRRG 


902 


14803 


A 


909 


169 


373 


ASNILSATDISNTFGPPGSQGFSGREAY 
VEAGTYYTNFSCLGOVKVF * YWMOLI VS 
IVlX*LL*V7^KL*L/GPRQEKFV*YvP 
AST*ASLPLKPCDPGGPKVTEMSVADS/ 
DV*SLWIiALIGESKFRCliVF*SKFLPSS 
AYTYSPIjEKBCDFILIjGCFVMICFCFCFL 
RRVL 


903 


14804 


A 


910 


132 


368 


GRIFLFVGQEKGARVSFLFLF*DRVSLC 
HPGWSAVAQSQLTTS /TWTQNNSSHLTP 
P*VAENHRCA/PHTPNVLFLC 


904 


14805 


A 


911 


2 


339 


NSWAQE * AG ITGS CHHTQLTLE F * AHDR 
QMIVlFFLFFLKTNFTFCPPA*RPWPGIiGL 
TE\PLLPGLKHFF/ CLTLPSNWDKGHLP 
P/HPYQI /CGFLRKNGASLISG 


905 


14806 


A 


912 


3 


381 


LNFCDTHPLTPRPVSIQRQCLPLVEAGI 
RWRALISPQLHPPT/YSLLSSWDYKHAA 
PHPANFFFFFFKKGSLTMFPRPGGS*IF 



578 



WO 01/64835 



PCTAJS01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
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IYLLIYLERESY/AVAQAGGQGPNFNSG 
QPLS PRFKQFS YLSLLR 


906 


14807 


A 


913 


394 


1 


MSKGKIG* KLG* KFAQKVSQFVNAKEKL 
LKEIKHFTLMNT*MI*KRNS/IiIADLEK 
ILVVWIENKTGYSIP*TQNRIQSKALIL 

fysmmtergeea\kkfearrdyfmrfke 
rsrlqnmkv*deaanadgeaa 


907 


14808 


A 


914 


326 


200 


HNWI^IWKKVKSNCFLMPHTKINSRWIK 
DIKI /R/LHRIKILQENVGNHIYKI *RQ 
KRS * PRHNFLHKNCKCLVWHKTNK*NH* 
KI YL *CI * KTFAVFM 


908 


14809 


A 


915 


3 


334 


LWRVKGPKFKFWLHHQLGLYC * VIS FF 
INKMKEYVLIHRIK*DHKNIF*MGYKKL 
* NVNIW* LVLFGFSSKNHL /WSGAVAHT 
FNPSTLGDRGRRITQGQEFQISLTNMVK 


909 


14810 


A 


916 


14 


299 


yqklpqkkspepnaftdkfykifk*lta 
ilq*lfqki/e/ketplpk*snessnal 

I*KPDKDIT/SKNYKPISVSNIHAKIIiN 
KI *GYQVQQKQLYSK 


910 


14811 


A 


917 


3 


339 


SLQPQSC3iIiKCSSCLSLSSCWDYKHKPP 
CLA/NFF/VFF* EKGICTVKN*NGDYPL 
GRRVTKRDHERGF*NAGKVLLVGW/CGF 
ETQSRFVPQAGGQGRNLGSLQALFPGLM 
PFSC 


911 


14812 


A 


918 


2 


321 


GIISGIYKELSQPKMTDSSIKK\KDLNR 
NFTKEYVQMACKHMKRCSASIiVTREIK\ 
IKSTMR*H* \TPTRMTKI KD\DKNTKCW 
*GYGAVGMLIHCQW\NTKMVQPIiWKN 


912 


14813 


A 


919 


51 


326 


FFVFCLFSLVFISLNCGLF/PTFFRIPS 
L/PYLQLI * IYLFFY* IjKP I FLKFFRDR 
VSLCHPGWSTVA*S*LIEASKYW\VQ*S 
SHLSLLSS*DY 


913 


14814 


A 


920 


239 


2 


DSLILSAVQ*HDLGSLQPPHPGLKQ/FL 
PSNWHYRCMSPCLAYFIiFLVETPFCHV 
TQASLELIiGLSNLPASASQSAGI 


914 


14815 


A 


921 


1 


242 


PRPRRRLLKYFILFNFLRQ/RSRSATQA 
EVQ*HDHSSL*PQTPRIiK\HPPASET 


915 


14816 


A 


922 


249 


1 


ALFCALKKTIjCGTLMFIAALF 1 1 AE \SR 
HNQTQPTCLSTDEE INKMWHRHAMGNYS 
AIKRNEVL I YAKI * MYLENTMLSEI I 


916 


14817 


A 


923 


404 


1 


S RPQKKLGELKCS * RPKGLNKKNQRKPG 
TFFFFETKFHSFCQAGVQWVYGC*SHPY 
LLHIIFP* *VF*MFLII>ITV/PSRNKDL 
IGNSEKQIHN/WSSLYFYF*LFFLRWSL 
TLVPQSGVRWRNIXjSLQPPIjPRFK 


917 


14818 


A 


924 


6 


358 


FFCSLLGHGGETPNKSPLPHPWL*ETKS 
P PKP PHPLKKKKHSL FFKI F I LKKKKNG 
PPFFPPAGPNPRG*NPPPPQ/RPPKGPP 
PPKEKINPPPPPQGKNFPRGKRPPLGPF 
PKKGGGY 


918 


14819 


A 


925 


268 


2 


HM YAFVfCI DLTAHLRFS AGKKE I PAGKL 
YF/L* RKNPKNQIGNQKKKKCIiPRAGRM 

NMGKP 


919 


14820 


A 


926 


74 


338 


IAGITGVSH* ESKNS Y* KQFL/ WPGTVA 
HSCNSSNLGGQGEWIT*GQEFETSLANM 
VKP 


920 


14821 


A 


927 


339 


1 


LRLFACP P L P KWWD YRGE PPRQ I FFS WH 



579 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

-nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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FFKKQGFICPIEKGRELSNCCFQKEVLI 
*KKVILKI * PFFFFFLMBSCSVA* AGVQ 
WHDLQPLPP\NSWDYRCPPSRPAKFCIF 


921 


14822 


A 


928 


3 


416 


RCWWENKIN/PFWK/S/VWPCVTKVNIH 
SPYDLTISLQYLPYRDEYLCSYKNLMFL 
IVLFI ITKNYKQPKCASVG* WLR ILWYL 
/YMENYSAIKMNK 


922 


14823 


A 


929 


344 


69 


YKRSPhh\KSA*DGPAmKSLPRS*YKR 
SPLLIKAKPDQIISGWEASPLII\PRAIi 
LQKLKPAASKF*W/NPKKKRALIS/KAI 
PSKKNKAEGIMFPNFKV*YRDTATKTA* 
WWYKNSSSL 


923 


14824 


A 


930 


378 


1 


HGKKI DPRF I P YTKI /NLKWFIDLNVKP 
KTIKFLE*NIGE/TIFVIDHKK*NP*KK 
KLVN/WDFIKI KSFCFVH* KPAVGK*KR 
/QP*TRRNYVQTHTSDRGLISRICKELF 
*LNSKVINSIRKWAQSLN 


924 


14825 


A 


931 


479 


83 


SRKGLGRGLKRGGFFPPPPPKGLPGGVA 
PPGFEGFFGSL*KG/DRG*PFLKGVKGE 
KGKRI *EGG/EKGGGFLPPPPPQKGFSK 
NFLGFYAGKKENP*GGSPPPPLFGPAQK 
ISLGKGENKVPPPPGNLFYLFIF 


925 


14826 


A 


932 


36 


440 


TTYASLDEAQLPRAKFNAFLTTF*HI IK 
NGPILGKIGRKYMIADQ\GHRMKNHHCK 
LTQVLNTHYVGPKRILLTGTPLQNKLPE 
LWALLNFLLPTIFKSCSTFEQWFNAPFA 
MTGERVHLYEE ETILI IRRVHKVL 


926 


14827 


A 


933 


8 


398 


CLQGKEETIjTTNWHLRYFLPFFFFFWPK 
KGARKNGGRPPGGSPKNQRAPCPKPFEN 
PGEKTPQKRGFFPPKPCPGPGPGPTGGQ 
GKPP/HPQRPQILPFVPQAGPQGRKGG* 
WNPPPPG* KGFPAPTPPRM 


927 


14828 


A 


934 


i 


441 


TRSHPRALKEVYTE INTVFMPANATS T T. 
QPMD*GVILTLKAYYIiRSTFCKAIAAVP 
SDS/SSDGSGQSQLKTFLKGFLILDAI+ 
NIGDSWEEVKISTLTEMWKKLNPIFMND 
FEGFKTSVEEVTADWKITEEVQVESED 
GTEFLQSH 


928 


14829 


A 


935 


3 


383 


PG FKASKDRLT FS S GNNAAGN FKLKPVL 
IYHSENPRVLKNYAKSILSVLYKWNNKA 
WMTAHLFTAWFTE \ P YVETYCS \ EKNI F 
FKILLLIDSVPSHPRALREI * KQMTI VY 
MP/STTTS I LQPMNQG 


929 


14830 


A 


936 


417 


1 


CFFSRDEVSPCGPGFSPSPD/HHDPPPP 
PSQCWDYRRDPLWVPPHICFL IHKKKRS 
SHMGSSMYDP* KPPHKWMKSPPPVSVLY 
GSIPVQVQIAPPPETNPVYFFSPPPPFW 
GGGRVCVCVCVSSEVCDFFGGDENVIV 


930 


14831 


A 


937 


412 


3 


KPTRVKKINPP FFKNTKNKGGRVGGSLY 
SPFFGGLGQKNGFTPEAKGS INLKFPTA 
PKSWGTQQKSLFKKKKKKKS ILKFIWNQ 
KINPKQIKQS*GITLPE/FKLYYKEATV 
TKTAWYRYKNRP 1DQWNGTDRNKATH 


931 


14832 


A 


938 


1 


416 


KNRHIDQCSGIERPEIDSHKYSQLIFDK 
GAKVI * *RNDNLFNKWYFNNWMSTC\RK 
INLDTDLTYLFQNGSQT*P*NIKLLENS 
MGGNLGDL/GMSQ+VSSSSSSRRIHDKK 
IiVS/WDFI KIENI CSEKGI I KRMKSQTT 
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D 


932 


14833 


A 


939 


4 


443 


DFD YNHDHD YAELGTRPGS VGQGS PDPQ 
FTP S RMGREGEGTHS lit* CSS LGMG VIA 
DLSTDPTELE KRALEVAGPDGQASAI S P 
ASPRRKAGDGGHRRALPGCTSLTGETTG 
KSGEAGQDGKPPGD/GPIGPYSLPGSGP 
GSGESMMG 


933 


14834 


A 


940 


3 


404 


LFMFLEFFF * KPRSC* VSQAGMQGCYLK 
ALQAPPPRFTP/SLLSS*DYWSLPPPLA 
NFLYF*ETRGFTGLTRMIS I PQPTEMPG 
LAS QTAKL I FF * K /HRVL VES /HG * S AR 
AVHRGDIiHILEP*TPGLK*YS\CISL 


934 


14835 


A 


941 


397 


1 


FVPNSQVANAKKKFLKEAESTTPVIiSGI 
RKQNTFAADMB* V* MVWIKDQPS YS I PL 
SQSVTQ/ SRALTLFDCTKAERNRRGKLQ 
WEPAV\EGSKGWLMRFKERSHLHNIKVQ 
DEAVS YPEDLDKMDAVNTKHQ I 


935 


14836 


A 


942 


441 


29 


SVTLCKHTVHVPTFIjRRGKRCPLFGQCQ 
PWPSPG/PHPLSMSPV*PRPQ\PIPLHS 

appiialgpfipmlsmaglpfpfqgsglr 
nwkppfpqpsllq* *hfpkplkpirglr 

PFPPSTLS IPLPGTRPRAKLGVMRSRL 


936 


14837 


A 


943 


2 


397 


ARDAAPEPGEHLLQGLSARHGLRPPHDS 
RPGPD\PP\SPPHPLPLPAVPTFVFSFG 
DSWLL*S PFFPAPWASEGGACAPNPHAL 
VPS PSGQASGLRGGAPSWKAGLDTDGQQ 
AGRQSPGPAPPSPPIjPPSQDC 


937 


14838 


A 


944 


3 


449 


LRQVWHEGEMPNKTTLI YHYTPI * ITKI 
KNTDNTKSLWGCAEARSFTHC/ WMRI KM 
VKPLWETVQ* YLI * * KLQLPFNPAVALW 
\SICPRKVKTY/ CHRKNKTRS *MFIV/A 
LFVTAQTKN 


938 


14839 


A 


945 


380 


175 


RAPAVPATQEAE VGGL/H *TWEDRLNPG 
GEV / CSE P KWCHCP PAWATKPNCVSKKK 
KKKE KR INHQHDE K 


939 


14840 


A 


946 


21 


331 


VAPLKYGPSKETI / IQS VRQSTECKKI F 
AYYPSDTGLITRIYEEL*RLNRKEKLNS 
SVYKYAKDIiI *PFSKEDTQMANRR/H*K 
KCSTSLI IREMRFKTTMRVDAV 


940 


14841 


A 


947 


365 


1 


GGGPFWFFFFFFLGGFLKIGKIWAKKKT 
GPSFPQKKKRGPKKKKPPGFFFFFFF*K 
GFVAPGGGQWEGFGSLQPLPPGVKQFFC 
PRFLRKW/DSRFFFFFLIKSNAEQFPTL 
YAGVRMRYIK 


941 


14842 


A 


948 


357 


137 


I SAHCELRLPGSHHS PALLCS PGLSRTP 
NLKQSSLFSLPKCWDYR\IiATVPGQLLL 
F* ARHCIS IDPSLHLNNM 


942 


14843 


A 


949 


53 


352 


REDNHHKCIRNKMINRKWANEVNRCFSE 
EIQMVKKHEKLLASPPIRSMQIKRYY\L 
NSLA* KKEKKSD\NTKLWQGFGETKSV* 
RYTIN/ PYDPTIPLLGIY 


943 


14844 


A 


950 


937 


3 


KVSPYKINIHK*VAFLCTNNIQAEKK/M 
NMVLFTIATNKKX/ SYLVTH* IKEVKYL 
YNKNYK\LLKKNRDNANK*KGTPCS * I * 
RXNI I KMSVLPKATYRLSAISLKL* MQL 
FMELEKE\TILKVIWKNKTAHLAKATLT 
\QSNPGGLTLPGLYYKAIVIHITW/HCY 
KNTHVDQ* /NRRESPEISLHFYTQLFSD 
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KVNRSKQ/ WKNCLSNKWYWESWLI ICRR 
RKLKLYLS PYKKCNSK+ I KNLTVKLKAI 
KIVEHQKSTVLVTGLWKGFMTNT I KVNA 
TKIKVKN*HLIKLKNFCTPKET/T*QIC 
RQQRVWNKLFSNF 


944 


14845 


A 


951 


246 


1 


AASTKTGVQT KTCT * ML I AALV I TVKRE 
KQSKCSSVE\E*TMWYIHTMEYYSVTKR 
NDVLIRTTWMNLENSMLSKKDSHKG 


945 


14846 


A 


952 


2 


255 


QGCGEKGTPLHCWWEYQPLWKTVW*FLK 
DPETDIPFDQAIPLLGIYPKBYQSLYYK 
DT\AKTWNQPKCQLMVDGIKKMWYIVDA 
A 


946 


14847 


A 


953 


343 


3 


GSIYTKEMEPKVNNLQK*KASCTDGL*G 
E FY * TFENE I P I LYNLFQKTEAKGTLTN 
SRDEAS I ILIPKPEKCTRRXENDK/P/ 1 
S LMS I DAKLLNKI LAN * IQQS R I YSRHT 
RLIQ 


947 


14848 


A 


954 


1 


349 


AQPPFILFILiHP\IYCIFSS*S\R*QKT 
LKTT*FQGFIFCFFFFFK/QAKSCSRIK 
GRGPI XAKGTPELPGLRDPPTS PT* VAG 
TKGGGPHTQLKFIFIAK*FYISF*HNGK 
FCSRRGR 


948 


14849 


A 


955 


2 


351 


GLKNYAKSTLP VLYKWNN\ KA* MTTHLF 
TAMSTE * FKPTVETYCS/EKKI PFKILQ 
LMDNAPSHPRALMEMYKEINAVFMPVNT 
ISILQPMDQGVISTFKSYYL/RNTFCKA 
IGAIDSDS 


949 


14850 


A 


956 


336 


3 


PTKENFEPDGF \TG * FYQTFKEL I PPQT . 
FLKTSREYFSSSSSSSSSSSSSSSSSSS 
SSPYRPI /SLMNTD/AKILNKILTNQVR 
LHIKRNTHHDLVGFIPK\WFNTRKTDQS 
NN 


950 


14851 


A 


957 


181 


1 


RWDLTMLPWLVSNS \GLRRSSH/LPKCW 
DYRFBPPCTA*GWFL IGPHWNVSTWGS 
PRISRQ 


951 


14852 


A 


958 


3 


293 


GGLTSPHVKTYYQATVIKTAWNG*RGVC 
MDQ/YNKTENPETDPCKYSQLMFSEVTK 
AIQ*RKDSLVNIENWNN*MSIHKKSSSR 
KHLNQ YLTP YTKL 


952 


14853 


A 


959 


350 


3 


NKKKGQSLVFRSPTLFFFFFL* IEMG\F 
IMIiARLV*NS*RRNMTTSGSQSVGITGV 
SNHARPKRKLFLFS 1 1 TTIGWAGV\WWL 
TLVISALWDYRREPPRPAVLIiPFHIQRS 
RMPLSN 


953 


14854 


A 


960 


349 


1 


GGFPPPLFFFPF/PPPWGKPPPPFFYQ\ 
SFPPKKKGGPKPPFFFGSPWGVHKKQFL 
VKSPLPFFWKKKKPPPPF*KKFFPP\VF 
QKQPGRVKKPPYERNSWLLGPPTKKSPK 
KSTGS 


954 


14855 


A 


961 


379 


160 


PGPQGEPPFFLKNPQKLVGQGGRLF*AP 
LLRGVRQKNS LNPGGGGALKPGS /HLWP 
PS WGKKGDFLF* KKKKKKK 


955 


14856 


A 


962 


184 


2 


TFFSPKKIFPFFFFSKKFFPKGTFFSPR 
ENFFFFFFFF*DGV*L\CHPGWSAWPS 
RLTATS 


956 


14857 


A 


963 


1 


328 


ARLVLKS * PQ / CDLPASASESS GI TGVS 
HCAS ASQS AG ITGMSHH IRPKWI SLYLG 
FWSFNKNVLHFFCASSLEGESMNNELLS 
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KKNWE FLDKKLHGS TKLGGNHVCVNF 


957 


14858 


A 


964 


1 


334 


HPLGGRVGGVPLGP / WLNPPPPQ*GTP 
FFLKKPNNPGRGAKPVIPAPRGG*GGKF 
LLPPRGG/PSMNPNCPPAPPP 


958 


14859 


A 


965 


206 


360 


QKSMSLHQQ*QDIHSSQAHKNSSW\PGA 
VAHTCNPSTLGGQGMQITRSGDQD 


959 


14860 


A 


966 


10 


332 


NFFSPGAPPPPFPFGGGFFLKKGPPFF* 
KKKILGGPPPPKKNPPGVFPFSP/AKNIi 
GFPPPPPPFFFF 


960 


14861 


A 


967 


r 


282 


RKCAKDLNRHSTKEDIWISTECI*HC*S 
LRKSTI,KPRraPHIY*ND*LYKKNDDIK 
CW*GYGVTGAVFHFHWSRMVQPLLNNW 
TVPRNVKFTL 


961 


14862 


A 


968 


339 


195 


FFFCRDGVSPCCPGWSRTPRPKQSTCLS 
IiPKCWDYR/RL*ATVPSLDYLF 


962 


14863 


A 


969 


3 


342 


IKKGPHPQLKKPPGVFPVFQFKKSYFPP 
PPPLFNNPFFFR\DKASLCHPGWSAVML 
SYFTAAYTSW\VRSSSHLSLLSS*EWRH 
TQPCLFF*F*FL/CRSK 


963 


14864 


A 


970 


384 


6 


GRKNVAGFPLSPLYTPPHGHGIiGPPQTF 
GAGP PAHKSHQKVGRQBCRGPGFVP PRPP 
AFLFFFFFKE*SVLKKKENNLYNStiFAY 
KILNKVQGETQCEGRAHIC\VCVCVCVC 
VSVCTCVHVCALAIC 


964 


14865 


A 


971 


427 


109 


LNSQFEIiQEKRISNLEDNEKII*SEKES 
KNRMARN/E/QSLRETWDTFGYTNIGIM 
ECPEGEEK/GKRKRKTIKKKMAANFPHL 
MKNINLHIQKAL*TPTRITCTGPYGG 


965 


14866 


A 


972 


389 


145 


SIiQP*TPRLMIiSSHLSLLSSWDYRCVPP 
HLANFLYF* *R*GFAMLGSSNSSA/SAS 
QSGGITGVSHHAPVVFTFKKKLVADT 


966 


14867 


A 


973 


170 


419 


VSFLFFFFFLERQFYFFSPGGGEGGEFG 
LIEPFP\PGFOTFSCLTLW/RRWD*GAP 
PPLPAYFGFLIK\TGFPLVGRGGLDLRP 
SR 


967 


14868 


A 


974 


212 


3 


QNSMIRYSRSMSRSCLCTTILILSLNRS 
LKSPR* /WPGWAHACNPSTLGGRGGRI 
TRSED* DHPGQHGETP 


968 


14869 


A 


975 


28 


356 


VFETSLFRNKKEIVKGTSIiPDFRTYFKP 
LI I ESI* CLHGIGQNNAWNRVGNLQIDA 
/ECAENLIYEKGGISIQW/SLFNK*CWS 
ITIHKEEKKIRPLHCIKIKGTKDFKKLS 


969 


14870 


A 


976 


350 


1 


LFCKEKVSLLCPGRFEPPGMKKSTLLSI* . 
PKMRGYR/RLAPPPQLEIFFHFFLTSKA 
TPL/CLGRSLKRLNSQMFSPFFSEGVSL 
CSQAGIQWRDFGSLQPAILWFK* FSCPS 
ILSSWN 


970 


14871 


A 


977 


4 


343 


PliHSSLGNIVRPHLLNNSNKNDTTKCW* 
GCEKPGFLIHCIWEYMMV*LLWKTDWQF 
LIKPNIHLPYDSAVAHLGIYPR*MKTFL 
QKL/CP*MFIAAVFLIAKNGKQLRCPSV 
SK 


971 


14872 


A 


978 


3 


424 


KRMRRQATDWE KI FAKG I SDKELLS KIH 
KE Ma KLNKKKS P KQVAC FKMGKQT * T AH 
LH* TI YR/ CRRVTHRKDHSTRS WLLREL 
QILNT 


972 


14873 


A 


979 


114 


325 


QQQQm* IKKLAEDSKRDFSN/BDIQMA 
NK*RCSTSLVIMEMQVKVTVRYYYLTLI 
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SIAI IKQKKKQNPKK 


973 


14874 


A 


980 


339 


2 


GPTGFKFNQGGPPPKGSFFIi*KISPPS* 
NPPFFFGPQKGFKPFFPPPYLNPGKNFF 
SKKPLFFFF*KKGFFKPPPFFFFFFFF* 
GRVSLCCPGWSAV\A*SRLTVTSTSHVQ 
T 


974 


14875 


A 


981 


2 


377 


QMANKHMKSFSTSSAFMKM* IKNKMRSH 
DTCIRMANINTNDT\CW*GCRSNKILMH 
SW/WECKMVQLL*TMV*KFLIKY/DTYH 
DPAISLLGSIKKKSKFCT* /T/CKILYL 
NVLFIMNPNWKQSNWYSMG 


975 


14876 


A 


982 


1 


457 


INE IEARNTTEKFNKTELTSSSSSSSSS 
PLVGLTKRKKSQN\NEGDITTDTIE 11/ 
TIIKDYSE*IYA/NKIEQASSSSSSSKT 
YNLSGPHHEGTESLNRPIMSEEIESLSQ 
NLPTNRSPRADS FTGKFL* TS KDELPPT 
LLK\FQKTEQERTIiPR 


976 


14877 


A 


983 


332 


2 


TPKAGQMQVLFPQYGSKVARAPFSVVAL 
PPPPPFFHFFCFPQPHCSLPPQIPPPFS 
RFFLQVKSSQQGPPPSIiIiGWGEQ/NTLP 
FPFFF* SHSLALLPRLECSGAI STAA 


977 


14878 


A 


984 


365 


2 


KSSKLFNYPPLFFFLKNFGPKKK/SGPF 
FCPFKNFFNPRVPGPNFNFKGFFFLKKG 
PTVF* I KKFWWGPLPPLK* PSGVFRFFH 
FLIWDFRPRPPFFFFFFFLVERGFHHVG 
QASLELLTS 


978 


14879 


A 


985 


1 


153 


GVQWCNYNSL * PRPPG \SGDFPTS AFQV 
AGITGVHHHTRLFFFFVGFFFNF 


979 


14880 


A 


986 


225 


1 


LYIYRERERERDRDRDRDVSHTHLYIHT 
RTHLDNILFCHPGWNAVA* SWFTVALNS 
WV\KSSSRPSIjPSSSDSS 


980 


14881 


A 


987 


2 


375 


GTDEYILIALIWMVSWVCLFFFFEKKA 
PFCPPAGKTGANFGLRAPPPPGIKEIFW 
PP* PSEEG\EPRGPNPSRGNFWFFKKRG 
GSPL*PRLFG/HPRP*GNRPP* /PPQRG 
GNNKGDPPPPPGGIFF 


981 


14882 


A 


988 


55 


335 


HIYIDVFVSGSWLLTV/ ISLLELTVFCY 
NVGAL*ASG\QASENRISVSDFLLPS FY 
LPKAGL* S FPAFMMMGHKILI PEKVI PH 
MLEBB\ TCTERP 


982 


14883 


A 


989 


295 


369 


ILVTRNSEL*VLPEQRTRFLEKAMALRS 
IPLLKR/NGRPGTVAHACNPNTLGGQGG 
RIKRSGV*DQP 


983 


14884 


A 


990 


2 


240 


KKISMNLEHEMIiSERSQS*KTTYYMIIF 
T*KLQNRQIYK\TERIYIYIYIYIYIYL 
FFFFFLERKFLFLFPRGEGGGPF 


984 


14885 


A 


991 


463 


0 


GFRQLSCLS\LPSSWYYRHMSPHLANPF 
\TF*VERGFRHVGQAGIiELLTPNDSPSS 
ASQSAAIH/GMSHCTQP 


985 


14886 


A 


992 


82 


365 


EICHQLYRSFLCSLFC/DHPSNKSH*DT 
MNCVHFI IRLLNFSFFFFEKKFRFVPQV 
EGQGSNLG*LKVLPPRLKPFSCLTLRRS 
WGYRGPPPPPVN 


986 


14887 


A 


993 


3 


317 


GLKQSSCLGLPKCWDYRHKPPCPTHIIF 
NTH* I IKVLNVSFSFCI/PLCWS I ALSD 
HVQ/PV*LYNMLVSSFLLLLLFFEIEW 
ILACLFCPVGSPILFFF*YILTF 


987 


14888 


A 


994 


350 


139 


LFRRIjRWEQHLS \qggqccsep* shhct 
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PAWATE *DPVSKI QQNKNQDTFSHTFIL 
ECRTS KGVS AFLAL. 


988 


14889 


A 


995 


219 


1 


RPRRPLIQKTKAKNHHFTPTRMPVIKDN 

nkc*qgcgev/wn/crmvqplwkt/iqf 
lkmlntelp *drai pltggke 


989 


14890 


A 


996 


3 


498 


CLPWAAVARQPPSAHPFPQPGP * LRTLA 
PDTATAVNQALQRQES *TGC*S PSGWPA 
VPTPVAPG/PPSAG * GWDPRAWPRHNG 
PPGQHPKEKTYQPVPAP* VQLS PTRQTP 
APMACSSLLLPALPPPAPAARPPCPAPP 
FLPSRTSTIiPPRRLSSTIKSKTPPGP 


990 


14891 


A 


997 


2 


419 


ADTEAAASYPDNL\ANDESSYTKQQIFN " 
VDKTALYWKKTPSRTS IAREGKSKPGFK 
TSKDKLTLtiLGANAADDFKLKPMLTYHS 
KNSRAHKNYAKSTLCFRN*NKAWTEHLF 
IAWF / E YFKVTWTNCPEKK / VLLLTHN 
APG 


991 


14892 


A 


998 


388 


405 


G*FFPLSASKFFFCP*ALKLWMEGCHIR 
PPPQVRFPIiQSSQA/GFISASLKGKGFP 
ASMPANIGPPKGYIi* PAPPFFFFSFLRQ 
SHSVTQAGVQWRNSSLQPQPPG/LRRSS 
HLSLPGSWDH/MPPCPANFCIFCTGG 


992 


14893 


A 


999 


1 


423 


WILHD/NLRQPAQWLDQEAAPKHFPKPN 
SH/QKKVMVTVCWSAAGLIHYS FLNPGE 
TSTLEKYVQQIDEMH* KLQCLQPALMR 
RGPI LPQDNARPRVTQPMLQRLNKLGCD 
VLPHPPYS PDFLPTDYHFFKHFDNFLQG 
KHF 


993 


14894 


A 


1000 


2 


406 


SDLFRAI * ETTS VIVYDVSMDYLEIDSE 
MIRDLLNTFPV\HLELQQDSSGVIQGAG 
ITEASTINAKEIMHLLMKGNRQRTQEPT 
AANQTSSRSHAVLQVTVRQRSRVKNILQ 
EVRQGRLFMIDLAGSERASQTQN 


994 


14895 


A 


1001 


290 


3 


VTQLPETVPHFLRDRVSFCCSGS/DHSS " 
LQPQTPGLK\YPPTSAT*VGGTCHHAQL 
I FIFLRDGVS\SVAQAGVQWEDLSLIi* P 
PPPGFKRLSCLRNR 


995 


14896 


A 


1002 


143 


408 


IKKKIKREI*KYLDISKNK/DTYQNLWN 
ATKAVL* GKFIGINVYI FKNRKN*NK* F 
NSTSGKLKKLEKKEQTKLKISRKETIKI 
RAKINE 


996 


14897 


A 


1003 


84 


409 


DKLAALPSSWTLHPGSPLGVTRAPPPPP 
PPPPPPPPPRVLEPVPRSLYPGLAVPW 
PRALH*PPHPGSLPACPAPGLLGGTRQC 
S\QTILPKKKPPPLnADPANEPPPP 


997 


14898 


A 


1004 


392 


2 


EKNAVHFDQKNLVSIWEPLQVPTQVCIQ 
NKTNSWGIAGYPFHFLKRNQRFFNALKV 
PGPFSK*TIHLVNLKNLFFF*DRVSFCC 
PSWSAWKS * FTAASTSlA IKQSSCLNH 
PSS*DCRHTPLLFFVRMRS 


998 


14899 


A 


1005 


2 


421 


FVSLLLLITSAIVAAGAPTCRTVYPPLS 
GNYFHPGAFVHLTI FCLHLEGVSS ILRA 
INFIASIISIIPPAITQYOTPLFG*SGL 
ITAGLLLI^VPVLAAGlTlXLIiTEPSLF 
TTFFDPGGGGDPII, YQHLF* FFGHPEVY 


999 


14900 


A 


1006 


1 


428 


DIHYG* I IRYLHADGAS I FCI CLLLHIG 
RGLYYGSFLYSET* SMGI I LLLATIATA 
IIGYVLP*GQISF*GATEITYLLSAIPY 
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IGTDLVQ* I*GGYSVDSPTLTRCFTCYF 
ILPVI IARLPALHL\LFLLETGSNKPIG 


1000 


14901 


A 


1007 


1 


419 


SPIVPIVMAMACLMLTERIILGYIQLRP 
GPNVGGPYRLLQPFADAIKFLSKEP\LK 
PATSTITLYISGPTIiTLTIGLVL'TPLP 
IPNPLVNLNLGIiLFILATSSLAGySIL* 
SG*ASNSNx XlilGALRAVAQTIS YEVTIi 


1001 


14902 


A 


1008 


411 


27 


GQPGQHRKTPS PLS I F YF/ ELAGGGDAS 
T * FQLLRRLR*EARLSLGI *GCSEL*LP 
PCPPAWVTK*DPVFGGRKKKERLSQLRK 
LRPREVKSLVQHYVAHKGGNWDPNPGPM 
ATQPGRSCTTLCVPLCHR 


1002 


14903 


A 


1009 


21 


377 


MSIiGGQQDS AQTTRS PQFAGF FGQRS I P 
DRTGLGFHLCVCKCVCACVQVSVCKCVH 
MCACACMCTRTPMCECVQVCECVQVCVR 
VSMCTQVC^CVRVCACVCSGACG/CTCV 
T*MSLACG 


1003 


14904 


A 


1010 


1 


412 


HKANTSTYIiLTLVNTFSG*VKACPTTHK 
TAEWASTLIEQI I PRF/GPAFI SfttVK 
QVTTTLDVNWKLHTPYHPQSSGKVERAN 
SLVKQHLIKIiALKTRQSWVTLLPFALAW 
LWAAPQSPTGINP F EJjIiY/ S APLPLSN 


1004 


14905 


A 


1011 


1 


410 


FRAVAGASRQENGA\TVILRD IARAREN 
IHKSLAGSSGPGASSGTSGDHGELWRI 
AS LEVENQS LRG WQELEQAI S KLEARL 
NELENSSPVHRSTAPHTQQVS P IAPVEP 
PAEKPATPT*DDVDDDIDLFGRDND 


1005 


14906 


A 


1012 


269 


32 


NISRIYKEI*LQIiNNKKT\SISKRAQYLN 
RNFTKDI *MAKKPMKRCPTLLVIREMQI 
KTTMRYYFTPARMATIKKTDNNEY 


1006 


14907 


A 


1013 


21 


443 


RIRKKNS YPHYVKS IVAYTFI IRLFPTT 
IFMCLDQEVTISN*H*ATTETTQIiSLSF 
KLDYLS I ILIPVALVATWAI IQFSL* YM 
\NSD PN I KQCLKDLLMFLMT I L VLSTAN 

A 


1007 


14908 


A 


1014 


1 


309 


SSRAAAIHGGACLWLQILHRLRtfEG*LS 
VGGQGCSKP * LHH* TLAWATERDFVSQK 

IvKJVlVlry JSJ\N f \rLuJi 1 bL.lUlUNJOUU^WJLAjP 

PKSQRKEGPKAPQGRTLWP 


1008 


14909 


A 


1015 


3 


256 


EKNQILKWTMDLSRHFSKK\NMQVGNRH 
MKKC*TSLI IREIQIKTTMKYHFTPVKM 
AI IWPGAGITGMSHRTQPSPTLLLTHW 

7i 
A 


1009 


14910 


A 


1016 


397 


3 


LFFSPPPKWGFFFFPPPRFFAPPPFFFF 

SPPFFFFFFFFPPPPPPFFFFK/SPPPP 
PKKK*KKK*KKKKKKTPNTPKKKPKKPK 
KKKKRAAARDLELVGRVGGRV 


1010 


14911 


A 


1017 


2 


164 


EKERRE*KREE*KERK\RKEGBREGGRE 
KJS Kc»Kl\-iVKJ< KKiS KiLKiJKC^ v l^ , I Vrtr KB 


1011 


14912 


A 


1018 


1 
i 


wo 
» 


NSSPPKNKN*NK*KQHT* *EKNFANYIS 
DIELVFRTYNELLQliNKTGNPI KNSF* K 
NQFKNKGK* PNFYITQSF*RNKPI * KF/ 
AKDLNKRFS KEDM* IANKRIKTCITPIA 
IRKRQI KTTVKSSFKA/I YMVWKET 


1012 


14913 


A 


1019 


302 


2 


IVEPLFLGWLCLFLQNPRVKGPGPYFSK 
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GKANPWFPTV\SKQPFKKPAFPVYQKQ 
LPLF/PALVDKKKVL/RFFFFF*DRVLL 


1013 


14914 


A 


1020 


47 


382 


AK IALLHS S LGDRVSHTH IHTHKRSNE I 
ELVT KS S P KKKS S GLNGFP I E FYCS FK* 
ELIPIFLKLFQQ\VEAVGILPNLF*EVS 

TTT.TPJT /PPnFiW<5NTfTTVPPTQT,MNTG 

GK 


1014 


14915 


A 


1021 


273 


I 


QS KERRQRTQEEMGKT FHNNRENGTLFG 
FSQSCLKDE I I I I I FETGFRS I SQAGVQ 

DYMC*LPH 


1015 


14916 


A 


1022 


327 


1 


KFLKGWP I YLKKGFPFVPQGGGQWANYR 
SLVPQIYGVK\YPSAFASKINSTMGMCP 
PAWVTFGMAQIiINGD* I F/ 1 FFLRQS FA 
T^raoTirjvnwwnT.p q pnppppriPin? p^r* 


1016 


14917 


A 


1023 


3 


407 


THLTQR* /HNIQTPP/WV*TGTLSTVIiM 
NFYLWM*FHFQSITLFILRLLTTKLTMY 
rvwT t\\ rr aad C vrwrnu t d pi fn vm .p vp t 

IIiFITINTFFLEGLF*ALYHSSLFPTLH 
*RGDWT/PTGIIPLKPLEDPLLNTFE 


1017 


14918 


A 


1024 


3 


425 


LPPQKKRGFPKIPPREF*KTPPKKKKII 
FPPPGKNWPPTK/RFLKGPPPSRFIPFP 
FFPLPSSPSPP 


1018 


14919 


A 


1025 


430 


1 


r!TT7DDI7DDWIl?l?17DT CDtTt?<7 /OfiVPOV 
r ir Pi? Mr J? rrijor'iru/ KAj v i_±r 

FP PPKKRFFS KNPQRGFNNP PQKKKKIS 
S P PPVNFAPPRDLLKRP PPFFFFFFFFF 
FFFFFLFMHPSFPTSVPQWQITSFLCLL 

JjX1jUV.V>V V»ViiJLuriV Vui/UdCtOD JrOIUlLfU 

PRWL 


1019 


14920 


A 


1026 


463 


494 


RKTGFPRLT*HLDLDFFFFFFFWKGIFV 
APRAGGRGRNLG * LKP P PLGWAPFSGLT 
PPKSWN*GAPPPPPFNF/CVF*RKTGFP 
RLTR 


1020 


14921 


A 


1027 


606 


1 


AGGPGPPKWPGAFGWG/SLNG*S*LKG 
KGSRGAPIPQSGPTVFPGGIPPKESPLR 
pDrsnMnrzppfiPPWPnzv ^nn /vn+riQVT.p 

Q P /GVTGKAPCPGASS PQASQAARRGSM 
GKGGAFHSVWGGCVLSAAWPGWQPPGSH 
G* /PGRRGGQSGCSCPKG* AWWRPCLAG 
SHSSAKHKGMSSEGPAPRGRAISPVSAG 
SIGPIAARGF 


1021 




TV 


ll/Zo 




1 


IKGKGGVLSPRQFPPLGFKKFWGPTPL/ 
SRWGFKEGLKPPGSFWFFKKRWGFSMWP 
RWV*\IPGPKEL\PPPFSQRGGIQGGT/ 
HRPPIIPPSLF/CFFEMETHSVAQAGVR 
WRDLGSLQPL 


1022 


14923 


A 


1029 


1 


436 


PPKKIIFSPKP*IFWGGGGPNFPPPKKS 
FFSQNPPGVFFSPP*KKKKIFPPPPENW 
GPPKIF/ YKRPPPPFFF 


1023 


14924 


A 


1030 


28 


366 


EDHLSPGGWGCNEIi *LCHCTPA\WRQNK 
TLSPKEKDHK 


1024 


14925 


A 


1031 


3 


142 


FDCSALQEFGTRLYCPSWSQIPELKRCT 
HLSL»PKCWDYR/R*ATTPGLRIVLEIiQK 
KLRR* CRELL YTPRS VTPNTND I * HWGG 
TFVTINEIISIH*YIALAGI*RFLSSSDV 
LTSASQNVGITGMSHHTWP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

urn inn arid 

Mill 111 U iSVlu 

residue of 

peptide 

sequence 


Predict 
ed end 
nucle- 
otide 
location 
correspon 

H i nor in 

last amino 
acid 

residue of 

peptide 

Sequence 


Amino acid sequence (A-AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I=Iso(eucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=ProIine, 

fl Into rn S n t> ft— Aratnina O-— 0„— J ~ 

>< — \jfiuiamine, ix—zviginirie, o^oenne, 
T=»Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


1025 


14926 


A 


1032 


7 


361 


PALRPSARGQQYYDETKNKTLCRNAQND 
SYLPDPNPFSRFSTL/DHSWHQLEAPDA 
* KAP FGLYWNCGARVCR\QG I S AK* TEA 
CGIiGTI KPS FFh I PLKQGEIjLGYPAYNE 
NLKKKKKK 




1 A(Y~I1 
14^2/ 


A 




472 


3 


A KyUMUM VDRjL AFx KKMSSRTFIANAEK 
SMFGPRASBCEG*LLGANTTGNF\LKAML 
IYHSKNPRTPKNEAKSTLLVP*K*NNKA 
LVAAHLFTA*FTEYFKPPIEIQIITADN 
APGHPR\MYKEMNVFMSAN\TSIL*PMY 
QGIISCRIPAARIEVKGA 








1U34 


iOO 

455 


A 


DTDiTT Rr>DCPTUQIUWP+OPt»»HA»*irfi 

FLiK* I LiAyRbUlHbMKI *SGJsAAAE*KS 
EARSSWFLRFKEKSHLHNIKAQDKAAST 
DGKAAESYLED\TD*DGNTK* ICNGDKT 
AFY* KEMPSRMFPAREBLMPGFKASKDM 
LTLLLRANAVGD VNLKPMM I YYTENPRA 


1028 


14929 


A 


1035 


114 


491 


NYFKNFTFSNKIiTL/CLL*IiFLL*KLFN 
F*TF*LLC/ 1 *HLA*NTNTLYSCTKI FS 
FFIFLFYKHFLKFIYFFYFLDFII*GAY 
*SLCYY*WYFSCQFCQVLFHVFWGIiCC 

tjttt TTOT T VUrttT\l?IlDV 

HVJjVKJjJjx ry AUpgrjx 


1029 


14930 


A 


1036 


114 


496 


LLRAILTYLKYKISAMNLVSA* FIYLHL 
TYHCVF/DHPVQGR*LLNK* INELFCYR 
SFGF*WVFSYSHLSED*AIjEEKYIiRERS 
RWVK/DLNVTSETVKILEEILGKTLLDI 
UlA^KJSrnMJN. 1 SQANJ. 1 KP 


1030 


14931 


A 


1037 


381 


45 


KNRGRKKNS*IiGGF\SDKFYKTF*KLIQ 
IIYKLFQKNBKEGTL\*NSRPISLINIB 
ANIQN/RAVLAMI IQKHIKKI IRHNQW 

KSIEF 


1031 


14932 


A 


1038 


134 


394 


SFCDKFTNCISESLCQ/SYSPTYIIKIF 
VLFKI RSGS ITQPGVQWHDHS S PQP * TH 
S^QTIFIiLSSQWGTKGMCHHTWL/ 1 F 
IiFLFVF 


1032 


14933 


A 


1039 


483 


30 


YDGGSE I INYVLESRLIGTEKFHKVTND 
NLLS rkytv KGLKEGDTY eyr vs AVN I V 
GQGKPSFCTKPITCKDEL/AYVSTTIYT 
SETCTFVDL* *DINKNDLPL*LQILAPP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
£=GIutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vr=Glutaraine, K— Arginine, S=oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrostne, 
A— Unknown, — otop codon, /=possible 
nucleotide deletion, V=possibie 
nucleotide insertion 














AHIISQEKEIKKIQI*YKEEKLPLFTSS 
SLFI+VKNLMEFAKKLIJ3LINEYNKVER 
X KiNX KN1 JjLiAKNTWl JjKr 


1036 


14937 


A 


1043 


50 


457 


TKYQLANKNMKICSTLPI IRE/MQIRPH 
EISLTPIMMALSEEQKNNKYWGGCREVE 
TLVHCWW*CKTVQSLRKTVHRFPK*LKT 
ELP*ELPPK/PGSQRDICTPMFIAALFT 
IAKRFWKHPKCPSTDE*IKEMWYIQS 


1037 


14938 


A 


1044 


2 


469 


LKQSSCIjGLPKCRDYR/R*ATVPG* fil 
♦NTTEDPNLTKN* ILNTNH/WCYSVQLI 
LGE SNSTIKTSIjS FSQAS SHQNTTVQFL 
STSPSFKNCQWLAILPRK*GLTLSFCFA 
FERQGL/NSVAQAGAQWLHHGSLKAQPP 
RLKQ/FLPNCPTLMECYRLQP 


1038 


14939 


A 


1045 


42 


200 


LICT*MriIYALLVIARNWKQIRC/SST/ 
DEWLNKLW* X PMEYY *AIEKBPTIGTC 


1039 


14940 


A 


1046 


475 


36 


LNVN * LiNHP I KRQKLAG/ YVRKHDPTIC 
CLS \QQSRFI FKDTNRLNIKGWRK\TFH 
ANSNQKIA/GVAVYVSNKTN\ FTRNKE * 
NYILIKGSIH**CIIIINICATNS*NLK 
I *N KJj J. EMJUsEIHCSKITVEU/ YNTLF* 
TMDRQKEFHRMQ 


1040 


14941 


A 


1047 


3 


326 


FFFFI IIIIFETERDSVSKII I IIKNKK 
DLF* FLVTQVTPEITNQYI STLPLQSKT 
KWDRPGTVAHSCNPSTLGGRGRRI /TLR 


1041 


14942 


A 


1048 


403 


15 


EVKNSSFSYRLMTEIMPFGYVSKLYNCD 
SGS *LS* SFNVCKMNRY* Y/LEERGNTS 
FKS LLLU4WKLLHS AI FTS A* T PRGL * I 
Y¥I iTfnnjETWSHS VAC3AGMQWHNQGSL 
QPS PPGLKEFHRTDI VYAR 


1042 


14943 


A 


1049 


25 


457 


YMWWNSHHSCMFTIjIiFDSS FPPPSLCFI 
LRSLFLLYAELPLAFLLVKDSFILFV* I 
MSVFHLIIiKLQ *NIHNIYFTILTTFKCT 
VQWH*VYSHFSVAKTAT\ ISRTCLIPSS 
WDYRHMPLCLANFFL *RQGLALIiPRLVL 
TSWJjQ 


1043 


14944 


A 


1050 


I4 


478 


FEP YCRH* TTRWN ^CWWGYELVQSLWKA" 
TEQYVLKPDICVF/LEPGVSLLGMYSKE 
LCTLC\YQKTRMFIATIiFALVKS**LPK 
SS*MVE*ITKL*YFHTMEYYTAM/KNE* 
ITI *TTIWMNI/TI ILNERSQTQK 


1 (\AA 


1 AQA *Z 

14V4J 




1AC1 

IUjI 


l 


4oj 


VKAERGEEAAE / ER / VEASS D * FMRFKQ 
RSHLHNIKVEGEAATADGETAGS \SYPE 
DPAITDEGSHTKQQI FSVDBTACCWKIS 
SRTFAAKE KSMS VSKASKDRLTLLLRAS 
AAvjJJijJVrir fKJjv 


1045 


14946 


A 


1052 


392 


24 


DFAPRRKKRETRRSKINSLTS* * KELEN 
QVKTNQKGSRXQEITKIKVELREIETLK 
TFPKKKVNKSRSWFYEK\INKLDGTLAR 

♦TTIRGIPPLWR 


1046 


14947 


A 


1053 


204 


488 


CAVCYTQISLTBWVDL*SIjDPGFETVSH 
LFDRVNQP * DCRRVTDMLMVF * NLHFVI 
IiFFIiFWF/DFVFVLRQSFPWTQAGVQW 
RDLGSLQPLPPG 


1047 


14945 


A 


1054 


498 


2 


S WVCL FLS PG I TSHTYVPM I FKI GAKK 



589 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, B=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 

Cy = Gltliam\n&. R=Art»ininA ^ — Qnrinr* 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VHWWKS I IiFNftSCWRNT .T QTfT 15 WTK^TTi D 

m/ltpgtkintnwikdltpsaesiiclk 

EN1G /THFHDIRLGNAFWDMTPK"ft*A r PTC 

en*ipwitsk*qt\fcaakntvn*vkr* 
pmd* e /ri fakhi sekrli cvi ykeqle 


1048 


14949 


A 


1055 


28 




I DGRSRGLWWN CWIiASOLI, WRAP ACJfi Q 

leprslrlq*amim\dctppw 


1049 


14950 


A 


1056 


163 


479 


TSPLGSQSAQITSVSHCPSREVPFLKIiI 
HWRQGGQVAIiLVATPHSPCCPQYRLAPI 
PRARHDFACT^SLI FVCILLVHVLLMPRS 
VAGRGVGSGLLG/CQAGR*L*TC 


1050 


14951 


A 


1057 


2 


1258 


ELFPWHFQPSRAASLWAKAIR0KVPGT 
RRWPVPTDGRPEGFALSEPHHSCLHGRH 
QGPERCRHMAAS P * EAGNK* PKDKDGHP 
GERTETAAGVHTGCGGK3GPAAAGGR*AQ 
GGSGEAGAAAGGAGAAVGGAGAAVGGSA 
GGRWPAGLU3QHGAG*GEGPCRQHGPPP 
GVSAGQAASPAKAAGQPGPGT*GAAGQP 
GRG* GPAGPRGGAAAERAGAGAMPAQGP 
AGAAAEPAEGEARPGAGDYGPAANHPGA 
RTGAGGAEGAGAAAGGLPRPAQAHR\AQ 

QRHRSHGEASAGQRHPHPGPTGGER\GL 
QSMLS KI REVAQQGGLKVGLRGRALGDQ 
EEMIQQQVFRIiCPGNLWR/RPPHHMRP 
*AVLL*NIFHISRRREDVMDPMPSSPIC 
PL 


1051 




A 


WJja 




«»/ 


*TGSPVTGSSALEPRSPHPQP\PSLRPR 
SPGPQPLHPljGPRSPSPSPEQTVPSHPA 
RLPSLS PERTQG\ PKAQ* SQLPPPSLAL 
AQPAPAVGGREEVASPACPSWKDKSRLR 
AVPGSA 


1052 


14953 


A 


1059 


47 


456 


TRCTGTLLRLDAPGKLWTRKM*VVASTY 
ARTDRKSSASFRPLLPLPRGFQLSLHFS 
LTSPSCIiAFSANTHRGLRGENYHITK/C 
DMAPKV/HN*TQAWQWHDFGS PRPPPP 
GFGRSSCLGLPSGWDYSPLWHHEQIL 


1053 


14954 


A 


1060 


4R4 


13 


FPTSASONTGTTGVSOHAOPI IF /L.FVE 
MGSPQV/GPASLKLPTSRDLPTPASRAA 
ESVSAHHH*FLNPL*RWASGFVAPSWS* 
TFLL\KQSSCLSLPVC*DCRCYNHWAWP 
HSVFSSQFIiSLFPFNFSFLFFLSFSFFK 
DRISLCRPGWEFHQTVQWSR 


1054 


14955 


A 


1061 


1 


494 


GSPGHPVCGRRRSALENPGQPCSLQPGV 
VSGATGAIVRVQRTSSAWATAAAMGAYV 
E / TTRFRYT* KAGVGCDRLWGAWLKADG 
LGETKAEHTLHDG/ PPEDALYGL I EGDD 
TNFTIQGEVVHCWYTGTIiPDGTVFDTNI 
QTSAQKMM/NAKPLSFKV/GIGKVSH/G 
WDE 


1055 


14956 


A 


1062 


3 


395 


SDPSPCGGIRFDEMEKFLENYNLPKLKE 
/DKINDPTFVK*IGSVMKIFHTVEP/PS 
HKCFTSEFY*TFKE*IAPISHKLF*SRG 
IMGTATPPVMFHLQENVT PK\PTEKK*T 
YRPVSLMNIN\KILNKILTYK 


1056 


14957 


A 


1063 


402 


260 


WYCPPKV/LLRFSVYSSPPEVWAVGSIK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 
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SEQID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence. 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine > 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, V=Tyrosine, 

Vf WT ft A- O A * t ft 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














AELYMLRPIiFPRTREVD*IF* ICPV/LK 
GSPKNKDFCP * GAKTFEYFYPATIiGLNL 
KI*IFRQRW 


1057 


14958 


A 


1064 


66 


379 


EFGRGCGEPKLRQCTPAWVTEEDPVSGG 
KKRKRKEREKT*REPSVSRPESH/ PPSQ 
EAFWVVQALSRPVLPNFISGRSASGLGK 
SEAIKSLFLYIiAVCSTTEEQA 


1058 


14959 


A 


1065 


223 


437 


KSCMSCDSNKIFFLRQGLSCSLTQAGVQ 
WHYLGSLQPLPQGPK* *SHLSLLSNWDY 
RR\PPR 


1059 


14960 


A 


1066 


414 


65 


NKS PDRNELAS * KCTQ * THTS IARDFFA 
KTIQWGKNSLFNIWC* DNWISTSKRIKL 
DPYIIPYTK\TNSKWIRDLHVKAKTIKL 
IjEKNGPGAWC PCNP S YSGGLLEARSLR 
PAWAT 


1060 


14961 


A 


1067 


418 


1 


LWVFFFSRQSL/DSVAQAGVQRCDLGSL 
QPPPPRFKRFSCLSLLRSWPYRCRPPHV 
ANFVFLVET\GFTMLARMVLNS * PCDLP 
AAASQSAGITGVSHHARL*FOGF*YIHR 
VAQPSSQS X *DDHP IKKFCIL*QPTPFR 
S 


1061 


14962 


A 


1068 


24 


440 


EV*NLYSENYKMLLKEMRGDLNKWKSI P 
*S/WI*RI*bTIVKMAVLPKLIYRFNLIPI 
RIPDEFFVKSTS*L/OCFimCKRLRlA 
KTIF* K* RTK/ SGDLILTDFKTYYKTMV 
IRTI*Y*QKDRCID*WDRIBS/PEMNPY 
IY 


1062 


14963 


A 


1069 


442 


26 


LRNQVSTPSSKTPRFFFFFFIiKRQTGAQ 
RHNHSSL+ PRTPGLKQPSTSASGVSRTI 
GASHHTPLI FFYFFN*GKEGWGGCSCFV 
AQAGIiQL\RIiQMILPPQPPLKCWN*RHK 
PPSLTYPI FLKRTP * SCFSKEKATRIPP 


1063 


14964 


A 


1070 


294 


46 


KNKLKRKERKENNKKKAKINDIKNKSML 
EKIHKDKN* FFERNNKIDKLLATN/RIT 
K^CTQIVIFMNKIRDITTYPTDIKNVKR 


1064 


14965 


A 


1071 


141 


469 


PKKQGVQLTQNAPLPFRVGGKRGLCPKK 
KKERERDREIIS I I I I I IKLPTKKTPGP 
DGFTDBIiYQTYK* LTS ILLKVFLKKRRE 
CFL/PYSMDPAVPAIAIxNPPP 


1065 


14966 


A 


1072 


49 


472 


EFQIIKK*NSFFADTEKALVWIENQNSH 
NMPLSQSLIQSKALTLFNSMNAERVGKD 
AEEKLEASRGFWFMRFKKPS P *HKVQGE 
AANLDVEATASYPEDLPEI IDEGGHGKQ 
/ QXFNMDE I VFxWKMPSRTFPAREERSM 
PG 


1066 


14967 


A 


1073 


82 


419 


LTVGFRETRSSSWVCSSSPKRALHALRF 
TPAIiKF* FFFWSFSRLFFFFLRDRVLLC 
CPGWSPTLGLK*SSCLGLPKCWN\YRYE 
PP 


1067 


14968 


A 


1074 


1 


407 


LEDGFFKITQSDKKEKKRI \KKCK*NLQ 
EVW\DYVKHSNLQVI E I PE*EVKCLENL 
FEEVIEANI /PHLASYLDIQIQAVQRTQ 
RG x I AKQTo P KHI VVKLS KVNMQEKI UN 
LPKEKHL I TYKGNS IRLTAKPSPKPT 


1068 


14969 


A 


1075 


3 


396 


LHAYHI VKIiSP* PLTGALSE \LLMTSGL 
TM* FEIFHS ITLLXLSLLTNTIiTI YQ*WR 
DVTRESTYQGHHTPPVQKGLRYGI ILFI 
TSEVFLFAGFF*AFYHSSLAPTPKLGGH 
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nucleotide 
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ing to first 

amino acid 

r&siaue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

J' „ In 

cling to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G1utaroic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline; 
Q=Ulntamine, R=Arginine, S=Serine, 
l— l nreomne, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibie 
nucleotide insertion 














WPPTGITPLNPLEVPLLNTS 


1069 


14970 


A 


1076 


250 


47 


VAACQNAFCTFSCYWL*NLLCRK\MDLD 
ISLLP*TKVNSRWIAGPNVRALTIKILE 


1070 


14971 


A 


1077 


2 


393 


PLTGAIiSNLLLNSGLAMRDDIiHS ITLLI 
LGLLTSTLTIYQG\WREVTQESTYQGHH 
TPPGQEGLRYGTMLFITSEGFLYTGFF* 
AFYHSSLAPTTQLGGHWPPSGITPLYPV 
ESPLLNTSALIiA*GVSIT 


1071 


14972 


A 


1078 


2 


500 


LHVIGVSEGEEN \GMKQNKI FEEIMGPN 
F/ PNLVKYIKA* / IQEMQ*TPNGIHLKK 
TVHRYI IPQLVRTONKERILIVAREKWH 
GIFGGTML*MNDDFSSEFIKARKIEMMF 
LKYKRGISSSAKHC*SRILCVAQISFVS 
KG/E ITTFSDKRKL.RAFI ISRPAHRNAK 
GK 


1072 


14973 


A 


1079 


419 


1 


LRWSFALAAQAI VRWRNLGSLQPLP PS S 
SNS PGS WRLQ * ANTAPLRSS LGNKNETL 
SQKKRK* KDNSEWEKIFANHASDKGLVS 
K*IKNSYNSITKN\DI*KWTKYLNRHFS 
KKDIKMANKHMKRCSTS 


1073 


14974 


A 


1080 


468 


2 


KLPPPRGAGSSAP/ PLFFPPTKKGFFS P 
PPP*KFFFSPNPLIFLGGFFPIFPPPKK 
NFFLKNPKGFFFSPP*KKKKI FFFPP * I 
FAPPKFFFKSPPPLFFFFFFFFFFFFFF 
FFFFFFFLRTRGRTRGRTRGRTRG 


1074 


14975 


A 


1081 


467 


1 


PTYKFNAIPIKIQ* * FFKKIENCILKFL 
/WNLKEPQIAKTVLK*NKVEGFTLPDFK 

\ltvtktvws*hkdrhrde/wk*teslk 
ikpykydqffffff*fsfl/hff*ylty 

SQHHKLIFNRMPKPFWKERLFNK*CWEN 
WIS I CKI I KLDHYLTPYTK 


1075 


14976 


A 


1082 


387 


3 


LKQ P P PHKS FVKNKNGVS LCCPG * F * T P 
GLKRSS \HFDLPKCWDFRCEAPVIjSLLN 
NIKL*L/CGRGLIKPQFP/SVK*K**AP 
SWSAHLAETFFFFRQGLTLLPMPGVQWH 
DPGTVQPQTPGHK* SSHFS L 


1076 


14977 


A 


1083 


2 


435 


FHPPTNWGGFSPPPP*KFFFSPKALNFG 
GGVGPNFPPPKKRFFPKNPRGGFIYPPL 
KRKE9F S> F FPPIiKIiAPPRV/ 5 FKRPPPFF 
FF 


1077 


14978 


A 


1084 


447 


47 


PSRKFFFLAPPKRGFFPPFPLKNFFFPL 
GGLFFG/RGVFQFFPPPKKGFFFKKPRG 
VF*PPPKKKKNFFFPPRENLGPPGVFLK 
RPPPFFFFFFFFFFFFFFGRRFVK 


10731 
iv/O 






1085 


438 


2 


AKWHFKGF5 PFKKFS PPFGGGG I FIRGQ 
L*KSFFFFFKKPKFLV*BCPPFKIFFF/I 
FFFKQRVLAL/ CPPGGNKGVPQGPFPIF 
r bKVoKJPIjb FNFFKNNGKKNPPPTPGKI 
FFFFCSFFL*RQGLTTSHRLVliNSWA*A 
ILPLQPPK 


1079 


14980 


A 


1086 


43 


265 


IKHKLEYI KIKNFCASNSNR/ TERQPME 
WEK\MFANLISDKGLTPRIHK'OIjIj*T 1 N / 
TKQQNSP I * KWVKDQLQWPYL 


1080 


14981 


A 


1087 


11 


412 


iasglhdffnkkkkk^ockkgggplkktp 
ggpkfnrggkkkifsfkggekktpggfl 
eknpflgggkmgqnppkkkkpfgekknf 
*gergekkpkfpggkkmspspqe/nkap 
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RDDPRPPE 


1081 


14982 


A 


1088 


2 


292 


RSLPASADSSSLVAASIiAGVRDRVSTCW 
PGWS/AV/VQTL* SARLGLPKCWDYRCE 
PPCLABAPS FMRSGKASCTLETVWEDKH 
KYEEAERRFYEHEAT 


1082 


14983 


A 


1089 1 


376 


3 


HIGLYIASCKTLLKEIKGS* IDILCSWI 
ERLNTVKKAILLKLIYGFNTIAI/KILS 
GLFIETERMILKITGKOmSQTAOTILK 
PNKVGRLALPNFKTYYKATVT QTVWS WH 
KDRWYGFFCVPTQI 


1083 


14984 


A 


1090 


277 


407 


QIKAERSHHKQITSMRNVKHCSSENYDS 
♦RKPAPGYII IK\F*K*LIKRKILNSGR 
LKNRPIKEE*RYAYQDISETIKVRR*EN 
DIFKVL\KEKKSCQPRLLYPLKIS* IEV 
KYFFDKSKLKKVITNRS PV* EMLNI VLQ 
/INYDSPYEP 


1084 


14985 


A 


1091 


421 


2 


GHPGVLFKG* *A*NEKSL*NCSFFNY*S 
SFSNMQTRIKNV/WPSTVAHACKPSTLG 
GRGGQIT*GQEFETSLANTVKP 


1085 


14986 


A 


1092 


369 


3 


FLKEIRSVTPVNT*ENLIAD/ IGEKVLV 
VWIENQTSHNIPLGQSLIQSKVL/TFNS 
MKTER* /BKLQKFEASRGWFMRFKERSH 
LH/NIKVQGEAESADVEAAGVYPDPAKT 
IVKGGYTQ*Q I FNVDV 


1086 


14987 


A 


1093 


71 


506 


FAEDNGL*LHPCS CKRHDIAVF YGCTS F 
VLTFGL* PWFLTQS / FLNPLEFS 


1087 


14988 


A 


1094 


118 


385 


SDLGKDFMAKTS KAQAAKTK \ I YTWDYI 
KLRSFFSAQ*TKQSTE*RRQSTE/WKEK 
IFADYSSNRGLISSRQETQTTQWLKKIN 
KNALCTL 


10X8 




A 




4.19 


1 


DAKIPGQMVAR/RrPRPIKKIFCPDQVW 
FIPGMQGGEHIGKSINVAPHIQMG*KSF 
NNIQHPFMI * KKXNLSTGTTHEGD ITQH 
/R*MLHEHTTNAILNREKLKAFPLRTGI 
RQGCLL* PLLFS IAVKT* P * AMRQEKE I 
N 


1089 


14990 


A 


1096 


34 


464 


NSSKKEKRKIPHDLGLRFLDMTPKT*ST 
NGMMD/KDFIKIK/FCASKDTINKVKRE 
♦EKLQIIYL/DKRLVPRIYKDSYYPIRQ 
LKKWVNNLNKHCFKEDTQIANKHKKECS 
T\SIRE*QTKTR*RSHFISTRVTKIKMS 
G/N/KKYWQGC 


1090 


14991 


A 


1097 


458 


28 


RIIKVDLKMCTHNFDSLEEMNQFFKKSR 
WPR*NR*FE*SYNH*RN*IHNLKVSKSS 
GPDAFTGEFYQTCE EELAP I LQNLFQKM 
ES IRSNS FYKVRI TLIP*GDNGS / TKKE 
NYSWSFMNLDAKVLIKILAN* I*VFIK 
REFH 


1091 


14992 


A 


1098 


41 


500 


FWIRHFLSRHKQRKNR*MDFIKIKNLYA 
SKDTVKEVKR*ST* WENYFQI /RMFDTG 
LVPRVYKEFI *LSNK\DNHI * QWGKKAV 
SRGFSKGNTQMAKKHMQRCSMFFVIRKM 

* I\KPKMRYHFTPSSMT*EKNKDMTCFK 
SDGEYGII*IYYYWGI 


1092 


14993 


A 


1099 


1 


484 


RRAP F F F F FF KR S F P FWARAP / LQ W AF F 
GLVQNPPPRQIPPPGLHPF/SCPtfPPKK 
WGYKGPPPTPGKILGF**KGGGPRLMKK 
IFISRPCDPPAPTP\QKVGIQGGTPPPR 
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K=Lysine, L= Leucine, M-Metbionine, 
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PFFFFEMVSCSVAQAGVQSCDLRSLQAL 
f irvar oLJIvjI^IjJ^ J.ljbiiFor'flr' 


1093 


14994 


A 


1100 


144 


260 


ILILDSALGHPEPHEFNTKGIEWYLPL 
NTMYIIQPLDQ 


1094 


14995 


A 


1101 


265 


457 


DPAAPLIiVTYTREI\*MYITKRYTAALF 
IKTKNWK\RPKCPSKGE*INKLWYICTM 
EYCSAIKNE 


1095 


14996 


A 


1102 


387 


40 


RLRAD I LRI LVYNNFLQRLKKQRYLFTG 
PQYRFLFLELWLCVCVCVCVCVCVRANF 
* I ISRDKVLLCCPV\ CFCPLNPPCVCVC 
VCVCVCVRACAI INKLLVETRSCSVAQS 
GLR 


1096 


14997 


A 


1103 


465 


208 


QNMTKTFQDYRP I S LMKI YARKQQYRYR 
Y*RMLHNYC2MGFTLGLRGWFNI * K/S 1 1 
IHIN* *EKKIMI II»IESEEAFDKC*QSL 
II 


1097 


14998 


A 


1104 


57 


250 


MTCSRMQ/RNIKLLRYKSEKL*EENSTS 
RNEITILSEEDSFTOVKLEN*MKTVKQK 
KKTSKKTGVYKMAI KKQISEFKGKTSNF 
Y* EQINT * EKKNFKENWSL 


1098 


14999 


A 


1105 


2 


378 


YVDPRQCGGILKGVLKGEFTISNEKANP 
GRGSPSS VSKD/L* CQNIKTIESKKTCL 
IQKLIKVKAEIYERESRKTMEEINETK/ 
SQFFEKI /NKID* / PLARLTKKKKKTQV 
LKSEK\GNITTNHTEIKRV 


1099 


15000 


A 


1106 


291 


1 


VSIYVSYNLDSCKGQIRIKSCDDQYIFK 
RLYFFEITFLKFLN* K* VLKKIRDNVLL 
C/YPGWPQTPGLKQSSCLSLPSSWDYRH 
VPLHLAP INYFLW 


1100 


15001 


A 


1107 


24 


397 


D IDHVWE FETVFYHVGQAGLKLLTSDD P 
TALASQSAGITGVSHRTRPSSGHFNNTP 
EPPEASSLSSHPK1jHKSPVT*NGAGL/Y 
GSSKLLSCVLNGPISLVHSTLRIiRKGGP 
QGNISQISLMAPP 


1101 


15002 


A 


1108 


64 


414 


AQLT PTQ PQGRAALHKGHV * RKTAPTCL 
FMAEKNQAASFFLFPFPSICINKE/HFK 

KQPGGGLLISKILWAPPPGFFLTREGP 
rrlrr 














1102 


15003 


A 


1109 


62 


475 


FEPLFYLMCLLNLFPLQLPRHPFLFLTV 
DIiVNTWGCPIiPS S PQ * EWLLAAPHRSTP 
PPLSSGFPARRQLEPGAGARGP/HHTQA 
LHLS FF FVFLRRS L / DS VAQAGVQ WRGL 

nor/'iDT DDrtT?ty\ tt g otjt or tjot tv 
UoLiy rAjfJrVjf v \ JLliijbr'Lioi-iFSJjj. Y 


1103 


15004 


A 


1110 


76 


477 


EEWKKLYRWRENISNLISDKBLIC* IN/ 
NQTFNIQPLKTKNPTKDVNDLKTFTNED 
TQKYTYIHQ*R\SHMKR*SP*LFI\KMQ 
MSTALRFLYPPITMDRIQIPENIKFWRG 
CGILIHSW*TRKMFHSL* KAFWPF 


1104 


15005 


A 


mi 


1105 


3 


KEE RS ETLAKGKL IAAG A/ YN * KE ERSQ 
INNLTLHLKGGKKTNDKPSKR/QE / 1 *Q 
IRAEISRIRPRKKKEKNNEFVFLKINKN 

ATEMKKITRD*CEQPSSSSSSSSSSSSS 

ssssssssssssssssssssssssssss 

SSSSPPKKKIPGPDNFTGEFYQTFQEEL 
MPALCKI FQ\NVQKPEGD I / PNPFYQVN 
IYLVPEPDE/QPFVIKTLKELVIYGN/Y 
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W=Tryptophan, V=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














L/K* IKr IHEKIiTANIIIiNGEKLKSFPI. 
GSGIR*QHLFIiPLIiFITVIiEFLIiRI ISQ 
EKV I KRMQIGKEKV\ KI IiFTDNRM* WV/ 
ENP/VNSQNIGKIMKFNKVSGYSNTHKS 
VAFP * TNNEQS ANKI LK 


1105 


15006 


A 


1112 


37 


466 


DRAAEFPTENPLELISKASKVEEFNVSI 
QRPIVFIiCISNKQLENEILKIQFH*/PI 
S VAS KNS KYGG INLRKYL* DMH IDNK/ P 
LNKNRGGLCSWSATSVLPKLIDNFSAIS 
GOTSRFFFFX3EIEKIjNLKPIWKYK*VRV 
AKTVSY 


1106 


15007 


A 


1113 


33 


436 


PGLN* * CWKNRISTCKRMKLDPNIAPYT 
KITS KWIQDINI RPVTIKILKENRGKGL 
QH*MWQ * FF* I * PQKQE * TNRTKSDFI K 
TAQQRKQQNERQSVEWEKI /FANQISNK 
ELLSKIKFRLLPLDNNKQLT* KWA 


1107 


15008 


A 


1114 


2 


391 


PLTHS RMAI KKETK \NYKCW Q * SGG I GN 
LGNCW* ECKMF * PLWKrVLQFLEKKKRN 
TELP*NPLIPILGIH/YKRNVCIQMFTE 
ALFIITK/ swkqpr 


1108 


15009 


A 


1115 


3 


458 


I RDPIiEEAVCP FADIiBCLHAGRTTTLFRA 
VRQGPLSL*KLLLPFVQLSHVPRGGVYR 
GSQAS LS CGGLHP VQAS * LLCLPTQAS A 
MAnAP\PPASLPPCSSISDCCASSERGS 
V04GPSEPGMGYNLI*VCRLLRPLEKPS I 
RVGVSLFSRYHM 


1109 


15010 


A 


1116 


204 


13 


PKMKTDSRWILGLFF*FLRQSL/SSVIQ 
AGVEWRNPGSLQPPPPRFKQFSLENQGI 
PPHGLWS 


1110 


15011 


A 


1117 


34 


454 


IHISWEFIiTSKLYGTLTSQYNKKTNIIi 
IK*MGKRFNAHFT*EGI*IVNKHMKRLP 
TSSVIREMQTKTTVGFRSVLTREAGIKQ 
TDS/NQAWVRKWSNSTLLCCWWKHRVLQ 
PTWN\TAWQLLIYKMIWQLHFYAQKDIY 
VN 


nn 


15012 


A 


1118 


83 


451 


CFLKFFLYRELISPFLL*Q\FVRPSPAF 
RRKPPPWVAFFFSDPS/FSFLVFFPKGI 
VFFLGDALKKVLT* KKNFFFFGRD*VLL 
CCPGWSRTPALKHSSCIX3LS RCWNSTTR 
P 


1112 


15013 


A 


1119 


316 


441 


FIFFFFLKTNFNFFAQVGGHNRNLG*LK 
LPIiPGLKQFSCLTL 


1113 


15014 


A 


1120 


67 


312 


RQIDQWTRIES*ETDPREYSQLIFDKGA 
NPIQWRK\LFNKWFWHNWTSTCKQIKKN 
IiDIVLASFTKTSTSHRIiRENLYRIHI 


1114 


15015 


A 


1121 


27 


490 


GTQLHSREKKNSPFNK* YWEN* ILTCKR 
MKIiDSYFIAYTKINSKWIKDLNVSLLGL 
LAKTK*SPKCKIQNY*TSRRK*KGKDLG 
Q * FLG YDTKCKATE * KNK/ LNKWDYI KL 
KNFCIA\NKTINRMKRQPTVWEEI IAN I 
ISDKVLISRICKKTNR 


1 115 


15016 


A 


1122 


490 


48 


EHTNNKRDTLTVD IGKVLVI * TEDQTSC 

XT /TDCWrtCI.TnCWST TT PMC T * a 17D/"2.T?T7 

ATE/EKLEASKGWFMR*KERSHLYNIKV 
QGEAAS ADE H1ATAS YQEDLAE 1 1 DENG \ 
KQQIFNSDKIALHQKKM/ PREKSMPHFK 
VSKDRLTLLREF 


1116 


15017 


A 


1123 


2 


259 


I YGQLI FNKGVKTIQ * RKNS VFNRCYWD 
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C\z=f~* In fa m in o D=A rat ninth C— Q««i_» 
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N* T S I WKRM \ fiUDP VIST PHTK T PfW T Y 
DLNIRTKTLICSSWRKKKRENFLKKKRK 
KL 


1117 


15018 


A 


1124 


206 


24 


CSIFFCIYLFIYLFIYLFIYLRQSIi/DS 
VTQAGVQWHYFS SLQPLP PGT\GLF* RT 


1118 


15019 


A j 


1125 


3 


351 


RRGRGFT I LVWAQTPDLM/HPPALASQS 
AGITGVSHCARP ICLFFKRQGF/NSVAQ 

DYRRM/PAMLGFFVFYAYPGITLLPRWP 
WISGLNELP 


1119 


15020 


A 


1126 


2 


349 


STNHKDIGTLYLLFGA*AGVIiGTDLSIiL 
IRAELGQPGNLIiGNDHTYNVIVTAHAFV 


1120 


15021 


A 


1127 


362 


2 


RVSLHCPGWSQTPELKLSSCLSI*PKC*D 
YRCBP PHPAYLTS F I TEDS KNFEYVTTL 

LSTFF*S F*N* FFVCLFFRQSHS VAQAG 
\VQ*CDLGS 


1121 


15022 


A 


1128 


1 


349 


IYFLTRSHSVSRAGVQWHSSGSIiQP*PP 
RFKRSSPPLSIiLSSWDHRTWG/HMPP/H 

QQQT7QQQ7 .ET .13 <3 ClWt~inVn\ TT HQ 


1122 


15023 


A 


1129 


367 


24 


INCVFYPCKC3VTKKPWYT/HF*RKGF 1 1 
INIFHGPFKLTFVCPEGGKNSETLYPFC 

QGNYGVGWVGFF FFS AFVI F I KKLVLS P 
YYKN 


1123 


15024 


A 


1130 


61 


299 


I PRVDNYCGYVREHLFSFL FFFKRSLV / 

<l\7TY'\an2i*lJDrjT»ClT.T.riDT.DD^2'G , Vrtl?\ CP 
ov XUnvj/i" riKJjjjoljJjy tr JjJr Jtrvrr r \oL 

FNLLKNHC YRDVPLLLNTLCI I FKT 


1124 


15025 


A 


1131 


2 


261 


FFFFLVFSFFLLRIiFLLPSPSSPPPSVF 

DLGSI*QPLPSRFKRFSCLSLPSS*DYGR 
TPLW 


HZ J 


IjUZo 


a 

r» 


1 15Z 


2 


Jot 


fir irrr oo^iEocoiiiyAvjVnnKJJiij/ 
SIAPYASWVQVI ILPQPPPPGFESFSCL 
RFPSTWDNRHAPPRIiAKKRNKI 


1126 


15027 


A 


1133 


2 


221 


PS LLKKLARHSGRCL * SQLLRNLRNENR 
/CVPAWVTK /GET P P <5 KJC1CICKNP P P RGP 
GAQTFETPGPGKPKGKIKV 


1127 


15028 


A 


1134 


331 


2 


RGPPPPLKEPSGVFRIFQFKNLESRGGG 
ERGKQSPPLGPPIRRGLSPKPPPPEKGL 
TVFFFFFFRCKDGVSFCCPG/WISRTPG 
LK*SSCLGLPKCWDYRHEPPHPAPAL 


1128 


15029 


A 


1135 


3 


391 


AVNTQMMRK* KSLIADVEE V* WWIEDQ 
TSHNIPLRQSLVQSKAIi/T*RAKPSMKA 
ERGKEAAKGKLiEASRG* FMRFKENN\ RTj 
HN I KVQGDTAS ADVEAAAS YS * DQAKIT 
' GGCGYTKQQ I FNTDETAFY 


1129 


15030 


A 


1136 


107 


350 


TYSIVAIVMRFYFFI IFYFLRWSLAWI 
O ACCCHPGNWHDGS LO PLPPGFHKFSCTj 
SLLSS*GYRCPPPCQA\IFIFLVEMG 


1130 


U\JJ 1 


A 


1137 


jj 


2Q7 


TLMHYTWRVKEKNII ISIEAEKAFDRIQ 
HPHM/ 1 KYIiNTI KAVYNRPTAS 1 1 LSG* 
NLKAFPLKS VTQQGCPLS PLLFNI I LEV 
LSTRP 


1131 


15032 


A 


1138 


378 


1 


PFTKINLKW I IDLNI KHKTI KLLEDI IG 
ENIiDNliGHDYDILDPTPKA*AMKKILIN 
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VALC*NHESFCSENDTVIRMRK*ATDRK 
i\xvl\\jUnjL IVJJU X ojnA-/ via nAfr VNNMiUum 
NPIYKWAKDLNMIiL 


1132 


15033 


A 


1139 


271 


361 


KW/YWNPIHIISQVCL*GPEiyQHVYGQ 
MTKQRCQGNSM/WAKDSLFNKWC*NKWI 

DC /OT*KTT MPVT fTtryvrrj vr IT *• T7TVT Tflnn 

NITVTTKLLEBNMRFSFAFGIGKD 


1 1 ^7 


1 KfYlA 


> 

-ri 




25 


340 


ljljir* xiSJiCjr lflLioKXjVo 
TSASHTAGIADGSHHARLLFFIiEK\EPA 
FGPPAGRKGAKFG*REPSPPGFRGIPP/ 
LPPPGNWD\YGGPPPPKPNFGFF 


1134 


15035 


A 


1141 


94 


128 


GSQMPRHL VD * MTRHLATLRES \CYSR\ 
VYFRFIEFIjHFDIQSTOQKSHR 


1135 


15036 


A 


1142 


85 


492 


VWVGLLSLEGSPSKFGNFIEFGVLLSSG 
GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 
CFETESHS IAQAGTQWCDXjGS LQPLS PE 
FKRrSCljRNSKSJjRKDXEPSEGNQC 


1136 


15037 


A 


1143 


372 


3 


KMNR*PISIKBIDFIVKNLPK\KFGPDG 
FTGILY\RHFKKEIIQTVCNLFQKTEKE 
GMLS \ I YKAS IAQ I PKPEKDKQTKATD/ 
YKPSGIEAKIINRILANGIQQYI\HDQV 
RFTPGMQGWSNIQI 


i tin 


1 CAOO 
130 JO 


TS. 


1 1 A A 

1144 


56 


482 


TMKTIiijYWQKWG*VDQWNGIESPEID 

K\YIQLVFDRKAKTCNV/RKDSLVNKWC 

TYAKKKCTLNLFFTPFTKITSKTxTiKNW 

N/SVNFLNTPLGKTLGDLGFPRDFKFLN 

TPKAKSliEKn/ MDliiNt JfKGKI CS * KNM 

VKKNKKT 


1138 


15039 


A 


1145 


190 


482 


RQGLALS PRLECSGVI IAYRSLKLVGSS 
DPPTLASQIAGITGIRHCPWPKTLFLAS 
V AMFAfcIN o w Jr o x*a Jf V t* b Ir \ o Jjir SPPxjIjA 
SQSRRSQ 


1139 


15040 


A 


1146 


138 


485 


IFFPSV*TIFLLLALFFS**MYLIFVKS 
SLSIYSVYQF/SFLCFWSPISDPKSQRF 
s loLtor X V IT vxf FKinxCV ljijtJxlFvanaAV V 

*S*LTAVSNS\LLKQSS 


1140 


15041 


A 


1147 


101 


426 


GDWKKFYIYKHSESKSPLILFEKKKGVI, 
EE YSSFD I * * AI KVI YMSRKRK\KNQL 
ANLTDAEKAIiNKI *HPFMRKTTQQFSNR 
SFLHLRKGIYKKPTANMIIKEQIIF 


1 141 










1 c 
I J 


v?7\/f^.VTr , Tfirov^TaTrTriiJvvKn7Mr , f dt v ~ 

V V V V^JJ XLX v»JVHiU\ur XAJNJSJrV X X W IT luva J_>f 1j I 

MRS WDQNGTM * KMT I YAPNIGAP KYI E 
Q ILMGANRE VNSNT I LLGDFS I PLS TL/ 
DKELPELNFT*NKTDLTDIYRTFHPMAA 
KYTYFAGTHGTFS KMCPVGFDIRVILVE 
FRRSTYGLDW 


1 1A1 
1 1*»Z 




a 
j\ 


1 1 AO 


t 
1 


507 


iUjt't'iUr JfF x XiUslLr yCKaPiUiUrPP'SHPF 
10iPRGxU)P/xTCPRx^KPPWAKKKNPPFP 
KKKQRGKGPKNPPPWKVKPBKSF* PSPK 
KTKIWPPPPPGGQKKKKKPPPPKKKKKK 
KSVSSS PKNI / YRS LFCTNLKKS FPT* F 
* NIiMP / TKLPV* EFP PP P YNS F * SAP 


1143 


15044 


A 


1150 


436 


32 


NLCmVYMYLIYVRTYVCIYSIYVSIYL 
IDI^LYHHHYLSICWCVSIYPIYIiSLtlY 
LCISIiIYHLFNLClYISIYLIYLS/YVC 
M/SSI*SMWCIYLIYVCIFSISLYYLY 
IYLSIITIIYLYIiCMYCIiFYLSSIEF 
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1144 


15045 


A 


1151 


3 


657 


QDCKIQNEHKIQNTLFLYTRNAHIKIKF 
HLHSKKILAVNLTKHV*NFYAENYTTLM 
KEKI * IERNKDLHRKQ\NLNPCIWIVKL 

nivkipvi\phr*rsltipiiipak\fp 
gdkekiilkfiwkgkgir*lrf*kirik 
gglni*pn\lrltwtvt kiv* ywwkerd 

RYLNQ\WNKIENS/EKLtKPRKYVQLISP 
VFDTSG 


1145 


15046 


A 


1152 


430 


438 


tfwvkkffllnlpk* inpgfkkkknkrg 
g* krp / sgsqvlkklrgkialtpegkgk 
irtx:pcppppplrickkkppirp*kkkkk 
krk3udppfgnhysnsyrhtscyftllhl 

TLOI FFTSRSPVAMSCRlNIi 


1146 


15047 


A . 


1153 


104 


478 


ELLLEGSPCLRAESKAGPAGRLVPTIiLS 
WRVQSPGVLCAWEEECEEPPSTGEDAPS 
LAIRETQI KFITRNHLS PLSMAI FQTNN 
DN\RTKRK* CWHGCGE IG /T /LLHCWWE 
/CKLTFP 


1147 


15048 


A 


1154 


171 


476 


NSSDYYYYFLRQSFTLVPQAGVQWHDLG 
SLQVTSDDPDLR* FARLSLPKCWDYRRE 
P / RMSDSNYFLK* VPVN* KARA*LLFIP 
SSCQAG/ASAGQTLSVS 


1148 


15049 


A 


1155 


3 


562 


PDYQTARRRQAPADRCQPGPAATICPR* 
PKARSSSMWRSNPTVTI2T7U3P * P<3WfTTT3 
AASKSQTGDGDRKELI PPNRTPNN* VST 
AAIK*QVRPYLTQKVPAG*TFCPHQKDQ 
GVRPLCRGPP\RPFGSPAVTDNRQAKLK 
TPTQ/HPADPPEQKRPPARPPVGRHAEV 
KOPGP PFPAGPRPAPSTDGL 


1149 


15050 


A 


1156 


67 


417 


TLSCETQPGQHGEAPS / PTKNA* IALPL 
S WARPLYFPAPARFTARGGPVFTS KRVP 
SPPNRRGWRPPP\QFCHAWNPSPKFLAP 
KIPPT/GPLPPNKEPSKENVNPG/PLCP 
I, 


1150 


15051 


A 


1157 


446 


87 


PDCINVISCFLTKIYHVFFFL*RQGIiTK 
LPR£iVSNS*AQAILPP/SASQSAGITGV 
SLHTRPIMYFSSYYI IFGDTVLF\ETQS 
HPAAQAGVQ* CNLGSLQPPPPGFKRFSC 
LGLPSRRMA 


1151 


15052 


A 


1158 


] 


938 


FFFFIFATYLraKGK\LTGKRQSD*S IC 
KHLKGYYI KKDILTAS KH/ V/KKCPTSL 
AIREMQ ILTGI IKCC* RCEGNG IFTHYW 
*ARAIAQLLWKNV*QHILKLSMCITYNP 
TILFI/JISNRKVYM/ CCPK/DPCSRMSR 
AAQFVI 


1152 


15053 


A 


1159 


370 


3 


AF KAAAEG AAMS VTGGQ P VLNS WWLGG 
VTTTT/CPTTFIMPDNTV\RGDILELTR 
\PLGAQVAMAVHQWLDXPEK*NKHK/IiV 
VTEEDVELMYQKALNNMVQNNKKAAGIM 
YTFNAHAAIDEFHR 


1153 


15054 


A 


1160 


55 


471 


SPPPGIiPKKIFFWPLSFYWP*PQRFCP 
LFAPFKQESNPLKVWPPFGATjQNKRSKG 
CSPSM/SARQES* PIiFPKNLETPPFGGK 
FLKFFFFFF*DGVSLGHPGWSAI /idsl 
QPLPPGFKQ/SLCLSLPAS 


1154 


15055 


A 


1161 


3 


467 


GYTNQQI FP VDKTAF Y* KMPSRTFMARE 
KSMSGFKPSK/DRAGDFKLRSMLIYHSE 
NPRALKNYPKSTLPVL\KNKAWMKHFCL 
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• 




QPPVETYCSGKKI PFKILLVIDNVPCHP 
GALMBMYEEINVVNTATNTTS IL+ P/VT 
SGVXS 


1155 


15056 


A 


1162 


216 


464 


S SEGPGS PGELP YGAQEFVMKAPQG I LV 
IR*T*FFETESHSVAEAGVQWCDLSIiLQ 
LPPLRFNQFCXjSLP\SWDYRRPERHPAN 


1156 


15057 


A 


1163 


29 


433 


AVEFGGDSCSPQHPLNPPLGSPQHSPPP 
LIGICT/REE*GWGGGLPVPCPPALPYP 
STPSWGGICVCYGRGGVAPPP*HPPLT* 
LGGGSTPTSALPAPGAPRHPGHPLMVLQ 
TLPWGPHTPARKP YINKVLS CVDF 


1 1 V7 


13V JO 




1 1 f\A 
1 lv>f 


too 


1 1 
1 1 


STCLGLPKCWDYRREPPRPALRCPFFLN 
ILLFLKFTLSEINI PTPAFF * LVFAWYI 
FFSLFSF*P/DIiEHYI*SGICVDTM*VG 
SSCFFNPT/WSISSF*LVCLDHLYFFI,F 
FFFLRQSL/DSVAQAGVQGQDLGSLQPP 
PPWFKRFSCLiSRIPAHFEWSRA 


1158 


15059 


A 


1165 


467 


131 


NRSLEW AKMYKT YD E I KLVTNIjP T / RKS 
PGLCKITAKFYHIYKDGLVLLLNXLQEI 
QVGFHPNS* YQ\ I ILIPKYSEGTTKKGN 
CRPIFLVNIETKI IiHE XRAI * VHKQIRT 
LE 




\ JVUW 


A 








EBGVLKAKRGCKSGiljHDTVNOVVN /an 
LKKFLKE IKSDTPVNI *MIRK*NNLTAD 
KEKVA* IQDQTSNNIPLTQSLIQSRALT 
LFNSMKTERGEEAGEEKFEASRG^FMGF 
KEKSHPHKHKSASKAASADVBAVASYLN 
S 


1160 


15061 


A 


1167 


26 


418 


KIWDYVKQTNL*ITGIPERGGEKVNNLE 
NIFEAIVO /ESVPS IFKEVDTOEOEIOR 
ASSSSSS 


1161 


15062 


A 


1168 


64 


313 


KWCKGNSYISKDLKELKYLGV/QLK*VQ 
/DL*SENYQILLKEIKEDLNKWKDIPCS . 
* IRRLNIKMVIFSKLIYRFNAIPIKILA 
A 






A 




*T\yt 




OOAKAAESIA DPGGRGCSEIiRSCHCTPA 
WATEQSINK*SINQSIK*KSKLC/CWEN 
TLVKHIFHKRLTSRIYKKL\QQLNKKTN 
NSL*K*EKDVNRYFI KKI YEDI *MANBN 
IL\IKLVIREIQINLKEWLSWFCCCCCC 
FVLFVF 


1163 


15064 


A 


1170 


290 


487 


GSLPHHTPKR * PFLF/ CVFETGSRS VT* 
AGVQWHNHGSLQP* PPGLKRSSHLSLPV 
LIFLFSVEMGV 


1164 


15065 


A 


1171 


2 


413 


GKVFI * FSTLH/GCNMK* SRS \LL * ALR 
FIFLFTARGLTGrVLANSSLHITLHDTY 
YMGTHFHYVLYIGAEFAI IGGFIH*LPL 
FSGYTLDQTYS KIHYTI ILIDINLTFFP 
HHSLGLSGRPRRY*DYPDAYTTRYILS 


1165 


15066 


A 


1172 


381 


2 


PPKLQKNFFFSSTGKFFLGGGRAFSPPP 
KKGFFSQIPRRFFFPLPKKKKIYFCPPP 
CFGPPPTFFLRAPPFFFFFFLLLFCSFL 
VGVKFILFL/FYLFIFRDTV+LCYPGWS 
AWQSWLTAALTPRP 


1166 


15067 


A 


1173 


9 


405 


NLDKKGRNRTPQSWFQANPMASMTFSKK 
KKKKKK30CKGGGALKKKPXaGAQKKPGKK 
KKNFFLKGG\EKKTPRGILEKKPFFGGG 
K30GPNPPKKKKPLREKKKF*GEKGEKKP 
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♦KFFFKKKFPPPSKKTP 


1167 


15068 


A 


1174 


410 


0 


FSYYPPPTRGGCPSSPPQIFLPPPLLGV 
FFPFSPLKIFFFPRQFKFFGGWPFFSP 
PKKKVFFKNPRSVYKNPPKKEKK*SLQP 
PG*VWAPPGIFKR/PPPPFFFYIFS 


1168 


15069 


A 


1176 


319 


3 


KEARSVFRVEGRG * KNFSASGPPGGGNP 
GGGRPGRGNFVFLIKKGGPPPGPKG/SP 
FFDF/GGPPPPPPPSGGGSGVNPPPPPF 
FFFFEMESRSVSQSGVPDAWADAW 


1169 


15070 


A 


1177 


385 


1 


SAS FGLPKCWDYRHEPTRPASVS VTVTS 
SRIiSGA* ARSGKGTLVFWAQMVFK\R I P 
LTKYSDHSREPSSLQLCMQSTPSKAEFT 
VAKADKRLCCTAKSSTAKS IAQIKCQND 
AGRSPQESLHVSGRV 


1170 


15071 


A 


1178 


147 


808 


KLKEIKKLLEENAGINLYDLRLGSGFLD 
MTPKAKQQKKENLKWDVIRMKNSCASKD 

TITFYN\ yrsdkglv/ s kkykelns ite 

RQPNF* KDLNKDFSK/EQMAKST*KTTE 
RLFIREMKTKTT/ME/YHFLSTRMAKI * 
KDSNR/RLCRKTGTLIHCW*GTAPANLL 
KNCQPLFPP*GTVWQFLKRLNI*ELPDDP 
AIPPLVICPKBMKT/C/CYTEICTQMFT 
AA+III 


1171 


15072 


A 


1 179 


1 


398 


SRSRHCTPAWVRVTtiVLKKKEKEKEKEE 
KKRCPCY IVSHQSL* KPAWQFL/RNVK/ 
LELP YD PAI / PLLGKCLKE I * KYAYTKT 
CM*MFIALFI IAKKYKQLKRLSTDEWIN 
KMWYIHAI 


1172 


15073 


A 


1180 


3 


389 


GYDRVI PNP* PLTGA\LRALIiLTCGLAM 
*FYLQSMWVLILGLLTDTLTIYHRWSDV 
ARKS TYQGHHS PPAQKGLRYG IML YWTS 
EVLLLGGLF*AFYHS\SLCPTPQLGGHW 
APTGMTPLDPVEVPLLNT 


1173 


15074 


A 


1181 


354 


2 


ANSSDACILDGHRALPLQLPPCGYSISP 
AAQSSTVPMPLLLIPPPHCNRTPSLWHY 
SPASNPTNPSPY\*TAPSPPPPIPRNNP 
FS*KFPYYVWVYTSLTTCSQPPVSSPNF 
TQPSVL 


1174 


15075 


A 


1182 


3 


384 


GANVSG/DL/KLKPVLIYYSSNPR/ALK 
NYARSILPMLCKQEKKEKKKK\AWMTAH 
LFTAWFT * YTKSTGET/CAKEKI PFKIL 
VLIDSACSHPRALMEMCKE INWFMPVN 
*HS I /LFCMQPMDQGALSTYKS 


1175 


15076 


A 


1183 


1 


410 


PPLP*KFFSPGGVQS*KGGRFWPPP*VG 
VLPSSSP/IIFFTPGNWGCFSPFSP*KF 
FFSPKGFIFOGGVGPFFPPPKKRFFSKI 
PPLVFFSPPFKKKIFFFPPPVNFGPPRV 
FFKRPPSIFFF 


U76 


15077 


A 


1184 


406 


62 


PPIINMLCSLAPPFFSPPPLRGVLPPFP 
LKNFFFP*GVYFWWGGGPHF/SPPPKKG 
FFPKFPP WVFNPP PLRKK/ YHNFPPPGK 
FGPPRVFFKAPP 


1177 


15078 


A 


1185 


337 


383 


RSYISFQK*VKDLNKHFSKDDTQMAN/K 
HIKRYSTLVIREMKIKPMTRYHFTPTRM 
AVPF 


1178 


15079 


A 


1186 


3 


471 


LAPSDK* ENRLPGNRPL* EVRS PS ARQP 
PHLRSEBPLRPAATPSGK*GASPPSSHA 
VREGGGGQPPPGQPPRPGGEGRLCPAAP 
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TGK*GAPLP/EPPPGLGG 


1179 


15080 


A 


1187 


2 


406 


FLVETEFCYVGQAGLELLTS RD PPAS AS 
KGAGMTGVSHQVQPQ**S*LWT*/PSSV 
EAGTS FGLS FLSS S WALS AQEGCLAVPS 
/SGSRGLLVGALLLWTKPSPQLSFVPAS 
QRLSSLSLMPPLPQPQHLTHTSIET 


1180 


15081 


A 


1188 


160 


459 


NFMT I DMLCS AWIHFCS SGLD FQL * KL 
FKSQ * ENNLINKWAKDLNS FT I FSYG* K 
LAHEKMLNI I RE I YFTAI MS * YTLTRTV 
KVKKTDTKCW * GCTATGNL IHCCKNV * P 
LRNTAWQ\FHKIUTIYLPYHPAIPLSDM 
NSRE+KNI/CHAKMCAWIFI 


1181 


15082 


A 


1189 


232 


2 


KAPPPFFFFFFFFFFFFC* SGSHSVSWA 
GI*W/PGV*WHHHGSLQPQSPRLK*SSC 
LSLSSSWDYRYVPPHLANLKK 


1182 


15083 


A 


1190 


2 


402 


PRVRHASGSPSPPPPPPGL\SHTSPSQ* 
VFSWPSY*TPCLSALTLASVLSLL*QRS 
PRTLFITOKC/DF/PASHSSCRIPAGL* 
ALGRQGIiFSCFFCFFETESHS FAQAEVQ 
WYNLSSAQPSSPEFK*FS /CLSIjPSS 


1183 


15084 


A 


1191 


19 


390 


WCVPAVPATWEAEPRRSR\RS KPLTGRQ 
SKSLSY/NKKKKKKKKKKKKKKKTGGGP 
* KKLLGGPKYKGGKKKKFFFFKGEKKKS 
IjGGILKKKTFFWGGKKWPTPPKKNKALK 
GKKKFLGGRGGNPP 


1184 


15085 


A 


1192 


485 


112 


QRPDPR \SAEAAIKYFLTQATASI ILLI 
AILFNNILSGQ* TITNTTNQYSSLI I IM 
AIAIKLGIAPFHF*VPEVTQGTPLTSGL 
LLLT+QKLAPISIIYQISPSLW 


1185 


15086 


A 


1193 


55 


385 


THAFADAWADAWGLFKG I LTENFPNLQK 
YINIQVQGG *RTSSRFNTKKTNSRDLI I 
ILPKVTQDICCjSKKKKKN\AENKKIIiKYRG 
GPF* * KPGGAQFYGGGRKSFFFFFX3G 


1186 


15087 


A 


1194 


408 


3 


CQSA\LLGGASQLG\SRGSGVRDPLEEA 
VCPFSDIiQLHAERTTALLKAVRQGHLSL 
PRLLLS F\ VCLCPAPRGGACRGRQASLS 
CX3GLHPVRASRLLCLRKRAWAMASVPPP 
ASLPPCSLISDCCVSNQ*DSTGRv 


1187 


15088 


A 


1195 


5 


371 


LFSTNHGDFGPLYliFP\A*AGVLGTDLS 
LL IRAEIiGQPGNLLiGNDHMYNG I VTAHA 
FVIIFFIVIPIIIGGFGN*LVPI J IIGAP 
DVAI PR I JbNl S F * LLP PS LLLLLACA1 T 
BAGAGTG*TVY 


1188 


15089 


A 


1196 


76 


404 


PTPLRTHDQS S KVSRYKVNIQKVGAFLY 
MLSTRTTGI * \ IKSTSFPFAS PKVKYLT 
INLTKCVQ/DLWlUfiKKIj/NEIKiilJQNK* 
/NVPFSWIKGPNIVN 


1189 


15090 


A 


1197 


50 


334 


ILHMVSIPSISIY*HL1 1 PAG*AGTHIG* 
LP PA* FFWVMG/ RDRVLLCHPGWNAVVQ 
S*LVVASNSW\VK*SSHLGLSKY*D*RH 
EPSYPASGTKLN 


1190 


15091 


A 


1198 


250 


1 


QYYLVSSNT* S I IDFLQLPQKC I FTVGF 
S VAQAGVQWQDLSS *QPLPPVFKQFS 


1191 


15092 


A 


1199 


2 


402 


TDARHHTWLIFVFLLBMGFHHVGRAGLG 
LLTSSDPPRPAKIiNFL*R*GHAVM\CPG 
WPHE \TCLGLP KCWDYRCE / HTAPKPHH 
FLFFFFFSEKNFIPVPRRGGG/WKNLN* 



601 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino arid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
Iftct Amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
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MEPPPPG * KGFFCPGPPKYWN* KAP 


1192 


15093 


A 


1200 


50 


437 


STLYLEREKEQISLSVIVSSPLPSTSTG 
PTAAAQS S ISGPS LPLPPHNGAGDAPAG 
I*G /YGOGPSGPPWGPS /SGP^PR * AT. VP 
PTD PSGAARGGRGG\ SRGS CCAPAGPAG 
LGDHRPGGMGEGPAAPPKSS 


1193 


15094 


A 


1201 


86 


313 


PPPPGGYEJHFSFXXXXXXXXXXXXKSLL 
PPGKGPNPPKVGLPPFXXXXVS PKXXXX 
xTnnnc *ktpph c ? ppp aphtspp 

AAAAA Vt C t riO XT X IUUT V3VJIT AT 


1194 


15095 


A 


1202 


105 


485 


VQQTTMARIYVCNKPARSTPRFTKQILL 
DVLK/ YIDTQTI ILRDFNTP/LNSVMSW 
RQKTNKDNLDLNLMTjGQLDIiIDISRXIiH 
VSTR/ YI FFSSLHE I YS KSGHMLSHKAC 


1195 


15096 


A 


1203 


1 


476 


PHFGPPPPPQDPRF*G\GPSTRKGFFPQ 
PPGFFPQAWPRAGQPPPGGT/EPVFPPP 
KHP * RRNP YPKFGS PPLPGKIHQGSPGP 

Qnni?P \ PPPWOTTRT? 2XODTT?W2iUl>DDTl?T? 

FF*GVLLCHPDWGTVARSR\PPQPPPPG 


1196 


15097 


A 


1204 


1 


269 


WEFETYNLGIK/WSKDMKRNFTKEDLM 
MADEHPRRC\ *HHLTPPRTAQVKLELEP 
NE/WQGCGEAGPATPCWGDAAPVQPLLQ 
IGRRFLKK 


1 IQ7 


1 JU70 




tins. 


4DV 


42 


K*KNBAJGLPJ\HLFTAGFTENFKPAVETY 
CSE*E ISLQI *LLI /GHPR/SLMEMYKB 
MNWFMPANTTS ILQSMDQGVALTYEYH 
YISNILYEAIDNTENPLIDLGKANAWVG 
PR 


1198 


15099 


A 


1206 


408 


1 


EMKTLTEMSSPGMPTEKVSBLED/DIN/ 
EMHRKXiQCREKR/ IKRN/BINVQEL* YH 
YKRCNHHVMRMPEKEQRKEE I FKVIMAB 
NFPNLATNNKS KVKEAQRTPER I KTIYI 
YI YI S * LOKAKEKES I T.KETnOK'TSirP'K' 


1199 


15100 


A 


1207 


7 


383 


LDIGCFWWEDKYSSCYFSLATSGKLKYF 
SFSCXIHCFbVCVCVRAHK\CI*SRAS* 

CQCFLVCVCAQV/CI*SSAS*CVCVCTS 
GI * /CQCFLVCVCVCVC 


1200 


15101 


A 




*rvJ 


714 


AEAGRSLCPGGGGCNEL* S PPCP<3AWVT 
\SETLSQKQNPHHHKKKTGQVRGQS3YL 
PLWEVNR 


1201 


15102 


A 


1209 


384 


2 


GVTHH/ ARARFFC/IiLDTGFHS VFQAAV 
Q *A\NHGSLQPQPPGLKPSSCLSLPSSR 
DNK 


1202 


15103 


A 


1210 


403 


2 


VPS IQTNGKSLTYFDFFFKQLFKTKtiKL 
DASFVNVCVNVEPFRTFAJ^KTQSLLCL 
SLFLL I F *T INCNFCFLRON / LRSVAOA 
GVQWRDVGLLQPLPPEFKQFC 


1203 


15104 


A 


1211 


156 


405 


ESKQMLIQCKSQMLYC*RI *TV/ CLCVC 
VCVCVCVCVCLTCGTKRVCIVFLFIWL 
NCK/PLCEPCC* CNROGWAPWIiT.PVTQT 
RW 


1204 


15105 


A 


1212 


70 


408 


KAFSLLPPGVSPGPLCNPRRPNFIGGGP 
KKKPPPPPQRWTQGFFNPPPFFFFLWG 
G*GQD/PPPVAQG*MDPPPRPPK\GRGP 
RHEPPPPAF 


1205 


15106 


A 


1213 


410 


3 


KKSMPGFKAS KDRMTLLLEAQAAGDF\N 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
EXSlutamic Acid, F=Phenylalanine T 
G=GJycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=G!utamine, R=Argimne, S=Senne, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LKPVFI \YHSENPRILKNYAKYT/PVLY 
KWSNKAWMIANLFIA* FIKYFKPTVETC 
YSDKKIPFKILLLIENAPDHLRALMEMY 
KE IHAVFMP ANTTS ILQPMDQGVTIiT 


1206 


15107 


A 


1214 


3 


340 


KIRVEVNKMYNRKLLEEIKKI KSLFFEN 
INKIDQFLPRLRKTRQKMQ INKIRNEQG 
GISINIMEIKYSYKEML*AIICKKLNNL 
HEMNKFli / EHRVPKLTQVE I ENLNY I YK 
K 


1207 


15108 


A 


1215 


1 


57 


RGLKIQNVNEIKS*CFEKIKIDKFLARQ 
NFKRKRFKFRD\ * KGDIMADITEI *GI I 
RAYY*QL*GNYLENLEEMGKFLVTYNLP 
KLNH/DIENLKKPVTT/REFKSVIKSLP 
IiKKSPLHDSFTAQFYQTFEEE*VSVLKR 


1208 


15109 


A j 


1216 


350 


1 


QSSFFSIYPNFHLLSFLFCFKDFLFFYA 
SVLATNSLPLFFPQISRYFP\PFERBRE 
RERENVVL\CCLGWSAVMQS*LTPALNY 
W\VRQSSLLSIiQAS*SYRCGPPHPANIF 
HFIFCR 


1209 


15110 


A 


1217 


139 


358 


KNTFFVVVEAKRLFVPQAGGQGGNLSFK 
FPAPP/NLRG*SASSPS sgdyrptppcp 
ANFFFFKKNRISPWGSGW 


1210 


15111 


A 


1218 


361 


50 


WGDHGSLKPQPPLRSKNPPPLASLVTGT 
Q* LG/HPVAPFP IQLIFFFFFIiRRSL/D 
SIAQAGAQWCDLSSLQAPPPRFTPIiSCL 
SIiKKKO^PVFVITWMNLEDIMLS 


1211 


15112 


A 


1219 


1 


361 


LKQHVSNVE KTAFYVJKTQS KTFYARE * K 
GEILSSNRGECLLQSFKGQPLLLGANAA 
GD FKVK\ PMLLDHS EN / PKALKNYTTS 


1212 


15113 


A 


1220 


292 


362 


HI PWPAI QQAEVGGS LEPRRSRP * PPR 
LK*SSHLSIJ^SWDYRNVIiP/RLANFCI 
FLCTVR\ FKIiLGSNHQNAS VSQSARITG 
VSHHAWP ILKLKKLFVCliFEMGSHYVAR 
AGLK/PPGLK*SSRLSLQNCWESRREPP 
HPDTIH 


1213 


15114 


A 


1221 


372 


3 


RLFFLLPPRPKGDFFPTUjIW*RHGFSP 
PHVFKPPPIiNLILGALKKKFFLPS/LPY 
VKFYFFKRAPLFFF/lFFFRDRVSIiCWS 
* PPGLKQYTHIiGLPKHTOYRHESACLiAN 
NHNNNNTNFFFBTE 


1214 


15115 


A 


1222 


3 


291 


RSGDQDCPGQHGETLSLLSLTFVNLSLI 
CML*TLSLIFVKIQKLAWHGGVRL*SQL 
/LRRLRQENHLNQEG\DCIPAWATEKDS 
VSKKKKKKKGGPF 


1215 


15116 


A 


1223 


379 


19 


HMQILTIMRYYTPIRMAKMKKIGYTK/C 
W*GCAATGYI\WEO0WQSL/WQNTWM 
S * KLNTHLS YDIAIQFL/ GFYS KE I KAY 
IHTKPCT* MFTATLI I KSQI /MKQYKCL 
STDKWIKQNRGIiI 


1216 


15117 


A 


1224 


387 


40 


KKTLSTP\PEKHPTPPI*FHPPPQKGGKA 
PPSFKPPPPKHPPPPKGISPPPP* IPPP 
PIYPPPPPAKPPPDFFL*SPPPPPPFSP 

IiLV 


1217 


15118 


A 


1225 


21 


477 


IVSRLITVRLQKPRLDPRVRPRVRRKEN 
YSSV/SFINIDRNVLNKIL/SSQIQQYI 
KKLIRHE*FWF/IPGIQGWFNILKSVIV 
TYVNK\QKWKKHT 1 1 S VDTEKAFDKI HY 
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LALILIKKKKKKKRALPQSFIPGDLFKK 
PNGGFPGLKMMGRAPGE 


1218 


15119 


A 


1226 


1 


398 


ERINHTLFFLAEAQKRLPTGISGRGRPA 
I PHNTPQRAPDHAYLPAALAAQHRAGGQ 
QAPPPG\SS PSSPYDEVKDREGDVTASH 
GLRGNGWGS P * ATSLVLNNLMYMTAKYG 
DEVPGPEMENAWNAIANNEKW 


1219 


15120 


A 


1227 


3 


238 


DAWVAGHDG/ RTP*SQLLRKIjRWEHRFS 
SGDRVSDP*SCLCTLAWVAE*DSSSKKK 
KGGPFKGTKFNS RGGGRNYFFYGA 


1220 


15121 


A 


1228 


207 


2 


NRVSPCCPVQWHDHSSLQ/ PRTPGLKGP 
\SASAF*VAGTTGVHHHAQLIFHFFFFY 
♦DRVT QAGVQWRNL 


1221 


15122 


A 


1229 


15 


413 


RKS VNVI IHHS E I LKKKN YMI I S I DTE K 
SFDKI * YPFMTKLSEN*DSLNLINNI Y/ 
AKP/VANTIIiHSKRLNAFPIKQGCLQGC 
\PPFLFNIILEVLASTKGRKEIKSLQIR 
VEEI /KPLSLFADNM/ IVY I ENPKBS 


1222 


15123 


A 


1230 


363 


1 


AADPFACRPVS PHPPLYLVTSSQSLIiTS 
VSPKKQPPSAGMQLGVGDSLSGWGWGRT 
KKTRLFFFKTRSLSAAQTRRQWCEHRSQ 
P* PPGLIKRSSHLSLPSS *NHRHMPPHL 
AN\LCLFV 


1223 


15124 


A 


1231 


3 


402 


QANS CI FSRDFGSVGQAGLE PLTPGDP P 
ASASPRCRDYRC\GPHAQLLI,TFC* ISI 
LILV/CIS*MKDYFITCIYFFNYS*QII 
FYRRASDFFP FLRQGLALS PMHDHGSLQ 
PHPPRLN\HPPTSAKGVTGT 


1224 


15125 


A 


1232 


3 


396 


FLSQHGFLFLFFAGIDKLILKFIWKDN* 
NNYEKED*RGVITLP/QYKAYSVATVIK 
MVWHWQSDKHVDQWNRBPRNRPNYYMSQ 
*FPIa*YVSQRNENLPLHKNPYMNVHNGF 
ICNSRKQSRYPSTGEWVNKL 


1225 


15126 


A 


1233 


3 


395 


IiPKCWDYRR* PPHPALF/LFF* KHPKFP 
KMQVK/WRKH*TENRLVMYFSEKFGD+L 
KDQHKLYYLDMAFQFIFTFTYVMRYS IL 
PKFNIR*LTLSSVKMAVFLVETGF\TML 
ARMVLIS*LRDLPTSASQNAGI 


1226 


15127 


A 


1234 


416 


2 


KIDKLILTFIWKCKRPIjLAKIVIiKKKNK 
I*RLPLPSFKTYGKGMV/TK/TKWGWPK 
NRP I KGTGF* VQKKRTPHI YGQLVFDSG 
ARTITN/WY*DN* IST*KRMKLDLYLTI 
YKINSKWIKDLNVRAKTMTFLEENIWVI 
LD 


1227 


15128 


A 


1235 


384 


22 


FFQE I *NAI PVNTQMVRKQNSHPANIEK 
VL/WWIEGQTSNNIPLSQSLTQSKALT 
LFNSMKA/E/RGDKAAEEKMETCRGWFM 
RFRERRHVHNIKV*GEAARSCGSLAAGS 
PLKHLLLHNTLIC 


1228 


15129 


A 


1236 


379 


2 


SPYTKIHSKCIiKGLNVRPIiTFKPLEENP 
G/VMVPDLGPGKKFIS *APKAWATKTKI 
IQWGYIGLK/AFCPAKEAIPRVKTWPSE 

KKKDLILKLDLLSADIS 


1229 


15130 


A 


1237 


1 


407 


YRVGQAGLKLLTL* S AHLGLP KCWDCGR 
EPPCLTIIAL*SVFSLVLPAVLIKLINF 
F\CRDGGQTMLPRLVSNSWPQMXC/LPW 
PPKVLGLQ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianine, 
G=Glycine, H-Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Prolme, 

lla.llltQmino D A rninino C^Qarma 

v£ vnuLamiiie, jtvr-/\rginine > o^ocrinc, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 

nurlAntido flalnf Inn \=n<\ociK1^ 
UULIcUUUc UclCUUU) \- 1/US5IU1*; 

nucleotide insertion 


1230 


15131 


A 


1238 


3 


380 


ELSQRTCCMDIMS YYKAIVITMAWY* FK 
DKQRVQQNEVESPKTLLYIYRIiWIYYEG 
DTTDKG*TF\NKWYWDSWI /st/cknm* 

FD YHFTOCT K TM /SOWM 


1231 


15132 


A 


1239 


1 


396 


FADDMI VYLENLKDS S KKLLELVNQFS K 
VSGYKSKV/NVHKSVALLYANSNQAE/Q 
I KNPTPFTIAAKKQK/ ykqknnlgi y*t 
TrR\nrr)TiVrrRTffVTrTQTiTif rt TTYnAwrcwrrvT 

IVO V JVJL/XJ X IyCti.1 XAlOJJIUil X lAL/rtl.NI\jroi\.X X 

PCSWMGRIDIVKMTILPKAI*RFS 


1232 


15133 


A 


1240 


392 


58 


SFSMLARLVSNSR/PSRVLPASASQSAE 
IIGVSHx7VWPSKLSF*LTIDQTHLSCNL 
T7TWPPBPP c ncvmafVznwvrsHQQT^P 

*TPGIiKQSSCFGLPKCWDYRLEALPRLM 


1233 


15134 


A 


1241 


500 


204 


SIiSLSFS *DGVLLCLGWSSTPGLKRFSC 
LSXjRSSWDYRCVPSSQTNF\VFLVEMGF 
HHVGQAGLELLTSSDNARIiGLPKCWDYR 

Xvw Jr Ir JUrivU E LIE E XVkZ> 


1234 


15135 


A 


1242 


2 


397 


NFM INNLPKKKALCPVVFTGE FYLPFKE 
ETIP*VIRISLSL/IFSIIFQKIETEGI 
LPNSFYEGCI I I»I SKPYKD I * \ENYRLT 
SVMHIDAPJFIiDSILANPIQQCIKVIHCS 
xi VAX v/ 3u i uun r in iui\on 


1235 


15136 


A 


1243 


12 


362 


AGFYHVGQDGLKLLTSSDPPASASQSAG 
IKA*ATVPGLSPLNFCEVRFMESWSWKR 
P * PT<5 \ Vf*<3 AT. AKPWTVRW A QOWnifT.P 
FPT/RM*SPCLHPAVSAFCGSASVLPGI 
TVTTSV 


1236 


15137 


A 


1244 


183 


383 


KQAGRGGSCL* / LPRCWDYRRELLHLAF 
MPG* LKK/ FFCKDGGLTMWPRLVIiNFWP 
PVTT.LPPPPKVLG 


1237 


15138 


A 


1245 


410 


2 


FSRDRVSSCWPGWS/PNS*PQSDQLALK 
CWDYRCEPLHPAFC*MNFIiKVSC*HGTS 
/DSKY/CQHVSPKNKDISLC3^YNIIIIP 
KKFNIL*YIWFIiDFFFFETEFCSOCW*D 
T/?QT^iPPPPr;Fir* fqppcit.p^^wvhphv 


19151 


IJIJ7 


A 


1 IV 


Add 


to 


KTEGANnJKNxTYQ/NLWDADKAVFRRlC 
FRALNAYTRK* EPA^LNLSSTLKTRKJCE 
QNKLKADRR/Q IMQI * * KLGKVENKQTK 
TIQQIH*TKSWFFEKISKIYEL*QA* *R 
KKGEKTTITNVRNKRYSGRVG 








1747 


xoo 




RKCWPGAVAHTCNLiXTLGGQGGRIT 


1240 


15141 


A 


1248 


3 


391 


DAWADAWGSRSRAVAlxFFFFWGFLGGGG 
lxKTGFYFIPQVERRGLNFD * WNPPPPGL 
RGSSPPT/LPKQWEPGGGPPAPSNFWFF 
FEKRGFPQVTQAGFKLWN* GDPPAGPS K 
GVGITGGTPSPHPLFLKKR 


1241 


15142 


A 


1249 


3 


323 


MHHHA*LVLVFFCGDKVSLCCPG*S*TP 
DLKRLSHHGLPKHWDYRC/ RATTPGFFS 
LFFFFFFNPPEFLLPWAPPSYSIiKQNKI 
FLKRFLE PLAP P KfJT.PT .TTRATKTR T 


1242 


15143 


A 


1250 


3 


405 


QPDS YS PQPGRLSAP PEQEGGPWLILPH 
ACAPSQICVGPHGAS ILS YQERKGTKVL 
SCF^HCKIiSSPVGLVGQSFCWQQPDAVQ 
WVPFRRRTQTPGTVAHACOTSTLiGG*GR 
RITS/RPGVRDQPEQH 


1243 


15144 


A 


1251 


407 


2 


PFLKKTKKELP*GPAIPALGVPSFFP^F 
FLPYKRKEIKGIRTPPMRIAALFTIAKI 
LNQPKYPSVDI ♦LHKRHTSNTTLIiSRMK 
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K=Lysine, I^=Leucine,M=Methionine, 
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W=Try ptophan, Y=Otyrosine, 
X=Uo known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














H*WIKKT* / HTDMME YYSAI E KNG I LS F 
AATGMS P KD IMLS E I CQAQKNRY 


1244 


15145 


A 


1252 


1 


253 


VADVRESLDPGD* GCSELRLCHCTPAWA 
T\SRTTSQQQLKKQNENTT * KTYGIQLM 
NCQIAOTCSKLLYTRKMSNLQSNLKKKK 
K 


1245 


15146 


A 


1253 


3 


372 


KKKAQKSGTIVLPCNPSYSGG*VGENA\ 
WARSQPGLIS*KQQ*QQQNQQQTTSKTD 
PYiyG/YLIHDRGGK\NTQGWDRLU3RW 
GWGIWLMTQRKIKLELYLTPYTRTOSTW 
I KDLNVAGCGGSCL 


1246 


15147 


A 


1254 


2 


410 


KNXiTPIVGLKAATN*ESIPPQT/SRKKK 
KKKKKKKKRGGALKKKKPKGGGGGKKNP 
. FKGGKKKKLGGGVKKRGBGKKPGGKKKK 
RFGKKSFFSRGGKKKKNRSSSSSSSSSS 
SSSS 


1247 


15148 


A 


1255 




*#■<>/ 


DPNCBMQKWSVNGVYRIGLYALKDMPAG 

GIIGGKSQRVNGLTSSKNSQPMATHKKS 
GRSY/RE* KKNKKKRGGRFKGSQFSSPG 
MQG 


1248 


15149 


A 


1256 


118 




MFIAELFI IA/RRWK* PKCPLTDE WINK 


1249 


15150 


A 


1257 


390 


1 


TQKNTPTPKPKPTSQQHHQKNKPTPPGF 
FFFSPPGK\GGFFPSPLFWVPPGFFPPP 
VFKTRPPEFIFGAP*KKFFFSPPRSLNF 
FFLRGPPS FFFFFFFFS * VGKEGS SPSH 
ENPLFVPTEGW 


1250 


15151 


A 


1258 


378 


3 


GAFFFFFPPPKKGFFFTPFEFFGPGFFF 
SPPFFIPPPQIFFFGPKKKKKFLPPPPQ 
KI FFFLRP P PLFFFFFFFFFFFFFPF P P 
FFFL\DLD*NVLSGTSAPEKNQELLGMV 
AYACNPSTLGGRGG 


1251 


15152 


A 


1259 j 


462 


287 


RDG\FHHVGQDGLHLLTS*STRLSLPEC 
WDYRCDLPCPAIPAVTIiYQIQYRPLGLE 
SKA 


1252 


15153 


A 


1260 


446 


14. 


NVRAET / IKTLKVNTGVMLHDIiS* /GKA 
FLDARPKAQVAKBKSRQPSLHRRLR/FC 
AS KDP IKEVKRQPTE/ WEKINHVSDKGL 
EFKIHKELLEKLSNKPI * KWDKDLNGHF 
SK/ELQIASKHMRRCSASLVIRKCNEIV 
DPD 


1253 


15154 


A 


1261 ! 


2 


383 


GSQRKWFLEMESTPGDNAID I VE VTTKG 
FNYYMNLVDKAVAGFER IDSNFERSS S V 
SKILSNSIACYR* IFCKKKSQLMQ/QTS 
LWFYFKKLPQPPQPSAATTL I SQQLYTL 
RQDSPPAKRL*FTOGL 


1254 


15155 


A 


1262 


476 


50 


FFFFFSDTGSHS W* AAVQWHDHDSLQP 
*PPGP\SDPPTKSSTKEF 


1255 


15156 


A 


1263 


143 


472 


TGAVPIRPSWN/RPPAMIFF*NAQGILF 
AEFLASQRAI LWE FFEKAYHES VLRKS A 
KGIiAEKCPGKLHQRVLVQYDNALAHFSH 
QTRTTKSSTRQFR 


1256 


15157 


A 


1264 


179 


489 


YIFFFLSLFFFLWPRPEYRGAITVHCSL 
NLPGSSDPTASASSVAGTKPH* YI FFFL 
SLF/CFSYAQA*VQGGNHSSLQPQPPGL 
K\YPTASASSVAGTKPHPIjIFFFF*KKK 
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OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, flNHistidine, I=Isoleuci ne, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
QsgQufanninc R~Ar pintnp &=S^Hn^ 

>C wuiaiumcj l\ r*-l gm IIICj O OC1 lire, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FL \ CFVPQAGGQGHDFG* L* PPPPSLKQ 
GGGLTLS RIGDYRGPP 


1257 


15158 


A 


1265 


1 


536 


FRGGWGSVRRAPGTASCYLALAGGPPGQ 

DPS^AP/PSW*VPRPGVPRNVPGRGRVG 
QDGCGQAQDIQAAASLMADTAPDSPGSA 
• GSVRALPACVPEISGS/SGLPPGAALP* 
VAA/RPTPGRHVDTHPRQHS FCGGQEGD 
I RAFPTL YIiE VYPGPP 


1258 


15159 


A 


1266 


403 


2 


TSLHPRGYMRLLRQGFISAPCGYMRLPR 
GSSLHPRGYMRLLKTGVHRCTPWVHAPS 
QTOVHLCTPWKHTPS*RFISAPLGAASH 
VT/GSLS IQHIY I FVFCLLRQGIj/NS VT 
OAGVOWCNLGSIiOPKLPOAOV TT.TK 


1259 


15160 


A 


1267 


2 


401 


FVLNPGGRSCSEPRSLHCTPAWATRAYL 
QLGKKKKEKKKERGEENKNQGPPRPLKR 
EVGTPGQKKPL/WGGSNGAGQKQPA/QK 
KGGKKKADHKGQRGKN* KRKREGGGSTS 
KNNS RGTGAKA* NPTT WGGGGinrr 


1260 


15161 


A 


1268 


49 


416 


UIGI^DPRIJ^RECGDLGAPPAPEVALR 
AGTCWTR/CTL*APPRGA/DRSPWPPRS 
PMCKAG/DECQDGIPG\MKAWSCGLRTR 
QCPWP * PKLPGGPGHPAWRT * PLPQTA/ 
CGPG PAAPCAG 


1261 


15162 


A 


1269 


420 


47 


GPPGWASFRLNFPKARR*GEWKTPGESG 
GAFFS PPGKNF\AGQN*GRPPNPPPPGP 
GKGGIQTRGGAGLGKNPFRFWGGFPNPG 
NK*GGGTKKKEGP PPPLFFFFLNTDFCN 


1262 


15163 


A 


1270 


404 


1 


AEILELKNVTDII.KNVSESLNSRIDQAE 
ERISELEDRLFENTQSEESK*KRI*KNE 
ARQQDLENSFKKANLRVIGLKEEVEREM 
GVESLFKWX HENS PNLEKHIHIVQEGY 
RAP\ FNSNKKTSRHS I TNSHTK 

ivrvt null my a omw Jv 


1263 


15164 


A 


1271 


387 


2 


KKEKKKEVI IIFVLLLPIiK* FQNIQVWL 
VRELBKKFTGKYVV\ FA*RKI LPKPTQK 
RCTKNKQKRPRIHAPTAAHVAILEDSVF 
PGEIVGRRIRVKWTQQLTRVHSDKAQQN 
NVERKVQTFSGIDKKLR 


1264 


15165 


A 


1272 


1 


393 


FPJ4GRLPGGGAPHFSDGVAGQRRSSPHR 
RSRGRAEALIjTSQTGRLGRGAPHI SDDG 
RPGRDAP\PS*TGWQPGRGAPHFPD\GQ 
PGRGAPHIPDDGRPGRDAPHFPYGVAAG 
QRIiQSRHFGRPRQAAGRWRL 


1265 


15166 


A 


1273 


3 


317 


S S YETKG VMIAS FSSREADNHTAFIRIK 
TNASDSTEFIILPVEVEVTTG*WKTDEI 
ESCFVFS IAYSGKVKNNC*LFFFP\APG 
IYSSTEMLDFGTLRTQGKKIF 


1266 


15167 


A 


1274 


92 


368 


LCPPGGRAGQGQNliN*GVPSPPGPKGFW 
GPTYGGGGGKRTTRPGGGKPEKS FD/GP 
PGPPPGGTKGNPAPKNFFAQKERGPFFQ 
NGGPGNKLF 


1267 


15168 


A 


3275 


401 


3 


DMCDWFKKEFSDTTPKA*SIKEKQLTSF 
I KI KSS CS PS\ KNT I KGLTRQEKLWGKI 
FANHMSDKGLVSRIYEEItSKIiN\ K* KIjN 
F * KWAKQ * DHIiLKKTH* * QICKERYSLG 
KGKITMRYHSTS I IMCI IKTIHS 


1268 


15169 


A 


1276 


298 


426 


GATMLVLLF*LAFDLRQSL/NCFPQAGV 
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N=Asparagine, P=Pro!ine, 
Q=Qutamine, R^Arginine, S^erine, 
I s3 1 nreonine, v =3 Yanne, 
W=Try P tophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QWRDLGSLQPSPPWFK 


1269 


15170 


A 


1277 


421 


1 


VSFKJJQKL* inkenhpikkwkkicnqpl 
PQIRYMDANKHIKI FSTSLAARETNMKI 
TT/RILWKTVWQFFKKVSIYLPLEPA1S 
Si JjFTQEK* KH I DGSCS P S RVATAKLLA 
LQWRRPSWNCMHQRAGSWKQIGAPPFSK 
L 


1270 


15171 


A 


1278 


1 


218 


TRSGVQDQPNQHGETLSLLKIQKLASYG 
GVCV* SQLLRELRQKNCL\NGPRSHHCT 
PAWAT\EQNS I * KKKKG 


1271 


15172 


A 


1279 


88 


396 


TFLFSSSFFFFGGGVPPCPPGLRPG\PN 
FASLHPPPPGFGGFPSLTSPRA*NWGPR 
PPPQATLGAFGLEGE * IFPPGRPPBP *L 
WGPPPPPPQRGGMAWPTPL 


1272 


15173 


A 


1280 


94 


399 


TDFLFIj*TDFLFLCLCSLKNKIWVNEFR 
YGGFSLGVSNTQAIjPPSQEVNDAIKQMK 

khlklak/dkisivrcitkmgm*llgyr 
sswqvkritwqlivlnmlayravfgi 


1273 


15174 


A 


1281 


236 


2 


tqsrlvffymtgpavylnhhalrthqgs 
hlcfffcfemeshsvtqagw\vwwrdls 
ssqpppprlkrfsclsfpss*dy 


1274 


15175 


A 


1282 


37 


410 


kkktlkkikklcppgvkgefnppfyvls 
kvfpkkerglffkvtfvsltpffkkknf 

KIGGGSGFPPPLWFFFFFY/RDRVLIiCH 
PGWNAVTRS * LTTS VDSSDPS / CLRLPS 
SLDY 


1275 


15176 


A 


1283 


404 


232 


LSS WDYRCXiPPH PANPLYF * *RRGFTVL 
ARMV/GPRDS PAS S SQSAGI TGMSHCAQ 
PG 


1276 


15177 


A 


1284 


3 


402 


MQIKITMRNYCTPVRMP*KKK\SRTKPR 
C*GGGGTTKILIPCWGDYKIGE/PLW/K 
SVWQFLIK*NNHIiIjYDPAILPLIFYTRE 
MKTYVHT ITKYCNHNEK/CMQGFTEALF 

/tlpkte/ sqpkhsst 


1277 


15178 


A 


1285 


1 


253 


LiRGKFMTIiHS / S ILiCRG*SQINN*TATL 
I K* KKKGKI KPK/RSRRKVI IMNRNK*D 
* KQNftN K-FNETRS W FFKNMKKI YKSLVR 
VT 


1278 


15179 


A 


1286 


400 


105 


FNF KKKKKKKNRE VS E * Q Y * Q AKYL KRN 
FTKNI*VAEKHMKNAQYVIKEMLI*LTM 

OVWTDT7\ MfttfTVT TTWTLJVt^rsr^T irnpv 

KX I X i Fl Jx\MAK.XKJjX UNHKCQQLiKPSY 
MVGKEFGKFLIKLNM 


1279 


15180 


A 


1287 


400 


119 


TVFHHIGQAGLELLTSSDPPVSASQSAG 
J. l\jljorlW \fc»A&SSr HSNHSCPVPLiEPHQ 
VAQLDS FGYEKVCS FSN* VPGAQDKNDL 
SEN 


1280 


15181 


A 


1288 


157 


386 


NDLQFHPFYCLF/DLFCQSLKKAGMQWH 
VS/PMPG* FCIYTHIYFLVEMGF 


1281 


15182 


A 


1289 


400 


124 


FDAPAKGQGGDFGSPPPPPPG/AQKIFP 
PHFSQ* IG* KEGAPMGQPI FVS FSKTGA 
PPPGQGGFQIPTPCGGPPQNFGIQGGAP 
APGPGGGFF 


1282 


15183 


A 


1290 


419 


3 


KNFFFLERGFFFFPPGGRAGGGF*FPPP 
FFFQGKKISGPPPPKKRGPPGAPQKL/R 
EIFLVLKKKGGPPLWPGGFLNPAPKNFA 
RANFPiCRGGSRVGPPSGGPNLFNIiGGGW 
FFRFFFFFFFFFLVFETEFRFCCPGWSA 
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M 


L283 


15184 


A 


1291 


397 


39 


WGQVWAKDWIHGLGPKTQGCPPPLGVFG 
CP/SLIPPNLFFYGIPGGLKIFVHKNPP 
PPIKKEPANWIATPFPPGVFIiFFL/CFF 
ETRSHS VTQAGVQ * RDLVS L * PTSGFKR 
FSCLSLPSSR 


1284 


15185 


A 


1292 


399 


1 


LNFFFLHRQKLCN'ILGSDDKVPAFS I YP 
EPRICYVFSLLFEIIFSFDFNKKIRIYK 
KHVSRIYTCDLLKFNNKDNPVKKWAKDLN 
RHFSKDGIQKVKWHIRNC*TLAIK/EMQ 
NTTTMR*HLMPNRMAKI RKT I 


1285 


15186 


A 


1293 


412 


1 


ARMVS ISRPLGTP I WRSQKGGNKRVSPP 
TRPNPLFLKNPKKCPKSRD*NKPPLLKK 
KKKWSGPCPAQ/SCPGQNPHP/SPPKQL 
PPSPSTPPSRPSPTLAFPMAS*LAISPA 
HTHCCWRELSGTHTLS I P CLKPLRGP 


1286 


15187 


A 


1294 


229 


1 


FFFETESHSVTQAGVQWCNPGFKRFSCF 
GLS S S WD YRYAP PRP \ ANF \ * FLVETGF 
YYVAQAGLKIiLS PGDLPALAS 


1287 


15188 


A 


1295 


401 


2 


STLPVH* KWTTKAWMTAHL/ FTAWFTE Y 
FKPTVQNYCSEEKI PFKILVLIDNAPGH 
PQALMGMNKEMS WFIPV/NTTS ILQP \ 
ADQGVI FTLKSYC IRNLFQHQCTPAWVT 
QLDSVSKK* INI FQPGMVAHACN 


1288 


15189 


A 


1296 


1 


336 


KKTPRRKTHKEHHNGEK \ LRAFPLRSG I 
KQ/GCAF* P / YI LNIWA/ VLAKEYGKE 
IKASFRKEELKLCL FIDMI I FVE I PENS 
PQKL\ IN* FSKVAG* * VSTQNSVAFLFF 
FFFF 


1289 


15190 


A 


1297 • 


3 


384 


HTNMQKVLV/VKTEDQPSHNIPIjNQIIiI 
QNKSLTLFNS IKAERSEEAA+ EKCEDSR 
GWFMRFKKKK/RIS ITKMQGEAASAWQ 
AGTS YPKDLAK/ DEGS YTKQQIENVNET 
VLY*KMI / PTEDFWRKDKSM 


1290 


15191 


A 


1298 


3 


395 


SRSVTRAGVQWHDLGSLQSPPPGFKQFC 
LSLLRS WDYRLS PPHQQMGGI FL/ VFLF 
FFFPGKGVFLGGPQAGGKGPNLG*WNPP 
PRGLGEFFGLNPPRGWGFQ/HPPPHPQL 
F/CCFFRGKGGS PRRAGRAPNP 


1701 




J\ 




260 


4 


UF IV SSAVFPLLKCliLD ILVS SSVUuYC 
LWNFKIQKSFRFYNLSPLVSSFKNSH*N 
SNI/WPGAVAHACNPSTLGGRGGRITRS 
GD 


1292 


15193 


A 


1300 


91 


219 


LSRLSAWDY*HVPTHVFFVEMGFHRWQ 
a or Z7T.T o#rTit3nii t n 


1293 


15194 


A ; 


1301 


277 


441 


TVEF * GVQSMKEE I \ELCLFEDDMS VYE 
ENJjKKIjTKN \LIjK* I SNYS KVAG YKVN I 
*KAIAVLHTSNEQ*NFEIGNTILF/TIS 
TPKYLGINLTKYAQGLYEENYSNLMNEI 
KEIdtfTW/RDILCS 








IJUZ 


256 


482 


YNVYFKI CIGPGTVAHACNPSTLG \G*G 
GRI/TLRSGVRDQQVQHG 


1295 


15196 


A 




1 oz 


IDU 


VCJRE FIjDMTGTCR I? T VTTRT .V T 13 V TW* C C C 
KLKTFVLPKTLLRD* KAKLQTGRKYL* I 
TYP VKGL VS RLCKKLS KLNS KKTTQLKK 
WAKDMNRHF 


1296 


15197 


A 


1304 


1 


468 


FKQFSSLSLLSSWDYRHPTGNFCELAPY 
KQNAPS CSNVFTDNVPVTST* QGLQALS 
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PNVNEQSPWRETKKTAVDLCGVIWKPCQ 
VFLRKKPKTKTGQ/HEQQIFGF/CPFSE 
TESHS VTQAGVQ * HHLGSL * PLPPPFKR 
NSTTRTSDYDQP/HVPP/HPA 


1297 


15198 


A 


1305 


880 


1026 


EEHGAGLSGSQDAAGGVPAG *GGWAQLW 
VTRRASLFLDKTHWPVDEQNLGSLYTIE 
ATAYGLMQKLELGRYNETHAIAKWIjLEK 
QEIjGGGFRSTQ/SDGDPRETTWALEAL 
TRFREAVPFKG IQDtiHVQIRAPKTALNV 
NWY I DHSNAYQQRSAKFLAQDDLE ikas 
GNGRGTIS IIiTMYHKSPItoREDNCNLYH 
LNATLHSALEENKKGGETFRLRMETRFQ 
NN* EATMT IMEVSLLTGFYPNQDDLKQL 
TSDVERYAFQYKTKTSTSDSTWLYLEK 
LSHEKNTELGFRVHRMTiQAEFLQAALVT 
IYDYYEPSRRCSTFYNLPTEQSSL 


1298 


15199 


A 


1306 


3 


726 


RTDHYQFQSLKHCLTGGEALNPDVREKW 
KRQTGVELYEGYGQSETWTYKWELSYE 
DANTYIVK*KTL*TQKEEG\IICANPKG 

MlvJ. V iaM V KJSJ^IjF x i v HJ. V UDrSviNVliP 
PVEERNVTVRIKPTQLLCILNCYLDKCE 
KTAVS*QGDFYITGDRARMDKDGYFWFM 
GRNDDVINSSSYRIGPVEEESALVEHTA 
VLtEub I VVSSPiPXMGEVyKAYXVIjTRAY 
SSHDT*ALTRVLQEHVIK 


1299 


15200 


A 


1307 


230 


486 


NAI CPNGS KGSLGLGS CSVTQTGVRW * D 
HSSLQPRPPGLK\ YPPTSAS * VA\GPQA 
GVQWHGLDSLQTPPPG/FKRFSCLSLPS 
SWDY 




1 

JDzVJl 








A A A 

440 


t> AX P VJtl i * RKKKQ I S liNADMEKV* WW I 
EDQISHNIPLNKSLIQGKALTLFNGMKA 
KRSEEAGEGKFEASRGQVMKFKERSHLY 
NVKVQGEAAGADGEAAASYPEDLAQSTD 
EGGYTKQQNFSIDV/TTFYWKKMSSRIF 

■UifiC lul XXJ oJJli vjo V sr 


1301 


15202 


A 


1309 


463 


59 


EVISTLTKMGKILKTKNAK/CWQDCGTK 
RTFIHCWWGYKLVQPLWKRV\YTFPYNL 
AIPLIGIYP* /NMKI YSHKRTCTKMF/l 
NSLFIISKNWKHLRYTFTREWIK/M*YS 
HTKDYYS AT KRN1CI.DMF C 5 MMMR ^OTTHYfi 


1302 


15203 


A 


1310 


438 


29 


PGGKGGGGPTPAPFPFPPPPPKVGVFW* 
GFPRPFTjFKPPPPNKAPFPGAPHFF/SL 
PPFPRVFKKTFPTKKKFFPPGGGGKKKP 

QI 


1303 


15204 


A 


1311 


2 


310 


HNQKNEFGPYVTS YVKVKEAK* IAKTRK 
LLEENIGVN IHDI ELGSGFLAMMP\EA* 
AIRLKIEKLDFIKIKNSCASKDTINKVK 
KTGRPGMVAHAYNPSTLGG 


1304 


15205 


A 


1312 


19 


348 


RGQQD VL PG * RGQQD VLPGRGTYS EYGL 
IFSARNPSMEV*SVNHRPFHHHGVP/HQ 
NCSDSGSYFISKECGN/WVSACGIH*CC 
LVPCLPKAADLGE * WNGDFSLSGKEKKK 
EFYLIMTFSQ 


1305 


15206 


A 


1313 


2 


424 


ENKLTNHGKTGNGGAQSQ/PPECEPRTH 
LQRGLEGRGGGEPWGQGQPD\PPSNSSL 
KNPQAGVPPFSSLKGKVKRDRSVSVDSG 
EQREAGTQ*TFVNGKKKKKKKKKKAKKA 
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aVODnQVa /OCC7QQDVCQ 


1306 


15207 


A 


1314 


375 


3 


KKRTLL I PLMNMDAI /MNKILAN* IQQH 
IKRI IHNDQVQFVPGMQGWFNIQKNI NV 
IPYY*LFY*CYINRIKDQTLIIISIDTE 
KNDKIQHSPIVKTLRK\IKBNPLTLIKR 

TV* TTC CMXrKTTKIT'n 
X I w KdO W ri tSjSJH XXJ 


1307 


15208 


A 


1315 


403 


3 


IiEGVSFFFPRVASQGSILGSCMPPLPRF 
HHFS /CPHIiLSKWGYRFPPSPPA\FFFF 
FTjKTGFFFFCKNFAI*CSFP*KQASPPPP 
LTVFFFS/HHPFFFFFFPLFFFLRWSFA 


1308 


15209 


A 


1316 


38 


427 


PEXPARPTRPRPSAWQPPRLRSEEPLRP 
AAAPSEK*GASPPGSHPIWEVRSVSARQ 
PPRPGGRWWGVNPPPGQPPRPGGEGRLC 
PAAPTGK*GAPLPGQPPRPGGRWGGYPP 


1309 


15210 


A 


1317 


43 


399 


IVTFFFFLGEGARPPPPGWGPGAHQGITA 
PLFWGGQGNPPP*PPGEGBPS/QGPPPP 
GGNVFFWKKKGSPRAPGGPWTGGPKGLP 
RPPPPKGCE*RGNPPPPTPKFLGFTTFQ 

KJK3GKTiSGr 


1310 


15211 


A 


1318 


437 


54 


MNEQKGERGLSSLGIiPAR*D*GXGNVKE 
GR/EGVTFSPREPKEESLRWSTPQKEIV 

f*\ rrKTlTPPflAPTTTYP nflaflQfJTlWlT? TUT .P 7LC\ 

TGGPPA*EITRAGEQKAGGGSKGAQTLQ 
RETRPPRGQRGGGRSASL 


3311 


15212 


A 


1319 


351 


133 


GGGWAEVPLTSQVGRSGRGTPQIiPDGAA 

is r\n o T3 n 15 Ti'D D O G tmvT? R iyp e fVPflD W!£D/2 A 
RQRKJr rir jtKKvjo rlAriAxrx o\j c\jK\jJ\ 

PHLPDDGRPGRDAPHLP 


1312 


15213 


A 


1320 


474 


11 


KINSPSQKKKKKKQCSSIRKIiA* /DQNR 
HFIEEETQ IPNKHMKRSSIS / LAI KEMQ 
IQITKSYCIIIRLCK*IiIKNSDSIKCWQ 
GCRE I / GSLIHCWRKLKMNSTAbQMVYK 


1313 


15214 


A 


1321 


485 


33 


CPASRVAJbXTbAHUHAJWJjJ-r Vr liV nmKzE 
HHVGQAGLE/LPVSWGNGGHL/RLCLLF 
GYOGQCCYKPS*RRFFCVSFYIRA*K*G 
FVCLFV*MVSCCVTRLEFVVTHRCNHSK 

VWHVSNSTALTSGLQ 


1314 


15215 


A 


1322 


451 


145 


THPFGRPRGGVFKVRKLNPPWLKKENPL 
FIKKKKKKPGPGGGPFIPTPSEG* /GKK 
KnT?NPPirRASNKPKFAPSNPOTGIjGKKK 
KPPS LKKKKKKKKVYSLI F 


1315 


15216 


A 


1323 


470 


151 


GKAEERHDPGSRACSEPRSCHWTPAWAT 
\SETPFKKKKKKLCISWPGTVESCFSVR 

T* * T * T.PO WT .VT.T?T,l?OPKNVT YTjPTCT. 

X X lxEv^X V JJ JLUC uf UcAli V X L UC X\ ■ 1 1 

HDKANMVMFLDFKYYFCFLFLD 


1316 


15217 


A 


1324 


64 


367 


twegegswltsqdrttaplhpsldnrvk 

ftma/sk/rikyi^itiltkdvkdlynen 
y * tlmknlkggamcs fts l 


1317 


15218 


A 


1325 


193 


381 


AQLFKTSMGNSETLSLQKKKKEKS * LTL 
PSSWDYRHPPPRPVNF\ * FLGEMGFRRV 
AQAGLEL 


1318 


15219 


A 


1326 


410 


7 


GFFPYP*I*PK/SLRC*GKDFYNQGSGQA 
RWLPPVI PPPLEAQAGGS P*GRSLRPTW 
PAWSNPS FLKKPTPliFIJCFS * GRRIALT 
PKAKVSVNWDSPPALQPGGPSKTFFPKK 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G1utamic Acid, F=Phenyialanine, 
G=GIycine, H=Histidine, l=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 
T=Ttareonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKLRI IGPGWAHACNPNTLGD 


1319 


15220 


A 


1327 


295 


3 


VCHFGIYFCTCSLYFTCLYFPFLIFL*V 
TLTFLVFYFDFTVLFI ISh* YFNDCSRD 
CNIHM* LLSLPVLI FHCCC\ FVFRDRVS 
LCHPGWSALASSLL 


1320 


15221 


A 


1328 


2 


392 


RQASRPP*SAPPAPAGKBGSGE*PPSPK 
/PPPPPPKWPPRGFPFPPAGPPPPPPP 
KKSPPPKTNPPPPPP/IKPPPPPPPPPP 
PPFFFSPPFP 


1321 


15222 


A 


1329 


378 


7 


TDDLILKFI *ERYKPNQF* KRTKLKDSL 
PDFBNYYKAKWKTRW**YKE\KHIDQW 
NRIBSR*QIFNIASII/HIEKEYLCNKW 
CYRTTKMBEKNMNLKPLP*TIKFNSK* I 
TDLNI KTITI WLKL 


1322 


15223 


A 


1330 


403 


2 


VETGFRHVGQAGLELLTSGDLPTLASQC 
WDYKHE PLRLGELRS LTAAWAMQQDP VS 
PNK*IKIK*I/P*SEQFSGI*SIHNWQ 
NRHFYPDPTHFHHS KVKPLTHYWS PHS 
FLSPTPGNHQCLLS VSMDFS IL 


1323 


15224 


A 


1331 


1 


389 


KMKSQAIDLEKIFTKHISDKGLGYGLYK 
ELL* FN/R/RCQTTQLKMGER*TNSSTM 
GMS FEWMVSKHMKRCS CAFPSLVTREMQ 
VH/TTSLFVERYHYTPTRVAVIKQSDQV 
\W*BCGIRTLIHC*WECKMIAT 


1324 


15225 


A 


1332 


392 


3 


NNFMPSSA/PPPPPPFFFGGPRVFSPPP 
FFKPPPPFFFFGPQKKKIFSPPPPLKFF 
FFLRPPPPFFFFFFFFFFFQKNLGKGFF 
F*PRPEKKKKPPPGGVFFFSLFFLRCSV 
ALSPRLECSGAISAHCNLCI 


1325 


15226 


A 


1333 


394 


1 


S QWG PTAIHYCAWLVFKFLGHFSKTT I 
S APLLK* TSTRAETS KSFS \CPRKMERA 
EGRRMFLGKSLLKQIVLLF/LISGSCVT 
AS*FL/CFFQTESHSVAQARVQ*HDLGS 
LQPLSSRLKRFSCFSLPSRTRG 


1326 


15227 


A 


1334 


416 


1 


LTLLPKPTPYNTRKENYRPISLINIDAK 
IL/NILAGKIPQYIKVHN\NMGLTPEMQ 
GLFNI * K/RKSVNVI CHINRKEEKKILL 
INA* I VFDKNPTM I * KNSWQ\GE YKE I Y 
SNILLNGKMLKACHL\KTRIN*GCQLSP 
DAW 


1327 


15228 


A 


1335 


391 


57 


WQRVCNWDKNLYQKEKRQDNVREK/WAT 
DLIINFTHEEMQSVNNLQKDTHPHSLVI 
REMQ I KTTS Y/HLLAKILNSDNSS YW* G 
CGGMGTL* CCWWVSKE IHALWE IVWF I Y 
SK 


1328 


15229 


A 


1336 


285 


2 


I FFFPWRGE 1 1 * HLSLINKGSRQNERAH 
\KDNGDFSQIISKLKLMYKYKILNQTTI 
KYMFFPKKHRTSIKIEHILGSKGSLNNC 
QRISVLQTGRV 


1329 


15230 


A 


1337 


34 


391 


AKIASLYS SLGNRANSVSKQNKKTKNQN 
LKN/IKTAGGLGWAHSCSPSTLGGLH\ 
DPGV*GCREL*WCRCTPAWASG\ETLSQ 
K* INKGKERKKKMKNCWWMNNSLIQLTS 
FSFTKTDWSS 


1330 


15231 


A 


1338 


292 


1 


GGG ISKT PGGEG AINP KKP PAL PPGGKK 
EAFSPPQKKKKGBRA*RE>ISSEDI *MAN 
KHMKSCPPLLMIREMQIKSTMRYRFIPI 

RME\ ilkkqkns 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine* 
G=GIycine, H=Histidine, I=Isoleucine, 
K=JLysine, L=Leucioe, M=Methionine, 
N=Asparagine, P^Proline, 
0 = Glutamine. Ft— Arsnnine S^Serine. 

^£ ****>***** ■*»*>, *W *1J CUllllWt w U*>* lllw, 

Threonine, V^Valine, 
W=*Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Impossible 
nucleotide insertion 


1331 


15232 


A 


1339 


369 


16 


DPWQNEQLSRVTCPGPRRASMFRIT*YK 
VIIAAI^YSVGPRAVSCIRALWTTYG\ 
IMDNYKRHERRFIJCPL*W1^MVAHACNP 
S TTiGGHGRHERVGRVRS I PDCWRVAAG 
VKASI 


1332 


15233 


A 


1340 


394 


1 


TQSGVQRHDAGSLQPQPPRLKQSPHPSL 
PSSQDHRRVPPCPARPCSSHDSYVLREG 
PCGRW /HESRGRVFLVIiFS * * * LSLMRS 
GGFKKGEFPCTSSIi/SCLPPST*DVTCS 
S LPSAMNVGPLQPH I FMHGF 


1333 


15234 


A 


1341 


400 


161 


KIGPNLPCFSTKRPWKIDKIQFWPPGKV 
F*/SPPPPKKILFFPPPPSGPFF\PPLP 
PLPKSFFPPKNPNLPPSFGFDIPPP 


1334 


15235 


A 


1342 


377 


3 


NRHFPKAETLKASKHLKRHLPLLVTREV 
QIKTTNICHNIATRLAKM*KADNTKCWE 
SCEPVLTFY\C* WEYKLVQPIi* KMVWQK 
PKVCVPYYLAILLLQHIPEGTCAS/CY* 
DIHKRLLTAMLFAM 


1335 


15236 


A 


1343 


170 


369 


RCNBNKI TKEVYFGE I FVRLDNKBKNTS 
FFYFYFLFY/ CWR*GIALLPRIjVLNS*A 
QWLLPWP P KML 


1336 


15237 


A 


1344 


385 


2 


SDLRRSTHLGLPKCWDYRC/RATVPGRL 
FIFLMVSLKHKSFKF* *RPI YLFFLSSL 
M1^VSYLRNH*LIQGHKDLLPMFSSKSF 
MILVLTSRS ILSLLFCFIiFFVLRQGL/N 
SVTQAGVQRHNHGSLQPQP 


1337 


15238 


A 


1345 


1 


281 


HMATKHMKRC* / SEMCTLKPQLNTTTPD 
HQNG *N* K* LVNFKGWQ* FGAVWNLIHC 
W*DCKLAQPFWKIDLSIKAKISMSYSHV 
VRKIFKLKNK 


1338 


15239 


A 


1346 


176 


1 


SVIWNIKEP * IAKI ILKKKTI VGGLTVP 
DFKTYYKAMVI/KTV*SWI*KDRQRNQWY 
RED 


1339 


15240 


A 


1347 


398 


2 


PRPPGPVRRRCPS/LTATSGSSSPPSSP 
FYLGYPQGFPLPAVLNRGPGILFWGPHK 
KNITLPARGR*IGSS*TAPPFFF/SLFS 
Fl^FSFLLFDTGSYSVPQAAVQWFNHGS 
LKPQPPGLKLSSQLLRRLGW 


1340 


15241 


A 


1348 


9 


395 


GLQNPCVGFLVSGFFFFFFFPPKKGLGV 
PPKKK/RGPNPNGPVGEFLATGPFFWTG 
PIiKKNPAPPRALFWGPPPPPPWGG*NPP 
PFLARGSPFFKKFFGGPDPFFYRTNPRG 
PNJCRGPPWPLKWGNPPPK 


1341 


15242 


A 


1349 


119 


1 


KNVFRNM/WPGMMAHTCNTSTLGGQGEW 
ITRSGV*DQPGQ 


1342 


15243 


A 


1350 


58 


396 


GIRVGKVCFIIFFFGFFFFFGKGVSLPP 
GRETGGPPL\LIKPPPFGVKGIFLPPPP 
GGGG\CGPPPPPRVTFVF*GRGFPP*GP 
GGF*PPAPKGSAPPPPPLWKNWGNRIPG 
GR 


1343 


15244 


A 


1351 


396 


1 


GPTKGPGP FIjDGQGFPPPTIiKQNFFFPA 
LFFPPLGKRLILGGF\PKPGGLLQPPT* 
KPLRFKPGGESDCFPKFF*GRIVCFQQS 
LCFFPPPKKKKTAFGKRFKKK'fLLFFFF 
SDGVSL CHPGWSTVAQCRL 


1344 


15245 


A 


1352 


1 


250 


RRL * SQLLGRLRRQNGVNLGSGACS ERS 
\CSERRSRHCTPAWATKRDSVSKKTKCR 
RHQHVGS LTVRLRPG YW I RHCGCHWWP 
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nucle- 
otide 
location 
correspon 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S^Serine, 
T=^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1345 


15246 


A 


1353 


249 


3 


CGATLCPKKYMKITPSFFVEIDN/AILK 
FIWKFKGPRIAKTTIiK*KQG*RTHTFHF 
HNILQKAI IVKTMWY* YKHRQS PGRVAH 


1346 


15247 


A 


1354 


15 


416 


LLTLSSSLKITIHTNKGRT*SFIRENII 
IFIATTNLLGLLPH* FTPTTQLFINLAM 
AIPL* AGAVI IGFRSKIKNALAHSLPQG 
TPTPLIPILVIIETISLLIQPIALA\AR 
IiTANITAGHLLMPLIGSATLTI 


1347 


15248 


A 


1355 


2 


416 


IKYLNVRPETLKLL* /ENIE *NPHNIGL 
GSDFFNLTSNSQGIKEKIDESDYFKLKS 
CCTE S DTTNR I RQLKNERKCLQ I TCDKG 
LI F* KKLK*LY/KQKTNNH* K/WSKRLK 
YFSRKD IHMAKRY /MKKCSTS X IRENKP 
K 


1348 


15249 


A 


1356 


426 


85 


HARLVL * FVF/LFETAYLSVAQA/GGAM 
AQSAHFSLELPGSSDTATSTSQVCYHRN 
T/RLLF* IFCGYG/RLCCLGWSRTPRLK 
QSSHLSLPKCWDYRWEPLYARPYLSCFP 
ENARLG 


1349 


15250 


A 


1357 


2 


301 


GGLLEVKVQDQLGQHNKTPSL/ IKIYII* 
PI YTQKN *" KNLTRHGGMCL * SQLLARLR 
*ED*SSPGI*GCREP*WRHWTPVWTIQQ 
DSKSRGEKKQVI FTHYT 


1350 


15251 


A 


1358 


2 


389 


FLHVGQSGCELPTSVDLPASASQNAGIT 
GVSHRSWLKFSLUDVPSPLTP/VSSAVP 
LISYLATGWRQAAIAASPIFLHQLARPA 
QPAREAAADS * LPADSAFPPKQWTC* *M 
FTGALFKKKKKPCFVAKK 


1351 


15252 


A 


1359 


309 


3 


KWDH I KLKNFCTAK* TINKVQRQPME W * 
K/ 1 FVNYP VDKGL I TRI YKELKQLYRKK 
KSNNLIFKMSRS\SSLAIREMQIKTTMR 
YHLTPVRILVYVLPKRARS 


1352 


15253 


A 


1360 


300 


2 


KRAFH* KKMPSRTFTAREGKSMPCFK/A 
SGWATPVIPAL*EAKVGESLEPRSSRPA 
WATRRDS CLF * K* * INK* KLTLLLGANA 
AGNLTLKLMLICHSKS 


1353 


15254 


A 


1361 


396 


60 


HKVTRKSDGMLCHSGFCSINQLAFCCLF 
FQPGGPRGTPLP/ PYKPPPQKTPKKTGP 
GGGGLYSPPFGGGGRGNPFFPGAQGFF* 
PPSPPPPPPPGGRKKIFLPKKKKKSLLF 
FF 


1354 


15255 


A 


1362 


37 


385 


ALFS FS FFBT3FFWGKKI FFFGQGGRAGG 
HSNLPEPPPPGGGAFPRPNLSGGGE*RG 
PPIS \GE I FGFLKKTGVPPGGRGWFKPP 
APKEPPPPAPPRGGISGQDPLPPPVLPG 
WGKKN 


1355 


15256 


A 


1363 


12 


421 


EPWEPQTIiGICTHLQTLFHDYQVLMKML 
WTVRYHLTPVKM P F I RKTFDTAGM * KK 
KKGCYQGSKKGGMLI/KPPWGAL*MSIV 
KKKTWGTFFKKIQKELP WDPAI P WLGMY 
PKEGNSVFQRGMGIjPSVIGTLFPIAG 


1356 


15257 


A 


1364 


278 


3 


CCTGEKLETFLLR * ATRQGCPHLFDNVM 

FPGDE IV/ YAENPHKS * PKNSL / KLKSD 
CSKVNIQKPIAF 


1357 


15258 


A 


1365 


265 


330 


WPGAXAHACNPSTLGGRGERIT 


1358 


15259 


A 


1366 


379 


1 


KRKIiMYCWWECK* IQPGWKTVWHFLKKL 
KLELP YDSVILLLYTFLKES / PVYARDI 
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N=Asparagine, P=ProIine, 
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X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














CTPKFIATLFIIA\RCSLTHD*TICKMWY 
LSIMEYYSAIKNE / YLPFATTQMNLEDN 
IFSEESQA*KHGVFIFP 


1359 


15260 


A 


1367 


1 


376 


HLPGAESQPPPV/ DNSWDRPAGRTQLLW 
TPA/DPHSYG*GGAGPHPCPSQPGCCAP 
VQSCS * APSEAQSLGAADS\GPAATLPA 

KKKKKKKKDDSGK3G 














1360 


15261 


A 


1368 


2 


409 


EEAMPKAKMG*RPFAPNSQILNAKKKLL 
KEIKSVTPVNI*MIRK*NSLIADMKEV* 
VIWIEDQTSHNIPLSQSQIQSKVRMLFS 
SMKAERREEAS EE KIiEVSR/GWKS AGFM 
RFKERSNLHNIKVQGEATSTDGEGA 


1361 


15262 


A 


1369 


220 


416 


PQPLFDWQMYTYTLHNDILVNDGRLSPG 
GQGCSELSIiCHCTPI*MT\SETIiSQKKK 
KKMGQARGLS 


1362 


15263 


A 


1370 


310 


289 


AFR*AFHSKGSK* *H*EKKVN/WNFTKT 
*NFCTAKDIIKRMKRQPTKWEVIFANHI 
YLTGILIS\KIYKELRTTQ* PETQSLK 


1363 


15264 


A 


1371 


3 


322 


HASERTHRRGKSTETTTPAWATERGSVS 
KQKTNKKTNK/RNTGSS IHNMVSDFEKQ 
VTQTF * S SMQMSNKPLKRYLTS VI IEMN 
IKPEYHFTPIRMAIIKKTDNTKC 


1364 


15265 


A 


1372 


3 


332 


GKEVS*EDIMMIiHVYAPNNRASKHMKQN 
I*IEMQGEl/DE/HPVIXGDFS/TPriSII 
DRSGQKI IKDIV/ELE/STVKQIiDLI/D 
I YRTL YLKRVE YVF \ FTSLS KTFATINC 
ILGHKV 


1365 


15266 


A 


1373 


381 


2 


CSVAQAGVKWCNHSSLQP*TPGVK*SSC 
FSLPSHSDYRHE\ PALFFKFFVQMGS\T 
ML*FS*ESCIKGSRQPCSyTYLHSPPIF 
SSLFPSLAPESIKRQEPSFFFEMESCSV 
TQAGVQWRHLSSIiQA 


1366 


15267 


A 


1374 


1 


385 


YKKGIKAKI FNKI * *V\ESAISHDQVGI 
VLVLQGFLS I /RNKS I SVTHHVNRLKN/ 
HMLI * IDAEKAFDKIQQSSHGKIGIKGN 
FFDLLKSIYRKPMANVILNSEKMKCLPC 
KC\KTSQGCQLSPLFFLFFF 


1367 


15268 


A 


1375 


1 


357 


CSGVISPHCX5LKFLGSNDLPALASRVAG 
I IGMTP \ HAQL IKNFCCCWWR * CLAFGG 
*RFKTMCRPIVK*NKSrNK*NRTVCMSG 
CGDS /RQ LLGRLRWS PGLRNVQGCSEP * 
WHHCTSACL 


1368 


15269 


A 


1376 


1 


375 


HRPKFKSISLLEBNMGESFHDLGL/GSD 
LLDMAPIVQSIRDKTSDFLEIKKSCSSK 
GTV* IMKKQATDWE / RTFVKHTSNKGS I 
MYEEL*NL* KL*NNPI*KWAREFSRHL/ 
EDTQMATKHMK /KCSTAL 


1369 


15270 


A 


1377 


260 


400 


IOIGLGKTVFPWPKKKNGPFFFFPKTTSK 
GRKTLMYNPE * PKYYSVKKKKKKXGGGL 
PFPCKIFFPNIjTIQTTFFWPKTNPLTLK 
KRKXPPKKTPTFWGHFFFERGTKKTQGG 
KKNPFKKGGWEKLFFLGQKKKMAPFFFF 
QKPPQR/l<fRKTI^V3^BP*l!^IJCKNM 


1370 


15271 


A 


1378 


403 


1 


TGVSQALS LFFFKYFKPRMVBFLVEPSR 
* KGPSVYGLQIFFS IPKVPFPCPFWAF / 
SLV*SSPPCT.SLHLLPAFGVLPKRSIiPR 
PMLQSFS PMFFAR IG * FQ ILLLNFQSLL 
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E^GIutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, l=Isoleucine, 
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IIUUCUUUC UCJCIIUIJ, V— pOialUIC 
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S * FFFFFGDR VSLCRPG WS AVAP 


1371 


15272 


A 


1379 


195 


381 


AERYQTSRSTADRISCLFI *SAEI/VFP 

DSGKKTGGNKNNNNNNNNTN1OT 

ISLSUCP 


1372 


15273 


A 


1380 


361 


39 


PPVKETAKDVNRRFPKQPMTNST*SDA* 
VDcVVC.i»* r QIjIV»KJjK*SRK.1A* AQBFK 
AS/HGQHSETLSLKNYLKKKRFSTLLFL 
REMQI KT1TRYHFTPIRMAKK/ SHHTKC 
*QGLAMl»PRIiALNSWAQAIliLtiHFSLPI 

OWIN I Kill loo lyAoLiTlVljr V l\jL.i_ikjJNKi-i 

LTSFAVSFTGG 


1373 i 


15274 


A 


1381 


400 


90 


LPSSWDYRCAPPLLANFCVF/M* SQGFT 
RLVSNS * PQDPPTSASQSAGITGVSHHA 
WPAFFIMWFSSSYSR/TMH* +QHPHV*K 
LHICPN* * KS\LSCPQVPTI IW 


1 in a 
1374 


15275 


A 


1382 


1 


431 


usil P KJso A V i Y W I i Ijr KJ\G * DNV EDEAY 
SGTLAIS IC/EENIHLVCSLSDEEQHST 
AQT I ANT I D ITVGIiAYTI LTAKS KLNKL 
CT**MPKLSYPLF*KIL*KISnCrKNIV\P 
NQLQIKAELPVEILHNCIDQDPETWLGGV 
AHTYN 


1375 


15276 


A 


1383 


2 


432 


EIjSAD VS FFr I TTPLSGVL * QNWG IAAF 
IPIELRSPTEVTFSFDVGNGPFEISLQS 
PTHFSDHQWHHVRVERNMKEASL/QVDQ 
LTPNTQPAPADGHVLIiQIiNSHLFVGGTA 
TTQRGFLGCIRAL/ RMNGMTLDLEERAQ 
VTPEVQ 


I J /O 


15277 


A 


1384 


1 


421 


\TDt57lT TV CftDiri T IV TAT *T_T/~'7\ Til TIT T DT n vn 

QFSNL*SKKFALGDV*SSNIL*YIYIFF 
EMESRSVTQAGVQRHDLGSLQP\LPPKL 
SLLPPKLSI^PPK\SASCLRLaGSWNYM 
HVPPRPAMFCIFSGDGVSPCWPGRTRTP 


1377 


15278 


A 


1385 


2 


416 


IFSVDETALYWKMMPSRTWYL/RKEKSM 
SGFRDSEERLTLFL\GLNAPGELELE I L 

ijX w rlir Ei JLiAjlri-iTl X VJvr Ixjo Vr I w WxJNJSA 
LITAHMFTAWLTE CFKPSVETYCLGEEM 
PFNILLLIGNAPGYPRAPMEIQEINIIV 


1378 


15279 


A 


1386 


170 


1 


DGVFLI FGGQNEKLNKNKDG / DLTKLPR 
IA*NSWARQSSCIiAFSKCWDYQREPPCL 
j± 


1379 


15280 


A 


1387 


47 


418 


FWGFFFFFLKKKKAALGPWTPPPPGQGG 
PPPPPPRGPGKGGPNPPRQTIYGFWGQR 
GPPPGGGGEPGPPPPGEP\PAGPPQTGG 
PQNWPPAPGP*KGF*GASKGAP*KGVDP 
GGERSQVPNRGA 






A 


1388 


CO 

58 


503 


KNIALKYI KQKLINLKGKRDKLTLTVAN 
I STCP * VTDITSR * KI CKGTEEI*F*QPA 
\DLIDIYRTLYPTAA*YTFFSSAYNYSK 
IGNS IGHKTFSN/ CKRNDNIQWLFSDHN 

flTV+RTTvTT TT 
0 X IS- 1 * IS X JN X X X 


1381 


15282 


A 


1389 


o 
*t 


401 


LVSQWNS KNRI LKKI KNSTPVHT/RQM 
IRKQNNLIALWIETVLWWIEDQTSHNI \ 
SQSMTQSKALTLFNSVKAERGEEAA\EA 
GRGWFTRFKERRQL*NLQVESEPPS\AD 
VEAVAS YPEDVAKI I DKDG YTKQR 


1382 


15283 


A 


1390 


294 


3 


KIVIFDFNDVKNCSS*KIVIFDFNBVKN 
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nucleotide 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagtne, P=Proline, 
Q=Glutamtne, R^Arginine, S=Serine, 
I^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, £=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CSSKTVIEGKVSDCKKIPSKHISDKYIiI 
S 1 1 Y KELSNLNNNQQ PN * KWAKDL /NNM 
SSKHRKICSISFGKMQIKTQIRNHYIPT 
DAW 


1383 


15284 


A 


1391 


3 


422 


PLFKKEEGTNFPFPPFWGRFFPGGKSFS 
PFFFPPPPFIKGGPKVSQKGPRFFFFFF 
F * DGVSLCRPGW I L»F PPQVTEALLRGFL 
ALSSLCFRLDKFFFFF*R*GHLIjSRWDY 
RHEPPHLRIS*RKTKE\KGLIMIiTRLVIi 


1384 


15285 


A 


1392 


3 


400 


FLYI YKDI YR* YKTFQINNKLKI I YFK* 
EKDLSKLFTKDVQMTNMHI *MGSTSLI I 
KQMQIKTTMKHHSSLPECLHFEtAANITC 
WKRLGPAGTIiIIiRW*BCKSVKPI,/WETF 
* QCLIQLSMQENYH/DPATPIilG 


1385 


15286 


A 


1393 


409 


2 


IVRHFSKEDIHSTNEHMKKGFLSL.VKEV 
Q IETTMGHHH ITVRRVKI KTDI FASA\N 
TKC* *GYGKTRTLINS/YWECKIVQPI*/ 
WKNILAVAYKIKHTFFPPRIHLSGSQKS 
HFS I YS KEMKTLRQHKDI FMATLSQN 


1386 


15287 


A 


1394 


409 


24 


KSPFFFFEKGFPSPPPGGAQWGGFPPPE 
PPPPWK*FSPPTPPNKWGNRPPPSW/P 
G*LFFFCSPGGFPPLPNWFLTPPLR*SY 
PPGLPKRWGFKQKPMGRG*RAFFCPPKI 
SKKLWSGKKKKKGRPLAI 


1387 


15288 


A 


1395 


83 


411 


QRDSVS KTKQKKINNKSRTDLNRHFTKE 

DRWM/ ankhlkryst* lvirevq ikttm 

RYYYIS IKKSKINKPDHTNCWQGCGRPG 
TLIYC*WE/CKMVPSI,K*TFW*FIiKRIjN 


1388 


15289 


A 


1396 


308 


14 


NFFFFYSWSNI I YCQKGTAFTPISDKID 
FN*KIISRQK*QYMLIKGSIKN/DITII 
NTYTSNNS SPKYI KQTLTGLKGE IAPST 
IWGVFNTTLS I IK 


1389 


15290 


A 


1397 


321 


1 


KQ*CSIEYS/FKYTR***SKL/WPGAVA 
HAYSPSTLGGRGGRIT*GQEFKTSIA 


1390 


15291 


A 


1398 


183 


48 


KWPGAVNHACNPSTLGGQGGRI /TLRSG 
V*DQHGQHGESPSLVGRVR 


1391 


15292 


A 


1399 


18 


422 


KAGMAILIDKVDFRTKNIITEVEKHFI 
ITKGSVREENISVLNV/WPHYRASKHMK 
Q/ILIEIiHVELGEHTIIVG/DFNISFSV 
LNRIDKESARRR*NI*NNTNHQIJSrLVDI 
Y*KTKNTTNGRTTFFPSAHKLFTIIIHI 


1392 


15293 


A 


1400 


28 


339 


YSCDHNSVQPQTPGLKQSLCQPPE*IAG 
ITDARHHIWPIFF\LKRSFVFIAQAGTQ 
WRGLL* KKKKVGGFKKGSCLTLPGRGDY 
RRFREEKI LIPGRGFCNELI 


1393 


15294 


A 


1401 


390 


1 


FGIjGKHVIjHMTSKAQSIKEKTD*IiDFLK 
I KNFNFS KDTIKGMKRQARNGEKI FPRH 
QS /DDFF * WAIDLTFPKEDKVMANKHI K 
RCLRLLLL\SMRYHYH*NKKTDHTKC*R 
ERG\ ELELLCCW * EYKMVQ 


1394 


15295 


A 


1402 


149 


402 


RFWIHLYRKRVFPPLLNPVITWKEKRGF 
VFVFVF * DRVLLCHPGWS AVP WS * LTAA 
STSWA\K*SSHLSLPSY*DHQHASPCLC 
N 


1395 


15296 


A 


1403 


1 


403 


KRKSVPELIC/APRC+APCSYRPNLPSS 
PGPHLDPVGPTFTPSPPRPGPARGPPAA 
RPKEPAE PGKEERRGL\ PAPGG 


1396 


15297 


A 


1404 


342 


3 


KEDNLIDKGGS PKQQI FKVN* TALY/ YT 
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SEQBD 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
VSSK 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=»Asparagine, P=Proline, 
Q=Oiutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ALYSKKI PPRTLI AGKEKLMPGFKASKN 
R/TLLIK3ANAAG\DFKIjRPMLIDHSEL* 
/RPKSTLPVL*KKA*LRAHLFIALFTKH 
FKPTVK 


1397 


15298 


A 


1405 


498 


66 


KNYAKS T L P ML * KWNS KALMTAHLFTAW 
FTK\ FFMPTVETYCSEKKRF / S FKMLLL 
I DNAHSHPRALME I YKE INIAFMS AKTA 
STLKPMDQ\KVILAYKSSYLRNI\raKT 
TAALSSDSSDASGQSKFK*IWKGFTRTR 
GRTRGSAR 


1398 


15299 


A 


1406 


3 


411 


VSPLCLGRSQTPEIK*SSPLGLPKCVDY 
RHEPLYLASMGVLNCPISNLGQCIPP*A 
AWWSLLLGC\P I LMLNLVQTPDLPSVLQ 
PRAPGPR\HPPVSAS * VAGTLGTHHYAW 
LSMLLGSTL*DVFPFLSPSLIPSEE 


1399 


15300 


A 


1407 


361 


2 


IRVPQKKKIKSPGRGHKIFSF*RAGPPF 
F FS F F FLRQ KESHS VTQAREQWHDHGS L 
QSRLPGPK* FSH\PPPPHPPVTGTTPLF 
IYYYYYFLRQSL/DSVALAGVQWHDLGS 
LHPRVRPRV 


1400 


15301 


A 


1408 


393 


2 


PGFNISVLKKASGGLFFFSPLGKKGFFS 
QPFFFGSPRVFPPPPFFNPPPFFFFCCP 
Ii*KIFFSPPPGLKFFFFKRAPPFFFFFF 
FFFPDRVSL\CCPGWSAMVQSRFSATST 
LRAQAIL 


1401 


15302 


A 


1409 


393 


60 


LNVNPNL* S YLDKHRGMLYHI LQSKBFL 
SKTSKAQAKEVTLCKWDYNKLKMFCTAK 
KTL/ IKVKR* STB* KKI FVKY / STTK/ G 
*TYRI FKTKLNNKNNTQFKNRAETSRSKM 
AE 


1402 


15303 


A 


1410 


422 


2 


VNDRKMDGWMNGWKEGR*TQKKKGGRRK 
REKERSMFLEGRKEGREKEREEREKESQ 
KERQKRGKARKEQRQEARKQGGR/EREK 
ERERBRGREREGERGRKSSS * TGS KWK 
I ILLAVKMLVHLKSLHIiQKVCVFSFSRS 


1403 










fid 


PGFFFKKIFVENFFGPHFQFFFPPPPKG 
KNPTj* PPQRGPG * \RFPGVFPPLKKGSK 
KK^PPPPRGTQPRWGKKPIaLPKRPGFPF 
GPRVSFFCLIEIGWFFWAQVPKKKKKK 


1404 


15305 


A 


1412 


391 


125 


EPAPFFFFPPPGKRGFFP\PP*FGYPPG 
FS PPPFLKTPPGNFFLGPKKKKFFFPPP 
















1405 


15306 


A 


1413 


3 


387 


TPDLK* LTRLSLPK* WDYRRATTPG\LF 
FFKP FAS LGS ISM* YHS P VCLAS KLRIL 
FDSSFSPSATAKHPI/TSAFAAAWLSPFA 
LIQPHSSESSLYSSYCPVPILIiF/IYLF 


1406 


15307 


A 


1414 


409 


1 


KRSGPFFYF/SPPFKKGSFAKPFFLGSP 
VFFPPPSSPP\GFFFFSPPPKRGFFPNP 
FFWGAPGFFPPPLF*NPPPDFFFWGPKK 
Ki^rFPrKGrUtrrFFKGPrPJjFFFrFFF 
FFFFFFFFFYKI IS I KGE IGRSS WR 


1407 


15308 


A 


1415 


3 


385 


HRPP P VMGLS PR P AAAEGVPG P PAVLAH 
QHHAQFLARP*LPPHGAGLRTCSPPCLS 
LPNPPPWAPVWPEPP* *ALPPAP/AVPG 
PIN 


1408 


15309 


A 


1416 


2 


238 


PGGRGCSELLPRHCTPAWVT\SKTPSQK 
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nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Htstidine, I=Isoleucine, 
K-Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 

0~f^l 1 1 in mino QsArmninA Q — Carina 

V£ vjiuuiiii inc, iv— >\ rg mine, a— oenne, 
T^Threonine, V=Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, possible 
nucleotide insertion 














RKKRRKPG^L * ETCTI. PPCOTT.VPGP KKK 
NGPPLLGRVWAKRKGGFHPPIiKN 


1409 


15310 


A 


1417 


3 


394 


NKHLKRCSTSSVIREMQIKTTNRC3IFSE 
*KHW/IINMCW*ECGQTEIiSALLIRC*\ 
WDSEMVOPT.WRP ITE YTJCE T TAR <5 S WDP A 
I PLLD I \YPKRNEAS C 


1410 


15311 


A 


1418 


307 


357 


LETRKTAEKIKRRAGFEK*NIjETNKNGN 
TISQNLRDVAKEVRREKFIAMNA\*SKK 
KBKP * INNLMLSLKELEKDEQTKPKVRR 


1411 


15312 


A 


1419 


341 


3 


I PVPLVGRGPPPLTRMVLI FWPHDPPPL 
PP*VIEGLFH/LPAFRPGAFLRKMVPPY 
*KNKIPPPRKNTFFPLKPPPPFFFF*DR 
VLLCHPGWRAVAQSWLATASTSQAPAIL 
ppp 


1412 


15313 


A 


1420 


405 


1 


WGEELVPSGRVHELFSSALRSERDTDSV 
CSCGGQCHCPGAI PGAALKAACRGD PAS 
RVGVETGSTMGNNGFFSFF IFKP YFFRD 
RVLPCHA\AMAQS*LTAASTS*\VKSSF 
rr^QT.p <5 <?wnvp nrs PWTiAitfPiTTP 


1413 


15314 


A 


1421 


1 


399 


QPTLLTELGLRPVPPHPATMNILY* * IN 
TrLHYSYKNYFIiFLFLRQGLQPIiOAGVQ 
WONHGSLOT.* tpgvs flss wdhpnpanp 
FI IFFLERQGLTLLPRQVIiNS * AQ/CNT 
PASAFQSAGI IGMSHCSWP YKN 


1414 


15315 


A 


1422 


168 


414 


EWREKAGESLEPG\GGGCSELRLCHCTP 
AWVTEQDSSYKKKKKTPGGGKGVF/C*K 
T/3GGTWP* i^vpnPRPnGPTinnrivjriRft 


1415 


15316 


A 


1423 


369 


1 


EHFCKSQDRDSVPKKKYIYIYPMSVFMP 
INLTNWIKFI^QKIiS*QIEEAIENLNS 
I Y* / PNESEFI I * SLPITKAPGP YGFTE 
FYYS FBCGEAMQILHKLFQKRG* RHSFSN 

V X IV x J-1XVO XvtTXiXx 


1416 


15317 


A 


1424 


403 


1 


KNTEHSKRYSIPQRYCGFSSREFFHFLV 
HMKVMFPLFKCAIALCLKQVYTVFFETE 
SCSVTR* /EVQQQDHGSQQPQPPRFQ\H 
P PTSASQ IAETTAILENFR KICI YFLRQ 
SIi/HSATOA*VDWPNHGSPGAPPPG i 


1417 


15318 


A 


1425 


401 


3 


FFFFFKKNFPFFFQVBGKGRKLG*QPFP 
PG\LKNFSAPPLPGGGN*R\RPPFARLF 
FFFL*KRGFSPLNQGGIiGSPPFFSPP\R 
APKNVGFQGITPPPGNFFFFFFFSFAES 
RSVTQAGVQWREPGS \QPLPPGF 


1418 


15319 


A 


1426 


386 


27 


CRINGVLILC*CBCKMAQTL*I*FGNFL 
QKKLDLTCUDSTIPIiLGIYPRDMKTYVH\ 
KKTCT * MFTAALL I ITKSWEKLLRPSEG 
EWINSRHPYDRIVLSNYETHNVNKIQRH 
YAEAKKTV 


1419 


15320 


A 


1427 


398 


2 


GLQVHATTKFFKFFVETGSCYIAQAGLE 
LIiVSSNPPG/SSFPKCWDYRCETRRWPF 
LFTCFSLPIFVF*liNHLFCriHLLPLFILK 

T.WTiPPPMYYTiPT PP ATiY<?T.<iT.T. AT .PPMn 
CDLSPIHLEVKKYFSKKDRL 


1420 


15321 


A 


1428 


401 


70 


HHAQLRCFFNSFVEVGVSLCCPGWSQTP 
GLKP^SHLNFSKCWDYKR/AAITLPSFF 
* IAl^*PaiJPPNCHSLPTPI^PPKLAALG 
NSCLHPWLPQPPVKGLSDSFVCMHPGS 


1421 


15322 


A 


1429 


388 


131 


RGGGCLQRH\RGFKQENWFNPGGRGSS* 
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Predicted 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidme, l-lsoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=»Asparagine, P=Proline, 
Q^Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Un known, *=Mop codon, /=possib!e 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PRS MYCP PAWGAQGGS LS QKKKKKKKKS 
KNYQKI LSTQKTQKSRVF I VWSKHCTRH 
CBK 


1422 


15323 


A 


1430 


184 


413 


VSFLFDFYFTjFF*RW/RSHSVTQAAVQ* 
CDCGSLQPPILFYFLRQSL/NSVAQAG/ 
VGSLQPLPPGFKQFSCLSLLSIWD 


1423 


15324 


A 


1431 


76 


533 


SCKRTTGRQFPDLPTRPPTRPQGFALVA 
HAGG*WRDLRSLQPPPPGFKRFSCISSV 
LWCTKAFNFDFQEIYSFSS* I*GFDSFE 
VNF F FLEMG VS L \ FARAGLKLLGSG KLP 
ASAS E /SGGI IGVCHWAQLL 


1424 


15325 


A 


1432 


373 


29 


RQGFPPMGRWLDSLKKKKFQIPNSKIF 
PGGGAPIjVIPPSREGEAGKFF*P\GGKG 
AIKQNYC/HCPPTWGKEGNFVSKKKKKK 
EKKMHQK*RCVKDQQ*T*MLLFPSLKS I 
YYFK 


1425 


15326 


A 


1433 


389 


73 


TDCPAIGRNHRDPVRPLLSSPHRKI / YA 
NLYYYIIVRILTLIQSTDLM* ISPWLV 
LTRVCVCVCVCVCVCI ILPS /CYPVCRF 
MYPTTTVKVQNI S IGQEKQKTKT 


1426 


15327 


A 


1434 


402 


39 


QAGIiQLLASSDLPALASQSAGITGVSHC 
ASPS ISL*APLGLiDTFSDFPCFNDLDSF 
EEHWSGML/SECP * TGI * DLS FFS * VYW 
GYGFLEDHRGKVPFFITSYQSYTPPTQL 
IIADVKFEQLV 


1427 


15328 


A 


1435 


105 


387 


E FS KVAGYKINT YKS I VI L YNSNE * FEN 
\KKI IPFIIASKRIKYLKI /QFAKAVQD 
FYAENYKTW*RIIKEDLNK*KNVLC*WI 
GRL/NI/LKMVLLP 


1428 


15329 


A 


1436 


242 


2 


FGGGFFWVGTPNKMGPFPQIIKNLF/TR 
PPENF*KPP/LPFIAFFSLGFLGVFFFF 
*DIVSIjCCPGWSA*AQSLIiAAALTSPG 


1429 


15330 


A 


1437 


411 


0 


LLFYAQECLKWFSALGGPGVPCSXLPAN 
QQPS VWVP*QS \EHVTVLQRFCLWPVLG 
PAYGQI WEPVPSTLK/PPPPGFK*FS * V 
SLLSAWDCRCTP/ PTDPANFCI FSRDAV 


1430 


15331 


A 


1438 


41 


412 


FFCTD * GSLCCSS WTQTSGLKRSFHLS F 
PNCWNYGH/RASVLGLIFFFFIiRKKT/ C 
P*KKFL*NEDIiLCCQLGLEIiIiAPRDSPA 
LGPQSVGTANVTPRPW/LLFL * TQNYLN 
SSRVRFS PAPLKYKEV 


1431 


15332 


A 


1439 


1 


406 


HAYN I V\NPSPVS LTGALSSLLMTYGLT 
MGCHFIiSITLLILGLLTNTLTIYH*WRD 
VTRESTYQGHHTPPGQ*GPRYGIILFIT 
SQVIFFTRFF*AFYHSSLSPTPQLRRHW 
PPTGITPI^PLEVPLLNTCVLLR 


1432 


15333 


A 


1440 


3 


422 


MEKvS WWIDQT SHNI PLSQS L IQNSAL 
SLFSSMKTE*GEEAAEEKFEASRAWLMR 
/FEERCHHNVKVQG\EAASGDREATGSY 
PDDLAKI IDDSGYPKLHIFNIC* KPTDT 
DIVDETALC*KKPSTYFLFFFLKQSLAL 
PP 




ID J j** 




1 /Ml 
1*141 




Z 


SVTQAARQWRHHSS LQP*TGL\ASQVAG 
TAGAHLYPSQNLIGRGIRSLA/ SVAQTG 
VQWRNIGSWQPLPVRIKGFSSLSLKSSW 
EYRCPPPCPD\LFIFL 


1434 


15335 


A 


1442 


397 


122 


GQASLELPTSGDPPASASQSVRITGMSH 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamtc Acid, F~Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
0=Glutaroine* R^Arzinine. S=Serine. 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CARPILGSCISLFAYPFCFLKCFLF/CC 
REV ILL* CSGWS *TPGLKQSSGLSLPKC 
WDYRCEPP 


1435 


15336 


A 


1443 


3 


299 


WEKMFAMYLSD * ATVSRP YR * LLQLNKT 
KYKLKLSQS /MNRHFTRGDVAMANKHTK 
RKS TS LVTRE I KTMRYHYLLKRMAEVQK 
/SVKCWQECG*GGGKOG 


1436 


15337 


A 


1444 


10 


410 


CTLFLSISPDSAGGICTMLLFQNTLNFK 
ISLALGVSDKCFSNFLIMTIVRNVFYIH 
GKKCILYCSPGCAHTHMHTLMHTHIHLK 
QFFLRRGLTLLPRLECSGAITAHCSLDL 
LGSDSPP**EMYFIYMV\EMYFIIiFPRV 
CTHTYAHTDAHTHTPET I F FETGSHS VA 
QAGVQWRNYCSLQPRPPGLRFSSYLSLP 
GS*NHRHVPP 




1 J J JO 


A 




All 


Q 

y 


GPAQSLDSSVSPLGTISSTVKWRHNAYL 
AGLSGGI /PCDCFLGPRVPWGSGICQWH 
WAVSPTLS\PTVRPPLFRETLYFT*V*V 
PPLETCPQ*HISHCRCVINWCEDTHTPK 
KHLVCVHACVCVCVCVCVCVCLGRSRQS 
EH 


1438 


15339 


A 


1446 


85 


416 


PGCLSLPKIWDYRCEPPRPPHFFIFSSI 
QGPLTDFFLTPLEQVQISPTAKGFLKKI 
PFFFF* * RRGVP ILARWFKS * PHDLAA 
SASQKFGITOVSPHVHPIFLY/CSSIQG 
PLTDFFLTPLEQVQISPTAKGFLKKIPF 
FFFLRQGLT\RQWSTHLGLPKCWD*GCE 
P /PMPGLHCEFFKG 


1439 


15340 


A 


1447 


376 


3 


I KS E 1 1 ETESRKP I SKIS / RAQS CFCGM 
VYEIDKLLARLDRKXMKTQLNNIRN*RD 
EINTDFTDIKRI I RENYKQDYGTKFNNL 
. DETDI FLERHQLPKLTQEEKDNLNS PIT 
IKDIEIIV*NLSTG 


1440 


15341 


A 


1448 


2 


192 


skwiknlnkelkp*ds*kkpegnlhdir 
fgndfldvtpkn/matkgkidn* tlpkf 

KMLMRIWRN 


1441 


15342 


A 


1449 


411 


3 


vflpplppfcflnff*gkggfffkiff* 
kkkgvegpqlgppppfgpfkkffff* if 
lkaplfvpptlkplf/cppkkknwennp 
ppfffcffvkktqfyfffffflrnrvll 
chprwsvmv*sclavastswaqaicp 


1442 


15343 


A 


1450 


413 


1 


ea*tgewcnpgggachepr* ppcpppwa 
tergfvskkkrkqek/ srimecfqdnlp 
gffqifsvmknkqegr*fwtk/vkmtky 

NA*NVNGS * YWKKRENEEECYDWKKLF 
QSILS*YVKKAR*KPGCWLMAAIPALW 


1443 


15344 


A 


1451 


4 


384 


dpai pllgtypremktyvhiktctqmfa 
avlftvaetskrpkcptde/vnki*cty 
i/mtyysamkrde 


1444 


15345 


A 


1452 


3 


390 


LPDHPGSSVSTPRGVT ITCRGFWLLFPW 
VSFFFFFFFLKRISLLPPNWRGGGQNLL 
NKSPPPRGF*NFLA* PPQGGRKKGPPHP 
PGAPGDKNP/QPFFFFGEKKKIFNPPTG 
GEKKKPPS P FKWGGGGPN 


1445 


15346 


A 


1453 


37 


381 


LILYINVCV'OTCVYIYTHTQMGSHIVTVE 
AGVQWCNIi04W*P*TPGLK*/FLLSSGD 
YRLTLPHLANLYI FFFFFEQRG\ FFFLL 
KLV*NSGPKATLPRVGITGLTHTPRPYR 
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Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Giycine, H=Histidioe, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
V|=Hjiiutamine, K— Arginine, 5>=berine, 

T — T*Vt ronnina \f—\J a linn 

a — l nreunine, v— valine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GDLFF 


1446 


15347 


A 


1454 


3 


400 


TEMVLHTEMVLHI S VWDPLIiDRPGRGVT 
WIiMLTAELFLYGGTEAFSSSFDV*GPCS 
ETVECFSD/L*A*GPP*NT*SPIYRFAS 

CLdjAr V*l I LtLiii X Ki3 V \S V Iybvjv»K/ V4QU 
LTSLQPLPPRLK*SSCLSLPSSWE 


1447 


15348 


A 


1455 


77 


398 


RAEIVPLYSSLGKRVRPPSQKMYIYYEQ 
LHAHTFGNIiAEMDKLVERYKVLSKFT*E 
E 1 \XjNS PVS I KERE F \ AVI SIi PQKETIX3 
PDSFTGGFYIFKEELMPILQRLFW 


1448 


15349 


A 


1456 


397 


69 


CPPGLSGIiPWVA*AVPP/RRPAPPPPPP 
AGIDAGLGDDPLQQTHHA/PPAAAGSAP 
AGCAAGPARGAPRGRSPPRRGSAGAAPP 
PWPPAAAAASSAGGS S APCAS 5 PAAPPT 


1449 


15350 


A 


1457 


400 


2 


ALFFWAPKKKKNFS PPPGKKFFFFKGPP 
PLFFFFFFFFFFFFFFFFFFFFFFFFFF 
FF*RRP I FILFYFLNQKNIPRDFFFFFF 
FLRDRVSFC^PGWTAMV*SQLTAA\ *TP 
GLKQSSHLSLPSS+DYRWVPPC 


1450 


15351 


A 


1458 


343 


11 


QGVHSRKYKEISKLNS * KKNPTRKWAKD 
TNRHFTKDNIQMENKNMKDVQARCSGS * 
F*KN*NFKSYAL*PKCSEI/KKSVTGTN 
WRNSQIYDKLSTLQNN*WIFSRSLYNFE 




15352 


JV 

A 


1 A eft 

1459 


1 Aft 

J 09 


Am 

401 


QS AKATE WRKDS * NHQTS / WWKES FTRV 
KALEBITKRKVNSLAS ISVKTD VHVTKV 
♦IiKKKKKKRGGRFKE 


1452 


15353 


A 


1460 


42 


424 


CPAN*NSFSRDR/SIiPMLPRLILNSWPQ 
VVLlL/ PWPPKVMGLQ 


1453 


15354 


A 


1461 


419 


69 


IFPLKKKKRGGGVGPPLYPPPLGAKRGG 
S P * KQNLKP PRPKKENS PFFYKKKKKGG 
RGGAPFFPPPLGGKTKKFFLPPK/IKVS 
FNPNLFP P P PPPGGKKKFFFPKKKKKKK 
SRNTVWF 


1454 


15355 


A 


1462 


1 


277 


CHTNFLCMSFPPAISHHPtMPPHPVSRS 
VAQAGV* WCDDGRG/CSELRSCHCTPAW 
MTERDS ISQLIRKKKKLIKIKKKKNKNK 


1455 


15356 


A 


1463 


382 


3 


KAGGSINQNSPPPPPPGEKKKKPPPKKK 
KKKKNPFWGGPRILPHNPPPFER*R*KI 
FLGPKFLTPLGPKIKPIiFFFFLKKKKKK 
RMTY/HSVAQGPLLNXDTLQAG\LSKA* 
RSPPKSKSMGQSFHRN 


1456 


15357 


A 


1464 


561 


86 


TVTTIDTT GT WA T?l^ T T , I7/ r lVT^riC?'C T T Tt TO GOT \ 

JJUJr lLa IjJUUSJVJ, r titt-lyubr Jj J. V oooJU\ 

NKPGIEGNLLS* * KASTKQNP IMNLRV\ 
IiNVFPriRS*TRQVCLLSPLIjFNIVIiEIL 

SLFI *VKNLMEFAKKLLELINEYNKVER 
YXINIKNI LLAKNTWTLKF 


1457 


15358 


A 


1465 


3 


221 


RFHRVSQDGLDLIiTS * STRIX3LPKC / WD 
YRIiEPPCPAKTCLILNGSC*VFLCYLSC 

r IS^yilJKJrO VJNDVVvxrtX V 1 Jj 










Z 




WWPAWTHTPGLKQTSRFSLPECWDYRRE 
PP\PGLVKFLL/IQYRWTQHVGLGDWV 
QDHHPHIE ICLHKSVI *NGI VFAYNLRT 
SSLTLFYLRRGLF/SAPQAGVQRHKLSL 
LEPLPPSFKRSS\CSSLLSSWDYR 


1459 


15360 


A 


1467 


349 


413 


RLGL * PRKD I TD / HVS LRKDTG/ W PGAV 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 
Q=01utamiae, R— Arginine, S^Serine, 
T=Threonine, V=»Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AHACNHSTMAGHRRQIPRSGV*GQPGQ 


1460 


15361 


A 


1468 


403 


2 


LPVBWKGGIFGSFHPPPIiRIiKKFFSPTP 
PKS *D/ YRGGPPPPGYFFFFFLKRGFSP 
FGRVIFKFPPPGVPPPPPPQNFGFKGRS 
/HPSRALFFFFKRVFF* I YP FFFFFEME 
PRSVPQAGVQWHDLGSLQALPPRFM 


If Ul 




A 


1 ACQ \ 


4.9 T 




IEFSFFAPSKDKGAILGPSTLPLPRFNN 
FFCPPFP INKDIRRGPPPRPNFFFFRKR 
GGFPHWAGMFLSFNLQEGPPPSPPQ/SV 
GF*GRN/HPPRAQFFFFKKFFF* CYPFF 
FFFEMEPRSVPQAGVQWHDLGSLQALPP 
RFM 


1462 


15363 


A 


1470 


13 


427 


RTRGLVFDKTEFKPPiQKKKKKA\GPFL 
RGNRSILKKKATFPKNIiFAPNPGPPKFI 
KKVLSDIiQKNLNPPPRMQGNFTPSLIjK* 
NK5SKQQITRDFRDLTSPLDQGDL*KIY 
KTFYPKTTE YTFS S APHGFY / S * FDHKI 


1463 


15364 


A 


1471 


378 


1 


FVRPPFFFFSSSRPFFKOTGGPLPPAPQ 
FFFKTPRGNPLLRG* KPPPQTPVGGAKG 
S PPW/VPPGFPKKGGGVFFFQQGL* KI S 
PPGPAPP *LVWGGESPPFKKKKSRPGW 
AHACNLALWEAKAG 


1464 


15365 


A 


1472 


412 


53 


SRLSFFLSSEPHGPPSPIiGPFATNKVAF 
C*/PPPPF*PSPPPKFFFF*GPKSVIYF 
YQR*PPTSRIFPPRGGAGPP/PPFWGFV 
NPKKK\LKPFSKPSPPAAKFKNPEGTKL 
GFFKKKKKKL 


1465 


15366 


A 


1473 


116 


34 


DWNIKPVLISNVMTGLMSMI ITI1I1LQLF 
LL*LFFFK/WELTFFGGKFHQRVDPSFQ 
GS IjGRGRQFFIjLTKTE YHI FFL I KVFNS 
KTEGAGLAT* IK 


1466 


15367 


A 


1474 


42 


428 


EIIMESINRFDVITF*NFCL*CYLMSKI 
NRQASS W/ENV*NQYVGEKILI FLI YKB 
CIQINKK\KLRPKIDTQVKDKQISEEGM 
QKANQHMIQC\QPSLVLNiOyiQIE LAEGH 
HLPYQINKDSKMMVEERRQ 


1467 


15368 


A 


1475 


87 


433 


PQSSPFSDHCSYQRLFLHLVKVSTYRLQ 
KIRKIHKSPOJNBYFGGFFLFFSFLFFF 
ETGFNFVPQAGVQGQDLG* LQPLPLGFK 
GFS/CPQPPRTFFFFFFLRVWGGFIQKK 
KFLFV 


1468 


15369 


A 


1476 


212 


430 


SLILMTSNGIHFIYFLFUiLLFFFF*DR 
VS ACHPGWRAMS* S YFT*ALTSQVK\QS 
YLSLSSS *NYRHTAIMP 


1469 


15370 


A 


1477 


493 


2 


PGAAAHACNPSTLGSQGGWITRSKIPEH 
PG\TLWNPRSY*KTKMCGLEAP 


1470 


15371 


A 


1478 


454 


492 


HRVGEERF/CLFETESHSLTQDRVQWHD 
LGSLQPPPPRFKQFSWYHI*PHAW 


1471 


15372 


A 


1479 


2 


213 


IDQERRRLMEGDRERDTETDAEKDMGRE 
/RNRYRERERLRG\RRERRKRRDRMT*M 
PRERERENLNSLYRETYRDFETEWVMDR 
ERQRLKRRLL*AVIVPSHSSLGSRRKTP 
FQKKR* REGERKSEFLIQRDI 


1472 


15373 


A 


1480 


77 


453 


SFGDSLTLSPRLAVQWVYLGSL*PPPPE 
LK\YSPTSASQVIHYLLFFFFFFGKKVS 
FCPQGGGEGPPFGLLEIFAPGLMPFFCL 
NPPKGWWR/RPPTMPKLFFVFFIKRGF 
SPGEPRGVSFPEPGT 
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Predict- 
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nucle- 
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correspon 
ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Aianine 
OCysteine, D=Aspartic Add, 
E=Glutamfc Acid, F-Phenylalanine, 
G=Glycine, H=Histidine,I=Isoleudne, 
K=Lysine, L=Leucine, M=Metbionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nudeotide deletion, ^possible 
nucleotide insertion 


1473 


15374 


A 


1481 


398 


96 


KRPWCFFSPPPKKGFFPPIjIFVGPRFF 
SPPPVFK/SPPPFLFFFPP*KKKLFSPP 
PRKIIFF*KPPPPFFFFFFFFFFFFFFF 
FFFFFFFFFFFFVFILLLS 


1474 


15375 


A 


1482 


484 


64 


QKRQMANKYMKICSPLFVIREMQTNTTI 
RYDDI PTRVAKNKTNK* KSNSTKYCQEC 
ESARP*IYCWWKYKMVQPLWK/S/VWQY 
LLKLNI *LP YDPVI PLLSMYPSQIKTCI 
PTKTCAQMYLVTLFI ITKNQKLAHSARV 
G 


1475 


15376 


A 


1483 


2 


400 


YNKKGQGVAPRYDVDTAANFPKELAKI I 
DEGGYTKQQSFNIDYTALYWKKTPCKSY 

SWR* ih/ slaskdrlt\llvgnwgdfk 

LKPVLIHCSKNPRVFENYTKSTLPVLCK 
WNSKAWITAHLFTAWVAEYFEPT 


1476 


15377 


A 


1484 


459 


65 


GGPPPPHKTIDFFFFFFFRGESPPPPKK 
KKKKKKG*DNAEDEAQS*RPPPSI*BKI 
MHL I CALTEEY * * LTAE 1 1 ASNIH I STG 
SAYI ILTEMLELSKLST* WVPKLFYPNQ 
LRT/RAEL*MVILNK*DQQF 


1477 


15378 


A 


1485 


2 


518 


PPPPPQRFGLRGGAPPPPKKKG*DNAED 
EAQS *RPHPS I *EKIMHLI CALTEEY* * 
LTAE I IANNIHI STGSAYI ILTEMLELS 
KLST*WVPKLFYPNQLRT/RAELSMVIL 
NK*TSDP 


1478 


15379 


A 


1486 


122 


501 


PRDPPALALQNAGITGLKQSYFLSLLSS 
WDYKHKPPCAPGYLK\VFFLEN*YLALY 
VF* *SYQGTSVLTQPPSLIPRHSSPGVS 
VQLFQKK/DLQHGCLLTPS I SGYSVTWD 
GVQWRDHGSL* P* PPRF 


1479 


15380 


A 


1487 


491 


425 


IPKNDI*AEP*RINKR/CAKRKMVRKGF 
EAESTICKALRYEE IVCSRNTEFVHGWI* 
♦ENRGQIMKGFGNS VEKLEHN* KFYEG * 
LLYRFE *LAAGFI VFQQ* VS KEI VKAWP 
GTVAHACNPSTLGGRGGWITRSGDRDHP 
G*HA 


1480 


15381 


A 


1488 


441 


17 


KKTNI YDQLI FNKGAKSTKLRKNSIiFNK 
WCQDKIS / 1 1 QKMKVDP YL I SNI \NLKW 
LKDLNVTAKTIKLLKENTGAILHDLGPC 
NAFLDGKPKAQTTKKKQVK* TSS KFKS R 
OQWLI P IISTLWETEAGGSPELRSSKPA 
WPT 


1481 


15382 


A 


1489 


413 


1 


LFWKEMVFLFWPGRVGNPGPK\GWLRLR 
PPPF* FSFQPFWGPPVGLLFPKNFYY* P 
PGGAHGPNPPPPGGKG*GPPRVGGGKFN 
* PKFGPCPPGWATRQKPVFQKKKKPNKI 
KNRTVKVPKVTKFIYYVPYLNGVSN 


1482 


15383 


A 


1490 


359 


407 


RHRMITFFT* I VTQNFPYL* KKMDIQIQ 
ED*RTPN/RDQHKHTPRHI I IKLSNSQ\ 
NSKRILKTRKK*LWY*GTHIRLSTYFW 
PKS 


1483 


15384 


A 


1491 


397 


39 


RKFRGRSFAKTLFSGPGQG/PKGNPGLI 
PEGP P r? ¥ rW PAEDGGFFr P PGQ IiPPQG 
PIRGPGPFLGFPVPPPEGEGPPPTGP* K 
PGQKRPKKRGWSPQKKGPKKKTLFFFFF 
CQMESRSVT 


1484 


15385 


A 


1492 


378 


1 


FGVFWFLAPRRKGGFFPSHLIWVPPGFS 
PPPGV/SNPGPGIKFGGPIKKIFPCPPR 
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OCysteine, D=Aspartic Acid, 
E=GJutamic Acid, F=Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=ProIine, 

O^Gtutamine. R— Ar<nninp S==Sprinp 

Vc ^* UIV) AX i»l glUlllV, UtJVI lilt} 

^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GGKFGPFKGAPPFFFFFFPQDGVCSVAQ 
AGVLQ * AEIVP \FPPT*ATEQDPVSKKK 
PTRPPTRPPTRPPTRP 


1485 


15386 


A 


1493 


3 


428 


I KS P * PDRFIAEF YQTFKELVP I LLKRF 
HKI /EEGTLPS * FYE /ACVTL I PKPGKD 
TTK 


1486 


15387 


A 


1494 


3 


417 


I LNNARLKPFLLR* ttrqgilllsllfn 
KI * VLEFLAGAI SQSKEI KVI QVRNEEM 
SKTLFSQTT*S*GRNPFIKIPLPKMSW 
M/NFIKVTEYKVSIH 


l**o / 


iOJoo 


j± 


1 AQ^ 




A 
*f 


PFCLGSKRFPPFFNP PRGGKF* KKKKI F 
GPPRGGPPLFPPLWAPKGGGPPRAGGSG 
PPPPKGGNPLFINPKKN/PPPPGGAIFQ 
SRFLGGVNPKIFLFPGGKFLTDPGFPPS 
LPPGGKKKKPRFQKKXKRKKRKVKEKKY 
FL 




I Jj07 


21 








FSKHDTPMVNKHMKD / CCTSL VIREVO I 
KTTTRYHLMPTRMAMCIYIFNYILFLK/ 
SKNNKC* * RYRE IGTLIH / AQWKYKMLQ 
LLW/ KTI WQIiLRMLNTKLSKDQB\IPLL 
GI * KKKKKKKKKRGGRFKGIN FTDPGVE 
RINFYNSAPK 


1489 


15390 


A 


1497 


3 


326 


WPACL*AVAAVAIxLVPEATRIi™GNNLT 
VCTPHS IAEUjSSKG/DLWLTDNR/LLK 
YC^LLE\DLQLRTFTCLNPATF/VPEE 
TGEPEHDCEWVWQTGKRNNKDHCIiYSL 


1490 


15391 


A 


1498 


345 


22 


KSKWLIDLHGKCKTLKR*YRKPTK* \LG 
HGNDFLDTTSNAWFI /RKIDKLDFIINK 
NVCSGKVTVKRMKR*TTDRDKISSKGIS 
DKRLLCKI YIKI LTTQQ 


1491 


15392 


A 


1499 


194 


432 


P WCVCVCVYVCMCVFETQS /HVARAGM 
Q*HNHSSLQL*TPGLKQVSCLNLSSSWD 
YRHTPN* FFFF/ CNFYLERGGVS 








1DUU 


mi 


417 


RD*FMRFK\ EKS YSCNIKIOREAASANV 
ETMAS Y* EDLAKI INEGS YTK\SQIFNV 
DEVAL\KKMPTRTFIVRERKS I PGFKAS 
KDRL I LFLG / ANAPGNTKWKP VL I YHST 
NPRAFKNDAKS I LLVL YKLNS KAWRIAH 
IiF 


1493 


15394 


A 


1501 


414 


0 


SSSSSSSSSSSPKWTTGALQP/LIjSRTP 
LKDSSEEESSQ*AEIi*AVHLWHFAWKE 
KWPDM*LYTDSLAVASGLAGWSGT\WKK 
HDWKICDK/DWGRGMWMNLS 


1494 


15395 


A 


1502 


3 


125 


RLGLPKC*DYR/RAATTPGLH*F*SWKE 
QRPQ* LALGRKPVNRNKNTQKYIHI YTY 
TH/PIPTNVYTCIYTHIHTPV*YTHIIY 
I IYLFKVLVFGFIiRRSLT/SVAQAGVPW 
RDHSSMQPRPPYKQLTCH* *LASQSARI 
TGGSHHTWPAL I LKLERTEAPVI S FRTE 
TSK 


1495 


15396 


A 


1503 


406 


172 


DI ILDR* RQKQRIiRLRQKQRETETE/RR 
DRGRERORYROROVOROGRRORORLRHR 
Q*QRQIDRLRQRQ/RASRDRGRGRDRDR 
GRDRGRDRERQRLRQRQR*RBRQRHRQR 
/RERDRQRQRQRDRGRRRDRE TDRGLDR 
G* DRGIDRGRCSDRGGDRDRGLDIGSDR 
DR 


1496 


15397 


A 


1504 


3 


420 


ITGVSHRAWLPS * FLKF FVWEME SHS V 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=GJycine, H=H is tiding I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamme, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAGLK/ PSGLKRSSHLDLPKCWDYRHE 
P\PHLAFFQFF 


1497 


15398 


A 


1505 


407 


1 


PISIYHSKDPGSLKSYTKSTLPVF*KCN 
NKTWLTAYLFTAWFPKPFKPTVETYC/S 
RKKS P FKMLLL XANAPSHPRALMEMHKE 
I P WFMS AVTS ILQPMDQGVVLTFKSYY 
LRNTFCKAIAAIDNDSSGGSGQIQ 


1498 


15399 


A 


1506 


» 


408 


PPGFKPFCLSLPSSWNYRHP P P/RPG* F 
FVFLGGTR FHHVG YTAS QHLT S RETHAY 
ALQ 


1499 


15400 


A 


1507 


2 


416 


EPRSHHCTPAWQLS KTLSQKKKKKKRKR 
MVFTGRKKGLFFGN/LKNLGTLSREYPL 
GPGIKNRLAQKRKPI>FYKKRF*NINPGG 
GAHPGGP KS WERGGGRKI * TPVGKYASN 
PEYHICIPPGKRNQNPFLQKKKKRRAD 


1500 


15401 


A 


1508 


276 


14 


SPPPYFLLIR*GRKGRGQF* * LRLFFI I 
NLRQCL/DSVSQAGFQWHNHSSLQPRTP 
GPK* PS FLSIiPSSWDYSHAPQLLAFYGA 
NCFN 


1501 


15402 


A 


1509 


1 


391 


NIFKBIMSENFPSLMSENFPSLGKEIES 
QIQEAQRTPNKMNPKRSTPKQVI IKF*V 
RE/MLKTAREK* I VI CKGTTLRPAVDFT 
AETLQIRRE* DDTFKTLKGKKKKKKTLP 
AKLTFPSILVFTOCTRGGSR 


1502 


15403 


A 


1510 


2 


419 


P RVRSRATNVI S KYKQHKKKNTS K* I KD 
LNVXPEPIKbL\EKTTGEKLLDIELGND 
FLDS/TPKTRAPKANLTP*NYFKLKSFL 
TAKETFN\KTPTKRGANHISDRGLISKI 
YKELTIQ*QKNNLISK*AKDLKRHFSKE 
DV 


1503 


15404 


A 


1511 


392 


122 


SVSLCVSLQSSGMFRC/LPTMPI»RR*RQ 
ENPLTSAGGGCSEMRSHPCTPAWVTQQD 
SVSKK* INK*IK* /Q*LRQNVKIKPSPI 
FCEQGKRRK 


1504 


15405 


A 


1512 


2 


281 


GGCSELRSCHCTPAWTT\SETLSQKKKK 
RKKIC I ECNTLKSH/ I LS FVCGGKYHS P 
VS CTCL * WS LGRS LKRS LERHQKNLNDP 
YPIFQNETOIH 


1505 


15406 


A 


1513 


242 


382 


QGPNLG * LHPPPGGLKGFSPLTLLRS WK 
NRLPPQHPFYFCFFRKNKV 


1506 


15407 


A 


1514 


1 


388 


RTRGTERDT/ RFRERRRERDRGREG YRW 
RYRERERHRE RES * V* RDRVRDGDRVTD 
RERERQSERERLR/RERDWERDTYIVRE 
TETERETEGERD I BRERDI VRDRYM/ RD 
RDRDILRE * ERDRETALDSEK 


1507 


15408 


A 


1515 


4 


285 


TRXQICNGDKTALY*KEMPSRMFPARVE 
LMPGFKAS KDTLTLLLRPNAVGDVNLKP 
MM I YYS ENPRALKNYDKTQLCLYS TNGN 
KAWMTDYRFT 


1508 


15409 


A 


1516 


415 


2 


MGDRESLPPSRVFLFFLAPCPKGILFHP 
V*FGSVRSFSIERCYR*GQRNEFWGPVI 
RVKASSSRAG*WFSSTAPPIjFF/CFFE 

FLCPSLPDAW 


1509 


15410 


A 


1517 


1 


324 


PTRPEMGFRHAAPAGLELLSSSDLPTSA 
SQSGGITGVSHRTWPILASNNY/SMDKL 
CAI/CERFILFFLKNIIiLWILILFIFIY 
FCLIK*CITINFSQNSIiLPHSIHLYI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Afanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny)alanine, 
G=Glycine, H=Histidine, J=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 

0={*liifcini inp R— A rtrininf .Q=Cprinp 

T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1510 


15411 


A 


1518 


2 


163 


ACRYPWIaNFVLLVEMGL \TMIASDLERS 
G* PQ/CDLPASASQSAGITAESHHGWP 


1511 


15412 


A 


1519 


425 


52 


VLRGAQFLFQEARLKWDFGPGGMAPPCN 
PPPLGV*KGPLPKG\GGPNPPGPPRGNP 
VFSKKPKITPPGGGAPEVPPPWGGGAEK 
FFYPGGPRVQGTE/RKTPPPPPGRQKGT 
PLPQKQKKKKPWSIi 


1512 


15413 


A 


1520 


3 


404 


THASGKSNIRGLTLPVFKTYYKAAVIR/ 
TVWP/WLRANTORQNRL/DGPEVDPYKC 
SQMIFDKGAKAIQ*RTDNNLLNKW*/ES 
*TSTCQKKKKKNQFR/DPAFIiIYTKFNS 
NGRKNLRVTGKT * KVLKKNI KQNWGDLG 


1513 


15414 


A 


1521 


332 


39 


TSRETGNFQLDLTAFLNEVGVLFLWRKG 
KGKI D IGIGTKEMP * PP I * LliMFS YI VI 
ERERKPE/LSLSPSIiECSGMILAHKKNS 
LPGSSDSHDPASRVAG 


1514 


15415 


A 


1522 


482 


0 


RARTSGVLLCS PGWS * TPDLR * S LCLS F 
PKRWDYRT*ATVPGLLYSLYSRFSP/DE 
LKGCEKS RTS PA 


1515 


15416 


A 


1523 


1 


417 


NKC* *RCEEKGMPVYYWYECK*GQPFWK 
TKERF FKKNL /N I ELP YNTAIALLNMRP 
Q*I*SQ*RKVCSCMLC»T\PTPNK*IKT 
MW*VaVEYSSPIJCKN\IiFSTTWINLE 
DISSNK/LRHRKSS ISRFHIiLL 


1516 


15417 


A 


1524 


1 


397 


RDSTYQGHHTPPVQ *GLRYGVIIiFITSE 
VFFFAGFIi/WSAFTRSSLAPTPQDGGHW 
PRTGITRIi^n^EVPIiLOTFVLIiASGVS I 
T*AHHSIiIESNRNRIIQALLITILIX3LY 
FTLLOASEYFESPCTISDGIY 


1517 


15418 


A 


1525 


2 


376 


I^TGQKLGLLHQTVSKFVNAKEKFWK 
ELL\KSATPVDI *MIRKRTSLITDMETV 
*W*KEDQTSHPIPLS*SLTQSKAIiNLF 
KAMKTORGKGAVE * KS EANRG WFMQFKE 
RSC/RFCNIKVQG 


1518 


15419 


A 


1526 


3 


386 


ESMLKAKTGQKLGLLHQPVS KFVNAKEK 
FWKEI*L\KSATPVDI*MIRKRTSLITDM 
EOT* W* KEDQTSHHI PLS * SLTQSKAL 
NLFKAMKTDRGKGAVE *KS EANRG WFMQ 
FKERSC/RFCNIKVQG 


1519 


15420 


A 


1527 


127 


388 


KRKSQINNLLLQFKELEK/QEETKPKAS 
RRNKKKKI RVDLLKI KKGKPLEGVKKKG 
GFF*RTMKRDKPLLKLPKKKGGGRIKTF 
HKTS 


1520 


15421 


A 


1528 


3 


402 


HENHMKICSTSYVIRELQIKTTMK*YYT 
PVRTTAIQNTDHTKCWQG*REQ/GSLIY 
CW * QCKMVQ / PLWKRVWQFPTKLKHSLN 
I*SAVVIiLGIYP\KSGKFNVHTKTCT*M 
FLAK* PQCSSVDEERKKM/WYMYGFE 


1521 


15422 


A 


1529 


2 


365 


ieklyrsgakffcrdrvslccpgwsrti 
glkqsscfgfpkcwdyr/r*aalsgriil 
i/tissh/ reqieqefthypeks aklfmq 
dgpprrk/ hpptrpqasniqnkfsfflr 
qsl/nslaqag 


1522 


15423 


A 


1530 


1 


418 


GTDTE*ACDKIQKPFPDKRLNKLG/IRK 
KHLQLDKYL/WKNPIASIILKSRRLKSV 
RLRPGDGRQGCSFPAWLFNIILEGLARA 
IR*EKQKQKQGQ\QIGKKEVNLSLFIED 
IMLYIENLKESTKKPI 1 1 INBFSRKEDF 
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SEQU) 
NO: of 
nucleotide 
sequence 


SEQH> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acio 

ICMUUC Ui 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
fling co 

iaai amiau 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, 0=Aspartic Acid, 
E=Glutam*tc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, 1-Isoleucine, 
K=Lysine, L=Leucine, MMVIethionine, 
N=As para gin e, P=Proline, 
y^uiuiaioiDe) i\ Argimne, o — oenne, 

W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














tCf 


1523 


15424 


A 


1531 


381 


3 


ILQGFVTRKKS KI FQKACHTSDLWTLTN 
LSCLHMKGLPPHRLYSSCTVIiLFLRKPV 
MRKTTLSCCFtiT*DFFLPLI*PPCFFR* F 
ILLIF*LYFCKDKVSLCCPSWSKLLP\Q 
QSSCLSLPKYQDYKA 


1524 


15425 


A 


1532 


38 


479 


DEACGPQUP I IiTP YVKTyW I KD * TRNKG 
IQFLEENGKN\FDIGFSSDLLDMTPKTR 
ATKVKL\NDIRLRNFCASKDTINLSLLC 
R\KR*PVEWEKISANHISDKGLISGIYR 

QSALYRWVL 


1525 


15426 


A 


1533 


105 


447 


LIFCRVFEYLHSLHLPQEICLSLALFSR 
FTFCVT I CEVDVWS VI FKVP FCSKRNKV 
AVHTMLYIQIFVSIiFI*PQNWKQPKCPA 
rVJaKlIMKMiV x J. til V/ Six SANKR 


1526 


15427 


A 


1534 


76 


471 


VWVCLLS LEGS QS KFGNS I E FGVLLSSG 
GFSAWRLFFFVYFLRQSL/NSVAQAGVQ 
QWRDLSSLQPLPPGFK*VLKQRGVCLFV 

FKRFSCIi\SLSPPWPSG 


1527 


15428 


A 


1535 


45 


338 


SNNEPFLDWIVM*RKVDFIRQPAMTSSV 
TOPRRRSKALPKAKLAPKKVMVTIWWSS 
ARLIHCSFI21ASETIASEECTQQIDDMH 
*KLQRLQAALINRK 


1528 


15429 


A 


1536 


425 


1 


FFNXTFHS VSFSPIiSQKCILTIi* XLLKS 
IVKNN>fRNFQRSLVRK*AKDMNRHFTDD 
HVQMAS / KHMKRCS LVTGEMQ I KN\TVS 
YHYPP I RMIKVRNS SNTKCW *GCGQTGS 
LMRCWW/NK*NQLLWKTGI PPHGLVSTR 
SRVF 


i ton 

1529 


15430 


A 


1537 


4 


443 


ETFVUHYQCGG I RP FS DLQLHAG RTTAL 
FKAVRQGHLSLQRIiLIiS F\ VCLCPAPRG 
GAYRGRQ VS LS CGGLHP VRAS WLLCL P K 
♦AWTMEGTSTPASLPPCSLISDCCASNQ 
RDSVGIGPSEPGAEYNLLVPRFLSPSEK 
RSIWVGV 




1 CA1 t 

15431 


A 


1538 


A DT 

457 


3 


C*AFWWGGTQQAASSTAAMAAMKPLGIW 

ELS IGRQRAWREQPDQSAEPPFSQAPRP 
GYPPSPQPLSMRRGPGANPRLARPLRGP 
VRVRLRRASSERQKRSRGGSGPLG 


I Jo I 




7\ 








I YIjFI FETESRSVAWAGMOWRD'LnST /I P 
SPTG 


1532 


15433 


A 


1540 


475 


202 


PGGGWFSPPDNFSIiKENSQGGPPVSHPP 
PPPGNGEGGKTPGAGHSGI*NPPPPLKI 
N/LEKGGRGGPPGPIiLNPPFPiCEKPRGG 
KKKKKKKR 


1333 


1 CA1A 

15434 


A 


1 CA 1 

1541 


1 A 

14 


A HQ 

4oo 


LKE I KSVI PMNTLMVRERNSLI VAVEKV 
LVA*VKEQTSHNIPLCQSLVQSKALTLL 
HSVKAKRG*EAABEKLEAGKHWFIjRFKG 
KSRLHNI KVQGEATSTDGELQQVTYPDL 
VKIVGEGDYPKQ 


1534 


15435 


A 


1542 


479 


141 


RKTDSWDLIK\SFSTAK*AINEADGQTT 
EWEKTFANYASDKGLISRKELKQINKKK 
ANS P I KK* AKDTKKQPTNMKKCRTS L 1 1 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amifiA Acid 

residue of 

npntidft 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G1utaraic Acid, ^Phenylalanine, 
G=Gtycine, H=Histidine, l=Iso leu cine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu tamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, Impossible 
nucleotide deletion, V=posSible 
nucleotide insertion 














RMQIKTTMRYHLTTESE*LIiIKSQGRA 
L 


i jjj 


1 J too 


A 




17 


474 


NPKXKKKKKIDLLDTYYTLYPK/AEYTF 
SSEIHKNWLYIN*LRANLNKFIN*KIKT 
I LSDHNRRQPE INKCNKTKR/RVT/TQK 
FKNLILSNYWVKGELQTQIAEFIjKRNVN 
KNTIYLNT*NTIKVIiNSGGST/AIiNTYI 
DKNFKKRVPS 


1536 


15437 


A 


1544 


89 


481 


ICLKVISLRQENATSVACIVSCLSEGSR 
ASENLVLHNASPNSCGLGIAFTCLCNVV 
SSGNKITDCYD*IiSSS/WLGFFFFFFLK 
KSFLFVAQAGGQGGNLG*PKPLPPGIiKQ 
FLPPPRANFCFLEKTGFFLF 


1537 


15438 


A 


1545 


1 


300 


P P P P APXXCRP Y * XP WXXP VYHS S WRHG 
SGAAQGAVLAGFGGVGRQGPGAASVSIP 
LCPENQGCREPGPSHAVPAPSALPSLRS 
LTGGQIGGTRAAQAVG 


Ijjo 


1 JHJ7 


A 




9 


416 


GAPEIKSIIKGYND/RLCTTKFYNLDEM 
DKFLVRHKLPKLI * E* IDNLNRWITSQE 
TDW*I*QQSSSSSSSSSS/PSSSSSRPN 
GFTTESYQSFEDKLIPIICKIiLKKIDKE 
\GHFPIiQL*GITQI PKPDI yh/ IENYRP 
ISIjM 


1539 


15440 


A 


1547 


54 


419 


PATWEDHLSQGGRDCSEPRLHHCTPAWV 
RE*DPTSKKKKBK/NDQ*LLIP*S*RRL 
*RSSHQTSHLMDWNSEAGR+EATFLRSY 
S KL\ KEELELUiI LDNNDDHNNSD* YLL 
SIYSIPGTVLS 


1540 


15441 


A 


1548 


37 


339 


KRWKCIiRA* LLMRPRH7VD * LNPDGVGYS 
EL*SRHCTPAWVTEQDLVSISTNRKNER 
HTLEYSHQHYSG* P * TG/ EESE YPSAL * 
QRE I IDYS FIQGMTDQL 


1541 


15442 


A 


1549 


477 


1 


PGSHDLGSYT*PQ/WSSPEVTSRDAPS 
HPSAPKFCSNPCRGRWLTSSKQPKLRLT 
SAVPGAPGARGTFHALGAGAAEAGGHSA 
SRPEAALCRPLPPLPMTLTSHPLLSGPG 
RLAWGCNGRR* IKGGG 












& 

o 


CWPGSSGTPDLK*STRPGLPTCWDYRHE 
PLCPASKTFLSPQIETPYPLNNNAHSPR 
RPALVNYSLLSVSMDLPIL\AFHTNGIT 
GYVAS PAP P VRWGSHS VAQAGVPWPNLG 
SL* PPPPRLK*SSRLSLLSSWDYSHMTP 
LN 


1543 


15444 


A 


1551 


2 


419 


ETSPS LQGWLGVLFPKRGAKTSRFIiI IR 
PQGGSFKDGDFFNPPGEIKTPPAKKKKK 
KDSARSPPARLQA*GAGLWDARASFRRP 
FQAP VPLS AQPRAQ PIiAVGTS RDGGSSG 
PTPGQESAV\PWREKHPQQPPPQPG 


1544 


15445 


A 


1552 


2 


387 


FRHVAQAGLELLG\SSDPSVSASQSTGI 
RGMTYRAQPGL*LFNSKNSILSGPKV/L 
QDYMW 


1545 


15446 


A 


1553 


393 


2 


KNPIFFFLKQGFPFFP*LEGRGKFLGPC 
HLCFPGSKKG\LPHPPQLIGAPGPPPSP 
GFFFLKRGFS/LFCPGGSFSLRRKGPPP 
PALPKF\GFLRVTPLAGPGNPSFFFFFF 
RDGVS LCHPGWSAVAASRLTA 


1546 


15447 


A 


1554 


1 


427 


LLLIiILYAIVEAVT*TEGAGYPPLPGSY 
SHRGASGHRAIFALHLTGGCCI *GAMHC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine» D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionioe, 
N=Asparagine, P^Proline, 
Vpuiutamine, K— Arginine, o=oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ITTMIDIEPPAITQYHTPLIiA* SGblT* 

GL\LVLSLPVLDAGITILLTDRNLITTL 

FNPDGGGDPILYQHLIj*VFGHPEVYTRI 
t no 


1547 


15448 


A 


1555 


3 


382 


TLYLLFAA*AGGL/GSALSLLIQAELGQ 
PGNLLGNDHI YNVI GTAHALVI ILFIVT 
P 1 1 IGGFGN* LDTL I FAAPDMALSRLNN 
I S L * LLHTS VLLLLAS AIVEAVA* TG * T 

GYIjCIiAGNYSHP * AS V 


1548 


15449 


A 


1556 


380 


3 


EVSPCCPGWS*TPGFKQFACLGLPRCWS 
YSREPAPPAKACFLDI*VK*WFCK*GSP 
EVNSKPGLxLFI YIjIKQKKGVK* *PARY 
LVLFFCFFVFVFVFVFF*HRIS\SVAQG 
GVQWCDLGSLQPPP 


1549 


15450 


A 


1557 


100 


254 


I PTVPTY* TPIKSFHRARIESSGPGYSW 

TYtTT*\OT» lT*!TriT HtnTOT T\OT»\ T1 T\ ft /^XTT IfTTTk 

PVDSAK*VPIiAVVSIJJSR\RDSGNLVH^ 
LMRVTN* MKRHLVTLTQS \CYS 


1550 


15451 


A 


1558 


2 


289 


APGVSSTMEDEMGGGLEPQRRGCSKQRS 
HHCI PAWATE *DCLKFNNNKKNYLIjLI S 
LRTQSYLYF/C* VNS Y*LKLS I KLAGGT 
GSKEH*SQKRKSK 


1551 


15452 


A 


1559 


24 


354 


PLPSASPGPBGATPVPTS/ACPNKIKLY 
JCTKKKN KKKKQQLiNE * ETG 














1552 


15453 


A 


1560 


376 


2 


AARGSGVRDPLEEAVCLFSDLQLRAGRT 
TALLKALFK\RQGHLSLQRLIiLSF\VCL 
CPAPRGGAYRRRQASLSOGGLHPVRASR 
LLCLPKQAWAMAGAPPPASLPPCSWISD 
CCASNQ * DS VGVG 


1553 


15454 


A 


1561 


3 


408 


AASTVFLPFLERKGIDFGFLFFFFGEKK 
FPFLAPGGAPGGLFSFPEASSPGLNPFF 
WPNPFEiK* KKGGP PPPPGFFFFF * KKRG 
FPGG\PGGAPFPDPKIGPPGPPKGGEFR 
GGPPPPGPNFFFFFLKGKGGGGPP 


1554 


15455 


A 


1562 


355 


161 


FKPGDGGCSBPRWCHCTPIWVIMRDSVS 

KPKQSKTKKRNVFT/C*D*VF*NNNINY 

Tvrr"*co # d a t vtt vt?tr/>n»iuf 
INCr 5 * KAIix XJjxEKQrM 


1555 


15456 


A 


1563 


410 


1 


TPPPPPKNFLGTPLFPKNKAGKGLFPPP 
LGFPKGGQWPIPPQRFPPLFSPKKRAD* 
KPKPPPALKIRGPNPGVFPKGFCGFSFF 
PPPPFPRGGGLIFFLPP/TK*SGG*KKP 


1556 


15457 


A 


1564 


2 


374 


ADRNLNTTFFDPAG/GVGDP ILYQHLF * 
FFGHPEGYILILPGFGIISHIVTYYSGK 
KEPFG YIGMV*AMIS IGFLGFIV*AHHI 

FS * LATLHGSNMK 


1557 


15458 


A 


1565 


396 


0 


II FLIFLRHGFAWAQAGMQWCGLGSLQ 
PLFPGFRLFS/ CLLSSWDYR*RQGFTML 
AKLVIjNS * PQAI \SAbPSQSPGlTDVSH 
CA 


1558 


15459 


A 


1566 


186 


452 


KQKCNFKTLNTKYQPLP FFFLERNFCFC 
PPGGGEGADFTFLEPLPSGAKGFF\CLT 
LQRM/ WE* RVSPPTPLNFGFLVKKGFSL 
CGSTGF 


1559 


15460 


A 


1567 


453 


37 


KTALYWKKTPSSTS IAREEKSVPGFKGQ 
AHFLFRGKHKFKLVSKLI YHFENP/R/A 
LKNYAKS I L/ PYKWNKAWMTAHLFS P\W 
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SEQID 
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beginning 

nucleotide 

location 

correspond 

ing to first 

o m in a a r t rl 

annuo akiu 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

ICMUUc \}K 

peptide 
sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Htstidine, f=Isoleucine, 
K=Lysine, L^Leucine, [^Methionine, 
N=Asparagine, P=Proline» 
v vriucaniine, iv rvrginine, o — oenne, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

Ar*UUIUIUWn } — OIU \3 Luuon, /— posoiuic 

nucleotide deletion, \=possible 
nucleotide insertion 














PIKYFRPTVEM*CSYNKILVLIDTGLDH 
PRPLIETYK\FINLHVCVLFNTTPIVQP 
LPR 








1 

1 JOO 






TftT .YWVTTTPCDTQ T 2M? WW VQ.^TDdWnmi. 
X Hjj l jx IV l roKloi >AxCtL H Xvo V irv»I? 

HFLFRGKHKFKLVSKLIBHFENP/R/AL 
KNYARS /TLAYKWNKAWMTAHLFSP\WF 
IKYFRPTVEM*CSYNKILVLIDTGLDHP 
RTLIETYK\FINIiHVGVLFNTTPILQPM 
dp an 


1561 


15462 


A 


1569 


1 


352 


FGTRRERERERERERERERERERGRGGA 
GHQI PS IEGQHKXVGAI IY\CREC\GEK 
ISGSTSYIKVWDIRDSAKCIRTLTSSGQ 
GISGDACAATFTRAITSAQGEH* INQIA 

J-iX\Jtrovjrl 


1562 


15463 


A 


1570 


394 


3 


TIiDWPQSRGGSTGKPVYPSVCCCCCCCC 
CFNFCQEAEKNNAEGLLHNQAGRT KDGS 
FAPSHDH*A/PRGTEV/DLLBSTI*QTSI 
KQVES KPR \EQARTGAGGQKEKATQNPE 


1563 


15464 


A 


1571 


399 


2 


KHQLPVFWQYNKKAWTTRTLFLDWLHCC 
FVFE VRKYLAS KGLP FKWL I LDNAPGH 
PPRTP* VQY\KGIEVIYLPPNTMSIjI* f 
LDQGVIRTYR/SHYTQYSMQRTISAMQE 


1564 


15465 


A 


1572 


37 


400 


RGT I TGKAASADQE IADKLS DAINKIME 
EKG\ Y* LPEQVFNLDESTLFWG\KKKPQ 
RTLLS KE KKRAPGFKTGKDRLTLLFC /A 
NAVKLI I RTALTYKAENPQ ALKENVKHQ 

T.DWHT.TTWfTr. 
ucr v r ^ jj x x i\\jxj 


1565 


15466 


A 


1573 


84 


485 


AGHKDS PRPHQTQE PS WLHLWDPAPGLQ 
VELPAS P \GRALALLS P WWDGTGRPGA 
GGGTRRGGSGPTGAHGAGGRLMHGGLQV 
PSPAPREGS *GPARYQAQRMWARTAGGP 


1566 


15467 


A 


1574 


3 


463 


TPAQGLRDPSNMRKHAYCGCCV/CITLC 
VGAQNKNRAVCGLYSTCPRLCVYEHEHI 
CVNE*VCEHVCERESVRVCKSTH/LPLC 
A*TCGPIFGCMSEKHVFSYTPCVHRVCV 
pvpvwtjGCpvc /vnvrvpvrvrvnvpv 

VVV«vniAJV^.VV>/ VUVV«Vv,VV*VV,V\,Vl < V 


1567 


15468 


A 


1575 


1 


383 


FLSFGFAPQAGGQGHNHG* GP P * P / PKA 
KGIFPPHPPEKREQRVHATPPGKFLDFF 
FFWKKGGLNl^PKKNIjGPGGKKNIjLVS p 
PKGGGKKKETPGPGGVFFWGGI FFFFP P 
PCS PG * PLS LLKKP KGG 


I J DO 




j± 


ID /O 




AAQ 


R T PRCHO PVGP LGCR EGA KPOG Pfl PDAA 
DSHPPASPRAH* /P*SLHRDPIPGFRGP 
CRRNAGAGPRAHTAGICAPOPNS * SRHH 
PWAAAPAGDDSPTTSLGSHVPVGRKSSD 
GCRQRALCTPGSPAPSEAEVGGSPELRS 
TjRPAWA 


1569 


15470 


A 


1577 


473 


62 


SLEINPYISGQLIFNQHAKTNH*SMRKE 
\SF^NKWCLDN*ISTGRRMKIiDPYLKPH 
IKLNS K/LKDIiNI RDTTMINIjHDLGFGN 
GF*TMTTK*AIKEKIN/WDFIKI*KFCA 
SNDI I KKVKRPNVTAYTCNPSTLEGPR 


1570 


15471 


A 


1578 


59 


426 


lerrsfgaplspffapqfmkkgfgsp* 
kfffspkalnpgggwpffpppkkrffs 
knpqegf i ppplkkkkktsqpp/ ykfgp 
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pkesfkrappffffkggpffplvsnqwt 
GKPRJjP p pppketqnpss fq iggqkkgk 
GGPQSFS FPIKKKKKKGGHSRSRTSPRV 


1571 


15472 


A 


1579 


2 


419 


KANKTKNAYFEGINKMDKPQVILKKKY/ 
REKT* IPSIGBEKGIMSP\NSEDTKRII 
KECrKQTy/VHRFYSIiKE/PUJlIiLESHKL 
PKLTQ/ EETDSLNS PV 


1572 


15473 


A 


1580 


129 


403 


YMFFIPINCQDHPK*KKKKKKKKKKRGG 
/RPFKKTLRGPKLNRAEKNKFFF*KGS I 
KKKOjEILKKKLFFGGEKNCKNPPKKKK 
PSREKKKF j 


1573 


15474 


A 


1581 


317 


76 


PRFFFFFPPPKKGFFSPPFFFFSPRFFP 
PPPFLKPPPRFFFFGPFKKNFF\PPPRP 
LIFFFF*APPPFFFFFFFFFFFFV 


1574 


15475 


A 


1582 


259 


377 


PREMKTYLPTKKLGYEISHYH* WWGCKM 
VQPVWKTVWQFLKGLNIKLP*DSAVPLV 
GM * PREMKTYLPTKKLGYE /MFTLSLLI 
IANK* KQPKCPRMWKW 


1575 


15476 


A 


1583 


1 


415 


PTRPITSSICLRQSYLKALIAYSSISHI 
ALGV\TAILNPTP*SFTGAGILIIA\HG 
LTCSLLFCLANSNYERTRRRI I ILSQGH 
QTLLPLIAF* *LLARLANIiALPPTINLL 
GELSELRTTFS*SNITLLLTGLNTLGT 


1576 


15477 


A 


1584 


216 


406 


LNLLPLVLGGSSCLPPPCGGN*KPPPPP 
G\LFLEKKGFSPCX3PAGF* PPALRGPPP 
PPLPRGIiI 


1577 


15478 


A 


1585 


383 


3 


KKFGYPFYWIG* KILK* FPG* K* SLPHR 
KSPFFFFF*GRVLLCPPGWRGTTKGHGS 
QVT\LPAAAMTFQV\K*SS PIiRLPSR*G 
YRQAS P WS GNFF F / CLVESLSMLPGLIL 
NYWAIAIKPSGPPKVLG 


1578 


15479 


A 


1586 


126 


413 


NPTLKK* KMKENRMKKNEQSLRDL+DTI 
KLTNRCII/3IPKEERKKGAEGIFGEIMV 
GNSSNLIR\ENINLNIEVAQ*TLSRIN* 
KRATLRHVIMKM 


1579 


15480 


A 


1587 


242 


409 


GWM I FRFNFFFFLRGS FTLVTQAGG * GG 
DFG/SLRPPPPGLKRFSCLTLPRSWDYR 
H 


1580 


15481 


A 


1588 


2 


338 


E IE KKGKGKKRRG * RSNKKKRGGRLEGQ 
BCY*SPPCPRLSFFIiVDEKGPEVLQDSLG 
GWWKTP /SGCPBMTDSSQP Y/ YRAFYVL 
KN/QRVGFSVDVGE IEKDQDVEKNQDPS 

/""•DOT 


1581 


15482 


A 


1589 


360 


0 


NNFLEQLKFPPKKKITTDSYKAPRPFFF 
FPPSQKGVFPPTLFFGFPPGFPPPPFLN 
PPPGFF/CFWAPLKKFFFPYPGG*TWVS 
LKGPP/L/RFFFFFF*DGVSLiCHPGWSS 


1582 


15483 


A 


1590 


75 


412 


VEGQHCNFCAAQETINRVKRQHTELLET 
FANWS PDKGLIPRTYKELKHLNRKKHS Y 
* KWADDLDRHFSKETYTPPKTYSCTQYC 
*P/ S I T I EN* KuKTSNT KTRLQGGHKRL 
ER 


1583 


15484 


A 


1591 


309 


1 


FSTQGAHMQVCYMDTFHDAE I WTS I EP V 
TQIVNMLPNR* FFtfPCPHFFPPPFCSP / 
LVSFLLLLRDRVSLCCPGQSRSVGFK*S 
TOjGLPKRWDYKCEPPHVA 


1584 


15485 


A 


1592 


2 


415 


LSISLS I FSFLP \ FS FDLPSLSLS LFLS 
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nucleotide insertion 














FFLFIiSFFPPS FFFFFHRASLCHIjGWSA 
MA*SWLTAVSTSQ\VKQSSHL 


1585 


15486 


A 


1593 


3 


396 


HSAFLFFL*DRVSLCHLSWKAVAQSQLT 
ATSTLL/VLKQSSHLNLP 


1586 


15487 


A 


1594 


300 


62 


KMTGVIiKSSCGKS PKQVGWLCVCVCVF/ 
SFKRQGIjAS/VAQTGMQWGDHS SLQPPA 
PGIiR*SSCISLISSWDYRRVPRRAPG 


1587 


15488 


A 


1595 


1 


412 


FDRPAADQKAASALKASGVQAQMAKGTY 
HDWS LQDWKVLWMTH* VS \ QQEQDPTNL 
YISNLPLCMDBLQLENMLKPFGQVISTR 
I LRD YSGTFRGDGFARMES TDKCDAVI E 
HYNVLL I MTP PGVS APTB PLLCKFAE 


1588 


15489 


A 


1596 


428 


2 


QSESRE P ENFLLPTSTSS LLLVHCLSRT 
CWNSTLSCVPHFFHLTFNSHLLLTSQP 
RRVHFC* LSLSS 1 1 S WKLLNTLPGVP IR 
\ASEIFGLRTIRNFPFLSHSVLFPFSLS 
ESHSVTQQGVQWHELGSLQPLPHGCQWL 
SCL 


1589 


15490 


A 


1597 


2 


442 


QGSIi* PHPPMLK* SLTSAF \NYRHVSPH 
LANYFLFFF * KQGLAVLLGLALN* AQ / C 
HLPAIASQSAGITGVSHGAWLFFFFLNQ 
CYLI*FLILIF\ERRNSPVAHVLVNGGD 
LGLLKPPPPGLMGFSCLNLLRSGKYKPP 
ATSPGYIFC 


1590 


15491 


A 


1598 


419 


121 


NLGYPRVSPPPPFLNPPPBFYEGPPKKK 
FY/PPPPPAQKIDPP*TPPPFFFFGTDG 
SHYVAQAGLELIiASSDLPASALQSTGIT 
KHEPPHLAETI FLVFL 


1591 


15492 


A 


1599 


45 


397 


DRVSLCYPGWSAW\*S*LTAASNSW\V 
K*SACFSLLSRGDYMCMTLHIANIKKNF 
LGRVRGSHL*SYHLKLCLKK*RTKEGFL 
S FFWXGKRHFLGQKNI IxNPRLKVFFLPG 
PPKGLGY 


1592 


15493 


A 


1600 


3 


397 


SRRPGRFSUVILTLSWHS*VCRAALAARE 
EQWSGCFKSHF*LEVNFWVSGIDREV/C 
GY/LKTVLGEDLNDYVSTQ I+D * LMKPR 
CPEKQDESLLKEFGGGA*RLNWHRPGA 
VAHACNPNALGAKGRRIPRSG 


1593 


15494 


A 


1601 


244 


2 


KTKTSFHSLIDSCGYLLSYSNFQI*KKM 
I FK/ YLHLRAVPRHVI VRFTNLERQE KV 
LRAAREKG*VMHKGKPIRLTADLSA 


1594 


15495 


A 


1602 


435 


2 


PQEAIYTPTQHPTYGAICRIARIHGSRD 

Q/ svemkwhpqltitpsgp/ LGKSLLPI 

P VALCFADLEVF I P KGGMLS PGVTI I S L 
NWKLRLPWVILQ*RITPLARVTPPDYQR 
EIRLIJjNNERKKSYVWNTRDPLGHLVVL 
PCPMVIK 


1595 


15496 


A 


1603 


288 


8 


EFPLIREMQIKPTMRYRLTPV*MD\R*W 
*GCGKRGTLVHCWW*CKLVQSVCERSSK 
NITIELPCDPAVLILGMHTKERKLRPCG 
WTRGVGPGYC 


1596 


15497 


A 


1604 


411 


236 


LIPS EHFFFSG * F* AFDRSSLAPTPQLR 
RHRPPTGLLPLSLLQGP\LLTPS VLLS * 
GGRFKGPNFTPPGGQGKIFFMGPPKSNP 
GPGV 


1597 


15498 


A 


1605 


2 


437 


KCLRISPCAGPRRPWCPSFBPRVCVWPL 
GVPQVG/ PEGP *GEG \GFEGGDVRQLWQ 
GKKKKKKNTDGKRGPWFPKEKRGFSLTP 
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WFGPQGLWEPCLNPGGPLFLIGGFQKGP 
KVLIGKGKTPGPKKKNNRPRKRGSLGPP 
GKDDFPL 


1598 


15499 


A 


1606 


31 


467 


EFPGRRPRGGGCSELRLRVCTPAWTTEG 
DSISNNI *LVNEN/HTVNSWKCSDS 


1599 


15500 


A 


1607 


418 


32 


VGQVGLELLTSSDPPASAYPK*WDYKR\ 
DPRAQPNPNILKAQSNRSP PRQR* PAS S 
FSNQNS CMHRCI S LP * VLFL I FFKNRP * 
NNSFGYNSSGWWSGLNILNSC* FWFLFG 
NSDTEIRICRSFHRGREV 


1600 


15501 


A 


1608 


379 


3 


FPILSPSYNPSSTELP***SLQSPLFBP 
YDFFSVNMFSFLFFNLKSHTVAQAGVQW 
RNHGSLQPRSFQLK\HPPASASAGTTGM 
(3fflAW/LIYLCVYLFLEMRSHCVSQD*V 
QWHNHSSLQPETPTK 


1601 


15502 


A 


1609 


456 


99 


PFPTPFPPPHSNWGLGLGCX3L/EPTMTP 
SGLGLKGSPS*SPPAHRLSGAQLCPVLR 
APVLGPSQMPPGRKKKPTARGS PWRKGV 
FVMSGQSDPLGPSSCQELGPRQSTQGTF 
PGA 


1602 


15503 


A 


1610 


1 


420 


FRFSDGAAGQKCSS PPRPGRGRAEVLLT 
SQTGRQGRGAPHISDNGQPGRDAP\PS * 
M*WRPGRGAP\PS*VGWRPGGDAPHF\Q 
TGQPGRGAPHIPDDGRPGRDAPHFPDGV 
GAGQRLQTRHFGRPTAAAWKVKWTSLR 


1603 


15504 


A 


1611 


426 


3 


KNPFLLEAKVSFNPKWPPALPPGEQRDS 
VSQKKKKOTiLIHKKAHSKH*FFICRE\C 
ESALLLHQNIHAGGKS YVCNK *GRGFRN 
KSHFTYQRTHSGKKAFL*KKCG*DFIi*K 
AILTAYQKTHSGXKSFVCKECR*DFTQK 
TK 


1 C f\A 

loQ4 


15505 


A 


1612 


428 


273 


HHA* LI FKIF/CVETRVSLLCPGWS *TP 
ELK*S3LLGCPKCWDYRREPPIRPTT 


1605 


15506 


A 


1613 


311 


4 


ANKFKNLNEIKFPEAHNLPKFTQEGGLN 
NPVSLY* KN*TYSLRLFP\KKKKSGPDD 
FTGE FN* TS KEEVP SLHKLLQKI * ERNT 
LiPWiiF HKAKVTQ V S KSEM 


1606 


15507 


A 


1614 


338 


88 


PNPPPPSKGK\GFPPPTPGRKKKTRPPP 
PPGKFLGF * KKRGFPPFERGGP\NPAPG 
GPPPPNPPKGGGTKGEPPSPEGGVFFFF 


1607 


15508 


A 


1615 


2 


162 


KHGGTC^YSLLLGRLRHESCLNLGGGGC 

KJS/ lrJB/oWCllrAWAtJl w USVoriKJQCxij 


1608 


15509 


A 


1616 | 


3 


399 


PEVREYLTSRGLPFKVLLILDNAHGHSE 
PQRFNTEGINVFYLLPNPRSLIQTLDQG 
VTRTFKSHYTWYSTERIANAMEENPDRT 
S *KSRIMTPL/IDAIVMTEKAMEAIMPK 
T X I S CWKKLCPD WHDFTRFT 


1609 


15510 


A 


1617 


390 


1 


KRNCFGPFLTPPPGQRLGGFKFLK\HFF 
YYRGERGGFFFSPFNRGFPPPPPFWGFF 
LGALKF*RGVPPSKPPPPGPGKKFKLKP 
FPPRKSYGGCFKCFLGGPFPFEGPPPQK 
KKKKNFTAARDLE PNAW 


1610 


15511 


A 


lOlo 






MKLVNIWLLLLVALLi*GKKHLGDRLEKK 
S FEKAPCPGCSHLTLKVE FSSTVAEYEY 
IVAFNGYFTAKARNSFISRALKSSEVHN 
WRIIPRSNPSSDYPT*WP/VALKKKKKK 
AG V/ LP LENYS 1 1 TRV 


1611 


15512 


A 


1619 


421 


2 


SSRLSLPKCWDYRREPPRPAQPRILKKI 
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S**EPFFLIIiKCI*IiRTQGIiGRGHKGKL 
NSTYVLHKFMD* *LGQLMNGLNMLDESS 
FCNVLKHTKQ\WRDYYFVCLFVCFVRQS 
L/NPVSQAGVQRRDHGSLRAPPPGFAPF 

s 


1612 


15513 


A 


1620 


406 


2 


GTKKAS SNGF INGPGAAKKKS QTFAFLA 
KPGAFPSILLDEKNRPQIKPPACFP/SI 
RESQTSYPVFFFWSPSCGFFFFWGKPEP 
LLLKEAGKVPLFLFLFGFL * DGVSLCQP 
GWNAVLRSQLTAE SNPHAS AHAS 


1613 


15514 




1621 


41 1 




PPPPKKKPSGPPPP /PSSSSSPSPPRKF 
LGGPRVFFPPP FFKPPP PKNFWGPPKKK 
KFPPPPGGKKFFF*RAPPP 


1614 


15515 


A 


1622 


2 


403 


TARCGLNFPCSSSLPFIAS * VAGTTGTH 
HHAQLILIiIFCXJDELSLM\CPGWS/PNS 
SHLGLPKCWDYR 


1615 


iJJIU 


A 






411 
*ti i 


LIINVCWPGLVAHACNPSTLGG*GGRI / 
TMRSGVRDQP 


1616 


15517 


A 


1624 


263 


2 


DSVSKKKKNFKE*LIPVLUCLFQNIEEE 
GILPNSLYKAGV\liIPKPDKDT*RKEIY 
RPISLINIDAKI VSKIIiAtnCIQ * FIKKI 
TDAW 


1617 


15518 


A 


1625 


3 


281 


PFSCLSLPSCWDYRRPPPRPANFF/VYF 
YKKNTRTQOTTVKHGFTVLTRLVLIS * P 
CDPPSLASQSAGITGVSHRTQPHTVFFL 
NNPALPKLQT 


1618 


15519 


A 


1626 


300 


20 


NPGPRGFPPPGPPKRIiDFRGGAPRPGF* 
YFLKNFIiGFFWFQKNFLVF FLGM KT PFP 
PQFFFFF /RDRMSLCHPGWSEVAQAWLK 
AALTSQTPAI 


1619 


15520 


A 


1627 


394 


40 


PQPAAASLFSPGLFFFFPPVFSPPPFFK 
TPPRIFFFWPP*KNFFF\PPPAFFFFFF 
LGAPPFFFFFFFFFFFFFF*DRVSLCRP 
GWSAVAQS *LTAALNSQT 


1620 


15521 


A 


1628 


386 


3 


I FPTRCTHLHLGTNKLPTCS FFDQAKKN 
PFCS\HHSRGVGLRARLFCERLTIEGAG 
TPACPAP*FPGEPTRP/PGRVRWIiTPAI 
PALWEAETGS * YVARTGLELLVSSNPPL 
SASQSARI TGVSHRTWP 


1621 


15522 


A 


1629 


401 




ARGVLPLNPPPWGGRGGGS P *GKNSKP P 
RPRGENPPPFLKPQKLPPPGGGPPPPLF 
LGG*SKKTPPPPKGGAPINQKPPPGLPP 
PGKKGAPFPKKKKKKDKNIRTKKKARRS 
GSIiLQSQHPGRPR/RGGPPLTKNPPIiAS 
PPQEKRGPPFQKKKKKKTKI 


1622 


15523 


A 


1630 


417 


47 


PPPGTISSPNP/QKNLKKGPGPGGNPRN 
PPPLGGQRG/RGLWAKKSRP PGPPRGNP 
PLFKKKKINGGGGPPPWPPPRGARAGK 
SLYPGGGPPQ*PQMGPPPPPPGAKKGFF 
PKKKKKKPKKPRKT 


1623 


15524 


A 


1631 


416 


54 


E YWCKGR* IDQWNRM * FKMDQHLHSQFI 
FN*ATKAIQWGKESLFNRLCLKNWLTIR 
D K\ I YLD ACLTT Y \ * KINS S WHSGAFL * 
SQILERLRQEASLSPGI*VQPRQHSKTP 
SLKKFFFKFFS 


1624 


15525 


A 


1632 


2 


373 


LVFLDNMLKIiLRHSALASACFPEDIAKI 
MDEGGYTKGQIFKVHGTAFCWKKMPSRT 
FWRBQSVPGFK/ATD*IiLLGANAAGN\ 
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nucleotide insertion 














KLKPMPIYHSENPRVFKNYVKPILPALY 
TWNSKA/ QMAAPLF 


1625 


15526 


A 


1633 


3 


383 


NIWN+KAWMTVHL/FIT*FTEYFKSTVE 
NC * EKKKILFK I /LLL IDNTFGHPKALM 
EIYKEINVI FMPANTIS PMDQGVIA 
TLNS YYLRNTFHKT IAFINYDS S DRCQQ 
SQLKNFWKGFS ILDAIR 


1626 


15527 


A 


1634 


2 


182 


GACTQIjIGRLRQENRLN/SGDRGCSEPI 
LHLCTPPWATE*DPVSKKKKKNPCPKLK 
KGPPP 


1627 


15528 


A 


1635 


331 


1 


LFPPPPVLKSGPGPN*N*PPLKGKSFGP 
LKKKFFFFFF \ FRD WVLLYCLGWSQTSG 
LKRSSCWDYRCEPPHLTPNF/SYF/CRD 
SVSL * PRLEGSGS I IALCSLKLLDSSS P 
H 


1628 


15529 


A 


1636 


80 


381 


KLSKCHAHHSRGFYKYSLFQLGASQFPQ 
VLRITHPHKGSWAAPRS*G*SQC/SHFF 
SFFFETKSHFWQAGGYGRNFT* LQPPP 
PGLKRFSHLSLPSSWDYS 




15530 


A 


1637 


381 


38 


SKRQGFHHVGQAGLELLSSSDLPTMALH 
PTCPLQKCWDYRC/DATAPNLSSTFFMQ 
EKCUVFL\PLFSSAPHSL* *PKAVIEKD 
CPGL* IW* *VARSQRMNVFNFIK 


1630 


15531 


A 


1638 


3 


295 


PGPDGFTAEFNQTFKE*LIPILVKLF*K 
IQEVR\FFKFS itltpksnkd 


1631 


15532 


A 


1639 


48 


380 


ILGKAI S FTI E * KGLKYLGI YLTKEAKG 
LHTEtrYKMLLKELKKDTNIWKGIIiCQ/W 
TRRLNMVKISV 


1632 


15533 


A 


1640 


343 


23 


SWLTAT/FCFLGSSDSPASAGVELLTSG 
DLPALAS *TAGITGVSHRTRPALSINTS 
TLLCSSPYCPPHLQSIjQGTTQPVPFLKA 
QRTGCLLQGVILTSQAENFCNKH 


1633 


15534 


A 


1641 


3 


397 


LELFSSAHCCPSLTVMQYYPP\RPTSHD 
CQRE KS PHRTKKKKPLEGVFLG * KI KTN 
LENPPPLPFFGGGPPPKGGGPFKIV*GG 
WPWPWGLKIPTLPKGPTPRPPWGPLGTF 
GGEGKPU?AFPLQRFFRGPE 


1 &\A 


15535 




1642 


2 


308 


NKWR*GNWISVKIDNFNFYLMPYTNLS* 
IRDDLNAKATT I KLVGENIGENLGIGKN 
F*ERTLKA\LRGKKMDKLDFITIGNFCF 
SKDRIKNKNKARRGGSRL 


1635 


15536 


A 


1643 


16 


386 


EKKKLSIiFTENGIPYL*NPKESAKRLLS 
LINDFSKV*GYKNN\DEKSVAFLNTNK 


1636 


15537 


A 


1644 


533 


3 


PLSLSLSSLFLSFFL*DRACFVAQAGVQ 
WIiDIiGSLQPPPPGI K* FS/CGGNVAVTP 
RLSPLTLPAMTEVRLPSSKIQTNKEKNT 
VMSBIYQSLM*MGWG/M*VRMCVKFKKP 
WVDN* * VSVMGCLLPSFLPSFLSFLFLS 
FSFljSFFFzFrBTfiSCFvAEAGAQWLDIj 
G*LQPPPPAVP 


1637 


15538 


A 


1645 


341 


19 


GIGGRPP * FQLLGRIjKQEK/ HLERGKGF 
NEPKSRPCISAWATKGDCFQKKKKIHMT 
LOO IO P KOS W KTD I SOFRTD YTCTTT T TfT 
VGHGIWIDIFINGIELSENLNQCS 


1638 


15539 


A 


1646 


279 


3 


TFYHNEKDNNKCWLTI \GKI *MLTGCYW 
ECKMVQLPWKTAWQFL/R/DVNI *LPYD 
LAITLLS I YTRKRKT YVYTKTCIQMFLA 
VLFTTAKRWKQP 
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SEQID 
NO: of 
nucleotide 
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SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nm inn Arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

9CU U vUvC 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, I=Iso leucine, 
K=Lysine, Leu cine, ^Methionine, 
N=Asparagine, P=Proline, 

0=0111 f a min a R=A rpinin** ^=55i*rtrif» 

T=Threonine, V=Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 

uuviwiiuc luavl uvii 


1639 


15540 


A 


1647 


392 


1 


QKLEIMLREEGMLKAKTGQKLGLLCX3AV 
THLVNAKEKFLKLNVLLCX3THKW*ESET 
AKSLIIiFNSMKAEKGE *AAEEKFAASRG 
WFMyRFKERNCLHHLKVDGBAASAWEA 
VANYIEDLVKKIDKGGCIN 


1640 


15541 


A 


1648 


425 


148 


SQSLI*SKALTLCNSMKT/ERAEBVAGK 
KLEASNF\ LKFKKRSCVXrNVNMOGKVA 
SLDGBAAASSPEDLVNFIDEGGYTKQQI 
FN*DKTSFFF 


1641 


15542 


A 


1649 


271 


462 


RRQKKRHKRCILSPLLFDIVLEVPARTI 
* OEKKI KGI OIGltKEV/KT 1 SLFADDW 
LYL 


1642 


15543 


A 


1650 


70 


398 


RPEASLRHMCLNAGQLIjSKRAKIXxAIjSL 
SFFFWKKSIAFAPQPGGQGGNLG* * KPP 
LPGLRGFSGLTLIiRN/WE*RWPVPPPT\ 
NFGPL I KTGFPLVGQAGFDLRTLGALR 


164? 


I J-J^f^f 










FKfi FP PTVfi FDT/3 PP FP <5 p VP P fivrPT". OD D 

PKRGPRCFPQPGAPPPKIWTPPGALPQG 
GVGPALPG/ALQKFGPKNPGGFFSGPPQ 
MAP\GGFPGGP*RPPRGGAPFFFFLRQG 
FPVAQARVHLPGSSDPPISAPQVAGTTD 
VC 


1644 


15545 


A 


1652 


385 


2 


KGNNPSPPEIKFFFFFF*KGI*IjPIiPQGG 
GQWGYFRSLQPPPSRLKLFSCPNLPSNW 
E YRGP * \ RLL*L I TGRGTSGS KTKVPT P 
CGPFNLNQLGLGTQARNFS /RLVFFFFE 
TESPFVAQAGIQLRDLHS 


1645 


15546 


A 


1653 


242 


3 


KNKNFGINRGFFFTFKVPGFFFFLGKVK 
IiF FFFRNIilCFFLKAKP P * \ VF F P T G P<5 F 
FFFFFFLDRVLLCCPGWNVWQSL 


1646 


-15547 


A 


1654 


3 


285 


HFIIYTKDLNRRFSKEDI * II IKHV/KK 
CS PSLAVREMQIKTTVRPGTVAGTCNPN 
TLGGOGRRI VODOLKOOS KTSKTiOKKTT. 
FRLARHGGTCP 


1647 


15548 


A 


1655 


1 


373 


KVSLFFFFEQGLI,CCPGWSAWSSIiQPQ 
CPRVKQFSHVS /LPSN*EYSCTPNTFSL 
QVCVSIHKYI*YIYIYIFKFF/CR/DRT 
IJUILHRLVSNAWSQAILPPWPPKVLGLQ 


1648 


15549 


A 


1656 


189 


2 


VQPGQQERNS ISKTKNKTKQKLPTKKSP 
GPD*FKEEIiIPILHKL/ F/HKIEDKGTL 
HNSFYWTI 


1649 


15550 


A 


1657 


385 


13 


GGPPPWGARSPPKL*NPPPPRKKPPGPP 
PPPGGAPPLGGFFLFFPPPGGPPPGGGK 
/SFSPRFFFFFFFLGGGGQIFLFSPGGG 
PRGVFS PPAP PLVPKPAF FS KKKKRGSS 
GGEKPEADGYFIK 


1650 


15551 


A 


1658 


352 


2 


HLSLLSNWDSIRAPP/RPCVIFB3SIVPLN 
I FFL*RGVTMLPRLVLNDPPI SASQVAR 
I IDVSHWAKLRRSV/CYVFETGSGSLSQ 
AGVQRYNHGSVOPOPSRVS*SSHHSLWK 
YRYPPPRR 


1651 


1i5S2 


A 




ZOJ 




HS GORDEGRMROGEWIiE S HCWVRARS rw 
TLKTSFLFTMAKI*NQLKCLFMDEWIKK 
MWHIHTME YYSAI KR/DEIPS FVATWME 
LEVTM 


1652 


15553 


A 


1660 


1 


163 


NQQNENRKT I EKI IGTKT * tiFKKI KKFD 
KPLARWT/ RGKKN IQITNIRN* RRDVI 


1653 


15554 


A 


1661 


56 


320 


KFFMYSAGESTKIRCLF/ SCLFLFIiR/Q 
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NO: of 
peptide 
sequence 


M 
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od 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

■ VOJuUv III 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, HNHistidine, I=lsoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 

0 == f>lii'fanriifiA R=Aroininf> Ca$Ar!n« 
>< vfiu miii me, rv — /^.i giiimcj o^oerinc, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=0 , yrosine, 

J\. KJ II KUUVVII, — OlUp CUUUll, / = pOSSlDie 

nucleotide deletion, \=possible 
nucleotide insertion 














ESRSVSQDGG*SBPRSCYCTPAWATE*D 
SVSKKQNKNKPKKNLNKCFASSLIiITLP 
PARHKA 


1654 


15555 


A 


1662 






fnpkkttsrdli i klpkikdkgs * kqke 
ktsi^arlpqphgllglglgksasspik 
kerkq\qitysgapiflvtdfsvetlqv 
rrewhdvfkvlkek/dfyprivylvkis 

F 


1655 


15556 


A 


1663 


362 


2 


VIFADEAQILKKEDKLDFIKMKTSVHQK 
TLSTE YKDNHREKI FVSQISDKELISRV 
YKGI&KLmTDKm*ILFYF*RLGHSVTQ 

GPVnWr*MTIQ QT UDfYP'Dr , T.V\'iaDTi'PC!7VG + 
v V n \-vta.& o JjJrUr y 1 JrcrLux \ JN P F I fc>Ao* 

AAGNTGVHL 


T656 


15557 


A 


1664 


79 


355 


IHLPSLIGDFNLFTGISIjLICLVLFVFI 
FETGSCSVAQ/ S/GVQWHNHGLLQPRPS 
GLRQSSHLS PLSSWNHRHGPG* FI YF/C 

V DltUUI lcLtr 


1657 


15558 


A 


1665 


47 


384 


KEKASGPLINFFFFPKLCKLAPFFFLPF 
FLWGGGGGKFSRNPQKHFPR* KRVFVNF 
FFFFFFLRHCLTVSLSHCHPGWSAVA*S 
QLTAGSNFW\VKQSSHLS/LPSSWDHRH 
APP 


1658 


15559 


A 


1666 


163 


601 


IFCKGGVLPCCPGLADLHFSTSNSISFY 
xoovjijjjta»jj»jNl\.lii j Jrinol JLJr J\JL»VC»VAWR 
FGNSECIFQELPLTLHHLLSTMLASFIH 
SHEASANALVGRSLTVGWGCRGVGVSVD 
PAAWLWRDLKGCF*DKSHSVTQSGGQWC 
NLSSLQP+APRLKRSCILSLPRSWDHWH 
VPPFLANF* I FCKGGVLP\ FAQGW 


1659 


15560 






*»lo 


i 
J 


£> v (_ J-Aj-LPKL. WDTGRKPLCPAPS FFY/ EG 
SITLIPKSEMHLPRNENYRSGFLLN/M/ 
DAKILNRIIJ\NCISN*I*]n14*\*KVKFT 
PGKKDWFNNRKPTDI IYPH*QNREEKSL 
VSSTDMAKVFNKIQPVLRELTIBIKGNF 
LNjj ( 


1660 


15561 


A 


1668 


411 


1 


IiRLHVGRTTT \ LFKA VSQGHLSLQR FlZT 
PSVEICPAP"Rrjn\7YPnpnaQT.Qrv2r«T itd 

r »_» v OXuriuriUjOv J IvVjlvvii-iO i-iO l A-H.-T 1 in rt 

VRASQLLWFLTQASAMAGPPPPVLLPPC 
SLI * DCCANNKGGF IGVGP FE P CVGYNL 
LVSHLLRPSEKPSIRVGVT*FSRC 


T661 


15562 


A 


1669 


151 


1 


PLEKEAEITGP*PHAWLIF/CFF*TESR 
YIAQARMKWHNLGSLOPTPPGFK 


1662 


15563 


A 


1670 


1 


389 


TFF*KLKMINLSEEGMLKAKISQKLGLC 

\tvsqwnskekflkeiksatpbst*mi 

RK*NSSVANTENV* ERSRTSHNILFS * S 
SIQNKDLTFFWSLKAWRGQEVAEEKSEA 
SSGWFMRFKERSHjjHTTK" 


T663 


15564 


A 


1671 


1 


363 


ECTGPKIAKI IIiEKKNNVGGLPI / PNFK 
I * YKAPVI * FWLKVPVI PSSSAILMKTV 
* Y\ yfkdrnodewypTjPVwktn*?h t van 
LI\FSKGTKTrQW*KSLFj^*CWNNWLF 
TCKRMKLD 


1664 


15565 


A 


1672 


203 


2 


ALNKRTDMPS * * IRRCNI I / KCLFSPKM 
N*VFNVIT/ IQCPSGHFFTETDKSILKF 
I WKS K * PRIiAKRTL 


1665 


15566 


A 


1673 


15 


378 


OTIilWQNNEHMYHSLPNFFFFFFGKGAP 
PG\ PKGGGRG* REP * I PGGKGNPPL* PP 
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sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Ala nine 
OCysteioe, D=Aspartic Acid, 
E=GIutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidioe, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KEGGPPGGTPPPGKPGKIWDFWEKGGFP 
LSPGGAGTPGPKGDGPPWPPKGGGKPGG 
TKQPGPKKPF 


1666 


15567 


A 


1674 


2 


349 


LFLTVLARLVSNS * PQ/CDPPALASQSA 
EITGMSHRAQPRGIR* I *CGS YTTCHPL 

LLLT\ yi*fi yfsteshsvt qagvqwhdl 

GSLQSQLRRLT/CILTLPSSWD*PHVTL 
PG*FCLF 


1667 


15568 


A 


1675 


348 


78 


lgpwvppvipaplgg*ggrsprpgfltn 
pgpkgkprvflkiqnlpgv/rgrpplfp 
/gspgsggpksp*plgpsfplt*nfspp 

FWGPPGGPK 


1668 


15569 


A 


1676 


2 


385 


TS RRDYRP \GHHNQLI F * / S FCRDGASL 
CCPCWSQTPGI KKSSCLVFPRCWDYRCE 
PGL* I *MGKNPT\LFSNGL* CDCIPLIH 
SIADIRKKPHS *LQGL*LCHQQNSQTES 
CSVTQAGVQ* CDLGSLQP 


1669 


15570 


A 


1677 


386 


1 


KSTRP VLYKWNNKA*MTEHLFTA* FTEY 
FKATIETFCSEK* IPLKILLLICNVPSH 
PRALMGMYKBINWSMPTDITCILQPMD 
QGVTSTFKSYYLRNTFDKAIAT/DSDSS 
DGSGKNLLKTFWKGFTI 


1670 


15571 


A 


1678 


2 


193 


EGGRI FFNSFFEVI LTLI / PKPKKWER 
K*SYHP IS IMNGDVKILAQTLLNQIQQY 
LKRIIHYDS 


1671 


15572 


A 


1679 


561 


830 


TLLLGT \NAWDFKLKPMLTY\HS \ENS 
RGL * KS WINLGLTVFYKWGTNNAWGDDR 
HHIiVYRHGFTGI F* GSQLKTYCSENIPF 
KILLFIDN 


1672 


15573 


A 


1680 


415 


2 


TSCAWLSLYPVYLYRSSSLPRFIFCHFK 
HLWYKNITKKAAEINTNFCCYINRVLLC 
HLGWS/ATWS2*LTVTSKLIiGSRDPHT 
LSLPSS*EGRCITPRLGDLLNLFSRDGG 
GIRGTSTLPRQVLNAWPQAILLVRIT 


1673 


15574 


A 


1681 


1 


78 


RPRIRHEVGQAGLKLLTSGQTPASVP * C 
WDYRPEPPCPALHISYK*NHAMCGLKCIi 
AVSA*RHVLGFIRG\WHVECCFPFCS*A 
GLKLLTSGQTPASVP 


1674 


15575 


A 


1682 


414 


162 


GGPGGPIPGA/AGIiRPPPPPLGNPPPP* 
KAKICPGGGAPPVFPGS*KGGGESP*PP 
RGRGPFIGVQPIiPFGIiGHKRGLFPKKKK 
KI 


1675 


15576 


A 


1683 


378 


129 


QPFGPRNFYQIFNFPG/PPV*HSPLFGP 
KFPPFSPGGGQWGPIiGNPRPPGAKGGST 
LRGPRTGGSRGGPPGPGKFFFFLKQSLA 
L 


1676 


15577 


A 


1684 


3 


374 


GI SATLPG IGAPGNKPELFKEVKLYNNAR 
EREKYDNMAEL F /AWKTMQALE KAY I K 
DCVSPSEYTAACSRLLVQYKAVFRRVQG 
SEISSIDEFCRKFRI»DCPIiAMERIKEDR 
PITIKDD*GNSLS 


1677 


15578 


A 


1685 


2 


373 


PFIRPET I KLLEE I PGGKLLDLGLGNDL 
LART PNABdNTWDH IKLKS FCT I KET IN 
TMKKQPTDCEK1 FSRUISDKGIM/SQMC 
KELVQLN* KK/TDDSVKGWAEDLDRHFS 
KKDIKMANRPGKVL 


1678 


15579 


A 


1686 


378 


201 


HATCLANF\CS YG * DRVS PFCPGWS * TP 
ELKRSTRLGLPKCWDYKCLGRSLLFPGA 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
m — j iireunine, v— valine, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /=possi ble 
nucleotide deletion, \=possible 
nucleotide insertion 














c run 


1679 


15580 


A 


1687 


1 


404 


MCVNPGGGACSEPRSCHCTPAWVTERDS 
V* KTNKKKVKRRKKI FHANS / KLKRVGM 
AVL I SDKI YYKLKKVRRDKE Y I 1 KGSTY 

nun tot tmtvpt Mipi i\ nvvutnvi'T mat v 
EEIDI YAUVGDFNTLLIiI ICRGT 


1680 


15581 


A 


1688 


307 


33 


DEGSCHDAHAGLKLIiDSSGLPASASQSA 
GIlwVSHGARPIilSSYNGTSHGGI^PAL 
VTSFNI*SHIjFKDLLSLQI/HVTF*GMGL 
GFAGAKLSI* 


mi 


15582 


A 


1689 


11 


394 


IFILEARTRISSRTFFKS* INIHAPYNS 
AI *L IGI FPRBKK/STCP YI YTQMFIAS 
Jjt V 1 AQ1 RKQPKCPSTGE \ WSKNLWN 


1682 


15583 


A 


1690 


20 


391 


SEGKGWYSCTKWQSMKLGGITFFFFFPP 
PKNPPPKKSGP/QKGPFF*GKGPPWPPK 
KRGHKN\RGFPKQRARPPKPVFFLIFGK 
KVjr IjIjGPKJoG* JN PGEKRNP PPWP* KGGK 
NPGNP KGGPHLTL 


1683 


15584 


A 


1691 


72 


392 


IKMIGSIiFFGFAFFFFFGKKTPFFPQPK 
R\GGKP* IT*TPPPGN*RNSGPSPPQKV 
GIKAPPPLPK\NF*FFGKNGVTPFPPGG 
FEPPTPKEPSPPVSPKGGKTNPAP 


1684 


15585 


A 


1692 


389 


161 


HGGACLRSQLLGRLRREDCLNQGG*GCS 
EPCTPAWVTE *D/SSQKNQKSKIKKSGL 
DNSFS XG*GXIiGIiSTCDRYS 


1685 


15586 


A 


1693 


286 


363 


DGISL* PRPPGLRQPSQPS \LLSNWGCR 
STPLCLAGFFVFVETGFLHVAQACTjRQG 
FTMVAQS / ASQKIHIT*GAFETIQVLFY 
WGGVGFGQWRFFFFFETESHSVTQAGVR 
6(Jb JLjSSJjQVPPPG 


1686 


15587 


A 


1694 


1 


356 


ELLEPRGRGCSERRPCHCPPVRVAEQDS 
VSKKKRERKYLYFLRFNWRSLRIFIYFC 
*/HPLQHNIQNISFTLQNSFGFFSRQYC 
P£> FXiE X I FDTPIjTENTjIiGIiFClKG I IQNW 
FFGVGLF 


1687 


15588 


A 


1695 


3 


298 


KY FETNENKN I QYQNIi \AVKLVFRENLT 
VVNACVKKEERFQVNNXALYPKN*EKSM 
I*NPKGKIIKVRSEKNDIE*KNDEENQ*N 


1688 


15589 


A 


1696 


3 


405 


RLWCGWRNRHLGS * NRVEN PETGLHRYA 
QLI FLTKVQKQVGEGQPFNK* CGGTWAP 
TGKT/MEQPVKASSSSSSSSSSSSSSSC 
KM*NIVFFKMGENLWDH*AKSYEVRTKA 
x x ivjiv v ivrujivr AjvAfxcir v*Xt?IvH 


1689 


15590 


A 


1697 


6 


392 


LQRTLLVGLFNAAGNLKLKPMLI CHSEN 
PRALKNYAKSTLPVFYKWIKIAWMTV* T 
V7AWLTEYFKPIFENYCS/EKKIPFKLF 
IJjTNNAPGHPTGLMEMYKTTNVGSLPAN 

JL AO VAjVrTIViyKv loir l\o 


1690 


15591 


A 


1698 


390 


3 


AI IBSDFLTTSREVAKKLRVHPFMVLWV 
LEPIGKVKKVNKWGP*KLNK/NKKNHCI 

J?\/Q Q QT.T?T.r*T\JTvTT7t>T?T TV'TUTl^'T^T? VT»7"TT v 

IS voooJur iJLJNINlSIrr LuAj± V 1 w/IsJvWXXiI 
HNW* *SAQWLNREVAPK\HFLQPNLHQK 
KVTVIVWWSAAGL IQNEH 


1691 


15592 


A 


1699 


1 


245 


GGGGE YS KI IAI KTALKNTT YLG IYLIK 
\ DVPDLYTKNYGTMLREIKYIiEK* RARP 
CS *TERFKIVKMSI /LPNLI YRFNTI 


1692 


15593 


A 


1700 


2 


324 


GTSGTSGTSGTSQTCRISRVRSTSSWTS 
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otide 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine T 
G=Glycine, ENHistidine, I-Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
0 = Glutamine R— Areinine. ft=Sprine~ 

T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GRHLQCHVSKAESSWFVLSASSCSTCSP 
ISIiSGS/GSSIiSNSSKNLTLPCLPSSUi 
S *P*ANPVDSAFKI YPGLSSVGSCL 


1693 


15594 


A 


1701 


183 


372 


PGVLLCW PGWSAVTVHQCDHSAIiQ PRTP 
RLK*SP\STWDYRYTPLCPA 


1694 


15595 


A 


1702 


29 


382 


GKRFPCSWEPKFPLTQGGFPPLHPGQKK 
KPGSKKKKDKSKCW*G/ CREA/GTT/EHY 
RQELEMVQS FWK/TVWQVLRKLNVEIjP Y 
DPAFIiLGLHPTSTQKR\DTMFMA/AITL 
/ 1 SERWKQPRCTS 


1695 


15596 


A 


1703 


1 


382 


KKVKIIGEAAVEFPDTIKKIIEBKEYLP 
L*VY\NADESGLFWKKKLQRTFISKEEK 
\SMDRLTLIILCKCSWVYEQDGP*AIiKE 
KGEHQLPVF* L*NKKA.WTVRTLFIiD* FH 
QCFVPEVRKYVASKRL 


1696 


15597 


A 


1704 


2 


330 


KLNNLLLKNS * VNTE I KAEVSSSLLE IN 
EYEDTTYQNLWDAAKAVLKGKHVAPRHF 
LQEVKK/ RLKRFQINNLTLYLKEL/EKE 
HINLKASGRK*MTKIGDLPGLYFVLNG 


1697 


15598 


A 


1705 


100 


342 


APKWS I VCPELVGS WSH* / PSRMKPWTL 
TRQGFTMLARLVLNS *RRDLPALASQSP 
GITGMSHRTQPLL INLMEIFTEILS 


1698 


15599 


A 


1706 


600 


211 


SCSVARLEFSDVIKAHCHL/RTPGLKQS 
SHLSHLS S \ WD*GRVPHDLANF* I FCRD 
RVLPRHiQAGLEL\liASSDPPS * asekc 
WNYRHBPOTPRQNLGIjLKTTYGWVFLKK 
YIIiTVSVFSLMCLDHCLLM 


1699 


15600 


A 


1707 


409 


I 


RGPFFFSPGKAFLKLGSNSFFPIKKTRA 
PKKPIFSPVSP /LNFPPKTGFPPVFPPQ 
MGGFFF/ CFPSFLFFFLPPPPPFFFSFP 
PPFFFFFFFFFFFPIjFFF*DRVSLCHPG 
WSAVALSQLTAALTSMDSSNSPTCV 


1700 


15601 


A 


1708 


154 


2 


IGKPLAGLTK/RKRENT* INKIRNEKGD 
MTADNTEIQS IIRDFVS *RTAHQ 


1701 


15602 


A 


1709 


263 


37 


SAQLHPLNIQNHRQS ILLHDFFLKKQD / 
G/WPGAGAHACNPSTLGGQGGWITRSGD 
QDHPG*HISV 


1702 


15603 


A 


1710 


390 


42 


YAGGFRAI FFFPLPREGAKNPNFP VGPP 
PFGGPFFFFFAPSQKKEPGFFGKKGVF* 
GGAKGFPNARGPFFF/G* KKKKGPKNKT 
PGFFL/MGPPNPGGPPPRGEGGKIGAKK 
KKKSIRL 


1703 


15604 


A 


1711 


3 


167 


YTCVFVCLCLCDCMC/CVCACMYICVCV 
CTRVCM*VCVO«CVRVCVQALTVLCKSV 


1704 


15605 


A 


1712 


116 


391 


KRNFFFGPQGGGEGPKFN*RGPPPPGVK 
GIFPPSPPEG/GKKKGAPPPPGIIFWFF 
KKKEVPPCGPGGV*TPDPGGPPPPGPPK 
GGAPRQGPL 


1705 


15606 


A 


1713 


401 


47 


HHYATKPFTHAHTCTCIQDTCNTCMQHT 
QVHT/HTDTHTHTRKVSSVCVLIMAEQE 
RPCPHC*GGEAGAECGVCAWLGLSS*TW 
RNRKGPAHTVRWGQEQSAGCAHGLVFP 
RSYLWT 


1706 


15607 


A 


1714 


1 


400 


CVESCEVD IEMVSCCV/ CSG* SAVCSGT 
AS AHCSLP I PGSRDS PASACQVAGTTGM 
PH/LYPGVPLKPRBGLQFTELPSGQLEI 
QPTCENKR*HVPCAL*VQLTDIRPN*RY 
Q FRVAAVNVHGTRRFTAPS KHFCS 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

H 1111 UV 21CIU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

film a 

mug iu 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
OCysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^uiutamine, k— Arginine, S=berine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1707 


15603 


A 


i / 1 j 


Hi J 


j 


IjvlXr Ja riJK&lrv JL VJNoSEMII/liVBt^FVT 

SGWIYHERFLNPGREIDWATCYSETGPC 
PVTQDGVQWSNHGSL*PQTPRLK\HPTT 
SVFPVA/RQGL/NSVAQAGVQWHDIiGSI, 
^--tCUlroljJVOioWFKijt'2>oW« XRxAFPPC 

r 


1708 


15609 


A 


1716 


421 


144 


RLECSGGITAYCSLNLPGPSSSPASAS* 
VAETTGLLHWKKKI VETGS HF I VQTAFK 
FIiDSSDLPAIiAC/SWDYRSELLCPACFY 

KTBT'T.P TKTT DP 


1709 


15610 


A 


1717 


3 


384 


YSACVCVCVWVDVS VHRCMS ISGH/AJRI 
IRVGPAHKAQSSCRPVPAGCCTSAQTPP 
WCSAATADPPPR*GESLPGAYPGSHSTC 
CPGCCLDW*HSLPSTEPAACRAGAPGGG 
H*AWLGCGGRAGGRPG 


1710 


15611 


A 


1719 


3 


615 


PVGSWARSSGAGWPPGSPQSVSDGEAGH 
SIPAPRGQCSRHRAEGRTARVCLSHCSF 
SGPRPGLVP I R * SLGRPDAVQAIVPDS Q 
EGRKTG I HAEAVMF PLGPGKGVKGCAGG 
RLCPSSPPSRCLDLGRRGMPPSSGPAGP 
RPSGVGSDLGRPGAGAATSSSSSSSSSS 
oooboobSSSRDGEGP \TGS VEAPGSLL 
GPWLPSQPS 


1711 


15612 


A 


1720 


320 


3 


GLKWIiNLKDNSLYLILAKWDDCLDEK* 
YADKMLQYTKSVWVSRSRGCQGQVGEKK 
UA I KAAQEWELRK \RI* YWRKECDALRAA 
REEQKELRDVRKAKKVYVCVRV 


1712 


15613 


A 


1721 


44 


373 


KAMGQTLWKTVWQFIjTKEKIGPM*QSCP 
T/DLALVFLGICTIDLKAYIHTETCTQM 
1 JL JL 1 X1X1 X X AKXlRKJKAS CSS VGE / WNKKL 
YYTRTMESYSS1^*NELSSYKKHWGGGS 


1713 


15614 


A 


1722 


135 


396 


aqglfctsvklaseqplrilfqldkrnk 
\feiygtsg*l*siiicqnnlqskfqmy 

xttttv X Mbo LAjAVAHACNPSTIjGGQGGQ IT 
RSGD 


1714 


15615 


A 


1723 


4 


383 


MLRAKAIKIAEENIGIHLPDLGIjDDLF 
LDITPKSQATK/AKIGN*GFIKLKHFCA 
AKNI IKKMKRQYKEWKKIFGNHVSDKKL 

VT?RTVW*UT DT TT1TMOCC 


1715 


15616 


A 


1724 


2 


405 


NSRTSLILNQNLE / IKLSEKGTWKAKTG 

* KUjLLQQK\ iskianakekllkevksa 
TSMl)M*MimK*NNFIPEMQKVLVVW/I * 

NT /PTiPOQT.TnMTf 2VT. , PT.T?MCTVAt r Dr , T7l? 

A\KLEATKRWFMRFKESCLHNVKVQDEG 


1716 


15617 


A 


1725 


90 


400 


squirrlkqnsi^presssktttrtkvs 
hwhknrhvnq/ynkiensginlhiyg*i, 
trjkgdeasqyss /diiifnkwcwqkkkk 

V YTjnPVTiTPfTFrT CCTMTTCri?T 
\*uut jj J. mod l rji j.oor Xi 


1717 


15618 


A 


1726 


390 


1 


TFFPKI \EKKIL* FIWGPRRPKIANVFP 
* QNKPKIEGI PLPGFKI YYRALVTKTAW 
F*HKNPPIGQRNKVENSETNFHPPSELN 
FFFF\YKGAKNIHWGEDSiuFNKWCWENW 
IS I * RRMKLGPNLTP YTK 


1718 


15619 


A 


1727 


3 


365 


HAS AKSNIiRWI I KLNLRAKT I KLLE ENM 
GENLWDLELSRGFLDRT/PKLYS I KLKL 
RK/WNFI KI KNFCFSRDT*KS * MAGKNI 
LSGKDLYSEYLKK* CNL I IRQSKTS T+M 
FIAALFI IAKH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etfa 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine ! 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla)anine, 
G=Glycine, ENHistidine, Msoteucine, 
K=Lysine, L= Leu cine, M=Metbionine, 
N^Asparagine, P=ProIine, 
O^GIutamine. R~Ar<*inine S=Serine 
T-Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1719 


15620 


A 


1728 


368 


2 


RNCLNSEGGGCSEPRSRHCTPAWATEPD 
S VSKMK*NKKINK* IMFLY\EQQSETEY 
FKLPLTLA/SRNMKFLGIWLSKGVQDLE 
TENYKILLQEIEEDLNK*RITKrY*WIKL 
NTVKMKSILSKR 


1720 


15621 


A 


1729 


326 


30 


NPFGGPKKGGSQGREI KPPLPPMGKPLF 
F*KTKNKWVGGAPPVTPPS/SGVLSQKK 
TFTLEGGGFNKIiNSPPALP/ARGPKKNF 
FQKKKKKTKQNPSQREE 


1721 


15622 


A 


1730 


1 


374 


IFNADKIA\FWKKF*KPQGTSVGREEKQ 
APGFKAGRNRLTTI /L/GASAVGFMIRA 
ALICKAANPQVLKGRDKHQLPVFRLLYN 
KKAWTTRTLFLDCFHQCFVPEVRKYLAS 
KGLVFKVLLILDNGPC 


1722 


15623 


A 


1731 


389 


1 


FPPKIFFFSTLFFFFPRFFPPPPFL* P/ 
SPPIYFF/ CAPKKKNFFPPPPGKNFFFF 
KTPPPFFFFFFF*D*VCLCCPGWSAVAQ 
SWLTTTSVFRVPVT 


1723 


15624 


A 


1732 


118 


422 


DITTHLFKWIjKFKK:*EKGIjNILFTKEDM 
QMEKNLKRCLT*FWKELQIKMRYHYPP 
IQMAKI * KN/S TIS IAWQG YRTIGTLF / 

hc**eqpfw*flsklnmilpynpa\iml 
lsiypkalkkhvhtktcm*mfiaalfii 

TKNWKEPRCPS icew 


1724 


15625 


A 


1733 


407 


1 


nikgpfrgpliqwglliwpkdsfpilgy 
ppfpspkisffflarscsgapnhfplpn 
qspcfpqpsfflgeekkeflpgyslafp 
fn/rfggs*kvrkgnggpipmgbsflff 

F*DKVLLCCPGWSAVA*SRLTATC 


1725 


15626 


A 


1734 


322 


362 


TAIYIYI I * LFTDNI PSHPRTWIEIYKE 
INIFVPANTIPIVQHRNQGVICTFRKTI 
TVTDCD PANGS GQSKLKTWKGFTILDAV 
KN/IRDSWEKVKIG/TIN/GVRKLIPSL 
KNDFKRPKT 


1726 


15627 


A 


1735 


49 


395 


RGGPGFFFFFFFFSKKSQILPPGWRGRG 
EPRVNGTPPPRGKGNPPAQPPQEGGKTG 
PPHKPG* FLCFLKKKGVQKGG*GGPQ/A 
PGPKGAPRPGPPKGGEKREGPPGPTRPN 
LYYAH 


1727 


15628 


A 


1736 


417 


2 


FLFFFFFLFFFFSFASGPEILFTCL*HT 
HIHFLFFYSKST * * PPVFAGGMFQDPQW 
IiPETKMVPNFKKKRTTLT/YIP* * KXtCE 
CDLSNFFCFF*DRLLIiCRPGWSAVA*SR 
LTATSTFQAQANRTRG 


1728 


15629 


A 


1737 


316 


338 


FFFLSFYFETESYLHHP+GFIVKLSKVK 
DI EN\LKKTARGNY QVTYKGAS IRLAAD 
FSAEISQAWREWDNMFKVLKEKTNWQPR 
IYKTLFVPHF 


1729 


15630 


A 


1738 


197 


379 


QKRAQIDKAIFICRDIAL/P*FQ/MYYW 
ATVTKTAW* WYKNRHI EQWNRS W \ P PE I 
KSQSYSHL 


1730 


15631 


A 


1739 


4 


401 


RGYRHAPPSLANFCIFSKD/MGFTWLN" 
S * PQ/CNLPASTS * SAGITGISHCTRPQ 
MATFLIGPHKIIPWSVLWPNL 


1731 


15632 


A 


1740 


94 


117 


KDRPMVPPVGAGEDQADEPCRGHASLWS 
QLVSAPTTP I PLPGRDVPSRATPFPAAL 
AAQQPP * AS PYPLPPGIiGAGHASAS PVT 
VPFSPISESTGS*ESAL/PAPRPGGSG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

a mi no acm 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vf=Olutamine, K=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=*Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V^possible 
nucleotide insertion 


1732 


15633 


A 


1741 


3 


390 


VDQFLI SHDLSKLTEDBVHNLNSSTTIG 
\EVEFKVKKL\*KKKSSGPDGITGKFY* 
TVREEVTPIL / S YLF/HB I EKEETLVNS 
FYEARI IL/ IPKPDKNKP * TNIDAKTTS 
KVLPNRIWQYVKI I IQHN* VGFV 


j /jj 




A 
x\ 


1 /4Z 


3 


442 


DLHSRVEPRVRPSVRKQ*WLLKVLICA 
SKDTLKRAKRQP IGWEKI FVNHMPDKDL 
IPBYKINMQKSGVFLFTNNSYSSTTDND 
INKWAKDLSRHFSEEDIQMANKHM\KR* 
SVSLVT REIKI KTTGR*LFTPKCWQG*G 
EIITLVH 


1734 


15635 


A 


1743 


411 


2 


LPPFKHPPPEIILGAPKKKITLPPPRPK 
KCISLKGPPFFPFCRYRVUCCCPGWFST 
PGLKQSSHLGLPKWWDYRHEPYCTQSSF 
SLSFFLKQTGR*WFDLSNFFFY/ r CYCFI 
iFSRDRGLTLLPELVLNSWPQAILL 


1735 


15636 


A 


1744 


1 


393 


RPGGPPKGRSREQGREGERSRRRRPRAP 
WARSHMWGARVFSVPRSFDQRPQEKCVQ 
SSYKQQLPARARDGTGNLIRGAPLFFF* 
DGVLLCRPGWSAGFKQFSRLSLPSR*DY 
RRTPPHPANF*ML/ CLRRSIiT 


1736 


15637 


A 


1745 


395 


0 


PSAPSFFSTRIi* LGEPPGFPPPPFLKPP 
PRN/S I FGAPKKKFFTiP PPRGKKFVS LK 
GPPLFFF*DGVLLCCPGWSAGFKQFSHL 
SLPSS * DYRRTPPHPDNF* MF/ CLRRSL 
T 


1737 


15638 


A 


1746 


397 


1 


CGNFLKREKNFEARRFLQK*AARFRNIR 
*VTPEITAFCPL*HVASFLVFI/TPNFPT 
IPQLYCLEPLGEMGGSGSKLPPFPTSKT 
PNPLISVNLCFP\AIKWENFFF*DGVLL 
IiHPGWSAVAQSR PTATSTS 


1738 


\5639 


A 


1747 


392 


2 


FTKKGGRGGNLS P / LiPPQKI F * KKKTLK 
KPFFWQRVWFFSPLFFEQKGQGPFFFKK 
PFF*KRPPDTPQPPPPSFIFFFLIjFFFR 
/ HLVAQAGMQWRHLrGS PQ P P PPGLTQS S 
QLELPHTPPHPDNFCIFGRDR 


1739 


15640 


A 


1749 


33 


403 


IKGFKKKRGKGGAPQKKGGTGAKPPPPP 
PQGF*K*QKRKNGSPPNVFFPNPGGPPP 
PPPFWGDKRGGPPRGGGAPPPRGKRENP 
FFPTLA/ HGKKEKKK 


1740 


15641 


A 


1750 


3 


396 


KRQTTNWE KVFA* KN\ADGL I SL I RRKC 
LKVKK/W*R/DMNTQFTDKGILMTNKHM 
FCRSSTSLIMEMQHKAGVIFHPSDWQKC* 
STDNTQS*QGHEEMYTIKHSW*TFDYQQ 
pi? /qpqwt 


3741 


35642 


A 


1752 


30 


419 


NEGIGAGHEVSFAANKCRSMRXNVEVLA 
LNRRLDGELLSGLTSTQALPGWAYLHLL 
SHHAVRPLFLCFKRGWVLLCHPGWSAVA 
QSQ\FELLGQVILRPHLPSS*DYRSIPP 

pT 7\ M 17 IT KTT7 EHOITT} > OV7\ /~»T> 

l~UAN r 1LN if r KERAR X ACR 


1742 


15643 


A 


1753 


16 


410 


VGPKKS LQQVWAAVQATLPLES YDLAHP 
IILKVSLADRDAI *NLWQIPIVAS* YIP 
LGF/YSKAMPSSVDIYSSFEICKICTHFPT, 
TGGGGPPLYSNYLGRDGGANHLTPGVKN 
QPGQFGKPPPLQKVQTLAGRG 


1743 


15644 


A 


1754 


2 


17 


NSSLIKHRRIHTGERPYQCSECGRVFNQ 
NSHLIQHQKVHTR*RMYI*SR/CGKDFT 
QKSTLI+H 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isofeucine, 
K=Lysine, L=Leucine, WHMethionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S^Serine, 
T=Threonine, V^Valine, 
\V=Tryptophan, Y=Tyrosine, 
X=Un known, w =otop codon, A=possiole 
nucleotide deletion, \=possible 
nucleotide insertion 


1744 


15645 


A 


1755 


138 


380 


vnwvo'nvnnnr'Dvnr oot ot tvpotw* 
XPXXXS PXPPPCPXPLiSSIjr IiTKCbAY * 

TPAHRPPPPGPFVPPPKPCCPPPPPLRP 

WPPLPLLTNPI PPPTLSFHPGPALS 


1745 


15646 


A 


1756 


194 


3 


WLCIPIRHTTEQQPGFPP/LIIWSQPYS 
PLFCLFVF*DTVSLCGPGWNAGVQSG1j1 
AASTSQAPSL 


1746 


15647 


A 


1757 


2 


403 


RVLFS PTLiAYTYLLF YLGAT I LMGVNI W 
KQPNCPTK/ GQQKI KLQiI YRMEYYS AJj 
KKK* ILLFAIR* VNLGDIMLSEVSQ\SR 
KKNIVLSHMW 


1747 


15648 


A 


1758 


398 


65 


PFFFFFFFFP+TKGLGCIHRCDHGSPQP 
RIH\GSSNAPSLAS * vggttgafhharf 
TLIQSSSVHVSTHTLHPYPSSLP 


1748 


15649 


A 


1759 


456 


31 


FAKRlTUKKLLSLI YLAT* KGE ITKALI 
KQ*\LKAQAKNSLKRHTIVLNHMKL/ ci» 

illiikemqik/stlryhfffitlakm 
*klgntfcwqgl*gtlihcwweck*hns 
yvggiwq/fsnklyvqi*ynsaisnldg 

RVGRPG 


1749 


15650 


A 


1760 


3 


378 


QFQ YFYYNGSVI KAVWYWS KNRK\ IDQ * 
NRTESPDKNLHKHMQLIFDKGTNTPQWR 
KDDLFKKWCWNN*TSTCQKKKKKKGGGP 
* KEQNIiTP PGWBDI ± FXjFa^P-KJJMPGAG 
VKTRWGGKNPGFPQ 


1750 


15651 


A 


1761 


69 


384 


YTSASWGGARYTASAAGWKTLLLLFLFI 
*DRVLLCHPRWSAI*TQPRI IAASTFQ \V 
KQSFWIWHIGDWDYRRCMPPCRANFFNF 
LCKKKKTTIjRRQEVNQTPAIjvRV 


1751 


15652 


A 


1762 


390 


1 


KFSTPGNKNLFFLKAPPFFFFCRGRVLL 
CCPDWYSTFGLKQSP/CFSLPKCWDYRR 
ESPPQAFFVFFGIiPSLPFSPPPAPSIiSQ 
SSSFFF*MEFHSFAQAGVKWLN\LFGSL 
QPPPPGLK* FS CiiSLPPTRP 


1752 


15653 


A 


1763 


2 


390 


PRVRGFFMRKFVDS YLVP YTKSNLiKWT I 
DLHVRAKAINLLN*NVRIHLYDRELCNG 
FLEMIKKTKAAATTTKLDFIKIKNFCAQ 
QMSS /MKVKRQSTKWJUvK.v-ib iHIoUKJGLi 
VSRKYVKKAYNSS IRTQSH 


1753 


15654 


A 


1764 


334 


3 


WSKRSGPPLSKNQTKKKNATKPQT* KNW 
INEIGPI I/NTSPSKEKTGSNGFTD*FY 

+»PT T/T7T7T T OTT T IfT.PftVPgTfQXrTTyDlfQP 

YVKDHSSCLSGIHPKDANMEQHMQINQC 


1754 


15655 


A 


1765 


259 


1 


KSTLFLMKKYWKGS FLKKHFFLETRSLS 
D/SPG/OTQWRDHSSLQPRTPGLKQSPH 
LSLLSSWDHR*APPCPANFRFRLKTRIG 

KUv 


1755 


15656 


A 


1766 


402 


386 


FKKKSRLRKI / KVQDEAASTPDLAKTI / 
DYEGHYTKQQIFHVDERAFYWKKMPP/R 
TFIARKKKSMPGFRASKDKLTVLSGANA 
AGDLKLKPVLTYHS /ENPRALK/ H YAKS 

rrrr OCT WTlI M'I'lf ATVTmrtTT^yCTJC "k P*Pt T CM 

TIiPEIj YKWN TKAWMK. I Qlvr Po * FT * IrM 


1756 


15657 - 


A 


1767 


2 


406 


PRVRPRVRKLITLIiNVSQRWS S EKKKKK 

RVNLFFWGPKKSTPPRVFKHRGGEK/PP 
PPPPKKPRERKPSLGVGSTWHGISPIKH 
SKKTKSTK 


1757 


15658 


A 


1768 


14 


409 


IASGFLFFIYFGVIGRPPKRGGVFFPGG 
GGAPHPPQGGC*KKKRGGGGGPPPFFPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino jirirl 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

i*aciHiip nf 

lOIUUC Ul 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, D=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
V^viiuiaraine, K=Arginine, o'^erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
a.— un Known, — otop cooon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SLGGLGRPRPMSKIKTPPSAGGKPP/PF 

T»PWfYKT* , TY2f2/'l/"2'QI5r%*'DT}T WT« nr*r* zrKrr»e* 

utus\ivi * ivjoovarr'i'y ^Jrt'iiruoPGGKNF S 
PPRGGGGGGPKTPPPPPPGGE 


1758 


15659 


A 


1769 


305 


1 


TKTGPPPSNRAKKKKIVFQKKKKEKKIN 
CFVTAPFKGIKTEATD*EKIFAKHLSVK 

d /TA7VT.* VT + TG vtwtjt t trr *TXTn»T/T¥win- 
\jf JJ V X 4j w X JL w XolVX I JNiSJjbKliNNEKTTI 

PIKK*AKDLSRHFTRYTD 


1759 


15660 


A 


1770 


148 


402 


FSPVSLG*GRGNIYAGMSNV*EVPPEID 
J? UriK V KKAL&TS FQVKL VKI I FFKSTI * 
KSLAK/WLAWAHACNPSTLGG* GGWI T 
RSGVKSEPGQHGE 


1760 


15661 


A 


1771 


250 


2 


KKKKEKKIICFVTDTFKGIKTEATD* EK 
IFAKHIiSVKG/LViTi* YI* ISKI YKELL 
KJ-uNJN is r X KK* AKDHSRHFTRxTD 


1761 


15662 


A 


1772 


407 


1 


KKIRRGGGQPPLFPIilGGGREKKFFFPG 
KRGFY*TKAGPFPSSWAKKKKFVFKKKK 
KEKKI ICFVTATFKGIKTEASD* EKI FA 
KHLSVKG/LVYL* YI * ISKI YKELLKLN 
WcJvi i XFXKK*AKuIjoRHFTRYTD 


1762 


15663 


A 


1773 


1 


406 


KKKKKKPTTFPGLPFFFSLSLPPRVPGA 
RGPPSSFPSKPGLGC* IKKSGVSPAPKK 
KKPRKTNPKNFPRPNPKGGGEKFFNLPT 
HFi/Wjf 1 WS?Sf *KKRENPPGFAPQKG/EV 
FSPGGRNSRGBPRGGPKKKKKKGG | 


1763 


15664 


A 


1774 


2 


378 


AAGE WLHQGS LQSLPPGLKQS APLGVSK 
W*NPWHDPPPPAPRFVWWGGWLRWS 
FFLGAQAGIFFFF* IEMGSHYVAQAGLE 
/PPRLQGSS*LNLPSSWDYRPVP 


1764 


IJDOJ 


j± 


W/j 


1 


431 


QQMRDKRN LiF EHNK* GIRGI YLNI I KAR 
HEKPTVDTILSGESFSSKIKTMLISPFL 
FNTVLGVLA+ARKRKDI * VGK*EVKS YM 
t ±C4um± \iaiJNPKI/STPKKKKTGYFMGGP 
GSKPPPQRGGAFLSLTRDPLEREFPKTA 
LFTLGQKKIKGPKFFS 


1765 


15666 


A 


1776 


334 


402 


KGGGGVGG*QGP WRLiAHCTDK\ KEERKR 
ERERKRQRKKERKKERKKEKKE * MKNNK 


1766 


15667 


A 


1777 


406 


3 


SPSSSSSSLFSPPPFFWGGPRF/SPPPP 
VFKPPPPFFFLGPPKKKFFPPPPAV*FF 
FF\LGPPPPFFFFFFFWEAGFPFFSPG* 
GP/SGPMAGFRSLPPPGNSLSKKKKSEG 


1767 


15668 


A 


1778 


70 


409 


LISFLVSSLIVRLYRPLLCLFPDPSKGH 
CIPCLLPLFWIIiLHI FLFWHSFI YCLIN 
DRVLLCLPGWCAWRPRLTAAS ATQ\ I K 
RSGSHLSLPSSK* \ WDHRRCPGFFFFFF 
FF 


1768 


15669 


A 


1779 


390 


31 


SHLSLPKCWDY/ RL*ATTPSQKI LVFTH 
G*VLSLLSLSFLIPPD*TF*KMSIiMRPS 
LTQKSSI IQHDLLDKVPITIFLRQSL/D. 
S/VTQARMQWHDLGSLQPQPPGLKLSSQ 


1769 


15670 


A 


1780 


357 


1 


LTLFWGAQYLPKKGKGPFFLLSLPSPSV 
WGPLPQKKKSPPLCFFYFLNRVLLCHPG 
WIAWQSWHSSAHFSL/VLTRFK*SSCL 
SLLSSWNYRCTLLHPPNFLNFWYRQGNA 
VLPKLVL 


1770 | 


15671 


A 


1781 


122 


254 


RKNE \WSGAVAHACNPSTLPGPGGQIMR 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, X=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Vf^vriutamine, xv^Arginine, oerine, 
i— mreonine, v— vaune, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 
















1771 


15672 


A 


1782 


45 


387 


TQTP KLRRFFHFS LLKS WHYRCS PL \ P Q 
HNG YFLTNLD IATSMFFFKDRVSL \ LAQ 
AGEQWCDPSSLHPQTPGLRR\ PPASASQ 

hAj&l iVjAHrttii WKWJLjXir t* 111 XlVr^Vjl 

YPMA 


1772 


15673 


A 


1783 


1 


401 


FATLARliVSNS * PE / CDLPAS ASQS AG I 
TVSPAHLAISPSFPAMPSSGLGLSYPAY 
HPGLGLRCHLPVLTSPWTSDTGPSSVLP 
DAGALHCPPEPQHICPL\LSGWLQTP 


1773 


15674 


A 


1784 


432 


1 


FljFLFFFIKKKTPFrTaAKPKJlRGF* KPP 
EFFQIF/ S/CPVFLGRFPKPQKEFFFPE 
REMGFFFLFSPQGGGPARGFPPPPLNQG 
EAPRAGHKKKGPNIiGGRTFFFFFFEMES 
\FSVTQAGVQWHDLGSLQPLPPPFKRFS 
CLTHAS 


1774 


15675 


A 


1785 


15 


434 


RLS IiS CCGRE EHSTLPGAP WRCT E IAwA 
DSPDPAPSPPSALPSLLPFHVYRDVCPV 
LCLRGWP* *MVERGRLGISPTWLLGWPF 
PGGA/ PHI KPE * YFLFAYT I LRS VPNKL 

n/^XTT AT T T PTT TT TV TTBTT tJT O ^r\AO T T 

GGVIjALiIjIjS IXj XIiAI I r IxjHXoK.QQoI± 
F 


1775 


15676 


A 


1786 


1 


258 


CWPETP VIxK * SHLDLTKRGEDRREPVWA 
ASTTIFFFETGL/NSGAQAGVRWVHLGS 
ZiEPLPPIPSLFMTP*GPCPGLGSSWPLR 
tsr 


1776 


15677 


A 


1787 


399 


63 


SLHNQWKSTPTIiKTSKKISHI *STWPG 
WAHACNSNTLGHHGGRTA* /RSGVQDQ 
PGHHSETSSQRLRNPYIKRCIKYLAHSK 
CC INDS FTVS VTS RKLIGKREVS PNNIT 
FR 


1777 


15678 


A 


1788 


3 


474 


MSISPvKWNSKEy^RAANRQFPSFSPWK 
DDSRDASPPEPASPTIG\PIRRIAESSW 
TWGSPCAEHPRARAGRRKAATDCPWAAG 
SQWRGPAGQGAPRS CLFPGSRTAARAQH 

DDl/B I3T3DDT57111T STTDTIOBT DOrtT T5DATT5T 

*VMTPRPPAAAPRSPVGP 


1778 


15679 


A 


1789 


66 


395 


LVQPLFDFIWHSRLSLWSGRDLLWSGKE 
T^PCIaNHSIGVI^EWQGSDVKKRRRLM 
ESLTGPAADVTRILKSNNPAITTAECLK 

■fVi Ifay V f UO V UOOlvUiiy JLJCvT XilM 1 X V^" " 


1779 


15680 


A 


1790 


413 


1 


PSPRALITDYS*EEGPRFWQV\EKKGQP 
LKPHPGLGS PHQES PRVGPP KRGGYNPF 
* KKRAKFFAPGENKGPPFWTGRDPTF *G 

LRRSLAWS PRIiECSAAISAHCidiRL 


1780 


15681 


A 


1791 


314 


1 


KTKPFFLKKTPPKKKNKKKGSC*DKARY 
QTRKG/IVNLGH/HPSFLFYFILFFETE 

CUCXfPXTTAn^TDWT D2kMCT/YPDCI>r , C , rVM70 
orto V 1S>UL3 v iCniuCATi olx/i tf o irv»r yyri> 

RHSKPSRKDYRHPPKRPQECVQ 


1781 


15682 


A 


1792 


104 


409 


EKQS FAFDWHFDLYFDNVK* KEGEESKA 
GEFNASTGWFGNFRKRL/RFKNVRVTGE 
P7AcruTnT?Aar)RFPr>WT?Tr!fTTP lnrrjvr.wn 

irt\D V X l^ELflXTlJCii; UEi C ruTvX X IjJVTv'J X XjIIVj 

KPLMYHEAAKYLNFWCPTK 


1782 


15683 


A 


1793 


392 


2 


GERERDRL*REREREREREREGERERAR 
ERERARQGTSTVESRF/HSYCRRDQDVA 
RPKAKGEVAAGRGS PDGLQVGRGQ * PGP 
SLRPGPWREWGFATYLACGGP I PTGEVE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aiuino BCJG 

1 COIUUC VI 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
amg io 

1 o o+ am inn 
Idol allium 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Hisddine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V=oiutannne 7 K=Argimne, S^Senne, 
l — i nreonine, v^vanne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SGEEWAQQGEWGQALGPP 


1783 


15684 


A 


1794 


405 


119 


IIKMAIVPKAIYRINTISIKVPMAFFTK 
\LKTIIKFI*N*KRARIAKAIIjSBCKNKA 
GSIPLPDFKLYYKAIVSKPTWCWYKNRP 
IVLVHSRTARKK 


1784 


1JOOJ 




1705 


405 


3 


V ryAGEQWCWHSS LQPQP PGLKWS / C 
PSQLPK\WNYRHVPPRPALTAHPALTAD 
F*RRKEYKLMRHRGK3CWHDFTLRSKKMK 
AMRHEYHS FLTYPNGHLYLHWQFFFFDT 
ESHS VSQAGVQWHDLGSLQ P P P PGFKRF 

J2»K V L» rUljM oQAlJ 


1785 


15686 


A 


1796 


2 


134 


PRLQHCTPAWATPQDSVSKRKRJDMTTDP 
AI *KG* *ATTTNI YIHKFYNLDEIDQFL 
KKHKLPQLTWYB I /DNLNSPITR\ IEFV 
ILNSKKKYPGSDGFTGEF*D * FCLBTKK 
KiDYRPCNIjKKI isdynkhlht 


1786 


15687 


A 


1797 


1 


404 


PTRPLTGSS ASGMMVE I FP KT YL I SSAC 
WVKIS KXDIiKLFS PFFFFGFLEGGFYFC 
PPNSRGGSPGGKFGLIEPLPPGLKGIPP 
PPPKRGGEFGPPPPTPAYFFFFLWGGGV 
/ PP / AVGGGGKP P I * GNPP P WP PQG 


1787 


15688 


A 


1798 


2 


383 


SGWLWACRS PDSE PIACPAGPRQ * GAYH 
PGQQLWMVR*VSQQCWPTPPALHSIS\P 
RP*LPPRRAGLGTCSPP/VPESPLIiPPW 
APAWPKPPRRALPPA/PPVPGPIDRPRA 
E ECRRMVRDWQAAP PAAP 


1788 


15689 


A 


1799 


35 


410 


ATGPSLGKVCS*AFSFL*PKIiDFWHLYF 
QASGFLL/ CHNS PS QTNSCS S FGKHYGM 
VLRVNS h * PDHP WNE FYFLNFF I FYRDS 
/SLTILPRLVSNS *AQTILLPLP/PKVL 


1789 


15690 


A 


1800 | 


65 


415 


KKGVLKGGPPPQPLGV*GPPPEKGGPGP 
F\GAPKEKPPLALGEPKGKPF*RGPKGF 
FFKKGLEPGAPPPPKPRGEKPPVFKGPH 
PFCKXGGGDKREKNGGL*RKKPTELGNG 


"1790 


15691 


A 


1801 


417 


3 


NLGPNI FPPRAPQNWGPAPPIiFFFLKKN 
FTWGGGSTPLFP /LNLGGLGGPFPGLEV 
YAPPS PHG* PRFFFKNQKLPPPVWGALY 
SPFFGGWEN\RKAPPGQTLFFFFFFIiK* 
GQDLPMLLQLVPNYWAQVILPPWPPKV 


1791 






1 8 no 

loUx 


i 
i 


All 


UtfUl fVjliA* 5>£>CIjSIjPSIGDYRC/ RTTV 
PS *FFFFFLEKGVGFI PRGGI KGLDNC * 

T/^DtlDDT^T TTV'NTD'D'D/ w »KTCr\t3 r irr"TMT«r«T»n'nn 
i-tvaf nir ire i^jx \l>lJr Jr zryjNoyii Vva i. AvjFfc'xrK 

PG*LFFFFETFFFF*KGGFFILAQP*MK 
WGALKKTPPAFSKRGGYRGGTPMEPTHC 
FLKEN 


1792 


15693 


A 


1803 


256 


399 


AIKNIHDS*EEVKISTLAGIWKKLITTIi 

JL Uuv JJvjr iv. 1 o V tS is V 1 VJJVME 


1793 


15694 


A 


1804 


407 


2 


FEKANLFPFLFKNSGPPNVPGRGMAGVP 
KFCCPSKRKVPGP I FFLRRLFLFYPKCR 
RPPLFWPAFGP*KFFLPQVFAPAF* KPS 
PKKKRAPVFFFFFFF /RDKVWLCHPPWS 
AMA*SQLTVTSVSWAQAILLPQR 


1794 


15695 


A 


1805 


429 


116 


LLTKKKRKiOCPI^CGKSGLGGP*GWTG 
V*QAEDTHVIW\VLSSAPSLSSEEMTDS 
MPGHLPS KDSRYGMEMLTDKKWTWDGGA 
WDSSPQGANGKRGARQASGFS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Add, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptoptian, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1795 


15696 


A 


1806 


2 


408 


F VI FVFIiVETGF \S I LARALACAI SNS * 
PRDLPTLASQGAGI TGVSHRTWPRRS CV 
F*EAFLTMLMVLHIPPSIXWHSVPTHLL 
AliSS * CFLFFLRGPWAVTQAEGQGHDIjG 
SLQPPLPGIiKWFSCLSLPSS*NYR 


1796 


15697 


A 


1807 


1 


196 


FRLGASLDLSGGCSGVSWTPAPPLPPPP 
PPSAASGIS/SGSTSAAAGL*SCRTAFF 
SFLSSFFFFFLKKINPFPFLGGI *PFKG 
GPRLV* GNI *PPPQG\DFLGGTRGGKKP 
WGGGKFGQGGNFPLPLKKPPGPQKNPPP 
\PPPPPPSAASGISMAAHLQLPACDRAA 
QhS FLS SHLFFFFF 


1797 


15698 


A 


1808 


395 


3 


LGKKMNNPKCTQGCEATGTL/M/HCWWV 
CKFVQSFWNTDS I Y* G* AAATHDSAMLL 
LGMHSMPACTFVHQKT* TKMFIAALFIL 
PLNWKQV/RCPSVI \ DGYI PTMDQSTAM 
KMNKLHAKTWMNLRNIMLNEKPR 


1798 


15699 


A 


1809 


7 


454 


IPGSTISUJPPPPGFG*FFGLSLT4SSWD 
RQPAPPR/LANLRR*T*LQSA*LWR\RG 
PILDEMKS FMCNCNSLTEGVKGRVEMMS 
QNGRLLTKFCHVGQACLKTPDLK* SARL 
GLLKCWDYSCEPPCLTQMPFS FFLFF*D 
RVS/PLSPGWS 


1799 


15700 


A 


1810 


20 


355 


PQCAHGCRAVAP VCV CFCFCV CVCSHCV 
SMCMCX5EVSAGWTLLCLCIiST\CSGAYA 
CDWGCG/ CYSACVCARVCACVCAQIiLTC 
IGMWE*GQAGQGKVLLDLPLCLCSCV?AP 
Q 


1800 


15701 


A 


1811 


3 


414 


S S KNDNNS LQEFMDKIAGMKKNLGNLTE 
LNNTV*EFHRAITSINNRINHAEERISD 
LKNWLS E I \NRQT* KI VTRNEQKLREVW 
DYVKRLNIi* IIG\ VFERBGGKAYYLQNI 
FEDIVHENFPRFARBANSQIQEMQRT 


1801 


15702 


A 


1812 


1 


443 


AGKSPS PKKFKKGPGGGRWS * FLGGAGG 
GVPFRPGIQGSNYRFF/SPPPPPLGEKK 
KPPFKKKKKTPQKLQNGNPHPYFNPRVH 
GCREQQPWEKAPDATRQPHYADKHVEAG 
EPREPPKPHSEPLFSSPRKRPLFTRSST 
/SGASPP 


1802 


15703 


A 


1813 


411 


66 


WKNNVFNKECLER*IF/IIQKKKI»DP\F 
LTRYIKI KSK* I KDLNIRLB I KKTPGKE 
SVTLAKWLIRSPYLSFP*QIQSKQ*INS 
YVIiIKITKEERWSTSKEYQKPWRERKAM 
YKIK 


1803 


15704 


A 


1814 


369 


20 


QEVRPS I YLSSSNRKYVKDSNARFTKEA 
IQIANIHMKNCPTSLIVGETHIKTSKGY 
HYVPIRMAKILKD\CNRROGETGTLIHF 
*WEWKMVQPFQKPF*WFPKKRKIH*QFD 
LATRS 


1804 


15705 


A 


1815 


2 


675 


GLAI LGRRLRGEACTRS P FS S I L I FMVS 
MGEWPGVP/GIKGTRTGAEAVPTRRRKS 
S I WPQTGGAGES SG/ PLRGLPRQGKPGS 
PRGPSGPPQNCARWWHPQAAPLGACCFS 
GPEPSRI»PPWRQGNWCLPSTPSSSA*EG 
WR/PVLQPGFL*SPSSLASICPGAERGP 
PGSSRP / GLRGAPGG 


1805 


15706 


A 


1816 


273 


2 


MESHIMVrPFTIGFFHLVQCLHGPSMLQC 
IAVLHSSYLFIYLFIY*DRVIiLCHPGWS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
pep it a e 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=JLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W— I ryptopn an, Y —Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 














AWQS * LTGASGSW\LRRSFHLSLP * C* 
DFRHEPR 


1806 


15707 


A 


1817 


40 


404 


GLHE \PGVQGCSEIi* SHDCTPAWVTSET 
LTLKKLKIKKSKKKKRANPRGFERPTFG 
EAGPGGLLKAIISKPRVIQGTQKSGALL 
GFNSNGGENHPPQKQPYLGGLYAKAHSP 
flGGGP P S PCG 


1807 


15708 


A 


1818 


392 


3 

t 


EKYNMSYDIKSTNHRKNCKLDFIKI/RN 
CCSLKDTINKMKMQASNLEKI FAIHMPD 
RGL I PKRKNSCNLVRR*QPPFPKEAKDL 
NTQKSQ* TNG/HGSKETSLI IREMQIKT 
T/MNYTTSIPT*MLKIKKMNI 


1808 


15709 


A 


1819 


311 


431 


EWGRAWWLTPVTPSLWEAEAGESRDQE 
*KLCTTVBK*KTISN/HDVPIRSSWTGM 
VAHACNPFTLGGRGG* ITRSGVGDQPD* 
HGEGL 


1809 


15710 


A 


1820 


68 


410 


AKKNQGPGPMVFGFGGIKPPPKQKKVRG 
GFFAICPKEQVFFFFFFWRAVVQSLNHC 
S LQPQPPGIiKQFKQSSHLSLIiSS *GYKH 
VLPCPANFILFSLVETGS\S I YFPGWSQ 
TP 


1810 


15711 


A 


1821 


408 


1 


T PF F FIiRVLRLTP PLIiGNF FGPGFPPWG 
GFSPGPLLKGPRP/CFPIFKPIFQPGKW 
GLVFFFFPPFFFYPRGSR*NLKKIFPNF 
PFFFPVFF* ILNPPFFFFFFFFFFFI»*D 
RVSLCRPGWS AVAR YRLTAS STSQ 


1811 


15712 


A 


1822 


362 


76 


SEIAPLHFNVGDRVSLHLETNKQSNKQT 
KKTLIFRDRVLLCRPGKNA/VEVQ* *LP 
AASN/FLRLKQSSCISLSSNWIYRHAPP 
HLATVFNFLI IFE 


1812 


15713 


A 


1823 


314 


2 


VTSKPCPRELTCITYGVSLTQCSMFGRM 
KGLLLIWPPVCEVRRASGRPPLMGSEEP 
LCPAATPSGRCTQQ/ LH* ERAMMTMAVL 
SNRKGGNVGKR*RNQIVAVS 


1813 


15714 


A 


1824 


57 


389 


NLHLLQLPTYTDADSTGPTLSGMNVKNIj 
HWSYEYKYRSITGVQWLILGSLQPLPPR 
FKCS CLS FLRR WD YRCAPPRRATF\ * FL 
VETAFLERLTSCDLPTSASQSADI TGV 


1814 


15715 


A 


1825 


410 


70 


VPIMSATQDYRHEPPRPAGRFLKKIiKME 
PPHNPALLLVG I * PKNMKSLHKDVCTPM 
FSGTLFAIAKIQKKPNCPSSMDEWINCR 
NY\MHI YDGICYSALKKNE I LARRSGTR 
Li 


1815 


15716 


A 


1826 


2 


411 


FLVEMGF\SMLVIAGLKLPTSGGAPASA 
S ESAGI TDVSHRAW P VFFFLKRCLVLVG 
RS**A\WPHTNLIPPLPSGIKGDLCPNS 
AiataWKiUjAPPJPb PGKF/ CEF * GRTGTTN 
FARG/WTKTPD 


1816 


15717 


A 


1827 


276 


3 


GRPGPADFRVRPQLLQRFLFI YLFTEME 
SCSVTQAGVQWCNLGSLQPLPPGLQ*FS 
CV\ K* FSCLGLLSS WDYRHMPPHLDNKS 
IFSRNGVS 


1817 


15718 


A 


1828 


1 


3Q1 

J71 


IiEPRRRFIjQCVQDCATAIiQPGQQS KTLS 
QKKKKKKGGPP/S * YQKGQGCPSGKKGR 
GVAGKGAFGPGFGGENKTPPGGGPTGEG 
PFPQKGWGPSQGPTKGPNLWGPGGPKL 
GGKGGPPGPTKGGGGPSSF 


1818 


15719 


A " 


1829 


2 


134 


DHLS P WWNQPGQHSEAPSL/ LINI WKL 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=dycine, HHHistidine, I=Isoteucine, 
K=Lysine, D=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
0=Cwlutamine. R=Aroininp ^=St»rinf» 

>C ********* i****, AX ill gllllUC, o vjci uic, 

^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known* *=Ston cnrinn /=nn«thlA 
nucleotide deletion, Impossible 
nucleotide insertion 














AGCGGAHLSSHL / LREDHLNLSVQGCSE 
P*LHACTPAWATE* DPVSQTKLN"*NIWK 
LAGOGGAHLSSHLLGRI T 


1819 


15720 


A 


1830 


427 


107 


VQHQPGQQRESPS J I IIQKLARGGARCL 
♦SQLPGKLRLENRFN/SGSERSHQCTPA 
WVTDRDCLKRTGITRASSQRLSVGIKQS 
CLNPRTAQLQVSAQSPSTVSTNL 


1820 


15721 


A 


1831 


3 


540 


VQFPNFKIYYKATAIKTVL\QHKQR*ID 
E/MNKI * TPEINS YVYGYLNFNKDAKAI 
QWGNDSF\FKKWC*DNWISACKYSQTSV 
SASSASSSSS 


1821 


15722 


A 


1832 


385 o 


2 


AGRQSETPYHNSTIMKVSLQVCVDLSCL 
LGPGSLVQDS PNI PAQRKRFLRLGS FWL 
PLSRFSRVGWPLPHWGQRSSGFSLPR\P 
P*SQIPAPRSPPPAGFVPARSWVCGPRP 
QTRPLPAERPSPRPRRL 


1822 


15723 


A 


1833 


7 


399 


RISRSYLSEYGGSGKEHPTLGASYARIM 
VFG/VPIIYF*RQCLA*AQWYSHSSLLP 
QTPGLKHPP\AQAS *GAGTIGTHHHT*L 
TFAF / 1 FVLGCFFL* NKI SVTOAGGOGC? 
NFGSLQPPPPGLKRVSCLTLPR 


1823 


15724 


A 


1834 


2 


306 


LARI*VSDS* PQ/CDPPASASQSAGI ICV 
SHRAQPAEEIIiKVFGTLCLEPQRPTPDI 
FIIPYLFC\LFEMESCSVAQAGVQWRIL 
GSLQPLAPGVKRVSCIjSPP 


1824 


15725 


A 


1835 


12 


400 


KKGMVPKQLKVGKKPPFSCWGPNMKKRD 
SPVFHHQDPIFPIPFFFFGNGFLFFPP/ 
LAGGQGGNLN* PNPLPWGLKEFPPPTPR 
GRGEKGGAPP P P INF VFL KKGGFPLGGR 
GGLEPPPLGDPPPLPPKRG 


1825 


15726 


A 


1836 


220 


401 


KGSFVFI PQPEGEGP FLG* LKPRFPGLK 
QFSCLTLLRScafYGPLPPPPVIF/CX5FL 

R 


1826 


15727 


A 


1837 


12 


357 


GLLGGQMNGSLGTQTS YEDXjMSSS \FKP 
NSPPPTPS/VRTGHLPK*PLESSNGPPP 
PQVSHSFQGQWARGHPSPPPQWNTPFSP 
PQQYTQCSKTDP*PPPSPPYLGQEGSNA 
PSLA 


1827 


15728 


A 


1838 


8 


380 


imkthaiGjLnkrvah* iqqykk/hh/nq 

MGFIIiGVQIYFNF* KINLIQLINSVKKK 
KN/HSSSSSSSSSSSSSSSPPVLIKSLC 
NliSKNKRNFLCLTKG I YKNKTE \NSMKI 
II*NGEQLNAFPLRLGTK 


1828 


15729 


A 


1839 


2 


444 


VPGDAKWFSVLHLKDAFFFIPLVPESQY 
PFAFEWENPNTREK\TWAVLP*GFWDSP 
HFFAQPLERDLRGLQIiEDGS IIjQYVDHL 
LVYS PTQEASDQNT I KTLHFPADRS YKV 
SKK^ITLQQVHCLGYILTPGTCK/LS 
PERVQAI 


1829 


15730 


A 


1840 


1 


642 


EIKGIQIGKEEVT/ SIiFADDIDYLRIDS 
TKIOiLEVICEIiNKVG*KINM*T* IVFLY 
IGNEHLELE I /R / ETiMPC T fCT <i <? TMTTVT . 

twutmni *v/ ri uric v» HVIJO i. I'ilvX *J 

ElOTiKKDV*DLYTElTYiO^PREIKIT*A 
NQ* B I LCLiCSRRIiNTKM /STFPQVFCVY 
YAI PVK/ 1 PSRNFLVLVDKLILKFI*KY 
RGPRTAKTTLKKKKKVVRLTLL I FKSYY 
KTIVITIGWYGFQDRQVD*WNRIE 


1830 


15731 


A 


1841 


3 


423 


HRITSE*DLHI*ARELIW\SMYGSIiDHKN 
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Hinrv 4fk 
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peptide 
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Amino acid sequence (A=Alanine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, P=Phenyla!anine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














YDAELDRMAMPCLCAIAGALPPGYVDAS 

I PE KLDS FINKFAE YTHBKWAFVKI QNN 
WSYGENIDEELKTRPMKR\PY\KTYSEK 
D 


1831 


15732 


A 


1842 


33 


179 


YMLGGRGCSDLRSHHCTPAWVT/TA*LS 

JfV/TVnDrtAKrDT /WITT /\OVT?ntTV 


1832 


15733 


A 


1843 


349 


10 


LAGGIiNSMEGER\LR*ERBECTQQQMVH 
DKYCKDLMGFGTKPRHITPFSS FQAVQP 
QQSNALVGLLGYSSHQGLMGPGASPS PA 
KSTLVESRCCRDLMEEKFDQRK\QWVLK 
CR 


1833 


15734 


A 


1844 


15 


856 


AAEQLSFIYKLPQNPSPSTPGSSLSGTH 
GMQTMLGSTHLPNLTDMLGPELGVQGI P 
SPGCACQRGRG \GGGRECCS P PGVS PQG 
\SAVGRGAEGPGGLTRSGSGAASAIiVRP 
GEKGCWCRTASGAGPQRRQRTRGPGSWG 
L*S FSQTS EEKCP S PAGSAGAGP VOQRRQ 
SSFAGGCGTGAGAPGST\GDAHPAQGGS 
GGPIiRSLPAVGGPRPGPSFLKTSGSGSV 
PQGVP ILWSLT * RALAAPGSQGPAGLAV 
SCTGGRGYRDPQAPGTGAG*HGNSTRIiR 


1834 


15735 


A 


1845 


402 


2 


SKARRQGRPLRQG*APG/AARIPEQKRI " 
GGP/ EERRRPS ARGPRATRVAGEGPKPK 
GQTAMAGGGHDPLPLPPARSRSQES IGA 
RSRGSGHSQEQPAPQPSGGDPS PPQERN 
1 EKPPKLCSTIjPGQGPPPNV 


1835 


15736 


A 


1846 


446 


32 


TSRKIS*KTGNQFLMKECSCWSHHKAAF 
TKKDVLNI \LAWKHVNTKASETFHFFQ 
SGQAKVQQGFVKEGCELINEALNLFNNV 
YGAMHVKTCTCMRLLDRLQYIMGDYAEA 
LSNQQKAVLM\TERVMGTEHPCIRPL 


1836 


I J / J 1 


a 


1 54/ 


440 


4 


V IX3KHVE VSJSKGGQVNYQAG \ KTVEIWA 
DKIXK3)MiGTADMVECLKSTRYKELIQQ 
AITAGGAP IAFGP VIDGNVT PGDRQILM 
EQGEFLNYDIMLGVNQGEGLKFVDGIVH 
NEDGVTPND* KFSVSNFVDCMRPRRGPN 
YSRFQ 


1837 


15738 


A 


1848 


526 


0 


PRRDPPPKRQTPIPTHVSPVWEKGPWGP 
APIiRPDHPSLSPCPAMG*K\PGLPRGCP 
QTQISPLFNRSPASPLICHHHPPSEP\K 
PGPEPPPTPSSSI PSLARFTRPGESSPL 
PPPOTPSGPP 


1838 


15739 


A 


1849 


417 


31 


QATGQECGCHRGGPPPGPAGETEPQAPL 
RLPGGTGIPWGAGILCPS*LPGPGSLSP 
AAGRG/SGPSAGPGAAWFSSP*/PACPS 

ooiwnv t vj\ortoor XvIvrioirVjJjr X LLtirnirir 

WCGRGASNKI IQMPGLVC 


1839 


15740 


A 


1851 


3 


285 


YTVCECVCVCLCVCLPVSL\SLCI,SVSV 
CFFPSLCGFVCVCPCACVSLAECALCAT 
KRFV/CMAACLW*ASFCVSTWVMRPAVN 
RFRRGGSALGA 


1840 


15741 


A 


1852 


128 


524 


KIPGLGQRSEGVGQGKDDLHVTAPVPTH 
GWGEGAASKPTVLPPPPP\PDAPTVFFF 
FFFFWEKKYFFGPPPKGAGPQIYLLGPN 
PPGFKPFFPPHPQAAGI*YIKPPPWYKC 
PLLKKRGVSTLAP \GFPKPP P 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, {^Phenylalanine, 
G=Giycine, H-Histidine, I=Isoleurine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=ProIine, 
V^vriutamine, K— Arginine, exocrine, 
T=Tbreonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


1841 


15742 


A 


1853 


1 


1648 


MTVPLHSGLGKGVKPERKTTVRRPFVSA 
GKKYACPFKKAETPQWRRLRMRPKAPAA 
SLLATFPISQRLIPLTPARKHCPSESET 
STWWSKAPATPRTRGAGPTSPPRPTRR 
R\*ACSLQKLFAVEEEFEDEDFLSAVED 
AENRFTGS LPVNAGRLRP VS S R PQETVQ 
AQSSR/PAAVTPHCSLRGFGPARIjGP/P 
ASLPPARPVLTA/OPSCIGAAPLRPVST 
SSSWIGNQRRVTVTEVLREPARPQSSAL 
HPLLTFESQQQQVGGFEGPEQDEFDKVL 
ASMELEEPGMELECGVSSEAlPIIxPAQQ 
REGSVLAKKARWDLSGSCQKGPVPAIH 
KAGIMSAQDESLDPVIQCRTP\DPP*DL 
VLWVTFLFQ/PALTVPTQQLHWEVCPQR 
S P VQAMPKAARGTIQSSPQ1TOFPCQP 
FQSPSSWLSGKAHLPRPRTPNSSCSTPS 
RTSSGLFPRIPLQPQAPVSS IGS PVGTP 
KGPQGALQTPIVTNHLVQIiVTAASRTPQ 

ADTtlDCTDA VFDD'CDr'nTVr'TT T>TT/"V"^ ppn 

UrlHJb'oiKAlvlKKr r^FAtjlJLiPHyUbCjJfcC 
SLEDIMVSAPQTPTHGALAKFQTE 


1842 


15743 


A 


1854 


235 


223 


IHKFIHQIWLAKITCQRTKY*KERSVLL 
PTSFSPVPSQGHYTCKQ\liCSLASDLSQ 
PDLVYKFMNLAVLHAMWNS RKVS CYPWT 

MT VT7T HIWOT 
Ml If JjflANttl 


1843 


15744 


A 


1855 


373 


3 


I KDG I YR YFYKALDS FCLCCEFMNQFFS 
CMDRBLSQRCFLNSAY*FPSNL*CYLCY 
LFFFF*LKKYFLTFFLFLRDRVLLCCPE 
*SAMVHS*LTVPLDFW\VKGSSCHSLLS 

GtjTTYVDinVT "DtJT.V 
OriUxKxliVurrrlljZ 


1844 


15745 


A 


1856 


378 


1 


RQRHS PAGNTGRPQVTPOG * IS WPSITK 
DRTSMS S VTSGALGHTAAS PHARLLPLA 
LPSVRTQHGSPPPGQEQPTIICPSNLPT 
HPSLPLGMHPSVRASPPLCK/P/SPPSI 

r*\Z> Vili-VO WlrOrr V X 


1845 


15746 


A 


1857 


3 


379 


YMVRKVIEVWFLLLLIiLFFFFRGGFLGQ 
GWGPPAPGFSFGKKNPQGLGSSNLRGPW 
* TNP * PVPGGALFIoVGPPTPAEFPFKNF 
SRGFLLV/ALADF*TRPRLVSPHGG/RG 
AKGTPAFLESMPPWMP 


1840 


15747 


TV 

A 


f oco 

1858 


452 


1 


AGNNWAKGHYI EGAKLVDS WDVVQEE * 
ESCDCLQGLQLTHSMGSGMGTLFISKIR 
EECPDCIMNTLSVVPPPKVSDTVVEPYN 
ITLSIHQSVENTDETYCIDNBALYDICS 

"R T r , KT ,TTT? C T 

XV X UI\iJ X X XV Vh jl 


1847 


15748 


A 


1859 


1 


385 


otssdylfpfffixfrnsihsvtoaggqw 
hnqgslqpwpsrlk\cptasas iclhmp 
pflanfliffveigspyvaqagsrdppa 
iiasq/ sagitgmghctqp* vflffffff 

<? T7P HTTP G. DttTD Df3t"2t*iP P T 1? 


1848 


15749 


A 


1860 


470 


17 


IEMDSRRVPRDKLACITKCSKHIFDAIK 
IT*NELASAD/DFFPTLIYIVLKG/NPP 

V*JL-»^^ XU^ \£ XXX XVXT Vw^X^I XT O JVXJ1 * X \JuiAJ X X X X 

NIxPJ^TOCSWLMOT^ 
WI^YVLRFRNGITGVSHRAHP 


1849 


15750 


A 


1861 


3 


790 


CSRPEFPGRRFVEAVRSKPYLSLPF*SR 
* S FFNVPAE * TSAKD ILASSEF I KQNND 
VSS\lxQKF^QEVK*IxDHIHTA*ADGSW 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N-Asparagine, P=Proline, 
V^vyiutamine, iv^Arginine, o=£>enne, 
T==Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KAELERGPDGSAPSPFPQKPAASRQPEL 
vjc IoA T r LJ3RWD P W YQS NVw 1 Li CPA! HKXi 
AEMPPSLDTSRTVDPFILDVITYYIRMG 
TQPIYFQIYTVKIFFSDLSQDPTEDIFIi 
X biltxLV K. XyDS KF P KUGr o PRRRG V AEG P 
GAELS LCYQKALLSHRPREVTVSLRATG 
LILKAIPASMY 


1850 


15751 


A 


1862 


192 


3 


SSGSHSVTQA\GVQWHEHSSIiLpy LLT* 
FPCjKK*ASHLTIJ\SS*DYRRAPPHPANF 
♦IFCREGV 


1851 


15752 


A 


1863 


82 


370 


SLCQKRAFVGEKLVHGLLVSPSGGRVPS 
CPDPWGCRPRFHAI AVYS FLKLRWI PE 
VS I IiPEDLEELYDijFKVRS SGKMRGSRP 
♦AGLSRG/DPACP 


1852 


15753 


A 


1864 


2 


325 


IQVYSISHLSIYLF\IYHLSRGSMHVSM 
SLS I * S I YPCMDV* MYLSNLCMHVYI YL 
FYGS I YRFYLS ICLSVYLS IYLS I YLS I 
x LjX UHPS I FK.TV IDEHAI FATWRH 


1853 


15754 


A 


1865 


3 


377 


YSPWCKLFREIiCKINVFD\LDSPLLSGK 
EFTOTTHNTFDHMWRTKEHNEAGWLIiIiS 
SVDKVMKENDELRDSNSWIiQKQV\* PLK 
SAKTAIiSGSLNSCREKAE I VEKQTQSLT 
M*VADIjQRKMHVQP 


1854 


15755 


A 


1866 


10 


378 


GWKNGEFIDAL* KVYGHKAPNKSAVYKW 
IT/ * FKKGQDDIEDEDHSGRASTLRKKI 
HLVYALIDKD/*RLTABAIANTIDISIS 
LAYRILTEKLKLSKLSTQWVPKQLCPDQ 
LiQRRAELPMEILK 


1855 


15756 


A 


1867 


346 


1 


DILWKIiQKPQPNGKMLKAAKERKKFAF 
KGVPVRMNADFS I AAMKA/RRRWNS I FS 
F*KENNCHLRLLYSAKNIFP/EIKTFSD 
RE FVT I RSAVKE I LKDVLWAEERLSHVK 
SRNV 


1856 


15757 


A 


1868 


1 


377 


GTFFQRTQCKGIKQ\YWGLI IKASSDP 
TCVEKEKVY IGKLNM I LAQMLKQEWTEH 
WPAFISDIVGASRTSKSLCQNNMVIIiKL 
vz>t&K vtUbb £>GQ X TQVKS KHVKDSMCNE 
FSQI*Q\LCQF 


1857 


15758 


A 


1869 


90 


384 


QWLLFTEYSSLYHPVLFFFFFFPGGRTG 
PNPPAGGEGNETG/PNGPSTPGGGGNPP 
PLPPGGLGLLSMPPPPRQILLMETKKRP 
PP*TNKCCSPGYSPP 


1858 


J5759 


A 


1870 


2 

• 


578 


FVVKHALLLGLDFLPGKMAPWSGQGCSL 
<aH J. iSLrfj I b WUr AVGGAS WRLKVVCvKGD 
SHKGPATPIASCKGPLGRPCPLLAQSKA 
*GS *KRG/VAPGSP*LALGMGGG\DRLT 
LISQVHGNQVTQI IPFSTEGETKAQRSP 
SLPPRDLIRGRHSWNLDSTQLLGYCPIjIj 

ff f ljfllrAljf lit* V Ft IWCaEXQKENSRE 


1859 


15760 


A 


1871 


1 


382 


sgqdagscllygagsgamvsgaynpyie 
iieqprqrgmrfiykcegrsagsipweh 
stdnnrtypsiqimnyygkgkv\ritlv 
tkndpykpyphdlvgkdcrdgyyea* fg 
qerrplffpnlgirca 


1860 


15761 


A 


1872 


490 


1 


ADSLS KD PGRPLHPPNI WGDRGRQ* PGE 
TPQ *HRRSCASQDPGRSQ* PGKILPS PK 
PGRPPPMTGEDAGPPK\HGNPSPNDQKK 
NPPPK* / PQGNRIHRDPGRFPPQ* PRKK 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G\utamic Acid, F=Phenyla!anine, 
G=dycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Lcucine, ^Methionine, 
N=Asparagine, P=Proline, 
V/^vjriaiaiuiiic, iv ai gin id e, o — ocrine, 
^Threonine, V=Valine, 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SFPNNIiGDSNDQRSRFPVA* KNLPQSSS 

f5TP A T CR PT.VT , PPOPfi A RPN^TT 
OHUiiulVCiiJXUr Ir virvJO Ortiz irlNO J. 1 


1861 


15762 


A 


1873 


373 


1 


GGGVPGALFSHKKKSLFFFPPPPFFFRK 
GL*NF\KIKGCGSPFLPLLFFLEKESCF 
VP\RVECGGVILGPCKVCLPGSPPFSAS 
AS * VSGATGAC/RPRPGKFFSPFFFFYF 


1862 


15763 


A 


1874 


3 


374 


YMLGKE I VS KTKIGQELGLLNQ/TSQ W 
MAKEKFL* ETKS ATPMNT* IKRKQNSPI 
TETEKALLVWIEDQTSHNILLTQNVI *N 
MALTLFNS I KAERGEEDTEEKLEGSRYW 

TTKroPVTf RTfATQTT 

e rixtr ruMViSurvj. o x x 


1863 


15764 


A 


1875 


2 


364 


IHSGKGESLWDLGLDTEFLDLPSKA*HI 
KAKNDKLDLIKM*NFCSAK/ET*/IRMK 
TQAIHWERIFVNNI *NRKLVYKELLKLR 
N\ KKNT I RR * AI DMDRHFTK/ EKMPMTM 
*YTKGCSISLVIL 


1 QUA. 

1864 


15765 


A 


1876 


41 


461 


WMKLLSENATIMLKDSDLSQV\FCVNIP 
FVQAAKAKIEE*DYIKLKWFCSAKGAI+ 

CMEFTQLTSLITTPTITLPWNPRQSYHH 
NPT 


1865 


15766 


A 


1877 


2 


185 


VRPTKl^PLERTQYTLPLLYKWNNKAWM 
TAHL/FTA* FTE YFKS \ SVKPDE +VRE I 
DYRMLYL 


1866 


15767 


A 


1878 


■ 
1 


Ant 

491 


PASAAAATAAS PS \PTAPQGPPR/RRRL 
LIQPPLYPRGLFTPGVPPL/APGGSREP 
S * SLT * DARTL P LP VLG PRRARLLGAC P 
AVQAEEGVDPVGHLSEVLAAPCKQP* TP 
PPACHCLTCEGRPSGVQAPLHKAKLYP 


1867 


15768 


A 


1879 


24 


A An 

449 


FGKKSQFLFGPPGGGEGEKLG*REPPPP 
GTKGMPPPPPRERGKKRGGPTGRENIiGI 

KGGK*GGGRLPPPQIKGPKRGRPKKRKG 
RP 


1868 


15769 


A 


1880 


190 


2 


PLYCHKVGQVGLELLTSGDPPPLSLPKC 
wdyry\dhhaqps F*LFI»SVQI SGIKFK 

HSWOPCI 


1S69 


15770 


A 


1881 


I 


458 


FAIRAGRNLPDKEFRYLRTVIVTAAVYW 

GLNSKLRCLTS \lltfqhragvspytsp 

FGFA+TCVFAKQLLEPILC/RPCFHRAP 
IiLPKLRGHFAE/FP*QCFFR/QALGFSP 

/rstcvglrhg 


1870 


15771 


A 


1882 


3 


392 


ymmryhyilirmakvki \sstnddtq* l 
klihc*qecrmvqplwkmv*qfliimii 
*lp*nlaillwgiyliemstyefr/rki 
ciqmfmtdlxvtakywtq/ pqcpsvagw 
ikq/irsihtveyysavkrnql 


loll 


1 37 72 


a 
J\ 


1 OOl 

looJ 


z 


473 


TNMRRNKQKGLRRCKPRW/RSRGMAWKV 
DKPRDAR* KSDRDRQ/RGETDG*RVSGG 
LTA*EAYRH\HRGQ\EAASQSGRSRQAG 
RQTP*QEQAGNEAEIRREEEKSRGQERE 
RE\ KDRDADRHKG 


1872 


15773 


A 


1884 


138 


444 


CYLTliI KCRI I YINDKVIVLTIVWY*HK 



655 



WO 01/64835 



PCT/US01/04927 



SEQH> 
NO: of 
nucleotide 
sequence 


SKQW 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQTD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine j 
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DRHID*WNRLQSP*IiNPSIWVQLILLNN 
AKOTQWVKDTLLDKWYW/ENVJIYQCMRM 
PIjGSYTI pmnqfkmnygls 


lo/j 


15774 


A 


1885 


27 


470 


TICCYIFLFFSFFSLFFFFFGKGVSF*S 
PGGBNTG/ ANLG * WNPPPPGKGIS PA* P 
P KE PGMEGAGHP PGTKS * PGKFG PGPGG 
GPSFPYRGPNKTWGKPGPKWFPGGIGYP 
ATIPQKKQPEGKRGR\PGGPMS PYNGP P 
KTPTHGKGPG 


io/4 


15775 


A 


1886 


478 


1 


KNVQENSALQAAMCRKMLI VCQTQCVI I ' 

SGESGAGKSVAAKYIMGYISKVSGGGEK 

VQVRRGKQKDKLGRPHLHGSAAPPICTP 

ASV*LLPTPPASAP\HVKDI ILQSNPLL 

EAFGNAKTVRNNNSSRFVSLCRPAWSSC 

SLRADLSTHTPTHTHTRV 


1875 


15776 


A 


1887 


1 


402 


HSLERPHYIG*LFKNIFSRD/RVFAMLA 
HWSRTPGLJOQSTPLSIjPKCWDYRCEPQH 
PAGSFFFFFFLKNGFWGCSLGGRAGGQQ 
*LKS*WRPNPLG*GNPPC*PSKEVGTTG 
AHKKIANREIRTRACGGTNFSL 


1876 


15777 


A 


1888 


511 


124 


GTRRQHFAGAHVPPEGP*S\MLDPKLK\ 
DDRPARDMWIREPGLLLPRAPAQDAGKY 
YCHRGNLTMSFHLKITARPVLWHCLLRT 
GGWKVSAVTLAYLIFCIiCSLVGIIiHLQR 
GESCPQWVCPNPTPSSPG 


1877 


15778 


A 


1889 


667 


310 


QLKP * ATKS V* KDTAFGIDVGNDFIiAMT 
PKAQAMKGBCIDKWGFI YR*S ICTAKETI 
NRVKR* PRK*EKIFAKPTWQKGQIS/RI 
HKEFQQLN/KQKSNNIilERQTKDLNRFL 
SKECSKDL 


1878 


15779 


A 


1890 


462 


3 


KWFFPLGPPFLPPP/PPPI*NPSFQKNK 
KLTRGGCARYFPPLKS PRPRI PFPFFEK 
GEGSPNSKHSPAPPFWGPKBTFFFRRTP 
PPPPPPPLCSFLKNNIiFSYIPKGFLGGK 
GQNFHS FLPSFFFS I KKKLLGLGVHVRF 
YCRWSCTGFFVQMY 




15780 


A 


1891 


1 


455 


NTCLGFGNGFLDATPKA*SMKKI INKLD 
FNETENFCSVKDTVKGMKRQATHWEK/V 
RKTHILYKDLILKIYNQLLKHHNKKTST 
IKQ*AKDLNREDIQMTOKHMKRCSGWGR 
WFTPI IRALWACKVEGSLELESSLGNIV 
RPHRKRERE IHCY 


1880 


15781 


A 


1892 


1 


537 


RGGIQAPKEVSPEGRQEPARKSLI *TA* 
ETPP* SQ* /PIPEEP/TGVFMKKPVSVS 
LETGKHAVWPKVNGKELPDNPTIKWFK 
GKWLELGSKSGARFSFKESHNSASNVYP 
VELHIGKVVLOTRGYYRLEVKAKDTCDS 
CGFNIDVEAPRQDAYGQSLES FQRTSKR 
SLYAIiALEDPGM 


tOoi 

lo51 


15782 


A 


1893 


2 


514 


VRCQRRCHE*RACX3SSLVNAKKLYEDAL 
MARKVKQSLFSLDVETDEDKFQMMSLQ\ 
CSLAYGTLTKILSEKRSAKSYGMSSVRM 

p G AttOT^lTARTiWriDPPVCirnTTjrtVDaTTiaT 
Jtonuy l oivxuiuny riut voy "ijv V rj\v DiLi 

LPFRKKGQTKDPALNTSLPQKVLGTTEE 
ISGKKHTEDTISVASSLHYS P PAS PQGS 


1882 


15783 


A 


1894 


473 


2 


VMGESRGFS PPPTFTGGNFVFFWKQGVS 
LFCPBGFKTLCSSSPPPPPPQKAGVLGG 
SFHARPPPFFFFQI*IPFFFGIKKP\LC 
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nucleotide deletion, \=possible 
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SPPP*PIi*KKRGIFPVPKGIFCEMESCS 
VAQAGVQ* YDHGSLQPWPPGF+ +SSYLN 
LV*LSS*DYKCVPPHPMY 


1883 


15784 


A 


1895 


613 j 


11 


FRPRS PACGLHAVS S PKLPGAQALSSPG 
l^LVFTAGSWnAG/LLRLSPRANMPRVA 

SQPACL»SPFLLI.APARPSGCAWPGLPP 
VCSLHLRGGWGGADPTGCLDAWAMALAS 
LRPCLCPALPLSREPPSPLLNDLVIiPRM 
PRRAPVLPI*GRTPHSLIJXHVQSPSEQL 
PSKGPERli 


1884 


15785 


A 


1896 


20 


449 


EPSAQTWGPGTLRLPNRALLVNVKFEGS 
EVSPCVACGIQAALSMGSTSSVKDLSHP 
QAPLPQWHQMVFARCLCMCGAQLNVPP\ 
ESFTFQVSTKDVPIAI^IACAIiRK/KATV 


1885 


15786 


A 


1897 


393 


3 


RPTAQS KGNI * VRVAS \EALS PKLLDFL 
PGKVLNGEKVDVRPATRQNLSQFEAQAR 

JvtvE»v~ VKV rtL\j\j± c irKAnoKiJooJJoiiLAaK 
ATPSENLVPSSARVDKPPSVLPYFNRPP 
SALPVMGLPPPPI PPPPCI 


1886 


15787 


A 


1898 


395 


217 


RER/ CKS FR/ PPA/ HLQAKI KGAQ * QVN 
QAAAAQAAAPAAAMVSRDISSIiLVSSQK 


1887 


15788 


A 


1899 


1 


375 


OTVLVQ*NNKAWMTVHPF/TAWFSEYFK 
ATVEIYCS /EKIPFKILLVFARVHSHPR 
TLIEI * KE I YAVFI PANTPS ILQPMDHG 
I ILSSKPYYLRKASRAQRLTPVIPALWE 
AEAAGS PEVGSSGLA 


1888 


15789 


A 


1900 


47 


326 


WWLGSELRSPSYSGAAGGAEMGRSLESW 
RS RPQRAETAPLHSS PGSGSEMIxS L* RQ 
HLTPTAWAGVQWRGLCSLRPRP PGFK* / 
FCPSQHHPQPPSSWDYATRCQATKQFVF 
FLEMGF\VMFARLILTVELNN 


1889 


15790 


A 


1901 


181 


837 


AGRVDRREPGMGTGICKELETGSRETRS 

GAQAKARGGTPLACSSHLPNSHVGSLKA 
QRDEAMVQSGLAPAVSSTCTRWT*GSEW 

IiCFQQTSWASSCSLEHSAQPSEVQVRAL 
SVPSHSPMWV\PQLSLPRDHRKPPGE 


low 




A 


1GfY> 

iyuz 


i 
l 




YTWGFRGKKPLIHCL* EYKLVOPLWRAA 
WRFSK* LRVEL* FNAAIS PIX3VYPE ENK 
LFYQNSTCTCWFITALFITGKT/WNQP 


1891 


15792 


A 


1903 


207 


3 


FREMEFLHLGQAGLELSTSGDPPTLASQ 
NVG/HYPJIEPLRPANTLQS * PLGLKQPS 
CIiSLPSSWEYRHMY 


1892 


15793 


A 


1904 


2 


391 


IQPLISQPJCYKTLGQNSLS/CCAHP/PP 
HFFFVLDSYFH5LLSQLKNAPFT*G*CP 
YWYSKSYSLIHSWGSSSILPCPLTSSGF 
PSLPPSYOPLPCPSLFIiIi*NSLSTLCriL 
FFSLLSQPSFSNRWPSQVYL 


1893 


15794 


A 


1905 


3 


424 


YRAGCLQSLLPPPLLLFLLPCDVPFPPS 
S\LP*VKASCGLIRSQQNVGTIPCLQNR 
KSNKPLHKLPSLRHSLRAMQNRILPILR . 
IGKFF/IFFS*DGVSLCHPGWSAGVQLQ 
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1894 


15795 


A 


1906 


2 


389 


I QRGLDKSCIiI TDNI PSSQSL IQKKGVN 
PSFKSMKADRGKEAAEEKSEASRSWFMR 
EKERSHLHN/IK* VQGKAACYPEDLAS I 
IDEGGYTKQ*IFNEDY/MWKKM*FRSFL 
TREKLTPGFKASKDRLTPLV 


1895 


ij tyo 


j± 






13 


AKEE rQSVWDFPM/ VPQGVYVNFPVSR 
NANLSTIKQLLWHRAQYEPLFHMLSGPE 
AYVFTCINQTAEQQELEDEQRRLCDVQP 
FLPVVRMVDCEGARVNKLLSSQISLFIG 
KGVREIiDSLSDPEVSDFSTKMCQFCEKS 


1896 


15797 


A 


1908 


409 


3 


EKTDGLYRASQRGKDLRJRLCAQSFHASW 
KDGMALCAL IHRRRQ\DL I G YAKLRKDD 
PIGNLNTAFEVAEKYLDIPKMLDAEDIV 
1 i Jrlsi<JJhj|^lMrYVSWIj*IVIAGAEQAE 
TAASRICKVLAVNREKKKLMEERV 


1897 


15798 


A 


1909 


116 


379 


HSGPRREGALLLPKCLPHAKRCLLLFKM 
CSDGATALCCPGWSSAAPS * LTQSP/AS 
TSQAK* PSHLGLPSCWDYRCIPPHPANC 
LDYYY 


1898 


15799 


A 


1910 


418 


3 


QD*YATANRWFICMLSQACFLPSL*PAH 
LTj*L*QIiMLFSFPGTPVFSYGDEIGLnA 
SALPGQHMEAPVMLWDESS FPDI PGAVS 
ALMI VKGQSEAPGS LLSLFRR\LS VQRS 
IU!j<bljijHGDFIiAFSAGPKLFSYIRPMY 


1899 


15800 


A 


1911 


394 


3 


ILEAYPEVKDPAVKGASSKKEMYGH* /A 
AEQALPVASEQEQQRHERSEKKQPQVKE 
GNNTNKSEKIQLSENICDSTSSAAAGRL 
l\W^IGKTYPQQFPKKLKEEHDRCTLK 
QENEEKTOVNIMSKKNREDV 


1900 


15801 


A 


1912 


499 


141 


PGLGERDWTSKYGQGAGEGSTREWASRC 
G/IRPGGDAGQQQPRPE*SVCPRGAHSP 
UraoWKASPAWHSAEPGGRCSGLGVQKE 
GFPGHLLQPGCRTPGPGIRKERFSGYLQ 


1901 


15802 


A 


1913 


127 


387 


ISFVFPPTLPKMPQLKPETISMTGLNLF 
QHLCNLARLATSAYDGCSNSEV/CDLDL 
LY*AALFLKLDYQIRFPNYFSTKYRIYY 


1902 


15803 


A 


1914 


504 


0 


PGPGQRKHSTAPMCLLNIYAQILRFVLA 
NQIYKCIKRIIHHDQVGFIPIMQSWFNI 
QNQ*\INLIHHINRLN*KNHMIISFDKT 
HHLFIIKTFIKLGIEGNLLNLIKNIF/S 
KNPAANI T LNSRKV 


1903 


15804 


A 


1915 


46 


415 


YTSNKQLQIQILKITYNSTKKYKIIilNW 
XKDVKDQYTENQKILLREIKDLNK/YRD 
TSCSW/NIVQMSMFS/KLIYRFSEPPNK 
NliSTL* ILTTSF* FAWKYTGTG I YKVTM 


1904 


15805 


A 


1916 


420 


1 


ENADCVERARKSPDSIP* \RGGQISVTM 
VSPNEQEKAGQLAIGVRAVRYNGV\LLA 
KMWRKKLHLTLSANLEKIIAIGLFFSNF 
ERKPPENTFLKLTAMATHSESNLSCFAQ 
EDIAICRPHPAIKMPEKAEQYKPLTASV 


1905 


15806 


-A 


1917 


384 


3 


TRTITSGQYSTHVIRASRVPTRS * VPVF 
RSCTSNRRFSQAIEPRVMH* KVHIRAST 
VRYDSGGHVAVYPANDSALVYQLGKILG 
ANLYVVMSLimLDEESNKKHPFPCHTS/ 
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RRTALT*YLDITNPPMY 


1906 


15807 


A 


1918 


405 


105 


KAEAQRRREICSGPHSQADSAWHPLVTW 
WH*PPPVARSSQEPGVHSPVP/HVCRKI 
LLIRPKMALANEGNYRELRWFTPWSRSR 


1907 


15808 


A 


1919 


409 


3 


GGQEEGVFRVLNS KEFETERGKP* IPPV 
KDQBI IH\PTKFNHVAHMGPGDGKQVLM 
DLPLSAVPPSQEESPGPAPTNLARQPPC 
RNKPYISWPSSGGSEPSVTVPLRSMSDP 
AQDFDKEPDSDSTKHSTPSNSSNP 


IQftR 

17UO 




A 




o 
y 




& P &PWP T .PP PrTWDTYIP * PT .P DAD D ova 

SACLCGGTWNLALWLCS PGT/PVPFLTP 
PCS/SCEVQQPASHSVASNQSKEPAKSA 
AVAHECPPGGTGSADPGWP PGATCPES P 
GPATPHTLGWEPGKSSPPTMEEEPWAP 
QGS PCWTVRQRMTMMM 


1909 


15810 


A 


1921 


556 


1 


IQLWAADGGILRRVGRDPPSHRIGKEPS 

*SQHAAHPIiI»IGPPGQQNLGDSKV*GFP 
SPRLEENTLENGGWGSKQLHGSPGSQHA 
GGSWKNGETSLKG/PH+ADGAGRHTMPQ 
SPPSPPFKPHSV*HNPPAS /PPPHGSPA 
PSGTSPLPMSAVSLLPPGSIj 


1910 


15811 


A 


1922 


567 


41 


GGWGETFSRLGNDIjQAHR* srfnaqaqe 

LLPAAGLDDVSLPVAPRYHLLLVAfijCRQ 
KAQVTGDPGAVLWLEEIRQGWRATQDA 
NTAQRMM/GVAA INQAI KEGKAAQTERV 
LRNAAVALRGVVPDCANGYQRDLESAMA 
ICTORPAAV 


1911 


15812 


A 


1923 


2 


405 


IQGOGITSSSVLHGMVFKKETEOT/VTS 
VKDAKIAEYSCPFIX24ITETKGTVLIKT 
DEELMNLSKGEENLMDA*VKAIADTGAN 
VVVTGGKVADMAIjHYMOCYIJMMLVKIiNS 
QWDVRRIiCKTVGATALPKLTPPCL 


1912 


ism*? 




lyzM- 




37 
5 1 


GECVQQRTGNVGLSPNTARWGTPLGPSI 
SSSAPPWSFSAFPGPPGSGK+AAKDGPG 
CSLKSGLKR\RSQGGLR*TRGSGPSPMP 
PS PS PS / ERPPPGDEGLIiPCTPRGGLPG 
PKINTACVCAADISPGLEPV 


1913 


15814 


A 


1925 


74 


429 


ATIPGHKTiT.T.TiFFFFFFFFFLGKGAWPIj 
G*GKG\GGPIRG*GNRAPRG*REFPPPT 
PGKRGNTGGGQPGQPI FGFLKKKGAPPG 
GPGVPKTRGQIEPPPWPSKRAGVTGWTL 
GPGPKV 


1914 


15815 


A 


1926 


515 


304 


ALAASIiALALNGVFTNTIK\ * IVGRPRP 
DFFYRCFP/GMG * PHSDLMCTGDKD\ w 
NEGPKETSPSGHSSPV 


1915 


15816 


A 


1977 


i 
i 


t j j 


NTVGSNKKSKKLYFAEXEKS YLNP TWTVJT. 
KGPQ*VKIILKKKSSVTKCTLIHSKA*Y 
KV/LKIVWYWQRDTHM/DHWNS IKQHMV 
KQFFDGDTKTVQWAKDCLYNNWCWEK\W 
ISTSNRMKFNIYPTP*ANINCHQDPNSK 
ELYRYKN 


1916 


15817 


A 


1928 


316 


356 


GGT*PPQAATPIS*LYLPITSSL/TLPP 
APSLPLQIAPIPSDSPSPRYS /TPTPPI 
SLGLAPPTTLL I P VPSLPVS PRPLNSTA 
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PP/ TFLTGSGAAVTWSLAVSLS PAPAS 
AMDKAQNM 


inn 

1917 


15818 


A 


1929 


490 


11 


PTRLTCPGSHVPGPVTFPPYETHSMP \ P 
PCHVHPLVQPPGSSHVPGPVTSSPSPPS 
ATHIHPLVQPPGSSQLPNTWVHPSQRMR 
TWLQQHPPLPAPPQP/ PSPGS I CSSMTH 
LTE YLAS FPAPQ/ PREQAPQGGVPTSLP 
QW*APQPTAPCRATQSQQIaI,DG 


1918 


15819 


A 


1930 


1 


552 


RNPRKRAFPSPPIPAPPS*NQENERSRH 
PQSLL\PFVKSRKRAFPSP/LSPCSSFV 
KSRKRAFPSPPVPAPPS*NQGNERSRHP 
QSLItLLREIKEMSVPVTPSPCSSFVKSR 
K*AFPSPPVPVLLRBHTFLPLPEPRQPS 
TVPVGCFGFSRIPRRWNHTGCAFLLICH 
SARLLCDPRVSSCWRHS 


1919 


15820 


A 


1931 


414 


3 


RVPGESRK*ERVLDSRHKD* EGRRRGNI 
\MGKRVDYTT*TVITDVTNLS IDQVGVH 
RS IAANMTFAE I VTPFNIDRLQELVRRG 
NSQYPGAKYIIRDNGDRIDLRFHPKHSD 
LHLQTGYKVKRHMCDGDIVT FNRQCI 


1920 


15821 


A 


1932 


521 


103 


ATEPAGVRLKEGGNI tesfvtvgnvisa 
ladfsqda/srys * kkkqvlvpyrdsvm 
twllkdslggnskaimmatispadvnyg 
etlstlryanrakni inkpt inedanvk 
LIRELRAE I arlktllaqgnq iallds p 


1921 


15822 


A 


1933 


1 


490 


NTGWVVRKGEGNGTIiMVRRLPCVPlliCSS 
AGGLTEEDEGELCAAGFPLLAEDFGQAL 
QQLQTAHSQAVGAP /K3GGDQVLGGGTGA 
PPTAYPNHALSIS* IPSVSWHDEGALPA 
VKIBILGI IHLLPERHALLSIiVQARSGL 
LLHGPPATGKILLNKAGTTECCLT 


1922 


15823 


A 


1934 


383 


151 


EVAFLENLIKDDIERGRLPLLLVANAGT 
AAVGH3 \S *IjGRMKELCVQCVNRPKMEG 
YMHVSQHPVPESHKMRAKAI F 


1923 


15824 


A 


1935 


379 


1 


YVPWS KDKEYFFNSQ *GLTTHQ I LPYI 
DGFRHVQKISAEADVKLNLVRIAIQILL 
* VGLQS YLGQGHQPGKS /CRGPGCEGWE 
GMVLRS * AQLSLSGTTAL*HWCPSSRKL 
IQFGIIKNLIRRLCI 


1924 


15825 


A 


1936 


376 


2 i 


GCLFIYMKPTAMSSSQVARSGEVSPFTA 
VPA* S * K/QGHGAVLGCHITSEACFQAC 
FPFLRPGRSTCFA* SGC* DVERSSSHSH 
GTAHSP\HGTAHTPMEQHTHSHEQHTHS 
HEQHTHSHGTAHMY 




15826 


A 


1937 


426 


1 


KHGE I I DDLLKVYRDNAS KKSAI / YSKW 
I1HWKKRRDDVGEEVHSSRPATSVCEET 
IHLVCALI *ED*LIAAETIftNAVGI PRC 
SAYTILT/E/KLKLSKLSTRWVPK/P/L 
LPAQLQIREKRSMAILNKWNQDHEAFLH 
IIIAGLY 


J yZO 


15827 


A 


1938 


2 


46? 


KRRKLVS S IAAAIHPALS S * TRQRFLLL 
LFLFHIVREDLVQLRIKKKHTIQIGNVE 

LI/NFS IVAAYKINMHISLVFLYSNDE* 
LENKTKQTSSFTIALKRIKYVGSNKKSK 
KFYFAEIEKSYLDFIWN 


1927 


15828 


A 


1939 


468 


920 


IPLSTHSPGMGTSTHIjSTRHHLSFWTYH 
ERLLNPGRETDWA/ IMLFRWGIiLMLPRL 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glydne, H=Histidine, Msoleucine, 
K=Lysine, LF?Leucine,MHVlethionine, 
N=Asparagiae, P=Proline, 
V=oiutamine, K— Arginine, j>-oerme, 
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X-Un known, *^>top codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














VSNS *AQVIPPTSASQSAGITGMS \HAS 
GHLIFNY\CSM*WLHLRNSQ*KRCTGQE 
TGPCPVTQDGVQWSNYGSL* PQTPRLKQ 
SYHVSLPSSWDYS 


1928 


15829 


A 


1940 


448 


474 


GIPGG*GSTTRNALyHVMNGJ^VvXIjJ. I 
CKHGKDWQKHKDSRCDRDNTEYEK\YDF 
GEMLHNATFCLVHRGRRHGS FRFDEAMQ 
AACVPVMIiSNGWBIj P r SEV J. W W MiAAVX 

GDERLLLQI pstirs ihrdki ialrqrs 

QFLWESCIATAIEDPNLIGWR 


1929 


15830 


A 


1941 


1 


423 


M xTLt X r AWjiVl* W V C P(jPJjw\KLiljriKalj J, 
HATPDPPLIiTLP* IPTPTPSSRCAPKAP 
PHLICPTSCPLCSKPHPWPP/CSSCQ 


1930 


15831 


A 


1942 


48 


417 


RLTOYQVL YRLHCTALQPLP PRVK* FS C 
LSLPSTWNYMHMPQCPTOF\CIFTRDEV 
PPMLP\SWWRTP 


1931 


15832 


A 


1943 


450 


2 


VHRRSNRQNMDTSR * RGAMQS VES V \GV 
PY\EQ*TIVDGI\NSGVWEGIAYA* IBE 
RYPEEFALRDQEKYLYRYPGGE/AYKVE 
TIKLNVEAVNTRRDKPTNIFAKNQAPVR 
MRRNSFTPLSSSNTIRRPRNYSVGSRPI 
KPLSPLRAQECI 


1932 


15833 


A 


1944 


451 


3 


GRITRHIiPRRAEDDKfiRiiRis PS PJjPSKH 
PMFPPSVTPKASSDWP/PASSIPCQACH 
G/PPPVSLPRKPAHRSC\PVFPP\ASGG 
DTSIHSGKTVYVKRKSQPAWPVLPPGGL 
*APSHGAPSPSPPDQHRHCPEI\R*DLL 
PAPAPoPFS XPPJj I 


1933 


15834 


A 


1945 


402 


3 


VR1^VSWKVMYP*SRSKASL/HPQITAS 
LTGSCVNCIVILILNFFYEKISAWIAKM 
EIPRTYQEYESSLTLKMFLFQFVNLYSS 
WFYVAFFKGKFVGYPGKYTYLFNEWRSE 

ECDPGCyCii X KXj I JN UM I X XMAfcxt) V 


1934 


15835 


A 


1947 


1 


405 


NTGWRVFAIIiCSLKGRPRGIERV/GGKK 
KKKKKPGG PLGPAQGKTPKTQKGGGAQG 
NRGKPPLFSGGNFGNGRIFGNPLPAPGP 
GGGGGPR * KTR\KRKNGNWDLLKGGGKL 
VFCPMVGKKLSGVPGV* QKNFKGGW 


1935 


15836 


A 


1948 


443 


1 


LTPGAANAbLtljCj \CC^Ci/XioVNG*RJ2GXi 
WEALLTHNMVAGCRLEEVDNAYGHYEAF 
STLAPKAffLSVELAEPCVPEPGLPPVFA 
NFIQIJ^/PVVVTEGGTAWLEWWHVQP 
MLALMBAEIJyCSQVLNRVT*GAHYSDCI 

TV TV AT PTKTT 


1936 


15837 


A 


1949 


396 


2 


GNGRGGSVPPNSIiNKDGISCAI * /RHIN 
WLNG*TPTIYCL*ETHLICK/DH/HRLR 
VKGKKKILHPNGNQKPAGVMiLT*GQTD 
IK* KAI KSNKEGHYE I KGS VKQEN I TVG 
NTYAPSTRAPRYKKQLLDLKGVN 


1937 


15838 


A 


1950 


419 


1 


RWQ/PSARPPTPSGK*GASLPARPSSGT 
*GAXi1jPGCFvWKVRSAS£> WPPSfUjGSEJs 
PLCPA\PSHliGSEER\PSRPPSHIGSKE 
p\ PARPPTA*DVGSASAPP /LPSGM*ER 
PAAPPSGR * GASLPGRPS PEMWGVPLPR 
RPVWDVRAPSRAPVWEVRSVSAWPPHLR 
REKTLRIiATAPV 


1938 


15839 


A 


1951 


422 


3 


QNHVTMSE/DAKIAFGKIQYPFKMESIiN 
TL * MKVNFLNLiI KI TQKNP IANTMFTGE 
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C=Cysteine, D=Aspartic Acid, 
E=Glotamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
V£=uiutamine, R-Arginine, S=Senne t 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














MLB/AFLLRQGKRSC/CSLSPILFI ILL 
EVLPS * VWQEK* / IQG*EREE *NCDFVD 
DMIVYVEN*KKPLYNLL*IIS*VARFKV 

TT TVPr«T 


1939 


15840 


A 


1952 


2 


409 


CAHYSKNPPQRPLRTDARLPAV/DPWSL 
GAILFMVGCGQPPFQEANDSETLTMIMD 
C3CYTEPSHVSKEGKDLITRTLQIDPRRR 
ASLQ* I ENHPWLKGVDPS PATKNNI PLV 
*YK\NLSEEEHNS I IHRMALGDITDR 


1940 


15841 


A 


1953 


2 


367 


IQCVRL*VHVCLCASSCVCICLYVHATL 
CVSTCL* CVAVCM/CLLCACATVSACLC 
/V/CXJWSTCVCVCPPVCTCDCVCMSAC 
VCT/CLCVCVTVSTCVCERLCVPTCVCR 
CP C I CS S LNGNEWMG 


1941 


15842 


A 


1954 


374 


2 


EAPWLLRAGRGALPPCWSKTPPSPLLF\ 
PPLLPGTLVYQPWVPLPPWNRLACAVTL 
ST*ARAGTSNPSWHLPPVSTAPQHPSTW 
QSPGAGTMGDPSLSPWTSHLLPGCCHNP 
CHCP FPHHKPTRV 


1942 


15843 


A 


1955 


1 


411 


NTPSPELHPC* PGL\PPLPSPPQQPPTW 
APPRTSTQQKLPILCLLKPSAHTDAPCT 
QPGSTLPLHTPHTQQAQGTAYQIHTT*A 
APPPGAKPG*RCPPPPPSRQPQRMQTPA 
PGNPQPCPRLTTLTRVLVPAVPPLPI 


1943 


15844 


A 


1956 


33 


451 


RGRNTFGPLQSPPPRFK*FSCLSLLRSW 
EE\RDYRCMPVHPANTMLASLVLNS \CD 
LPALASQS PGI TGVSHPTRPHLS * FLMP 
LE*GHHHLLLYLLSQPPNMPP*FQRFST 
GQLECSSFRTNLIVPSPSLNFHSDFPLT 
S 


1944 


15845 


A 


1957 


3 


399 


YMQVRTTMSDHSSHS ISKINTDNTKS CK 
G*GSTESLFHDRWEYKFVQLLWETVWHY 
VR*TFTILYNPE I ILKR/I FRHTYKNVC 


1945 


15846 


A 


1958 


47 


399 


AANPTLPAVFFFFFLETEPPFGPPGGRA 
GPQSRLTEPPPSGVKPIPR/PPPPGBPE 
KWPD \QGGGGGGDPKPPGGPPP * ATGQN 
SAGKKKKKKXPRPGAHGADPLLPGKAGG 
EEWLDPA 


1946 


15847 


A 


1959 


407 


238 


TQAFALI/KDGGVIGGICFRMFRTQGIR 
EIVF*AVPSNEQVKVSGSPRRQPQCTAH 
S 


1947 


15848 


A 


1960 


106 


1854 


NEAVKSKPNQTRNHKEKNNYVHLPDAGP 
SQPASAGGSSSACRRSTKATLHKRWVSS 
PAGPGVQPLSWQHPPMMA*GLQGNPSPQ 
AAAPPARPGLVSGNCLT*EMAQAGA\GT 
GGSLSASLGKRCPHIPVPCE\PVL*GLC 
VPGRSSLGELGGNPVTVQSFPGPGAESD 
PW*GDAECCSLLQASFGDRAGWSIR/RG 
SVGRPAGVPRGKGRKPTLSGA/ SGPGS V 
LGGFCCPEPLSREAESGMVQE* *GRFWT 
GQERTPTGR*GCKVAGYLHSSATVGHSG 
AGAGGS PGKTSATLDVGQGLGGT/PSGP 

AQPPASGPSPWWQPLLLRQRLRQSTGL 
*LLFPGVGGGLQPGEG\GQPFPLSPWGS 
LTGRKNSNKARPGGNEGTEG+GARNGAS 
LTWVWTVPNGGYCPQAGRDFWLVDSRFK 
PSLGLRAMV*NSRKAPLSFEDGRMGTV* 
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Hi no 4n 
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Amino acid sequence (A^AIanine 
CXIysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PbenylaIanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P*=Proline, 

vriu kaiu iLiC} iv-'Argininc, o — ocnnc, 
T«Threonine, V=VaIine, 
W^Tryptophan, Y=Tyrosine, 

V— I T n If n f\xu n *=Q •#■<-» r» rntlnn /=ni\cci hlA 
j\. \j iiiviiLr wii, ~~oiop cuuun, / — plIJoIUlC 

nucleotide deletion, V=possibIe 
nucleotide insertion 














PQGQEGRASHRKGDSGLSPHSGKG*DP* 
G/PGGKERSISPPLPRGL*GP/VGERGR 
NGDWPW 


1948 


15849 


A 


1961 


76 


427 


QRQEPTPGHTTSGTRHPPGPGGRGPIQT 
HSHSHLFPWLETSAGSSPPNFLCSLPPY 
TPL.IiRIJjP SPPTiCPS OT*K^RMP T PAPHP 
SPASPCKVP*DARGAPAGGAPSTIi/RGH 
HPPHP 


1949 


15850 


A 


1962 


375 


131 


FLVGR I RLLYCLRELFVYLLREQKAKRN 
SQWVHTLPISSRHVDAVPCSPSINRNRM 
GRDKKRPPPIi\CVDA*LHSPSWIQP 


I SOU 


1 D5M 






on 




uu J. J-U-iV: JVW rliriirl V O O 1*1 V JSx<Aj * X» JurlovjlJ-P 

FS FPDKVS S WQAGVQ WHDLS S LQAPAS 
GSKRFSSliGDHRHAPPHLAHFCIF/M*T 
QDFAMLAPJjGTSFIYIjFGCCC\FETGSH 
SVAQAEVOWHNHSSIiQPHSPRLRRSFHL 

Q T PTJQLJTYHPM"A P 


1951 


15852 


A 


1964 


402 


42 


CCHY PGT PGLQGRL S ASHQ AS / SGPQLP 
APLPLQVLPFIALG IGVDDVFXiLAHAFT 
EALPGTPLQVGPCP \QGSS EAAQLTG* E 

CPKGYCLDQ 


1952 


15853 


A 


1965 


8 


14 


SSRAAACPPRPAI>SIiPRPPVFTRRMGPQ 
GSGLPKLPV5APSSRMGSFPAPSPLLLA 
SGDRHLCPDAAALPI»LAIBTGCDS\PPG 
H*PPPSPGGLHPCSPPS*PLIL 


1953 


15854 


A 


1966 


521 


2 


YBLYG 1 1 KRHRRE VSNNQQ VR YKEHS I P 

PTYWQ <5VPf*TYD r PWfTDPPPPPP Q Of'tlPMV 

DHTGTAAGGGGGMILASPKLGATPLPPfi 
*APA/PPPPPPPPPPPGVGSGHLNIPLI 
LEFXRVIiQQRQIHQMQMTEQI(^QVLLL 
GST fiOTVGAPAfi P ^ T? THRTOTA<5 QTKPT, 
LPLTV 


1954 


15855 


A 


1968 


111 


395 


YLMRGFIjIjHHNMVKGKEDERKTKGARLI 
LFFFFFFKKESPS PPRVEGRDP \NLGT* 

Mp T ,pp Tn/TTKT »RP PT P PRTTOTR PfJ RVDHPD 

NNFLFFRKNGGS 


1955 


15856 


A 


1969 


413 


101 


DVNRHFSKKNKHCQ* LVTKEMQ IKTKRR 
YHFS PTRMGfCLKN\ KKENKFWQGYGETG 
IFTHCWWECKMVQPIK3CTVWQFFLKVK* 
LLVFGPACKELGSYYSLVTKS 


17JO 


1 JOJ/ 


j± 


107A 

17/U 


A1 1 
**1 J 


i 


*v xi x rAUu x zv v nr iyt» v i Hj» _l rvi j avj rv. x x w I tXJ 

ENNTRKYLHNT K I WKGLNRTK \ KP * T I K 
KKTDKFDYDTIKNIiSSSKNKTRQTIDWE 
NIlAYLQYITDK* I ISGIHKAFLKV/YK 
/ KKTNOTVGKWARDLNGHLI KNDIHMAC 
I 


1957 


15858 


A 


1971 | 


3 


828 


GQACHFIFRSAQAGGSRLRIWRAKQLGL 

WP VCAT j T M P AT iF W Q FPTR T .T XYIN <? VPf? P 

S ISI^TSCI,/CNRCVWEGGRG*TGPGLP 
SLGKVLVEGIPSESPGPTASHPCCSPRP 
DPDQ/LSCISAPSATPTVH*SHLTLPMG 
WPGPQ*GLHLSDQPGPRKP*PGCSGLGG 
GDAAPRGMEKP\PPPQLP 


1958 


15859 


A 


1972 


398 


3 


GPGCYFSVRLiQCP/ RKI PAWKRAVCSAT 
LI\SL(^PSI^APHVLGLAALAVHLGES 
RSALPEVDVGPPAPGAGLPVPALFDSLL 
TCRTRDSLFFCLK*ALLPQSRLLSSLSK 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucme, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q =3 GIutamine, R^Arginine, S=Serine, 
i— inreonine, y— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SDTCSFDKIRRDRHADIPPLY 




1 JOOU 






I 


433 


QDPT KMTKEELNALKSTGDGTLG/ RAS E 
VEVKNEIVANVGKREILHNTEKEQHTED 
i v KJJl-VDXlsVr lAbENlilDQKSSEDTAP 
FLGTLAGATYEBQVQSQILESASLPENT 
AQVESNEVMGAPDDRTRTPLEPSNCWSD 
LNGGS 


1960 


12861 


A 


1974 


373 


580 


TIFSRQVIiRIQNALSDKPNVSTVYSNNG 
otSUiUI bEASV/ YnWKiLISTEYHKTL 


1961 


15862 


A 


1975 


47 


413 


KWKHLRGSEHWRPQENQVHQRIAELRKA 
GLWSQRR/PAEAAGGPRPKSHWDYLLBE 
MQWMATD FAQERWKVAS VKKMVRAVARQ 
LQDRTRREAGARREEPSRLRQTSPVLPE 
KSSVPGLVLRR 






n 


ly/o 


444 


3 


GYERSRGrrSGTHSSRDYDGIFGQATVK 
WAMLDQFRMLS PCSKEVMHQPFYLKRVE 
IMAQCEEWIADIQQYSSDKRVGRIMSRH 
SAAIKRRTGQLREEIiLKLPCPEGLDPDS 
GDAPEVCRAATGAEETIMHDQV\QPSSS 
KVLP3VFQ 


1963 


15864 


A 


1977 


281 


567 


PRSSLQGQELSRPWGRKGNFPGPPVPPP 
QRMFYDSELFSGISDPSCCF/RSAPTAD 
QVYGDQDMHEWRKHCMDYLVRKWRPLG 
KGQGGKJ. S S PHRP 


1964 


15865 


A 


1978 


357 


2 


KQILGPPHPQAQPGRWPP PHGPKDAPL 
WS SRAAP PGRGLGRAG P AAGVEAGATLR 
DSSPSTWTREGLHVQAQRKRPSHVHKG/ 
SGPGCLEDGEFPTSLRLQAQLAEIGRGN 
GLSVRRQ 


1965 


15866 


A 


1979 


29 


434 


VQAEAEGLPGDTEHPQPQLMSRSLEGQS 
DVT I KHVACGD FFTACLTDRG I I MTFGS 
GSNGCLGHGSLTDISQPTIVEALLGYEK 
AQVACGASNVLALATERELFALGRGDSG 
RTGARTKESH/YLPQQVPMP 


1966 


15867 


A 


1980 


3 


2082 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQS PRYEPQSPGYEPRS PG 
YEPRSPGYESESSRYESQNTBLKTQSPB 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDS FTQGFGEQPTGDLP IGPPFEM 
PTGALLSTPQFEMLQNPLGLTGAIiRGPG 
RRGGRARGGQGPRPN I CGICGKS FGRGS 
TLIQHQR I HTGEKP YKCEVCS KAFSQSS 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YIiLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
oljKoxiyKVn IliwRFr SGGICGKSFSQRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VLAIHARTHLPGRTYSCPDCGKTFNRSS 

TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


1967 


15868 


A 


1981 


2 


188 


LPETNFAELFLPYISQHNLIRKYKK/WP 
GAVAYACNPSTLGGQGGWITRSGDQDHP 
GLHEEWP 


1968 


15869 


A 


1982 


3 


424 


EGQAIVERMNLCLKQQLQKQKGENRYYR 
TPHKQLN\ALLTLNFLSLPKGRILSAAE 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutflmic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














QHLQKPAAKTEAEQIiVWWRDIiITESWEI 
GKI ITWGRGYAYVS PGL/NP S RHLKPYH 
ERMLRKRFREDPEDP PSCSHVKTDAEED 
PN 


1969 


15870 


A 


1983 


3 


399 


YSKLSFKGTLTKFRRIYSSSFYKEFQGC 
ISDLCHP/R/TLTNCIiRGELLKENIiVWG 
AFGCHPHF PCYYNKSQERNLIjQALRHPT 
TVAFGETGFNYSYKCTMPVPEHHKVFKR 
QLELAASIiEK/ PLVI LGQQADE 


1970 


15871 


A 


1984 


1 


405 


RRHIGGGVRLYYIGGEVFAKSLSDSAIF 
AQTPNCNQRYGWHPGTVCKI PPGCNLKI 
FNNQB YADLLDQSVNQGLEADY/ QLTRM 
CTILMSLIiKSGWGAEYRRQTATRTPCWIE 
LHLNGPLQRVDKVLTQMGYPS ILM 


1971 


15872 


A 


1985 


27 


452 


QGREHAQGGQSPGAGHIiGPTPEPQPEPQ 
PRPS SQAVPAGRWEPAQE/ PTRHPHPRL 
SPASRPLGSPPAPLVRSSPGRCRLHEHT 
IWSSTVGTSVEPAPSLGRPQAPLEPGTV 
TSSVRLQQPHMHTPGKIMPDPSKRNGKF 
TFT 


1972 


15873 


A 


1986 


414 


220 


GAEQEELLSP/GSGGCSELRSCHCTPAW 
ATRAKLRLTKQNKTKRESYRQGNQNWGR 
XLGELXiGRL 


1973 


15874 . 


A 


1987 


52 


412 


TRERKLFFACDNVWKHLKRYIiRKNSFGB 
NLWSSRNIKKKKKKKKKKKKKKTKKFST 
PKNFFFFKACX2CMGKKNTKHDFHEQNFI 
EHALLQQLNXNFSTNYIEGRVF\HPGAP 
IESLLLMCH 


1974 


15875 


A 


1988 


2 


143 


EKRRGLDKRTP /AQAAFEKMQEKRQMER 
ILKKASKIPPPFVCMWSVDS 


1975 


15876 


A 


1989 


3 


163 


TEFQPS EK/ WGEDLGDNTWEYI FAIDLL 
CCHQKWI CHPL FLVG WRAGAEV S GVF 


1976 


15877 


A 


1990 


1 


439 


DKTAAEDAIRNLHHYKLHGVNINEEAST 
NKSKTSTKLHVGNISPTCTNKELRAKFE 
E YGPG\ IECDILKDYAFVHMERAEDAEE 
AI KGLDNTE FQGKRMHEQLSTSRLRTAP 
GMGDQNGCYTGGKDGHWSKECS IDRSGR 
VADLTEQ 


1977 


15878 


A 


1991 


1 


145 


WAASKAMK/ MGDWKTCHSF I INEKMNG 
KVWDLFPEADPVLLKRLRESR 


1978 


15879 


A 


1992 


2 


425 


NTSTLKKTLESDCT \KLFSQGIGGEQAQ 
AKVDRCLSDLAAETNKFRDLLQEGLTEL 
NSTAI KPQVQPWINS FFS VSHNIVEEEF 
NDYEANDPWVQQLI LNLEQQMAEFKASL 
S P VIYDS LTGLMTS LDAVELE KWLKST 
FN 


1979 


15880 


A 


1993 


3 


449 


VAGPAPGAGARPGUDLQFLQRFLQ ILKV 
LFPSWS SQNALMFLTLLCLTLL\LKSFD 
QFTCNLLYVSWRKDLTEHLHRLYFRGRA 
YYTLNVLRDDIDNPDQRI SQDLERFCRQ 
LSSMASKLI ISPFTLVYYTYQCFLSTGL 
LGPVSIFGY 


1980 


15881 


A 


1995 


1 


410 


rtOnnnnTTVT I/Tnt^^miT»r , »T Vn^TT»T%TT\/'\T rmr 

S SRRPF IAIjK^KSMRDLNPEDIDQL I TI 
SGMVI RTYQLIPEMQEAFFQCQVCAHTT 
RVEMDRGRIAEPRVCGRCHTTHSMVLIH 
NRYLFSDKQMIQLPES PEDMPAGQTPHT 
VILVAHNDLiDDR/VQPVDRVNDSGFF 


1981 


15882 


A 


1996 


1 


154 


LFFFRIiLVRYTKKVAQVSTPTLGKVSRN 
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peptide 
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nucle- 
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ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HMHistidine, I=Iso!eucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R-Arginine, S=Serine, 
1 — I nreonme, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LGKVGKKSWKHPEAK\KRPCGKN 


1982 


15883 


A 


1997 


2 


475 


KKLKRQTRREAQKE\LPDKV\RWGLVTP 
SETPVGISNLSPFLGPRCL/QNPAELKP 
PWRFPWAQRPXAH\ERANAGPKTPAEQR 
KVKKIKKLKEDISQGVHISVYRVRNLSN 
PAK2CFKIEANAGQLYLTGVWLHKDVNV 
WVEGGPNAPKTITRLMPIiRI 


1983 


15884 


A 


1998 


3 


431 


QLRTRDRGWPSRRPEREKRTSQSARRPT 
CTESRWKSEEEVESDDEYLALPARLTQV 
SSLVSYLGSISTLVTLPTGDIKGQ/SPL 
EVSDSDGPASFPSSSSQSQLPPGAALQG 
SGDPEGQNPCFLRSFVRAHDSAGESSLG 
SSQA 


1984 


15885 


A 


1999 


1 


400 


ALDLRGLQILVGFPKRRVTTCSYPTALQ 
S P IB YQRKERSTAVMRTE PDS AYQASPR 
PYSAGPADSKKPTKGYCYNPTLP\RLEI 
MTLEGTTG 


1985 


15886 


A 


2000 


1 


372 


QNIDLVISFFSSRIiliQAGAELSVERVLE 
I IKQGWALPKDRL/RGS CAPTLSAAGR 
SSGGQSPCMPGLCVCSFWVLTVSWLVCQ 
KFPELKFKYVBEEQPEEFFI PYVWSLVY 
NSAVGliYWNPQDI 


1986 


15887 


A 


2001 


393 


1 


GGIGRGGGAGGGVGAAGSASGGVGRRGA 
GGVIADSGAPGGGVEGGVGASGGWRE/G 
RGTSGGVGGSGGACGSV/GGSGGAGGGV 
GACGSTSDGVGRSRGTIGGLGGSGSAGG 
GVGACGGASGYVGIRGAGGG 


1987 


15888 


A 


2002 


2 


362 


WVTFISLIiFLFSSAYSPGVFRRDAHNSE 
VAHRLKBLGEBNDKALLLIAFAQYLQQC 
PBT2DHVKXLNEVTEFAKTCVADESAEDC 
DK\SL\HTLFGDKLCTVATLRETYGEMA 
DCCAQQEPE 


1988 


15889 


A 


2003 


2 


358 


EANRGWF IRLKEGSQLYNIKVQGEAASA 
DVEAAAS YPEDIiAKI TDEGGCAKQQI FN 
VDK\QTAFS WMKRPCRTL IAREEKSVPG 
FKL/SKNRIALLLGANAAGGFKLKSVLI 
CHS ENS RTF 


1989 


15890 


A 


2004 


190 


1 


DQTCFLS FTVKAVTFNGWVWWLTPVI PA 
LWD YRH \RP ANF FVFLVE TG FHHVAQAG 
LKLLGS 


1990 


15891 


A 


2005 


1 


132 


GMCHHAQLI FVF/CSRDRVLHGCSQTPC 
IjKQF SCliGLiPKCWDYR 


1991 


15892 


A 


2006 


2 


134 


PMTFFTELEKTTLKFIWNQRRIQIAKAI 
LTNQK\NKARGITNIC 


1992 


15893 


A 


2007 


315 


127 


SEIAFFFCSFLKIILDTR/FSFFARAGL 
KLLASNDLPSSS SQGAGITGVS YGTQ P V 
CFEYNVG 


1993 


15894 


A 


2008 


3 


325 


RCSMLAVREM/QKATKRCHFLPTRIALI 
K/ND/GSNKCWK\HCWWECKMVQLLWKI 
VWQFLKKLNIELPFDPEIPLRDIYPKQL 
KTYVHTKTOGQMFIAVLFIIjNVPHIMNV 


1994 


15895 


A 


2009 


369 


3 


vgqaglefltsgdppasasqsavitgms 

ahlslpsswdyrcmpphlanfyfcrh/r 
vccpgwsqtpglk\yppalvsqsvattg 
tsprawqdtilpv 


1995 


15896 


A 


2010 


114 


287 


aplclclrhlrllikrltvlgteahtlV 
npittrgrggqitwgrefetslanmvkp 



666 



WO 01/64835 



PCT/USO 1/04927 



SEQfl> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 
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peptide 
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ed end 
nucle- 
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correspon 
ding to 
i«oi ammo 
acid 

i~ni_ir1iin nf 
I Co III tic VI 

peptide 


Amino acid sequence (A=AIanine 
OCysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
\^=vji uia mine, tv-/vrginioe, *>— oerine, 

* JL III CvIllUC, Y t allllw, 

W=Tryptophan, Y=Tyrosine, 

U 11 tvll If W 11 ^ — CUUUIa, / UU&oll/IC 

nucleotide deletion, \=possible 

llUuCUUUC 1I13CI Uvll 














CF 




1J07/ 




ZAJl 1 


Z Jo 




IMKFNI*IjKFYF I TLFSROSIi / SS VAOAG 
VHWRNLGSLQPLPPKF 


1997 


15898 


A 


2012 


310 


1 


SSSSSNTHFXSIPKYLINPDTCFLAyKVN 
NSSLTGliEYTBTLKPGIK 




1 <OQO 
J J099 




2013 


3 




OUUuiiaWyivUOUU Vv7C^ILir/\KlVJlXXJl~V / 

CNSLDLIE " 


1999 


15900 


A 


2014 


332 


3 


LiRDPLEEAVCP FSDLQLHAGRTTALFKA 

VfO^xjn ijjXj^rtljXiXi or \ v V-Uv_ Iri-Wxr X 

RGRQAG\SLSCGGLHPVRASRLLCLPKQ 
AWAMAGAPPPAWPRPCSLISDCCASNQR 


2000 


15901 


A 


20.15 


2 


325 


LLLHHAPPVNLFLRDRVSLCGPLCX3PGC 
SQTPGLKQSSCLSIjPKC/WDYRXRATAP 

\7jJ 


2001 


15902 


A 1 


2016 


373 


3 


WAHGLQPGKPKOTSLSKKRKKKKGKKLN 
TFPLRSRTRQACLI*SPLLSNIVLEVLPN 
EIRQ/QKKEIKGTHTGREELTLSLFTDN 
MI I YVDI PKQSTKKNQGSYS VARPGAVA 

WJVr*MP QTT JT/tlT/Vl 


2002 


15903 


A 


2017 


343 


1 


EFPFVSGSRATGKSSDIRATKYIWRVIiE 

VT.PZUflPPnnPDT.JTQ Q\ HT^T.T J^QVUPflA 
X XirCfrvW rK.VJsj.KXv 1 1 IV ■> o \tlXoXjXjuoXtlJruxl 

FRGDKWSCCHQKDETGGRGEDEVLLCCP 
GVLGCSALCRSGVRTKFGIRGRPWKERE 
R 








2015 




o 

z 


xrc T <5 TJirKTTTn^WQ^MVaRaPNP^Tr.PrtPn 

INC? J- O X XvP* XINXHVVOv7l J VAIUiTli JrO XXJt\V3X\.V3 

GQI \RGQDFKTSLANMVKP 


2004 


15905 


A 


2019 


3 


322 


ARELVFFFGAYRKGFFHRDMKPKNIxLCM 

lair JMaj JXJ^nLJV \jLu\Js. Cm i-l ivj JC1 xr Ir 1 X UIuo X 

KWR/YRGNPALLRPTPMAKFPYEGWLHI 
SGSSIWNIKTSDSNFIFFKESKG 


2005 


15906 


A 


2020 


205 


377 


NIVENIWCW^VCFLOTCTVCINPE-TS 
DE/WPGAVAHACNPSTLGGQIXSQITRSG 

'HPT? 


2006 


15907 


A 


2021 


3 


324 


KKWGKR \LNRAPNGRRYPETRWALEEDQ 

nJTPlTPT lITTfT^7PTTUr v T T DTPM2i'K*TW r r*rri 
\_rlJLl~fULLLElXi\X VlCC xr XrCTXrtJS.Xrll If 

x^SWPRCGTGTLIHOlRGCKTV/R/PL 
WTCTVRO /PTj /P.fCt*NT PTiPCDPAVLSTiCT 

VRy/ C LJ/ f\l\J > 1 >< -L tr BIT V^L/XTX*. » 1-KjJJV— X 

Y 


2007 


15908 


A 


2022 


2 


382 


RVSQIXSLNliTSRSTRLGLPK/ CWDYRC 
EPPSP 


2008 


15909 


A 


2023 


1 


421 


RWNPGGRGCSELRLHYCTPTWVTERDS I 
SKTAKNTt/NNTCRPRNNC!R 


2009 


15910 


A 


2024 


339 


3 


SWDHRRVLIFVFLVQTGFCHVGHAGLEL 
I»TSGSOSAGTAGVSHPA\GOKHOFRPFH 
RFLKFGFVFRDRVSLCCPGWPQTPRLKQ 
SSRLSLPKSWGPATALDPDSSYLEEMVL 

s 


2010 


15911 


A 


2025 


2 


146 


lTrFGRSRQEDHLGPGGQACSELRSHHCT 
PAW\VIEQDPVSKKKKPPKP 


2011 


15912 


A 


2026 


3 


380 


RhECSGGISlHCNhR/LPGPKRFSCLSh 

PCPWTYVTPT.D /Pki/l^f ATP f TY^ PITH T/lfY&f2 

LEIVLTSGDPKCWDYGCXDHCTWP 














DYMNSx^IYFHSLVTiASTDEGFLPKTVST 
QSAGITGISHCARPWIPF/CFFFFKNRK 
TRFVAQAEGQGGNFGSLNPLPPGFRGFP 
CLSLT 


2013 


15914 


A 


2028 


2 


187 


FTLLPRJ J ECSG\MILVHCSLlNXPGLaRWS 
SCLSLLSSWNYTCVAPFSIFYYFIFLTW 



667 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
. sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 
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amino at-ia 

IC3IUUC UI 

peptide 
sequence 


Predict- 
ed end 
nucle- 
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location 
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Ulug CO 
Inst ominA 

acid 
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peptide 

sequence 


Amino acid sequence (A=>Atanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Iso leucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
v^=Vriuuiuiine, iv— /vrgimoe, o = oenne, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /= possible 

fl lif» 1 onti fl A Hf»lftinn \ — nnccid \c* 
UUdCVllUC UclCLIUU, \ — pUbSIUIC 

nucleotide insertion 














GGTLNC 


2014 


15915 


A 


2029 


197 


3 


AAGLSQAGMQWGDFGSREHLPSRPQRFP 

L.iJlVVJriNiNWl^IKlliar'irKJr voir r e or \1jV 

BTGFHHPGQ 


2015 








fift 
oo 


jOZ 


I\J7 V3JLrilJA^JVI!«JVI\V9xr^JM>i<^^II? C ir X V Lit? r r 

FFBTBSHS ITQAGVQWRDLS PLQPLPPG 
VSLCWQGW 


2016 


15917 


A 


2031 


7 


131 


GTIMAWAPGEGGCSEPRSCHCTPAWVT\ 


2017 


15918 


A 


2032 


1 


343 


LEGSE PRLCHCTPAWV/ TGDS VS KKKKK 
KKKK 






a 
j\ 








y iUjAr W ir FUfaAuJNfi KK V Fir xi FAoMAr jj X 

KIEKTALKFIQNHKRPQ1AK\TILSKNR 
VGSITLFNSKXHYMTTVTKTI 


2019 


15920 


A 


2034 


238 


3 


FLALPKVLGIIGCVPPHPASMAPLTTIE 
DTALKFIQNHKRPQIA\KPILSKNRAES 
J. 1 LufnaAXRx Mir 1 V 1 JvfJjKV 1 fa T 


2020 


15921 


A 


2035 


345 


2 


LVKI QKEMNVFMTANP I S I PQ FMDQGVI 

fa fa LAo 1 1 Jr KiNiVX Xj I V \ IJlAmU 1 U f SSDii* 

SWQSEWKTFWKGFI ILDGIKNTDDS WEB 

MKT C.TT.TYZT KTCTfT. T PT gTnTWITE 1 PlfTT .\T 
V Mo 1 JJ luJL IVJVIVJJXJr 1 d 1UUC I\JE»ff JVX Jj V 

EDVT 


2021 




A 


ZU.50 


i 
i 


1 A(\ 


vjKLL \njSxjKfanl1A~l irAn/\lK/\lsJjJMUSJSjx 
KKREKKTQKGKNGLGFWAF 


2022 


15923 


A 


2037 


199 


348 


RSSNEGGRDCVFCFGDRVSLCLPGWSAV 

tvULtrtLt lAlo loU \VlwXJorlxAjJj 


2023 


15924 


A 


2038 


3 


193 


NGIjNAPIKRLRU^ 

/injj x \»-jcviJ x ri / i\.v»» i xjJSX? / wiisjhi: 


2024 


15925 


A 


2039 


116 


337 


skllkilprlcwgwwqapvipatqevea 
edhlnpg/rs rp awatqrtpvs IKKKI I 

TiPPaNnir/r'WfiPC'w i 

XJl\jrruMV9JV/ V^fXwJroVI 


2025 


15926 


A 


2040 


368 


252 


VFFFFALFYIiFFFFFFFFFFFFFFFFFF 
QFLGFYFRFKF 


2026 


15927 


A 


2041 


204 


399 


VSSHKINGLTVCSTSPFFLSLLPPSEES 

tif/vC LJlUL.X\i7irxVAfa/faVMxjF 

L 


2027 


15928 


A 


2042 


103 


356 


WHFSPPQPLPPP/PLPNPPPPPTPPPP\ 
PPPPPPPPPPSPPPPPPPPSPPPPPPPL 
PPPPPPLSPPPPPSTPP/PPPPPPPPPP 
PLPP 


2028 


15929 


A 


2043 


3 


344 


LYKWNNKASxjAAHLFAAWFTVYFKPTVE 
TYC/SGKKKIPFKILLLIDNTPCYPRAL 
LEMCEE IS I VFAPATTTS S /LKPMDQGV 
IVTFKSlTnjROTFQAGGGEKKEKHERKK 
NIIS 


2029 


15930 


A 


2044 


2 


349 


PRVRKSPGPNGFTANFYQTFKELISirlT 
KLFQKKKKIKKGENPPNSFYGAIIPRIP 

FxjAGHFKQSFGREIHHLXJREFIPGIQGG 
FNIGN 


2030 


15931 


A 


2045 


280 


462 


CXFFLVVVLWCVVlxLFXVVVLWLCFFG 
FWCCVVFCXCGWFFVFFWCVLCGVL 
IiGCWC 


2031 


15932 


A 


2046 


3 


284 


PSPSFSLLLPPSFSLLFPPSF/SPPPPS 
FSLIjLPPSASxXLPPCTSLIjIHPPTSLQ 
LPPLPS F YLLLPTS I SHHL PTHNLPTTS 
IQDPSTPCSIK 
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NO: of 
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SEQID 
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M 
eth 
od 
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correspond 

ing to first 

amino acid 

residue Of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D= As parti c Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

rV-rinlnininA TQ A Q— . Carina 

V2=ij!utamine, K— Arginine, s=oenne, 
T=Threonine, V=VaIioe, 
tt — irypiopnan, x— ly rosin e, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2032 


15933 


A 


2047 


95 


1 


GRVDAIIAHCKLR/IjPGFTLFSCLSLPS 
TWDC 


2033 


15934 


A 


2048 


211 


I 


tri iPl L\wuvWv»oU 1 1 1JM irliyiljWoJjKXW 
VRVDPFFFFCTDR/SLSMLPRLVLNSWI 
PAIILPHPPQVLGLQ 


2034 


15935 


A 


2049 


35 


266 


EVRRPSTWQPPRLRSEEPLRL\QPPRLG 
v-»t/"''e , /"*ttt rr*m\r*T v~r r Teem "D*tt lone* 

GIT\GHSYRRASEEDKKESSMSS 


2035 


15936 


A 


2050 


281 


3 


QSNFKRALLSILLKAIYRSNI IPIK/VP 

LSKKNKTRRITLSDFKLYYY\ IAVFVAA 
WYWKKNRLSDQWN 


2036 


15937 


A 


2051 


251 


12 


ILRFFLPKFLGFQKPPSIiEKTHFFLPAF 
GNSIiFWPPjKKljGyKJvVrrr r r 
L/NSVSQAGVQWHNIiGLLQSPIiPRLR 


2037 


15938 


A 


2052 


2 


325 


AUHJjKi^vyj^pyp^jyxvt»iv.i PoPIjJSJIIsk. 
\LAGCGGGHPRLREENCLNPGGRGCSEP 
RSRRCTPAW/VNDS KTLARKKKKKKGVE 


2038 


15939 


A 


2053 


3 


166 


SLLLPRVECNGAI S AHCNLHL \HNLHLP 
GSS6SPASASQVAEVRGSLEPRSSSLA 


2039 


15940 


A 


2054 


227 


3 


LNENIGRITGMSHHARLILLFCE/YRVS 
A lyAl* V/ yiiJL/Jk , vo±jyP±/±*K Vf JSJ*£r £>llr o 

LQSSWDHRCAPIRIAIFWVFC 


2040 


15941 


A 


2055 


301 


0 


QRKSHMFIjTINQKljEMIKJjSBSGMbRVc 
TGQKLDIMC/QVSQAVNAKEKFLEKIKG 

irrcvtrmwrn) vhqq Twvnivn?in/r.i t wwtt? 
l/1 ova iyn j.xcruo o xjn v unsut^ v ui vnic 

DQAP/HNHIPIiSHLLMRAR 


2041 


15942 


A 


2056 


1 


109 


OT">T T3T3T DnDMOT WDr** GD/^VCC 1 t)I?T C* /ITT 

CTPAWAT 


2042 


15943 


A 


2057 


313 


250 


PQPPPPSPPPQPSPPPPSPPPPSPSSP/ 


2043 


15944 


A 


2058 


134 


2 


ekesrsva\rlkgsgaisa\hcnl\clp 

HQ. CMCDD C A CD VZk C* Q 


2044 


15945 


A 


2059 


120 


326 


NSLVADME/KVLWWIEDQTSHNIPIjSQ 
S/LIQSKALTLFNSIKAKRSEEAAEKIF 
EASRDWFMRFKXKQK 


2045 


15946 


A 


2060 


102 


413 


VGIT/VHEPRTRPG 


2046 


15947 


A 


2061 


311 


— ; 

1 


FSGPTPLKNGVGFSPGPKGGFFPPPPGG 
FPAGRPKWLTRILGKGGSLLRG/RYPKK 
nFTiTfP T TiGDNS PORAPKRGP 


2047 


15948 


A 


2062 


229 


380 


WYDLGSLQPMPLRFKQASRVNLPRSWDY 
RHPP^RLN\IWIVDTGILHVG 


2048 


15949 


A 


2063 


142 


383 


tyoLr o xrir* yixLiy r^S^c*ii 1^4.0*1 v ovlvli 
S\VCVCTCVCTCTCAGAMCVCAGA/CFC 
VOTC^^/CIjCVCVGA/CLCVC 


2049 


15950 


A 


2064 


408 


200 


NLIQIKALTHFSS IKAERGDKSTE/EKF 
EGSRGWFVRFKERGHLQJ I KVKHEAANA 

V7M77i 7V/^OAT CT3DCC 


2050 


15951 


A 


2065 


348 


3 


WVSPYS PCVCVCVCVCVCVCVCRDWVS P 
v^.v-.ir v> v uiiuv cvuv \^i\uri / j. vi/ Uny vv. 
VCVCRD WSPCCPGVCVCKDWVS PCCTS 
WSILKLLSSGOTPTTVSQSAWITGRSHS 
AWPARA 


2051 


15952 


A 


2066 


73 


286 


NLIRGLLESHILISMRYGYCKSYTLSMR 
DIPBALNKWKS /IPMFCSWSRRL IVSMA 
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OCysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Vp^jrin (amine, k— Argintne, s— Serine, 

i iirtjviiiiic, v— valine, 
W=Tryptophan, Y=Tyrosine, 
^ — u ii Hii own, — oiop cooon, /— possiDie 
nucleotide deletion, V=possible 
nucleotide insertion 
















2052 


15953 


A 


2067 


329 


0 


PSSTFLLPFPRPPPSPPPPPPPPPPSPP 


2053 


15954 


A 


2068 


221 


3 


ELKVILHCLRDFSLQSSIMKVLILKILI 
LSVCCVC/VCVCVCAVCSCVC/VCSCVC 

/ V^O^VV^U.VijL.VwV^VlioV. 


2054 


15955 


A 


2069 


I 


167 


GTREMPLNPGGGGCSEPR \ SCHCTPAWA 
TKSETLSQKKKKKKKISNYKTPFKSYRI 


2055 


15956 


A 


2070 


2 


354 


ARACLGLPS S WD F / S VBNRFHRVGQAGL 
blxATS GDP PTS AS PECWDCRH\ DHHTW P 
LL 


2056 


15957 


A 


2071 


20 


341 


CIVTVNTRGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRF/LKQNSSSS /SEVN 
al i> wNNDNNMlj YPUTGNGCINIIiSyP I T 
ESRAIYQRPIFHVHVNQDLPHGDVLS 


2057 


15958 


A 


2072 


353 


58 


LQLLTTSDPPASASRGAGIADGVWFTQ/ 
SLNGAQAGVQWRDLGSLQPHPPSRIi/LIi 
ASQSAEIAASARPPPRLGS EERLCLAAH 
RLGCEEPLCLAAQSGK 


2058 


15959 


A 


2073 | 


1 


338 


GSRLQRVCINYRLSFFPFLSQGWINFTW 
LFCLCVCFLRDRVSLCCPGWPSTSGFKR 
SSCPSLIjSRWDYRHMPQHIiASHTLif 
/TILPR 


2059 


15960 


A 


2074 


1 


322 


GGGREAGEARGGGEGGQGSGRRRRGRGG 
PRTCGAEGGRGAGETPGGGARPEREQGR 
GRHSERQGPT/RQTKRPKTKTKQLSQNK 
NT 


2060 


15961 


A 


2075 


450 


225 


TPVGRGCSELRSCNCTPAWVTD\ETLSQ 
KEKRGGVKIGWKKRRTRI ISLHLPGSHE 
KFNNLETVKNCVNHFCRLNT 


2061 


15962 


A 


2076 


2 


470 


TPQNKPHPTTKNTQPQPTQKK/ PTQTQP 
/TTPKPKTKKNTTPNPAPPNTQKNHTTK 
PTTQNPHKTKQKNKTKPNTTQQQNKPTN 
PK. I vTQQKl i lwTKPPKKKKKKTADT 
TSPNPISTKKIKKXiAEP 


OA/CO 




A 


2077 


304 


1 


NSATPPCSFTA/ KPHPTPPTPPPTSFIP 
TCQHSPPTKICPQARPRTPPYAHPPRCP 

PTTT TPimPPHDT.lXDDrHiaPPriT.T^WTrDT 
K J\x J. ir Auwuir iinir rUrwu&luJljNXxilJr J. 

IWRGRVCVCVCVCVCARA 


2063 


15964 


A 


2078 


167 


i 


TILQTNSTWSNVLLWQGAVAQAC\NSST 
LGGQGGRITRSGDRDHPGQHGETLSRA 


2064 






o/vjo 


J 


1A. A 


PP r PPQPH"Qr'2MJr , Qli2iT.WP2iM7Ti QQrDCC 

/IPVTPPQCLYWP/WKVPLQCPPDL 


2065 




A 








RTPRRLCKSQLDRCSPLKERHKFGLSRS 
VLSAMTQSGIY/WQPPPPEFKRCFSCL1T 
LLSSWDYRHAPPRRARA 


2066 




A 


ZUol 


4 


w ■ — 

J ZD 


ftnTTY^MQWPaWT.PT.VTTTiW !T2iPT\ VPT V 

rtu i. i. vji»ioixrvi-iVi JjX: Jj x r jluN i\r /vr l \ loij V 
LNFFLHKIQEPSLG\SGSGP/LSCNS 


2067 


15968 


A 


2082 


1 


343 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP " 

tf r if if r f if tf ±r xr r ir x* t* ir f ±r ir tfxrlrjrtrirctrtrir 

PPPPPPPPPPPPPSPPPPPPPPPPP/PP 
PSSPSSPPPPPHTPHP 


2068 


15969 


A 


2083 


330 


88 


ACDRSIISRIHKEIi\EQIiCKQK\SNNPI 
KKWPKDMI RYFSKQDVQTVKTMNKCSTS 
LI I I EMQ IRSTMRYYLTTVRTPHPS 


2069 


15970 


A 


2084 


1 


340 


RVRSHGTTHLAQLI FVLLVQTGFHHDGQ 
DGPDIiL/NLVTRPPQPPKVL 
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add 
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peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^GIycine, H-Histidine, t=lso leucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


2070 


15971 


A 


2085 


2 


348 


ARALSLGANAAGNFKLRPLLRYHSQNPR 
AliKNYAKYTLPVLHRWHSKSLETAFTKC 
FTEYFKPTVBTYCSEKD/l /SYKAT.T.T..I 
EKALGHPRALMEMYKDI S I VFMP VNTTS 


2071 


15972 


A 


2086 


369 


2 


KGLPPSPANLPFIiSVY/KGPPMFTRLIL 
ISCPCYPPPSASPSAGIKGLNPPGWPPF 
SFFYQKFVRFFVLARNRTFGFVDYFSIL 

VT TVOl^eXIT VDOT T C 7\ T WVXTCIT t7tn?T7T 

IijiyCyV«oWlixKr JULio/Ujx r r Vovr Vi? vlj 
TQGIEAMVPRA 


2072 


15973 


A j 


2087 


314 


3 


LLESVGPTRNSRPFKGLIGRILLDPEFY 
PSLVSDDJjPASASPNftGKIJjPrr rrEib 
SHS\TQAGVQWRHHGSLQ\IiKRSSYLSL 
TSSWVYRHTPPHPANICIFSRA 


2073 


15974 


A 


2088 


330 


16 


CPCFFLSALSVLVGWCFAFWCGCV/WW 
CVCFVVVFVCVCFLGCVVLCLCFV/VCL 

V Cj V CA* V V r rViVa J-i*-V- v LX. 


2074 


15975 


A 


2089 


1 


337 


GTRTFLPPSYiODPCEY/IWAHPDNPGSS 
SNOmLNFSTSD/PPVHSSGNWKVLSSP 
NRPYYYSYTATPHTDPTPHLPSPNPSSP 

OnOVOT T CnCTT ^rtTTTJTTTO TTC CtJTT 

oroi r-blibiJo 1 KJyl lrl 1 lFllbbnlLi 
LTS 


2075 


J5976 


A 


2090 


350 


84 


QRKBNKRTRKRATERRDESREEKAGRKE 
GENQEKRNKEQQGRQRRSRDRT/EEEKE 
EAKRREHKNPKKKKTKPPPQKKKKTEKK 

T> fill >XTT flUT 

KilJNIiSN 


2076 


15977 


A 


2091 


i 


355 


SDPPTSASQSAGITYVSHRTWPLLEFSG 
TS IRLAGKPAGVLVEVTGK/ SVCGGGVT 
KTHWNECHTGYPKCCWSSQAGESSLQPP 

nivoimiJt'riT pt ticp t»Tr\v"D T t dd\ rxKtvf* 
PPGr lUiFSUXliiJPboWUIKxlJJPt , \yiNlrL. 

IFSRDGL 


2077 


15978 


A 


2092 


27 


345 


ASIPCLKKRKKKKKKTGGKPPP\GGPKP 
KRGGGGGPPPTKRTLFGPPQGNQTPGGV 
S/GPPKGDPPrSPiPPuUKK5rPr EtKovr 
KKGPPPWRGVKREKGHFLKNFQQK 


2078 


15979 


A 


2093 


3 


301 


HEHVAQAGLKLLGSSDLPTLASPKCWDC 
KR\DYCAWPHI FP ISGPLYVFFPLPRMS 

RAQFQCHFFKGWPDP 


2079 

- 


15980 


A 


2094 


76 


342 


WFXXFFFLLFVFFVFWFVCFCVFCGFFF 
r vjt n VA-.V tic v c\ac v~ur uurur wr\.r rvr 
FGFFGFWFFVFFLVCFLVFVCFFGCFFF 
LFCFF 


2080 


15981 


A 


2095 


115 


325 


MDERKKIRGGGRQGKECKIHCKKKLSPG 
IRSYPVEN/F/VDTMYDYLQPAYYKLND 
LTNADPCAVRYLLFDQN 


2081 


15982 


A 


2096 


80 


227 


HREREREREREREICICMSVYA 


2082 


15983 


A 


2097 


308 


1 


NSTVTMENSVNIHYRTRVFTEAQFTIAK 

QWKmDVYT>QTT T7WT7VTA WTVTWnTVT 

CVCVCI CVCVCVCVCVCVC I Y I YMMEYY 
SAIKRNEIAAFAVTWTRA 


2083 


15984 


7\ 
£\ 




1 




ARACLGLPSSWDF / S VETRFHRVGOAGIi 
EIATSRDPPTSASPECRIX3«I\DHHTWP 
LL 


2084 


15985 


A 


2099 


1 


221 


LLWRLRHENHLNLGGRGCSEPRLHHCTP 
SWMTR/ GKTPSQKKXTKQPMEWQNLCLK 
KVFGNTGVRE I FNG I KLS 
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M 
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od 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamme, R=»Arginine, S=Senne, 
T^Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=5top codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2085 


15986 


A 


2100 


186 


2 


ERVYVCVCVCVCVCVYQCYIiPQIRSL/V 
AS SLKANARVCV CV CVCVCV CVCVCVCV 
SLWHPLTA 




1 

ijyo/ 




OlAl 

ZlUl 


305 


2 


HNTLLSQSL IQS KALTLFNSMKA/E/ RG 
EEAAEEKLEASRGWLMRFKE\RSHLCN\ 
TKvQGEVAS ADTEAAP S YTEDHSKITDB 
GGYTKQQ I FNVDKKAS YWNMS 


2087 


15988 


A 


2102 


6 


166 


EQTALEILARAIRQEKEIKGIQIRKEEV 
KLCFFVND\MVLYLENSKDFI*KVDAA 






7\ 

M. 


2103 


53 


176 


EIKNNRPGMVAHAY/NPSTLGGRGRQIS 
WGQEFETSLVNMVK 


2089 


15990 


A 


2104 


3 


267 


FRHVGQGGLKVLTSGDPPALVSQSAGIT 
GVSHCAQPIVGDFNTPLSIL/D/RSTRQ 
KINKDIQDLNSALDEADLLDIYRTLHPK 
/STEYTFF 






7\ 
n 


2105 


268 


3 


PPKEHGSSPATEQSWMENDFDELREEGF 
RRSNYSE/LREDIQTKGKEVENFEKNLE 
ECITRITNTEKCLKELMELKIKARELRP 
ECRSLR 


2091 


15992 


A 


2106 


3 


170 


GFHHVDQAGLELLTPQVTHP \LGLPKCW 
DYRREPPCLASPHFHQIAISQKRHREAK 


2092 


15993 


A 


2107 


1 


398 


SARGPDGFTAEFYQTFKEELVQ ILLKQF 
QRIKGEILL/KNHYVKPSITLIPK\PG\ 
RD ITKKLLTRSF CVS LCPVLS PLQS LQS 
RPSSLSMISLHSVCFSVLSASAHPHVHL 
CPTCPTLVLSGSHCVCCVSLFFF 


2093 


15994 


A 


2108 


3 


370 


HENWNNKGWGDSTIYSMKYLYFKPMLRP 
Y/C/ SQKKI PFKILLIiFDNAPGHPRVLM 
E IMYKDMEVF\MPVNTTF/ILQPMDQRV 
ILTFKSYYLRNTFHKTIAAINSDYSDGS 
GQSQLKTFWKGFIVL 


2094 


15995 


A 


2109 


1 


213 


HFPVENESAPG/FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2095 


15996 


A 


2110 


391 


3 


K3CKKKNPHRKKIKKRWKXIFNAERNKKR 
AGVI IHI LDRLD FKQQTIRRDKR\GYYT 
MINGTIQQEVTTILNIHAANTKAVRYIK 
QVLLKLKTELGPNTIITGDTNTLLLSTL 
NRSSRQKNQQTLDLICNI 


2096 


15997 


A 


2111 


188 


365 


FQNT IHI CVCVCVCVCVCVCLy CVCFCV 
FFFCLLCGGFMCGCWCDFCILFCFYGVG 

nnnr 

iTctrLt 


2097 


15998 


A 


2112 


3 


340 


RMESALDRLKQFTTWGDTGDFHAVDEY 
KPHDATTNPSLILAVAQMPAYQELEEEA 
IAYGRKLGGSQEDQI INAI DKLSVLLGA 

PTT trtlTTVttTrPWOtffMlTiT f» I'll » /nnminrn 

E IXiKNI TGRVSTJwlJARLSFD/ SDAMVA 
TA 


2098 


15999 


A 


2113 


11 


305 


FLFTDFCLFMTHILGHKINYITN/CKRN 
VIIT/SYFSPHNRIKLKISIRKISRKSS 
NTWKLNNRIjIjHYPQIKDEVSREIRKYLE 
IiNINENTNF/QNLWDIHK 


2099 


16000 


A 


2114 


3 


387 


QTNH/NIPLSQNLIQSKAITA/NCMKAE 
RSEEAAGKFEASRGWFMRFKKSS Y\ IKV 
QGEAASAGVEAVAIYPDLAELIDE/GCY 
TT/QI FS VFQTAF FWKKKPSRTFM /REE 
iCLI PGLKASKDSSSLLLRVHAAGD 


2100 


16001 


A 


2115 


152 


393 


VYCPICWLVFFFCFCSVLILFVMFV/CL 
CFSFFCFLGFWVFSSFYLFICVFFFIV 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H-Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 

T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LGCLFLCWFDLFLFVLFVLCLFFVS 


2101 


16002 


A 


2116 


2 


373 


ARACLGLPSSWDF/SVENRFHRVGQAGG 
EIiATSGDPPTSASPECWDCRH\DHHTWP 
LL 


zluz 


loUUJ 




Zl 1/ 


o 
o 




" " I lnMI '" ^ XXC< V tiO A J, 

I PKDQKYS FLHDSQTSFCFSDS I PTPFN 
MEETOOKSNIjELIiRI SIiLLIES WLEPVR 
VLRSMFANNIiVYDTSDSDDYHFLKDLEE 
GIQTLMGRAEKRHCRTVQNLKQTYRR/F 
DTNS/HNHDALL 


2103 


16004 


A 


2118 


146 


15 


FFFFYFFFLFFFFFI PFLFFFFFLFFFF 
FFYFI FI LFI YS LLVF 


2104 


16005 


A 


2119 


406 


3 


LFSVNETGFYSKMLSRTFTATEETSIPG 
FKVSKDKLTL\LGANAAGDFKLKPMLIY 
HS/ENPRALKNYAKFTPPVLYKWKNKAL 
VrTTKTiFTARYTRYFKPTVETY /TIiIJ.TD 
MAPTHQRALMEMYKE INVFMPANTTS I 


2105 


16006 


A 


2120 


109 


396 


YYFYFYLVLFYFFIFLFFYFFFFFILLL 
FYFFFSFIFFFFLFVYFIFLCYFFLFFF 
FFFFYLFFYSFFFFFI FIiFFFFFFFFFF 
SYYIiI FFLiS YI I 


2106 


16007 


A 


2121 


15 


413 


ivlarntnfwlsfltfpvalgilivlikgv 
kyi fwple ycqrlkmfvs ys fhy\ fflg 
sllflkygfhmylilll/cifi i imcff 
ikysfffcclyhfffsfylfflyflif\ 
cylvilffsfllflflssycfff 


2107 


16008 


A 


2122 


1306 


429 


SSSSSSSSHVLRIIKDEDFKIIiEQRQW 
LSEKBAQALCKEYENEDYFNKLIENMTS 

JjtrOUriJJ V 4JjJ£\JJl.il7JJU * WIVW*J**-'* *>> A Villi 

AIEYFPESLCAQFAMDSLPVNQLYGSDS 
LETAEREIQHFFPLQSTLGLIKPHATSE 
/HKRGPSMVMILTKMNAVAEWRRLNK3PT 
DPEEAKLLSPDSIRAQFGISKLKNIVHG 
ASNAYEAKEWNRLFEDPEEN 


210S 


16009 


A 


2123 


3 


206 


LRRLRQENHLNWRGGGCSEPRSQHCAAA 
W/VSNSETPQKKKKKKEKKKKKKNLPNS 
ALKKTYSQRGKLF 


2109 


16010 


A 


2124 


23 


401 


IASGRPFFFFFFFFFFWPPPPGGYPHFS 
FLKKKKKRGGGGGKSLLPPGKGPNPPK\ 
WGFPLFPP PLFPQKTPP PPFFFYKTPP I 
PPQGPRGGPLKFTPPGGGGGNPPTILLD 
KKRGVLGPPPFFWTN 


Zi 11/ 


lOvl 1 






J 




" MFDVSLLTFTHSLFHFS PQFHRKCELST 
LCDGGELRDHILLPTSICPITR/DKCSC 
PGEGC 


2111 


16012 


A 


2126 


1 


213 


HFPVENESAPG/FKAAGDLLTLLLGGNA 
AGDFKLKALLVYPSENPCFLKGSFKPNL 
PLVWCSHKKAWVQLG 


2112 


16013 


A 


2127 


104 


419 


NSFFFEEELYNPFGGLGKKTFFGGGEEF 
GHTPPENEALGGKNKFFTGEGGQTFSNN 
GEEKSVS FWISIEKILHRALL/AHALCK 
NCWELNFGQKEEPFFPPPEEF 


21 n 


16014 


A 


2128 


109 


140 


PPPPKNFFFPPKGKFFG/RG/VGPKFPP 
PKKRVFSQKPPRGFFYPPLKKKKNPFPP 
PGNFGPPRGFFKRPPPFFFFFFFFFFFF 
FFFFF 


2114 


16015 


A 


2129 


10 


457 


iCTSWTWCPVPWPATQKAE/AGGSPEPG 
RSRLL 
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Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutaraic Acid, F=Phenylalanine, 
OGlycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, Y=possibIe 
nucleotide insertion 




loOlo 


7V 

A 


2130 


409 


0 


PPPPSPPSPPPPPPS/PSPSPPPPPPSP 
PSSPPPSPPPPSPPPP/PPPPPPSPPPS 
PPPSPSPPPPPSSPPPSPPPSPPPSSSS 
SSPPPL 


21 16 


1 /Alt 

16017 


A 


2131 


317 


177 


c FFFFFFFFFFFFFFFFFFFLRIAJIFFF 
FLPILIFLFLPFLFFFLFF 


2117 


16018 


A 


2132 


3 


542 


EPWSVAQAGVQWRDIjSPLQPLPPRFERF 
SRIiSLPSSWDYRRLPAHPANF\QFLVET 
V r Kxl V VaUHJj XuS XiXj X ouriXi rAJLi X oUoAb X 
JAtPIoiCXo XjJ? tvv 


2118 


16019 


A 


2133 


146 


439 


lksvts iaktwiqpk/ cptnsevdkcps 
tvkwikkmwyi/yftmbyyaaik 


2119 


16020 


A 


2134 


95 


1 


GRVDAILAHCKLR/ LPGFTLFSCLSLPS 
SWDC 


2120 


16021 


A 


2135 


2 


380 


THTHHTH/ THTHTHTHTHTSLS 


2121 


16022 


A 


2136 


4 


337 


KRNNKAWMTVHLFTAWFPEYFRPTVETY 
CSEKKIPFKIL/LLVDNAPGQPRVLVEM 
HKEMNVVFRPANTASIIjQPMNQGGISTF 
NSYYLRNTFHKAIVAIDSNSSDGFGQNK 


2122 


16023 


A 


2137 


2 


356 


PVSSSQVRASVYLKKKKKKGPPPEGITN 
TAGEKPPQSFRGKG\PPFPLISPKKEPV 
I\SFLKNFGPCTIGKKKPPHPPAKNGGP 
LQ/ RPPTQGGGRGKKKGKNNQRPLSGLG 
GNRSPKPPF 


2123 


16024 


A 


2138 


2 


357 


FIiGSSDPPPSAS PVGRATG/RVFFFFFL 
VEWSHYVAQNGLELLDTSNPPAVASQS 
VRITCVSHRTWLLSPLYKI IQVCVSK/S 
PELEQSEDKSLKK 


2124 


16025 


A 


2139 


379 


16 


LLQVRCFVSTVNRGSSCQKTI QVYYVQE 
AIPPSFLIjSPFLM/PYTKINSRWIKDSN 
VKPKTIK\TLEENIiGMPLNI FFTYQLLW 
LYxjHPESQLE I CNS FRALQEGNLII FFI ' 
GRVGRPGTTGL 


2125 


16026 


A 


2140 


73 


411 


NYLLNNLFFFFFLERGLTFAPRAGGWGG 
NLTSWNLGPPGPNKPPPFPPKRP\GTPK 
PTLKEGLFGFFKTTGFP PGAQKGPELPG 
LRGPPGLAPPRGGNKGGNPPPGPIiKTFN 
G 


2126 


16027 


A 


2141 


46 


421 


AGVSWRDHSSLQPCLTSRARAV\SHLSF . 
LSSRDYGPMPPPPKK3CKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKQ 


010*7 
Zi2/ 


16026 


A 


2142 


26 


479 


IjAjl LiXiFolr vImi JvfyWWKX XXiSioXoNKoXi 

I CRI YKETLQLNHKKTNNP I FFKWVKKS 
ts±ir X KvjAXU VAN K. \iiX KM jvXlN X X o V ivaN 
AQKNTMRYHHTP I KMKKMGCALWLTPVM 
CQEAGYILKMYPGWTPYPGRWGRPTAP 
ALPPRGTFPRD 




16029 


TV 

A 


2143 


9 


166 


aktpt T apt TTririrprinanawD /TCTrnvrn 
UMKljltAKLiX JUNJui^KWUArtP/ XKNDKGD 

ITADPTEIQAS IREYHKH € 


2129 


16030 


A 


2144 


1 


328 


LEKESRPPPPPPPPPPPPPSPP/PHPGS 
LPKPLGLQPLPRPEVSFLTEYLPKLKAC 
EGGGVEI AAAS FPRI YLMGMCSRQDRIQ 


2130 


16031 


A 


2145 


1 


440 


KTFLRSLWQLVE/CYISSGLIDDHRRPM 
ALTPRHPRSGITAPRPRPQPPGRVGIPE 
PTALSPSPGPPPPPCSTPGRCQVPSLER 
RRKEGREPPS VGRGCX5GHGI S PSSDIFF 
HLNFCliAHPPDLFVFVCSISNQLYIFHE 
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E=GIutamic Acid, {^Phenylalanine, 
G=Glycine, H-Histidine, £=IsoIeucine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 

0=f»liit3 mint* R=A roinint* Q=Slf>rinf> 

T=Threonine. V=Valine. 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKKKGGR 


2131 


16032 


A 


2146 


23 


422 


IASGRP F FFFWGAPGGEEGKKNF FWGPR 
GGKKKIRGAPTPRGKGNPPPKPWGKKGG 

KSPPPGPXKGKKNRGRP\GPPPQIFFKP 
FFSTGEKNPIiGKKTPNWRLIiPW 


2132 


16033 


A 


2147 


3 


372 


KKNTPFFLTPRGPPPPPRGPLSPPPKRA 
TPPPPPFKKKKPGFPPKKKNFFPPPGGG 
PPPP/LPGEKKGGYF/WW 


2133 


16034 


A 


2148 


363 


1 


FNCRWFETKSHS VPQAGEQWHS FGPLQP 
LPPGFK\EFSCLSL 


2134 


16035 


A 


2149 


2 


361 


FFFLVETGFHQVGQGGIiEHLTSDDPPTF 
GIjP \ GGWDYR 


2135 


16036 


A 


2150 


387 


2 


KKSFFLVSPARVQWGDPFNPPIPGSNNF 
PFSTPPKTGVIRGPPPARKIFVFFIKTG 
\ FPQLGRGVLKS LPQ /CDS P P PAPPKGG 
VSGGNPPAPPFFFFFFEMESHSVSQVGV 
OWRNLGSIjOPPSPGFKOF 


2136 


16037 


A 


2151 


23 


421 


IASGRPFFFFFFFFFFFFLGGRGVFFYP 
PGGGGGPNFFKKTPFPPGKKKFFPPPSP 
KGGFLTPPPPPPFFFFFLKKKGVINGGG 
GGVKISPPGGTPPFLPQKGG/NKHGGPP 
PPRKKKI FLFFFPGAIKQRPPP 


2137 


16038 


A 


2152 


3 


366 


VT WAHCI FHLLGY TDRPCLKKKKKGFPG 
GEKMGLKNFFPPLGKKPPPPQIPNFWSV 
I EKP PRG FYTiGGAAP PKKFFIiLiKP \ G P P 
FFKEPPPPKKKGGASPPIiKKKFFKGKKS 
PPFFFSDPTS 


2138 


16039 


A 


2153 


423 


253 


FFWGGGAPIFPPPKKGFFPKNPPGVFFS 
PPKTGGCTPFFP /HPVMPGPPKDPPfCRPP 

p 


2139 


16040 


A 


2154 


287 


1 


KEMNKKKNPKKKKVSKKISTAFYNKTLN 
NINFF\ FAFFFFFFFFFFI FFFFFFFFF 
FFFFFFFFFFFFFFFFRFLFLFNYKQTTA 
QLMQHSAPTRP | 










All 


z 


RPVT.PTPPPr. /HDV<3PPDT.PPQPT?PPPP 
ivi\. vlir jLr ir trUf txc r orrrLirtarr xrccc 

PMKFFSPPPTFFYFYKSPPHPP/HPPQV 
VFSPPTPPMPFITPPPPPHTSPSFPSFF 
FFFPFFFFFFFPSFFFFFFFFFFFFFFF 
FFFFFFFFFFFFHI ISIKGE IGRSSWR 
VMSRTRG 


2141 


16042 


A 


2156 


69 


606 


LWWPPLSRHAAYRQWPGPTAPRGLGHKV 
KGPGAS PA\ PCGAAAGSTAQG \GGGAAC 
LPGPAAGAVTWPAGPGGGRATGPVLQR 
PAGAGQPTQQGQHDHAGRVLWQHGIGRP 
GAHRPGPCAP /DSAPRAPGGRCG 


2142 


16043 


A 


2157 


449 


0 


SSPGSSSSCSPPPPGGGVGFFL/YKPPQ 
KKPPPPPVGGPGFFPQKKLKSFFPLPPP 
FFLGGGGPPPPPPPKIKSFYPPPKKVSF 
PPPPKKAFLPPPPPPLPPPPPSPPQPPP 
LF 


2143 


16044 


A 


2158 


241 


5 


KKFSFFPPGGSQGGFFGSLQNPPPGFTP 
FFCLNLGKKWGQRGPPPRPNNF /SFFFF 
FFFFLVETGFHHISQDGLDLLTS 


2144 


16045 


A 


2159 


434 


2 


LGSASQLGCSGVRDPIiEEAVCPFSDLQL 
CARRTTAL FKAVRQGHLSLrQRLLLiS F\ V 
CLCPVPRSGAYRGRQASLtSCGGLHPVQA 
SRLLCLPKQAWAMAGAP/PPASLQPCSL 
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sequence 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyl alanine, 
OGiycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Prolinc, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
yy— l rypiopnan, i — tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














ISDCCASNQRDSVGVGPSEPRAGYNLLV 

"DfTTT CD 


2145 


16046 


A ' 


2160 


448 


84 


FLGGFFFFFFWFPPPS PPQKEAL\QKKI 

FKGGGR\FSPWFLFFFFFFWFFFLWFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLGKCPLKYT 


2146 


16047 


A 


2161 


3 


156 


RGCSEPRLRHCTPAWVT\KETVSQKKKK 
iU*LilJj(JGKTrANrr FrNKAPGTFP 


2147 


16048 


A 


2162 


412 


37 


FFHSPPPPPPPAGGAVFPPKKKKKSPPP 
PifliLiXj/ KGGGGPPPPPPPKRGPPPQKP 
KRGFFSPPKKKKKFFPGPGGPPGPPQKT 

PDDPPDDPPITPPPPPPPPPTJPPPQf "COT* 

FLRTLWRKGRGRGI 


2148 


16049 


A 


2163 


5 


370 


QSSCVQWLVFWPTIQEAEAGGFLEPRS 
ERLQ/CSHCTPAWA 


OtdQ 


iOUjU 


A 


Zi o4 


437 


36 


if r r oEAJibC^VAQAGVQWHARSLPQP 
LPPG\SSDSCLSLLSSYMCLSPRTRGS 


z l j\j 


lOUDi 






3 


623 


RQG r i b VAWAGVQ Wx D LGS LQ P P P TG FK 
RFSCLSLPSSWNYRHAPPCPANFVFLEE 
TGFLHVGQAGLELPTSGDLPASASQSAG 

i- l\sWorlt-A/ K_rj\.VCo inl ih* r 


2151 


16052 


A 


2166 


337 


0 


PPPPPPSPPPPS/PSPPPPPPSPPS\PS 

ppp 


2152 


16053 


A 


2167 


350 


156 


AEATSKIRCQKCYIMIAGHSGTRLQFQL 
LQRLRQENHIiNPRA/RGCNE PRSHHRTW 
PTWXISKSFIiA 


2153 


16054 


A 


2168 


1 


337 


DERSLSQRSRSWSYIvGYYSDLSTARHSG 
HHKKRT KKKK/ 1 KKKKKKKKRGP FKKKG 
PLKTRKKPRGGLLKAHPFWGGPPPGFFL 
TGEGAPPPVFFKKKKKKPPLGGKGGFLW 
G 


0 1 KA 
1 134 


16U55 


A 


2169 


2 


606 


RVLRAVAAHEEPDKEGKEKPHAGVS PRG 
VKRQRRSSSGGSQEKRGRPSQEPPLAPP 
HRRRSQPP\HPGPLPPTNAAPTVPGPVE 
PLLLPPPPPPSLAPAGPAVAAPLPAPST 

HH\PAPMVIPAPAEPI 


2155 


16056 


A 


2170 


311 


2 


GLQPLGLGSVKQCMDLACVPETVCVCVC 

V U V v_/iKAll 1 f V k» 1 wVjv_ V flio A V V-WrlV 

C/VCVWACTCVPVCMHTCVGVGASVCMQ 
RNELGRWAWENDAIRQQRC 


2156 


16057 


A 


2171 


78 


337 


NTPNS I YKAMSLKGP ITGTFLPNYPGHK 

Vv-.VV-.U1 X J-iv_ v I y xl J.xllillrlJLrllxll yrUt? xr 

HI/HIYTHT 


2157 


IwJO 




<£1 /Z 


Z 


Z/o 


TPSQKKKEKRKRKGRKRKRKKKIVYTKI 
EKGNAKMTCESFIHFCKKSSSVPQVALS 
AEYRCSST 


2158 


16059 


A 


2173 


2 


365 


FYHVGQAGLKLLTSSDPPALGLPKCWDY 
RRJE\DCAQPNVKS 


2159 


16060 


A 


2174 


195 


389 


FRVFSKLKYYYDFFRGRISLSCPGWSTT 
PGFKRLS CLSLPS SWDYRRPPPCPAN\ F 
CIFSRDGVSP 


2160 


16061 


A 


2175 


23 


374 


IASGRPLI FFFFFFFFFFLFCRGGGPPF 
NPRGKERGGFLI KGPP PPGGKKNFWAPP 
PGGGEIKRTPPPPGVFFFFLKKKGFSFG 
GGGGKK/PPPPGEPPPPPPQKGGKKKKK 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TRGPLP 


2161 


16062 


A 


2176 


2 


158 


FFFLKQSLSVTQAGVQW \PVDSLQPLPP 
RFKRFSCLSLPSSWDYRCVPQCPAN 


9169 




A 


9177 




120 


PRFHFIASPSEMSQNTTVKAKTTVPASEC 
AYPKIEPFFPF\DPRGSES\FDKLGVHH 
HPLFPLSGEPLM ILHKNTH 


2163 


16064 


A 


2178 


3 


256 


YIiKKDLNVNQKTLTLLB/ EKQGK/HLHD 

mrlckefi^ktskaqyilkkisqy/lik 
vqnfnavkdpvkrmkrqasd/went ft 


2164 


16065 


A 


2179 


95 


1 


GRVDAIIiAHCKLR/liPGFTLFSCLSLPS 
SWDC 


916^ 


16066 


A 


91 RO 


i 

i 


143 


ARGERERERERERERERBRERERERERE 
RERVFV/RERAALSRERGDKCERSGLF 


9166 


16067 


A 


2181 




304 


ARGQGHLiSLQRLIiI*SF\VCLCPGPTGGA 
YRGRQASIiSCGGIiHPVRASWLLCLPNQA 
WAIAGAP PAALLPP CSL I SD CCASNQRD 
S VGVGPS KPCVGYNLLVW 


2167 


16068 


A 


2182 


3 


163 


PQRRSTESCGWDKDARSQSRSPPRQQAH 
GHHS /HTHTHTHAHTHTHAHTHAWTRP 


2168 


16069 


A 


2183 


3 


205 


NFKLFCKGFFFLRQSL/DSVSQAGVQRH 
NIGSLQPLPAGFQLFLQPPPPGSKDTHP 
GVQRHDHNSLQP 




16070 


A 


2184 


228 


88 


WFFRIYDSFFLVFLFVFFFFFFFFFFFF 
FFCVFTPFPVYFIQYWYI 


2170 


16071 


A 


2185 


287 


•> 
3 


CT.T7T. A T DOU17PAD V T VT D VOVTTTT.T TLf~*f> 

PSKGLSIVQWRGQLNSPVLKSQDPTP\ 
PRFKRFLCINFLSRWS YRHVPPRPDS FV 
FLVETGFLHLV 


2171 


16072 


A 


2186 


252 


105 


VSISCHPQ\PCLIYTOT/HIDGYSNIHL 


2172 


16073 


A 


2187 


267 


459 


KHVPGSSYTWL/ IFFFRGNFLKKGANFV 
PQGVLKGGNLSSIiYPPPPRLKQySdiTL 
LRIGNYRHA 


2173 


16074 


A 


2188 


170 


15 


RSVMKDLNSHFSKENIQMANKRTKECSA 
LVVFREIK/TTMRCHLTPIRMATIKK 


2174 


16075 


A 


2189 


2 


504 


DVT I S TCHASAKV GTRL VFDHYGKI I QK 
TPYPHPRGMTVSVKQLFSTLPVRHKEFQ 
RNIKKGLG\RRSCFCFDF 


2175 


16076 


A 


2190 


1 


565 


FFETESLSPRLECSGAISAHCaTLHLPGS 
SDSPASASRVAGTTGACHHTWV / IFFVF 
PVETRFHHVSQDGtiDFIj/NLVI RPPRPP 
KVLG 


2176 


16077 


A 


2191 


142 


382 


NTPPLLFFFVIRDRVSLCCPRWSGVAQF 
WLSATYASRV\KRFSCLSLP/SNWDYRC 
VP 


2177 


16078 


A 


2192 


138 


365 


KHQ YHHCC/ LLKKKKKKIQCKKKKKKKKK 
KKKKIKNKKHGGPFKKNFW/EGQPRNWA 
GGV 


2178 


16079 


A 


2193 


385 


21 


RGEI FFFKTRRKKFS PQGGRGG VFPPSP 
PKNFFFPQGGKFFGG/EGGPKVPPPKKG 
GFPKKPQGGFKVPPKKKKKIISPPGVIG 
GPPGNFLKGAPPFFFFFFFFFFFFFFFF 
FFFLVRAVKLS 


2179 


16080 


A 


2194 


415 


56 


PPPPTTAPVFSPPPPPRPFFSPPPPVFF 
FRSFPPAPPPPPIiFFPPPSPP\PSFPPP 
PPPTRPCPPPPPTIPPPPPPFSPPPPPF 
FFFFFFFFFSLLPFFFFFFFFFFFFSFL 
FLFFPSRLW 
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ing to first 

amino ficid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

can |] Ati re 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P^Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, ^possible 

UUV.IVUUUC tllfiK?! LIU 11 


2180 


16081 


A 


2195 


35 


369 


MKRPSPPPPTPPPPPPPPHPLLPPFPEK 
KTGPP/RGFPKGGERGPPPPKKKKDPPP 
QKQKKKKKWGGGGGGGGQRKKKHPQKNP 

•c c c jt v3xvwov7X\jtwf xjrvuixf zwjtr JTJryo\j\3VJ»v 
K 


2181 


16082 


A 


£m 1 Zr\j 


o 




SKPRSCMCTPAWATO \ ETPS YKKKKV KTC 
KGLPLGGPGEKT I FPSLKPGQKAQPKNK 
IPFKKKIPFFSTPKGKKKKWFQGGPKKK 
KKEI 


2182 






91Q7 


1 
1 


Ml 1 


ARfiF P f? P WQ FF FP FP RP FP IT P FP FP 17 P TP 

REPJERRHP/RPRERE 


2183 


16084 


A 


2198 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERBRERERERERERERGGAPP 
PRLYRERVLKKQRGRFLPPVRERKK/RS 

PVnTTiPflV 
r vy i J_ii vjt x 


2184 


16085 


A 


2199 


346 


1 


vprhglpcfcfphqijqppkrkkfnnrkr 
altshtqffvfetescsva/recsgais 

hctpvgqqteiwspknkpikkeegcpyc 

QPLSC 


2185 








j 


joy 


VHHHVQIjILFIFVETK/FSHYAVQAG/L 
ELLSSSDP/PTSQSSRITGMSHH 


2186 


16087 


A 


2201 


3 


204 


HECHCTPAWAT\SETI£QKKKKKKKKKK 
LFHLKRGPLKLF 


2187 


16088 


A 


2202 


69 


379 


KKKRVFFWGPGGSKPPT/SGNPPPWPPK 
GGGLRGIiPPPPGPRGVIFKNFGKRGPPP 
PPRTiTT c ;wrjppriPiiV5T.aT^pnciT QnT.wwn 
APP FFFGVTjKKLP PLFFVS KG 


2188 


16089 


A 


2203 


81 


379 


KKKKKPPPPKKKNPGPKKQEGGPLRGPP 
TFSGGGGGKKTCPQKLKAAGG/SKKAPG 
PPPGAEKKNPALGGEKKTIjGPNGRGPKR 


2189 


16090 


A 


2204 


1 


341 


GQSLPVSPRLQLSNGNRTLTLLSVTRND 
VHP YECE T DNP A Q AltfFQTYPVTTJJVT .fT?V 
LU?\PVAQATSPNPRSQRPGLSVPLSSK 
NADSPPTPRNPARHDFLPQANMGRPSLD 
QE 


2190 


16091 


A 


2205 


2 


370 


DAP PRPANFVFLVKTGFPP VGQAGFKLP 
PPGDPPPIiASOITGKG /HCAOPPF 


2191 


16092 


A 


2206 


362 


24 


PRGSSSASSSSSSSSSPPSSSSSSSSSS 
SSSPF\PPPPVLKPPPPPPKKSPPPKKK 
NPPPPKKKTFFPFFFFPFLRRGLCPFGR 
GKRAKPPFQKKKKKRKRESSFYQQVIIH 
M 


2192 


16093 


A 


2207 


357 


3 


AGQALWLARVIPVLWEARSYYVRISWYR 
NIVAMDSDSLDGSGHSKliKSFRKGFPVL 
YAIKKTHDSWEG VS \ MS AL I TOKS L I PS 
CVDAFEELTS SAE EVAAQWG I ATDLEL 
LVGCARA 


2193 


16094 


A 


2208 


280 


411 


GNDVYFLiVFLF/ CLFEKES CSVAQAGVQ 
WHDLSSLQALPPRRMA 


2194 


16095 


A 


2209 


3 


392 


PIIYEKYTQQINEMPRKLQHQQLAIjVSR 
NGPILLHDNAQLHITQC/LOKIjNELGYR 
VLPHLPYSLDLS PTDYYFFKHLDNF\LQ 
GKHFHNQEDAENVSQKFVKSQSMDFYAT 
/GNKLISHWPNCVDCN 
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sequence 
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M 

etb 

od 
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NO: in 
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26 
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beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

CO/1 at nnAA 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing u> 
last amino 
acid 

lOIUUC VI 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pfaenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, 1^= Leucine, M-Methionine, 
N=Asparagine, P=ProIine, 
\^ VFiuLaiuinc, xv — /vrgininc, o-oenne, 
T=Tbreonine, V=VaIine, 
W=Tryptophan, Y=Ty rosin e, 

nucleotide deletion, Impossible 
nucleotide insertion 


2195 


16096 


A 


2210 


248 


417 


GIIiANCASIKDLISRIYKKLKQIN/KQK 

X ViU XT XI IVL\. rl rtTvX/ I lis* XV. X r OV^JQ Ci X ^ VJ V Vi XVXU i 

KK 


21% 


16097 


A 


2211 


213 


493 


GKRFYFACPGKIGALQGFFKGGGPFFFF 

FPPT .TP P PNT^TVT .TTvTirr .K"T TTQ PT .PTPYl? 
r r ruirr trvt ±m x xj x in zvliivx i\X3 fxjv— x n r 

SNILGFYPSITHRTIPDAWVTA/SQSGP 

VLMRSIS 


2197 


16098 


A 


2212 


159 


413 


LAPRVI FGPPKKPPERPPLFFFFFETES 
SSOTQAGVQWHNLSS\PPRFKRFY/CIiS 

T .P<2NPTi VPTP PP PTIP ZVMPiTT? QPTYWQ 


2198 


16099 


A 


2213 


455 


60 


NPRREVGPICPPPKIRVPPQNPQVGFYS 
PPLKEKTFTSPAPVNIiGPPRDPFKRPPP 

AQPRVARRPLVPS S \PPLPSxjCLAPAPR 
GPGSLCPRGSLEGDNGSSPG 


2199 


16100 


A 


2214 


2 


243 


LTLSPRLKCNGMVSAHCNFCLLSSSDS/ 

rvLirv/ ^CU^VMXJWXAJVJXgVgV^OIaXM^XAlvrXwX JrflW 

CTEPJ)SVSKTNKKN"SLKIFTCFFVDAA 


2200 


16101 


A 


2215 


1 


286 


FSQLLRLFIC/SQGGRIxIjCCESCPASFH 

rVWVKt^31TxTiQVFPRTKRKyS I IVQTS F 
ILWIQSDLEIDR 


jLZKJI 


i£.t AO 

ioWJ. 




1 c 
zzlo 


1 11 
1 15 


Z 


TTT i7FP\7PfY3T.T02Vf5VnWP \ nTJ?m?TT.V 

IXXcf TV tv^JU X yiUJVyriA \i/iJOV3Ci X XJ V 

ILPPLFAPCSLQSSWDxTvRVLPHLPNFC 
IV 


zzUZ 


lolUi 


TV 

A 


ZZI 7 


ZlO 




v x Jriror ri v coyvr uooinioiiororr nx* 
xXFIYLFIYLFIYLRWSL/DSVTQAGVK 
WHNLGSLQPLLPGFK 




1 6tf\A 
10IU4 


7V 


ZZlo 




ZZo 


K^SHLFTDDi ILYMFOTKHSTK/NLLE 
LIKE I S KVTG/YQKS VAFLYVNNKQAI K 
KTIPLTrASKRIKMSGOA 


2204 


16105 


A 


2219 


244 


3 


EVLNQNSGLPRWPNILRAKASIiRVPRQC 
S RG WFS ANGAGTTR YLYAKE WGG LGGG 
GCSKLRSCHCTPVWAT\SETLSQKK 


2205 


16106 


A 


2220 


162 


2 


INMVGNLFFFGDR VS LCHAG WS AVAPS W 

ptvastsl\vkossflslpsswnhrh 


2206 


16107 


A 


2221 


146 


3 


GRVDGVPWRNPGSLQPPSP\GSSDPPTS 
ASOESGTTGAHHHTRLIFVF 


2207 


16108 


A 


2222 


239 


2 


Sx^SKPDKFPHDSSPEIKPVTVimiNSF 
SFIFPFFFLFPEKESHSVTQAGVQWRNL 
GVSSYWP\AGLKLLTSGDPPALAS 


2208 


16109 


A 


2223 


2 


159 


xjNRDLGGGGCS blrs chctpawat \ ses 
PSQKKKRKKKKKKKKGGRGRNSKI 


2209 


16110 


A 


2224 


3 


345 


RFKLFSCLSLPSS WDYRRVP PRPA/ NFF 
VFLIET\GFAILTSSKTERQSRLECIFG 
FYGLPCREKRASERRSVEG /HERKILFS 
FDFFFLGGTESCFVTQAGVQGCYIiGSPQ 
ppppG 


2210 


16111 


A 


2225 


103 


319 


FSEExTUmrim^TMSCSCLTIiVE/C/ 
WSEGYMATPCTIIiIiLLFFQRHCLTLSPG 
GVQWCSHS SxjQPQTPGIK 


2211 


16112 


A 


2226 


2 


110 


FHHVGQAGLELLTQWSIHLSLPKCWDF \ 
RHEPPHPA 


2212 


16113 


A 


2227 


2 


178 


I FLI FTFIxEMSSHYYAQAGLE FPG / FKL 
TSRLSIiLSSWDYRRPPPRLAKFFAFLAK 
GDAA 


2213 


36114 


A 


2228 


173 


3 


FIFIFISLFFIFFLRQSIVI^PSAVOSR 
LQPPPPRFKQFS\CLSLLSSWDYKRVPP 
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C 


2214 


16115 


A 


2229 


333 


1 


HTPPPPPTFHSLCTEGAPPPPQNT/PNP 
PKNFr ilPIjSKbHTKANEPSijSTPRTHP 
HSPRPPLLTHPHPNPRASAPPGSRYPPR 
ARHRERPRERPTQRERERERESVCVCV 


2215 


16116 


A 


2230 


350 


3 


KKKD I PPVNI YAPNTGAHKYI KQI LLDL 
NKDGP \HTIXAGDFTTPIiSVIjNSSAESR 
CSRPFI ISPSLWGLRE 


2216 


16117 


A 


2231 


277 


1 


QMHPARGHLPQALI P VQKPAL ISQGMSA 
SGSTQVS P FhS PCFVrvcTvNCSNTKA/LT \ 
PFCGAGTGSHS FAQAGAHWCNRGSLQPH 
PPGFKRL 


2217 


16118 


A 


2232 


3 


335 


ETRFHHVGQAGLEPLTSDDPPASASQSV 
/ GITGVS LRARPGES / GSWKTVCCNNMS 
EPT 


2218 


16119 


A 


2233 


29 


448 


CPSLRQAWHEAAIDEVRTGTYRQLFHPE 
QIjITGKEDAANNYARGHYTIGKEI idlv 
LDRIRKLADQCTGLQ\GFLVFHS FGGGT 
GSGFTSLLMERLSVDYGKKSKLEFSIYP 
APQVFTAWEP YNF I LTTHTTLEHSDCA 


2219 


16120 


A 


2234 


1 


365 


GARL I LVFLEETGFH YVGQAGLELLTS S 
DPPASA/FPKCW 


2220 


16121 


A 


2235 


28 


460 


DRLIDHISKLGTRGLQGFLVFHSCGNLG 
\TGFGFTSLLMERLSLDYGKKAKLEFS I 
YPAPQVSTAGVE P YNS ILTTHTTLEHS D 
CAFMVDNEAIYDICRKNLDIERPTYTNL 
NRVI SQ I VS S ITASIjRFDGALNVDIiTE F 
QTNLV 


2221 


16122 


A 


2236 


614 


84 


LAASLAACAQLSALAASHRMWALQRLRK 
LLTTEFGQSININRLLGENDGETRALSF 
TGSALAALVKGLPEALQRQFE YEDP \ IV 
RGGKQIiLH/SPHFFKVIiVASRLVTLEAG 
HFCPCCAETHKW\AWFRRYCMASRVAVA 
IiDKRTPLPRVFLDEVAAVRVCGHILQLG 

tvt*ot /v\it t r*Li r 
UlTiijQQn I CrtL 


2222 


16123 


A 


2237 


1 


393 


GPMLAQLSVFRCGSTSAPNDLWYHFIBL 
PYHGES ITMLIALPTESSTPLSAI IPHM 
S TKTIDRWMS IMVPKKVQVI LPKFTAVA 
QTDLKEPLKDLGITDIVDSSQGHFCQIT 


2223 


16124 


A 


2238 


3 


402 


HVGQAGVQ I GKACWE L YWLEHG IQPEGQ 
MPSDKT IGGGNDS FNTFFSETGAGKHVP 
RAVFVDLEPXVIDEGRTGTYRQLFHPEQ 
LITGKEDAANNYARGHYTIGKEIIDLVL 
\DRIRKLA\DQCTGSQGFLGFP 




16125 


A 


2239 


2 


478 


GKGGIiHK I PWTPIjTPS FARGLVPSLiAR 
GVEARNGAGPIKSYPRPGSRLKMQNGSK 
GSGLQNKTFHWE I CDAHVNSKI QLKQ\H 
ISSRRHKDRVAGKPLKPKYRPYNKLQRS 
PS IliAAKJjAc ^KDMMNPiiAPAFIiS SPLA 
AAEAVS SALTLPPRPFCFV 


2225 


16126 


A 


2240 


255 


2 


FLFVKPHQISCPTKKGIKSFLVLCPLNF 
FLFWRODLAHPGWS AVTOSWLTAASN\ Y 
GLKQSSYFSLLSSWDYRCIPPHLGGKRP 
L 


2226 


16127 


A 


2241 


365 


11 


EPPPPGGEKKKRGKPPPKKTPKKDGPQK 
KSAAFFGG\GKI KKKKGAFRKKKKKGGG 
PPPKKKPPPRKKKKKKNFSPLPPPKHTP 
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0=tf^l tlt^mino P=Arainino C— Cartwa 

v£ vjiu i4iuiiit, iv— /vrgiiune, o — oenne, 
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SLFQKPPTKI FLFSPPIiLFLNIIiFFFPL 
SPFIF 


2227 


16128 


A 


2242 


3 


143 


PNFLS PRLD CRGT IRGHCGL YLPGS CD P 
S/CLSPPSSWNYRSTPP 


ZZZo 




A 


ZZHO 


1 J 2. 


z 


CHI F\ VFLVQTKPHHI 


2229 


16130 


A 


2244 


334 


43 


TSKHMKRCS I SLATREMQNKTTIGGHFV 
PTRLAWIQKPKNAKTOQGCLMC/WCWVC 

\IYPEEVLASSVPGT 


2230 


16131 


A 


2245 


312 


2 


PPSKLPEKNFFFFFFFLRQSL/SSVAHS 

WHWPrJT ^SPT.TTPP PPP^ffOP 4 ? PPTT.T.<? T 

WETQTAIiSQDCATAIiHPGGQRETLSHKI 
CVCVCVCVCVCVCAVCIYIN 


2. J. j 1 


ioiJZ 


a 

Jx 


Zz4o 


5 


ZJU 


P AnAMVT?T<3RRP rT.T.P PPDT 7?TP \ T4\7An 

AGlxKlxLASSDPPTSASQSAGITGVSHHT 
WPOPLTPCPHAKS IiPPTNOI 


2232 


16133 


A 


2247 


2 


214 


GRVDLVIQAGVOWHDLGSIiHPPLP\GSS 
DT.P / AS D 55 OVAGT AGR FHYA WT » T FF \ VF 
FVETGS / HTQVSNS YDVL 


2233 


16134 


A 


2248 


3 


139 


EGVQGCHHNSLQPPTPGNSSDPPTSASI 
EAGTAGSHYHVWLIFLLF 


ZZJ4 






ZZ4V 


i 
j 


i fin 


RnVflflniHQ <?T ,OP PT Pf3 \ fl <3 ID P PT<? A<! t 
VAGTAGSHYHWLIFLLFCFEGDAA 


ZZjj 


IDI-jO 


a 
*\ 


ZZ DU 




•a 
j 


RliERESVSKKPKSQKPPIKKPKFTPRKF 
KEVLAKYGTKFFFVFFGLFVWFFLRWSL 
/NSVAOAGVO WRDPGSLQAS PRP 


2236 


16137 


A 


2251 


15 


394 


FVSFSFFPSFPSHLFFSSSSFPPFLPSF 
HFSFIiPSDRPSWPS /FLPSFTjP 


2237 


16138 


A 


2252 


60 


306 


GRERILEEIMABDFLNLMKDLNISIQVA 
Of) T PS KMNSKRPHRYOHFRS OSORTIiKA 
TREKQLATYKGSSVT/VSPSPGPQTVNS 


2238 


16139 


A 


2253 


3 


351 


GFHHVTXJANLKLLTSSDLPASTSQSTGI 
TGI\DHCTQPNFPI 


2239 


16140 


A 


2254 


150 


2 


RPRRPDHSRLGAGDQPGQHSKTPS\HQK 
KTTCTRO AWPHAP AT AGTRfJAP A 


2240 


16141 


A 


2255 


357 


1 


LNLl^IaTLYAKII^KWITCTNV^ 
KFI/^KNGAtn^MDTRIiDNAFLDLTS KAQ 
LTKEKIDKLNFI KIQTF/ CS I KDLLGNL 
KRQAPEEKKILRNHISNKELVSRIHKEV 
PKLNNKK 


2241 


16142 


A 


2256 


397 


1 


FSLFPPVGGQGGFFSSCKSPPPRFRAFF 
CPNPFRKGGNRGPPPHPGKGTLGFFFFF 
FRQS FALS PRVEYNLCLPGS SDP PAS AS 
QVGGAPGIiPPPAWVNFGIFFF /CFFLRQ 
SRSVAQARMQWRHLGSLQAMP 






A 




z 




TLTLTPRIiECSGTIS / R AHCKTiHTjPGSR 
HSPASAPRVAGCGGGHL 


2243 


16144 


A 


2258 


214 


347 


KISLILGVHKICCEF/CFFEMKSRSVTQ 
AGVQGHDLSSLQPPPPGL 


ZZ^Ht 




A 


ZZD7 


"5 /IT 


* 0 

z 


PP*?PFPPPSEAR < ?P t ?VA\RT»Rr ,< ?riTl7<3A 
HCTLHLPGS 


2245 


16146 


A 


2260 


333 


1 


SDQRWTENAFVELRDEGFACPSFS EL/R 
STPSTSGEEVENFEKKLDECITRITNTE 
KCLKELMELKAKAREPREECRSLRSRHN 
QLEERVSVMEDQMNEMKREGKFREKRT 


2246 


16147 


A 


2261 


2 


357 


S PRS CS VYGIAI LLFIrYFLYKIiAFADL Y 
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G=Glycine, H=Histidine, I=Isoleucine, 
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QLALNS FLHE IQEP / SLASGSGPLSRNS 


2247 


16148 


A 


2262 


355 


0 


PPLPSPJuSPPLSJjSFFSLPPFPFFFPFF 
PLF/CPSFSFFFPSPFFP 


2248 


16149 


A 


2263 


348 


35 


YFLKKGFSFFPRGEGRGKDSRSLKQLTF 
GFKKP/SCPSFLRKW\ETRLCPPAQKIF 
FFFFFLLEMGFCYIAQGDLKGSSQSSGI 
TGVSYHIWPTFIGHLTMCLAKC 


2249 


16150 


A 


2264 


270 


92 


DRPARRKMFFTYS SRIiFFFFFFFFFFFF 
FFFFR\ FFFFFFFFFKFFFFLFSRWSLM 
IFIS 


2250 


16151 


A 


2265 


1 


417 


FRPPAGVQWRNLISLQSLPPGFKRFLYL 
SLPSS / WDYRCMPPQLA/NFFVFWFDV 
VVVVVVVLVEMRLRHVIiARLDIVVLICI 
SIjMANGMEYLFIjCSFAIHLPSLIjKCRFT 
YFAHCLFLYYFLETGSHSVTETGWQWCI 
I 


2251 


16152 


A 


2266 


344 


427 


LIGLCEDTNLGAIHAKRVTLIAQDLQLA 


2252 


16153 


A 


2267 


310 


417 


RVPY GTLGSG PLATMAG FDD KLKPHMEG 
AAAPILVR 


2253 


16154 


A 


2268 


186 


464 


NSCLSHNQRQLLFRLEKGMGPISAQEKL 
VLHTLSGFSGLV\VGWLVFEMESCSWQ 
AGVQWRVHPP PRFKQFSWLS FPSSWDYG 
HVPPCPANF 


2254 


16155 


A 


2269 


1 


427 


PEPPSIiPPDGAKKQKTQKSKKWRICFLR 
KKKKKKKRKKQKKKKKKRGGGSQKKSRR 
GPKLTTETNI I ILI KGGLKKMNYREIEK 
KLLFGGGGVIGPTPTQDIKGREE INYLE 
AVGREKQRFISLVK/ tnva/heatrdt I 
FRGYL 


2255 


16156 


A 


2270 


399 


44 


ISFQLLLPITVLPGHVRVLMEMYKEVHV 
VFVP\STTTFMLQSMGQGVTLTFKSY\Y 
LRNTFNKAVAALTSDSSEESGQSQLKPV 
WAAFSSLNASKNI /RDSWEAVKI PALKG 
VWKKWSLKM 


225* 


16157 


A 


2271 


19 


420 


AAGIRHEERERERERERE/QRERERERE 
RERERERARRNIYHTYRPRPPRVFFFFF 
FFFFKKKRRGFLVPTPPGGGGAQKKKTS 
LEGGKGVFFKGGGKKTPLKNPGGWGEP P 
QKKNGGGGPPPRDPPPPPRPLF 


2257 


16158 


A 


2272 


463 


20 


S YNI PLSQSIjIQ/ SRALTLFDSTKAERN 
RRGK\RSGKLQWEGSRGWLMRFKERSHL 
HNI KVQDEAVS YP EDLDKMDALNTKQQ I 
FSVHKIALYSKKMPSRIFIAVERESMPG 
FQASKDRL/ LLLG/ ATAAGDLKLKPMLI 

VTCrCVTJD'D'\7DD3VT?T? 

I W o JxTl xr K V IrKAC r 


2258 


16159 


A 


2273 


474 


82 


VGVWADFLKNTSQAQATKAKMDKWDPI K 

TSDKGLI / SRIDKELKQPYRKNPNNPVL 
KWAKGWAQWLT P VI LAL WEAKA.GRTQGQ 
EIEATLALFSGLFCQVFIiC 


2259 


16160 


A 


2274 


152 


3 


iu^KJNoDollUftool \K11j1joNDK-NPvN 
IHRRPTRFLTMLYQQNLCHLGL 


00 £ft 


10101 






ii 
Iz. 


Jul 


LGASARYEKPTVNLILNGERLN I FPIR / 
S KTRLG YLLSLLL FNIGLAI LASAINQ K 
KEIQI IQIVKKKIKTQRKKKKNKTKKKK 
ALFKFKGGPEKEKGPPKNPFKTPPWF 


2261 


16162 


A 


2276 


329 


487 


EFVNITIKIATSLHYKAIVIKA/AMVIW 
YWHKNRHIDQWNRIESLEINPHICNK 
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E=Glutamic Acid, ^Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


2262 


16163 


A 


2277 


2 


466 


AHHS CS HLLTTAAHYS CS PQ LLTTTAAH 
TYSPPQLLTPAHHHSCSHLLTTTAAHHS 
CSPPQVLTAAHH\GAHTCSPQIiLTPAHH 
S S PQLLTP AHHS CPHLLTTT AAHHS CS P 
PQLANSYRG\SPYCSSWSQTPGFRKSSH 
VGLPEPW I FHGLQW 


2263 


16164 


A 


2278 


4 


467 


KNVTQQGKIHIRDKLDEMWGNTSVFCTN 
HMKHQTNFNAKKCNVFKECGK\TACNFQ 
LTQYQISHANQKPYECQICGKPFRKRAH 
LTQHNRIHTGGKPYECKECGKVFICCST 
LIQHKRTHTSEKPYECLECRKTFRRSAH 
LIRHQRIHTGEKPYK 


2264 


16165 


A 


2279 


383 


3 


FLCVCACVCVyVCMC!MCACAC\C31CMCG 
CVCVCVCVCVGLRGLGWGAWCRSWGPP 
LCFLLLG ILPLKSRIiLWLPRRT I S ICTL 
P S AQGPLPAPGFGKYAS NTTG VKGS SS V 
FSLLSRITALSHLHW 


2265 


16166 


A 


2280 


47 


219 


VCSELKSCHCTPAWAT \SVTLSQKQNKT 
KQRRTLGSIFFQHTFMHLKKEKSLILQK 
W 


2266 


16167 


A 


2281 


294 


160 


NKTFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLFLF 


2267 


16168 


A 


2282 


68 


490 


RTFAYPKPPVFGSHSGKTTIPKNARGSF 
PS/PPLPFPCPSPSPGPSPVSCLPPRLP 
SPSPSPSLSPISRLPSP/SRSPSPSLS 


2268 


16169 


A 


2283 


3 


454 


CQSAPLGGASQLGYSGVRDTLEEAVCPF 
SDIWLCAGRTTTIiFKAVRQGHLSLQRFR 
LPFVWLCPAPRGGVYRGRQASLSCGGLH 
QVRASRPLCLPTQASAMAGAPPPASLPP 
CSIiISDCCASNERGSTGMGP/SEPGTG 


2269 


16170 


A 


2285 


1 


452 


LKDSGRDYVSQFEGCALGKQLNLKIiLDN 
WDSETSTFSKLREQLGPVTQEFWDNLEK 
DTEGLRQEMS KDLEEEKAKVQP YLDDFQ 
KKWQEEMELYRQKEEPLRAELQEGARQK 
LHEL/QEELS P/LGQEMLDRARAHVDAL 
RTHMAPYSDELRQ 


2270 


16171 


A 


2286 


3 


266 


NSSPPSSGHSTPRIiAPPSPAREGTDKAV 
SALKSPQPNRGMGRGQR/PGLPS /DTAT 
ITPHTSGFPKQPQLSLKVQTQRAKGRLS 
HWDLEP 


2271 


16172 


A 


2287 


274 


453 


I YTFVKS S S KTLRPRH IDQWNRI ENPE I 
KP/EYSQIiIFDKANKNIKWEKDTLLNKW 
CWDN 


2272 


16173 


A 


2288 


81 


487 


TVYFKPTVETY\CWDKKKI PFKGLLRRD 
NTPCYPKSLLEMCEKXNIVFAPATTTSS 
/LKE^DQGVIVTFKSNYIiKNTF\RLGEG 
RKKKKKKGRKKSFHKALAAINSDSSDGA 
GQS KQKT / FWKGFS I PNAI KNI QDPWE 


2273 


16174 


A 


2289 


75 


469 


SRGVAGAPPKPSPSTPSPGPLDVTPGPH 
SSHHAASPGP\PPPPEPTASSMASAP\P 
PAPQPTPL\PPATLGPPSAGPE/PSPGS 
CTSTGWGYSFCCPRCRRWMRWQPPQQGP 
AWHWWPREFP/PPPRVSGS 


2274 


16175 


A 


2290 


256 


55 


PTQPLRVELTTVTLRCD INKWDYI KLKS 
FCTA\KKRQPTEREKIFPNHVSNKRLIS 
KIYKELIHRI 


2275 


16176 


A 


2291 


497 


29 


SLTHRVAGGAAVTPLAHAGARQIFFLGD 
PHPTSSLLGWGPAWDPCAFQVSDHPASS 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Giycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucioe, M=Methionine, 
N=Asparagine, P=ProIine, 
v^— vji u utiu me, w— Arginine, a— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 

r» 11 plort+i Hf> Hc^U^tirkri \ — r*neoI Kin 
IliltlCI/llUt: UclcUUII, \ — UOSSlUlC 

nucleotide insertion 














RVGKTLLSCPGGGRLPWVTCPGGGQLSC 

GMCPRCMCVCVCPCRDVY/ ICVCVCVCV 
CVCMCLCPCWDVFTEti 


2276 


16177 


A 


2292 


141 


392 


SCSTEVKQPKIGVREVDFVAAPGIAPSS 

POTf HTiTT'CTQCT.irQT T.DT CTTVDI7T UMTU 

LDVLARE I RQBKE I KGIQ/LGKEE 


2277 


16178 


A 


2293 


14 


291 


YVGTEGISFISFMRVTNYMTRHIiATLRE 

C\ ^VCD\ 1 7VT3T3 T7 T T? CT ©TJTlTr^O'TVSO / T3 *r 

& \LioK \ V IfKf X£tC LiKr Ultjo loU/KX 
TSRQHP PR/DLRDALLYLNRRITLVRTR 
CKSVAKRPPGSP 


ZZ/o 


joi /y 






1 1 A 


293 


WRQRETVKKKKKKKKKPGWGFLNPPPQR 
GKLCF 


zz yy 


i £ i on 
lOloU 


TV 


ZZyj 


96 


313 


WTPA\WRQRETVKKKKKKRKKPGWGFLN 
PTPQREKLGFLKRGPGF 


2280 


16181 


A 


2296 


234 


2 


CCLBTTRSLFDKGTKENTQWGKDSPSNK 

DPiaifMUTOTPirDMVT nnvr td /vtmtmtt 
KUnr^WXoiCJ\i<niUjUr r i Ltlr/ x 1NJLJNV 

KY/IKDLNLRPEITKLLEENLREK 


2281 


16182 


A 


2297 


334 


6 


KLFSPPGGGGGPPPPPPKKGWVPPETP\ 
KRGGGAPNPPPPKGGGAPKPPPQKKNSP 
PKKKKKI FCPPPKKKGPKGVFLRGPRGP 
YRVFLKGPPLFFFFLKKSWRPLAMYA 


2282 


16183 


A 


2298 


467 


8 


LiPGFKAS TDRDN \ VAGDFKLKQMLI YHS 
ESSSALKNDTKSTLPVLYRNKEAW/VTA 
HLLIPWCTEYFKXrAETCCSERKISFKI 
LLLI \DNAPSYPRALMKMFKINVFMSDN 
llblvxb iUybvlLiUNbi I iaKN if I jfj\ 
ITAIDSDSCRMPQEGN 


ZZoj 


lolo4 


7A 
J\ 


2299 


1 


449 


O TUT CTT5 / DMT OTXTT CTVT OTOTVT C TV 

/HISVYIYL/9IYLSSlHIiSAIICLSIF 
QSISLSLSINLSTYLSTIHIiSLSSIIYS 
Q T TVWT . <; T w /t.<; yyt .5? 


2284 


16185 


A 


2300 


1 


445 


QAGI*QLLTSGDPRTSGIiPQCWDYRC\DH 
RSWQT 


2285 


16186 


A 


2301 


241 


22 


KimX3TCACVHVFICVYMCLWCAWV/C 
VSMCLYVCACVCtMNQC/VTVCMCVCVCV 
CRFVCVGI PPHSKWISIG 




1*1 ion 

lolls/ 




Zlvl 


3 


AnO 

478 


vjo\J1 IS 1 li J_i 1 oUKJMii? W t/CiALili 1 o vJMvj Kir 

GRGAPHI PDDEQPGRDAPHLPDGAAGQ\ 
SAPHL PDGE \PGRGAPHI PDGAAGQRRS 
PLPRWGG / EI/3PGAPH I PDGAAGORP <? <5 
HPR/PGRPGRGAPHFPDGATG/Q/DGAP 
HFPGIPPDGT 


2287 


16188 


A 


2303 


440 


41 


KSHHLSFLSFLLFFFP/TKSHSVAQ/CW 
SAJCSAHCNLCLPGSSHSSAliASRVAVTT 
GVYRI 


2288 


16189 


A 


2304 


2 


395 


FFLVKTRFLHVGQAGLKLPTSGGPPALA 
SQ/SL/RITGMSHRTQPE 


2289 


16190 


A 


2305 


184 


2 


S IKKPKPLVNLRNH/NT/WQGAVTPACN 
SSTU3GRGGRITRSGVQDQSGQRRESEF 
LRLGYGLD 


2290 


16191 


A 


2306 


86 


472 


1 1 KLCKWQNK/RFEAGSLVPEMGFYYVT 
QAGLELLFSRDPHTSASQSAGI TGAAFH 
QRWSVGTVXiLQVDRGTPPVGDCGSRTPQ 
WPGQAFIiRTALKSEAHPPHSSTDVTPVL 
WSEGSPCLLSPPSLSFTG 
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E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
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2291 


16192 


A 


2307 


133 


2 


RVIHVWRSHPAIP\TTREAEAGESPEP 
GGGGCNEPRS CHCTPAW 


2292 


16193 


A 


2308 


3 


369 


LTTAFPDFPGSTHPLTSASQVAWATGAH 
HHSWLIFVFFVETGFHL/SELLSSSSPL 
AS ASOSAGTTG /REPPCLTSPFFF 


2293 


16194 


A 


2309 


253 


365 


TDYF5TLFIYI»RQSIi/SSVAQAGVQWQDL 
G<?T.OPPP c ;GFTC 


2294 


16195 


A 


2310 


362 


0 


TKAQQCVHENHFKIiKDANTimKVWRNI 
GDEE/HFIKIKKSVTQEAIIIINIYA 


2295 


16196 


A 


2311 


167 


2 


LCYCVI FVFFI FWRQSLTLSQGGMQWCD 
prn<? t .n pt .p p \ gt ■kt?5?c;wt .PM^wn yp p 

dajO i-i yj -tr l irr \V3Xjjv-c\jO J ni-U-u it imo nu x xvjv 


2296 


16197 


A 


2312 


67 


482 


DHL I PGGGG CS ELQSHPCTPAWVTA/ ET 
LSQKKKKKREK 


2297 


16198 


A 


2313 


188 


437 


AGFFPPENQLTNMKVRKANESDPWGVKP 
x^x/tto CxHvjoxjoxjoXjoXjr fJur 1 r Xiity o j-i 

/NSVIQAGVQWRNLSSLQRISP 


2298 


16199 


A 


2314 


41 


325 


TTTLFXRVRQGYSSLLMFLLPFVCLCPA 

LPTQASAIAGAPPPALLPPCSLISVCCA 
SNEQGSVG*DP 


2299 


16200 


A 


2315 


417 


0 


QPPPWWnVPr'VPPPPiVSlPV/FTATP'Mfll? /V 

HVARPEFGLELPTSGEPPTLA/FSKCWD 
YR 


2300 . 


16201 


A 


2316 


98 


288 


LMAVVPATWZ?AEAQESIaEPGGWGGGEEG 
CSELRSCHCTPAWVT\SETXiSHTHTKKK 

KTrWGjMVT. 


2301 


16202 


A 


2317 


1 


410 


IjI^I PKX^LTKRKPSPHSIjNLKKKKK^ 
KKKKKKKKKKKKRGGXjVKKK^ 

GGKKREKTOQKKKSPKGKKKKnjR/EERG 
EKNPKRGGEKKRS SS PRNNNLRGEKK 


2302 


16203 


> 
A 


2318 


249 


5 


DT V'fcQQDPiV'jllTKrPrTOWVGPTPPPPTrtfiTV 

P\PKIPKLVFIPPPIRKKLLPCPPPLTL 
APPRVPLKRPP 


2303 


16204 


A 


2319 


2 


393 


AH^LPKC^YRHEPPRPAPLFIilaNSYF 
GLDLLT/ S/GDPPALASQSAE ITGVSHC 

r\\if XT J7i XVI 


2304 


16205 


A 


2320 


389 


2 


RGNSNIGGPGPLRGKKFSPPPPLKNWGT 
KLGPPPPPPFFFFFF\GKPGFSIFFKPI 
T.iTFICGPKTn3PPPJ^IfW5GSPGPGWFFFF 
FLSNFTQLreiKl^KGGKKNFASPPPPF 
FFFVHxTClXjypSATPDAW 


2305 


16206 


A 


2321 


389 


206 


FQWRWGF/NHVGQANLELLTSNDPPASA 
^RSAGTTGMSHi^OLKNYFI^^ 

OXxwJTVtJ J. A *w in liiTwV/ OIMl J> J- ■ 1l w X V C\l i FT rvJ.1 f f 

IAVGM 


2306 


16207 


A 


2322 


371 


42 


SFPPQSGFFSPPPPHEFFFPPPPSFFSW 
VGVRQIPPPPKIFSSPFPPGGFFSPPPE 

V UK 17 r O JTJT XT J7 V 17 XTXT JTO J7 X" XtOCtTCifC T 

FFFFFFFFFFFFFFFFFFFSFFGVKKRK 


2307 


16208 . 


A 


2323 


82 


386 


PFLTQKYFF1TPEEGFLKKPNRREGPPS 

f X X l/±r X Xj ri trSHPm tsxs X. V f AAX' lr X X X*T3v7v?4jr 

INMTFSGFVTTKAPFPRPIiRFNPMLRQG 
FDLLTLKAS\WGSSASWY 


2308 


16209 


A 


2324 


1 


413 


RGSGDNRHGPPCRVNFVFLVETGFLHVG 
RSGLELPTSPALASQSVGITGVSC/RPP 
PQASY 


2309 


16210 


A 


2325 


58 


400 


SETLVSKKKKKKKKTOPPPKNPKKKKNP 
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PPPPFCjAKKGPCnSxvr / PPPPI rQKKRGG 
FKK/ SPPKKPPKKKKTPPPIjGKKGPPPP 
RGGPLGGFPPPPPQKKNGPPPPKKKPPF 


2310 


16211 


A 


2326 


462 


306 


AHHNLRLPGS SDS PASASG VAGTT/ GMC 
HHARP ILYLSGDASALLHCFSSAQLF 


2311 


16212 


A 


2327 


1 


393 


SRPPSRTEKIRHFFFVETGSCYVAQAAL 
KLLDSSDPPTSASQSAG/ ITGMSHRAQP 
TS 


2312 


16213 


A 


2328 


391 


2 


AWFKETKAGWI I PRSWDRGSWVSQPYSA 
LTSS PESGFHSVTNGDSMTCIjEIXSKHSL 
VAPHGIPQRGSSLQDGVSFCPAPTMKPY 
LASTKM\RLHEARELGFFSFLStiRQGLA 
LS PQLE CSGAI LAHCSLNL 


2313 


16214 


A 


2329 


2 


115 


GCSELRSRFCTPAWAT \SKTASQKKKRG 
KKKKGGVFYF 


2314 


16215 


A 


2330 


2 


406 


AAAPSALALRDGWAVRPELDLLPPCX3EE 
VAPGAHCLjGCGPS P CLiFLS PSHTRQS P P 
APTSSPGLSTSPPLVPTHVSAPHSSKGP 
PSIPGAQAIiRGCGLGGWDR\PSSPSLP/ 

DTTT70TJVT3T XTT77VT5 


2315 


16216 


A 


2332 


226 


377 


KRKSKHITFLFKTLSWPDTVAHACNPST 
LGGQGGRI/TPRSGVRDQPDQHG 


0*5 1 /C 

2.3 10 


16217 


A 


2333 


3 


191 


CLS PGGGGCGGLRLCYCMPAWVT \NETV 
SQERKKERRKEREKEIKKERKKRKKBRK 
KEKKKKG 


2317 


16218 


A 


2334 


295 


81 


FFKPFFPPKKTCPEFS\FFWEKRGFPPP 
PLRSFFQNPPPKGGPPILLKGPPPPPSL 
GWPPPPPIiFFFFFFLR 


2318 


16219 


A 


2335 


375 


8 


TQIVPLPSNIjGNKTRIiRLKKKKRNEQGN 
IPTDTIDNRIKQirQ/TYYEQLCANK\N 
LDKMDKALESHNFPKLKQRESLNI/HSA 
xUSIHr 1/ ILNISTKKTPDPTGFTGKFjjQ 
IFKEKKMAGHSGSHL 


2319 


16220 


A 


2336 


399 


97 


FFFFFFFKQNLA/ SVTQVGVQGQYFRSL 
QPLPPRVKPFSFPNPLSNRGYRGPPLGR 
VKQENCPKIjKSKRFQIjNKIPRIjGKKKKIj 
RFPKKKKKEKEKKIVKT 


2320 


16221 


A 


2337 


411 


57 


KKPRSFSSCSSSPPFFFFFSPPPKKKIF 
PPPQIFWGPPFFPPPPFFKPPPPPFFFS 

FFFFFFFFFFFFFFFFFFFFFFFFFF 


2321 


16222 


A 


2338 


97 


354 


AKAPSLSIiVLFSFSTFFLGIQGLALLPM 
IiECRGAITAYCSIjNLPDSSGPPTSAPSP 
TPY/RIAGATGTHHNALLLFKFFSRDGL 

DT 
rij 


2322 


16223 


A 


2339 


391 


45 


LMFFHLSHKHRSGEAPSIHWSIYLSIHP 

CTTIDQT.VIJQQTHT.CTHQTTVDC'PfT OTU 

/CISIHPSIYPSICPSVHLLAHSFIHST 
ICPSMHLPIHVSIQHFLSAQILPVSVFG 
i?v<?nT 

uVOUX 


2323 


16224 


A 


2340 


506 


0 


RDHEQLG I VRADKKKKKKKKKKI KKKKK 
KKKSSSSSSSPGTFFRG/VPLKDPVG 


2324 


16225 


A 


2341 


203 


1 


VELRVRATEPGFNFLKGIYCTSMVDWIK 
KMWYIYTMEYHAAIKG/DRIMSFVATWM 
ELEAI FLSKLMQ 


2325 


16226 


A 


2342 


402 


40 


PYPPKKKASPTDA\FSSSSSSPPPFFFF 
SPPPKKGFFSKPFFFFSPRFFSFPPFLK 
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PPPQFFFFGPLKKNFFFPPPALKFFFFK 
SPPPLFFFFFFFFFVEAGVLLCYSG 


2326 


16227 


A 


2343 


IS] 


1 


QLKDQASDUjGKNGDEVKETI PS FLPSS 
LPPF\LPLFLHQFLPSFLPSFFPPSLLP 
FLS 


2327 


16228 


A 


2344 


2 


407 


FVASQLGCSGV \RVRDPLEEAVCPFSDL 
QLRAGRTTALFKAVRQGHLSLQRLLLSF 
\ VFLCPAPRGGAYRGRQAFIiS CGGLHP V 
GASRLLCLPKQAWAMAGAPPPASLPPCS 
LISDCCASNQRDSVGARPSEPGAGH 


2328 


16229 


A 


2345 


405 


178 


IIKFIYEKATA/LLNG/EKLKAFPLKLG 
TRQGCLHSLLVCNTVIiEVLTGTLRQEKE 
IK3GI I IEKKEVELSLFADRIIi 


2329 


16230 


A 


2346 


2 


412 


FKASKASLSPLLGANTAGDFKLKPGLIY 
HSLH\liKNYADS ILLVLCQWNNKAWMIA 
HLFTAWFTEYFSPPLRPAQKKISFKMLL 
FIDSAPSQPGVLMEMYKEINVVFMPANT 
TSIIiQPMDQGVILTIiKSYWLRDTFH 


2330 


16231 


A 


2347 


2 


397 


ESLEPGRRRLQGARIMPLHSSLDNRVRL 
CLKKERKKEIGVL IRCWQECKIVQPLWE 
IVWYFLKKIiN\ESPYEPAVPLLNIYPRE 
MKIHVYTNTCTQIFIVALFTIAKSGKWW 
GTVACACKPSSSE/VJLRWVDYL 


2331 


16232 


A 


2348 


3 


423 


EGCSELGSHHCTPAWAT/AESVSQKKKK 
PKKK 


2332 


16233 


A 


2349 


49 


262 


QMCKGSNRRRGKRVGSRQISKKKTNAPI 
KKWAKDMNREF/DIQMANKHMEKCSTSL 
I IREMQ I KSTMRYHL Y 


2333 


16234 


A 


2350 


356 


2 


FVTAPLHSSLGNRARSYCKEKKKVQVAA 
KAVLRGKFI IAYTVFKKRKISNINLSIS 
LKTLEKEEHTETKADGAQYVTKI \RAKI 
NKIETANETKSRSLEKTSKTVEGKCLRD 
ILLSAQI 


2334 


16235 


A 


2351 


360 


19 


LDAQFLEVGLGELLFRSTVPTLQPPGCG 
ASFPVLNTLPF\SLSPSQSSSSPASLVR 
PWVAPPFLCPHGEPDGGPDSTTSLPFPG 
PAATGPAARLIQHPASRQPRPASHTHCG 
V 


2335 


16236 


A 


2352 


360 


0 


NTFLAAFRLVFCQMTS YSLAIiLSHKLAL 
MPIiNLSDLLTRWTHCMGELFFLDILAIQ 
NPFHTVFFLGHPEWGMESRFWQAWQW 
PDLRQLQPS\PPGSMRFSCLSLPNSWNS 
PSYGRL 


2336 


16237 


A 


2353 


34 


474 


EGWRPCKEIAARQ VGCPHSCFS PHWQLL 
QKQEKTAGAVSVCVCTSTLCVCVCVCVC 
VCAQAMCVCAGA/CFCVCVCAGA/CLCV 
CVGA/CLOTCAEAVSVCVQ/VAVSMCVC 
RS/VSLCVOyiQGQSLCVCAGA/CLCVCV 
CGIPPPVLCLN 


2337 


16238 


A 


2354 


297 


16 


KFFFLKSFFFFSFFFLTPPRFFFFFPKK 
KKZFFFPPRKIFFFFLI PPPPXFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFXRHGG 
WFEEITILTV 


2338 


16239 


A 


2355 


3 


315 


PVTPATRETEAGETIAHDLGEPGGRGOG 
ELRSCHCTPAWVTEQDSVSKKKKKKRGA 
RFKESNFTTPGLQRNIFFU3ALKLISGA 
GVriKRRDGKTLGFPQFNRPWG 


2339 


16240 


A 


2356 


399 


154 


PGQRGE I PSLPKIQ/ ELAGCG/GHLNPG 
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nucleotide insertion 














6RI FGEPRSRHCTPAWATBQDS I CKSHS 
RSGLWCYKSSSVYLSTRGVWVRGIASV 


2340 


16241 


A 


2357 


416 


2 


FFFFFS ETESRSVAQAGIiRT \QWRNLGS 
LQAPPPGFTPF 


2341 


16242 


A 


2358 


209 


2 


KKNLVPWPRGGYFKSLQPPPPGV/SCPN 
PPKKLEYRVLFPQPSNPFFFFCIFSRDG 
VSPCCPGWSRTPD 


2342 


16243 


A 


2359 


279 


380 


RGYNP / WPGAVAHTCNPSTLGGRGRQIT 
RSGDQDH 


2343 


16244 


A 


2360 


415 


224 


FFFVFFFFLLITFILMIiNLLPCHKFLFL 
OFLLGYLFLLYlA CAFFLLVAIiNTFITS 
FQQLDYYTF 


2344 


16245 


A 


2361 


97R 




I SD P FRFWNI YR THTGT.T S I» I O I <? FNCV 
SCQKFQILEHFRFQIRDAQSVLRK\RKA 
WTGAVAHACNPSPLGGRGGWISRSGDLV 
HPGQHSETP 


2345 


16246 


A 


2362 






KKKKGGRGGGGGRGGGGXXWGGTlCICKinr 
GGEKKNLWGGGKGGGKGGGS 


2346 


t6247 


A 


2363 


19 


409 


PKPPSVLGGGGPARYPSPLGGPNRPVPL 
GPGVGAPPGPPGKTprrFTiKT KTCT YPARO 
GPPVIPASPGGBGKKSPLPPRPRVPLTQ 
IFPP/PPPPGGPNQG 


2347 


16248 


A 


2364 


159 


383 


HSHFKNLSS 1 1 KKLHRNNTFTEHFSLSS 
SLNQCFLNLTVFYSHLGNFKNSN/SWPG 
AVAHTCNPNTLGGRGGQITR 


2348 


16249 


A 


2365 


400 


221 


GRLRQENCIiNSGGRECSEPRS /HIiHCAP 
AWAT\EQDSVSKNKKQNKQKQTHIYTVL 
LCARH 


2349 


16250 


A 


2366 


383 


14 


GGRGCNAPRSCHCTPAWVTERASPQKTK 
KQKNTHTKKRISSCCYKMEDPLRQA\TF 
LQCPRAEGPSQKAARMELMEKQEKNQGP 
ARHRROEOPIjT5;RPCPDFn'ir , WT»<5nVS <> 
TPAQGLSLLICK 


2350 


16251 


A 


2367 


274 




PRKI LKARGKEHLASRGTM I RMTS DFYL - 
OTMQARREWSKIL/KVLEEKIHQHRIL / 
PVKSSFKSEEE IKTFSDKQKLRGLVTSR 
SDLGKDVK 


2351 


16252 


A 


2368 


161 


2 


PFFFFSETESCC\VTQARVQWRHIiGSLP 
GSSDSPASASQVAWITGTRHYAWLIF 


2352 


16253 


A 


2369 


361 


198 


NGRL I FVFLVEMGF \TMLARLAS CDPPA 
SASQSTGIRGMSHNSQLKCFTEFDSFC 


2353 


16254 


A 


2370 


116 


300 


HLNGDAVEERDFMKCTVSG 1 1 VAHCNLE 
FLG/SSDPSASAPRVAGTTGMCHHIWLI 
FVILVEM 


2354 


16255 


A 


2371 


2 


192 


M KLDPHliS PYTKVNSRW I KDLNLRPKT I 
KILEPNIR/ITLLGIGIiGKDSMTRNPKA 
IAIKTKLAR 


2355 


16256 


A 


2372 


1 


133 


AGELLEPG \GRGCS KPRS CHCTTAWATE 
QDSSPEKKKKKKKGGF 


2356 


16257 


A 


2373 


43 


403 


LHDSPAIiASQGAGTTGVSHHARPAAGIN 
SRIGQAEDRISELEDWLSEIR\RQSGRN 
MDKRMKMNKQNLQE IPJDYVETKSM /NTR 
LIGVPBRVGENGSNLEKIFQDLIHENYK 
GKPIRLMVDL 


2357 


16258 


A 


2374 


404 


215 


GQGGRITRAQKFETSLGNWRPGSEAQE 
LIiE\PDDRGCSEPRSCHSTPAWTTEQDS 
VSKKRRKC 


2358 


16259 


A 


2375 


3 


397 


SKQLEFTQLYTKLNQLNKTKISDLKKEK 
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nucleotide deletion, V=possible 
nucleotide insertion 














YPIRLFSKENTQMASRYRTKYSTWLI IR 
EMQIKTTFRHKLAALASSRCLLGLG\AT 
SAHCCTVGAPLW 


2359 


16260 


A 


2376 


413 


1 


PKKGLFPKPIIWVTPGFFPPPRFKKPPP 
KKI FGAPKKKKKSPPPPAKKFFFFKGAP 
PPFFFFFFFFFFFFFFSGDSQERVREAM 
PVAGGP\PRPHSLSAPHAPGGTAWTPMH 
PVQTHKAQSPKLPASECPPPTTPIiS 


2360 


16261 


A 


2377 


398 


247 


RRFHHAGQAGLELLTSSDLPAIiASQXSA 
GITGMSHHAQPSATHFQKHLVS 


2361 


lozoz 


A 


Zj to 


z 




QENRLKPGGGGCSELRSCHCTS \AWVTT 
AKLCLKKKKKKKFFGKGGG 


2362 


162o3 


TV 
i\ 






y\j 


FFFFFFFPGGGRLQVCPPPGILIiFCFIiY 
KGGSPPGGSTIFPPPPPGEVGPPGPPPP 
GGFFFFFLE / QGGVSPDGPGFFVLPTPK 
KPPPPAPQKGGBPKFKPQVWGPPWPTF 


2363 


16264 


A 


2380 


36 


427 


VHPLNHHDQKGQASSTQKKKKKKKKKKK 
KKKKKKKKG/WKGGGAF 


2364 


16265 


A 


2381 


402 


2 


NFFLKGRGWGPLPPFLPRGPPPRGGPQK 
/RGGLGGPPPFFFGSKTPPAGGSRTPMG 
QKKKGPPLPEGPLFGGAGQAPFLPPPVP 
RGGVPSPKKKKG/APPPFFPPPFFPPPP 
PPFFFFFFLRYNLALLPRLGCSGT 


2365 


16266 


A 


2382 


166 


5 


THGHVI YVDQEMQMMIENM/ WPGTVAHA 
CNRSTIiGSRGGWITRSGDLMVKPRLY 


2366 


lozo/ 


TV 




z 




TSQ PSIiLSS VfDYRSTS PRLANF\ ILFYF 
IYFFFAFSVETGFHRVSQDGLNLLTS/S 
/IPSIPRIPKHWDYRHDP\RTWP 


2367 


16268 


A 


2384 


2 


417 


GRVGFSQSNGNPS SLS FTLLKVDFEVTI 
PGEGKDRIFKVS I KWLAI VS WRMLHEAL 
VSGQIPVPLBSV/QALDVAMRHLASMRY 


2368 


16269 


A 


2385 


318 


32 


TMB IMIiDKKQIQVIF/ FBFKMGREAVBT 
THNINYTSGPETVQWWFKKCCKGDESLE 
DEBCSGRPEVGNDQLRAI I AHASADAWV 
DRDSGRCCCACP 


2369 


16270 


A 


2386 


409 


3 


ISQAPSTPPRKRGFR/PPQKTSFLGPPF 
YAAFYQEKKFLFFSSDPPREtGDKGKQK 
GFPPPKVAPKKKGFFKKGPPPGKKKDPP 
SFFKHKSQPTRPPRAPALEGPRSR/SAA 
LQPGDRRR/PPSQKKPTRPPTRPPTRP 


2370 


16271 


A 


2387 


415 


10 


KKGFFPFGPFIFLFPPPGFLPPPPPPIF 
WFSGFCPPPPPLFYFFFRGGPK/HILVF 
PLFFLPPPFFFFFPIFAKESPPKKGPGP 
LNFFFFGPKKN/ S PPPFFFFFFCEMEFH 
SCRPGWSAKWHDLGSLQPPPPVFK 


2371 


16272 


A 


2388 


410 


95 


KATMDKSDH I KLKS FYPAKETPTKVXTQ 
PPEWEKVFANYPSDKGLIPI IYKELKQP 
YG\ KKSNNSI KKWAKDLNRYFSKDIQMA 
NRCMKRCSRPGAYAHTYNPS T 


2372 


16273 , 


A 


2389 


362 


92 


RFLFFFSPPPKKGFFPKPFFFFS PRVFP 
PPFFLNPPPKLIF\GPPKKKIFFPAPGG 
KKIFFLKGPPPFFFFFFFFFFFFFFFFF 
P VENT FY 


2373 


16274 


A 


2390 


131 


487 


ATEHEKTEKS SLSFFSXS KRKKKMEKLH 
DIGFSSNFLG\RPKAQATKGKTDWTSAK 
LKI CSSRDTI SRMKRQPKEWAKTFANKS 
CNNKKPEKIDKNKKKKKKKFLGGALLKK 
TNLKPRG 
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pepuue 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
\£=\?i (114 mine, iv— /vrginine, o^oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 




lOX/J 


A 

A 




4U5 




P W Q T . 2\T PP T .PC* Qf^VQ C 21 Dr'TTlT'PP Df^rP 

PFSCLSLPSNWEDRCESPPPAQ/IVFIG 
EGFYILHGFF/RRGPKIRCF I SGCPPPV 
LSFPT 


2375 


16276 


A 


2392 


1 


203 


LFFFAFSVETGFLHVGQAGLEPPTSGDP 

DTTC A /"PDM/TATTYVTatin 
rvon/ c rWv^Wl/lKnJJ 


2376 


16277 


A 


2393 


386 


16 


TPSAGGRHIBLSLSTCPSCAQHQGKEHL 
AVEDKLS IRLETDI IiKTKS / WPGAVGHT 
RPRPPKVMGLQA 


2377 


16278 


A 


2394 


3 


396 


QLLERLKQEDRLSTGSQGCSEIiRSHHCT 
PAWATEP\DSVSK 


xo /o 




TV 


2395 


273 


416 


r r XoU IjiJ^KWKKQPGTWAQACNPSTIiG 
GQGGRI /TLRTGVRDQPGQHG 


2379 


16280 


A 


2396 


329 


76 


FI P IESACSQBCLKPNLRQEW/ YIFGTL 
Jvbl YsjsX. iSoriPVIV/ DWbAGBIiIi\DPRG 
RGCSELRSCHCTPAWATRAKLCLKHTHK 
IKK 


2380 


16281 


A 


2397 


210 


7 


GGKKMYCRKPGGGGFFAAVWSPKGLFS/ 
ANRHFSKEGTQQ 


2381 


16282 


A 


2398 


48 


393 


S ILKTRKPPLKKGRGRRKKEKESVRTHV 
FFSYQSNAFPSKPLNNSMTLGRLLSFSF 
cuvtlrc r x ±UQi JuxjvJCPGWSRTPGIjKGS 
SHLSFPKCWDYIHEPPHPAYHS/LFCR 


2382 


16283 


A 


2399 


128 


383 


EEAPKHFPKPNLH/QKKVLVTAWWAAAG 
LIHCSFLNPWBTITPEKYAQQTNEMHQK 
i^UXiy XjAJj V JNKA.tr P X LdjHHN VKlAi 1 1 J. H 
AS 


2383 


16284 


A 


2400 


54 


384 


LFTFILNSVFHTYMCLYFWTLFFSVNPF 
VSMPIPQCLDDSSFIISLDSEINPNIYS 
QL I FDKKTQRGKNS LPDRWCWENCI FTY 

ITDMlTUf TIOVT . /CDVTVTTOMMMirnT MTV 

l\J<raiVJNlJr Xij/oir x 1 I^XaWWWlUJLiiNXR. 


2384 


16285 


A 


2401 


393 


1 


HRGENTHQQGGGLSRWRRHSRQRGTSRW 
IRHTRQWGPSRWITPSRQRGSSRWRKRS 
RAGDIEVEKTQQSEGALEVRIRSRQRGT 

QPWPQPQPnPVQPPWPTPQPrtPPQO /w 

RRRSRQRGTSRWRRHSRQR 


2385 


16286 


A 


2402 


1 


330 


RPPPPPHCWDYR/HEPPR\QPTLWVIFK 
I^VETPXCYVAQIGLEIJ^SSKSSSRLD 
LPKCWDYRHELLCWMVTFQEKLV\SGFL 
FKI PRFFKAGME I FKOTEGFWS SHPIAT 


2386 


16287 


A 


2403 


3 


407 


ADAWGLRGTHGPWEQAGISGI S PSNS FL 
PVCFRQSVALVSQAGEX3WRRLGSLQPPP 
PGCGRESC\ PS FOGGITPSCFFVFLEKM 
GLHRVGQAGLQLLMSGEPPALCSQRRGI 
TGVSHHARPPSKGF IWHTGAPAM 


2387 


16288 


A 


2404 


239 


573 


VCFGVFS LSHFFES E FSS I TQAGVQWLN 
rjGSLQPPSRGFKQFLCLSLPSS\YWFTG 
TPPLVGGGSAGLRGG 


2388 


16289 


A 


2405 


1 


122 


PTRPGRGCGELRSCHSTLAW/VNSETTS 
QKKQRKEKKLPLL 


2389 


16290 


A 


2406 


417 


130 


QALRVKjWQLVPSSDFL^TRAQSVNNL 

lrhsssssssssss /sssssssssssss 
sssssssssssssssss\ssssssssss 

SSSSSSSSSSSSSSSSSSSFLFLFFFFG 
LRGAFLGRFCPIRR 
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correspond 

ing to first 

o mi nn m*tH 
dllJIUU aviu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I^lsoleucine, 
KpLysine, L^Leucine, M=M ethionine, 
N=Asparagine, PHProline, 
0=C*liif"afniiiR- Tt— A roi n i n S=Serine- 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno>vn 5 *=^top codon, /=possible 

nucleotide insertion 


2390 


16291 


A 


2407 


327 


3 


RQSQSLRYMAHQPVCCFTCSILIRAELN 
E I /ETKNKKIEKINET /RKSWFFEKINK 
MGAI TPDATERQNI IQGYYEQLYMQNLE 
NLEYMDKFLGRFKPPSLNQEEIiDTIiNR 


2391 


16292 


A 


2408 


178 


404 


AAYRINSLQEVQHPTKRHSQPRKQSKIiR 
F^3NAORNNIRRGPGMVAHACNPSTIiGGO 
GGQV/TLRSGVRDQPGQHG 


2392 


16293 


A 


2409 


94 


386 


TSFALVALGGRSCSEPRSCHCFPAWVS\ 
SETI^QWKRKTPTLNNAKyWGGRFLRFF 
FLTRGGGGRFFFFGGTKTKSLGAGFKKG 


2393 


16294 


A 


2410 


1 


384 


PTRQVRI KLFPRI YVHTRKRIiKNI YGNT 
ETFITQPFKNHSVWQLIRNNRNAKKQEN 
VAHNEEKNKKQS IETN\QKYTHNKNLDT 
TKKKKKKKKKKGGGPLKKTIFKARGGEN 
NFFFLGPQKLNSGAGF 






r A 






*> 


FPPFPPPOKKST,FPPFFFC^PKTFSSPP 
VFLTPPQKI FFCPPKKKKYFPPPRGKIF 
FFTjK\ PPPPFFFFFFFFFFFFFFFFFFF 
FIFFFFFFIiGFIiLSSLLIGVPFIiLLNAN 
IMHiLQKKILKDEVQI ISSFPSTKNRNV 
L 


2395 


16296 


A 


2412 


218 


112 


LKKENNTKCYYGWGTTGTLIHC \ WW \ KL 
TOVLWKTD 


2396 


16297 


A 


2413 


64 


403 


LKNFFFFFFFKKKGPPGAPGGSKPRGLG 
EPPPPPPQRGGNTGGGPGARPKKNGGGF 
FFFWRGEPP /PPPPAKGGGODLGPGPFG 
REETNNFPAPAPPGPKNKNLGPRAQKIL 
CF 


2397 


16298 


A 


2414 


217 


409 


VKYLSYSVLLTI I CTVQACYQELRPGAM 
AHTCNPSTLGGHGGRI/TLRSGVRDQPD 
QHG 


2398 


16299 


A 


2415 


416 


0 


ERPPPSSFFPPLAPGQVGQFFYGKGPWP 
EKSKKKNRFWGTKPFPPEGGGPPPPPNS 
P PGGGEKS P LFF FGGKGNPTWS PG PKFL 
KKRVIKGPLHPGGKGYKKTREFGLKPGP 
PPPKFFLGGG/PGFQNWEKPPQP 


2399 


16300 


A 


2416 


277 


0 


PFVLLSSSSSSSSSSSSSSSSSSSSSCS 
LLVRYLCKVKHKSSLCAHPGTVPFIiELI 
HDTDS FAH/VFPADLL 


2400 


16301 


A 


2417 


205 


486 


RHGYGFGFWKNTCMQHRHSYRRVYIHRH 
IHLTFLAYVHRESPEAMSALLTQIIiLFF 
KLLSFLRRCLA/ SVTQAWQWCSLGSLR 
PRPPGFKRFWC 


2401 


16302 


A 


2418 


364 


1 


KESLGDPVKDTVILFATRNPMQMGSNYQ 
FFI YLCLTHLLSRS YCWFGCKMLQ/ PLW 
KTVWQFLKKLNAEFPYDPAIPL\DICIP 
VFTASLFI IAQRWEQPKCASTDEWINQM 
WHMHTIEHYPR 


2402 


16303 


A 


2419 


281 


492 


LFPPSLPASHPKLSTQQPERSHSVTHCG 
/VDRRGSDLGLDQPRPPRUOPSSRLSLW 
SSWDYRPAPPRPANSC 


2403 


16304 


A 


2420 


63 


465 


RLQRFLLS F\VCLCPAP I GGAYRGRQAS 
LSCGGLRPVRASLLLCLPKQAWAMAGAP 
PPASLPPCSLISDCCASNQRDSVGVGPS 
NPGAGYYIiVARRFLSPLEKRSIQVGVTL 
FSRCRLS PliSLTRKGNSLTPCAS 


2404 


16305 


A 


2421 


61 


482 


QRARITGVSHHAQ/ LRFCLFDMRSHS VA 
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E=GIutamic Acid, ^Phenylalanine, 
G=G1ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, MNMethionine, 
N=Asparagine, P=Pro1ine, 
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T=Tbreonine, V=Valine, 
w—iry prop nan, Y— Tyrosine, 
X=Un known, *^=Stop codon, /^possible 
nucleotide deletion, Wpossible 
nucleotide insertion 














QAGVQWCDNSSLQPPPSGLKMIj/STSAF 
QSL/WDYRCEPQHP IHY 


2405 


16306 


A 


2422 


90 


444 


YCFSSECSEKCRCPGHDLQLSPGLCHHG 
VSGIAKPLLCPWPWIWKPQPLCSPTTSL 
PFLPLPPFSQCPQGPAWLEIdSGRPSLWK 
QGGLQSLAIKR/REGSRAQWLTPVI PAL 
WEAEAGG 




16307 


A 


2423 


1 


444 


PGPGPGCGPVATLGSPSRPARTDS PSLP 
PHSQLREAEARNRDIEAHVRQLQERMEL 
LQAEGATGESLMCPLPRRT/WEEVGRLL 


2407 


16308 


A 


2424 


189 


418 


SRRAEPGSFRGCLRVGVPSCTCV\SLWV 
CVOTCWG/VCVOTCVCVCARA/CLCLCA 
NFSL/CSHVSLCLSLSLSL 


2408 


16309 


A 


2425 


30 


895 


IjDEQCTSEIHRRGEATARPRAPKHPAPP 
ATAVRGRDAASQNLKRRPGSGTDGLRLQ 
GAEPSRLLRTYAGGAVIPRGTPSRAQPP 
PPQDPLGRRRWLSRNTWGPWPGTTQPPS 
PQLLRNDWGSCGFMVPEAARGKVFQDSQ 
EGAHIRRETVS KSVCAEPWRHQRARDPA 
PTNFPLKCQKQRGASTSSGQHGDRVNLV 
FTIDDDYSPPSKR/PCTNBPPQPPVPEP 
ANAGERKMREFNSGPHNPVEETKLICLC 
PSGHASCQVHLWTGAMLLGFQSWRKLPG 
SGLKARILQ i 


2409 


16310 


A 


2426 


494 


154 


SSRVRCQCA\LLGGASQLGCSGVRDPLE 
EAVCPFSDLQPRAGRTTALFKAQMEMQK 
SPVFCVAHAGSCRLEIiFLFGHLGSSLHG 
LWN 


2410 


16311 


A 


2427 


416 


1 


PQRGPLLALEPGRQGAAPVEDLQPQGPD 
KPPPPPIiPQPFRARTWTAAVPRHPPPV 
ACHPPQPLAASKPWP/ SVAGGDLLPLPG 
PERPVHAFFIGFI FVHLGLGGVSGRGAV 
APARSGPVPPRPPS SSTSRFS LFFLHE 


2411 


16312 


A 


2428 


84 


409 


DYKHAPTMPRIQNFVYHSPQQPRCENRI 
NFTPKKKKKKKKKKNKKKKKKKKK\KI F 
WGG 


2412 


16313 


A 


2429 


1 


389 


LRDLSSDRSNPGRFLSTSNSSLY/EKDK 
RNKAYFTK/RPSPVNDIIST 


2413 


16314 


A 


2430 


456 


0 


PGWPGRGAP\PSQTGWPGRGAPHIPDDG 
QPGRGAP\PSQTGWPGRGAP\PSQTGWP 
GRGAPHLPDNGWPGRGAPYI PDDGQPGR 
GTP\ PSQTGRPGRGAPHIPDDGRPGRD / 
GSSLPRRGGSRAEALLTSQTGWPGRGAP 
P 


2414 


16315 


A 


2431 


3 


344 


CRERRSCHCTPAWAT \SETPSQKKKKKI 
¥ FGNG P PGGPQAGIjKIjRAWGFFQKRGTM 
GPGTKNHPGQRGEP PLLQKKQKITRPGG 

IP 


2415 


16316 


A 


2432 


1 


109 


RPLRRLRQENRLNRGSRGCSEPKLC/HL 
CTPAWAT 


2416 


16317 


A 


2433 


1 


239 


QSFAVLPRLVSNSWAKVICX/ PSVSQNA 
EIT 


2417 


16318 


A 


2434 


3 


464 


DWLQLEMQGEIVALVHSHHGGLPWLSES 
ARRLQVQSDLPWWLVCRGTIHKFRCVPH 
LTGRRFEHGVTDCYTLFRDAYHLAGIEM 
PDFHRENDWWRNGQNLYLDILQAPGLYP 
VPLSAAQPGDVIMCCFGSSVPNHAAI/Y 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CGDGELLPHIPK 




i&i to 


A 




J 




LNL VLKGLS PS PFPALPISFPIFFFSPH 
FLGTPTLEGGRADLPFLQPPGAPG /qpa 
PLNYGPGPYRNPCP/RLPQLKPAGPGHG 
LLKSPPPNPGRNWPLLGSLFDFFKKRTS 
IPLP 


2419 


16320 


A 


2436 


238 


400 


QFRWKRGKAI FFFFFLRQS P SVTHAGMQ 
WSNLSSVQPPPPGFKQFLC\PSVPSS 






TV 




l i 


/MO 


HPRPQRWCKVNFSYSPBQADELKLQAGE 
IVEMIKEA/CGNPDMPSVS PGPQRPPKT 
TEDKGWWEGECQGRRGVFPPNFVLPPPP 
IKDAOPLIlLFGIDOEAGPTESGISGTRF 
RRLSC 


2421 


16322 


A 


2438 


411 


80 


PQAEEGAPTPGSENFNPPPPRGGGAPPP 
PQKNFFPPRGVNPGGGGGQKRPPPKKGG 
/S PKKNPGGDKNPPPKKKKNT GEGGGFT 
GAPRGTPKKTPPPRDGYFQVFFFIVSLK 


2422 


16323 


A 


2439 


290 


1 


QLNKIKKTPLLFLPWANBKVPKIPPNYP 
PGPKICJCGYPPS I FFFFFUSOGI* /NSVTR 
AGVQWRDLSSLQPPPPGFKRFSCQKPNS 
AFPHASADAWVDP 






A 




AAft 
*rr\3 




PFSRPLFFFS PP PKKRAS P PP FFFCFPR 
VFFPPPFF/SKPPPKFFFFPPPKKKKIS 
PPPPKKIFFFFSPPP 


2424 


16325 


A 


2441 


131 


408 


GCVPPEPAFLCFVLEIVSSVAQAGVQWR 
NliSSLQPSPPGIiKRFYRPRIiPSS/QDYR 
RAPPIiA 


2425 


16326 


A 


2442 


102 


351 


QASSSVLKLCVCVRARLCV/ CACVCACV 
CVC/VCVCECVCVC 


2426 


16327 


A 


2443 


190 


3 


PQGAREKSHRPGPIGRRIiKL/DPPFLSP 
EiVKIOTRWIKDIiNVKPTTIKTLEC3^LGN 
TLLDTCPG 


2427 


16328 


A 


2444 


410 


30 


vcvapplcvcigtlcvrthtqfcvcvht 
hslcvcr/cqflcvotq/vavsvcvcra 
slcvc^\alcvcv<^\tvcvcvhgtvcv 
(^crdsfcmctcagavcvcvcvcvcv\c 

LOTCVASAQLLGRGFCSSVNIKGAGP 


2428 


16329 


A 


2445 


243 


1 


KVMVQNKGPFSNLFFFGPTINFFTPQFK 
QQGGQNPNPFLFFFFFMRQGIi / DS VTQA 
GMQWCHAGSLQHLPPGLKQFSCUjP 


2429 


16330 


A 


2446 


276 


408 


MLKNCAFWPGTVAHACNPSTLGGQGGRI 
/TLRSGVRDQPDQHG 


2430 


16331 


A 


2447 


24 


405 


LGDVCAFFFFFYLKNQLNPGGKtilLPVG 
PMENQRWEQFDKIjQKGGIKMNPLRGGI 
SVPLTKKKNQ/WARGE 


2431 


16332 


A 


2448 


239 


3 


SPLCGNNVYKPSTVKKTNQVEKMPPSKQ 
IWELCVELLWO /SNRGIAGSGAHACNPG 
TLGGOGWRIMRSGDRDHPGQHGETP 


2432 


16333 


A 


2449 


406 


144 


GCSE PRS RPCTPAWVTSETLS / RKKRKR 
RKKKSCLLRAILSTSPELTHFTLTTPLF 
SRYNDYPHFRDEKTEARRVYATCSGSHS 
WKRLG 


2433 


16334 


A 


2450 


349 


1 


GGAIFLTPRPKVPLTPITPLAFNPGHQE 
EIPLPKKKKKIVRAIYDKPTANIILHGQ 
KVEVFPLiCTSTRQGCPIiSSLLYQHKTRM 
PSL\PIxLFOT^H^VIARAIKQEKEIKGI 



693 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ED 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acio 

rpc|r) tip r\f 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/linn 4rk 
Qllig U> 

luct ominn 
ladl aliliuw 

acid 

residue of 

peptide 

sequence 


Amino add sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 

v»iuianjine, n^Argmine, o — oenne, 

M. 1 Ul CUIIIliC) T TallUV) 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QIR 


2434 


16335 


A 


2451 


382 


19 


DIjGRMTAGSGDQRCAVGVKLLS fslgaa 
GKGPKHSEAEASIiPKNPGSYNRQAQWPQ 
SSLHLKGTI PDTTS LNTPWKMLSSLKVP 
SWQGAVAHDCNHSTLGG\RGGWITR5RV 


2435 


16336 


A 


2452 


3 


396 


FTOjKLMLIYHSENPRALKNYTKSTLSML 
YKWNNKAWMMAHVFTTWFTEYFKPIAET 
YFWGKKKI TLKI LPLVDNAPGHPRALME 

MWE! TtJV /\/n T P T T .nDMIWUT Q TT DOV 
VI i JVB JL1>* V / vyj.V-XljyjrlTUJv^jV Xo XiiKbX 

R\KNTFCKDIAAIDGDSSDRT 


2436 


J UJJ / 


A 




IK 

i Ij 


41 1 


RKPPPPPPGGGGKGGHHPPPGPIFFFEK 
KKKGLFGG/PGGAPNSHPKRNPPPWFPE 
GGELTNPPTFFFEGGAF 


2437 


16338 


A 


7454 


3 


114 


HHV/ GQAGLELLTS DDP PAS VFQSAG I T 


2438 


16339 


A 


2455 


397 


1 


WSWQK\NRRIDQRSNIESPEMNPRMYGQ 
VIFDKIAKNTEWEKDSLFNKWCWKNWI / 
LKRMKLDHPHLTP \ KQKS KWMKDLIRP E 
TEKLPEETGGNPHDIGLSNDF\LDLTPK 
AQGAKVNTDKWDNI KLKNFPTRP 




J0J4U 






424 


3 


PKKKKI FF PPPGFKIFFFLRAPPFFFFIj 
CX£HFLLNP^RSLSCTTSCCVSTIPTSL 

PNTVOOPlA rv*lT it/iOT t IV T /rtf T TTTT mr n 

UNJSoob V \ UvjixriCo LiLiAl / L.O JL» 1 HLiTIj C 
PFCVLLVCTCDTVCVCVCVCVCDWCVC 
PCPYGTIiDIAFKHFFSRWSLTLYAQAGV 
no 


2440 


16341 


A 


2457 


266 


379 


HWPGAMAHTCNPSTLRGQSGRI /TIiRSG 
VRDQPDQHG 


2441 




A 




m 
i\) 




RSVDSATREAEAGGLIi\DPGGRGCSEIiQ 

iJU Ol— .1 Jr/lrV V / x X J-t 


2442 


16343 


A 


2459 


184 


387 


IVHFQMHKMXNVAYIIPQFySFIIiEIQS 
HSVTOAAVnWHSH<? QT.T.PT.PPfTr.TfrYQCU 
/LSIiPSSWDY 


2443 


16344 


A 


2460 


110 


1 


KNRVS FFF/CS FETESRSVTQAGVQWCN 
LGSLQPPTP 


2444 


16345 


A 


2461 


380 


2 


FFFFFSETESRSVAQAGLRT\QWCNLSS 
IiOAPLPG 

j_iV/rVE xj it 


2445 


16346 


A 


2462 


382 


29 


NGPGHGGPPVIPGTWGGQGGGFPRF\GS 
KPGFTWGNPPFLKNKKNYPGGGPPPVIP 
NFLGGKPGNFFYPGGGGFO /SGPGAVFP 

PPPGQRRSIFLPKKKKERKLILFFSLEQ 
RLQNCDA 


2446 


16347 


A 


2463 


228 


1 


KKGTLFKPPPPGGRFFFFFFFETQSRSA 
VTQATVQ/WMPSRLTANPTFRTQGILLP 


2447 


16348 


A 


2464 


395 


1 


RLRRENHLNPGGGGCSEPRS/HPAWQQS 
ETTSQKTKTKTKQKALASDTVLSPRQSS 
EERFHLSLFHSSFVTPFGIFSFLTTSPH 

WLSKBLQESLSRLLWTQAFS 


2448 


16349 


A 


2465 


29 


299 


ETPNEAS P1CTS WD YRHVLPCLANFFIFF 
VKTOL/HRAGLELLTSSDIiLCFPKCWDY 
RH\DRSTWPLSSVFGSIFLVYYWPLYLI 
ETNMLTTL 


2449 


16350 


A 


2466 


1266 


1473 


YFVPQSTQNHAAVFRVGSLLQEGCGKIS 
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peptide 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KLYGDLKHLKT \ FDRGMVWNTDLVETLE 
LQNLMLCALQTVNG 


2450 


16351 


A 


2467 


49 


356 


VQVILLPQLLRRLRQENCLNPGGGGCSE 
PRWSHCCP/PAWVTEQDSISKTKK 


2451 


16352 


A 


2468 


115 


3 


LSHTKWS AWPGAVA/ STLGGRGRQITRS 
GVQDQPDQHGE 


2452 


16353 


A 


2469 


3 


404 


FVAPGGGQGTFPGSLQPWPPRGPGASGS 
PSRAPLI FGFFGGKGVSPVGPGGFLLPG 
SRD/SGPPGPPKGWGSG/GLGPAPGP 


2453 


16354 


A 


2470 


404 


218 


FAQVVIKWGALSSPKPPFPGFKLFSRPS 
PPSWDYR/RRPRPPRLFFFFFLRQGIA 
LLPRLE 


2454 


16355 


A 


2471 


386 


258 


AGGLLSPGVQGYNKLLSCPCKPAWTT\S 
ETLSQKKKKKKLPFY 


2455 


16356 


A 


2472 


2 


364 


HHTQI I FVFLVBTG FHHVGQ AG VELLTS 
GDPPASWDYGR\GHRTWPYSHI FNNL 


2456 


16357 


A 


2473 


1 


292 


DRGCSEPTSCHCTPVWAT \ SETVSQKKK 
KKKKKTPLLRGPE KKNGNP P FRG FFRKN 
PGFKRKKGGAPPPGLFKIGRKKS PVGGH 
RHTFILGGPLPRF 


2457 


16358 


A 


2474 


1 


307 


SLSCSS I VRRACFPFTFYHDCKFPEA/S 
PVMLPVKPVEL 


2458 


16359 


A 


2475 


2 


596 


MKNAEDILTMEVLKSTMKQELEAA\QKK 
HSLWBLLRIPNICKRICFLSFVSSSSSS 
SSSSSSSSSSSSSSSSSSSTKITAW/LP 
PLEAS FYRSTCLMPARALLFASTIPFWG 
LTLHLQHLGNNVFLLQTLFGAVTLLANC 
VAPWALNHMSRRLS QMLLMFLLAT CLLA 
riFVPQGEKSSQVEERKCLSLFSQGLPW 
SHLS 


2459 


16360 


A 


2476 


367 


44 


YQEDITIMN/TYALNIGVPTYLANI/DL 
NREIESNI IIVEYFNTSLSKMDRYRSSR 
QNIDKETVDLKYS I /HINQMDLTDRYRT 
FHPTATERDSISKKRIKIKIKINKSLT 


2460 


16361 


A 


2477 


12 


362 


HHEPG \GGGCSEIjRLCHCTPAWVTELDS 
VSKKKKKKKSQKKK 


2461 


16362 


A 


2478 


401 


1 


FFFFFSETRSHPVA\RLECSGAISAHCN 
QCI 


2462 


16363 


A 


2479 


3 


353 


YMCVCVCACVCICVLPWMCLCVC/CYCV 
OTCWYIiCVCLCVSLCGHQHIiAVSGKRSQ 
PPSHBSFKTSLIiLWVFKNLPPLIiCAWVP 
QVSQNMPLDGTRTLNNVSLPDEKVHDLL 
LPLTAA 


2463 


16364 


A 


2480 


57 


361 


MNESLLSCKRWSLLAFSHI/CVFCLICS 
TOISALCnm^CAC^MHVCGCRCTmCLCV 
CI^CTCTCMYGRVCMMCLOTCTHCVL 
ARMCACGCGCVYKAECP I 


2464 


16365 


A 


2481 


3 


381 


YMCACG Y I CMHA/ CVP VCMCVCTCbCAH 
VCL/ CCMCVCTFVP VCMCVC I CVCLCVC 
TPIITLPFLSQERISFCTDLRSFRATAK 
RSHEEVKNCVYLQTI WNQYLI I FYLQNP 
NFGWAQWLMLVT P AWE 


2465 


16366 


A 


2482 


2 


380 


IHVLGNCLYFSEPQFPHWQQRITRSFLE 
GYGENLMRLHMSYIHMCLHICAYICI/C 
/LCOTICVLU^YIWCVYLCVCICMCI 
FVYLCTH I CVFVCVLVCI YLCV/ C / VCV 

crcvci 


2466 


16367 


A 


2483 


296 


1 


NGTTIRRMIiPIFSRYRPGMPGLVQCTIQ 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R—Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyros»ne, 
X=Un known, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SCTANSVPYSHGFIYFFIYLLRDRVWLC 
YSGWSAVAPSWLTITSN\YRLKPSSCLD 
LLGSWNYRQMPPCI 


2467 


16368 


A 


2484 . 


1 


319 


NTVGLCVCV / CLFVCVLSLCV / CLSLCV 
/CLCVCWSLCVC/VSLCM/CVSHCVCL 
CVYLWLCVSVCVSLCVSLCVCVYECVI S 
STKRSLGAPSRAEAAKLPRNWAPAKQSH 


2468 


16369 


A 


2485 


3 


334 


NHFI ISIDI/DKHFRKIHQPFVPKAITK 
LGI KGNFLNLLRGTS VKSTGNINM\NGE 
KPNNLPLRLVIQGDVLFHLIiLEVLASAV 














KK 


2469 


16370 


A 


2486 


17 


409 


CLDNKKPRKBYCQKTHLIKKNPLLSRIC 
KELLKLYDKRMINPIKKWAEALYRCLSK 
GDV/HEMASKHMKRCSIS/L/SIRKMQM 
KMTL 


2470 


16371 


A 


2487 


410 


165 


IjECNVT IMAHCNliRNDGSSDPPTSAS \Q 
TAGIHHSLiQHFFFLFFVBTGFCYTQAGIi 
ELGSSDLPTLASQSVGVTGTGHGTWP 


2471 


16372 


A 


2488 


142 


417 


VEQLLEGYRTKS LYLRS FFLFVCMFLRW 
SL/DSVSPGLTNCHHLGSLYPPPPGFKQ 
FSCIS\LWAWRRTP 


2472 


16373 


A 


2489 


j 


404 


KAGAQMGIKHPQHSSRLRLRRPSPRLPS 
SQKPIiNTHYFPSFS FCAHRHTHTHTHTH 
THTHTHTGV /HLHPHT 


2473 


16374 


A 


2490 


373 


146 


YGLERKISFKIL/LFVGKSPSYPRAIiME 
MCKINVVFMHANTTPTLQPIDQEVILGP 
GTVAHACNPGNLRGQSGRNA 


2474 


16375 


A 


2491 


427 


85 


KTPPPP\RGPFFFYFFFFFKKAKNFFPP 
PPTP PIjGKKKNP\ PPPEKKIS P PPGFPP 
PPFKKGPPKTLFXTPLKKKKGPGAPQKK 
PFFFKTPPPFFFFFF 


2475 


16376 


A 


2492 


443 


240 


FLFFFFFXXFXXFFFFFXFFXFFFFXFF 
FFFFXXXXXXXXXXXXXXXXFFFFFFFF 
FFXGCLFIYLFI 


2476 


16377 


A 


2493 


3 


135 


ISAHCNR\LLLPGSSNSTASASPVAEIT 
GACNNNRQDFSFI ILQX 


2477 


16378 


A 


2494 


37 


420 


AHVCVCTCVCVCICLSRRVWCAHSHVC 
AGI FLHLINQTFTTHL/ cvckvmalrvt 
PSPCLQGA\QTQLAGQC!MCAKVCIMNFMY 
IFIBGHICPQTCSCLGEPIRGGLSPLVC 
IC/MCAGFFVSFC 


2478 


16379 


A 


2495 


402 


178 


RQNKTPFGVKKQITGGGGRGRVSYFLRG" 

VSQENPLNPGGGGCSEP\RPPGWGTKNP 

LFQKKKKSSKQQEGRGED 


2479 


16380 


A 


2496 


138 


439 


RTFIIiGDHHHHPSAVHFHLPRRKLCTHE 
TGAAPPPPALGATVPPSVSVCLTGGGPQ 
ITPPPPALGARDLPSVSVRLTMGGPQIT 
PPHPDQQPL/SPPSVSVC 


2480 


16381 


A 


2497 


118 


382 


VKS FCASMRACV CECVYMCVSMCVCVCV 
CVSACLICVCLCQ/CDLINKCV 


2481 


16382 


A 


2498 


164 


439 


VKSFC^MRACVCEOmiCVSMCVCVCV 
CVSACLICVCLCQ/CDLINKCV 


2482 


16383 


A 


2499 


1 


411 


ISTSFYTDTINRTALHFAVGRWHLSAVD 
FLLKHKARVDVADKDGMNALHFATQSNH 
VRIVEYLIQDLHLKDLNQPDEPKESPLH 
LWINNHITWNSLLSAQHDIDIL\IRS 
SKPPLHVAADRGNVICLVE/LLLKAGCD 
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ing to first 

4X 111 |UV «Mu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
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residue of 
sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Gtutamic Acid, F— Phenylalanine, 
G=Glycine, H-Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine> P=Pro1ioe, 
0=G|ri famine, R— Arainine. S=Serine. 
T=Tbreonine, V=Vaiine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion \=nn«ihle 
nucleotide insertion 


2483 


16384 


A 


2500 


80 


404 


RTAAAVS WSFPQDFEGQS PKCTQGVRE 
ALRQIK/RLVPTGSLRH/WPAGSLA/LC 
QPLSDEKDLTQLFMFARNAFTALAMMDY 
PYPTDFLGPLPANPVKGRRKLPRRRERP 




lOJOD 


j± 


Z3UI 




1 


AAI QQGS LACSHS VPPATTPRAYTPVP P 
QLLVRNF \ YP KTLELRSQLRCARRFPRE 
TGADCRHAGAGRQTK 


2485 


16386 


A 


2502 


423 


214 


WQAQVGGLLEP \GGRGCCBLRSCHCTPA 
WVT\ GE I LSONNP KROQNKTKOKMG P G V 
PGSWGSPRAGGLTV 


ZHiSO 


ioio/ 


2V 


ZDU.J 




1 
1 


IiHWNORORCPPPPPPPLPPPPPLPPP PL 
PLR\HRIAGAGQTGAKIRWGRWRVGA 
GLGVAAA/DSRESCPDGAAAGGGEGGGD 
SAATAGPGGGGGGGTK 


2487 


16388 


A 


2504 


331 


411 


PFLKTGNKGAPPPTKIFFLFFKKKGG/W 
PGAVAHACNPSTLGGRGGRITRSGDRER 














GGRGCSELRSCPCTPAWVT \GEVIjSQKK 
RKSSNKKSRCLEDQAEVGGERQLVQVSV 
PSTCQRRVGAWFIYL 


2489 


16390 


A 


2506 


1 


468 


RQGSMNKLETBRQIKKAPARNPERERER 
ERESKKGEDRHTDICRP/RERERQNYRD 
RO 


2490 


16391 


A 


2507 


27 


380 


FVCLCPAPRGGAYRGRQA5LS CGGLHP V 
RASWLLCLPNQAWAMAGAPPPASLLPCS 
LISDCCASNQRDSVGVGPSEPGVGYNLM 
VRRFLSRSEKRNIRVGVTRFSRCVCHPF 
L*LGKG 




lojyz 


j± 


ZJUo 




ZJ*T 


RQENCLNPGGGASSEPRSRLCTPAWATE 
RDSISN/QNKQTNKSHILRAAFSRPSCY 


2492 


16393 


A 


2509 


1 


421 


AKKIKPPFPLKKKKKPGRGGPPPPPPP/ 
AKFGRETPLNPGGKGSIKPKLGPPPPPP 
GGPPNLPPKKKKKKAIPK/PLVPSHVKS 
TGFPPSCS/RSALIiRAETQ 


2493 


16394 


A 


2510 


197 


3 


TGPPDFNFFFFLEMKSHSVAQAGVQWRD 
LGSLQPPSPQFXQFYYWGLQGA/LPPCIi 
NKICIFSRGGV 


2494 


16395 


A 


2511 


114 


454 


QGPLEKKGAPFKNKFFKTGVLFFGSQNI 
PPWWFKKGFSRTPFE IKKKNPPGILGGS 
PKKFFFFFFFEMVLLHHPGWSALAQSLL 
TTASDF/LRLKQSSHIiSLSS 




JUJ7U 




«J I £t 


3 


495 


FFLSRGIjFIHLBSAPAIFQCIiLFWFLV 
FVATGSHYVPQAGLKLLASNEPPASNSP 
KCWVYRHEYPALKCLCFGLCLF/AFLP 


2496 


16397 


A 


2513 


25 


437 


PljLFS PPAGHAMEED ILPPAPS FLFYFF 
FYWLVWVKI IYLYVLAQAATTNYHRLSG 
LNNKHLFLTILEPRMP/SIKVPA 


2497 


16398 


A 


2514 


2 


256 


QAPSAEGEMTSYVLLAYLTAQPAPTSED 
LTSATNIAKWITKHS \NAQGGFYYNQDN 
LPYSLCSGRNESTAFGTNGQDI IHPVHS 
S 


2498 


16399 


A 


2515 


2 


284 


KCQCDELCSyYQS CCTD YTAECKPQVTR 
GDVFTMPEDEYTDYDDGEENNNAIRHEQ 
A\GFTSVTSDLQAQI IWISEQKHAMSFV 
EEMTVLEVTL 


2499 


16400 


A 


2516 


2 


432 


RDCERESSRAAEYTASLKASCWMGDLAE 
SPSLDPQVAMGTPPAGPCGWSVGTRSQP 
FPRGPSRGPCCVSPPSWTPEPGTGQ/CG 
EGGGRWSPQVPP 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glydne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,A=possibie 
nucleotide deletion, Impossible 
nucleotide insertion 


2500 


16401 


A 


2517 


401 


2 


GGEAAIRGQTKPQIjEGKAPECSECEKKI 
VRS\PRLIRRQRTPTGEKPYECEECGKS 
FSRRYRLAQHQR\TRAGEKTYECNECGR 
GFSERSDLINHYRVHTGERPYKCDECGK 
NFSQNS DLVRHHRAHTGEKPC 


2501 


16402 


A 


2518 


398 


1 


ETFGK\ SGGRS I VPGQFLAVGPKGRAVM 
ISAIEKQKLVYILNRDAAARIiTISSPVE 
AHKASALVYHWGVDVGFENPMFACLEM 
DYEEADNGSTGEAAGNTQQTIiTLYELDIi 
GLNHWRKYKEPLEEHGNCI 


2502 


16403 


A 


2519 


384 


3 


GGGID/ SDASLVIAGVRLEDEGRYRCEL 
INGIEDESVALTLSLEGEALPLPPHSCV 
AAGPPRLGLPGLLPS ISSAPLGTPAPSP 
RPRRPSSPSAPIRWPSPGSPPPPRCGVS 
VPTQPGPVPVQLLRGVY 


2503 


16404 


A 


2520 


1 


426 


GDRQMITALLRKLKQQSRESVEENRPRL 
LKALKELGDFYLELHWDPQSWVPLLSRI 
LPSDACKIYKQGINIRLDTTLIDFTDMK 
CQFjGDLS FI FNGDAAPSESVWLDNEPK 
VYQRLRHEES/QEKNRTQVDCLTP 


2504 


16405 


A 


2521 


2 


425 


ALPIGRMPIMVRSSR\CVLTGKTPAEFA 
KLNECPLDPGGYFI VKGVEKVTLI QEQL 
SKNRIIVEADRKGAVGASVTSSTHEKKS 
RTNMAVKQGRFYLRHNTLSEDIP I VTIS 
KPMGVESAQEKIQTVSPQEPEWR 


2505 


16406 


A 


2522 


2 


376 


IQSFSWYGYHLCSQGILSERVSASKFP 
LLFFFFFKKGGPPFMP / QGGKTKGPLKP 
GDPQSPGIKGFPCLAPPNKGKKGGPPPP 
RGNFLGFKKKKRFPQVGGGGSKPPREGK 
PPPIiAPQGGGNRGG 


2506 


16407 


A 


2523 


403 


3 


GAPRSLSEKERQLMGM INQLSS FREQLL 
HAHYEQKKLAASQIEKQRQHMKLGKQ\Q 
QEQIARQQQRLIQQQHKINLLQQQIQVQ 
GHLPPLMI PVFPRDQRQLDALAQQGFLL 
PPGFSYKAGCSDPYPVQVIPLY 


2507 


16408 


A 


2524 


2 


369 


NARCL I LRAAE YMAACVYV C I S VFACMC 
FCVRVCVPVOTSGCLCVCVSMNVHLSLC 
/VEQCACLWMCVSLHVCLYICACLCVSV 
CEAVCLHLESHCRGGSRPFPVVGPAFSL 
SSCIilLAPPSVT 






7V 

n 


ZjZj 


59 


380 


RNRNFNKNS S F FFKKGP WGGGPGGTK 
GPKQRQGEP\NHWGQKNFPGQTRKPRKP 
TPGPPPPKKI FFFKPLEKKGPGQGPQGG 
QKPGGQKKPPGGAPQKRGKTGGET 


2509 


16410 


A 


2526 


1 


371 


VMPLSRKHPKGFFSNHTLVLHVARSEMD 
KERVFQATRKSSP/ CFVPLIjPWPT 




10411 


TV 


2527 


445 


3 


RWLGLAL I ALKRELKKGDL \ PEMRWWDS 
SIIPNG\FDLTEETPKREDYFGIANLVE 
HPAQLNPPVDNDTPVTLGVYLTKKEQKK 
LRRQTRRE AQ KELQE KVRLGLMP P PE PK 
VRISNLMRVLGrEAVQDPTKVEAHVRAQ 
MAKRQKAHV 


2511 


16412 






i 
l 


A1 ft 


Ivir 1 K.1T J\I\I\JLl AOIuUiyO l\\£J2*& MxVKVL/ J. 

NAEDNEMEEETDDGPLLVPRVKVAEDGS 
IILDEESLTVEVLRTKGPCWEENDPIF 
ERGSTTTYSS FRKNYYS KPWSNKETDMF 
FLAISMVGTDFSMIGQLFPHRARTEIKN 
KFKREVYASAIEDQATSVHT 
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SEQ ED 
NO: of 
nucleotide 
sequence 


SEQ n> 
NO; of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E>Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arg'mine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrostne, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


2512 


16413 


A 


2529 


385 


1 


qkkcedlk\dqdnp I vrppptpgscghg 

G WNI SKP YWQLNWRG FYYL YGAWGRD 
YSPQHPNKGLYWVAPLNTDGRLLEYYRL 
OmiDDFLLYINAREIRITYVCK3SGIAV 
NYNNKYVSMYNTVW 


2513 


16414 


A 


2530 


1 


1228 


FRATIiRPETMFGQTNCWVRPDMKYIGFE 
TVNGD I F I CTQKAARNMS YQGFTKDNGV 
VPWKELMGEBILGASLSAPLTSYKVIY 
VLPMLTIKEDKGTGWTSVPSDSPDDIA 
ALRDLKKKQALRAKYG I RDDMVLP FEP V 
PVIBIPGFGNLSAVTICDELKIQSQNDR 
EKLAEAKEKIYLKGFYEGIMLVDGFKGQ 
KVQDVKKTIQKKMIDAGDALIYM\EPEK 
QWSRSSDECVVALCDQWYLDYGEBNWK 
KOTSQCLKNIxBTFCEETRRNFEATLGWL 
QEHACSRTYGLGTHLPWDEQWLIESLSD 
STIYMAFYTVAHLLQGGNLHGQAESPLG 
IRPQQMTKEVWG\YVFFKEAPPPKTQIA 
KEKLDQLKQEFEFWYPVDIjRVSGKDLVP 
NHLS YYLYNHVAMW PEQR 


2514 


16415 


A 


2531 


335 


2 


KXKALFFIHPPPFVNGRPHKNPKIKSLG 
S PTP FSL I FLKKKFCFVNQDGVRWL YFG 
SLQSLPSRFPPFFCUULLSSWEYRGLPP 
RPGKYFFPFIFFYFLVEM\GFTVLARMY 


2515 


16416 


A 


2532 


3 


380 


AINSYIRGDDPSSYPEWQSASRSSK\W 
SPLPRALHLTDAK 


2516 


16417 


A 


2533 


3 


2083 


SSEGYLRGNMSENEEEEISQQEGSGDYE 
VEEIPFGLEPQSPGFEPQSPEFEPQSPR 
FEPESPGFESRSPGLVPPSPEFAPRSPE 
SDSQSPEFESQSPRYEPQSPGYEPRSPG 
YEPRSPGYESBSSRYESQNTELKTQSPE 
FEAQSSKFQEGAEMLLNPEEKSPLNISV 
GVHPLDS FTQGFGEQPTGDIiP IGPPFEM 
PTGALLSTPQFEMLQNPLGLTGALRGPG 
RRGGRARGGQGPRPNICGI CGKSFGRGS 
TLIQHQRIHTGEKPYKCEVCSKAFSQSS 
DLIKHQRTHTGERPYKCPRCGKAFADSS 
YLLRHQRTHSGQKPYKCPHCGKAFGDSS 
YLLRHQRTHSHERPYSCTECGKCYSQNS 
SIoRSHQRVHTGQRPFSCGI CGKSFSQRS 
ALIPHARSHAREKPFKCPECGKRFGQSS 
VTiAIHARTHLPGRTYSCPDCGKTFWRSS 
TLIQHQRSHTGERP YRCAVCGKGFCRS S 
TLLQHHRVH\SGERPYKCDDCGKAFS\R 
ASDLIRHQRTH 


2517 


16418 


A 


2534 


434 


3 


APLHSGKRSPTKCN\ECG\GAWNRSSLL 
DRHKI IHSEENPNKCEECGKAFKQASRL 
TIHKI IHAGEKPYKYEECGKVFSQSSHL 
TTQKILHSGENLYKCKECGKACNLFSNL 
TNHKRIHAGEKPYKCKECGRAFNISSNL 
NKQECI 


2518 


16419 


A 


2535 


46 


454 


PSTFSSKVMDKQTLCSSQATSNTSRYAA 
ALYRQGS IYPKEMKTCVHTKPYTVVHSS 
IFMIAKKWKQPRCPSA\DSWRNRMWSIH 
AMGCYLTMEKNKAVMLPRR / WS LENIVL 
SERSH 


2519 


16420 


A 


2536 


13 


476 


ALKTYKYSHKKAFNPKKVYGKCSQKDPF 
PINHCLPTEKLHLCDKIGBGVFGKVFQT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentirie 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, ^Phenylalanine, 
G=Giycine, H=Histidine, I=Isoleutine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
0=Clutainine R^ArcHnine. inp 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 

nucIpTttirip dp!<*tinn V=nns<;ihlp 

UUutVUU^ UvlvilUII) ~UVNlt/IC 

nucleotide insertion 














IADHT PVAI KI IAI EGPDLVNGSHQKTF 
EE ILPE III SKELSLLSGKVCNRTBGF I 
GtiNSVHCVQGSYPPLIiLKAWDHYNSTKG 
SANDRP/DPFKDDQLY 


2520 


16421 


A 


2537 


162 


296 


YVCQRYKL\NPFLPSYTKINCKWVTDIN 
VKPTSVKLLQEKRRKSL 


2521 


16422 


A 


2538 


495 


348 


PGW\HAFIVKEPRVEKLCKASARAKPQP 
PAI I AKTFKAGGITGQYAQAAL 


2522 




a. 








AR^FAWTQT AHAVMAQPfirVP T AOT T T.T P 

YI KVGNS E I KRTGGFGS TD P TGKAAYWA 
SRVSENRPACKAI IQGKQFEGLVDTGAD 
VSI IALNQW/PKNWPKLKAVTGLVSVGT 
AS EVYQSTVI LHCLGPDNRDS TSHYS KT 
R 


2523 


16424 


A 


2540 


2 


509 


NVDADDVRLAIQCRADQSFTSPPPRDFL 

CLTAPNYRLKSL I KKGPNQGRLVPRLS V 
GAVSSKPTTPTIATPQTVSVPNKVATPM 
SVTSQRFTVQI PPSQSTPVKPVPATTAV 
QNVLINPSMIGHKNILI/TTNMVSSQNT 
A 










DU1 


i 


f^Wri\ATPriDrSPfiPPPPG , 172\GC VVPW Dtjrt 

\LPQYVRDFFRKKAE\SGMDSSRNLEKL 

CHlxlxLTQVKRESRAHTTTiSD I YLNNI I P 
RFVQVSEDSGSLFKKVQRYFFTEVRGWS 
NDT I FKI LLD T ML T TWVTOIiS VHCYrP V 


2525 


16426 


A 


2542 


473 


287 


EKDFNLPPKDLRL/ KTSDV/ TSTKENEF 
EDYCLKREIjTiMGT CEMGWEKPQ P TOVr 


2526 


16427 


A 


2543 


268 


482 


KKKKAWGLQRGGKNFPAKAELPTHGIQI 
NS CS VRL / VD I KKE KPFS I L KVEGQAQA 
RTHLNRAFDI I VLTRGG 


2527 


16428 


A 


2544 


407 


1 


CKKICYLIHYWWEYKMIQPLWKTVWQFIj 
\KKLNILYDPAVILYDIYPEELY 


2528 


16429 


A 


2545 


28 


399 


FRHSSFQRSGRGSQLMVHFIiSLt/SVMPK 
IGSVAGINYGLVAPPATTGETT,DVnM / JC 
GEADTENH 


2529 


16430 


A 


2546 


2 


365 


FVVim5EVGGEALGRLLVVYPWTQRFYE 
SFGDI^TPDAVMGNHKVKAHGKKVLGAF 
SLX3LALLDNLKGS FATI^ELHCDKLHVD 
PENFPJjLGNVLVC^LiAiiH/FDKEFAPP 


2530 


16431 


A 


2547 


375 


1 


GFACRRMQKRVREVSHAEAESAASKKVR 
ANGSGKQNEGMNVTWTATLALSQAVISS 
ATOTQMQPHSLIQQQQQIHLQQKQWIQ 
QQxAIHHQQQFQHRQSQLLHTATHLQGA 
\QKQKQQQHEWR 


2531 


16432 


A 


2548 


3 


376 


EI^GSDVA\gAEAIiVDRRQERKGEIDAHE 
DSFKSADESGQALLAAGQYASDEVREKIi 
AVx^EERAAxxLEIiWELRRQQYEQCMDLQ 
LFFROTEQVDNWMSKQEAFIiLNEDIjGDA 
VDS/ IKEILKKHE 


2532 


16433 


A 


2549 


360 


1 


RGEMLOTV\NNRFLKNFVPGKIEPFKSH 
SLYPPCYVHDVS FWI DQKKGFDEIiEFHT 
VGRAVSQDTI IS IQFLSRFQHPKTQQVS 
LCYRIiTCQTCDKALTQQQVASMRSQIRK 
EIQQHQY 


2533 


16434 


A 


2550 


2 


403 


WAEBDTELRDLLVQTLENSGVXNRI KA 
ELRAGVFLALEEQEKVENKTPLVNESLK 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing id 
last amino 
acid 

rtaiuuc oi 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H-Htstidine, I==IsoIeucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
vjF^vjiutainine, tv— /vrginine, o^oenne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
a. — uhkiiuwh, — oiup tuuon,/ — possiuie 
nucleotide deletion, \=possible 
nucleotide insertion 














KPLNTKD\IjQGLEGRENLARDLGIIKAE 
nrwiRPTtTiT.RVTi? p f*cynfCRV RPTTY^Rn & 

VJX VVjv?irLil JliEt V AIwy^yRDftWr X luliu/i 

LDLSDVHSPPKSPEGKTSAQTT 


2534 


16435 


A 


2551 


1 


409 


VPRNPTPLGGPGGP I LRSREGGPPGLPR 
EPPSVLKKREPPTRGGGRPPGFPLPRRV 

QTKNPFPKKK 


2535 


16436 


A 


2552 


256 


1 


HEMLXJTI^IiGPFPPPGSSKGPSLTGPPIL 
FQSPGP/APHTPSSSPAmKT/CTPVCP 
SHLPW/CCPLCLPMRLPWSPVPPVSKSS 

DDBT.V 


2536 


16437 


A 


2553 


398 


3 


ARQQQQLLQQQHKI /NWLQQQIQVQGQL 
PPLMIPVFPPDQRPIiAAAAQQGFLLPPG 
FSYKAGCSDPYPVQMIPTTMAGAAAATP 
GLGPLQLQQLYAAQIiAAMQVSPGGKLPG 
I PQGNLGAAVS PTS IHTDKRCI 


2537 


16438 


A 


2554 


20 


517 


DRPPS TKMKDTPQhti&APDuS PRGPAP V 
PECPEH/SPRKKTSACRPPLPLRPSHSS 
PLP/SPQPSHSTPQASCPLPEAIiSPPAP 
FRSPQSYLRAPSWPWPPEEHSSFAPDS 
SAFPPATPRTEFPEAWGTDTPPATHRSS 

n±rL*12rxCtr O cU 


2538 


16439 


A 


2555 


361 


3 


KSSQEALEEALRQRIiEELKKLCIiREGEL 
PGKLPVEYRLDPGEDPPIVRRRIGP\AF 
KLDEQKILPKGEEAEIiERLERBFAIQSQ 
ITEADRRIASDPNVSKKLKKQRKTKYIN 
AVKKLQVY 


2539 


1 £ A Aft 

16440 


A 


2556 


470 


2 


LiKAAVT^ujIjIS VPoD VoJUKAr r. \WLiSAF P 

l\ds PYS ihhprri QVSSEKEAAPDAGA 
ERITADSDLAYSSKVLLLSS PGLEELYR 
COILFVDDTAEPRETPEHPVKQ I KFLLG 

KxnJSB a r* V Jj Vv3V7C W b c oJjJAjlAjt' yALUry V 

KVSNAIRCAQAQTGTV 


2540 


16441 


A 


2557 


2 


315 


WAEEDTELRDLLVQTIJSNSGVIiNRIKA 
ELRAAVFLALBEQEKVENKTPLVNESLK 
KFLNTKTCKMFSLFLFIYL\EFLNLDTI 


2541 


16442 


A 


2558 


372 


3 


CNIS SHFANKKCQDVTVAARNVMTSQIH 
HAV\KIIPGFNINVPGLPPPDEDTELEV 

SJav. voiir rxci v na iidliuh -l jxl/^ixo lour 

TCRISEYVKKLMELAYHSLIiEAASSSDQ 
CADQLFYSVRCI 


2542 


16443 


A 


2559 


64 


435 


WGDAS CTGRAQLGIAHKSVL PTLTDKFN 
PRTVK 


2543 


16444 


A 


2560 


113 


375 


VPGPARDSTQRRAVKNDKKELMS IPGIP 
GPVQVPGGGLST\GMRGFPGSSLGFLTI 
PGGALI PFSPAFFSRVGGDLS PRNTGPG 
QKPG 


2544 


16445 


A 


2561 


431 


51 


RKIYRVYERENFRVEIMFRFSHTSKKVC 
KGNV/DVAKFIKLHRIX5WHVLNVQCACH 
QKGGIYWFRSIHVELIGYPPPRSSSHIK 
IGDKTOVKASVTTPKYKWGSVTHQSVGV 
VKGNI IWVTKFLMLTFH 


2545 


16446 


A 


2562 


1 


455 


PEGIALEWTESTGKEPJEHTFQPGDNVE 
VCEGELINLQGKILSVIOJKITIMHKHB 
DlxKDMLEFPAQElxRKYFK^DHVKVlAG 
RFEGDTGLIVRVEENFVILFSGLAMHGL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino ncirl 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Htstidine, I=Isoteucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

QeaGllltainillft. R^Ar^inini*- S&&Arin^ 

T^Threonine. V=Valine 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KVLPPPIiOLCSETAS /S IN / VGGOHEW 


2546 


16447 


A 


2563 


407 


1 


KWDHPLSKCEVPCGGSIISSNGPVSYPG 
FPSPY\TSSQDCVWMITVPSGHGVRLNL 
SLLQTEPSGDFIAIWDGPQQTAPRLGVF 
LRSMVKKS VQSSS I QVLLKPHRHAATGG 
I FAIALSAYPNTKCPPPTIHPCI 


2547 


16448 


A 


2564 


382 


1 


TQEVEVAVSLDRAPAFQSGCQSETLSQN 
SNSSNKTWILDHFGLSDFLDQRFSACTV 
FTPREQVSSHTRMF IAALFTIAKTWSQP 
/ KYPPVIGWIKK\MWHI YTMEYYAAIKK 
DEFMS FAGTWMKLEAMY 


2548 


1M4Q 








u 


TPEQVGGS PAQVP I PYLDDDIPLLEVBQ 
EPVSLELGDVSLVSVSREGLQPAS ITGS 
RGHL I VQLQELLHHWVLWSAVKSRWVI V 
GLFVSrLILSLVFAIRLR\SASRAPVIiL 
RP 








ZJOO 


AA.A 


1 
1 


I YES PFVTETGEYYKQEASNLLQESNCS 
QYMEKVLGRLKDEEIRCQKYLHPSSYTK 
■VTRRfrrnp mvahwt .oft .rat? prm t t pop 

KKNDMANMPVLLRAVSKGLLHMIQEHVL 
RAUGRIPTSYVRMQS 


2550 


iu*tJ 1 


A 






i to 


QKKKKKKKKKKNFLPETRERALPGKKKG 
GGG 


2551 


16452 


A 


2568 


215 


411 


IHLIFIHLVFIDYLSVRHSSKILGYISE 
ITRSGVQDQPG 


2552 


16453 


A 


2569 


170 


430 


TSSQIAPQCIiAV/VSGPIAACPELTSAT 
S PWLQVRTNAMAS PLLKFSAEDLLFKAA 
LSQFCVIMLNAKLSVQKYEK\LISAFSD 
SR 


iJJJ 




^ 


Zj /U 




AC/1 


AVPNI PS P IGVNGFTGLPPQANGQPAAB 
AVFANGMHP YPAQS PTVADPLQQAYftGV 
OO YAG P AAYPAAYGO I S OAF PO P PAMI P 
QQQREWP EGCNLS I YHLPQE FGDS \ ELM 
QMVLPFGNVISSKV 


2554 


16455 


A 


2571 


3 


424 


LKTMKGGTGNGLE IMLDIQQDEYLPVWG 
ETGTSPTSGAPLHGSRPQP/PARGFIiGF 
TVRPG 


2555 


16456 


A 


2572 


1667 


2046 


YIFFFTAFLWAALTFQVTTTLAPIALLV 
RSAKMMRASHDKPTANITLN\GKTGRAS 
KQRREERNRQEVKLS LFTDDMFL YLES P 
IVLAJPKLLIjLINFSKVIAYKINVQKIiLA 
vlytkssresneehn 


2556 


16457 


A 


2573 


1 


399 


FAISQDHPALPSRP PS IiHHPKPGTLTFH 
PDLPHQATCSRPIRHQRTWPEDAPLAKA 
DTVSPAEHPPAAATKAP / TR/PAPDKPG 
GTSDPQTGPAP\PASPPCSGP\APQPVP 
RKPSRAAPSKVSVTVPRRVPRTFPP 


2557 


16458 


A 


2574 


2 


452 


AKVNEMKSPMRKGHTIjIjKNKEEKIiNQLE 
S S LWEE AS DEGSLGGS PTKKAVTFDLSD 
MDSLSSESSESFSPPHLDSTPSLTSRKI 
HGLSHSLRQISSQMSSVLSILDSLNPQS 
PTPLLASMPAQLPP\RDPKSTPTP/SYC 
GRCRGFSLTS 



702 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alamne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Htstidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Giu taurine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=0Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


2558 


16459 


A 


2575 


2 


369 


TLVYPAI T FILLS I CICYW IVTAVFLTT 
SGVPVYKVIAPGGHCIHENQTCDPEIFN 
TPEIAKACPGALCNFAFYGEKSLYHQYI 
PTFHVYmiFVFLWLINFVTALGQCALA\ 
GAFATYYWA 


2559 


16460 


A 


2576 


406 


I 


RQEGTGPSYLLLVGITPRSFWGFSPPLG 
PSGKNP I KTPGGPLQGFFFRVQMGVIjLP 
LPGPSGN/ CS I KVSALNSS FS P SGVNPG 
EASLPWF / CFFBMESRSVAQAGVQWQPP 
PPGFKRFSCLSLLGSWGYRRPPLV 


2560 


16461 


A 


2577 


3 


410 


YISPFYITHMRAHTNLPGPF/KLNQRAD 
ALVSAAFADAQTFHS LTHLNAAGLRKRY 
GLSWKQVKEIVKHFSAGEVLHLPHQGAG 
VNPRGLSPNSIWQMDITHIPTFGKLSFV 
HVSVDIYSHFIWATYQTGEATAHVK 


2561 


16462 


A 


2578 


386 


1 


TERIRQRQYQRETEKDKRYTERQRKTER 
IRESDRDRERQNQIERDRRAT\RERDRE 
KQRESDRETYRERENQIETKRDRKRQSD 
RDRETQRETGR I RYRERERQHQTESDRD 
RERESDRETQRHRECI 






7^ 


9^70 


it 




LIiYFAKRYGAAFGE CCOAADKAACTJiP K 
LDELRDEGKAS S AKQRLKCASLQKFGER 
AFKAWAAARXjSQRFPKA* FAEVSKLVTD 
LT*VHTECCHGDLLECADDRSDLAMYIC 
ENQDSNFSKLNECCEKPLIiBKFHCIAEV 
END 


X.JKJJ 










496 


NLLNDALAI RE KTLGKDHPAGAATLNNL 
AVLYGKRGKYKEAKPMSKRALEIRKKVL 
GKDHPDVDKQLNNIiALLCQNQGKTEQGE 
YYYPRALHIYHTKLGPDDPNAANTKNDL 
VACYIiKHGTVKQA* SLNKENLTSAHEWD 
V 


2564 


16465 


A 


2581 


10 


389 


KLAG * GL WGHTL ITNPLTE PLTYPFLGL 
YL*SIIITSSICIiLQTDL*ALIAYSSII 
HIALVITAILIQTP+SFTGAVILIIAHG 
LTSSLLFCLTNSNYERTHSRI I ILFQGL 
QTLRPLIAL* *LIiA 


2565 


16466 


A 


2582 


1 


192 


LIPTLAI ITR*GGQPERLNAGTYFLFYT 
LVGSLPLLIALIYTHNTLGSLNILLLTL 
TAQELSNS*ANNLI*LAYTIAFIVKIPL 
YGLHL * L P KAHVEAP IAGS I VLAAVLLK 
LGGYGI IRLTLILNPLTKHIAYPFLGLS 
L*GGQPERLNAGTYFLFYTLVGSLPLLI 
ALIYTKNTLGSLNILLLTLTAQELSNS 


2566 


16467 


A 


2583 


2 


438 


QAHGPLAGCRLRPRS PSPVLWRRRRRWW 
RQRRKWKTKTATAAAGMYAS WTKACRGL 
EELINLTRLNVSYNHIDDLSGLIPLHGI 
KHKLRYIDLHSNRIDSIHHLL+CHVGLJf 
FLTNLILEKDGDDNPVCRLPGYRASIFQ 
TFAQL 


2567 


16468 


A 


2584 


3 


415 


GRAILLLEEAIQYLSGIEIDLIDTDRGE 
WDSLTPQARR* KEAGLQMFVQLARFHNI 
VCKEAFGTLAFLTSE I KSLFGHPFLAER 
I ISMLNYFLQHLIGPKMGALEVKDFSEF 
DF*PQQHVSDSGAIYLDLGDEENFC 


2568 


16469 


A 


2585 


2 


453 


DAKMYLS YK YATV I RNLREGTC PEAPSD 
ECKPEKRCALSHHQRLKCDEWSDISVGK 
IKCVSAETTEYCIA* IMNG* ADAMS LDG 
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peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
ami hip <>uu 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ujng iv 
last amino 
add 

peptide 
sequence 


Amino acid sequence (A*=Atanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
1^=01 uia mine, rv /\rgi rune, o — ocnne, 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 

Y — ¥ T r» L" n rt XAm *— Qf An <*rkH a n / — or»ccih»lo 

A-uuiuiutvii) -~oivp cvuvn, /— *possiDie 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GFVYISGKCGLMPVLAENYNKSDNCEDT 
PEAG YFAVAVVKKS AS DLTWDNLKGKKS 


2569 


16470 


A 


2586 


3 


413 


MAVESRVTQEE IGKEP * KP I DR* KTCPL 
MLRVFTTNNGRHHRMDBYSR*NVPSSEL 
Q I YTWMDATLKELTSLVREVYP * ARKKG 
THFNFAIDFTDVKRPGYRVKEIGSTMSG 
RKGTDDSMTLQSQKFQIGDYLDIAI 


2570 


16471 


A 


2587 


1 


798 


LEVMimVKAGADQRAKNQDGMNALHFA 
TQSNHVRIVEYLIQDLHLKDLNQPDEKG 
RKPFLLAAERGHVEMIEKLTFXiNLHTSE 
KDKGGNTALHIiAAKHGHS PAVQVLLAQW 
UUXNEMNELiNI SSIjQIATRNGHASIjVNF 
LLSENVDIiHQKAEPKESPLHLVDINNHI 
TWNSIiLSAQHDIDILNQKQQTPLHVAA 
DRGNVELVETLLKAGCDLKAVDKQG* TA 

r AT7TV COCMUOT Xn/TMUIT TTTnPDWHUD17T7 

HHBS IRDPSTGFTL 


2571 


16472 


A 


2588 


2 


285 


AWS IAPSHPHS KVPPGPRRGKAEGRPGA 
AAQAAEQAEVHPPSSGPSPLPARQPPVW 
QIPPTPSLKTTTRRGAQPQHSRKRIiA*S 
RSVSVLFRKM 


Zj iJL 


1 £LAT1 
I04/J 


TV 

A 


ar on 

2589 


11 


438 


TVW/~>T %TT»TCTT H TIT P If ytTirtTV 1/ 1/ 1/ kf p -r\ 

AiYGIiNWHJA^TIjSQKKKK^ 
RGGGPPRNPHFWG IGGRPGTGPPRGEKN 
GLKNQKKEKFLAPR*KKSTGKPLKPPQG 
iv^WCK5v»fc p Kr VlUsj^PJ?KAGGGbljKPKGK 
NREPFIiALKSLPGVFTiGGFGG?NPGMGLK 
NQ 


2573 


16474 


A 


2590 


313 


391 


VHLVRVKLGL * SRI YKELLQLNNTT* DN 
PI * KWATDLDRHFS TKYTQMVDRLMEKC 
STS 


2574 


16475 


A 


2591 


62 


616 


EVHQGTEVRDSEVRRRPQARGPLMPAER 
ACjRQRWIjVPAIjQPRWGGIjRR* RGAVRQH 
GAHPHGLIiLQDQKI PALPGRKQAGSLHA 

TADHLNPGEHRRGEAHVRAAV* PAAGAE 
GAAKERRAHOANTAIjnVHPR * T/5<?FAT?T. 

VJflfAIVlJ IvI\JlClYffll>t X r\±J^l V XLXwXv f t i_7 , J n * 1 f 

RLLRKPGRTS VWPS PM 


2575 


16476 


A 


2592 


345 


438 


HKRWLPVPILYQHLF*VFGHPEVYILIL 
PGF 


2576 


16477 


A 


2593 


479 


313 


QDGLDLI,TS*STRFSIiPKCWDYRREPPL 


2577 


16478 


A 


2594 


3 


419 


LTLHSNTIiPPPEMSGLT* PPATNAYWTF 
LPSAIRLFPEI FFIAVLLSVSLFDETET 
LSDAHSWRLTFKYERDANYHLLMSAQER 
IiERLFGIiHGGTI P I VPTADFHDS ISGAS 

DTD T AH55 GT , AYTMFP °. AP OT MP TAMKY 


2578 


16479 


A 


2595 


1 


419 


HKCEGRYRGKT YGG YWS LCATVNKALDE 
RIPITSASYYATVTLDHVRNILGSDTDV 

NCVR*RDNXAKKLMHLVVESFPSYRDVT 
IjFEGN* VAFYTRAO I IiVTDTGTVIiEGK 


2579 


16480 


A 


2596 


199 


397 


SPTALNTDDVNPFMGSSSRGHHRQFQTG 
I VSDHPAEAGP I S *DEQELP YAVIiHFHK 
VQPLEPKVTD 


2580 


16481 


A 


2597 


154 


3 


MLSLRGFFCLFVCFETGS * YAAQAGLKL 
LGSGSPPISSFQSARITGVSHL 


2581 


16482 


A 


2598 


425 


223 


Q ITGHGGKCL * SQLLGKFGQENFLNLGS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

J- „ 4_ 

ding to 
last amino 
acia 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argimne, S=Senne, 
T=Threonine, V=Valine, 
w— tryptophan, v— tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSFN* PKIiLPGTSGWVTMLNSIiGKKKKV 
TAYLNKWMPTP 


2582 


16483 


A 


2599 


3 


441 


QGPS RDIjVSEIjYQ inafdtpe s ulm igk 
DHSDPIHHTFDHMWRTKBHNEAGWLLLR 
SVDKVMKENDELGDSISQLQKQILSLKY 
AKIALTERLISCPV*TEIVLNYT*SLIM 
LLTDLL* KVHAQSHHAHSGQASNCNVMT 
LIGLQH 


2583 


16484 


A 


2600 


131 


423 


GPAPMVILFHLTGFtiLAFLPLSHLLTSY 
LVPWII^GTDGHTFRSACLPRWIjEAEW I 
FGGVKYQYGGNQEGK* CFFTG* SYVYNG 
SSGKVPWETFSRT 


2584 


16485 


A 


2601 


222 


447 


SGI PS FGL* WEQNRPGRLNAQ I LKDLG 
VSPGS DYGKLKNG I S SGLDNGVTTSDHD 
VLKMAIVGRKIWILGDWSG 


2585 


16486 


A 


2602 


2 


457 


FICDPLIKAIGTEGDTDVLSE IMNSFAK 
SIEVMGDGCLNDEHLEELGGILKAKLEG 
HFKNQELRQVKRQEDNYDQQVEMSLQDE 
DECDVYILTKVSDILHSLFSTYKEQILP 
WFEQLLPLXVNIiICSSRPWPY*QWGLCI 
FDDIVKHCSPSL 


2586 


16487 


A 


2603 


1 


297 


DHRQKLYAHTECGKALiLWKS VI CVHQKI 
LEEE KPCEGTKYDNI FSNRGCPPVPRMV 
HAVEIPCK*TECBKATGVHGPRGASEFL 
PERPTGMNMAKCESR 


2587 


16488 


A 


2604 


1 


405 


RFDVSDGLELRP KYNG IAHRLTTI WYIiD 
GLRGLYKGVTPI IWSAGLS WGLYFVFYN 
AIKSYITEGSSERLKASBYLASTAEPGP 
MTLCITHPLWVTKTGLMLQYDAVVNSPH 
*QYKGMSDTLVKI YKHGGVQGLY 


2588 


16489 


A 


2605 


165 


3 


PEELLLVTFPVLKNTGIRPGAVAHTCNP 
STLGGRITRTGD*DHPGQHGETIiSL 


2589 


16490 


A 


2606 


3 


455 


KRYGCFSKRMNKRSATNVFFCARKGEVL 
GLLGHNGAGKSSS IKVTTGDT* PTAGQV 
LLKGSGGWDALEFLGYCPHENALWPNLT 
VRQHL* VYAAVKGLRKGDAEVAI TRLVD 
ALKLHDQLNS P VKTLSQGLKI KLGFCLS 
ILGKPSCGILA 


2590 


16491 


A 


2607 


1 


429 


VDYTVRJCFCIQQEGDMTNRKPQRLjITQF 
HFTSWPDLGVPITPIGMLKILQKEKACN 
PQYAGAIVDHCKA*VGRTGTFGDIEAML» 
DMMHT*RKVDVNGFESRNRAQSCQKVQT 

TPW 


2591 


16492 


A 


2608 


166 


435 


KFLSNNYVHFKQNFKKVLKFIKHLVLNY 

EPRSSRPTWAMWRNS ISTKNAQI KNIVL 
RILDPG 


2592 


16493 


A 


2609 


228 


431 


IFSKICIFNGFNYFPIVGHIGRIYLFIE 
TGoMC V I VQW GO YGS J-iQ P * 1 PGIiK* 
SSHLRLPSSWDH 


2593 


16494 


A 


2610 


439 


175 


RNGGLHLWSQLIiRRLRWDYHFSPGSQGC 

TYTS ILSVLNETVYI IVYYTKKLDTSFK 
RKKL 


2594 


16495 


A 


2611 




439 


LLGSIIiSSMQKPRGLVDQETLRKAR*QA 
ARLNKl^EHEKQQKVEFRIRMEKEVSHV 
V* DSGHI TT* VQS MNS I ERS I LHEWEV 
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SEQD) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanlne 
OCysteine, D=Aspartic Acid, 
E=Glutamrc Acid, F=Pheny!alanine, 
G=Glycine, H=Htstidine, I=Isoleucine, 
K=Lysine, IHLeucine, M=Methionine, 
N»Asparagine> P^Proline, 
v^=v»iutanune, K— Argimne, ^—serine, 
^Threonine, V=Valine, 
YV=Tryptopbao, Y=Tyrosine, 
X—TJn known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














DGLTS FYFGEDDDCRYO^I FTKDAAPSD 
EELDSYRRAEEWDPHMADDKRKLTELAQ 
RPDDDA 


Mjyj 


lo49o 




2612 


1 


439 


V XRKVTGTEGS SSI VLDYT I PS S TGGMP 
TOKSEDQTDTKRTVIKTMEDYNNDNTAP 
AEDVI IMIQVPQS I WDQDDPESEDQDDK 
STQPISSVGKPATVIKNVRTKPSAXVKY 
PEKESEPFAK1LKFTMDVSHEVI PHEVK 
SS*YSA 




16497 


A 


2613 


2 


453 


EKPEKEECNLWTEMWQENVPGS FGGIRL 
YTjQELMT ITQKALHSQPWKMKAQGAIAM 
ASMALQTOSLVPPYLGMILTALVEGLAG 
RTWALKEELIiKAIACVETARSAEL*KSV 
PNQPSTHEI IQADLKECSKENGPYKI* P 
VICAADILHA 


ZjyJ 


16498 


A 


2614 


2 


441 


LEPALPGRWGGRS AES PPSGS VRKTRQN 
KQKTPGNGDGGSTS * APQPPRKKRARAD 
PTVESEEAFKNIMEVKVKIPEELKPWLD 
EDWDLVTRQKQLFQLPAKKNVDAIL*EY 
ANCKKSHGNVDNTE YAVNEDVAGI KEYF 
NEMLGT 


2598 


16499 


A 


2615 


1 


430 


RGDRDIiHCTESQSEASTEEGHDSLSVGI 
FEEDSQLEFILDPPKSKPPAWLNGIMTC 
* DFELLNPRRWFLLEVKDLALTRRQ I L 
LHKGLSDYEKSTTLQELVLKKSSRSGPP 
LS IEDLGLNFQL* PSSRVYGFTAEELKP 
SGE 


2599 


16500 


A 


2616 


373 


552 


ICKIKYKLFNSALLFFRQGLALIiPRMEC 
SGAIKVHCSUDPLGSSNPL*RKIKESTL 
NLEKSLCTRGIFLCKYBEVPKQFLKICF 
QIFLETGSRSVSQAEVQWHDQNSLQPRT 
PCjJjKbi? SCI^JjPKCvroYRCEPPRRAIjCS 


2600 


16501 


A 


2617 


498 


157 


QIiIGSSTHQAALRSRSLPPPAGPGTFHF 
HYQGKAWWKGGTS PDS LPSLLGRGVS V 
QLHPRGKEQRGASDT* HKCPVKLWTIGG 
KYRVSETS RI FSLPPTTLQRAGIiDSGSG 

CT 


2601 


16502 


A 


2618 


255 


389 


I*SEFYTYEGPSIRPPIGS*GTNLPLPLS 
YIPRSPSAVDDENLLDE 








2619 


3 


207 


AUCCT TTTDBT7VT1TT riT ad KyTTTji -4- t m ■*> 1 77\ 

Urlbo Jb 1 ± 1 KAX AX KXjG I AP FHF * VP * VA 
QGTPLTSGLLLLT*HKLAPISIIYQISP 
SLNVSLLLSGT 


2603 


16504 


A 


2620 


3 


207 


QYSSLIIIMGVGIKLGIAPFHF*VPEVA 
QGTPLTSGLLLYT*QTIiTPISMIYQISA 


2604 


16505 


A 


2621 


45 


447 


WRRIDCRLNLCVTTASLKHFFSIAHKFH~" 

t tnvtr2CHT.ocnticr , i7ar»r DDTBDCTTpfr 
iiniNoonJjyoUrliLiSAUijKKi AKr XXL-LUj 

PYGESGQSWPTLTLERANLNWLTVYFNN 
WKDWEYDSDQYMSNFDRSSEMMNI IVW* 
RSIDNLPHI I CSTLINTKNSFL 


2605 


16506 


A 


2622 


3 


142 


GNQATPKTAPATMSTPTI LVATAVHAYR 
* * VAEKEHPLKFGGRACS 


2606 


16<tfr7 

lOJU / 










riV3Irv3V7UlJU 1 -L LTVCjIv. _L tvU J? j_U>4 \J a v_l i ^ Hi V XT 

DGGLVPKSLYRTA*DLENEDLKLWTDTI 
YQSASVFKGAPHEILIQIVDASTVITWD 
YHVCKGD IVFNT YHS KRS PLP PNKDYLI 
AHSITFPDGNNVHIjNNKVWMLGPDYIMW 
ITTLITNNEN 
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SEQfl) 
NO: of 
nucleotide 
sequence 


S£QU> 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQD) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

Hin<7 tf\ 
Ulllg W 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, R=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 

0=filntaminf* R=A roiiiin^ S^Sprinp 

•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion. \=nossible 
nucleotide insertion 


2607 


16508 


A 


2624 


1 


315 


EEKQAPGFTTGRGKIiTPLPCAHAVRF 1 1 
S TSL I YKGAYP * ALKGKEKY*LAVFWVY 
NKKGLNlSn^FLI^VHIiCFVT 
GKGLPFS IGLYHLILIiIiFFFF 


2608 


16509 


A 


2625 


232 


478 


LDCGNYFTV*MYIKTSHCIP* IYIILNF 
HLYNFRYNLGDYNGEIVSEVMAQRQPMK 
PTYAIPIIPITNSSQFKHQEAMDVKE 


2609 


16510 


A 


2626 


3 


Ann 


T^TPNVHMG^TfJTjPGDSTTIEDAlOSHS 
ESASPSALSSHPNNLSPTGWSQPKTPVP 
AQRBRAPVSGTPDRDKLRPCGQRD*GYY 
WEIEAS EVMLATR I VSGS FGTVYQGKWH 
GDVALAILQWDSTPEQFQGFRNDVAGL 

PT 


2610 


16511 


A 


2627 


2 


341 


AliQKHEDTDCPCVVVSCPHKCSVQTLLR 
SE*SAHLSERVIAPSTCSFKRYGCVFKG 
TYQQIKAHEASFWQHVNLLKEWSNSLK 
KKGS S FD KECVDKNKS IHCWH 1 0 1 CS S V 

I 


2611 


16512 


A 


2628 


2 


1 *7Q 

1 fy 


T3HTGT.WVTST.PAVFPGOVRRTLF ITGLP 
RDARKETVESHFR*AHWLVGDFSPCCIS 
RTGEADPVHHRTPQRCQEGDCGEPLPVS 
RWVGSFLIHYIQEHCL 


2612 


16513 


A 


2629 


1 


418 


GFSFCHPVPVKWRHRDSPQP*TPGLK*S 
CLGLPKCWDYRHBPLRPANISY 


2613 


16514 


A 


2630 


3 


AA"i 


T?TrGTT T TAT PNGVKEFT* LATLHGSNM 
K*SAAVL*ALGYIFLFTVGGLTGIVLSN 
SSLDIVLHDTYYMVAHFHYALS IRAVFA 
I IEGFIH* FPLFSGYTLHQAYA* IHFTI 
I FIGVNLTLFPQHFVGLFGMPRQ YSDYP 
DAYTTR 


2614 


16515 


A 


2631 


2 


ASA 

4d4 


ARFTGAPIjKVHKIINPW+SPSGTLPALR 
TSHGBVISVPHKI ITHLLKEKYNADYDL 
SARQGADTLAFMSLLEEKLLPGLVHTFW 
TDTKNYTjEVTRKWYAEAMP fs LNFFLPG 
RMQRQYM*RLP 


2615 


16516 


A 


2632 


131 


410 


WMWSSKAPHCFRLPS IGEADTVHQCAMS 
FQ KGHS AIoSGVLHLVFKPDLVYQTLJjQM 
P PRKCCLW PGAVTHACNSMTLGGRS R * F 
TRSGVQDQP 


2616 


16517 


A 


2633 


2 


230 


FFSETS S LLE IQG JJ^RHGGTWRLRQENN 
LNPGGGGC^ELRSCSCAPAWVTVRLiDIiR 
KKRNRNP * K1XKNYLKI FVI 


2617 


16518 


A 


2634 


2 


370 


GTSSSDPAQPGDDKEFIDASRLVYDGIR 
DIRKAVLMIRTPEELDDSDFETEDFDVR 
SRTSVQTEDDQL IAGQSARAIMAQIiPQE 
QKAKIAEQVASFQEE * S KLDAEVS KWDD 
SGNDIIVLAKQ 


2618 


16519 


A 


2635 


2 


376 


MTDTDALY*RELFDPADKDKMDHSRRGI 
ALVFNHERFFWHLSLPERRCTCSDRDNL 
TRRFSDLGFEGKCFNDLKAEELLLKIHE 
VSTVIHSDADCFVCVFLSHGERNHI YAY 
DAKIE IQTLTALF 


2619 


16520 


A 


2636 


1 


461 


DMAFLILTERKILGYGQGRIGPNVACPY 
GLLQPFGDAI*LFTKEPLKPVTSTITLY 
ITA^TLALTIALLL*TPLPIPNPLGNLN 
LGIXF IIATSSLAVYS IL* SR* ASNSNY 
ALIGALRAYAQTIS YEVTLTI ILLSTLL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
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SEQID 
NO: io 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nmtnA ncift 
auiinv nviu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatamne, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

v< miuuiiuiuc| jv t\M gi in uc, o — tjenncj 
T=Threonine V= Valine 

M. M. Ill VvlllHV) W w 49 at It \^ j 

W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 
















2620 


16521 


A 


2637 


2 


384 


ENFKAIiAMIAFGQYRQKRPFEDHVKEiAN 
D*LNFAKTCVADESABNCDKSbHTLFGD 
KLFPIATLRETYLELADCCAKQ*PDIHE 
CFLQL KYDNP YLPRLVKPNVD VMCTAFH 
DTEETFLK* YLYE IA 

xj x x xjax x xj x A-i x m 


2621 


16522 


A 


2638 


1 


373 


TFIYLLFLFSSAYSRGVFGRDAHKSEVA 
HRFKDLGDENFKALVLIAFA0YLC3QCPF 
EDHVKLANEVTEFAKTCVADESPEN*DK 
SLYTLFGDKLCTVATLRETYGEMADCCA 
KQEPDRNECFLQ 


2622 


16523 


A 


2639 


2 


375 


KGPCYRLVSTGTQRRHPGAVYLNKHLC* 
CNVGKA*GPHCEKCTLPCTFNEEPLKAL 
TFFREHGP*VSDPEVATAPTEKEIPSIiD 

VIEKTSPPVPVEI 








ZO*RI 




*kfo 


KYFLASHTSLFI ICYTAHLTCTIAEPKO 
IESHFGKRLDADLVF* KSDDSTWDVI P 
SKP V* TSGLFSGKCL*HIVEG I IRAVDP 
RKLY 


2624 


16525 


A 


2641 


208 


376 


ILRNLIKNHFWPGWAHACDPNTLEAM* 
cm t T»p <J7? n wnwpnp p <? vt .r tht? t. a 

V7\4 X X xvO IvU ri UXxxr X/X\_n.O .C^rvc O V l ifv l nn I i>a 


2625 


16526 


A 


2642 


424 


2 


KGE P L P PLGG WKT * GPRAF PQAQI PHAG 
FKTRGCPFPLPQGRNKAQVPNPFCPRER 
PLPCiKQ^QPVT^Plj.Kl^GI^TjGGPP'FYPnP 
FGGPS PRVPMFGVQTHLGNKPKPPPFP 
KKKKGGTGCSVGNWGPKSSQTLPSPQAS 


2626 


16527 


A 


2643 


190 


3 


ELSTRGFQGPHRRIAGLKQCLVMFPELE 

R QRnT l QPW*WT.HT.Pf5QQn^P3XQIVQ* T BfJ 
ivovj\jij«zjE»rx nuiuj it ujcuoiuuho x>vur 

ITSVCH 


2627 


16528 


A 


2644 


2 


339 


CCEKPLLEKSHCIAEVENDEMPADLPSL 
AADFVESKDVRRNYAEAKDVFLGMFLYE 

AAADPHECYAKEFHEFKPLVEEPQNLIK 


2628 


16529 


A 


2645 


81 


369 


VEVTGQPQNAS FVKRNRWWLLPLIAALA 
SGS FWVFRTPDG * TSWR * IfiNHMTQLI 
NNHLRA* YSVYRDIYFLWIAI * *SS*PA 
SVLLFTADYCPE 


2629 


16530 


A 


2646 


1 


348 


DMDMNPLRPQNYRFG* ELKAD IDYHFKE 
DNDEDDHQLSLRTVSLWAGAKDD * HI VE 
SEAMNYEGCPI KVTLATLKMSVQATVTL 
GGFE ITP PGDLRMKCGSGP VHIMGLHLV 
AGEE 


2630 


16531 


A 


2647 


1 


178 


GYTDTILDVRSQRVIiSLLGLSNSEPNGS 
VETKNL * Q ILNGGESPKQKGQE IKRYDL 
YLT 


2631 


16532 


A 


2648 


181 


244 


TIKRYKNVMIFYYFNFF*R*GLNSLAQA 
AAQWCNHGSLQNQPPGLKGSSCLSLPCS 
*GYTIFYI*SIHQLIDIWXVSTFSKLFLH 
VSAYSSIKMS 


2632 


16533 


A 


2649 


2 


369 


KWITFISIxLFLFRSVYGKFEFLLDAPKN 
DVAHRFIDFGEQHFKALAEIAFDHYFLH 
CPFEDLVTLVNE * TEFA*TCLADESA*N 
CDKSLHTLFGDKLCTVATLR *TYGEMAD 
CCAKLEPERI 


2633 


16534 


A 


2650 


1 


349 


VTFISLLFLFSSGYSRGVFRLDAHKSEV 
AHRFKDLGEENFKALVLIAFAHYLQQCP 
FEDHVKLVNEVTEFA* TCVADE S AENCD 
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peptide 
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Predict- 
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otide 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Iso(eucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
v^=oiuiamine, k— Arginine, 5y=oenne, 
T=Threonioe, V=Vaiine, 
yv— irypiopnan, x — tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RSLHTLFGDTLCTVATLRETYGETADCC 


2634 


16535 


A 


2651 


21 


403 


CV CCVCVRES RTES LPGDNE D FNVKNAS 
VKDTOCVHFDCDSQNDP PMEATGFTAQV 
TIYSLINtiLHRLKYFE i niDLH*IAIFFS 
KVS I LNHPGQ I IAD YAP ALDCHTCHIAC 
KY Ab JLilUsnXUKRo V K. 


2635 


16536 


A 


2652 


1 


355 


ARMS I TDTYGQHLIAGGLMTQEDVSE I K 
SSYTAKSNDHLNWMAHYRPTALNIjQAHW 
HGIAQAEAQITTWSSGVPFDLLLMVGMM 
TVQVP * ELQMHSHMiKTHDQSRMENMMY 


2636 


16537 


A 


2653 


2 


360 


LFEHLG * YKFQNALIjLR YTMKVPQVLT P 
TLVKVSRNL* KVG I KCCKHPEAKRMPCA 
QDYLSVVLNQLCVLHEKTPVSDRVAKCC 

mnOT 71 T ITWrHTTVl/TJg'PPKrR X? 

liSoijVNKKJrCr isALiiS ViJiSl z VFlUSrJMAJS 

TFTFHAD 


2637 


16538 


A 


2654 


360 


314 


YTCAIVPCKLFWWCC3^DRVSLCGLYWSQ 
TPEIiK*SSCIiSFPKC*NYGHEinVPARI 
I FLNL YNKSG * TATRKCP PTKVTPSSHQ 
Q*HLTi WPYFYFK 


2638 


16539 


A 


2655 


1 


353 


NWTL*TLKKIFSLKDAIKRIKIQSVRWK 
KI FAKHTS DKGVES KYI * RTL INQ * ENN 
SIKNWSKNLNRRFTBCDIQKANKHMLSAT 

HT TTTJ+TG VT v* r\* vnv^PCTP^rRPTuivypn 
o1jVH t J£oJ\JjJ\ v s2 Xrlx lo±KVAl\MJ\ixi.U 

YVTSW 


2639 


16540 


A 


2656 


340 


10 


GREDKSEKCYLKPGRSQPDRGAPKSSPG 
VP* PPAFPAJUj?tjFv*KooPr PUvoIj2uh\D 
LRPRQPSQPGGPGELIFPVKTKKKKIKE 
RKTLLAWTRSNERP E IAQREGGCLRQ 


2640 


16541 


A 


2657 


509 


261 


IPYFKIHCLHSALGVAETEKETAEHLDL 
AGASSRPKDSQRNSPFQIPPPSPDSKKK 

0"tJf"* TXffXTT CV"*xr*T7W"\*TCnM/TI3T7T UT771 

oKuiWALirTjlv* VK.y w ZovjfKiJr' VijrlvA 


2641 


16542 


A 


2658 


345 


3 


SAHLSHPKCWDYRREPLHLARISFLFKA 
SSPLYGRATFYLSTPPLMDGRWVAS I FW 

T T •* TT7T 13 * 7\ TATCrVWT OT7TOT CT1?*VWC 

LSVAQAAVQWHNHGSLQPQTPGLKRSSH 
L 


2642 


16543 


A 


2659 


271 


56 


VIXYIVKPIPRPCPHPHPYIHTHTOTHT 

IT XXi X fl. 1 lUjUJUvDr laloVY JjJtroIo XIsvjiM 

fGCFLEIYLIiPIIiLKM 




16544 




2660 


lac 
ioo 


O© 


EREGRKEGRKNEGTETVIEGGSSKTQEQ 
TR I KAS * L.NS L YTTVKPORRIiKGGinCRP 
KDSNFKD*RNISPHR 


2644 


16545 


A 


2661 


351 


199 


LL1UILRQENLLNPEGRGFSEPRSHYYSP 


2645 


16546 


A 


2662 


1 


160 


ERAWLHLS PGDGVCS EPRSHHCTPTWVT 
E * DS VANQPGTORTOLCLKPTWVTE 


2646 


J 6547 


A 


2663 


233 


3 


NPFPVYILFLPQAPKGKWIiSPFPPFFFF 
FFFLRGSLAM* PGCS AVAQS 


Z04/ 




j± 


zow 






FVTQAEVQGRS IS * PGPLVPGFKRFLGL 
TLl^TWNCRRAPPGLVraCGFKMSLLTL 


2648 


16549 


A 


2665 


322 


74 


DTYTLPRLNQVBSLNRPITGSE IEAI IN 
RPPTKKSPGPDRFKAKFYQRYKEELVSF 
LLKLFQ* IEKGGIGWARWLMPVIPVL 
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peptide 
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Predict- 
ed end 
nucle- 
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location 
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residue of 

peptide 

sequence 


Amino acid sequence (A==Alanine 
OCystelne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine> 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Th reon in e, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=otop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 




JOjjU 


A 


2666 


2 


228 


NLALS PRLECKSQGFTMLIRLVSVS * PC 
DLPASASQSAGIMGVSHHAQPFLHSYLR 
PLILIKGDQFKYAFNLFSC 


2650 


J 6551 


A 


2667 


129 


301 


VRPLAGLGSPLIFFFFFEKSLAFVAQAG 
GQRHNLG * LLPLP PGLKGFSCLNLPSTW 
D 


2651 


16552 


A 


2668 


318 


21 


FSIjLVKDFQNDIiAVNSSSIFPSTFLFLV 
RS*LDFVNLNFFNFYLFRDBFSLCCPGW 
TRIPRLKQSSRLRLLSSWGYKCVPLHPA 
INRFIj*CDFSISGY 


2652 


16553 


A 


2669 


2 


322 


QDGLDhLTS *STRLGLPKCWDYRREPPR 
PAPINKCLLS IYHGPGPSLGTEGEKGQD 
YIPAI EQLTDQWRRLTWTGCYCAAVAED 
SMGWWPLLPGYEAEKEAGRFH 




16554 


A 


2670 


29 


261 


E FNTLS KCVWI Y* LFSLSLSLSLCVCVC 
VCVCVCVCLCTSVCIiFYGSPICLHFWHP 
DTLSBES FHFLQI ISKELLKC 


Zo54 


i / ccc 

16555 


A 


2671 


5 


224 


GSAWWLTPVTPALWECKAGRSPEVGSLR 
PA* PRDP PAIiASESAGITGVKKNIilAEIi 
WEVKSCILLIFISLDKV 


2655 


16556 


A 


2672 


126 


321 


WGRRGARARRDSHRALRVSPELALLELR 
PS TMAH I CNPSALGGRGGW I T * DQEFET 
NLANLEKPC 


2656 


16557 


A 


2673 


360 


74 


SRLLRKLRQEVGLNLGGRGCSGPRIjHHC 
TLAWATERDPDS KNKTNQKNKQRR * *TL 
PETNSLLAWEPDCV CKTNKLATRLEIMA 
QKSCSQRPQYH 


2657 


16558 


A 


2674 


192 


357 


RIRK^LYYYYYYLLRWSLALSPRLECRG 
TILAQCNFHL*SSSRSPASAS PVAGIT 


2658 I 


16559 


A 


2675 


236 


375 


EEVLYSRKDRHVASYSRK*W*RSGAEAH 
ACNPSTLGGRGGWITRSG 




16560 


A 


2676 


236 


2 


ATKVS INS IGTLGVKVGGLLEGRRIiRPG 
PKQSSCLGLQSSWDHR*VQPHPGNNFTL 
FVETRSMLARLVSNSWPQAI FQ 


Zoou 


J 6561 


A 


2677 


164 


389 


LTVQHGLRGLRKLTVMVESEGQASHLLH 
KAAGRRSECQQGNCQMLIKSSYQPCAEA 
HACNPSTVGG*GBRITRSG 


2661 


16562 


A 


2678 


208 


1 


CSVYEGSCFCILRVRRVCVCVCVCVCVC 
VCARSCIYVRSENH*KQLPSSSILT*RF 
* IQSTFRSNKRLR 


2662 


16563 


A 


2679 


159 


382 


RWLIKNHPTQARI QVRGLLDRDCQTQTW 
LWESGVQPIiATT PRRHS EMGGWPGAVAH 
ACNPSTLGGRGG* ITPPRPESKLEDCLT 
GTARHRLG CGSQGCS LWLPP PGG IQKWG 
VGRAQWLTPVIPALWVAEAGRSLEVR 


2663 


16564 


A 


2680 


29 


367 


DCQSEQLRKLRWEHHWSQGV *GCSQP*S 
HHWTPAWVTEQDPVQNKQTKNYNTHQRA 

AlWTTiiriTj TITBIT AmnAT him t tt/ifivMAv* -» « 

GEVWRE INCLSTEQLTEIiYSEITQItiAL 
SVRN*KLL*YVKSSVTSAETQISGGKIF 


2664 


16565 


A 


2681 


256 


1 


ENIVPVRFLCIHRLFAILIMVQ*LERKR 
DHIVIHLNLTLETVYLKKWQTRPNAVAH 
ACNPSTLGGQGRRITRSRARDYPGQHGE 
T 


2665 


16566 


A 


2682 


1 


395 


LLIEHILIAMALLILTERKILGCIQLRK 
GPNWGPYGLLRPFTDAI KLFARE PLQP 
VTSTITLYMTAPTLAVTMAVLL*TPLRI 
PNPLVNHDLGLLCILATSSLADY* IL*S 
R *ASHSNYALIGALRACPH 
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SEQIO 
NO: of 
nucleotide 
sequence 


SEQW 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQTD 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K-Lysine, LHLeucine, M= Methionine, 
N=Asparagine, PHProline, 
t^^oiutanune, K—Arginme, a— oenne, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=XJn known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


2666 


16567 


A 


2683 


3 


349 


VRIDKAAVCLLY* AWAEFI WALANLI IR 

PT1T ,T /INT)!! T YNVTVT AMTVPUT T 
rr UAjUXlir VTIN LlLAjli Lfil. i. UN V X V Xx^flrxT VII 

FFIGI PI I YGGFGN* LVPLI IGAPDMAF 
ARITNIRF*LLPPSLLLLLASAIVEAGA 
ACT 


2667 


16568 


A 


2684 


99 


2 


lil vAriAUWlro X LKjVjyZao^ JL lKoVjVKJSyir 

GQHG 


2668 


16569 


A 


2685 


325 


4 


SSFPMSFIINSFIIIFLYTTIFLIFYDV 

V XolrJ&f Ivir VL-E V_Jj 1 tS*w.lfl|JJ X K.iUir\ < Jv_ojr 

LPCKRV*VLAPFIKNFFFFFFFLRWNFI 
FVAQAGVQWRHLGSLQP P PPNFK 


2669 


16570 


A 


2686 


382 


269 


MGFHHVAQADLEIiLGTNDPPAIiASQSAG 
ITGVSHHAQPKLTF * TFS Y*ACVGGSS P 


2670 


16571 


A 


2687 


3 


381 


GHLYALTFFFNSQ I CLFPRI YKTCKKSF 
WLPFLNLVSLCAI EGYKT KKVPSNV* KG 

ISRIEDKNHKTMFIDAEKWFHNI*HPFI 
IENIRPGWAHICN 


2671 


16572 


A 


2688 


289 


91 


JLivxlU!iJ^Vi\^oivlojxl>iVo X xv~xtlnnniUl 
RCHHYHHHS * RLAALCEESGWRKALGST 
SMGRVAFHRH 


2672 


16573 


A 


2689 


3 


211 


TEPPFPAPNEFLIiLHSIiLSKHYAIiGRPH 


2673 


16574 


A 


2690 


2 


197 


DLLTS*SAHFGLPECWDYRREPPRPASY 
AFI KYFCPALSCFWCIETGP YLIFAYLR 
EESFVRSRT 


2674 


16575 


A 


2691 


253 


356 


AVPVKMAI VKKTRNNRCW* GCGE IGTFL 
HCWWEC 


2675 


16576 


A 


2692 


211 


339 


PGHPLS I ILQWIWGEDTRRGPMHEALH* 
PCCSELRSCHCTPAWPTE*NSIF*KKKK 

LQWIWGEDTRRGPMHEALHGKSGDGVYA 
VRLLTAY 


2676 


16577 


A 


2693 


1 


175 


KrtbVjJUM Jj(j<jKo^\jllir*t>oV-il^-l rAHV 1JS W 

DSVSKKKRKGKKKSALILIFSNGVLFCH 
uu 


2677 


16578 


A 


2694 


234 


13 


KTSREPWSEFLSIKGEQQCSFGIiLYVFC 
FQMESHYAAQAGV*WCNLG*LQPPPTGF 
KDFSCLSPLSPSS PASAS 


2678 


16579 


A 


2695 


149 


2 


SQGEDFTKTRALQWRSRRSWAHACNPS 
TLGGRGWWIT*GQEFKTSLAN 


2679 


16580 


A 


2696 


2 




SETRSHHCTPAVT\n^E*DSISKKKRKRNR 
WGGTRFSN 


2680 


16581 


A 


2697 


338 


357 


T LGGRGRRTTRS AVRNQPGQHGDP KF FL 
VSS 


2681 


16582 


A 


2698 


53 


293 


TISIKSKKWKNKQGEimW<34ENWIiAIC 

DPT VT.r , PVUT.\7HQr i TliTPrWVT.PTy!l>JT.*P 
Kxd-iivLiV^xr X v LiVilo^ — L>\JL iVLN X 1 iMtoHL rv. 

KEV*LAHSSTCCTGNMAEKPHGRR ! 


2682 








i. 




LYAAKVPLQKACI RTFS YISFFI FFFFF 
FFGKQSS FGPPGLKARGGTPLIGTPPLG 
VKRNFLPQFSGKPGITGCPPLPQKFWFF 
KKKRGLIF*PGGVPNSDPKGIGPPNPP 


2683 


16584 


A 


2700 


166 


3 


DYRRVPPRLVKR *GFTW*AGV* WRNHG 
SLQTQTIGLKQSSHLSLATCWDYRYE 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

auiijiu aUU 

residue of 
n en tide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

si in a tf\ 
VI 111 g IU 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^vriuianiine, iv-A.rginine, s s3 oenne, 
T=Threonine, V=Valine, 

W=:Trvntf»nhan Y=Tvrnc!nA 

X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 


2684 


16585 


A 








T* CHE IETTLGNMVKP 


2685 


16586 


A 


2702 


78 


341 


E VACNCLLPAIGCFASVTYV* SSVLL * F 
EL* VLNEN<5S P*NPTH* VV+VTR\TT .1 CCO 
DWSVMA*S* FTEALTSQTIRSSHVSLPS 
SWDY 


2686 


16587 


A 


2703 


258 


185 


TM I AP VHSSLGDRARPYLFKQKR*GLAL 

^PPT.FWTYJAT TWRNT.RUT.fQ QMPPTCIl 
orivxLCinlunl X v nivtN 1 J raj Uv*oolMJrzr X OA 

SGVTG I TEMBVERHGW I QETFWK 


2687 


16588 


A 


2704 


361 


41 


NFGPVYKTNPGPSRPGWGPLGKPPGPQK 
FFPFGVPPSPPFPKKGSVFSPPGGQKGP 

X" xwj v r nor tr x'xtoJVixi if 1 vw "or or A v r 
NWDFPKGGFPKKKKKKRKILRN 


2688 


16589 


A 


2705 


12 


329 


SCTLQLTTPRLK*SSRPSLPNN*DYRHV 

S * YYLVI FFFKME PCS ITQDGVQWCDLS 
SLOPL PPRFICPF <3 CT.SLP <? TP P 


2689 


16590 


A 


2706 


232 


1 


KRKRKS FKTYYKKFFLNSS FKNQCPYAI 
GH T FRS I T F FFFT.POSFGT.VA* AfJVOWf* 
DLGSPQPPPPGFKPFSHLRLR 


2690 


16591 


A 


2707 


135 


1 


CFFLI FFFFSSDTQAGV* WPQPPGIKRS 
SQLSSPRSWDYRRISPR 


2691 






z/Uo 




J}/ 


XjXJXxu I o OiMxCljnrllrlVV^WlJ I xCKJS f JrKIr AU 

YKYFLKEVASLIVKLYLFCKLNFE 


2692 


16593 


A 


2709 


88 


344 


MQEHYYFI KRGNKMI IRKYVQLHANKLD 
SLDKIDKFQVKNHQN* L* KK* KS WPGAV 
AHACNPSTLGGRGGWITKS * DRDHPWLT 

Q 


2693 


16594 


A 


2710 


1 


383 


LHKVTTVLiRFLS KFHILVLYFYYTCVHV 
SVSDMWCWFLY* I FTDMYNAVFQ*LDFF 

YLICLFLSYFLFNIYVMLLFYILCIIYF 
YVIFSSSSIF 


2694 






9*71 1 


iu. I 


i 
i 


xrrr oiioriiAioiiur rxuNC aj* j? * ivf-lvf r 
PFLSFKINQNFFFGFPFNFLKIFFKFFP 
NKFFKF* IYLTFFKKTKPK 


2695 


16596 


A 


2712 


1 


136 


KNTKI SQGWW PTP VIPATQEPEAGGSLE 
RGRQRLQ*RNLGSLQPLPPTFKRSSCLR 

l.iUtfijHi'A InXvRJT Jr JrXXrlLM J7 LIT 


2696 


16597 


A 


2713 


3 


349 


LI PRGQGSTWLP YNP ATS I FGNDLNE I 
KOTGHAKTCIWMFMTSSFI IDRTRKRQ* 
CS SVREWINKQ* CIQTMELVFGRNFFFF 
FFFFFFFFFFFFFFGGPPPPKIQIFKEI 
NPQ 


2697 


16598 


A 


2714 


181 


342 


TSQLLNLPLPI IKA* AIRQEKEIKGIQlT" 
GKEEVKLFLFAHDTIVYP 


2698 


16599 


A 


2715 


3 


205 


FRHVAQAGLELLSASSLPTLASRSAGIT 
GVSHCVQP*TVSSLPJjGFYDTSFKIADI 
OS CFSWVLYYCLOS AEL 


2699 


16600 


A 


2716 


388 


23 


ASNQSKNSFEKGEKNQQSVMVKTSQQAL 
ISTKRAFFDMIRSVYQKTYK*RNA* * ERQS 

GHKEK*KAHRFDRNI IQFANDMIVYVEN 
PKDSTKRLS 


2700 


16601 


A 


2717 


4 


194 


FSCLSLLSSWDYRSMPPHPANFLYF**R 
RGinmARLVLNSWPHSAGITGVSHHAQ 
PFNYFFI 


2701 


16602 


A 


2718 


1 


212 


LCCPGWWETARLKRSSRLSLPKCWDYRH 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metnionine, 
tt=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ESLHLAFGFFLL*YFGPYIMVYVFSFNL 
YDFSLVYSLYGTSI 


z /uz 


1££fY3 
1DOU3 


A 


9719 


53 1 


299 


KAVHVNIFFITKVHLTMETQIVSLSIiFP 
FPSFLHPSLSLSSFL*RDRVLLCYPG*S 
AWQS*FTWPQIPGI 1 R*ASCISLVSS 


2703 


16604 


A 


2720 


139 


320 


QPTMRHSFSF*KQEISIFLKVSSSLHHT 
LRFLFFFFLLDRVSLCHPGWSTVALS* L 
TVAW 


2704 


16605 


A 


2721 


1 


99 


IILAYSS ITHIG* I IAVLPYNPNITILN 
LTIYM 


Z/IO 


10 WO 


j± 


979? 

Z/ XX 


91S 
xi j 


361 


I/TFFFFFFERESNFVTQAEMQGGNLV* L 
KPRPPGLKHFYYLSLQSTGNY 


z/Uo 


1XZ/Y7 


j± 


7771 
Z/Z-J 




94 

X*T 


AIPLRPEF* THPGQNGKTRFPLKPQNLT 
GVGGNARYSQFPKRLRLKIPLNPEVGPS 
INPNFHPGIPFWAQKENFFPKKKKKKEI 
IMLNFSLICYNLFFLYICY 


2707 


16608 


A 


2724 


27 


314 


WKQLKYPPSDEWINKMWCIHAMEYL&IK 
RNKVQLCVT* RWRWIiMDTKKK 


Z/Uo 


lOOw 




9796 

Z/ZO 


1 
1 


112 


GFHHVSQDGLNLLTL* STHLSLPKCWDY 
RHEPLCQAR 


z /uy 


10O1U 




9797 

X/X/ 


971 

X/ 1 


3 


RSPMLKNQAIKSVIMVYIGTIWRSAGPF 
AIFFIAE YTNI III ITLTTTIFLGTTYD 
ALSHBLYTTYFVTKAVLLTSLFL* IRTA 
YPRFR 


z/lU 


lOOl 1 


A 


777 
Z/Zo 


191 
JZ1 


1 


ENKYRPGAVAQAYI PNTIiGG* GGWIT*G 
REFETSLTNVEK 


Z/il 


iooiz 


A 


9799 


9 

X 


290 


NRHFTKENIQMPTRHKKRDPQS SLVTRE 
MQTKTKVR YYFI LTRMAKVKKI DNTTC * 
* G YE KLNS PKLQTTKCP I FEWVNKLWHS 
QIVEYFSGIQHA. 


Z/lZ 


1001 D 




773/1 




OA* 

XU*T 


hVSSE* * * I YQILFFFPQ^IECHSVSQW 
VRGSNLC*LHPLPPDFKRFSCFSYWEG 


7711 


lOOlH- 




9731 
Z / J 1 


7 
z 


286 


PFYKATVIRTVWYW* KN*QIDQWNRKES 
PEIDLHK*SLXiIFAKGAKAIQWRKDSLF 
NKWC*NRWISTCKNESMDGVVAHTCNPS 
TSGGQGGRIA 


2714 


16615 


A 


2732 


323 


181 


RDFVLLVE ITMLARLVSNS*PCDLPASA 
AQSAGITGVSHRARPFLDL 


2715 


16616 


A 


2733 


272 


131 


GRVDRLNPGGGGCS * LGSCQCTPAWVKE 
TPSQKKKNBCI ICFYAFLFI 


2716 


16617 


A 


2734 


227 


87 


AASTLTI*TSLIPP ILTTLVNPNKKNS YP 
HYVKSIVASTFI ISLFPTTIFMCLDQEV 
IISN*H*ATMDFT 


2717 


16618 


A 


2736 


25 


415 


FSKYLLDHQLSAGKLVEQHRGSLCPLSL 
LHSDSSLSFAVSGKVSLAASIRNKLELP 
ET*RALMM*NHGHPLEFG®1* * PWT*EP 
GS *RC * ELGQPGQARRIiACNPSTLGGRG 
GWIMRSGVRDQPGQDVET 


2718 


16619 


A 


2737 


390 


1 


KGFFLKTKRKFFFTNGF FPLJjGGKGV PF 
PGKRLGPPILGFYPPQRVFPFPKPPFFP 
PGPFKRGQKPPGGGKNF*RGPPGPPKKG 
GAPTGKKKKKEEBRKEKEKKTPRNISKP 
LSHPFPNFP FCATVRGFS 


2719 


16620 


A 


2738 


96 


331 


QtTYI'HGLFPFNFNRLCVFFFS * KLVGG 
V*LCCPGWL*TPGLKQSSCT J SLLSTWDY 
KGVPPCIiAEHFFFFEKKSPLSL 


2720 


16621 


A 


2739 


379 


14 


LREVFPFFSPPKIKCFSKNSPQKYFPPP 
WFKTPPTPPFFFPPLFGKNFFFPSLFN 
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G=Glycine, H=Histidine, I=Isoleucine, 
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vjiu wiii nic, rv /\r guiinc, o^oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

3£s=f Fit If n A im *aStnn rnHnn /=rtncc t Kla 

nucleotide deletion, V=possibIe 

MUl-ltUlIUC IlldCIilVll 














FCPPRGFF*RPPLFFFFFFFFFFFFFFF 
FFFFFFFKVFYSLIiLLYFLFILK*MLTF 
KTSTSTOCY 


2721 


16622 


A 


2740 


198 


38 


ANPLFFFFFFLRDGVLVYRPGWSAWQS 
* LTADSTP WAQVI LQPQRNP IEHDF 


2722 


16623 


A 


2741 


290 


39 


MS KVGPIiIjGVLGTS frvllkgpeikgdp 
LLANPLFFLFFFLRDGVLVYRPGWSAW 
fiQ*T.TanQTPwanvTT popup Tpum? 


2723 


16624 


A i 


2742 


1 


399 


ilirkhsdigtlyllfda*tgtigtdls 

VLNRTELEPDQATLIjGNDPVHEVIDTAH 

algi I FFI VI P 1 1 iggfgn* lvpli iga 
pdmafarinnisf*llptslllllasai 

*Ha/iVirO J. V7 J. V 1 rOJjft II I V»Jv 


2724 


16625 


A 


2743 


398 


2 


sppplifflwgfsyifpppkkffflinp 

DDV P Q PDD P PP1TP P PPPPPTPPP P* P HYTC 

nfcspppfsppppffllppplffffffs 
kkkkkkkrarsrsrtsptrp 


2725 


16626 


A 


2744 


295 


185 


fffffffffffffffffffffffffflf 

PPT.PTTPPYT? 

r rJurAf cat 


2726 


16627 


A 


2745 


374 


4 


lkkhkstrvp*nvksgk*nfsppfkirp 
waqkrakkkgarekkad*ekgefgklkk 
kksfrpgkkv*g*pkgfqrnfekkpvks 
qgpvnmgvipafekkprippvakipi*g 


2727 


16628 


A 


2746 


2 


382 


QDATAPIIDEIiISFHDHALIIICLICFL 
ILDALFIiTLTAKLTOTNISnAQEIETV* 
TILPAIILDLIALPSLRILYITDEVKDA 
SLTI KS IGHQWY * TYEYTDYGGLVFNS Y 

TT.PPT.PT .PPPnT /2T J . 


2728 


16629 


A 


2747 


2 


374 


DWPPTVKRKTNPRAQSTAABRFILLFTV 
RGLTG IGLANS S LD I VLHDTYYVVAHFH 

WTt^T/^ZVVPaT TnnT7TH*17DT.l?<Jf3VTT.n 

QTYAKIHFTII FIGVHLTFFPQHFLGLS 
GMPRRYSDYPDA 


2729 


16630 


A 


2748 


55 


230 


QIPAK* INKYLQN* KKKKKKKKKKKKKK 
KKKGACLKKPHGGAKEKS WGRQKLFS FK 
GG 


2730 


16631 


A 


2749 


161 


3 


NGNYTLVKKCFNTKEKIKWVKN* IFFFF 


2731 


16632 


A 


2750 


362 


1 


PEKKS FAI KGVKKAGKQCKFCDAPKKGN 
PPKKKNPTGTPKRVLKK*PPREEKKNSP 
QRKKKNTAKNKEKEAPQEEI *KGPPRYI 
FFFIS FFFFFFFYFFFFFFS I VINLFGT 
FIATTLEA 


2732 


16633 


A 


.2751 


311 


110 


ITLPRHGGPHVQSQLFKRLRWKDHMSLG 
DRGCSETjSSCHCTPVWRQSKTFSKIKYG 
RNGTDKAATCNPGRFRSWGRKVA*GQBS 
ETSLSNSRTRL* KNKIWSKRDRQGGYL 


2733 


16634 


A 


2752 


223 


16 


KKTPQKKKlTraGKGPVKKSPPQKFLKST 
PP*IFFYVFCFFFFFLYFIYLLFRIFSF 
FFILFVYYLFI T.P 


2734 


16635 


A 


2753 


367 


10 


PAPRGGVYRGRQASLSCSGLHPLRASWP 
LCSPTQA*AMAGAPPPASLPPCGLISDC 
CASNERGS VGVALS + SGAGDNLLVCRLL 
SGKCRNHPSSASLTliAAVDWSCSYSAIL 
APPLKNS 


2735 


16636 


A 


2754 


14 


341 


PGPNF * FGGQNS PGKNQNLSLPT* KGNG 
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T^Threonine, V=Valine, 
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Y«*nntrnAOfn *~C+/\r» rniinn /— nfmi hip 
A- UDKuUnUj *-oiV|J VlHiUii, # pU^ai Vic 

nucleotide deletion, V-possible 

UUCICUllvlC IIISCI UUII 














PGNP KNGGPTVPGTPP2TRGS hLRTLQG 
PLRPSGTPPTRRAPGTPVGNPGF* PRDF 
FAQKGPNVREGLKQGPNPKGLNPPP 


2736 


16637 


A 


2755 


15 


172 


UQWWFP XX.VOPI, * KTVRRFLKKLKI DP P 
YYPAIPLLGIYSEERKSVKQSRRG 


2737 


16638 


A 


2756 


263 


2 


QKPLPPRNDPVFPKGPERTPRPTWGFIP 
(VZ\rt>T ■«TrFFTT,T.V < 5 PFFFFFSPSPNLFF 
F*NRVSLCRPGWSAW*SWLTAASTSQA 
QAI 


2738 


16639 


A 


2757 


311 


106 


FVCFLRQRIiALLPLECSCTIIiAHCINIiRXi 
GNL* RSLDMKGR 


2739 


16640 


A 


2758 


23 


328 


NITYYT* LFVLYFYVPGHTWFFFXFXF 
FFFFFFFFFFFFFFFFFXFFFFFFXFFF 

TPWFYFWFFFFFFFFFFFFX'X'FFFFFFFF 

FFFFXFFFFFFFFFFFFF 


2740 


16641 


A 


2759 


1 


338 


np*r l?CTWWnVT/^TT.VT.T.FOA*AGAI*GT 
Utt XiV & Xvit\\3 X 1U1UIUDCUX4 J"»jriijvjA 

Al^LLMRAELGOPGNLLGLDHTYNGIDT 
AHAFAI ILFMSLAI I IRSFGN* LVTLI I 


2741 


16642 


A 


2760 


120 


2 


INKYAWIiGTVAHACNPSTLGGRWTA*GQ 
EFKTSLANMVK 


2742 


16643 


A 


2761 


251 


3 


CRGQIGTPTLHYSSSSSSKWIKDIiNVRV 

Tf^TTrrj^Tn>3TrTVNTiPRTiRTiGDC!FTiDDTKS 
fi_ j_ j_ jLijtj r_in V n i ir* pi i f*^** » - * jjiw^ivM 

KNSTRKI RK* SLKLETFFLGGTR 


2743 


16644 


A 


2762 


11 


303 


EILMYRFCL*IFDIFFYWYSREIDLIFV 
Y*FFFFFLETGFWFLLPRGDIj6*WEPPP 
PGYKGSSCLGLPR 


2744 


16645 


A 


2763 


236 


356 


DWLI FVFS VERGFCHVAQVGL* LLGSGD 
PPASASQGAGIT 


2745 


16646 


A 


2764 


2 


332 


T.TOTPftf.fC* AtTLCTiPECWNYRCEPPHP 
AGFLF* RHFDKGVS * LLIYPGTGWS 


2746 


16647 


A 


2765 


297 


its 
19 


TTKTf Q WFF FKTTfTK'PWfinATICFKlI* RGFFIi 
KFVFYFFFIG*GVFFFFFFLISIIIFFF 
FFFFFFFFFFFVFFYFFFFYFFFFFFLF 
FAMKINVFY 


2747 


16648 


A 


2766 


1 


147 


QILRRLRWENCVNLGGGGCSEPRLHHCT 
PAWTV*NSVSKKKKOTIiIiFF 


2748 


16649 


A 


2767 


357 


loi 


NWDYRPLS PRPGKFLDF* VNRGFPLLTK 
LVSNF* PCNLPPSTSPKAGISGFNPRAQ 
p 


2749 


16650 


A 


2768 


2 


193 


RWESCLNPGGGGYSEPRSCHCTPSWVTE 
♦DSVSTTTTKEKERKQNRTWSESNSLQK 
YKDTNHF 


2750 


16651 


A 


2769 


333 


1 15 


VS QDGLDIiLTS * STHLGUPKCWNYRRE P 
PRPAQKLLMVIWLGFMSSPKSPLEL 


2751 


16652 


A 


2770 


3 


ill 
oil 


CMENCMVT PPKKVKHKNYHMIQKFYFWK 
AVQQLSTELNIL*PYDPAITLLGIYPKE 
LKTS FRTKTCT *MLTA5LFVIAKTWKQP 
RCPSVGE * LSML * YVQTMECYSVLK 


2752 


16653 


A 


2771 


209 




RPGRPQVliRRVRPQNRLNPGGGGFSEPK 
♦GPCPPAWGAQRDFISKKKKTIT 


2753 


16654 


A 


2772 


213 


57 


RPGRPQLLRRLRIiQNRLNPGGGGCSEPR 
♦GPCPPAWVTQRDF r SKKKKTIT 


2754 


16655 


A 


2773 


148 


1 


KATGRKTl^VKSRC WG WAHTCNP STLG 
GRGR* TARAQEFKNSLRTLAK 


2755 


16656 


A 


2774 


2 


364 


WSAVRRDWITALQPERQRETPSQKKKKK 
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T=Th reonine, V=Valtne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














^\J^a^J^J\J^J\J\J^J^ loGK 

PGRKNHWRQIQGQPKTPRERAEALKNRV 
GLKKLFKTPGHVNGAGNPKFRNGKSGKP 


2756 


16657 


A 


2775 


121 


2 


HLPJSGVSDQPGRHGKAPSLLKIEKLAGS 

vrvj 1 V- Jj w o yJUlAj 


2757 


16658 


A 


2776 


195 


3 


GRVDINTLLALLLI IITF*LPQLIGYIE 
KSTP YECGFDP I S PARVPFS I KFFLAAI 

DWT.T.imT. 
trr UUB JjJj 


2758 


16659 


A 


2777 


3 


285 


FLNIRNKVSLCCPS*A*TPGLKQSCCLG 
LPKC+DYR 


2759 


16660 


A 


2778 


135 


379 


HWSATVKLYAILLGLLKWBSGTLLCSKD 
VLFFL*G*SSAYSIESVIMQIKAT*VKG 
KAKVQIjGAKKIjKAYWQKKSPGI PAG 


2760 


16661 


A 


2779 


364 


1 


GSFTGAVILI IAHGLTSSLLFCLANSNY 
ERTHSRI I ILSQGLPTLLPLIAF* *LLA 
SLANLPLP PT INLLGELSVLVTTFS * SN 
ITLLLTGLNILVTALYSLY I FTTTQWGS 

T fLftTTMWT V 

L1HHINNIK 


2761 


16662 


A 


2780 


356 


1 


GCXRAHIWPQKGNHEGQVHLFIDKVCRQ 
PMTEDCINEITTQVAQIFLVHFLLRQSL 

YOQLIFNKGAKKIK*G*NSLFNKTYIiDP 
WISTCK 


2762 


16663 


A 


2781 


1 


204 


AQVGLQDASSPRIEELITFHDHALIIIF 
LICFLVLYALFIiTLTT * LTNTMISDAQD 
SETDCYSQSYIL 


2763 


16664 


A 


2782 


1 


522 


YKCNECX3KVFNQQAHLAQHQRVHTGEKP 
YKQTECGKTFSQMSNLVYHHRLHSGEKP 

* KCNECGKTFSQMS KLVYHHRLHSGEKP 

* KCNECGNTFHHNSTLVSHKAIHTGEKL 
YKCNEOSKVFNQKTTLARHHRrHTAEKL 
YK*EECD1wTX3CKSNIjjSTHKKMQ 
SAFSLQ 


2764 


16665 


A 


2783 


1 


327 


ENRLNSGGGGCSEPRSHHCTPAWATERD 
fa vc» ciLb i^QKQNFHIjNAQSNCQKLiKS P * 
KYLKHPE KTD I LS KASQYNNQLTADQS I 
LKFKTKTFDQKKGRP KGHGMTYSNS 




1QOOO 








IOC 


SSLPKCSDYKQEPPYIACATLKCYQIPN 
FYCWPHI FKRMFY 


2766 


16667 


A 


2785 


208 


3 


RPICPLSLWGVSFLFFFFFLRDRVLICH 
PSSWNHNLIPPR 




lOOOo 


a 


Z/oo 


AO 


jdZ 


GVLNDLGIAEFNTCSKSLLLLIFFKGRI 
LLCCPGWIEVVQSQLTASSTLGLK*TFH 
LSLFGSWDHR 


& /Oo 




a ! 


2787 


11 1 


187 


bUJa irxto rf uuuwAl WjUo V o JSXtruxjsH 
KKREEKKH*HNTSGEKINLI 




loo/U 


A 


2788 


1*y< 




noriT Dni.DCMOTO* T*WODT D+T ODDDT 

Ko^LiKVj Lifts Mo -Lo * X w WoKJjKT LiKf KKJj 

F I FCRDRVSLCCSGWSQTPGFKQSTHRS 
LPKCWDYRREPSPLAQI RYLGLFFSQGL 
SSAFCYVAMVSGF 


2770 


16671 


A 


2789 


3 


150 


AASTRQLI FHFTSKHHFGFEAAA* YWHF 
VD W * LFL YVS I YWLG 


2771 


16672 


A " 


2790 


86 


311 


NIHP tiNDI FTRLKNGFKKKE I SLVKNEP 
NKGT*SLISIKVLFHT*K*FIRPGVAAH 
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nucleotide insertion 














T*NPSTLGGQGGWIT*SQB 


2772 


16673 


A 


2791 


2 


123 


GGGACSEPRSHLCTPAWETE*DSISEKK 
EEKK* LQI YTNI 


9771 


JOD/t 




9707 


109 • 




PLLRRLRPENCFNLLGGGFN* PRSRNCP 
PAWATKRDSLLKKNLKNF 


2774 


16675 


A 


2793 


1 


258 


GGCSGLRLCHCTP AYJETD * DP VS KKKKK. 
KNFTRAKLLIFGTLVIiGKVKRGGPFKPR 
ILKLPWETWQNLVFTKKKKIFFLKNRGG 
GE 


2775 


16676 


A 


2794 


199 


3 


VGP I FRHDKPPHS KQLCTYS PAFTQLFS 
SGKILKNIHLWPGTVAHACNPS TLGG *A 
RWITRSGDR 


£.110 


100/ / 


a 
t\ 


Z/iO 


i 
i 


1H9 


GGCS E PRS CHC I PAWATEPDS I * KINK* 
INK* K* * KINRNKKGKNHFLSS ST* * PQ 
VASS FLTGQHRYTKL* SSQKVLLDSATL 
QSKSNAEVKRVNRTTD 


Lilt 


loo/o 


j± 


Z/yo 


ZZO 


196 
J ZD 


NDNGO S GWAHACNP S ILGGOGGWI I *G 
REFKT 


2778 


16679 


A 


2797 


270 


1 


KFGISAPFAPSPKV*KRGFF*NFLGNGP 
PVKIPPF*TPL*NFSKPGKNFPPFF*KT 
RFSKI FPNGFFFFFFFFEGVLLCCPGWS 
ALTRSW 


9770 

z/ /y 


l£6£0 




970ft 
z/yo 


107 


JO 


fknfccg*evslccpgwsqapgvk* ps c 
LGFPKCWDYRCELPCLASCSIj* cdrnqt 
RISAVVLWIBFHLSPIIjFVLPLFI.KKPQ 
AGLLCF 


?7»n 

Z/OU 


166R1 




9700 


119 




I FEVTV* CTK*HKRNMQPGWGHACNPS 
TLGGQGGWIMRSGV 


7*7511 


16689 
lOOoZ 




9Rftn 

ZoUU 


19 


114 


KOHPGNGPNPSGKGPGRASRFLKXKNFF 
F*KTPPKKPHTPCKPSPKGKGPERLPWH 
FFNQKFGPLPMGPNPDQRAGFCLRDP*G 
GGKNRSPPTIi 


2782 


16683 


A 


2801 


139 


3 


AASTFYIFFETTLIPTLAIITR*GNQPE 
RLNAGTYFLFYTLVGEG 


97R1 


166R4 




9R09 


918 
ZJo 




AASTSHVISSMYNIYIIIQFKTFPVFFF 
RDKVSLCHTGWSAVAQS*LTAASAALTS 
CVQGILPTQPPEYHTRPLRLFL 




lUUOJ 


A 




llv 


291 


KKLGVFGFGFGFKTKSNFIMQAKGQLPN 
LG*LKPLPRGFPQFSGLSLSGTWDYKHT 
PATR 


2785 


16686 


A 


2804 


97 


2 


SYNNQDSWIiEKEKTNRSVKQN*APIPE 
KKKKKNKVGGLTLPNCKTYYKATIIKTV 
WYWRKKRQIGQ*NRIESPEIDPHKY 


2786 


16687 


A 


2805 


268 


1 


ATFIiLFYYVFRPQ I P YYL* IFILSLF* Y 
RHSGPPYVGPPQQYPVQPPGPGPFYPGP 
GPGDFPOUiPIRKWWLVWLMPQLNPTVW 
PTIIT 


2787 


16688 


A 


2806 


153 


3 


IPIiPKGLLVPIiFGVFLKVFFFFFFFF*D 
RVSPCRPDWSAVAQSRLTASS 


2788 


16689 


A 


2807 


3 


216 


NAARDHI VKPSP * PLTGALSALVMTSGL 
AM* CHSHS ISLLI LSLLTSTLTI YQ *RR 
NVTR * TS YQGHHT PTDQKGLRYG I ILFI 
TSEAFFLS * FF*AFYHSSLSPTPQLRGH 
WPPTGLS PLNPLEVPLLNTSELLAS * VM 
PLSLHIASNTKPTNQHTNHIPMKAQCNT 
INLIPRPPHTN 


2789 


16690 


A 


2808 


168 


354 


FLERE SG FVAQAEGQGQNLS * WKP F FP I 
LKHFSCLSLLRSWNYGPTSPVPAKVGGF 
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n=niiifomin» D— A r-rrin I n C— Q*»!w* 

vjriuiatioiiic, iv — /urginmc, o — oerine, 
T— Threonine V~Valine. 

* i ill dflllllv, f Tallllvj 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

nucleotide deletion, possible 

IllltlCUUUC IXlbcrlllMl 














WRNKIY 


2790 


16691 


A 


2809 


61 


360 


YVSNS KCSNHRK* SLFFFFF FERESS FV 

LRNGDDGPRPQPPANIJGLLVKTGFSPVA 
HDGVNLGTLGDCPALP 












•5 
J 


TCMP POMP VTMT.Trpr* OAT W3PTT 2VT."KTffl/T 

KTFLAP I FFFLRRNS ALVAQAGVQWRDL 
GPLPPLPSGLKRFSCLSLPSC 


2792 


16693 

IUU7J 


A 


2811 


IH-V 




LLWKTGYQFLKKLKLELLSNVPI PYL+ 1 
WKKGNPYALLLAI 


2793 


16694 


A 


2812 


230 


1 


VKLCPVSLKTGVRPLLTSSEVIFILNCK 
FNIGV*LLPGSLLLLLASAIVEAGAGSG 

* TV7VP PT AHMV Q HAP PV21T.V 
i v x At lr liHuuN X o Una Jr I AH V 


2794 


16695 


A 


2813 


3 


345 


HEVRI DVHTRTCFTFGTI I IAI PTGVKV 
FS * LATLHGSNMK* SAAVL * ALRFIFLF 
Tunnr/m t vt jqwq q T.nnrr.uriTWi Aran 

FHYVLS IGTVFAI IGGFIH* FPLFSGYT 
LD 


2795 


16696 


A 


2814 


2 


184 


ARVGEliHIDQAGLKIiLTL* SAYFGFPKC 
GEKI 


27Q6 






/.ol J 


L /O 


j 


SWD* * SMQPCPDDYGTRSDQKKPTFKSP 

YVQ(YFf5T ,1?PT /5 <? Cm . P AQ2V Q flP AITTTY^M 
x v oy x vjxjanii^joaxvijtrrtJtfio^i\Ar\^L x\jl v l 

SHHALV 


2797 


16698 


A 


2816 


3 


94 


HENQYSSLIIIMAIAIKLGIAPFHF*VP 
EV 


2798 


16699 


A 


2817 


93 


1 


GGGGCSEPRLHHCTPVWVTE*NPVSKKK 
TRA 


2799 


16700 


A 


2818 


1 


197 


GTRAQSLLLGRLRQKNQLNLGGRGCSBP 
KS GHLTPAWAT * PDLVSTKS CTLI YLVT 
QALHITLLP 


2800 


16701 


A 


2819 


109 


2 


GVF*GVFIiAQGLTLVAQAEVQWHDU3SL 
nPr'DPPT.TTRT.I/ 

\£ tr V— ±r Mr JUI Xv-EtXj V 


2801 


16702 


A 


2820 


2 


347 


ARAPG FAENEVFVFS SCFVWMVPGGKGE 
NAELMQPSSY*BSQHFFPAPPRCSSCV* 
LCSLGPSLLGTLIFCQFTLSELPRFRKG 
* FSSSLKKVTCTODRVlxLCHPGWSAVVQ 
TQIj 


2802 


16703 


A 


2821 


1 


353 


CTRRDVTRESTYGGHHRPPVQRGLPYGI 
ILFITSOVFFFT* FF *TFYHSTLTPTPG 
L*GHWPPTGITPx^LKVPLLNTSVIxLP 
SRVS IT*AHHILI *NNRNQIISSLLITI 
LLGLY 


2803 


16704 


A 


2822 


261 


400 


VEHSNSWKSNFLGQGTGCHACNIaNTIjGG 
RGGRITWRSGV*DQLDQH 


2804 


16705 


A 


2823 


273 


97 


LNTPRMQRLTQlxATVILRFHKDQGFAML 
PRQVSNS * AQAICHLGLPKC* DYRREP P 
GQK 


2R0S 


lO/UO 


A 




1 so 


jjj 


jvivJc roirrr rrnr * uaiojjunjjunort v 
VQSWLTAAST 


2806 


16707 


A 


2825 


67 


338 


LSPEL*LY*IF*KPFS*QLSLTSHSPSL 
FSALLLLS ISPPTWLHSQSDKNLHLFSF 
LFPJ}SV1 J LCCLGYSAVT*SWFTVASNSW 
AQWLS 


2807 


16708 


A 


2826 


3 


359 


HEQKYS FLHDSQTLFCF* DS I PTPSNMD 
ETQQKSRLELVRISLLLIEPWLEPERLL 
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nucleotide insertion 














RSMVANNLVYDTS DSDD YHLLKDLQEG I 
QTLMGRLEYGSRRTGQMLKQTYSKFDTN 
SHNHDAL 


2808 


16709 


A 


2827 


3 


338 


LERNLDTTFLDPAGGGDPILYQHLF* FF 
GHPEAYILILPRFGIVSHIVSYYFGRLE 
PSGYLRMG* PI ISVGLLGFI V* SHHIFS 
VGVDIHARAYS YFGSLVMAI PCTLEVFT 


2809 


16710 


A 


2828 


1 


406 


RHEGE KLRRPTFGPRHRGAGTAKMS AS L 
VRATVRAVSKRKLQPTRAALTLTPSAVN 
KIKQLIiKDKPEHVGVKVGVRTRGCNGLS 
YTLEYTKTKGDSDEEVIQDGVRVFIEKK 
AQLTLLGTEMDYVED*LSSBFVF 


2810 


16711 


Jt\ 






i 


GI IVMQATIATALIGYVLPLGQISL*GA 
1VIAYIIAAIPFIGTDLVQ*I*VGCSVD 
SPTVTGFLAFPFVLAFI IAAVAALIiLRF 
LQGTGSPNPLGGASQSDKIAFQFSC 


2811 


16712 






1 tt£ 
150 




YSBRE VAFFFFGNGFCF * TQAEWNGGNL 
G* LNLLPPR* KE FSCKI FPRTWNYS P 


2812 


16713 


A 


2831 


303 


590 


NILTTLLNKQSKSNQLQNKIIHTVYIKI 
KIFFFFIiRQSLSVTQADVQWHCLCSIiQP 
PPPGFK* FSCLCT J I.SSWDYKHAPLHPAN 
FCIFSRDGISPS 


2813 


16714 


A 


2832 


1 


370 


EELITFRDHAVIISFLICFLVLYALFLT 
LTTKLTNTNILDAQDIQTV* TILPAI IL 
VLIGZiPFLRILYITDBVDDPSLTIK*NG 
HQWD* TY* YTDYGGLI FNS YI LRPLLLE 
PGELRLRDVDN 


2814 


16715 




OC37 




i 


QFFIFLRHSFTLFAQAGMQWRDLGSLQP 
SPPGFK* FSYLSTLRGLARKITLAQEFQ 
TSLGNMGGPHPRA 


2815 


16716 


A 


2834 


223 


I 


DNLAHKGKTRVYLKSRNKLGKGGGAGNL 
VSLDS IGSRRDHRRAPPCPANFVFLV* M 
GFPHVGQDGPEIjLTSCSC 


2816 


16717 


A 


2835 


1 


321 


GTRKPS P * P VTGALSALLMTS GLTM * LH 
FHS I TLL ILGLLTNTLT I YQ * WRDVARE 
STYQGHHTPPVQKGLRYGI ilfitsegg 
lfagff*afyhssldrtpqlggq 


2817 


16718 


A 


2836 


57 


370 


iwmgrvlwkdrlygvfcrafnravtsrv 
waedpwrvpktls vdprklppfs * isvr 

GQI YFT* FFFFFETESHTVAQAGVQWWG 
DLG*LQPSPPGFKRFCLSLP 


2818 


16719 


A 


2837 


2 


345 


ARAHRQLDEP*L*RRPGERHPSW*SEET 
VERQRTKT*SESSQTGTSITSSRNARRR 
ESEKSLHLETLNKEEDCHSPTFKPSTPD 
HPLKVMPAPSPKENAWVKRSSNPPARSQ 
SS 


2819 


16720 


A 


2838 


15 


376 


AK I E PLYSSLGNKS ETPSQKGKKKKKKK 
KGKGGGEKKKKKKTEGGTRGLTQETQFF 
GNPKGPKHLGARN* KHRAKQAKRGNQPR 
EYRFTEEKKGEKLFNPGGGRSQKPKTGQ 
WNSAGGKK 


2820 


16721 


A 


2839 


3 


336 


HELLAS ILLIY* * CRDVTRERI I PLAHH 
TPPVQKGLRYGI ILFITS * VCFLAGLF* 
YLYHSSLPSTAQLRGHWP PTGI TPLNPL 
EVPLLITS VLLASLCSFI *AHHSLIEH 


2821 


16722 


A 


2840 


338 


3 


KIIFYLFFIJtelAFLAQIEGHWAVFSSF 
KLWP PG FKHFS AS T FLVNRVFKGGPRAR 
VNFGFF I KSGFFP I F * VGF * LFFFFFFE 
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W=Tryptophan, Y=Tyrosine, 
a— unicnown, — oiop coaon, /—possible 
nucleotide deletion, ^possible 
nucleotide insertion 
















2822 


16723 


A 


2841 


2 


149 


ARGCSEPTLHPCTPAWVTE*DS ISKKKK 
KKVECSNGNVCRTMCILCCFW 




i am a 
15/24 


A 


2842 


348 


175 


bACKVVS XTGMHCHA* V I VS VIJjV* TGS 
LHVGQVGLElxRTSDDMHTLASVSGGITG 

MP 

Plo 


2824 


16725 


A 


2843 


338 


118 


DPGGGACSEPRLRHCTPAWVTKRDFVSK 
KKKK*NIKC* HAKSGTVLYS FLYLFYS * 
irxjyLKVvovHJ.lrvjfc'ollv 


2825 


16726 


A 


2844 


189 


47 


KCVAIiLLSLFIiSCSVM*RNACPRPAPCH 
DCKFLEASPAMLPIQSVEL 


2826 


16727 


A 


2845 


335 


139 


EVSPSWPG*SQTPDLKRSACLSIiSKCWD 
YRPGVSHRARLNGSFLTQLYEMLTYFPI 
IWVTLQVFR 


■2827 


16728 


A 


2846 


3 


352 


HEQRLTPEWKKAATALGDVVKVGAVDAD 
KHHS LGGQYG VQG FPT I KI FGSNKNRP * 
DYQGG ITGSAIVDAAIiSALRQLVKDRLG 
GRSGG YRSGLQGTSYRSS KKDVIELTND 
SFDN 


2828 


16729 


A 


2847 


2 


269 


ARGLFSTNHKDIGTLYGLFGA*AGVLGT 
AIiSLLIRAEIiGQPGHLLGRDDISRIHVK 
RI I PGCGTLDGISVGRECACLWTPACSR 
KCARG 


2829 


16730 


A 


2848 


317 


28 


SFLPN* KLAGHGGRPL* FQVLRRVRLEG 
CFSLEG*GCSEPWSCPCPLTWATETDPF 
SREKKKRRRREEKAEKRRENRRVPSLSV 
FLEEYVELTLIL 


2830 


16731 


A 


2849 


326 


107 


FHHv GQDGliDLL PS * SAHLGFP KCWD * R 
CKPPCPASRNFKTLKLYHIFSNNLIE*N 
*KS I TRGTLETVQLPGN 


2831 


16732 


A 


2850 


224 


3 


SILQAWLKVPSPPGNPLGLLQSQVDFFF 
♦RQGLALS PGLDCSGMI MAHCS LKLLGS 
TDPPTLAS*VAGTPSSC . 1 


2832 


16733 


A 


2851 


27 


329 


VRTAAINRPGFLIiPCFIGQNFFFY*RFE 
TYRLGV*-MILTPH*YLLSSLSGLDCVIF 
FS Fi LFFYWF I VYCHVFMVYLFFFLFVI 
LFVLCVFWCFYFIiLFV 


2833 


16734 


A 


2852 


351 


60 


NLLSSLQGGRLPRPPGLS PHPQIFTPHG 
L> V FXi T LiFLa P W KPKASliN PKb * P P PGQ 
PRGNPFPPKKNFFFFETVLLCRPGWSAV 
AQFQKKKNLLIC 




10735 


A 


2853 


1 <>£. 

186 


365 


Li I \2 iiliN Li 1*i QciQ I BY PU P YAQCG PRSG 
NAQQIGHSFPNG * QVPASGMHGLPWTHQ 
GLKK 




1 CSYlC 

lo/Jo 


A 


2854 


17 


23V 


uttttt j^iT omct t •putt 7kotrr»o*mxT'T»ior\T 
rir X Xx^XoNAXjJjXhtXXjAKxlUo IHJjWbyJLi 

L* RLRRNDLLKSGAGGCSELGLHHCTPT 

nil a\iiJtr v orvf vnocn 


2836 


16737 


A 


2855 


325 


140 


GFHYVSQLX3LDLLTS * S ACLAFP KCWD Y 
RHKPPCI1AAYIFIQSRVFI1AVISISTHS 


2837 


16738 


A 


2856 


1 


157 


GTRGFHHVGQAGLEVLTSDNLPVSASQS 
AGITGESH*AHERSDGYANTNSPG 


*>OJo 




A 


LoJl 




JO I 


r .T.T.R T VS LiPWP IT*? Q PT /WO A VMYP PnT.P 

VTKPPPPD*DKDSCIiSLLITWDYRLAPP 
LPANLEF * I * TG 


2839 


16740 


A 


2858 


3 


346 


HEGNHKFKI INVHLR* LNQNL * SGLEGD 
KVIWRWI *TS VTFTiFC *MNGKMNPDI*FR 
QFTYEVGRIFFFSLKD*VWLCCLGSSTV 
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nucle- 
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E=Glutamic Acid, F=Pheny (alanine, 
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K=Lysine, L=Leucine, M==Methionine, 
N-Asparagioe, P=Proline, 
Q=Giutamine, R=Arginine T S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














DLGSLQPPPLGFKQFSCLSIjPSSWEPPC 
PA 


2840 


16741 


A 


2859 


18 


233 


KNPTILCCKGGGAQA*KPRIiLGRLRQEK 
GLTPGGEGCRDLKJSPLGIPPWATRPKLL 
LKKKKKKGGKKKTLF 


2841 


16742 


A 


2860 


356 


3 


RVNFKIFGKKGFCPGGPGGFEVLTPGDL 
GPLFFQKGGDS PSKPLKPFFFCFALKST 
TLNYFQFAFCVLRPTPN*SVTQAGMQ*H 
DHSSLQPQPPSLK* SSQLGFPSS WVYRH 
APPSC 


Oft AO 


lOtHD 


j± 


ZOOl 


1 1 ^ 

1 ID 


^74. 
j if 


IVCGKILNV*YTQ*HNKLQNKNKVD*PT 
IKLMLISQSLGILIFIQKRDFEVNYGMI 
LKKRGWPGPVAYVCNSKTLGGQGGQIT 


Zo*»J 




j± 


ZoOZ 


lO j 




LKR I WLL I LHSH PHTSPHDRG VTQLEAI 
GPRWY * TYEYTDYGGLI FNSYI LPPLFL 
EPGDLRLLDA 


2844 


16745 


A 


2863 


1 


236 


GTSPFFSFFFLS FFPSFLPFLSFFNPTT 
LLLSLHSNI S * KKVLFSFEMESHSVNRL 
ECSGTISAHCSPAWATGTDSIS 


7RAS 


lO /HO 






fn 

V / 


364 


VCVOTCVCVSVSVCVCVCVYICTYICMY 
VCA* YRIRRYIMLPTLVYNSCPE IHDSK 
SCADHTGAITRATR* PCLCIHIPVCLTV 
CLTVWLHI CLS VCMD 




io/**/ 


A 




0 

£. 




ARGLTQENRLNLGGGGCSELRSHHCIPA 
WATB * DALSQKRKKKKGTYRG 1 1 




1fi7 Aft 


j± 


ZoOO 


-IAA 
J*tH 




IQPNSFIFLKIFISRDGALIjCCPGWSPN 
SWAQQSSRLNLPKCWDYRHEPPHLWQF 
L*KHYMYKWHKVHCVFSHYNKIFTGQTK 
TEN 


2848 


16749 


A 


2867 


217 


340 


RVFSDFYMNLKWSLRKRRTWPGLVAHAC 
NPSTLAG*RGQIT 


2849 


16750 


A 


2868 


3 


349 


HEATSPI IQELI TFHDHALI 1 1 ILICYL 
LLYALYLTLTTKLTITIMSDAHE IHTA* 
TILPAI ILDLIDLPCLRILYVTDEVNDP 
YtiTITS IGHQWY* TYEDTDYGGLI FNSY 
ILT 


2850 


16751 


A 


2869 


1 


352 


GTRIDVYTRAYFTS ATIGIAI PTGVKVF 
S *LATLHGSNMK* SAAAL*ALGFI FLFT 
VSGLTGIVLANSSLDIViaDTYYVV7AHF 
HYVLS IGAVFAI IRGFIH* FPLFSGYTL 
DQTYA 


2851 


16752 


A 


2870 


2 


342 


ARARFRTS T J >T t AFALL* LP WLQEAGAAQ 
TVPLTTLFDHTMLQDHRAHQLAIDTYQB 
FEETYIPKDQKDSFLHDSQTSFCYSDSI 
PTPSNMEETQQKSNLELLRIFLLLIESR 
L 


2852 


16753 


A 


2871 


254 


2 


YPGYQAISQSDMQSYAYRHL*LQPPGVK 

* SS I VS LLS S * DHRP VP PCLANE KNFFF 

* RQGLAMLQSRLVLNS WLLL YSHAE PRA 


2853 


16754 


A 


2872 


282 


3 


HTHILHYIYIHHGHIST*HLSPPTNIYS 
LCNIKI LI YTPYDH* I IRIPLHQEHAVH 
RRRLIHIHIHTSVCVCVCVSVCVACVCV 
CVCVWVLV 


2854 


16755 


A 


2873 


278 


3 


LWPLKNSGPQPWPQMDLGKPQVKTPWY 
YKKGP* KXFFRKF* KWPGFFFFFFLRQS 
HSIAEAGVQWRDLGSLQPLPPGFKRFSC 
LNQISSC 


2855 


16756 


A 


2874 


2 


348 


ARA*SLILVSLI IPNATSNLLGLLPYSF 
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TPTTQLS INLAMD I PL* AGTMVIG YSSK 
IRHALAHFLPRGTPTHLIP ILAMIESMS 
T.T.TQP I AMAVRVTDNI TAGHLTM PLTGS 
PTV 


2856 


16757 


A 


2875 


1 


260 


GTRENHLDPAGRGCS KSRSHHCI PAWVT 
D*DSLKKKKKKKPOI ILGNGGTEGNOK* 
RGNPFRGPWGPSRQKIKELGPGGKHGKS 
KF 


2857 


16758 


A 


2876 


176 


1 


EHLFCKKLLGEEKGSLSKEFGKCWVNPR 
♦KWNSRPGWAHTCNPSTLGGRGGQITR 
SG 


2858 


16759 


A 


2877 


1UO * 


I 


GVRVFGFSSVPE FSFDTCAGAOWRNTif5«3 
LQPLPPGFK* FS CLSIiLSSWDYKRTPRA 


2859 


16760 


A 


2878 


67 


333 


WQGLGRAVARVPRSYEITWAGNLKGAA* 
FQQLFFFFFFHLLFFFFFFFFFFFFFFF 
FFFFFFFFFFLFSFFFLFFFFFFFFFFF 
FFFFL 


2860 


16761 


A 


2879 


335 


1 


GRKEGRREKEKBRKKKKERKKERKKTI F 
KKHIPSPLAFPRKAWKLKEIQIKliVAH* 
KINRQAIRGCPPNKRPHPFQNMI KPGRE 
EGEGGLETSRDGERNLWTFGPSVOPT.R 


2861 


16762 


A 


2880 


3 


345 


HERHETASIILLIAILFNNILSGQ*TIT 
NTTNQYSSLIIIMAIAIKLGIAPFHF*V 
PEVAOGTPLTSGLLLLT*OKT»AP t QT t y 
QISPSLNVSLLLTLSILSIIAGS*GGLN 
QT 


2862 


16763 


A 


2881 


347 


156 


WLIFVLLVEMGFFHSGQAVLKLLASSDP 
ASMTSOS SGITGIiRHCADPP * AFT* T ATiP 
LPPSAKI 


2863 


16764 


A 


2882 


338 


2 


IKKAL*HE*LREKKKSPIRTSQSSKAP* 

MGKLLETPNLPLLSQKVAEPLNQPITSS 
KIETLIKKPYHPEKSYGHDEVTAKFSRA 


2864 


16765 


A 


2883 


206 


1 


IRTRPFTPMF I FYMFVYLATRS CP FT LA 
GVQWHNHGSLQL* PPGLKESSCLSLLSS 
WDYYACLHAELV 


2865 


16766 


A 


2884 


341 


140 


DGGCRE PRSRHCS PAWVTE* DS KQNKTK 
QKQKRYTNISSPQTSPLCYSFSAYTPLH 
NMPGNIMFSSL 


2866 


16767 


A 


2885 


360 


247 


FHRVSQDGLDLLTS * SAHLS LP KCWD YR 
REPPRRPKAL 


2867 


16768 


A 


2886 


229 


351 


KLNNRPGVVAHAFNPSTL* KLNNRPG W 
AHAFNPSTTjGG*GGWIMRSGVRDQPNQH 
GETP 


2868 


16769 


A 


2887 


359 


2 


FYSSSSSSSELVPCRQDVQVPHYLEGLF 
LRSCFMEIKYDT* KRKI KYTHVHTHTHT 
HTHTHTQREKERDFPRFTAKWKANLEAG 
SGYATAS S IRAVLWTTITLV 


2869 


16770 


A 


2888 


4 


305 


ADSHMWKYKAPGITKILLKKSKI *GYHQ 
DWISSRYQDLFHKDMI FKTMS YWYKKRQ 
VDHWDRIESPETSSQMYEHLIYNKDNIA 
EW* EKDS FLSKWCWDN 


2870 


16771 


A 


2889 


1 


436 


IAILT * YDYTLL* RVGPRGQEFGTRATV 
ITNLLSAIPYIGADWK*G*GGYSLDSP 
TLTRFFTFHFILPFI IASLATLHLLFLH 
♦TLSNIPLRITSHSHQITFHPYYTDQYS 
LRIxT^FLLSFTTLLLILPNFLFYPP*TT 
LTFPF 
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2871 


16772 


A 


2890 


1 


218 


RKISPL IKLINHSFIDLPTPSNI SA* *N 
FGSLLGACLILQITTGLFIiAMHYSPDAS 
TAFS S IAHITRDVNYG * 1 1 RYLHANGAS 
IFFICLFLHIGRGLYYGSFLySET*NIG 
I ILIiLAT IATAF IG YVLP * GS LLGACL I 
LQ I TTGLFLAMHYS PDASTAFS S IAHIT 
RDVNYG 


2872 


16773 


A 


2891 


3 


368 


LSNS * ANNIi I * LAYT IAF I VKI P LYGLH 
L*LPKAHVEAP IAGS IVLAAVLLKLGGY 
GIIRLTLILNPLTKHIAYPFLVIiSL*GI 
IITSSICLRQTDLKSblAYSSISHIALV 
VTAILIQTP 


2873 


16774 


A 


2892 


3 


342 


HENLI*LAYTIAFIVKIPLYGLHL*LPK 
AHVEAP IAGS I VLAAVLLRLRG YGI IRL 
TLILNPLTKHIAYPFLEIiSL*GIIITSS 
rCLROTDLKSIilAYSSISHIALVVTAIIj 
I 


2874 


16775 


A 


2893 


2 


361 


ARVCLRQTDLKSL IAYSS I SHIALWTA 
ILIQTP*SFTGAVILIIAHGIiTSSLLFC 
LANSNYERTHSRII ILSQGLQTLLPLIA 
F* * LLASIANLALPPTINLLGELSVLVT 
TFS*SNIT 


2875 


16776 


A 


2894 


229 


3 


YTFVQSFIFSWLFICKVCIiL* PAHSMV 
CACEC^CVCVCLCVCLSSLKI I PLLECL 
IYIiQLILVLSSAIYNSRFB 


2876 


16777 


A 


2895 


3 


391 


GFLITNNISPASPFQTTIPLYLKLTALA 
DTFLGLLTALDLNYLTNKLKIKSPLCTF 
YFSNILGFYPSITHRTIPYLGLLTSQNL 
PUiLLDI»T*LEKIiLPiCriSQHQISTS 1 1 
TSTQKG 1 1 KLYFLS FFF 


2877 


16778 


A 


2896 


381 


2 


GRCRVSGS I IDHRMAPQKWR YTKGGPHQ 
WHNFCFF* KKGVFPYGPKVFLFRAFVFS 
PPGPPKRWEIKV* TPPPALFFFFFFFFV 
EMGSHCVAQAGLKLLGSSTPPTSASQGA 
EIAGVSHRAQPHAS 


2878 


16779 


A 


2897 


324 


1 


IiYTNNTKHINPQQNNTK*YQRNTKNRRK 
KEQQTKNNIKRHITESARKNS* *HRNIQ 
KANKSQQSNKHYRARSRHNNNNNNNNNN 
KKKKIENSHAASD* ITSSGGRSRA 


2879 


16780 


A 


2898 


120 


345 


P PAPS S * YTTS L I QDRLFLMMAVLSSAS 
LMRGNVGSNIMNALSHFLPQGTATIiFIP 
VLDIMEAISLLIQPIALAV 


2880 


16781 


A 


2899 


381 


219 


CVSQDGLDLLTS* STRLSLPKSWDYRCE 
PPRPAGSGHLYSSFKKQTGDTHHNF 


2881 


16782 


A 


2900 


1 


224 


LSWCLTLYFAYLLAL FYFYFLKILDLAI 
LPRLVPYSWP*ASSPASASQSSGITC24S 
HYTWPLPQTLNPFLFFVP 


2882 


16783 


A 


2901 


1 


228 


KYIiINNRLITTQQ*IiIKLTSKQMITIHN 
T*GY*YNRSGSSFNGVYDYHLLRSESEH 
PWMIVDNTEYDEIYTRGGIE 


2883 


16784 


A 


2902 


2 


332 


LPPSFSLLLAPSPSSSIiPPSPSSSLPPS 
PSSSLPPSPSSSLHTSHSSTLTPSASCT 
LTITESFTQRIiAQS YITILP * ASRTTMQ 
LLKYRTLQTTRNNTHSMTRPKALTKI 


2884 


16785 


A 


2903 


3 


243 


DIGFGTDFSDMTPKAQTKQVDQLDFIKI 
♦NFWPGTVAHTCNPSTLGGRGM* ITKLG 
V*DQPGQRGETTQKLAGHGGVDAV 


2885 


16786 


A 


2904 179 


2 


CGYFNAYICYSLLCFLYLSLCNQPFWKK 
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OCysteine, D=Aspartic Acid, 
E=Glutaroic Acid, ^Phenylalanine, 
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I FLPLFGLVFFFFFF *NRV* FCCPGWSA 

vsq 


2886 


16787 


A 


2905 


108 


2 


GRVDIKLTSKQMITIHNTKGRT*SLILV 
SLIIFIA 


2887 


16788 


A 


2906 


318 


2 


YKESNSKNNEKKKITRQRKRQKKNRPSM 
MVHTCNPSPMAGRDGQI PS ILFNQPHS P 
RI IRLTLILNPLTKHIAYPFLVLiSL*GI 
IITSS IGLRQTDLQSLIAYSS 


2888 


16789 


A 


2907 


17 


308 


KLAGYGGMCLWSKLLERPTRMNHLSPRS 
EGLMEP*SHNCIPDWTP*QNPVSVQITG 
FCFLTPNLAMPPRIiNLSGPICCQLSLRL 
LGPRIiFAASVSGV 


2889 


16790 


A 


2908 


378 


1 


RSLHSTETAAKTHGHQTHPGLSTFSK*T 
PTP IAS I P I PNPHNCLPAFNS TTSASYF 
SBCVLTSSFKEX IADFR+ KYSSTN*HPY 
HLFLiLKFVFFCSRDGSLTMLPIRPGLNS 
RSQAILPPWPPKVL 


2890 


16791 


A 


2909 




129 


HLFGTNHRDIGTLCLLFGA*AGVLGTAL 
SLLIRAELGQPSTRP 


2891 


16792 


A 


2910 


1 


323 


CV CWGYRI PRCNIGHHLKFFFFSLFFFF 
FGNKVWFCSEG*RAGDQLLIMEPLASGF 
KGI FFLNLPQNWE * RVSPKLPGKFWIF* 
LKTGFPLVAQWFELRTSGDQMA 


2892 


16793 


A 


2911 


367 


136 


ETPGNHLSPGV*GCSEL*SCLCTPAWAT 
EQDPISKQKRTKKS KTLLKNTKADLTRW 
KHIHKSACVCICLCMYMCLRE 


2893 


16794 


A 


2912 


258 


140 


FLIFFCLDCFLNIKVLGVPGFFFSLKKK 
QT * RQGLALS PRLE CSGL I IAHCTLE PL 
GSRNPLTLAPQ IAGATGMSH* VQGYNEL 


2894 


16795 


A 


2913 


48 


242 


YSMIFCVLILPVSFTSSDGIFLLINLFIi 
FY * STYPIiAFFKTESHSVTQAGrVRWHHL 
GSLQPPPPR 


2895 


16796 


A 


2914 


309 


76 


GF 1 1 CVCVCV C VCV CVCWF I FTEY S FH 
SLFC+ IROCVFYFLALSIiLIGFYLFWNR 
YLEVYNISFEVWASLFNRFLLL 


2896 


16797 


A 


2915 


1 


122 


FSVETGFHHVGQPGL*PPTSGDPPSLAS 
QS AG I TGMSHHA 


2897 


16798 


A 


2916 


167 


1 


GNMCSKESVSGTNINRKPD**YNPRLGT 
VAHAYNPSTLGGQGRWITGGQEFKTSL 


2898 


16799 


A 


2917 


2 


356 


ARGILLLRIMLTTLTRYP* *RDVTREST 
YQGHHTPPVQKGLRYGI ILFITSEAFFL 
PRFLPPFLSSSFPPTPSLLKPFPPSSFP 
SLPSFFPPFLPSSLFL 


2899 


16800 


A 


2918 


2 


351 


ARBYTSLQLILQMTFIMAFTCTDRTLYE 
LAFEC* XiMRTLD I ITR*GNQP * RPNAGS 
YWLFYTLLGTLPLLIALIYTHNTIiGSIiN 
ILLLTLTAQELSNS *ANNLI*LAYTIAF 
IVKI 


2900 


16801 


A 


2919 


32 


321 


ALMGITFFFFLGKGVLAPRGGGRGGNPG 
LWGGPPPGLGPFSG*SLQGGGA*GPPPQ 
AGANFGFLILRKTRGSPGCPGGFWIPGP 
GDRPAGAPQGGG 


2901 


16802 


A 


2920 


I 


361 


GTSTRLGVLLLSLHHAGS INPLGITLHS 
DKVTLHPYYTIKDALGI^LFIiIiSWlTLT 
LFSPDLLGDPDNYTIANPLNTPPHIKPE 
* YFLFAYTI LRS VPNKLGGVLALLLS I L 
ILAIIPIL | 


2902 


16803 


A 


2921 


3 


152 


HERLYSVPLRILRRLPDPLSIPQGWRHV 
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I RESAYKSHLTLPARRGLQDG I ILF ITS 
EWFFAGFF*AFYHSSLCPTPQL*GHWP 
PTGITPLNPPESPLLNTSVLLASGVVIT 


2903 


16804 


A 


2922 


410 


1 


RAGYRQKWGSLATVIGQLGLPVERGWYG 

NHRPRCGGSLLSELKFAPLALQPGRQSK 
TI^QKKNKQTTTKKNTKKQPGSVAHACN 
PSTLGG*GRQITLPSGVRDQPGQH 




IOoUj 


A 


000*2 


o 
L 


143 


&dzv zii>nr»\/T + c cut n? a our* t ott T.ccnnT 
PALASQSTGITGMSHQCPG 


2905 


16806 


A 


2924 


29 


340 


EMNEVERRGRK*LFHDSIQSR*EYRCAP 
QCPANSCVLM*RWGFAMLPKIiVSSDLPA 

T. A CAQfinTTfnrCtlPRUTT A7T7T "DT T OTODT 

SQRHLYLPRA 


2906 


16807 


A 


2925 


2 


337 


ARVLILPGFGIISHIVTYYSGKKEPFGY 
IGMD*AMISIGFLGFIV*AHHIFTVGID 

LHGCNMI * SAAVL* SLGFI FVFI VSGQA 


2907 


16808 


A 


2926 


76 


388 


RYCTPAWATETPSQKKEKKKKCQTFSPE 
SSFPVCLENGGGGNFISLFFLFLIILSL 

WWW* PPPT TWVQVW17V17PT.l?GVT?P 

FYFFINLI IFLLLNYFFFFI 


2908 


16809 


A 


2927 


48 


374 


KDQBPTDMGSAHFQVFKGWRQVGGANED 
RINSHGGPVMFSGQYYGQLVRFLVY* I I 
irrrtLturr our loir Lir ±o±r r VJjVr i L. 
LFIFLFYFSFFFIFVFFFYYFIVIF 


2909 


16810 


A 


2928 


3 


294 


HEGFHYI SQDGLDLLTS * SARLGLPKCW 
DYRREPPRPARFPTLRIWLYILKALYTV 
LGCTREYEATOPWCTTHRMDGGQEVEGR 
REGTGGGHIIjGF 


2910 


16811 


A 


2929 


7 


420 


i rrrdvllltlts ls ptaar i cynggrr 
gnrlnsiiaylskellaawslrkpshgll 

imgaaggtg*tv *ptlarnysqpgvcvn 

LAMVSLHLSGVSSIIiGAITFITAIMNME 
APAITQYQTPLFV*YGLITADLLVLSLP 

nT.TZVrt TTTT .T.TTiP trT.CTTTyiTriP 2i^2/2T2ni> 
JJJjJLn\?.L J. J. Uli 1 L/lwIULlO JL Jl C c lJJr/^JCxovjJJJr 

ILY 


2911 


16812 


A 


2930 


11 


253 


GLXiHKAP S P * KFFFS PKPFNF FWKFS P I 
FSPPKKKFLSKNPHIVFKFPPFKGKIFT 
FPPPLKFGPPRVFFKAPPPFFFFFFFFF 
FFFFF 


2912 


16813 


A 


2931 


70 


341 


DSSFFS*LATLHGSNMK* SAAVL* ALGF 
I FLFWCGLTDIALPNSSIJDIALLOTYY 
WAHFHYVLSLGAVFAIIRGLIH* FPLF 
SGYTLDOS *AICIHCAT IFIGVYXTFFPH 
HSLGL 


2913 


16814 


A 


2932 


3 


340 


NYSHPGAS VNliI I FTVHLAGVS C I LRAI 
NFITTIINIKPPAITQYQTPLFV*SVLI 
TGVIiVl^IPVI^AGrTILLTDRNLNTT 
FRDP AGGGDHTT .VOT .T.F* T.RfiWP F 1VT T 


2914 


16815 


A 


2933 


2 


364 


RVQKGLRYWI ILS ITSEVLLFAGYF* AL 
xTISSLAPTPQLGGHWPPSGITPLNPLEV 
PLLYTYVIiLASRVSFT*AHHRLIQNNRN 
HIIQALLITILLGLYCTLLQAS*NFEPP 
FTISDGIYG 


2915 


16816 


A 


2934 


2 


367 


PRVR P RVRYLLFGA* AGVLGT ALS LLIR 
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AELGQPGNLLGNDHI YNVIGTAHAFVI I 
FFIVI PI I IGGFGN*LGPLI IGAPDMAF 
PRINNISF*LLPPSLLLIiIiASAIVEAGA 
RTG*TVYPPL 


2916 


16817 


A 


2935 


3 


365 


YHIV* PNP*PLTWALSALLMTSGLTM*F 
HFHS ITLVILGLLTNTLTI YQ * WRDVSR 
*STYQGHHTPPVQKGLRYGIMLFIT*DR 
FFFAGVF*ALYHSSLAPTPHLGGHWPPT 
GITPLNPLE 


2917 


16818 


A 


2936 


3 


144 


DSHGRHVING* TCVTICFIRQLIGHPTS 
KHHFGFEAAA* YWHKKKKK 


2918 


16819 


A 


2937 


3 


422 


QRLLATNHHDIGTLYIiSFGA* PGALGTD 
LSLLIRAELGQPRDLLGNDHIYNEI *TG 
HAIX3ILFFIGLPI IIEGFGN* LHALI IG 
APDMALPR INN I S F * LLPPYLLLLLAS A 
IVEAGASTO+TVYPPLAGNYSHPGASVE 


2919 


16820 


A 


2938 


3 


382 


RTRGLFS TNH I D I LTLYLLFGA* AGVLG 
AALSLLIRAELGQPCNLLCNDHIYNVIV 
TAHAFVI IFFIVIPI 1 1 *CFGN*LLPLI 
IGAPDM^PRINNISL*LLPTSIiLLLIA 
SAILEA*SGTG*TV 


2920 


16821 


A 


2939 


225 


3 


NISWRMKQIVPKGEPYNGVPVFVSLWFH 
KMFI FETGSHS VYQAGVQ*RHLGSLHPH 
PPGIKQFYLSLPSSWDY 


2921 


16822 


A 


2940 


1 


247 


PTRPPVIYSTIFAGTLITALSSH*FFT* 
VGLE I NMLAF I P VLTKK INPRS TEAAI K 
YFLTQATAS 1 1 LL I A I LFNNI KKKKK 


2922 


16823 


A 


2941 


3 


369 


TRDSTYLGHHTPPVHTGIiRYWRILFITS 
DAFFFAGFF*AFYHSSLAPTPRLGGHWP 
PTGITPLNPLEVPLLYTSVLLASGVSVT 
♦AHHSLVEDDRYQI IQALLITILLGLYF 
TLLQASEDFE 


2923 


16824 


A 


2942 


3 


401 


LTVCCVIVIiRLKTLFFFFNKPFLTQKGY 
FNTPEEGFFKKPNRRVGPPSPMTDPTMIj 
TNLIKGKVPBCAPPRILMGGGINMTF*GF 
VTPRAPFPLTLRLNPMLQQGIDLLTLKA 
SGGSPASGNFLKELGLRSINF 


2924 


16825 


A 


2943 


189 


348 


RNARGWVTDKEKRLRI/3MYAHACNSSTL 
GGDGR* IA*GQEFETSLANMVKPCL 


2925 


16826 


A 


2944 


2 


349 


ANSNYERTHSRI 1 1 LSQGLQTLLPLIAF 
**LVASLANLALPPTINLLGELSVLVTT 
FS * SNITLLLTGLI ILVTALYSLYI FTT 
SQWGSLTHHINNIKPSFTRENTLMFILL 
SPIL 


2926 


16827 


A 


2945 


621 


929 


GCSSGTGCCPILCDLPRPWSCRGVGGSP 
SSTAHLCPRGWRSGRCFTiPPLSAS*VDS 
AMSLIQAAKNLMNAWQTVKASYVASTK 
YQKSQGMAS LNLPAVS WKM 


2927 


16828 


A 


2946 


379 


395 


SQHFGRLRREDHEVRSS * PRDP PALTSQ 
SAGITGMSHCARPLVATSIHKIDNRYID 
DRS*NINIGTI FINLLIFYLSI YIjSIYL 
SIYLS IMYSFTVAQAGVQWRDYGSLQPE 
SPGPK*SSCISLSSSRNHSHTPPHGWVD 
PKIP 


2928 


16829 


A 


2947 


2 


357 


HTYD I RKPRR * TLACALSALLTTS GLAM 
TTCFHYITLLILCIJjTNTLTIYQ*WRDE 
TRESTYQGHHTPPVQKGLRYGI ILFITS 
EAYFFAGFF*AFYHSSLAPTPQLGGHWP 
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PTGITP 


2929 


16830 


A 


2948 


2 


357 


PRVRYSTOHSDIGTLYLLFGA*AGVLGT 
AISLLIRAELGHPGNLLGNBHIYNVIGT 
AHAFVIIFFIVIPIIIGGCGN*LGPLII 
GAPDMAFPRINKTS F* LLPASVLLLIiAS 
AIVEAG 


2930 


16831 


A 


2949 


2 


217 


KNLKIVQYGDMCLWSQLLRELRWEDRLS 
SGG * GCSELCS CHCTPAWTTQQDCLrKKK 
KNFWPVGEVKGRHML. 


2931 


16832 


A 


2950 


2 


354 


AIPMTTARLTIHEAYLI I LERTTTTTKD 
VKNPRRIAAAITASCLGGGLEDAISCQY 
GIATKDRPTGLGTPEVLLGALPGAGGTQ 
RLPKMVWPAALDMRLTGRS IRAHRALK 
M*LVD 


2932 


16833 


A 


2951 


3 


150 


LFS CS PTFSSDPLTTPLLILTT* LLPLT 
IMASQRHLSSEPLSRKKKKKK 


2933 


16834 


A 


2952 


283 


344 


EKKKK}QCKKKKKKKKKKKKK*KIjRKKKK 
KKKKKKKKKKKKKKEK 


2934 


16835 


A 


2953 


3 


378 


DAWADAWNQTPIiFN* SVLITAVLLLLSL 
RVLAAGITILLTDRNLNTTLFDPAGGGH 
PILYQHLF*CFGHPEVYILlLPGIiGIIY 
RIVTDYYGTKEPFGYIGMV* AMRS IGFL 
RFIVRAHHIVTVG 


2935 


16836 


A 


2954 


3 


387 


ILYQHLY*FFGHLEGYILILPGPGIISH 
MVTYYSGKEBPFGYIGMG*AMISIGFLG 
FIGRAHHI FTVG I DVDTRAYFTYATI I I 
AIPTGVKVFS*LATLHGSNMK* SAAVL* 
ALGFIFIiFTVCGLTGI 


2936 


16837 


A 


2955 


2 


432 


PRVRPRVRKHIAYPFLVLSL*GIIITSS 
ICLRQTDLKSLIAYSSISHIALWTAIL 
IQTP*SFTGAVILI IAHGLTSSLLFCLA 
NSNYERTHSRI I ILSQGLQTLLPLKKKK 
KKKKKKKKKKKKGGGLLKES LGGPNLTG 
EGK i 














2937 


16838 


A 


2956 


12 


391 


SYFISSSKPHLSPPWLSSPDEATSKKKK 














KGGGFK* KRIGGSFKKGGGEKKIFFFKG 
GKKKKTGGFFEKKFFLGGEKY* DNLPKK 
NKTVGEKKNFLGGG 


2938 


16839 


A 


2957 


1 


352 


PTRPYFPVDAGEAQHHPRTCRRPLRALW 
SSHHERWKVTLCTHCSLGVFFLYCCTYY 
IFVLFIP*SSCGLTIiIFITCI ILFGSIS 
FFLFFTI VFS 1 1 1 VTTFKFRLLYSIIFIi 
SYLLC 


2939 


16840 


A 


2958 


1 


243 


NLPRLNPKEIEILNRSIICNKIKAI IKS 
LLSKKSSGHNDVTAEFY*TFKEELITIL 


2940 


16841 


A 


2959 


373 


3 


FSSLKKRVTPPPPPKTGFSLEGLHLLKK 
NFPQKPPPPKKSFSQKNPPPPKKKPPF* 
KKKPPPPPPI*HPPPKILQPPPPPFFFF 
FFFPFFFFFFFFFFFFFFFFFFFFTDMR 
GFIVRKFRTRG 


2941 


16842 


A 


2960 


2 


136 


PRVRSTLPISYKWNNRALMKAHLIMK* F 
TEYFKTTDELYLHDNT 


2942 


16843 


A 


2962 


1 


358 


HTPPVRKGL* YGI ILFITSKVFFVPRFI 
* AVYHS SliCPTLQLGGLWPATGMTWLNP 
L*VPLCTTFVLLA* * VSIT* AHHNLIDL 
NRCHVIYALVITILLRLYFTLLPASEYF 
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nucleotide deletion, \=possible 
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EAPLTIS 


2943 


16844 


A 


2963 


2 


373 


RAYDIVRPRP + PLLGALSALLMTYGLAM 
*VHWDCITLLILGLLTOTLAIYQ*WRDV 
SRBSTYQGHHTP PVQKGLRYGI ILFI TS 
EDVFFAGIV*AFYHYSLAPTPQLGGHWP 
PTGITPLNPLEV 


2944 


16845 


A 


2964 


403 


61 


LFFPLEKNFLPPGFSAFFSPFSP*KFFF 
SPKALIFWGNFSPFFPPPKIRFLPKIPP 
WVFFSPPFWBKLFSSPPPLNFGPPRVLF 
KGPPLFFFFFFFFFFFFFFFFRFEGECW 
RL 


"2945 


16846 


A 


2965 


2 


360 


LI PNLAMVTR*GNHPQRLNAGTYLLFDT 
LDGSLALLIGLTYTRNTLGSLNVILLTI 
TAQELSNS * ANNLV* LAYTLAFIEKI PL 
YGLHL*LPKAHREAPIAGSIVLAAVLSK 
LGGSGKN 


2946 


16847 


A 


2966 


321 


2 


STGMHFPHINMAINPPPPPRPPPPLFPP 
NPKTKPNPTQ * KGGFPPLGP F * KKLS PF 
LTLFFFFFFFFFGHHPGPWQKKKKKKVF 
FFCVCFS FQRVHNI YKNTHQQQ 


2947 


16848 


A 


2967 


200 


1 


RRT YTSHLLACLRQGLAFS PRLEGGGKI 
RAHCSLQLYGSSDPPT*APOTAGTKQHN 
QRIAQCNADN 


2948 


16849 


A 


2968 


2 


354 


LLTASSSEIAPLQSSLGDRARLCLIK*K 
EGVhHS I * SGNQGKS YANVYRLL YLD P I 
PKI YAEAYTP *NVNSTNLBTKS PKTIQK 
FPEDRE FKNDFFE KTKNGDRETGAFHPF 
LLFLV 


2949 


16850 


A 


2969 


99 


400 


ALGIHFIFTVSGLTGIGLANSSIiHIVLH 
DTYYWAHFHYVLS LGAAFAI IGGFIH* 
IPLYSCYTLDQTYAKIHFTI IFIGVNLT 
FFPQHFLGLSGMPRRY 


2950 


16851 


A 


2970 


2 


377 


NILLLTLTAQELSNS * ANNLI * LAYT IA 
FIVKIPLYGLHL*LPKAHVEAPIAGSIV 
LAAVLLKLGGYGI IRLTLILNPLTKHIA 
YPFLVLSL*GI I ITSSICLKKKKKKKKK 
KKKKKKKKRGGPF 


2951 


m52 


A 


2971 


1 


407 


GTRSYTHLYRVFLELIKMSVYDLNHTVI 
MVI SGHVRLAFYG I VHLTL I LNLLTDH I 
LYPFLVLSLSGVI I ISSIFLRRTDLESL 
IAYS S I SHIALWTAILIQT P * S FTGAD 
ILIIVHGLTSSLLCCLANSNYER 


2952 


16853 


A 


2972 


10 


259 


SRSVAIYFKGMASA* *RMFSSKKKKKKK 
KKKKKKKKKKKKKKTAITKKTTTAKWKN 
*RTEKIRHTRLPLISSRDAKCVDFLYT 


2953 


16854 


A 


2973 


12 


400 


LNCRTPSLYLQRAGELLSVBNPHIWCQK 
CVRKNNFFLFFFFFFKTDLYCPH I CXAL 
S ITCFLAI S ISGLFAFFLLNS FNYHF 1 1 
VP * NFS LS I YLCLHS LFYVG FFFS FDMM 
I FS I LT * TLFFCLLFHL 


2954 


16855 


A 


2974 


125 


3 


RPRRPAAQVGVQ * KNLSSLQPVPPGFKL 
FFPLSLLSSWAS 


2955 


16856 


A 


2975 


2 


332 


RECTNYPAQPPPPPLEAEEGFQPLPPTD 
TG*LECLPPLGGGQHRALGLGATWHLEN 
ACALVLADLGTGPKPSGPFSGQACGASG 
RLPLLSGPI LPPLGQGHTLS AWGHHA 


"2956 


16857 


A 


2976 


7 


329 


SRDILVMTEDGEFF+GHV*P+DRSRRAD 
TPRLGGHWPPTGITPLNPL*VPLLNTSV 
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OCysteine, D=Aspartic Acid, 
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nucleotide deletion, V=possible 
nucleotide insertion 














LLASGGSIT*THHRLIEYIRHOI IHALL 
ITILLGLYFTLIiQASEYPBSSFT 


2957 


IUOJO 




7977 


9 




AHHI FTARID VDTRAYFTS AT 1 1 IAI PT 
GDKAFS * LATLHGSNMK * SAA VL * ALGF 
IPLFTVRGLTGIVLADSSIxDIVLHDTYY 
VGAHFHYGLS IGAVFAI IGGF IH* CPLF 


2958 


16859 


A 


2978 


394 


0 


QGCSEL*SCHCTPAWVTE* FPVSKKKKQ 
RK 


2959 


16860 


A 


2979 


149 


383 


PLCFPSQHWGKSSNCLSSLKYFPSSGLP 
LRCKAELDDVKQKADKELDWAHACNPS 
TLVGOGGRITLRSGV*DOPGOH 


2960 


16861 


A 


2980 


2 


378 


ARVS IGFLGLI VGAHDMFTGRMDLYTRT 
YCTFATILIAIHTGVPVFI *LATLtHGCN 
MK*SAAVL*AIiRFIFISTVCGLTGIVLP 
N* SLDI VLHYMYYAVAHLHYVLS IGAVF 
AILGRFIHRLPVF 


2961 


16862 


A 


2982 


2 


409 


PAVAEAYLKPWDDSKGSFLWGKPDLDG 
IRECCQRNFGWNRTRTDESLFPVLKQLD 
AQQTQLRIDSFFTLAQQEKDDAKRIKSQ 
RLNRAVTCMLRIEIEAASSBIEAVSVD* 
QKELELIDKAVACHLEYMYETDP 










a 




ui\c au* q n wa<3GOR ff rczva p a p inrprrnT? & 

IlnQnluunlUu uyKAunnirnu IvivV— L3/n i% M. 

QHEGLELRVENLQAVQTDFSSDPLQKW 
CFNHDNTLLATGGTDGYVRVWKVPNLEK 
VLESKAHDGEIEDLVLVPEGML+ IVCPD 
HYSPVGINDFMWLL* LNWGNRLYFPHI S 
VYTjM fhprtp 


2963 


16864 


A 


2984 


3 


417 


LILPGFGI ISHI VTYYSGKKEGPG YICM 
V*AMTSIGFlJGLIVRAHHIFTVGTDVY < ? 
RAYFTSAT 1 1 IAI PTGVKVFS * LS TLHG 
SNMK* SAAVL* ALRF I FLFTVSGLTG IV 
LANSSLDIALHDTYYV*THFHYVLSI 


2964 


16865 


A 


2985 


273 


408 


MKIFIF*VSP£>LFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFLAVFTFI 


2965 


mow 


A 








TFOPSELWRTALLPRLECSGLI IAHCSL 
ELIjGSSSPLASAS * IAGTTGS ILRY 


2966 


i UOV / 


A 








KKEFKIGRKAAEKMTRNINNAFGPGTAN 
ECTVQWWFKKFCKGNKSIjEDEECSGRPL 
EDDNDQWRAI lEADPLTTTREVAEEIiS V 
NHSMAVQHLK*VGKVKKLNKWVPHEt»S 


2967 


16868 


A 


2988 


2 


158 


PGWSLTPDLR*STCLSLPKCWDYRHKRP 

sptsrqtfdhapnknsshspp I ym 


2968 


16869 


A 


2989 


435 


23 


GWPPNPQKSFYFPQRLKIWGGGGRKRP 
PPKKKGFSQKTPEGF*KPPPKRRKKIFH 
DPGKKGPPKGIFKRGPPLFFFFYFFFFF 
FFFFFFFIITIECLHSRFPHRHHNKKFP 
PNPPS PR FWPQHLNTS LPNP KNKE P 


2969 


16870 


A 


2990 


410 


3 


GGRGHFFSGAFFIKFPWNKKGISQPLCF 
PRGGGS PPPLGPVRGGGPPCWGPLCHKG 
PVKKTGAPRGKNGISPFFCFPPLGSFNR 
SGFFWALLGSPPLFFFFFF*DRVSLHHP 
GWNAVAQS * LTKTSTS WVQVILL 


2970 


16871 


A 


2991 


371 


132 


KQSLPFKVKGFFFPPEVENAIYPDSHTA 
FYSW * KRSTFSKKKKKiCKKKERKRKKEE 
RKEGRKEGRKERKKEGKKEIENA 


2971 


16872 


A 


2992 


336 


1 


CPRWSRTAGLK*STCFGLPKCWDNRHES 
PCPASSVLND*QLSTVQRAT*WQTKSLV 
LKAPPARGALPVSRAVSPPPSSHHFAYF 
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nucleotide deletion, \=possible 
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LHMI *QVSCLPLR*SLALVPQAGVQ*RD 


2972 


16873 


A 


2993 


317 


440 


HLGRLRQEHHLSPG* LRKFFFFFGRDGV 
LMLSMLVFNSWTQVMLLPQPPQVU5LR 


2973 


16874 


A 


2994 


2 


181 


FHHVGLDLLTL*SACLGLPKCWDYMEP 
LCPALVI LLTAKFTNFR YRVKNVCT FHV 
ESN 


2974 


16875 


A 


2995 


1 


165 


GFHCISRDGLDLLTS * YARLGLSNCWDY 
RHEPPSPAPLFIS YS ICLFLSKLIQFL 


2975 


16876 


A 


2996 


405 


2 


KKGFPIFSRMFFLYKHIEKPPLASQKFG 
DPKWSPHPRP I FFFLLKKGVLYVWREGF 
KFHPP*FPPPGPPKRWE*RVKPPHPPPF 
FFFFFLYVEMGSHCVAQAGLKLLGSSTP 
PTSASQGAEIAGVSHRAQPHAS 


2976 


16877 


A 


2997 


2 


415 


CLFTGGGIiTGIGS AGSSLD I VLHDTY YV 
VAHFHYDLS IGAGFAI IGGLIH* FPLFS 
GYALDQTYARIHFTI IFIGVNLTFLPQH 
FLGLSGMPRRCSD* PDAYTT* SMLSSVG 
SVTTLTAVIVLI IMIREAFDSKRTVL 


2977 


16878 


A 


2998 


349 


370 


HHFY*SITAFRPFQHMELRNFFFFFLFF 
FFFFFFFFVFLFI FliFLFYFLFLYEI YL 
ILFFL*HHFYSAV 


2978 


16879 


A 


2999 


36 


139 


LCHCTPAWETFSKEKKKKBKRKKKKKKK 
KKKKKKKPGGGS * KTALSHDCATVLQPG 
RLFQKKKKKRKKGKKKKRKKKKKKKK 


2979 


16880 


A 


3000 


104 


373 


SADREXXS KTDNLLGH * TNVNKCKVPRV 
IQSVFSSHSGLKIiEINNRKMKEKSLNT* 
KINNISYWVQCSLYNLKSNSYTKSSDNT 
TTQYMY 


2980 


16881 


A 


3001 


1 


396 


LDCSKISSYLQKSSSHVLFFSFSFFFFF 
GGGGGFAIiENPYPPAGLGPQKKKTLSPP 
WLVGGPPFPEKAPP*GGHFLGKKPQKNP 
GGQNPKRPSGEGNPFFQPPGGEKKQIGP 
PTGFGGKPFFSF*PREPPGP 


2981 


16882 


A 


3002 


412 


104 


FFFFFLPVRQTFYPQPFSVFFPLFPFKF 
FFFP * AFNFFWGFFP I FSPPKFGFFS KI 
S RLVFFS L PFWEKFFFFL P P FFFAPLRF 
FFKGPPXFFFFFFFFFXXY 


2982 


16883 


A 


3003 


400 


46 


IiFFFFFYKLFPPPAFGGFFPPFPI**NFF 
FPPGPFFFLGGFPPFFPPPK*VFFPKIP 
PGFFFPI?PF*KKFFFSPPPFFLPPPGFF 
FNPPPPFFFFFFFFFFFFFFFFFFLCW 
VQVEAY 


2983 


16884 


A 


3004 


411 


69 


YSPLPPLFFSSPPSKFPWPPFSLFFLTR 
VYKGFFFFPFFPFELFGPPGLSLGSKPP 
PVFLGGDPAFFSIPHPRVGSLPPPPNWP 
FIGPSFW*ALFPAPPLFFFFFLSSFFFF 
F 


2984 


16885 


A 


3005 


3 


200 


DAWG*LFSTNHKDIGTLYLLFGA*AGVL 
GTALSLLIRAELGQPGNLLGNDHIYNVT 
VTAHAFVII 


2985 


16886 


A 


3006 


259 


146 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFGLLNDILL 


2986 


16887 


A 


3007 


118 


414 


QNQPPQNKATHTVKIEKKEKPETKTVAK 
EHNKAKTAEKSEE+TKKEVKGGKQEKVN 
HTAAKVKEVQKTPSKPKEKEDNKKAAVS 
KHEQKDQ YAFLRYM I 


2987 


16888 


A 


3008 


427 


111 


FFFFSPVGNFSPPQQFPFFFPPFPPKIF 
FFPPPL*FFWGGFPPFPPPPKKVFFPKS 
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PPGFFFPPP*GKKFIFPPPP*FCPPPGF 
FLSPPPPFFFFFFFFFFFFFS 


2988 


16889 


A 


3009 


3 


302 


SLASCLSYLVCVIFljGQPKPTI*LQNST 
PHKK*NPTERYVKTCTQIFIALLFKKEK 
QPRCPSAGEWINKMLYACTIEYWLAIKR 
YE XLI YATV* MYIiEKI 


2989 


16890 


A 


3010 


1 


409 


RLHDATFPIIEELITFHDHALVTTFLIW 
LLVLCALFLTLTTKLTNTNI SHAQEI ET 
V* T IliAAI ILAIjMVIjPSLRIL Y I TDEDN 
DPS LT I KS IGHQWY * TYE YTDYGGLMFN 
SYILAPLFLEPGDLRLLDVDNRW 


2990 


16891 


A 


3011 


157 


2 


GRVDLKIQKLARCGGACLQSQHTQQNHL 
NPGE KGCSES * LP P CPPD * VTKO 


2991 


16892 


A 


3012 


2 


423 


ARAARAHTVTYYSGKKEPFGYIGMG*AM 
ISIGFLGFIGRAHHIFT* * IDGHTRAYF 
TSATI I IAI PTGVKVFI * LATLHGSNMK 
*SAAVL* ALGFI FLFTECGLTGI VLANS 
SLYIVLHDTYYVVAHFHYVIiSIGAVFAI 


2992 


16893 


A 


3013 


2 


140 


ARANILLLTLTAQELWDPRANNIiI *LAY 
TT iAF T VKKPTj YGIiHTj* LPKAHVETPMDG 
PILlxftAKlxLKlXSGSGIIRLTLIlxNPMTIt 
HIAYPLLGLSL*GI IITRS ICLRQTELK 
WLIAYS* ISHIALVVTDILIQTP*SKHF 
TTNSHCPRTMGPQSQQLNMTSLHTSFYS 
KETSLRTPLMTP 


2993 


16894 


A 


3014 


2 


420 


PYLAAG I S IliLTDRDLLTTLFD PGGGGD 
PILYQHLF* FFGHPDDYILILPGFGI I S 
HTVTDYYGRKBPSGYIGMV*AMTPVGFL 
GFIE *AHHIFTVGIDAHTRAYLTS VS II* 
IVI PTWRQVFS*lxATLHGSNMT* YAALF 


2994 


16895 


A 


3015 


6 


292 


AHHIFTV*IDVYTRAYFTSATIVIAIPT 
GVKVFS * LATLHGSNMK* S AAVL * ALGF 
I *LFTVGGLNGIV* S Y*LLDIELHDrrY* 
FVMGCPRKVYF 


2995 


16896 


A 


3016 


3 


422 


TPII IGGFGN*LFPLIISAPDMEFPRIS 
NISL*LLPPSIlxLLLASAIVEAGT*TD* 
TVYPPLAGNYSHPGASVDLTIFSLHLTG 
VSS ILGAINFITPI INIRPPAITQYQTP 
LCA* CDLMTAVLLIxIjSIjPDLAAGITILL 


2996 


16897 


7V 

A 


3017 


2.1 


AQtC\ 
HyKJ 


TP F PGRHLTMFS LHLAGGCS I LGAINY I 
TTLINIRPPAITQYQTPLFV* SVLITAV 
xxHLLSLPGLTAGVTIJXADQl^lxOTTFFD 
PAGGGDPILYQHLF* FFGHPEVYILILP 
GFGI ISHIVTYYCGEKEPFGYIGMC*AM 
ISIGFLGIIV*AHHIF 


2997 


16898 


A 


3018 


98 


402 


LRSQHSKSFQISGKPSQEEWPQISPDST 
DYI INT*HFNAQMLKNIY*HQPHGLHDF 

KKASSSSSKKDPPGGGPT 


2998 


16899 


A 


3019 


399 


229 


PPPPGGGGPQGPPPPRGGFLPKSPGGVF 
YPPPRGGKIFSPPPGFFGPPRGFF*GAP 
p 




16900 


A 


3020 


2 


401 


S DA VL* ALGF I YLVLEGGLTG I VLADS F 
LDLELHDTYYVGAHFHYVLS IGAVFAI I 
GG FVD * FPLFSGYTLDQTYAENHFTI IL 
IGANVTFLPQHFLGLSGMPRRYSBYPDA 
YTT*NIT^SVGSFIxTiAAALL 


3000 


16901 


A 


3021 


413 


67 


PPPPGKIFFKKTPKKKIFPPPQF* IFFP 
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K=Lysine, L=Leucine, MNMethionine, 
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PLPPKKFFFS PNP * FFLGOFS P FFPPQK 
KI FFPRI PPNFFFSPPLKKKIFFFPPP * 
IFPPPRFFLKPPPPFFFFFFFFFFFFFF 
FFF 


3001 


16902 


A 


3022 


2 


332 


LTLS I LS I IAGS * GGLNQTQLRKILAYS 
SITHIG*IIAVLPYNPNITILNLTIYII 
LTTTAFLLLNLNSSTTTIiLLSRT*NKLT 
♦LTPLIPSTLLSLGGLPPLTGFLPKW 


3002 


16903 


A 


3023 


1 


412 


RGPPFFFFFCVFFFFFFFFFFFFNKGCR 
HSKRFFFFFKKKKKSSRPTRDRV*FYPK 
GWRS PLFLFFS PGGRGFFPLSHQ VGFSN" 
BVLVVFKKNLELFRGSPARKKKKKKKHIj 
B 


3003 


16904 


A 


3024 


279 


1 


LGRNTELWKSGKGMDILKTNCGKLANEP 
FRQPRVLGIGGEAPRAGSGPPSRAPPA* 
TPGPSSAGS WP * PPGTGRAPRGPAPSAP 
GARSPGRPG 


3004 


16905 


A 


3025 


151 


401 


KKPLGGPNLTGEGKKKFFSLKGGKKKPP 
GKFLKKTFFLGGEKMGKTPPKKIjKP * GK 
KKIFKGKRGKKNPKTLAVKKFSKKKKK 


3005 


16906 


A 


3026 


416 


140 


YLSPLKKFFTPPPLRMFLPPNPLKNIFF 
PPQLKI FWGGWAQNSPPPKKGFFSKNP K 
RVFLPPPIRKRYNFPPHGKILAPPKNL* 
SAPPPIFF 


3006 


16907 


A 


3027 


379 


31 


PPPRRAGVFFFKNPKPKPPPPREGGRF * 
PFPPLKF*FFPKPQNFLGGGGAIPTPPP 
KRGLGQI PTERFNLSPPTQKRINFPPPG 
KGGPPPPLLKPPPPPPFFFFFFFFFFSL 
NSFI 


3007 


16908 


A 


3028 


28 


420 


MQQTTMAHI FLCNKFANCAHVPRT*SET 
KPMSTPLQFD * TYKGEKS CKYAEHERTW 
KQ* CVFSLYQI I PT* EKTWKCNQCX3TNF ' 
NQFFKQTTHL*NHTRDNQICFSKIGLEY 
YYRITTRQHLLKLRTVCIL 


3008 


16909 


A 


3029 


I 


401 


LGNNGEAVSEKRKEKSQKEKSHNVWGF 
FFFFWGKPPFVPQAEGQGRNFC* PKPWP 
PGWGEFPGPAPRGGGNGRQKQPGRGNFG 
V*REKGGSMGGPRGA*NPGPKNPHPWPP 
QGPEITGGTTRPHPSGIFKKT 


3009 


16910 


A 


3030 


285 


397 


MFIKGDGLNKLRPGAAAYACNPST3jGG* 
AGRITRSGD 


3010 


16911 


A 


3031 


265 


2 


KKESSCIKATNSNSLFFLF*SVFFFFVF 














FFFFFFFFF*HFKSALLWVTDLGFLNSD 
PRV 


3011 


16912 


A 


3032 


410 


49 


GFSPPPP*KFFFSPKPLNFWGGGGPNFP 
PPKKRFFPKNPPGVFYSPPKKKKKFFSP 
PP*NLAPPKIFLKSPPPFFFFFFFFFFF 
FFLSNVSNGLTNMYILPCKDPSCPTTFP 
ILGSLISL 


3012 


16913 


A 


3033 


177 


2 


VTPPFFFFFETRSHSVARTGV* *SDRCS 
LQPHPPSIK*SSHFSLPSSWDYRSMPPH 
AS 


3013 


16914 


A 


3034 


75 


395 


MS YKHXXKKKKKKKKKKGGAP * XKP WGG 
PKLTRDGKKXFFSLKGAKKKPTWKEWK 
TLILGGRKMGTTPPKKLKPLRKKKIFKG 
*RGKKHPKSLPVEKFASRGRIKK 


3014 


16915 


A 


3035 


400 


83 


KKEGPARWSPVTPPLLEGPVGRSPQTR 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
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/» viUIUIVWil, UlUJJ LUUUll) 1 |IUMIUIC 

nucleotide deletion, \=possible 
nucleotide insertion 














NF*PTPPTHRKSLPS * KSKIDWRGIPLY 
PPPPRVKPKKSFNPGNRRFC+TKIFPCP 
SPWAPKTHPPFQKKXKKRVYK 


3015 


16916 


A 


3036 


3 


196 


DSMPQT*NKSFAKARKKKKKKKKKKKKK 
KKKKKKKRGGPPLKKTLGGPKLPGGKKK 
NFFFFRGG 


3016 


16917 


A 


3037 


1 


411 


FCYDVCVESGCADYS IVT IMKKKKEK* K 
KKKKKKGGGPLKKNPGGAQNYPGVEKKI 
FSLKGGLKKTPRGNFEKKPYFGGGKNGA 
PPPOKNKPLGEKKKFKRKKGGKKliOFPW 
GKKISLPGFYLKKIYPPGRGFFNFS 


JUl / 








•a 
O 




NFFFKKPRGGNFFPPPKKGFFSPPSPLK 
FFFFPPPFFFFGGGGPHFPPPPKRFFFQ 
KPPRGFFFPPL*EKNFFFPPPVFFAPPP 
VFFLTPPPP 


3018 


16919 


A 


3040 


1 


111 


IGLSGMPRRFSDYPDAYAT*NILSSVGS 
FMYLQQDNK 


3019 


16920 


A 


3041 


76 


967 


QLLKGGVSGVCPLLMFRCVRSFFLLVGS 
WSSLASGVKPQTFAVSVTVLKAARLELF 
I PPRGLWSLASGVKEjQTFAVSVTAHKS 
SVDPKNSGAQLASPSGSRTRAAGGAACQ 
SRCRVPALIjSPWWDGTGRRGAGGGARR 
GGSGRTGAHGVGGRLRHGGIiHVPSPAPW 

AGPGAKSLTALCEQGWPAAPSAGPTKPT 
PTRNSS WPASVARS PGSRSCLSLHTSLQ 
AEGVGSSLGQPSKGLPQCSGGAEGIiLKC 
RQSGSPGRGGTESERGL+GLPQCSGGAE 
GLLKCRQSGSPGRGGTESERGL 


3020 


16921 


A 


3042 


39 


141 


LSIRGLNIIIKRQRL*DWIKQQDSTLCC 
P*EIH 


3021 


16922 


A 


3043 


2 


405 


LFSTNHIYIGTLYLLFSTRAGVLGTALS 
LLIPAELGOPGNLLGNDHTYNAIVTAHA 
FVI IFFIERP 1 1 IGGYGN*LGPLI IGAP 
DMAVPRINNITF * LLPPSLTiLLLASAIB 
EAGAGTG * TVYP P LAGNYSHPG 


3022 


16923 


A 


3044 


3 


134 


HLNPGGRGCS EPRLHHCTPS WATE * DS V 
SKKKKSRKGWTGLFI 


3023 


16924 


A 


3045 


44 


187 


DPRVRQYQTPLFV*SGLIFAGLLIiLSLP 
TTjGAGI T T LLTDRNIiHTAVF D PDGGGDP 
ILYQHLF*FFGHPEVYII*ILPGFGIICH 
IVTYYSGKKEPFGYIGMV*AMISIGFLG 
FIAGAHHIFTGGIDVDTRAYFTSATIII 
AIPTGVKVFS*LFJLIRPNU^TSPISP 
NPRGRHHYTTNRPQPPHRRV 


3024 


16925 


A 


3046 


434 


40 


, GAPPPPPGRFFFFLNPREDTFPPPPQKG 
GFSPPPPPKFFFSPPALFFFGGGWPNSP 
PPQKNFFF* KPPEFFFFPPFLKKKI FFP 
PPLFFPPPQIFF*TPPPLFFFFFFFFFF 
FFFFFFFRGCKINFIVRGF 


3025 


16926 


A 


3047 


1 


399 


LFTGGGLTGITVLTNSSLDIVLHDTY*VV 
PHFHYGLS IGAGFAI IGGSIH*FPLFSG 
YTIiDQTYAKIHFTI I FIGVKLTFFPQHF 
LGLSGMPRRYSDYPDAYTT*NILSSVGS 
LISLTAAILI IFMS *EAFASK 


3026 


16927 


A 


3048 


116 


367 


GASMILSSMIFLECIVGGFPVFSVYLFK 
LQILRQSSTMCFVLFCFFEIRSCSVTQA 
GV*RRGHGSL* PQPPGLSHPSSRDHGHV 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamk Acid, F=Phenylalanine, 
G=G!ycine, H=Histidine, I=Iso leucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=Glutamine. R=Ar<»inin*» Q^-C^ r ; llA 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *==Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3027 


16928 


A 


3049 


6 


345 


SQLLRIU 4 RKENCIiNIjGGGGYSHQR*HQC 
PIAWATE*DYLKKKKKNFSLFIjKTGGFY 
PFLKKAPLGTTLS PKNNT.PT.fi PT.YTflfPT 

GPWNWERKNFGKKKGGVIGLQGGKTI PK 


3028 


16929 


A 


3050 


261 


2 


NKKSPPVNLWWKMGFFFKFAKRVVLSWK 
GGGGFFFFSWRRS FTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC+DYRREPQ 
YP 


3029 


16930 


A 


3051 


160 


2 


ICVDEQAGVQWRYLGSLQAPPPGLATLS 
CLSLMSSWECRQPPPLG* FFVCPR 


3030 


16931 


A 


3052 


89 


3 


PLTSGLLLLT*QK3jAPIS IIYQISPSLN 


3031 


16932 


A 


3053 


344 


2 


HVEAP I AGS I VLAAVLLKLGG YG 1 1 RLT 
LILNPLTKHIAYPFLVLSL*GI IITSSI 
CLRQTDLKSL I AYS S I SH IAL WTAI L I 
OTP * S FTGAVI T.T I AHGT.Tfi T .T .PfT. A W 
SN 


3032 


16933 


A 


3054 


266 


2 


FOP P I S AYTK I S PS TjNVS T/LTVTtiR T T. Q T 
IAGS *GGLNQTQLRKILAYSS I THVG* I 
IAVLPYNPNITILNLTI Y I ILTTTAFLL 
LNLN 


3033 


16934 


A 


30,55 


3 


32 


KYNSLIMPTMIATITLLNLYFYLSPLLY 
**SSCPP 


3034 


16935 


A 


3056 


3 


33 


KNNSLI I PTI IATITLLNLYFYLSPLLY 
**SSSPPS 


3035 


16936 


A 


3057 


57 


445 


ANVWAPHGPAKLTNKDNYHIWKSKRLKI 
ANMTIKKLNEVIGLTLPDFKTYVEIjVQ* 
RQNAID*RKHKQPVKQSPBA*PHSYSQL 

SQKLNLHTDLTNFTKIN 


3036 


16937 


A 


3058 


311 


1 


RVGLLLKLNKISWPPPPFYGPS * EKEPL 
CFSQlGLFLTRTMVIiNNLHSPPVKTRPY 
NKIAP PPF. T.PP PFKT)R V<I T PT .PftUQ Tl\HJ 

*SQI/TATSTSQASSDPGRV 


3037 


16938 


A | 


3059 


138 


411 


WERPWKAQEAVFWI * VSAFWAPPPLMEK 
OIPPDxxEOHYRNVPCTVNRNOPPVQPPT.P 
WSLTVAQAGVWWRDLGSL* PLLPGFKRF 
LCLSLLS 


3038 


16939 


A 


3060 


1 


189 


FCRVGQAGLKLLTSSDPPASAS *SAEIT 
GVSQRAWSKITILKSSSFSYFPNSCKMC 
FWLICLCT 


3039 


16940 


A 


3061 


3 


406 


DAWADAWVLILPRFGRTSHIVTYYSGKK 
EPFAYIGMV* AMISMGFLGFI V* AHHI F 
TVGIDVNTRAYFTSATI 1 1 AIPTGVKGF 
S * LATLHGSNMK * SAAVL * ALRF I FLLK 
KKKKRGAVLKVPWGGPSLPGCG 


3040 


16941 


A 


3062 


451 


82 


PPPTNYFSPPPAFLPGGGGPPRPPPFKK 
WSPPTPPPWIMPPPKKKKKFFPPPRGW 
GPPPKIF*KPPPFFFF*KKNPPFSPPGE 
NRGVFFSTKPPPPWGKKNFAAPGAPPPP 
PFFFFFPRGGG 


3041 


16942 


A 


3063 


/HO 


i 
i 


pfppkoTj t PWRfw;PTc c iPPP'K'FrKPF , P'K"K' 
PPGVFFS PPKKKKKFFFPPPLNIiAPPKI 
FFKSPPPLSFLFFPFFFFFFFFFFFFFF 
FFFFPLSVQTLLKRTRAPPQPPLLD*EK 
APAPRVP +TGBGMPAVNVAFAPPPFYKE 
RPS 


3042 


16943 


A 


3064 


1 


409 


PTRPRESTYQGHHTPP VQKGLRYGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
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HWPPTGITPLNPLEVPLIiTT 


3043 


16944 


A 


3065 


1 


137 


HTFNFS I YQKATVI KTVWYWYNNRHTVE 
SPEINPYIYS*LIFFLF 


3044 


16945 


A 


3066 


4 


426 


KLEN* KMVLKEIKEDLNKQTDILFS * LQ 
RLITVRMSILPKIiIYKFSAIPIQIPA*? 
L* IKI I IKCMRKGK*TRIAETI FFFFFL 
SQSFILSPKLDHRGGITANCTPPWAIKG 
KI.T.LKKKKKPPKTKPEKFFIQKATGAEG 
GVH 


3045 


16946 


A 


3067 


411 


187 


RNLPNVPPRPTHFVIiLVKTGVSQVGQGG 
GKLLASKNPPSPAPPKSWDYRGEPPRPA 
PRKFFF*LNKFKIYGGPKN 


3046 


16947 - 


A 


3068 


193 


471 


QCTCIKVHSGQKTGSTPLVIGELQIKIT 
LGCYYTPTLMAQ I KKTDRTKCW * GYGAI 
GMIi ILCWRECKI VQSL* KRVWQFI I * LN 
IYLAIKLNI 


3047 


16948 


A 


3069 


323 


478 


FFXFXFLFFFILXYFYYF*KLFYLYLFX 
YI FKI YFI YNFI YI ILFYI I FI I F 


3048 


16949 


A 


3070 


1 


378 


GTRRFFFHSITLLILGLLSNTLTIYQ*W 
RDVTRESTYQGHHTPPCQKGLLNGI ILF 
ITSEVFFFAGFF*AFYHSSLAPTPQLGG 
HWPPTGITPLNPLEIPLLNTCVIlLASGV 
SIT*AHHSLIENNR 


3049 


16950 


A 


3071 


201 


2 


TTPIQLFLKHYHT*NFNYNFFFEIGSCS 
IAQAKVQWCDLCLLQPQTLGLKHSSHLS 
LQSSWDYRHA 


3050 


16951 


A 


3072 


261 


1 


EKAMGGGPFKVKKBPGEGPTTKGWPLKG 
PLEGGQRGLTGPFKSNGRLFFFFFGSNE 
VSLCCPG*SRTPDFK*SACLSLPKCWDY 
RHG 


3051 


16952 


A 


3073 


333 


52 


EIFKKKKKGGGRFFNKRVFPRPRVSNGR 
PRAQFFLETFFFFPERGFFFFFFFETES 
YSVTQAEV*WNYLGSQQPPRFKRFSHLS 
LPSSWNYRCK 


3052 


16953 


A 


3074 


7 


214 


SQLQENRLNPGGGGCGEPRSCHCTPVWA 
TE*DSVSKKKKKKKKRGGWVPPSLGGGP 
KKKNPFFNQEGGI* 


3053 


16954 


A 


3075 


180 


419 


QKHLS ILHYIiFKRDVFFFFLKGSFWSQ 
VGGQGHNIjG * LKAP P PRLTHFS CLTLRE 
TWKNRPRYFFCFF I KTGFHHVTRE 


3054 


16955 


A 


3076 


371 


208 


QKLSGHGGSRI**S*PF*EAKAG*QLVSG 
GRGCSQL* SHQCTPAWVTE* RLVCKR 


3055 


16956 


A 


3077 


349 


3 


TFCWQKYTMCRSLCHPHPPRTWSTKKKR 
PFFQKGQGDAPPYKKVQRGNPPPPLKGR 
PSRGPPKKCKVFKAPVFLFPRFSPPPFF 
PPPLFFFFFF* DRVLFCCPGWSAVAQLT 
ATS 


3056 


16957 


A 


3078 


124 


1 


ISSETPAVKSSFWPGVWHACNPSTLGR 
*GKEITRSRDRDH 


3057 


16958 


A 


3079 


205 


1 


CLVQNI WACVSHWRYIMSCKGC* RLGLM 
CVCVCVOTCVCVCVYFCCCCCCCYC*DE 
LSLCHSDCSAVA 


3058 


16959 


A 


3080 


3 


413 


OffiTQPVQKGLLYGI ILFIT*EGFFFSG 
FF*AFYHSSLAPTPQLGGHWPPTGITPL 
NPI»*VPLLNTSVLliRSGVSIT*AHHRLI 
♦NNRNQIIQALLITILU3LYFTLLQASE 
YFETPFTISDGIYGSTFFVATGFHR 


3059 


16960 


A 


3081 


1 


321 


NSLNPGDGGCS * PRLHYCTPAWGTELDS 
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ISKBKKYPYHVYHPSMKMTIYNYRQQNV 
NHI YIMLVEHSQTQENTCFM I SGNFFFN 
LP I VLGEGEKNQHS I SFKLFFNF 


3060 


16961 


A 


3082 


117 


484 


VLKYFIDSEVNAVLFSISCSFVTDFVFL 
FFFFGKGVSFCPPAGIKGGGFGFLEPLA 
' SGFKRI FFPNPLEKWE*RARPPPRGKFW 
NFFFFFFLKRELFFAPRWEGRGKILVYL 
KGPPWGYPHF 


3061 


16962 


A 


3083 


381 


227 


CISRDGFYHLGQAGLELLTSSDPPALDS 
*SVGITGVSHRTRPLLLRLNVQF 


3062 


16963 


A 


3084 


2 


391 


SHAYHIG*PSR+PVTGALSDLLMTSGLA 
M* IDFHS ITLLILCLLTNTLS I YQ*WRD 
VTRESTYHGHHTPPVPKGLRYGI ILFIT 
SEVFFFARFF*AFYHSSLAPTPQLRGHW 
PPTCITPRNTLDVPLLNT 


3063 


16964 


A 


3085 


2 


248 


IMRSGDRDHPG* HGETPSLLKYKRLAGH 
GGMRLWSQLLGKGGTADSHHHVLLILET 
FYSLRERRHLTSVPTLGMNYWAQDIR 


3064 


16965 


A 


3086 


276 


243 


EKWPD+SRAACPVLCRGNGQYSKGRCLC 
FSGWKGTECDVPTTQCIDPQCGGRGICI 
MG S CACNSG YKGE S CEEAPR Y I P EKE 


3065 


16966 


A 


3087 


3 


130 


GFYHVGQAGLELLTL*SACLSLPKCWDY 
RREPPRPAHTPPHS 


3066 


16967 


A 


3088 


277 


2 


SSSSVFCIjLVWTSSSSSSSAARLPPLTG 
FLPKWAIIEEFTKNNSLIIPTIIATITL 
LNLYFYLRLIYSTS ITLLP I SNNVKIK* 
QFEHTKP 


3067 


16968 


A 


3089 


404 


60 


FSHGKMRFFSPPSPKKIF^SPQSFYFLG " 
GGGAKMPPPKKRFFFKKTPRGFYFPP*K 
KKNFFFPPPVKFGPPRGFLKS PP PFFFF 
FFFFFFFFFFFFFS PTFFTVFHLMLKSD 
ND 


3068 


16969 


A 


3090 


405 


186 


INKKPEAFTNTVDQMVLTNSHRTFYPTA 
TACSLSGAHRTFSRMDHV*DHKTSIiNKF 
KTE I TLS TLSNHKLE P 


3069 


16970 


A 


3091 


371 


1 


SPPLVQKGLRYGI IliFITSEVFFFAGFF 
*AFYHSSIiAPTPQLGGHWPPTGITPINP 
LEVPLLNTSVLIiASGVSIT*AHHSLIEN 


3070 


16971 


A 


3092 i 


263 


3 


NKRS P PVNL W WKMGS FFKFAKRVKIS WK 
GGGGFFFFSWRRSFTFVAQAGMQWRNLS 
SLQPRLPDLR*SACLGPPDC*DYRREPQ 
YP 


3071 


16972 


A 


3093 


1 


392 


FFFADFKKMFILINHFKMELTTYFELKR 
NEATASENC*DAVKAVLGGKFIVLSTYI 
RKEERPRINNISFQIKHWTKKN*T*GKQ 
KKKKKKKKVTGPE I P KFLIVKSGKPPKV 
ILTGAWGPIKFLSFITRL 


3072 


16973 


A 


3094 


3 


367 


EM* I E I TMRYHYNTARRLKF * KTDNIKC 
* *GHGTSGTLIYHWQE * KMVQPLWKIV* 
QLL 


3073 


16974 


A 


3095 


3 


257 


HE VSQDGLNLLTS * SARLGLPKCWDYRR ~ 

EPPCLAWLXLPDDCVIFQKLKLLHHNLL 

NI*SCIDVLMGIYSLSNFSQSNFPFFFFF 


3074 


16975 


A 


3096 


189 


414 


KGLVMVTS*CKMFFCLSIFFFFFERGFC 
FFAQAGVQGHNLSSLEPLPPQLKQFFCL 
TLPRS * KYRPAPPCPANFY 


3075 


16976 


A 


3097 


I 


299 


ENYRPI SFMWTDAKII1NKII1ANQIQQCS 
KRITHRDQVGFL PGMQGQFY I * KS I KS I | 
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NAIHHISRLKKRNHMIPSIEAKKAFNTH 
S* FMKKNKKLAFQET 


3076 


16977 


A 


3098 


362 


126 


FIHTHTHTHTHTHTHIYIYI YI IASFQW 
NAIRGRMKCINKP* KDMEELKCILPSER 
SOSAKATYLLYDSNYITLBKAKL 


3077 


16978 


A 


3099 


3 


423 


RHEHAYHIVKPTP * PLTGALS ALLMTSG 
LAM*FHFHSITl2LILGLLTOTLTIYQ+W 
RDVTRESTYQGRHTPPVQKGLRYGI ILF 
ITSKVFFFAGFF*DFYHSSIiAPTPOIiRG 
HWAPTGITPLNPLEVPI^YTC^LLASGV 


OKI iO 


loy/y 


A. 


linn 




03 


IxP^JTNCaMGAPFKN'SPFFPOFLVKWS PA 
YFQNKEKDVGKPPFFFRAPDGAPFFKKK 
KK*GlxALSSKtxirYGGMISxxELLGSSDTF 
ASASRVARTTGQCHHAGPT 


3079 


16980 


A 


3101 


81 


247 


GGWGGPPLKPPLF*KKTGKNFWPPFFKG 
KEKPPPNPRGGGK3K3GPKPPPVTFFFF 


3080 


16981 


A 


3102 


3 


388 


HEKKEPF* YIGMG*AMIS IGFLGFI V*A 
HHIFTVGIYVDTRAYFTSASII IAIPTG 
GKWS*IxAJIXJIGNNMK*SAAVL*AI J GFI 
FIFTESGLTGIVIiSNSSLDrVIJIDTCYL 
VAHT.HYVTiS TGAVFAT 


3081 


16982 


A 


3103 


3 


387 


HERHEKLSNS * ANNLI * LAYT I AFIVKI 
PLYGLHL * LPKAHVEAPIAGS I VLAAVL 
LKLGGYGIimTLILNPLTKHIAYPFLV 
LSL*GIIITSSICLRQTDIiKSLIAYSSI 
SHTALVVTAIL IOTP 


3082 


16983 


A 


3104 


3 


381 


HEQSHAYHIVKPSP* PLTGATJ1AT1T1MTS 
GLAMGFYFHFITLLILGLLTYTLTIYQ* 
WRDVSRESTYQGQHTPPVQKGLRYGIIIi 
FIT* EAFFFARYF* AFYDFRLGPTPOLP 

C x, X £*nxr t v-rtXN. x x xxjc xvxjvjir x £>^XJK. 

GHWP PTG ITPLNS L 


3083 


16984 


A 


3105 


412 


111 


FIiGHPLFKKKMGGKKRGLPKMGV*HPPA 
PKGKPPPLKKKKKKPGGGGAPLYSPFSG 
GFGKKITPT^TPEGGGPKNPNS P P PP PPGR 
KKKNPPPLSPKKKKKK 


3084 


16985 


A 


3106 


298 


98 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
F*L*IIY+AICAINNIRGAGAEIiYRSRD 
FVCY 


3085 


16986 


A 


3107 


2 


417 


ATHVGLQDVTS PMI EQLITFHDHALMNI 
LI ICFLDIjGALFLTIATKLTxOTJM* YAQ 
EI*TD*TILPAIILNLMALPSLRMLYIT 
DEGDDPSLTIKS IGHQWY* TYE YTDYGG 
LIFNSYIRPPLFIxEPGELRLLDVDNR 


3086 


1U70 / 


A 


JlUO 


o 


din 


VGLVLFLSE LP LNGG I LTFFHOGI YSPF 
PGGRTWALMVGSWGLVMASTDxxLGPLCH 
AFTPATQLlxLNLAVAS PL * PAALR IGCH 
SKTTNALTHFLPRGTPTPPRP ILVTIET 
MSIxLIPPIAPIX}lUAAGFTAR.FT,T,T,HLS 
V 


10X7 




A 


Jiv7 


-a 
J 




PRAI KFYRDWPGHERJCRI AWKGDP C5TMV 
L I KDE KGLMCQKKKKKTPFFWAP KI PLV 
FPPAQKNQGS*PNPPGGGGNPPLRPGP* 
RKKPPAMGPPPSSPGGINPKSFFYLNP 
GPAH* PRGEN*LGPVFENLFLPLLKKIF 
LWGELRCSQT 


3088 


16989 


A 


3110 


398 


2 


SLFQKNPNPLVG* KKKGKALPQGPPPLF 
PPLGGAGPGGSQGRGWGPPRPPWGNPFF 
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G=Glycine, H=Histidine, I=Isoteucine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














P*NPKIWPAKGGAPFSPPSBKG*AGGFF 
LPRGGGFPLTQIPPPPPHLGGKSKPPFQ 
KKKKKNRQGTSWMMQGWGPQ 


3089 


16990 


A 


3111 


3 


371 


SSDPPTSAFQGSGTTDMCHHHHAQLI FN 
FFVBTGSCYVAQAGVQWHDHGSLLSQTP 
GLR*SSGATVLNWAPALGPRRPDPTRME 
SLVLMKPRGSLIRSACPDCFLVFWFSFF 
HEAEGCASEC 


3090 


16991 


A 


3112 


233 


449 


FALFS*LATLHGSNMK*SAAGL*ALGFI 
FLFTAGGLTGIVLANSSLDIGLHDTYYV 
EAHFHYGLSIGAEFAIIGGFIH*FALFS 
GYTLDQTYPKIHFTI I FIGGNITFLPQH 
FLGLSGMPRRYSDYPDAYTT*NILSSGG 
S FNALNAGS IT 


3091 


16992 


A 


3113 


458 


2 


RGPPPPPPPKFFFF*TPGKNSPPPPPEG 
VFFPPSPPPNFFFS PPPLFFFGGVPPI S 
PPPPKSFSPQTPPRPFFPPPPLKKNFFS 
PPPLFLPPPPFFFYPPPPFFFFFFFFFF 
FFFFFFFFFFCGDLEGELPGTGMLACVI 
LLRANRKARTRG 


3092 


16993 


A 


3114 


1 


418 


LNTTFFYPDGGGDPNL YQHLY* F*GHPD 
PDIHIIiPRPGIRSHIDTDYSGKKE PYAY 
VGMGWAMTSIGFLWLMVRARPLFTVGVG 
VDAQAYS SFAS ITIAI PTGAEVFS *LCP 
LPLSGMK* TGAAVWALGLRF I FTCSGR 


3093 


16994 


A 


3115 


1 


425 


PRINNISF*LLPPSI*LLIiLAYArVEAGA 
GTG*TAYPPLAGNYSHPGASGKLTIFSL 
HLAGGSSILGAINFITTIINIKPPAITQ 
DQTPLFV*SVLITEDLLFLSLPGIiAAGI 
T ILLTDLNLSTTF FD PAGGGD P I LYQHL 
F*FFDPAGGGDPILYQHLF 


"3094 


16995 


A 


3116 


2 


383 


GLSCTNHKDMGALYIiLLGARAGVLSXAL 
SLLIRAELGQPGYLLGNDHIYNDIVTAH 
AFVI I FFI VIPTVI/3GFGN* LGPL I IGA 
PDTAVPRINNIS I * LLPPCLLLRLACAI 
EEAGAGTG*TVYPPL 


3095 


16996 


A 


3117 


1 


259 


PTRPAL WTAIL IQTP * S FTGAVI LI IA 
HGIiTSSI^LFCLANSNYERTHSRIIILSQ 
GLQTLLPLIAF**LLASLANLPAPTPTP 
HQ 


3096 


16997 


A 


3118 


3 


174 


LIRGGRGCHELRSRHCTPAWATRARTLS 
QKK*KTTNPKKKLCLIFFGGKKKKLKKG 


3097 


16998 


A 


3119 


155 


1 


PDFFNKSMDKKKTARGWEDSSSS FCFFK 
RDRVLLCCPGWSAVAQS * LTAAS 


3098 


16999 


A 


3120 


420 


2 


PPPKFFFPPTPPFLGEGGAKTPPPKKIF 
FLKKPPGVFFFPPLKKKKFFFSPP * FLA 
PPKIFFKRPPPFFFFFFFFFFFFFFFFF 
FFFFFFFFFLVFIVLFQVKVHFLKKCFN 
IQFPLASDNS*PSMIHEKFYCESNIEF 


3099 


17000 


A 


3121 


48 


387 


RDPVLQKKEKKKKKKPKNQKKKKKKGGP 
F*KIP*GAKIKPGKEKKNFSPKRGAKKK 
NPGNFEKKTNFGGGKKWGKPPPKN*RFK 
GKKKFLKGKGGKKTQ I PWGLKI FFNGFD 


3100 


17001 


A 


3122 


237 


2 


PFPWLPPFPLKASSSP*SL* FLLGGWP 
NLPSSPNKGSFPKFPSWFLFRPP*GKNF 
YLALPR* PWPPQGFL*TAPP 


3101 


17002 [ 


A 


3123 


448 


130 


PPRFFPEFFYSGPPKPPFFKTPVFLGVK 
PGVFFFSPYQKKPTNFGPKMGAL*RIPL 
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nucleotide deletion, V=possible 
nucleotide insertion 














FETPIWVFPIKGFHKKKPPVLN*PPTRK 
PPPDKIIiKKKKKDDCISRLARN 


3102 


17003 


A 


3124 






QQttPTTf^nfSVPPPPP'K'T 7 FPPPDPT P^Wfi. 

GGGTKSPPPEREVFPKNHGGCFFSPP*K 
RGKYFSPPPRMGPPPGVFFKGPPPNFFF 
FFFFFFFFFFI FFFFFFFFFFTKKKNWF 
FFKAFRMSPKPVPPFFFFCNYRWFRPR 
VRPRVRPRV 


3103 


17004 


A 


3125 


443 


2 


VT7 P PTiYJP VT Wnn Q T SZ h IT\7T .TJP DP Pff TrrZT 
l r err vjk. v a rv iajhij v uivjf fxvxrxiivur i 

PFFPKKIFSIRPGWGGGPPLPPPQRQMW 

GGPPPPLFGLFRPQGKIPFPPKKKIKPP 

f2f2vr i vni\7rn7r r \7* TfinrKTtfPPPPPT.WAfiT .tt 

EPCFFFFFFFEMEFCSCCPGWSAMARSQ 

JLU-i ).in 1 1 


3104 


17005 


A 


3126 


3 


186 


PVI YSTIFAGTLITALSSH* FFT* VGLE 
GGGL 














3105 


17006 


A 


3127 


171 


38 


KKKKLFFPPREKWGPPKNFLKRAPPFFF 
FFFFFFFFFFFFFWSERSS * VA 


3106 


17007 


A 


3128 


16 


189 


ILGEVI WV+ * FF* FIKKKKKKKKKKKKK 

W tfW PfiAPWK 1 KTPfV^POTT'P/TPl'flif TTTP 

SL 


3107 


17008 


A 


3129 


401 


85 


LVNFFSPQEKRGFFPPPPPKNFFFSPRG 
PPFIjGGGGPI FP PPKKSFFS KNPPGVFF 
SPP*KKKIFFFPPPLFWAPPRFFFKGPP 
PFFFFFFFFFFFFFFFFFSQF 


3108 


17009 


A 


3130 


2 


312 


ANNL I * LAYT IAF I VKI PLYGLHL* LPK 
AHVEAPIAGS IVLAAVLLKLGGYGI IRL 
TLILNPLTKHIAYPFLVLSL*GGGVF 


3109 


17010 


A 


3131 


449 


3 


J?T?I?1?1?T?PT /2T!PPP T WPPFPPP DTHf P VPP 

LGGGKKKPPFF*NFPQKPLGVLGAPPPP 
T.rpp TTCfCTn? f^GKKVF<3 P PT.P * KGGPPK 
KFFFFPPPPKGPPPFLKNLRGWVFFKPP 

DLELADAW 


3110 


17011 


A 


3132 


95 


448 


VINRE*KX^C*MKKKKKKKKKKKKKKKK 


3111 


17012 


A 


3133 


60 


442 


LGGFFFFGGKKGFCLWCPRWGAKAGI PV 
NGTPPRGV*RNFLAQPP*EGGITGPPPL 
PQ* FGFLRENGVPIiRGPGGFEPPILGEP 
PPLPPOKGGKNGRNPPPPLKGFLVLFKK 
EFS SLVP S WKARGDP 


3112 


17013 


A 


3134 


236 


45 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFGAAD+VILFRYFQ 
FEATIMGV 


3113 


17014 


A 


3135 


441 


88 


KQQTPPGLI FF* KAPRREI FLPPPIMVF 
FSPPSPFKFFFFLSPFIFFGGVLPFFPP 
PKKGFFFKNPRRVFFCPPLKKKNFFFLP 
PFFFGP PRVFFYP P P PX FFFF FFFFP PF 
FFFFFF 


3114 


17015 


A 


3136 


2 


328 


TMLS PKPQQLNQQNCSPEHYE PQLKTQR 














KKKK3CRGGGQKKKMVGGEKKKPG*KIFF 
FFIKKVKKKTALGDKKKTQFWGGS 


3115 


17016 


A 


3137 


281 


2 


KATKSGTPI PSQGQQSLAWSWAGIGSAQ 
PPAIiLHS * PIGKIFKNCMPVGRKSPQLP 
RNTSWQLGAVAHPSNSSTLGGRGGRITR 
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E=Clutamic Acid, F=Phenylalanine, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=posstble 
nucleotide insertion 














SGVRDRTRG 


3116 


17017 


A 


3138 


44 c 




FFFLPPS PFFP Y* KRGF^IT.FRRVirB'MTrD 
PPVFPPPWPSQIFGFQALIFLPPPPPLF 
FSFPVGFFQfTALFPFNGFFPGFFHGLFP 
LLRFCPRK*VFWGGFFFFFFFFLRDKVS 
LCHPGWNAVAQS E FTTALTS KAQAPTRP 
PTRPPTRP 


3117 


17018 


A 








DR*TiF<»TMHKT?TflTT.VT.T.B , fl2i*2if2\7T QT "" 
JJf winiUUjlVJiiJXJJiJf Vjii AVJviJDl 

ALSIJ^ILAELGQPGNLLGNDHIYNVIVT 
AHALGKI FFIAI P 1 1 IGGFGN* LAPLI I 
GAPEMAI PRINNIS *GLLPPS ILLLLAS 
AIEEAGAGTG*TDYPPLAGNYCHPGASG 
DLIIF 


3118 


17019 


A 


3140 


362 


2 


KPRRGKFFPPREGGGGFPPPPPKNFFFP 
K3GGKFI/3GGGGKNS PPQKKGFFQKNPGG 
VFFPPPKKKKNIFFPPGKMGAPPGFF*R 
GPPPFFFFFFFFFFFFFFFFFGOSGOVK 
LKSPKCKL 


3119 


17020 


A 


3141 


440 


102 


PTPPPCCKFSFKRPPKKTLFFPTTNLVF 
FSPTPP*WFFFSPOAr.TFUnPT.aPTFPT> 

QKKFFFSKFPPIiFFITPPLIKKFFVPPP 
PFILSPLKIFYKPPPPIFFFFFFFFFSF 


3120 


17021 


A 


3142 


1 


79 


FKLDYFS I IFI PVALFVTWS I I E FSL* Y 
INSDPNINQF* KLDYFS I IFI PVALFVT 
WSIIEFSL 


3121 


17022 


A 


3143 


3 


441 


FFFPPPLKKKKFFPPPPNIGPPPKSL* K 
PPPPPFFFFF 


3122 


17023 


A 


3144 


223 


3 


LPYWKLPYUCH* * *LQimJQESRG*HFL 
RPRPFKNQMKSGTVAHACNPSTLGGRGG 
RXTR5GV* DHPGOHGET 


3123 


17024 


A 


3145 


65 


414 


KKKKKKKKKKKKKKKKKKKKKKKKRGGP 
PKKKTRGGPQNPPPKKKKFPPQKGGKKK 
PPLGF*KKTPPLGGEKIPPPPPKKNTPP 
KKKKKF*GGGGQTPPPPPPGKKFSPPRN 
KKKK 


3124 


17025 


A 


3146 


410 


31 


RANQKAFRGKPLCDLAVGKNLS SRTQIA 
LTI * KWINWTILKLRTSGH* KTPIKTI K 
R YP I EGEKI SDEEL YLQ Y* KVL * IGKKK 
PDNPVEKWANDLKRS FMKGNILTVFKGM 
QR*LGSMAHICSLS 


3125 


17026 


A 


3147 


182 


241 


SHPSHHSTINITNKGLL*TPLPIPNPLV 
NLNLGLLFILATSSLAVYSIL*SGGASN 
SNYALIGALRAVAQTIS YEVTLAI ILLS 
TLLIRGSFNL 


3126 


17027 


A 


3148 


254 


1 


KTKKGLKI KDPLTRF * IS VC * ITKS I * F 
FKLLPFFPLLKEHIPLKYLFFFFFLFET 
EFRSCCPGWSTMV*SLLTATSTSQIQAI 


3127 


17028 


A 


3149 


499 


2 


NTPPAAGGGCFFFFFFGEKNFPPPHPTP 
RFFPPPPLKNFFFFSRVFFFWGGGAQKA 
PPPKKVFF*KIPRGFFFPPPKKKKIFFF 
SPGFFGAPPGFFLSGPPSFFFFFFFFFF 
FFFFFFFFFFFFFFFFLINFLLS LQGFF 
LVSKELFLFGLTVKFTRGFRGFCGQ 


3128 


17029 


A 


3150 


256 


146 


KNAKVTQVGPEFNKGPG*HTHTHTHSHT 
HTHTHTLQ 


3129 


17030 


A 


3151 


333 


1 


TISCLCTRGEHPLSPRRAGPYTGSPLHC 
CVDWDKVFS SWKDLTDWPLGDLD I EYF 
TDGSS F I LRGVCRAGYAAVTLDS AVEVL 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














S VSAETSA*KAEL IALTRALWLGKEQK 


3130 


17031 


A 


3152 


43 


379 


NKTSLFGWDYIWEWGAPEPBTPPKRAAG 
ALHSLAQPFSVALPPCFFDPCAPSLSPG 
*HALRPLPLPGLASEIQTAPSWHVPPKS 
LSPAPQPCPIPTLVPVGYKTPP 


3131 


17032 


A 


3153 


398 


12 


NTTPGGGKFFLKKTREEKFFPPKKKRGF 
FPPPPPKNFFFPQGGNFFGGGGGQISPP 
QKKGFFQKNPRGVFFTPPKKKKIFFSPP 














FFFFFFLKKSWRPLAI 


3132 


17033 


A 


3154 


371 


105 


SPSPQVNFIKGPKPPPPK*IL*RAPNPP 
LPKKKFSNPPTWGPGPQPPPSKFSKFAR 
FPFLPPPFPFKKEPPKKKIFFPTKEGTV 
I*KNPPFSGFQSPDSIK 


3133 


17034 


A 


3155 


3 


371 


DVGADP ILHTSTGL* LAMQY* PEA*TAF 
SSIAHIT*DVYYG*VIRYLHANGA*IFF 
ICLLLHIGRGLYYRPFLYSKT*KrGLIL 
LLAT ITTAFIGYVLP *GPI * F* GATVMT 
NIiLS AI P YIGT 


3134 


17035 


A 


3156 


1 


398 


IAIPTGVKVFS *IiATLHGSNMK*SAAVL 
*ALRYIFLFTEGGLTGIVLPNSSLNIVL 
HDPYYWAHFHYVLS IGAGFAI IGGFIH 
♦FPLFSGYTIiDQTYAIIHFTIFIGVNLT 
FFPQHFIiGLSGMPRRYSDYP 


3135 


17036 


A 


3157 


401 


46 


PPGGRNFFKKTPGKKIFSTKKKKGFFPP 
IiPPKNFFFSPGGFFFGGGGGPNFPPPKK 
GFFSKNPRGVFFSPP*KKKIFFFPPG*I 
WAPPRVFLKGPPPFFFFFFFFFFFFFFG 
KKSVRA 


3136 


17037 


A 


3158 


404 


18 


FFSPPPPPPGGGVFPPNPNKNPFPPPPP 
PFFLGGGPPPPPPPPP*FPPPPTPPNVF 
FFSPPKKKNFFPPPPGPPPPPPKPPPPP 
PPPFFFFFFFFFFFFFFFFFFFLFFFEP 
MEKGPRPGDIGSNPPS 


3137 


17038 


A 


3159 


3 


192 


SLVIAGCPR*NLSSTLNLPTEPSKSPCK 
FNC*KKKKKKKKKKK3CKKKKRGGALKKN 
PWGGKK 


3138 


17039 


A 


3160 


205 


80 


VQRDNFGFLQPSPSGVKLFFCLSLPNKW 
DFRCGPPNPG* FFS 


3139 


17040 


A 


3161 


3 


384 


L I VPTI I LLPLT * LS KKHI 1*1 INTTTHS 
LIISI IPLLFFNQIKNNLFSCSPTFSSD 

FKRTPGGAAHWGGGGRETFFPKGGEKKN 
RPGVFWKQTFFWGGK 














3140 


17041 


A 


3162 


348 


70 


GPPPKKRWSKTPKVVLNKPPQKKKKFI 
FPPFVNLGPPKNFLKGPPPFFFFFFFFF 
FFFFFYFFWMGCDR*CSWRHSSPPRLSG 
TPKCSPSVT 


3141 


17042 


A 


3163 


2 


353 


IiKTI PLTSTCLTIGSLALAGI PFLTGFY 
SEDHI I * TANVLYTNA* ALS ITLIATSL 
TSAYSTRI ILLTLTGQPRFPTLTNIETK 
KKKGGPFNRYPLGAQVYGGGQNEKFFLI 
GREII 


3142 


17043 


A 


3164 


1 


221 


PTRPRDCSELRSCHCTPAWATEGDS ISR 
KKKKLSTRTAF*YTEAXNSLIYSLNIGT 
FKTYFPKIKTYDRHFDF 


3143 


17044 


A 


3165 ! 


26 


383 


I P FYQ * S h I * YTROXKKKXKKKKKKKK 
KKKKRGG 
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«< viuuiimiiibj rv rii gimnc, o — oerine, 
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W— Trvntonhan Y=Tvrn«inA 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 


3144 


17045 


A 






JO I 


QE IGAIIiANTVKPHL 


3145 


17046 


A 


3167 


391 


115 


LFKKISPHAGIWGFFSPI»TP*NFFFSLE 
PFIFGRGLAPIFPPPK*RFLSKNPPWF 
T P PPT ,Mfi FCPFP P P P DVP T /S D D T VC W Vn a 

PPFFFFFF 


3146 


17047 


A 


3168 


94 


389 


S PGILGQKGQ IGP IGNHVPGGLAAPVTP 
* FRFKPRLP * GFGPKAS PPLALKPERAQ 
VGGTPPPGPRGPNGQPPPFKENQPGLGF 
R RRfJTTT .tt RJTP fJFPT . 

t\_T KVjft, 1 Ii-vC»X\J\.vjX7 X14-I 


3147 


17048 


A 


3169 


3 


363 


WATAliQPGQQSETQSQKKKKKKKKKKRG 
GPPFFFFFKKKI FFSPPGAKNKRGKFF* 
KPPKKKKKIFFFPPPPLKKKKKKKKKKK 
KKKKKKMGGAFLKKPRGAP P P PGKRKL I 

FFFT.TCfW 

C IT X J_i.Tv.v7 v 


3148 


17049 


A 


3170 


2 


171 


KEPLGYIRMV*AMISIGFIiGGIV*AHHI 
FTVGIDVDTRAYFTSATI I IVI PTGVKV 
FS*LATLHGSNMK*SAAVL*ALGFIFLF 
TVGGLTG I VLTNS S LD I VLHDTYYVVAH 
c tlx vi_Jjioo w r I ijivxrlJNilv- 1 FJixCKyox 


3149 


17050 


A 


3171 


390 


27 


QSLTVKSPYPWILIKTKGHH*VMNAGL 
TRYQSLLCENPHIRSEVCITLNPPPLLP 
VSESPVKHSCVQVIiDSVYSSGPNL*DHP 
* TS VDWELYVDGIS FANPCKVSIiKKMTS 

fntr V X JtrKO 


3150 


17051 


A 


3172 


127 


310 


KNPGGAKIIaRGGERKNFFLKRGGKKKHL 

\J±C\Jl\J\x B C Y1\J\3 K*Juni£\l\j? tr IX* WJXt? Po if J? 

SP*KFFFSLKALIFFGGFCPFFSPPKKS 
FFSKNSQVFFFSPPFKEKIFPFPPP*NF 
GPPRVFIiKGPPPFFFLGVFFFFFFF 


3151 


17052 


A 


3173 


376 


3 


FFFFFIiRRQSLAVTLAGV * RCNL.SSLQS 
PPPGTK 


3152 


17053 


A 


3174 


365 


14 


RENFFSPEGGAPKNKPPPPPPPPGGKKK 
IFFQKKKKKKIFYPWKOTPPPPKKKKKN 
P*KPGPLKSQGFFFFKKNQNIiPWGPPQI 
SFPKKKKKKKAKTVQERKYNSNTQLVSA 
ETOIiL ( 


3153 


17054 


A 


3175 


1 


377 


VPMSSLGGKARLHLRKKKKKNPGFLKN 
FGPIiALLGMGVGNlKGLKGQKGKNPAFG 
AHTGGGCFSTjRfrRWTPFPlCR Al?r3 T TPYM 

SPH* KET*KALEPRGYKGL*QALALPNL 
KSGKMEHILRGAP 


3154 


17055 


A 


^176 






GHEVLDS SDLPAS AfSOSAG I TGVSHP AI* 
* EILLSMFETTWACDLLFQNI S F I KSS I 
PCFIGLDFIMPHRYCRFFF 


3155 


17056 


A 


1177 i 






TAHTS *GY* VKNYINIiSFCFFFFFIiFPN 
HSALQPGGQGHNRS SIjQPWF PGLKQFSC 
LSLPRSWDNGLVPQHLVNFF 


3156 


17057 


A 


3178 


206 


1 


KGTLFKKDFFFKNFPKKVFLGVPKNSLF 
* KITjTIj P PVT ,NP YP FFFFFFFFTVQT .PW 
PGWNAWP RLE P 


3157 


17058 


A 


3179 


330 


1 


IVSTLETCYIAYNEEEKDTFITLRIYVI 
GGNGKFLGI*IKQHIKKIIHHDQVGFIP 
GMQGWFNICRSINVIHHISRIKGKSRAQ 
WLTPVI PALWEGDAGGSPEVRSSRPA 


3158 


17059 


A 


3180 


3 


399 


HASAP jjQSSLGYRARP CFKKKKKKKKGG 
GGGGFFFPKGGFGPLPKKGFFSGKGGLG 
KWGLGGAGKTPGIKKPLGKGPPKKRGGK 
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KGQKGPHF*KKKKKKGGKKLKKGPF* KP 
FKKPLLGRGVQP *NPHLLGG 


3159 


17060 


A 


3181 


2 


204 


CPTACPFW*NKKTjIiMPKKKKKKKKKKKK 
KKKKKKKKKRGGAPFKNSLGGPHFPGAG 
KKNFFFFWGGY 


3160 


17061 


A 


3182 


71 


377 


PKRGGQPKQKKIWGPPPPGAPPQKGMGF 
FNPRGFKKQITFFPPPPPPPEKKPPFF* 
KKKKKKKKSNCSD 


3161 


17062 


A 


3183 


151 


2 


FFFLEELVPLLLKLFQKIEEEGCSPNSF 
NEAFIILIPKPFR*TTKKENF 


3162 


17063 


A 


3184 


152 


387 


YFQGFIYWACICTSFI1FFFFFI1KREFC 
FVTQVEVQGPNFT*LNPPLFGLKKFFCL 
TLQIGWNNRPLPPPQVIFCFFK 


3163 


17064 


A 


3185 


22 


336 


YEKCTALLQMVS S F I WMBREGTHQYS FY 
RKDFSLASKVNIVS YYLSP I VE* FFFFL 
RGSPFAPQAGGQGPNLGSWKPLPPGLMP 
FSCLTLPGGWNC!RPPPPGPVN 


3164 


17065 


A 


3186 


344 


1 


WVLKKI FFYPGRGGPPFI P P PLGGQGGP 
IPWARGF*PPRGPPPKNGF* KKKKKRGG 
GGPPPGFPPPGGPRGGVPFFLGGGGPRK 
PKKI TKKKNPGEKKKTS FKNQKRKTKI K 
TT 


3165 


17066 


A 


3187 


296 


1 


NPKKILTLPKKTKVYKCEGENQVPIIFQ 
GIKNI FWKGI F* PKKEREVCV* SMRHVI 
PVFPKKRGSKRSNKSCCYKDTCTRMFIV 
ALFTIGKTWKQPKY 


3166 


17067 


A 


3188 


2134 


1 


GVAAHACNAS I LGGQGGRI I *GWEFETS 
LANMVKPC 


3167 


17068 


A 


3189 


1 


159 


LQDHPG* HGEPPS I VKIQKLARHGSLRL 
* S * IiLGRLRQRMRQETCLNPGAR 


3168 


17069 


A 


3190 


119 


340 


QIKKNRLVSARGKNNKRK*IYKP*VDIF 
FKEDIQMAGKQMKRDLISLX IRE IQ IKT 
TITCYLIHARMGTI TRD 


3169 


17070 


A 


3191 


75 


1 


LS VNNFWPGTVAHACNPSTLGG *GG 


3170 


17071 


A 


3192 


343 


1 


IFILGGGPCCSPVFFQFFGGGGGGFFLP 
QNFFPPRGKIFRPFFF* KKKLKRPNWGF 
F*NFFNPPLGFFNLFFFFFKKKPKNFFF 
LGGFFFFFFFFFFFFLAGGDSLAIjSPRIi 
EC 


3171 


17072 


A 


3193 


105 


368 


KFKDPPFPPPFFWPPPKQLPPPPGKIGA 
PHF*TPKGPPPPKKKKC*KKKILKGGRG 
KKKKKTPPKRPQKIWGPSKKKNPWGGGK 
TPPL 


3172 


17073 


A 


3194 


1 


365 


FCRDEVLLFCPDWS PTPGLKQFSHLCLP 
KCWSYRC* PPYPAQGVFLK* HLTKSLSC 
LKLFMASLCLQDKVPAPQPCVKGLSKFF 
LCTLTuSTLIPFTHSLFFLFLGWFLRQHR 
SVTQAGVQWR 


3173 


17074 


A 


3195 


1 


362 


GNQPKRLHAGTYLLFYTLEGSVPIiLIAL 
MYTHKTLWSLNIIiLIiTLTGQELSNC*AN 
NIiX*LAYTRALI VMIPLYGLHL* LPKAH 
VEAPIAGSI*LAAVIJjKLGGYGIIRIiTL 
ILNPLTNT 


3174 


17075 


A 


3196 


137 


3 


KGQPRFPPISLKKGSQGKRGFFFFFF*Y 
RIPLCHPGWCPWQSW 


3175 


17076 


A 


3197 


1 


266 


EGRGCSEL* SCHCTPA* VTVRYPVSKKK 
KKKKKKGGGKGKKKGGEKNTLFGPKRGK 
LRGPQKRGKKIGPEKKVGNNLKKGI FFR 
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BKTF 


3176 


17077 


A 


3198 


217 


446 


YWSEHYAMTQVLEGFSYSLQDHFYFCFR 
SIRRIIFTSLIKPSIND*GEREIiEPITT 
SQALQ I AGRAGRFSSRFKEG 


3177 


17078 


A 


"5 TOO 


145 


114 


OPGPEGKIR FFLK T PMT .TP ctn/^ifCT .VVO 
LFKRVKPENCLSLRG*GCN* PI 


3178 


17079 


A 


3200 


2 


330 


SRHYTPAW*QSKTLSPKKKKKKKKKKKK 
PP FP KNTQKKP KKKKRGGLNWGVKTP P P 
LKTOKMGTS PGKTCT.KTntfT,FPTi1("K'KTCPPn 
NPLF* FRGF * KKKKAPFKKKKNPPK 


3179 


17080 


A 




7 




FPPFI.firiPflTPirFT.KPPT.PAN+MT.FriT 1? 

FLGGVLKKIFLRKPFSLWIKPFPTPFKG 
KKIFFKTFFCjKPIiFFF* kthtpppppppt? 

FFLSRGFFVFFSPKKPFFFFFFFFFFL 


3180 


17081 


A 


3202 


355 


2 


FFFFFS ETESCS VAQAGVQWHYLGSG* A 
LPPRFTP 


3181 


17082 


A 


3203 


156 


1 


LKSLLWEAKVGGLLEVRSSRPA*ATWRD 
FISTKN* KISQVWWYILWVTTWE 


3182 




A 


T704 




I ZD 


nppRRPAP&WPP*r , pr.PT»aT.pr**r , nr , D2i 
CCPPWAEASPSGVQASPARAPACPARAL 
LNE 


3183 


17084 


A 


3205 


280 


373 


QRGTRIFSDLQT* KKKNKSPFKILLLID 
NALGHS RVLTERYKD I FRPANTTSFLRP 

MVDRT .T PTPH Q 


3184 


17085 


A 


3206 


1 


367 


E I E S I Q I DGHTKNKFTjG I HLTYLTKEVK 
DLCKKNYKTLLKE 1 1 DDTEKYDMLMD * N 
NIVKMAILPKAIYRFDTISTKLLMSFFM 

ITLLDFKLYY 


3185 


17086 


A 


3207 


368 


12 


FAQKKKKKKKKKKKKKKKKKKRAPPQN* 
RAPQKPLKPPPRVFDIPPPLGSPPPPAF 
FWRGGGPPPGPFSKKKKKMRLTGGGRLF 
LXiA 


3186 


17087 


A 


3208 


2 


356 


KYLFSS I PEGKEKMKGIANLFNE I ISBN 
CPSIARDLDIQKTRHANPYNLKKSSPQH 

TP IRPGMVAYACNPSTLGG * GGW IMRSG 
DQDHPG 


3187 


17088 


A 


3209 


nq 




IN I KOLPRCY5GTCT.W *50T • SP P T iP R OnPV 
SlJGGQGCSEL*SSQCSPAWATERDSVSK 
YKINNFL 


3188 


17089 


A 


3210 


295 


122 


VTKTV* NWNKDRH I DQCSRI ES P KIHLH 
IDDQL I FDKDVKTPE * RNNGLFNKWCWE 
N 


3189 


17090 


A 


3211 


218 


466 


ATFDTTLPSCYQNAPNNRFDDLSD* EQE 
IDTMTVNIILPLP^lxNIVITNPYMI*HH 
QHDLDYTYPDTTGQLVNCAYFINLLQP 


3190 


17091 


A 


3212 


340 


355 


IiGLQG*REGNFGSLQTSPPGFKRFSGLS 
LLSSWDYRV 


3191 


17092 


A 


3213 


254 


40 


WPGTMAHACNLSTLGG*GGCITRPGDRD 
HPG*HNNNN 


3192 


17093 


A 


3214 


198 


25 


YNEKEKPVKMLKI IANSLGAVAHAYNPS 
TLGGQGGR I MKS GDRDHPG * HADAWVET 
VL 


3193 


17094 


A 


3215 


376 


3 


PPGVF* RGPPGFRTLPPKSSSSSPPPKI 
LGGPPFLAFFGGAPPQKKPPSSSSSASS 
PPSSSSRAPLKKGGPFNPAGGFPLFGGP 
PRGGPPFSSSSSSSSSSSSSSSSGRRSR 
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SRTSRTRGRTRG 


3194 


17095 


A 


3216 


393 


260 


VSQDGLDLLTS * STRLGLPKCWDYRREP 
PRPACTLCISSTYNDP 


3195 


17096 


A 


3217 


3 


387 


HASASSGSRAPPFFFFFNKKTCSKELET 
TRGTVLKTCLFB'KKKKKVLGGPKPPLGN 
PKGVGKPLKGFPGPNPWPPP*KGPGCHF 

luiur i^troirxvivoxix trow vvjvji\.vjooJ_ii_iir 

GPLGGPSEVYFSPLRA 


3196 


17097 


A 


3218 


77 


406 


RMARP EIjGLPGNLLSNDH I YNG I VTAHA 
FVIIFLIVIPI INGGFGN*LIPLIIGAP 
DMAFPRINNISF*LLPPSLLLLIiASAIV 

PAnanTY3* TWPPT.AfSNV^WPfi A 


3197 


17098 


A 


3219 


394 


2 


KRRYFPDGLNFFWGPGILKIFVKKKV5S 
LKKKKKIO^PPVFLQGWGGNKKFKGGGL 

KT7QTTPNT.T *T7PPP(YKT7&P* KM W ffP PWPQ 

PQKDPLRE FFFFFLRQVSLCHPDWSAVA 
RSQLTSAS ISRAQGAGRV 


3198 


17099 


A 


3220 


376 


157 


KFFFSP*KFFFSPKPLKFGGGVGPIIPP 
PKKRFFFKNPQGVFKKPPQKKKKIPFQP 

PVTtfFrtPPPnPr.TTf^PPPPFPPPFPP 


3199 


17100 


A 


3221 


394 


1 


VPPPQKFKTPGPPPPPREFFFF*KKKGF 
P PLGGFLNP AP KNP P PGP P KKVG FPGGP 

P P P PflflFPPQ PPT J? F * NPfTRP VPPYTP1TT 
PKRKFLLKWGKGGKFFPKKPIFPPPPKK 
KKKKKKRAAARDLELADAW 


3200 


17101 


A 


3222 


207 


402 


SlLM*IiCCIiFPLPGVTPIDGAPHRSYRE 
FKVMREKRI 


3201 


17102 


A 


3223 


309 


3 


YPFFHLI DLAIHPCVCFTKFYKATVTQT 

AWSWYQIRYIDQ*NGTEISEIPPHIYNH 

VI CDKHDKNKQWGKDS LFNKWC WEN* LA 

injlfTiTrT .DP PPTP Y AK" T 
xuruuJMmrr xr x xr inni 


3202 


17103 


A 


3224 


3 


382 


IxDRERPP FFFWGARHMDI PQLVNLS INK 

RGGPP * KKP PGGPK* P PGGKR* I FP YMG 
GKKKPPGGFLEKPPPLGGAHLGNPPPQK 
YTPPGKKKNLKRET 


3203 


17104 


A 


3225 


133 


2 


FFFETESHSVTRLECSGTISAYCKn^CLP 
GSSDSPASAS *AAGI 


3204 


17105 


A 


3226 


349 


2 


AGVPPGNPPLWGGEGGGSPRGGGLKPGF 
PQRGNPFFFKKSQPTPPGGGPPLIPPPW 
GGGAGGSPLPQGQRFQ*TKIGPFPSPRG 
KKKKPPFPKKKKKKKKRKEKSEMPGFMV 
LNA 














3205 


17106 


A 


3227 


3 


239 


IjNKVGRGCSE PRSRHCT PAWATE * DS I S 
EKKKKKPESRGILKVKGLTQAbFYlaALW 
IiYYLHPTAKQQIWGFF I YFSKP 


3206 


17107 


A 


3228 




i 
i 


IGLKI0NSCPLKDSIiKKIKROATDWT?KY 
LQNTSDKAS VFI I YKEHLQLSNYKAVDP 
IK*WAKEMNKLH 


3207 


17108 


A 


3229 


174 


1 


VQMLEDKS FEETP * FSSETLVLKI PHVQ 
PGAVAHTCNPSTLGGRGGRITRSGDRDH 
PG 


3208 


17109 


A 


3230 


2 


340 


KNHSVxTjLCVPSIPFPTFCFF*F*VFNT 
F* FKIiN* IN* FTYIDRVLLCHPGWSAVA 
* S * LTAALNS WAQAVPCX^IxLLAHHHAW 
HI FKKTAYI IHIWYVWYDTTYPFKVYN 
S 
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3209 


17110 


A 




i 
i 




ITFLLFDIiEDALLLPLP * ALRFIFLFTV 
GGLTGVVLANSSLDIVLHDTYYVAPHFH 
YVLS IGAVLAI IGGFIH* FP 


3210 


17111 


A 


3232 


207 


355 


ELS PFRLKKTLY* LGMVAHACNPSTLGA 


3211 


17112 


A 


3233 


3 


355 


TTQQ * L I KLTCKQTI AI HNTKGRT * AL I 
LISLIIFIATTNIiLGLLPYSLTPTTQLC 
INIiAMAI PL * ADAEVIGFRSKI KNALAH 
LLPQGTPTPLIPILVIMBTINLLIEPIA 

f.APPT. 


3212 


17113 


A 


3234 


2 


355 


KYLINNRLITNQQ*IiIKLTSKQMITIHN 
TKGRT*SLILISLIIFIATTNHLGLLPY 
SFTPTTQLS INLAMDIPL * SGAMVIGFR 

CVT V~KT7\ T A LJ ITT PHP'TTyrPT TT5TT TV T T l,»M« 

ISLLI 


3213 


17114 


A 


3235 


43 


370 


QGCVKGWVLEEQVRRGWILDSSEGKTDL 
KQRGS PGSWPEHVGGWSGVMG * SEAWTG 
QARWLTP I ITALWEAEVGGSLRPGVQWH 

UT/2CT rvnr bob pvdpovdct nop r/rp\ 


3214 


I7I15 


A 


3236 


15 


356 


LIQPSLKLMISIHNTKGRT^SVILLSLI 
IFIATTNLLCLRPYLFTPTXQLSINLSM 
VIHL*AGAMVIGFRSNIKNALAHLLPQG 
TPTPLMPILVIIETIRPLILPIALAVRL 

XA 


3215 


17116 


A 


3237 


1 


376 


GTRTNTLT I YQ * WGDGTRBSTYQGHHT P 
PVQKGLRYGI ILFITSEVFFFAGFF* AF 
YHSSLAPTPQL*GHWPPTGITPIjNPLiEV 
P1J^CVLLASGVSIT*AHHSLIENNRD 
yxxyALuLx x x vxa» 


3216 


17117 


A 


3238 


1 


358 


GTRNG* YTNA* ALS ITVIGASLTSAQGS 

PIKRIAAGSRFAGFLITNNI SPGCPFQT 
TIPLYLKITDXjGVTFLGLLTCIaDLNYTiT 
NKLIIKA 


3217 


17118 


A 


3239 


258 


390 


RQGLLMLAGLVl^SWPLQSSHIiGFPKCW 
n VCIP V. P PPT *• f. T T . 


3218 


17119 


A 


3240 


2 


372 


ARARFHHVSRDGLDLLT* *NTHLRLPKC 
WDYRREPLRPGKTFFLKKKKKNSIFLFF 
REGFKEKSILGIKFFRPTGGVLILTGNH 
GWGCKTGTEIj'LVP c iP PPfiT, A FIT TPfSf .W A 
HDTPHRVRNWL 


3219 


17120 


A 


3241 


j 




HERLWGGWKTGAAGLGRTSSRPTAST.Tn 
T*TTMTH*SRTTGCSTGSGRTWTRSRHS 
CAWGMAGLYRVAVASRGPRGMM* PTPPH 
WLCWAWKVP 


3220 


17121 


A 


3242 


170 


3 


I KSQAGLVGFLGP FS FQDSLNLVFVFGF 


3221 


I7I22 


A 


3243 


2 


385 


ARADVDTPS YLTSDTIMRDI PTGGQVFS 

* , TATTiWn^KTMV*TAAPT T TT y^CTCT crn 

GGLTGLELTNSWLDIVLHDTYYVGPDFH 
YVLS IGAVFAI IGGLLH* YPLFSGYTLY 
RTYAQ I HFAI I F I GEN 


3222 


17123 


A 


3244 




106 


GTRYVGQAHLKCLTSSDSPASTSQSAG I 
TGVSHSA* PASTSQSAGITGVSHSA 


3223 


17124 


A 


3245 


2 


360 


ARANTLTI YQ* WS DVSLENTYQGHHTPP 
VQKGLRYRIILFITSEDYFFSGFL*AF* 
HSSLSPTPQL*GHWPPTGITPLNPLKMP 
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LLNTS VLLASGVS I T * AHHSL I ENNRNQ 
IIQALLI 


3224 


17125 


A 


3246 


17 


160 


GG*GCSCSEL*SCHCSPAWVTEQDFDSK 
KKPAI LASCLKHLNPLSSH 


3225 


17126 


A 






9 


WAHI YMTPLS P PS PT iTTPVO * TTMPV T Y<5 T 
YSLDN*NPSSPKAKRAPKKSYTLPYLHL 
CVCTCECV*VCVCVCVCVCVWV 


3226 


17127 


A 


3248 


2 


2220 


FFGGGRPSPPQGYFLUONHSSPSPPVKL 
NPGPA* FYPPTKGKKTFPPPQR* PSPP KN 
TKTPPPSFFFSS 


3227 


17128 


A 


3249 


2 


372 


AYTISFIGKISFYGIiH* *LPKAHVEAPI 
VGS IVLAARTJiKLGGFGI IHLTLIFNPL 

LKSLIAYSSISHIALWTAILIQTP*SF 
TGAFFLIIDHG 


3228 


1719Q 

1 / 1^7 






90/* 


jo 


* fffffffffffffffffffffffflyl 
IAMFYTjS ff fkodnoryroys t k 


3229 


17130 


A 


3251 


169 


370 


LKMTELRGAPAS KPRGQE P P PHYPCHHH 
HHHHHHFL* VTKGQGPHHWPS PTRDPGW 
L*SPS*EDQRR 


3230 


17131 


A 


3252 


22 


156 


GERIGLGLGGQGCSEP+LCHCTIiAWVGD 


3231 


17132 


A 


3253 


3 


400 


QNQTPLLD*GGLITAVLLLLYLAVLTGG 
ITILtADRSLDSTLFYPAGGGDPILYQH 
LF*FFGHPEVYILII.PGFGIVSHIBTNY 
WGGKEPFGFVGMV*AMIAMGFLGFIG*A 


3232 


17133 


A 


3254 


373 


31 


REVGPPTP+KIFFFPKGLNFWGGGGPKF 
PPPKKKGFFKKS PVGVFPPPGG* KSGPG 
PGFKKPPQKGKNISFPAGGKIGPPRGTL 
KRAPPFFFFFFFFFFFFLLWVYVQVERP 
XL 


3233 


17134 


A 


3255 


1 


379 


LNLIQRQ* R*V* KFL* LPPQT * KKKKKK 
K3CKKKGGGAFKKNLGGAKFNGGRKKKIF 
FLKGG VXKKKAGGGFKKRGKGKKCYLG I 
FEKKP FFGGGKNWENP PKK I KGT.REKKK 
F*GBKGEKKPEKAG 


3234 


17135 


A 


3256 


42 


376 


FCYISLVHHCIYNDLSFERKKNIFVPGQ 
INSISSIA*EAHCKNKSLLHAVXKKKKK 
KKKKKKKKKKKKKKKKEKKRGEKKKKKK 
KEGRSShKKEK 


3235 


17136 


A 


3257 


jjj 


67 


CYPLSPLKFFFSPRSLKFWKGVGPI ISP 
PKKKVPSQNSQEAGFPSPNVLKRRPGPN 
FKTTP * KEKKNPFPPPVKFGP PKESLKR 
PPLFFFFCGLQ 


3236 


17137 


A 


3258 


2 


129 


FHRISQDGLDLLSS * SARIiGLPKCWDYR 
CEPPRPAKNKILLS 


3237 


17138 


A 


3259 


205 


415 


QQKNRRLLHFKGARTHNSYNRGQPTTPS 
I TAHMYPRLROSHTT YVTiRVHWP * VP<? a 
IEGPVSV*AIXHSSTIvVAGNIJiLIRFH 
PIAEKSPLIQTVTLCliGANTTLVAGGWA 
IiTQNDSKRIAAFSTSRGLGLI IVTIGVN 
QPHIiALLHICTHAPVKAILFMCSGSIIH 
NLSKEQDIRKIGGLLITIP 


3238 


17139 


A 


3260 


3 


393 


S FNLSTLITTQEHL * LLLPS * PLAI I *F 
ISTLAETNRTPFDLAEGEAELVSGFNIE 
YAAGPFALFLIAE YTNI 1 1 INTLTTTIL 



747 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, LHLeucine, M=Methionioe, 
N=Asparagine, P=Proline, 
\g— \MiuuuuuKf i\— ArginiDe, oenne, 
T=Threonine, V=Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LGTTYDALSPELYTTYFVTKTLLLTSLL 
Ii* TRTAYPRFRYnflT.T WT. 


3239 


17140 


A 


3261 


2 


400 


ISDLSEK* FKRLWKLIMEAPEKGKAQC 
KEIQKMTQEVKGEIFKE*IA*RKKKSKF 

ORTT.DTT . T RMD G 2V T .13 G P OMO T SrmTCTTDItf 
\£o j. uu x xj x anyQ/uiiis) c oiNiCJ. ivy " » *iKW 

SELKDKIFELTQSNKDKGKRIRKYEQSL 

* VAWDYIQ*PNLGI IGIPEBE 


3240 


17141 


A 


3262 




170 


MMT.aPW* \T*Wi7\n.TL T KT.T.TTVrifD VTDTWIaI 
luiiuu m v ** Cix v_ v xirtx f\J-LuJ\ v I\_K i rlbn 

EKMPANHISDKGLVSGIYKELPRLSNKQ 
AI DLTF * KWAAGHGGS PL 


3241 


17142 


A 


3263 


238 


3 


KEKKIGLKKCLQGSHFSIHTAWS 1 1 IYM 
FSPLTIISRKRMGQPGIVAHTCNPST*G 


3242 


17143 


A 


3264 


350 


3 


S PTLLGSKDPNLLG FRFPL WKKGKI IRA 

TFLWCFLKRGFPFLSQKKKKKNPGAVA 
HACNPSTLGGRGGRSQGQELETSLANTV 
KTR 


3243 


17144 


A 


3265 


69 ~" 


200 


RLECSGVISAHCNIiNLPGSGDS PAS AS * 
IiAGITVMVKLPVIAK 


3244 


17145 


A 




771 


*HJo 


FFCLGPPRRGD*GPRSSSPGSPWVFKKN 
GVSP 


3245 


17146 


A 


3267 


185 


420 


DQGLWGFI IYFYRQSLA*VHWNNPSSL* 
PRTPGLKHTPVPSLLI SWDYGRTPPHLT 
NFCIFFDRGSFFF*DRVSATHA 


3246 


17147 


A 


3268 


3 


392 


TGCHS I PQAGVQWHNHGLLQPQP PGLR * 
G <5 PT J3 T.P Q Qwuv 


3247 


17148 


A 


3269 


1 


398 


KFSCISSKHQKLKLTPKPPKPPPKKSPL 
VLPIGKKIRETFWGAFKS PPPNQPKGAQ 

1 xj FJjJXx w U XUWwuVjuJjAJj V V * KAir rbNF 

KGPPGGKPMEQP * LGPGPPLKWKGGLPH 

v£ R*3\jr o aatu oxvxvxvvjvjIj tvu 


3248 


17149 


A 


3270 


422 


183 


ETEWSLFKVI ITERS PNIiEKDTNIQVQ 

RSYRT'PSPPTT.MTif'r'FQPlJT.TTTT DOT/MM 
DomiroKT J. Xd*U\ J. x oivMXjx X xxjir rvviMXN 

TERIL* MQQDRGNNIQWSSGCSA 


3249 


17150 


A 


3271 


3 


35 


KNNSLIIPTIIATITLLNLYFYLSPIiLY 
**SSSPPS 


3250 


17151 


A 


3272 


155 


i 
i 


ifDFFPPFIiPO Q PTT.vanzk a ap wnriT .pc r . 

JVXsiT c c C C JUIvyOC X Xj V/i\4n>lnILrf v^iJXJVvOXJ 

QPKPPGFK* FSYNSLSSSWDYIG 


3251 


17152 


A 


3273 


413 


71 


PPSTGLFLTEEYEVSFPPFPL*KFFFPP 
SGLFFGGGVPPFFPPPKKGFFPKYPRLV 
FKGPLLGGGGLPPPPP * I LPPLGS FkPA 
PPLFFFFLFFIi 


3252 


17153 


A 


3274 


336 


45 


DRVLSCSPAWRAMARSHDFG*LOI.PPPR 
VKVFSCXSfTPSSWDPPJIWPRKGNFVFL 
V1CTGNPWNLGGQGCREPJILCPCIPAWGT 
DKDSVSKKKKKS 


3253 


17154 


A 


1775 


zw • 


Ififi 


f? VFFFVT* Ff5FFVf3 ATJ ,PF PMPWT .1? PQfTT 
ILVHOSILHFPGSKDSSASPS*VAGT 


3254 


17155 


A 


V)16 




7^ 


FMTRR VR RP P PTM T T WUT7I7 * PWMWD PT7 

FFI FFDGRFSCVFFFFFLFYFFFFFFFF 
FFFFFFFFFFFFFWFSARSFIYFLFPRH 
VT 


3255 


17156 


A 


3277 


144 


17 


KAPPLFFFFFFFFFFFFFFFFW* FRG*T 
HWNGDACMCNLTKS 


3256 


17157 


A 


3278 


2 


109 


YHIVKPSP* PLTCALSALLMTSGLAM* F 
HFHSITL 



748 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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peptide 
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Predict- 
ed end 
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correspon 
aing co 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, F=IsoIeucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 

— f Immtn m in a D— -A r<Tin!nO C— O^— Inn 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3257 


17158 


A 


3279 


404 


65 


NLVSSPKRS VFFISPSQKI FFFFFNWKE 

FYFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFL*TACLF*TFKFTKTK 
R 


3258 


17159 


A 


3280 


226 


383 


GI* KNSMDGCCLGPFWFETGSHSHPGW 
<5 T VWRNIiGS IiOLIiPPKFKRFS C IiS 


3259 


17160 


A 


3281 


91 


26 


LFFFFFFFFFFFFFFFWSERSS 


3260 


17161 


A 


3282 


3 


321 


HWIYYFGKNEPFGYIGVA*AMISVGSL 
G FMA* AHHI FTGGI DVYS PAYLTCAT 1 1 

vAXlr i.v»VlvVr J. Lnl JjJJoOvjPl^X^Orirlw ij 

* TLGFIFLFTARGLTGIVLSNS 


3261 


17162 


A 


3283 


393 


112 


PPLRVFFPPFPLKNFFFSSRRFFFWGGV 
APFFPPPKKGFFSKI PPGFFFPPPLKKK 

TT?CO COD* ITWZiDPnPPT»1?/TDDP"G*PPPPP 

FFFFFFFFF 


3262 


17163 


A 


3284 


360 


47 


P PLHRWVPP PPP * KFFFS PQTKFFGKVF 

KFFLPPPI FFPPPRFFFTTPPPFFFFFF 
FFFFFFFFFFLPIiFFWSVSF 


3263 


17164 


A 


3285 


386 


157 


FFSPNPLIFWGGGGPKFPPPKKRFFSKN 

DDr*\7PPQPDT,KT?Tnn?PFPPP *NT lAPPPTI 

FLKGPPPFFFFFFFFFFFFF 


3264 


17165 


A 


3286 


3 


385 


DAWVPCKLYLGVFFCRFVKISAILILKI 

T.TPWT PMTTQ T*VT.T.K KWK\CfrtCKTC1CKKTC1C 
LiX C IN Ifili J\o J- X Jjl i^I\J>j\JVJ\J^rv^jvi\j\-^R. 














3265 


17166 


A 


3288 


81 


270 


IIKIiEKKiaGCKKKKKXKKKKKKS*KGNE 
KTRGPKKKVHRDVTKKI LCYKI CP FNKG 
ISLIGT 


3266 


17167 


A 


3289 


3 


145 


IX5RLJ?QKNHLNLGG*GCREPKSHHRIIiA 


3267 


17168 


A 


3290 


404 


47 


GPGGKIFFKKNPBEKI FSTQKKKGFFPP 
FPPKNFFFPPGGFFFWGGGGPIFPPPKK 
GFFPKIPRGVFFTPPKKKKIYFFPPREI 

rjA DTVSTP* ITfiPPT.FPFPFPPFPPFPFTiV 
WAJr IrVj-Li? lUurlr trite r rccrrrrrr rruv 

LNDILLA 


3268 


17169 


A 


3291 


106 


362 


KKKKKKKKKXKIOCKKKKKKK^^ 
KKKKXGRGGGKKKKPRGAQI IRGKKKII 
FFFKKVFK* KGGGDL* KKKS FCGENNLE 
NTHKE 


3269 


17170 


A 


3292 


3 


226 


OTLVOTiNLGLLFILATSSLAVYSIL*SG 
OA <5NSNYAI\ IGAliPJ^VAOKKKKKKKKKK 


3270 


17171 


A 


3293 


337 


2 


FFNRELLKSKNKETEKNHDYEKERIWHF 
DRPYKFYMLFFCFGVLIFFLYLPFFIYF 
pp FFPPPFPFPFPP P F FPFFLNS LKfT * F 
FS INTHKCGNRGQGGGAIGPRLSNGRV 


3271 


17172 


A 


3294 


327 


21 


KFFFSPPNKYFFFVFPHKFPTPTKMFL* 
KIPPPPLISPPPKENHQCPPPPHIFAPP 

odt) ppY^p DPTFFFFPP PT .P PPTiWPTn QR Y 
Dorr i. vjir trixr rrrrrruccr unAiuom 

IAQAGLELLGSSYPPAS 




\n\TX 


j± 




JO/ 


101 

till 


FFFSFF*KKKNFFPPPPLWPPPPNFL*N 
PPPPFFPFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFF FFFLVNSLCRRYRG*SLLY 
YAWL 


3273 


17174 


A 


3296 


178 


2 


KGPPFFFFFFFFFFPDSITLFIITKNEK 
KTRCTSIGE* I TWYTHTGE YYS VMKRNE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaroic Acid, (^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, PHProline, 
0=Glutamine R=Ar**inine j^^pnnp 
T=Threonine. V=VaIine. 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














LS 


3274 


17175 


A 


3297 


147 


1 


RGPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFTW*FALTFLV*NF*AI 
ITF 


3275 


17176 | 


A 


3298 


3 


347 


PFGEGETNGRFDLDTKKALVSKKKKKKK 
KKKKKKKKKKKKKKKKKKKRGGALKKKN 
RGGQKKTGEKKKKFFFKKGEKKJCTPGNF 
* KKKKFI/3GDNLAQTPPKKKNLWGKKKN 
FWG 


3276 


17177 


A 

i 


3299 


405 


2 


WRPKPENSPSPGGPNPPGQKN*TPFPPK 
TPKLGRGCGGAPQFPLPVRARGENPPNP 
RFLGPN* PKFPFPPFAL*GGGNNBTFFQ 
KKKKGPRFWAGVLPTGGAEGESIPHLSP 
ASGSSRHPWCFMLCRPSRLCIili 


3277 


1717R 
I / 1 to 


A. 




1 4. J 


j JO 


PLFPHFLGGWGGGGTRPPG* RKRGKSGV 
PQPPPPGGRVPPPPHPPKKWGNKGGPPP 
PGKSFFFFFFFWRDEVSLFCTGWSQTQT 
PGLKQSIHLSLPRWLDYGREPPHLAIHF 

fPTT? PPTRrt 

L K J. IVl IV X i\VJ 


3278 


17179 


A 


3301 


348 


1 


PQRLKILGGGGPPNSPPPKKGFFPKTPR 
GF*NPPPKGKKKNFPPPRKIGPPQGFF* 
RAPPLFFFFFFGPWFFFFFFFFLFLENG 
VSLYCPGRSRTPGLKLSSRLLASLRAGI 
TGVS 


3279 


17180 


A 


3302 


2 


354 


GRVGGRVGSPTTLTT*NPHSHEKTPSCS 
YPKKKKKKKKKKKKKKKKKKKKKKBCKKK 
KKKKKKKRKRKKKK3CKT*GGGAYIKKKG 

KKTPH 


3280 


17181 


A 


3303 


362 


26 • 


SSQAAHPTKKKKKKKKKKKKKKKKGGAL 
KKNFGGGKNKGGEKKKIFFFKGGGKKNP 
GGNFKKKNFFGGGKIGENPPKKK*RFRK 
KKKFLRG FCGGKBT^ 


3281 


17182 


A 


3304 


371 


3 


NQKKP P P PVFFFFFFKKKKNPPPATPPP 
GRP * FFPPQKKFFPEGGPPPFFKKGAFF 
PP P KKNPFE P F FPGGVFPP * KKKN1 KW 
PP PK3CKKKKKKKKKKKKKKKPKKQKKGR 
PARDLEPHAS 


3282 


17183 


A 


3305 


2 


196 


DR * LFSTNHKD IGTLYLLFGA* AGVLGT 
ALSLL I RAELGO PGNLLGNDH I YNVI VT 
AHAFVIIF 


3283 


17184 


A 


3306 


4 


339 


LFSTNHKD IGTLYLLFGA* AGVLSTALS 
LLIRAEIjGQPGNLVGNDHIYNVTATAHA 
FVI I FFIVRP 1 1 IGGFGN* LNPLIIGAP 
DRAFPRINNI S F * LLPRS ILLVRASAI A 


3284 


17185 


A 


3307 


2 


367 


KPPAITQ YQTPLFV* SGLITGGLLLLFL 
PDLAAGITILLTDRNLNTTLFDPAGGGD 
PILYQHLF*CFGHPEGYILILPGFGIIS 
HIGTYYSGKKBPLGYIGMV*AMRSIGFL 
GFIB*AHHIL 


3285 


17186 


A 


3308 


243 


370 


KGHRPGMVTPVCNPSTLGGQGGWIRRSG 
I*DQPGQQGVTPSL 


3286 


17187 


A 


3309 


2 


363 


LVPLIIGAPDMVFPRINNIGF*LLPPCL 
LLLIAYAIMEAGAGTG+TAYPPIAGNYS 
HPGAS VDLTI FSLHLAGVS S I LGAINFM 
TTI INI * PPAI TQ YQTPLFV* SVLITAG 
LLLLCLPG 



750 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3287 


17188 


A 


3310 


1 


364 


LYITGPTLALTIALIiL*TPLPIPNPLVN 
LNLGLLFILATSSLAGYSIL+SGGASNS 

MVftT TOST PTfttnWPTGVOV'PT RTTTT OT 

LLIRGS FNLSTL I TTQEHL * LLLPS * PL 


3288 


17189 


A 


3311 


215 


2 


GMAKLKARFLPKKGGKRSLSPLLWGSGF 
QS * LTTASTFWAQAI 


3289 


17190 


A 


3312 


153 


3 


MNKCICMCTCMCVCF^SHKHTVCAWVCV 

X * V *x X 1 Vv» Vl^ Vv-VV».Vv»Vl«. Vv- 


3290 


17191 


A 


3313 


149 


1 


KTPPGFFPGFKGPF*FKGPPLFFFFFFF 
ETEDGVLLCRPGWSAVSRDRA 


3291 


17192 


A 


3314 


282 


69 


KIGKTAPFFSWPRRVPFLKKKKKKK*GL 
TLSSRLEYGGMISIiELLGSSDTLAiSASR 

VAKi IIj^K^HAWJjI 


3292 


17193 


A 


3315 


479 


60 


MMQHS I KHALVE I TGWVRWLRP VT PAL W 
ClAlsVGErNoISK^LNNRl^SNIYl\J^ILr 
NLIYRINAPFFKFLIALLNKNSKSKSIW 

xrr.trc* *G , pr\*cois'KrKn7/2nT*rTcr4i?* f i*iin 
PilJivo *o ly^ooKIMJN VovjI 1 J.oiJi?^ xriy 

QATKLRQEFLHTGLVHRSTVSRFRFHA 


3293 


17194 


h 


3316 


129 


3 


SFHQKVWPDAVAHACSPSTLGG*GGRIT 
GSGDQDHPGQHGE 


3294 


17195 


A 


3317 


3 


156 


EFHRVSQUGLDRLTS * STLLGLPKCWD I 
RCEPPRPASFPSFLTVRISPFL 


3295 


17196 


A 


3318 


36 


340 


TGLVIAEHLLFFFFKKGFWFCPPGGRAG 
PQGNLMEFPVSGGKGIFCPGPPKSGE*R 
APPPSRGKFWFFKKKGGLPFWPGGV*TS 
DPRGTPPPLGPKGGSYR 


3296 


17197 


A 


3319 


326 


2 


KAr FIjNF IvPHPGFA*GIiICri^NSLiLP I * 
SFIRKT*AFLICL*IQLNRADGKIPIiKP 
G VI P FKKCFPNFTP P LFFKKNCS S TPNL 
FFFFPFFFFGDGISLCRPGWSAVA 


3297 


17198 


A 


3320 


175 


312 


QAls VQWGDIjSSPXPPPSGI»T* fy*cslp 
RTWEYRCPPSRPCNFSIF 


3298 


17199 


A 


3321 


363 


51 


RWGP RLPVAWLPS VSRRAEECS PGRGAH 
GQGGGGGGQAR* REGPDEWTGGAPVGMG 
SPSLNRGAGRGPPQPGPSS KSRRGRVRC 
TPHSCLIGCGLSFFICKMG 


10OQ 


i /zUU 


Jn. 


552.L 


i44 


187 


RGWVSKKKKKRKKRKEKIRICVLT 


3300 


17201 


A 


3323 


3 


313 


TRRERERERERERERERERBRERERERE 
REREFFFFFF*KKNLSPPPGGKKRGGGG 
FHSPPFWGEKDFLPPRPPKGGERGGRPP 
if if m c r per rv K rv_n»vxttjr£*ooov 


3301 


17202 


A 


3324 


365 


1 


SLGWWRVTPPPFPLGPRGGSPPRPDKRG 
SPGRVKFGGVKPFPKVKP+PPPKRLDPS 
GA*FFFSPKKICQGPPPRKKKKKEAELS 
ILCTKFPLQEEEVMQVPPPPCTCSSEPG 
INCICKRHC 


iJ02 


17203 


A 


3325 


1 


269 


A T57\ T rT GT Or OT OT PT C T 1313D171717>T X nf 

QILGGGLFYPPKGGAHLSNRLGGGRVFF 
GPPGKKTAPLGGGRVIKKKTPPGTPL* L 

G PPPHfiP PP PfY3fir>TTKT iGPP P /IWSP 

PPWGGKKAPPPKFGSGPKKKKKTLERER 
BRERBRKRERELV 


3303 


17204 


A 


3326 


1 


316 


ARGERERERERERERERERERERERFFV 
SPPGPDTLQDRGGVSLFASDPTHEIYLP 
CPLRGPLSERDPEELHGVCFFSEPSHRW 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, Msoleuctne, 
KHLysine, L=Leucine, MNVIethiontne, 
N=Asparagine, P=Proline, 
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nucleotide deletion, V=possible 
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3304 


17205 


A 


3327 


2 


369 


ARGDVTRESTYQGHHTPPVQKGLRYGI I 
LFITSEVFFFAGIF*AFYHSSLSPTPQL 
RGHWPPTGITPLOTLEVPLLNTSVIiLAS 

LGLYFTLLQA 


3305 


17206 


A 


3328 


1 


259 


GTRKKHSTI LI IRE IDIKFMPRFHISP I 
RLVKI*ALANIRC*QQC3GKVGNIiLHC* F 
SKLVQSTGRAIWHALVLSLARSRALSLS 

T tJ 
Lit! 


3306 


17207 


A 


3329 


2 


352 


ARGTLLLLCLLFFALTIYPRWCDVTRKS 
TYQGHHTPFVQKEPLVLGGKLFITS * VI 
FFAGFF*AFYHSSLTPTPI*LGGHWPPTS 
I SLFNPLKKPLLNTSGLLASGVS IT*AH 


3307 


17208 


A 


3330 


3 


361 


HEEPLGY I RMA* AM I S IGFLGLI VRAHH 
I FTGGIDVDSRTYLTSATI 1 I AMPTGGK 
VFR*LATLHGSNMK* SGAGL* ALGFMFL 
FTVGGLTG I VLTNS * LDIGLHDTYYWA 
trcuvrrr e? 

rlrni VJLo 1 


3308 


17209 


A 


3331 


1 


340 


GTSGDTRAYFTSAT IIIAI PTGVQVFS * 
LATLHGSNMK*SAAEL*ALRFIFLFTVS 
GLTGIVLANSSLDIELHDTYYVVAHFHY 
VLSIGAEFAIIRGSIH*FPLL+GYTLDQ 

1 


3309 


17210 


A 


3332 


2 


352 


ARGDVDTRAYFTSGTI IIAI PTGVKGFS 
* LATLHGSNMK* SAAVL* ALGF I FLFTV 
RGLTGXALANSSLDIVLHDTYYGGAHFH 
i VLio iKyAv r AXliKyt 1H* FPLr SGYXLD 
QTYAK 


3310 


17211 


A 


3333 


85 


370 


QVSHRVRPCFERBRERDRYRHTQRECER 
AKEIYSGFFLSASEMESCSLAQAGGCKB 
LRMCHCTPAWVTQGDCVS *NKK* HWEKK 


3311 


17212 


A 


3334 


362 


3 


QEVKVSMNRDHTHS LQRGEQKRNS VSKK 
KKKSSQEKLKRVHTQTHT* IFIAVLFIV 
AKNGKQFKFPSTCEWINNM*YIHTMEYY 
S ATKNQL IHVTT * MNLNNI YTKFLKARQ 


3312 


17213 


A 


3335 


3 


351 


HEGTTIiHHLFLLETGSNNPLGITFHSDK 
KTFHPYYTIKDALGLLLFLLSLMTLTIiL 
* PDLLGDPDNYTLAHPLNTPAHIKPE* Y 

AIIP 


3313 


17214 


A 


3336 


17 


350 


FI FTLTETNRTPFDLAEGESELFSGLKI 
KYAAGPLALLFITBYTNI II IDTLSTTI 
VLGTTYDALS PELYATYFVTKTLLVTSL 
FL* IRTAYPRLRYBQLIHLL*KNFLPL 


3314 


17215 


A 


3337 


2 


349 


ARDSLISPILRLHPRPNLRQNAFHYHIH 
KvjitSWrliFI ILiSKPIRNAPTLLGIiPRCI 
HHMKHP I ICRLIH* FPLFSGYTLDQTYA 
KMHFTI IFIGVNLTFFPQHFLGLSGMPR 
RYSDYPDAYTT*NILSYVGSFISLTAAI 
LI IFMM*EAFASKRKVLIVEEPS INLE* 
LYGCPPPYHTFE 


33 b 


17216 


A 


3338 


2 


386 


ARAILNAMAFLILTERKILGYIQLHRGP 
NWGPYGLLQPFADAI KLFTKEPLKPAT 
STITLY ISGPTLALNIGLLL*TPLPI PN 
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P LRNLNLGLLF I LATSSLADYS IL*SG* 
ASNSNYALIGALRAW 


3316 


17217 


A 


3339 


175 


16 


ILGDLFPPAWLHPFLFLLLLFSRPSLAV 


3317 


17218 


A 


3340 


1 


369 


GTSNLSLLFIVATCGLAAN*ML* *GGKI 
N* IYTIoRVAIiAAI IHTILYB ITLS I ILL 
*TLLISGSFNLSTLITTQEHL*LLLPS * 
PLAII * FI STLAETNRTPFDLAEGESEL 
c * LaluN xe» X bAK 


3318 


17219 


A 


3341 


1 


363 


GTRGGTILPAMVLMLIDVPSLRILYITN 
KA/NEPobTIltor*GHyWx*TxQx IDxGGLi 
ILNSYIIiPPLFLQPGDLRIIiDVDNRWIi 
PIETPIRIINT*QDGLHS*AGPTLGLKT 


3319 


17220 


A 


3342 


181 


315 


PKSPPTPPPPTXXXXXPGGNPPGPTL*G 
PFYWGGERGPPF*TGGP 


3320 


17221 


A 


3343 


3 


183 


HEVSQSCRELLTSGDPPASASQSAGISG 
MnnRAwF vS w Xrnl rNXxULjfc'vj I vixKF x 
ItNL 


3321 


17222 


A 


3344 


83 


354 


VLGARCPCAGVSRVSMEPWFLWQQACIiL 
P WWRCCLSTLYANRADRDVP * TSRTGP V 

QDGETP 


3322 


17223 


A 


3345 


124 


379 


GQGI CFMWSLGLSHRLAHCRYTVGVNN 
LLCSSSLPFIiLPLIiPLPPPEGPGWDDIN 
I IIFI I FRDKVLLCCLGWSAVT * SQLTT 

A 


3323 


17224 


A 


3346 


3 


368 


YEPIP*PLTGALSALJjMTCGLAM*IHLH 
S ITLLILGLLSNTLTIYQ*WRDVTREST 
YHGHHTPPVQKGLRNGIILLITSEAFFF 
AGVF*AIYHSSXxAPTPQLGGHWPPTGMT 

r\T xTDT WIST T 


3324 


17225 


A 


3347 


1 


352 


GTSAGDVNYG* I IRYLHADGAKI FLRCR 
FLHSGRGLYYGSYVYSKT*NIAINLLIA 
TIATAFIGYVLA*GQILF*GATVSTNLl4 
SDIPYIGTDLSQRI *GGYSVDRPTLTRF 
FTFHF 


3325 


17226 


A 


3348 


2 


360 


ARAEISPLHSNLGNESETPSQKKKKKKP 
PGGGGKNWNPYPSGTPPPNPPKGGE *RB 

KGKKKRGFPAR* KNP VFKAKGKGRNFG P 
QAFFFKK 


3326 


17227 


A 


3349 


226 


366 


PQSFCSVASWPAGDLMEI*AWQKFEYKP 
GKVPMDPEGCUjPLAKKKEEEEEEEEEE 
ET*?FrFrET^^tEEF^tP-E EES EEEEEGE E EELV 


3327 


17228 


A 


3350 


2 


374 


ARGGGYS PHRATLTRTVTFHS ILPFI IA 
GLAAIHLLFLHETGAYTPL* ITSHSEKT 
TFHPYYTMKDALGVLLFPLSLTTLTLCS 

P>JT .T /UZ P PTH VTT .TNT) T .WTO? PTT T TTt5 T? * IV 

LFAYTNMRSGPN 


3328 


17229 


A 


3351 


1 


266 


♦NILTAI ILNLMVLPSLRMLYITNEVFD 
PSLTIKS IGHQWY*AYEYTDFGGLIFNC 
YILP 


3329 


17230 


A 


3352 


207 


362 


ILFLLKQKI YVLFGFSN*GSS I PELAHS 
DAYQTKE I CSSGKVYKYLQCWEKW 


3330 


17231 


A 


3353 


178 


365 


NHGIDKGLRYRIYTVQQEDNSVKR* LKD 
FNQHLS KDNIRSADKHMKRCPTSLVXNI 
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vis AR.rr 


3331 


17232 


A 


3354 


359 


3 


LQVRS LMPLGQFHPP * NFFCSFFPKPKI 
PFFFLFFLKGVFVFFPGWRARGQFWVFG 
PPPPKFKKFFFFTLLRNWGFRDFPPSRG 
KFFFFFFFWYF* * RLRFTMLTRLVFNS * 


3332 


17233 


A 


3355 


2 


354 


ARE P S P * PLTGALS ALLMT CGhSM * IHL 
fISITIiLILGLLTNTLTIYQ*WRDVTPES 
TFQGHHTPPVQKGIjGYGI I LFITSEVFF 
FAGFL * AFYHSSLAPTPHLGGDWPPTGV 
TPLNAL 


3333 


17234 


A 


3356 




364 


GTREMICCSALSPRIHLSFHRRWPDWHC 
ISKLITRHRTTRHVLRWSSLPLCPINRS 
W I CHHRRLHSLI S P ILRLHPKPNIjRQNP 
FHYH IHGRK* SAAVL * ALGFI FLFTVGG 
LTGIVLANSSLDIGLHDTYYVGAHFHYV 
LS IGAGFAIIGGFIH* FPLFSGYTLNQT 
YAKIHFTI I FMGVNLTFFPQHFLGLSGM 
fKKxbUxPDAYTT 


3334 


17235 


A 


3357 


2 


376 


ARGGQPEDYILILPSFGIIFHIVTYYSG 
KKEPFGYICMV* AMIS IGFLGFIA*AHP 
lr i> VCj lUVNTKAi rTSATI I IAI PTGvK 
VFSWLATLHGSNMK*SAAAL*ALRFIFL 
FTVSGLTGIVLCN 


3335 


17236 


A 


3358 


357 


139 


AQPWFFFLETVFCHAAQAGLKS IGSSDL 
Irl oAbQbVGTTGMSHHASfPERSC*QNTF 
TYYVQSIYSDDIRKSMF 


3336 


17237 


A 


3359 


277 


1 


SCQKD IS IGFLFVPGFRVKGVAQEHLVG 
VRMYVLVPTVGGDGKERER*RGIRHTPP 

OOJ?'DTJ r DrYD T3 r DT7T3/"*DT"VD TYEJ L* I^L?T5 7\0/"\T>Tr-r» 

fSK v KyKJSKl£KljKlJKljKri lUSKASQRKP 
EMDRETSC 


3337 


1 /ZJO 


j± 


jjWJ 




i. 


TTPOlTP&tJI ypoiTTT t rmvT r P/"<7\ ooot\t tvt* 

A JL t-£j I^CAWLiCr SKTXjIiTWTCASRSDIiAH 
GP * CPNLCSG I FSSAWTD I SRLLLFFCL 
FVLRQSLTVAQAGVQWCNLCSPQPLPPG 
SC 


3338 


17239 


A 


3361 


199 


354 


HLPYWPMFMATFWFFFFFETKFHFFPQA 


3339 


17240 


A 


3362 


3 


364 


HEDRDTPAYFTCANII IAIPTGVKVFN* 
IiATLHGSNMK* SAAW* ALRYI FLFTVS 
GLTGIVLSYSSLDIVLHDTDYWAHFHY 
VLSIRAVFAIIGGFIH*FPLFSGCTIiNQ 
TYAEIHFT 


3340 


17241 


A 


3363 


3 


360 


HEETY I PKDQKYS FLHDSQTS FCFSNS I 
PTPSNMEETQQKS VSGCRLP *AGMGBTC 
GQGSRAAQPLPSLPLQNLKLLRISLLLI 

HLLKDIjE 


3341 


17242 


A 


3364 


352 


27 


VYLSSQRNSAASVNRMTVVTDRSLSPYT 
iXa WiiKJr.UKlir itr V iris Jjr V X 1 F V iSKGSQS 

lAQAGVQWCDHSSIi* P*TPGLK* *TCSC 
LP * VAGTTGTHHYTGPIiI LLNL FN 


3342 








7/1*7 


QO 

yy 


PPPPPPKTAPQKKKKAPSI PAGQWLVEQ 
NPRPQPFKKPPGVGF*NPPPPLNMGF 


3343 


17244 


A 


3366 


3 


319 


HEECIJUrriXFFNLMS*RTTVILHRMFS 
IVTAI * YLLSSQYVIGYYIYVCIHTHTH 
MYIHTHRHRTFICVCVCVCIYIYTSIC 


3344 


17245 


A 


3367 


3 


474 


YELLIiLNGPILNARAFLILTERKILGYI 



754 



WO 01/64835 



PCT/US01/04927 



SEQD> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ n> 
NO: in 
USSN 
09/515,1 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sentience 


Amino acid sequence (A—Ala nine 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrostne, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \~possibIe 
nucleotide insertion 














QLHKGPNAVS PYGLLQPFADAIKLFTKE 
PLKPATSTITLYITAPTLALTIDLI»Ii*T 
RLPIPNPLDNLNLGLLFILATSSLPVYS 
IL*SG*A*NSNYALIGALRAVAQTI * *E 
VTPTI ILLSHYYKVTIjRN 


3345 


17246 


A 


3368 


2 


348 


ARGLQDATFPIIEELITFHDHALIIIVL 
ICFLGLYALFLTLTTKLTNTNISHAQE I 
ETV*TIliPAI ILVLIALPSLRMLYITDE 
VNDPSLTIKSIGHQWY*TYQYTDYGGLI 
FNS ! 


3346 


17247 


A 


3369 


64 


363 


KKWGFVPQQKKGGGRISVNGTPIFKGKG 
NFPP*PPGEGGTKGPGPKPK* IWIFKKR 
GGLIFWPKQFPTPGPKGTHPPGPPKGGK 
*RGGPPS PGGFNFKRG 


3347 


17248 


A 


3370 


34 


355 


AKGKEYIETVAEKKKKKKKAPPPI*KAP 
♦NPGKKPPLALKFPPPSQKAPPPALFCP 
*KGPPPRVFPKKKKKFPPFGRKGFFFTG 
GKPP 


3348 


17249 


A 


3371 


1 


374 


FLHSLHLSLRLLYLP*KFSMFIILLKAY 
YTLCVCVCVCVCVCVCV* PLNKIHINHF 
TCSTVRGRSNP I FFPPLLN * RPFYTT 




1 /Zjtf 


j± 


jj / z 




j 


FFFFLKKOKI SRVFPRGPE F FGORKPFW 
AKGPPRAPQKGGEKGKNPSPPPPLIFFK 
KGEKPQGGPLFKKGLGFFS P PKVFFKKG 
VFKSGPGGP * NFGFRGFS P PS P PKKGGK 
KRKPQSPGFFFFFFFFEILGCSGRSRSR 
T 


3350 


17251 


A 


3373 


2 


404 


LGTDLSLIIRADLGQTGDLLGNDHIYNA 
IVTAHAFEIIFFIEIPIIIRGFGN*LIP 
LIIGAPDMAFPRINNISF*LLPPSILLL 
LASAIVEAGAGTSGTDYPPLSGNYSHPG 
ACVDLTILSLHLAGVSSILGAI 


3351 


17252 


A 


3374 


2 


417 


AFV* IAHITRDVYYG* IIRYLHAKGAR I 
FFMCLFLHIGRGLYYGTFLYSET*NIGI 
ILLfiATIATAFIGYVLL *GQISF*GATE 
ITNLLSAIPYIGTDI J IQ*I*GGYSVDSP 
TLTRFFTFHFILPFIIAGPNPLHLTT 


3352 


17253 


A 


3375 


85 


1 


FWPGAVAYACYPS TLGG * GGWIMRSGVR 


3353 


17254 


A 


3376 


396 


1 


KEGYIKGPRGENFF*KNPERKIFSRQRK 
GGVFSPLPPKKFFFFPKGLIFLGGGCPI 
FPPPKKRVFSKNPPGVFINPP* KKKKI F 
SPPPENWGPPRVFLKGPPPFFFFFFFFF 
FFFFFQTLFLGGCGYNTKLR 


3354 


17255 


A 


3377 


3 


118 


LLPPSTiT iTiLTi AYAI LVAGAGTG*TDYPP 
LTGNY*KTKA 


3355 


17256 


A 


3378 


301 


82 


ERKGPPQRGGKKGPKAQKPEKKKPKKRP 
PPPQN* FFSNPPKKKKGGKDPPSKKRC3G 
GKGVPPPRKKKKKKKKE 


3356 


17257 


A 


3379 


2 


195 


DR*LFSTNHKDIGTIjYIiLFGA*AGVIjST 
GLSLLIRAEIiGQPGNLLCNDHIYNVIVT 
AHAFVTIF 


3357 


17258 


A 


3380 


21 


411 


VFCTNH * DIGTL YLL FGA * AGVLRS ALS 
LLIRAELGQPGLLLGNDHIYNVIVTGHA 
FVMNLFIVIPIIVGGSGN*LGPLEIGAP 
DMAYPRINNIGF* LLPPSLLLLLAYAIV 
EAGAGTG* TVYPPLAGNY 


3358 


17259 


A 


3381 


2 


421 


GRVGGRVGSTNHRD IGTLYLLFGA*TGV " 
IiGTAIiSLLIRAELGOPGNIjIiGNDHMYNV 
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N=Asparagine, P=Proline, 

0=i^l lit?* mine T)~ A roinin^ C — ^orinf* 

T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known. *=StoD cndon /=nn«ihl<> 

nucleotide deletion, \=possible 
nucleotide insertion 














IVTAHALAKIFPILVPI IIGGFGNGLVP 
LIXGAPDMAFPRISNISL*LLPPSVLLL 
LESVIAEAEAGTGRTVYPPLAGTYSNPG 




1 /iW 


A 




•a 
j 




V T V<3T T 1? HflTT . T TAT .Q e H * T7 VT * \I(~ZT M T 

NMLAFIPVLTKKINPRSTEAAIKYFLTQ 
ATASIILLiIAILFNNIIiSGQ*TITNTTN 
QYSSLI I IMA1AI KLGI APFHF* VPEVT 

QGTPIiTSGLLtjLT*QKIiPSSISSRKQPH 
p 


3360 


17261 


A 


3383 


5 


398 


FFGHPEVYILILPGFGIISHIGAYYSGK 
KEPFGYLGMA*AMIS IGFLGFIA* AHHI 
FTVG TD VtfTPAYPTS ATI I TAT PTRVKV 
FN*LATLHGSNMK*SAAAL*ALGFIFI 1 F 
TVGGLTGIVLANSSLDIVL 


3361 


17262 


A 


3384 


1 


250 


I/3vrrGKTIiSIiQKNV* KIiARCGGTCLRSQ 
IiIiRRLK^EHdiTPGGRGCSEPRSCHSSP 

AW ATROnT,QfYV*inOTfYPT ,P T T? Q JV T T ffQ 


3362 


17263 


A 


3385 


456 


0 


I FFFSQVETGIHHIGQAGLQLLTSGDAP 
CVGLQKSWDYRPEPLHIiAWLIHSIiIiSNF 

FLFS YS *NPCAQ * KNEGPLLNKNPCRIT 
DAWADAWVAPVF 


3363 


17264 


A 


3386 


1 


225 


PSRNFLVLVDKLILJCFI * KYRGPRTAKT 
TLKKKKKVVRLTLLIFKSYYKTIVITIG 


3364 


17265 


A 


3387 


3 


400 


LLILGLIiTNTLFI YP A WGDATRESTYQG 
HHTPPVQKGLRYGI ILFITSQVFFFAGF 
F*AFYHSSIiSPTPQLGGHWPPTGITPIiK 
PLEAP1^NTSVLIiASGVSIT*AHPGLIE 
nnptjptfpat.t.ttmtt/2t»vi? 


3365 


17266 


A 


3388 


24 


407 


iasgrpfffffffpkgkffi srgggpfs 
knpslsfypfggkpplgpppklvppss* 

- PGKKAFFFF *TPPPGFLTPTjGGGIiSPIiK 
GGYLIiAPHFVPTEGPL 


3366 


17267 


A 


3389 


148 


413 


L^LNNNNWGKimLLNTYSCT 


3367 


17268 


A 


3390 


157 


14 


RVSACCQVWW*MPWPATREAEARESLB 
PGRHRLRS CHCDRDPVS KS 


3368 


17269 


A 


3391 


2 


398 


LFSTNHKDIGTLYIiLFGARAGVRGTALS 
LLIRAELGQPGNIjIiGNDHIYNVIVTAHA 

fvinffivipiiiggfgn*lvpliigap 
dmafprinnisf*llppsllt j lvayaiv 

EAGAGTG * TVYPPLAGNYSH 


3369 


17270 


A 


3392 


2 


395 


DQKDIGTLYLLFGA*AGVLGTAVSLLIG 
AELGHPGYLIGNDHVYNVIVTAHAFVI I 
FF IVIPI I IGGFGN* LVPLI IGAPDMAF 
PRIi^ISW*LLPPSLLLU^ACAlVEAGA 
RTG*TVYPPLAGNYSHPGA 


3370 


17271 


A 


3393 


396 


54 


GYPPNLTFVTSSLTSVPSPSHPPSHVCS 
PTPAIPQTPRSPPSESTPPTBVSLTTPT 
IiSLGAISSLASAGPQGASPASPGFWGTS 
PP* PYS * NRPQTAACGVFSQLFF IR PEV 
P 


3371 


17272 


A 


3394 


417 


60 


KGWVFQRILWPGPGGSPRYSRPFGGPRG 
IiIPriGSGV*TPPGPHWETPFFFNNQPIS 
GQGGGGPLIPliLGGVKPEKIFYPGNQSF 
H* PKIPPHPPPLGAJCQNFPPKKKKPQKN 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Pfaeny!a)an ine, 
G=Glycine, H=Histidine, £=Isoieucine, 
K=Lysine, L=Leucine, M-Metbionine, 
N-Asparagine, P=Proline, 

r^==/3ttl4om inA PaArftinSna G— Carina 

V=\jriuiani!ne 9 tv^Arginine, cr-oenne, 

J.~ 1 UI VUI1I11C} T ToUllC) 

W=Tryptophan, Y=Tyrosine, 

In If nrkvvn *sStnn rnHnn /=nnecihlp 

A— UUIUIUWII) — OUJp LUUUII) / — £JU»olUIC 

nucleotide deletion, ^possible 
nucleotide insertion 














TKGFWV ! 


3372 


17273 


A 


3395 


417 


3 


FFFFFFFQHHFGFEAAA* YWHFVDW* L 
FLYVSIY**GSYSFSINS 


3373 


17274 


A 


3396 


229 


1 
1 


NLWDTAKAVLRGKFI ALNVP INKLBRSQ 
ISNLISQVKEGQAWWLMPVI 


3374 


17275 


A 


3397 


344 


165 


ELKS PQPRKRNGCFFLAVDPDEGF * NLN 
FCGNGQGLNAPGGYPCE I PMGFVPNAYG 
Mr 


3375 


17276 


A 


3398 


1 


409 


HKD I GTL YLLFCARTG I LGTALSLL I RA 
ELGQPGNLLGNDHIYNGIGTTQALVRIL 
FIVIPI ITGGFGN+ LIPLI IGAPNMAVP 
RINNISF*L* PPSIiLLLVAYGIVEAGAR 

X\j X vj X rtrLuiAjri X on.r'Vanl— V Jjlj Xi*> 


3376 I 


17277 


A 


3399 


3 


396 


KDIGTLYLSFLS * AGVXK3AALRLLIRAK 
173QPRNLLRNDHIYNVIVTAHAFVI I FF 
IVI P 1 1 IGGFGN * IjLTLI IGAPDMALPR 

TKTKTT QC* T.T.DDQT.T .T.T.T.2VVf5T VPATSZVPT 

G*TAYPPLSGNYSHPGASV 


3377 


17278 


A 


3400 


49 


350 


KSNSHTEVQAGQEQKSFFRSKHKAGRAG 
SDPAPYVLAGCLCRSISSFLFFFFSLFF 
FFFFKRDGV*QGYSSGSNTPGVKQSSCL 

oXixrlVX WJJXrvrt-olrXaIlr*i* 


3378 


17279 


A 


3401 


1 


398 


KPPAITQYQTPLFV*SVLITAVLLVLSL 
PGLGAGITIIiLTDRSIiNATFFDPAGGGD 
PILYQHLF*FFGHPEAYILILPGFGI I S 
HIETYYSGKKBPFGYIO!G*AMISIGLL 


3379 


17280 


A 


3402 


388 


42 


TPLFNPPGAKCVNPFFPKKLKKKNPSRA 
MGGPPLFPPLWGGKPKGFP *NRKSGAP V 
THFAPPAPPPGGKREPPPPKKKKK30CRK 
EEKESNRAVQ I KKRGNKKGRLP VLD PAE 
tsioy 


3380 


17281 


A 


3403 


413 


2 


SPHPLKNFFLPPKPPNSGGGLAPFVPPQ 

V-*r»C T. DVT OPfiDYlfCJ DDF k" P JfTNTZVl*. D DP 

GKPGPPRASSKRPPLFFFFFRGRVPLCH 
PG*STVQOTQAQCTLELLGSSNFLTSSS 
PVTWGHRQGPPRP INFFIFFRGRVP 


3381 


17282 


A 


3404 


433 


274 


LGWAHACNPSTLEG * GGRTAWS PGVQD 


3382 


17283 


A 


3405 


131 


427 


GPPPQPFFFWGKRGGPNPFFFFFSEKRD 
VF*QFFLRRRGPQKFFKPPFFCLQiCrSL 
FSEKKKKKGFGPPLFPQKKKGCGGGPHY 

NISSLQALPPGFT 


3383 


17284 


A 


3406 


413 


137 


SPPPPPQGGIFFFFKPR*ETFSPPPNLG 
FFS PPSPLKFFFFPKGFIFLGGGGPKFP 
PPKKRFFFQNSPGGFFFPPF* KKNFFFL 
PPVILGPPRVFF*GPPPPKK 


3384 


17285 


A 


3407 


319 


540 


IVFKIX2CEESFTTLNVDIRNHQNLLDSL 
EQHGKGDLL*GANAYHCEKCNKKVRAVQ 




1 1 AO\J 


A 




9 


401 


SLHLGGGSCIVGAIYFITTIIWIKPAGI 
TQYQTPLFV*SGLITGI LLLLSLPVLAA 
GIT I LLTDRNLFTTFLNPGGGGDPIIjYQ 
HLL*LIiSRPEVYILirjPGFGIISHIGTY 
YSGKKEAFVYIGW*AIISIGC 


3386 


17287 


A 


3409 


2 


142 


KKENYRP VS LNRDAKS LNQ IQLHI *KI I 
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C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=G lycine, H=Histidine, I=Iso leucine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














HCNO VG F I PGWOCI W FN T T . 


3387 


17288 


A 


3410 


407 


48 


PPGGKIFFYKNPEKKIFPTQQKKGVFPP 
FPLKNFFFPPRGKIFGGGGGQNAPPPKK 
GFFPKIPQGVF*SPPKKKKKNFFPPGKI 
WAPQGVFLKGPPPFFFFFFFFFFFFFFF 

r c c v rv 


3388 


17289 


A 


3411 


3 


402 


SYRLE*QLEDKAAPPQDFTQQtiRWPEKQ 
HRMWPDKEDMHKQLVEASETLKSQAKE 

NHKVWDKEEEMEVAMHKADMMWQEI*RS 
KKLTKRMLFTQMRR ITPVI PS 


3389 


17290 


A 




i 
l 




HWTP PWGTE*DCL* LKKKQSAYLPNNT I 
KKLTGSRTIEDICTEYNK 


3390 


17291 


A 


3413 


202 


1 


IVMSWPGWTHA* I PSTLGGQGGWITRS 
GVRD PLAS AFQ SAG I TSVSHCS QLI FVT 

\/T7VT.QDT.CITJT.V 
vc XijojrUotlu 1 


3391 


17292 


A 


3414 


407 


309 


PSFKASKDRLTLLLGANAAGDLKFTPMF 
IYHSENPRALKNYTKSTLPVPYLCIRNI 
KAWMIAHLFTAWLNEHFKP I VETHCSGK 
KZSFKIIoLLINKAPGHRRAQMERHKMNV 

l/BMT.JVRrnTDTT.WT MTYY2\TT?*T1?TT rT OT 
W cnilnH J. X wr J. JJJULiVUJuov t?" i.f ILUlibl 


3392 


17293 


A 


3415 


404 


46 


LKKPPFQKFLKILKKKKKGGGGGPPP+T 
PPFGGGRGGVPPRAGVSKNPVPQGETLF 
FFKNPKFPGVGGPAP * FPLLRGVNKKNF 
FNLGGGGFKKRKFSPPPPPRGKKFFFPK 

ivcvr\.ivX\.xvCi 


3393 


17294 


A 


3416 


23 


404 


SVLITGAMjLLSMPVLGAGITIIiLTDRN 
LKTTFFDPAGGGDP ILYQHLL* I FGHPE 
DNIIjILPGFGIGSHIVTYYSGKKEPFGY 
IGMA* AMKPIGFLGDIAGAHHVFTG* ID 
VNTRSYFTSDTINMG 






a 


J41 / 


Jo 


4oy 


GDPEVYI LILPGFRI ISHIVTDYSGKKE 
PFG YIGMA* AMRS IGFLGFIG* AHHIFT 
V* IDVDTRAYFTS ATI I IAI PTGVKVFT 
* LATLHGSNMK* SAAVL*ALRFI FLFTV 
GGLT 


3395 


17296 


A 


3418 


1 


419 


FSTNHKDIGTLYIiLFGA*AGVIjGTGLSL 
LIRAELGQPGNLLGNDHIYNGIGTAHAF 
VIIFFIVIPIIIGGFGN*LDPLIIGAPD 
MAFPRINNISF * LLPPSLLLLLASAIGE 
AGAGTC * TVYPPLGGNYSHPGAS VDLT 


3396 


17297 


A 


3419 


397 


107 


KNSAPPFSPIFIaP*GPKFAFFPPGKKVG 
KKI F * KFPAPGPOGVYLREGGGS PSPPO 
TPNWPNKKAAFVLPPPPFFVFGNPKKKK 
KKAVAFSS PNRR 


3397 


17298 


A 


3420 


3 


253 


RGCKEMRLCHCSSAWAIAGVSEEKKKKK 
KNWQ * CLLG I CS YLDLWLPGLFKWLSQ I 
GRI VELOLIiELKO YIiRGTTKOTTjK T T.H 


3398 


17299 


A 


3421 


1 


393 


RTRTIKTPAITQYQTPLFVGCALITAAL 
LLLSLP VLAAG IT I LLTNRYLNTTFFDP 
VGGGDP I LYQHLF * LFGYPGDYI VI LPG 
FGIISHMVAYYSGKKEPFGYIRMD*AMI 
SIDFLRIIV*AHHIFTVEN 


3399 


17300 


A 


3422 


142 


377 


LSKNCYKPSYVGFVCQCKFFNQLSKFLE 
VKLLYCMVKIiC* * EG* MAQPL* KQFVLV 
ALKLNIVLPYNKAISLPGIYSTD*KTYT 
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N=Asparagine, P=Proline, 
\/ v> i u uii u ill xv— Arginine, o — ocrine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

Y=I Inlf nnwn *zSfi\n j*n/lf\n / — nnrrl K lt> 
./*« uuMiuwiij — oiup cuuuii, / — ^>Uiaiuie 

nucleotide deletion, V=possible 
nucleotide insertion 














DTQNLHTTVYSSSIjI I IKN* KQQRCPSV 
GE* INKCPI YTTE YYSLTKRNKIiSSHMK 
A 


54UU 


17301 




1 ATI 

3423 


2 


Ariz 


Jjil_ilrljr vjlionl V 1 X X oLj JVTLci rfVjI J.Vjn 

G*AMISIGFLGPIV*AHHIPTVGIDVYT 
RAYPTSATI I IAIPTGVKVFR* LATLHG 
SNMK* S AAVL * ALRF I FLFTVGGLTGI V 
LANS *VDIVLHDTYYVGAHLHYVLSIRA 
VFAI 


1 At\\ 

J4U1 


i Tim 
17302 


A 


lA^A 


2 [ 


A1C 

43o 


T XI r r*PT?I?n O 7S. t~ % f2T!T\'0 T T VflUT TP* TEYTJ'Dty 

VYIMLPGFGI ISHIVTYYSGKKEPFGY 
IGMG*AMISIGFLGFIV*AHHIFTVGID 

LHGSNMK* SAAVL* ALRFI FLFTVGGLT 
GIVLS 


3402 


17303 


A 


3425 


1 


193 


PTRPPTRPACS ELRSRHCTPAWVTE *DS 
VSKKKKKKKFSPARGAHVCTPTFLGGQR 


3403 


17304 


A 


3426 


3 


446 


HKKFLFPKSFFNRVFPPTKKKNPRGPGF 
FFFFLKKI FFFP PGGI * SGGFGSLQLF P 

FFLKKRGFSFFGGGFLK*NPGFPPPPPF 


3404 


17305 


A 


3427 


202 


1 


FSPGRMLGNTIKLPYTHSFILMNSQNNL 
EKYKVGGI PLSNFKTNYKATAIKTMWYW 
HMNRYMD*WNR 


J4UD 


\TifUL 
1 / JUD 




I/ion 
j4Z5 


IDS/ 


1 
l 


LSSLQPSLRGFKRFS * VSLVSSWDH 


3406 


17307 


A 


3429 


22 


298 


ESYASTAKATERDMSDRMLSASGMPRRY 

IFMI *EAFASKRKVLIVEEPSINIiE*LY 
GCPUTiPGR. 

w v-.* x\ L i rvvjxv 


3407 


17308 


A 


3430 


267 


26 


GITFFFFFWLEI*LYYWLFLANKLYFHB 
QIWFGLVFFPGLLLFDFSFYSDYFPLLL 
CTjGSFSMF* CTjIHCFFSVT T VLIiF 


3408 


17309 


A 


3431 


274 


376 


NKG*ISCSISPPQHTVRRGLTLSPRLEC 
SGVI SAHCRLCLPGSSDPPASAS 


3409 


17310 


A 


3432 


177 


47 


FYFFFFFFFFFFFFFFFSLLFFFFFFFF 

ffffffffffffoinrsfiasfkyhk*v 
lifkkkkm 


3410 


17311 


A 


3433 


379 


83 


fkrhftkedlrmvtehikscssslatra 

MOIETTMAHCYTFMRLiAKIKKYOYTKCC 
*R*RVTGTLTYCWWBCKMTQPFWKTIVR 
RFLKQYIYHVTGYV 


3411 


17312 


A 


3434 


214 


3 


AASTFFFFTRSTYYVCIFLKSKRMINK* 
NYIKLKS FCTTTTKINKMKRQPTDWEKM 
FANHINHHEGLILK 


3412 


17313 


A 


3435 


1 


353 


GGQGQPGQ*SETLSQ*KVSWAW*LQLHR 
RMRQDNLLSPGDQG*SEPC* CYYTPAWA 
RVRIiCQWRSVNEKSLHDICNDKK* QNI * 
1 1 S VPEGKQRMKGLENLFNE I LDENYPS 


3413 


17314 


A 


3436 






VFMEVDFFFFPFFFFFFFGKRNFFFPPR 
GGGGGPF* IFGPPPPRGKGNFPPPPPGK 
GGKKGPPPPPGFFFFF*KKGGSPPGGGG 
FLTPPLWDPPPWPPQRGGFGV*TPWPGP 
GSDFLNFFFF 


3414 


17315 


A 


3437 


108 


2 


PVAVAHSCNPNTSGGRGGWIS *GHEFQT 
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nucleotide deletion, \~possible 
nucleotide insertion 














SLANTVK 


1415 


I / J iO 


A 






3 


uJ-jJ-iliKH.]? e ljFJVnKJi Vb IFVr I r MRQGIjT 
LTEAGVQWRSHGSLWPQTPGLK* SSCLS 
LPSIWEYSAYHHIR 


3416 


17317 


A 


3439 


1 


186 


QTSSFSFRRSVITNFFKLKDDV*TH*KE 

7l l/"HTT I/Tit t**\YT»T-^Tr t n t mrrmT ^ * >s.y m**-*-* 

AKNjjJB KRIiDEWL I R I KSVBKTLNDLMEL 
KTSTRP 


1417 


1 / J 15 




1A A A 


290 


3 


PGJjGGGPRYSRFSGGLGPKI PLNPEGEG 
SIKLNPPPAPPPGGPN*DSFIiPPPPKKK 
jS-cuS. 1 rl V Lie WjjijDi iiijy Fijy t'ljAr i IK VIM 
GGLAQQS PKVL 




1T11Q 


A 


3441 


lOJ 


I 


PAJLjJjP VKPPKr QDKJbNFPFGP FFFFFFF 
* I *DKVLLCHPGESATVHS SIjGDRA 


141Q 


1 ijZSj 




"5 /I /IT 

344z 


131 


1 


KPKKP IjT I KS IGHQ WY *TYE YTD Y GGIj I 
FNSYILPPLFLEPGS 


1490 


17191 




3443 




2 


PUAAHS I GHLiP LiR FHIjRXjGAVTHAGNP R 
TLGGQGRWIT*GQEFKTSLAKRVRP 


3421 


17322 


A 


3444 


2 


130 


WEKI FANHVSDKGLIYRIYNELQQLYK* 
KTNG pi kkrpks trp 


3422 


17323 


A 


3445 


2 


117 


IiAIJjGIjKPGLK* ATCLGL PR YWD YRHQ P 


3423 


17324 


A 


3446 


350 


101 


kafssslgssrkkgdsdagfcihfktvp 
fgffwtlelkkgipfflnffffpflrvl 
kgsknpf* krgpffffffffdnfyiwn 




17325 


A 


3447 


1 06 


1 


LLQPNGQLGPGTVAHGCNPSTLGG* GER 
ITTSGVR 


3425 


17326 


A 


3448 


28 


333 


GQQEQNSISXKKKKKKKKKTPFKKGKRK 
I PGGKKKPLGKSRAPPPLGGGEKNPPPG 
KRAPEKI FKKKPRGNWETFKRMEKGFLK 
GQ * PGNP * KVKPL WGGKK 




17327 


A 


3449 


81 


339 


PIVSSCPGVAGALKQAMTLEFKVYQHHV 
VANCRALS EALTELG YKIGTGRDTDGVQ 
QALlLi WGIiRLiAS * P I V SSCPGVAGALKQ 
AMTLEFKVYQHHVVANCRALSEALTEIiG 

L A~Lvj lijKlJ L VyyAJjLiNA/GJjRLiASQrV 
QPGCGPGSAAAAAMM 


3427 


17328 


A 


3450 


166 


322 


RDFSTTVLITDRTSRQKIPTDVSQLNLI 
DIYKTL* PTIGKYTFFSSTYRSLR 






n 


3451 


257 


354 


JU? lorr JfcuJKV JjLCCPGWSAVV*S* FTAA 
LSSW 


3499 


1711ft 




345Z 


OCA 


3 


TDQQKTKEIi* EEQLYAHRVDNLDEMDKF 
LEGYK*PSMTHGKIENLC3?PITSKDIES 
YTKNTjTjTDPP 


3430 


17331 


A 


3453 


358 


353 


R* *KLLMMPC* IETNKT*NTCTVI KEEK 
GQPGTVAHACNPSTLGG * GGRIT 


3431 


17332 


A 


3454 


108 


353 


THIiGGNPVRIiYAWLLLHYVAS YRI FSVS 
nUrsJATi XijPi.laL*NFIjMEUGTVAHTYNP 
S ILGGQGRRI T*VQEFETSLANMVKC 


3432 


17333 


A 


3455 


342 


50 


GPVTPNIPPVPPLGGQKELFPPKKKKF* 
KFWGSPPPLFFKKPKNPRETPPESCPKP 
KKTPIPVKV*GKKTP*TFSKKKKTPPIP 
ELFLLKKKKKLIN 


3433 


17334 


A 


3456 


343 


2 


KAPFFFFFKKFWSQKKRGLFFFFFKNFF 
I PGFPGPTFFFWGFFFFKKGPPVF* LKK 
FFGGPLPPLKDPPGVFPFSPF*FWDFPP 
PPPFFFFFFFFFETVSLCRPGWSAYSRD 
C 
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Amino acid sequence (A=A la nine 
OCysteine, D=Aspartic Acid, 
E=G!utamicAcid, ^Phenylalanine, 
G=Glycine, H=Histidine, IHkoleucine, 
K=Lysine, LHLeucine, MNMethionine, 
N=Asparagine, P=Proline, 
Vp=vHutamine, K— Argtnine, S=oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1 A*i A 

3434 


17335 


A 


3457 


2 


259 


SLRSQNGGENSV*SCPSTSSTAALNTAA 
AANPLALNPQVSLSSLSCCPISIKGSFV 

f 


3435 


17336 


A 


3458 


186 


334 


AGGNKKMEKRNM *NKE I F* WFKKNWPGA 
GAHACNPTTLGGRGRRIMRSG 


3436 


17337 


A 


3459 


207 


1 


YFKPTLKINDKGYIiNETIDKFDPRDINR 
TSNJjTIj* EHAIiFPSTCGI rTflRRDY I EGH 
EVNLNQFQRIRTQ 


3437 


17338 


A 


3460 


268 


3 


PLNPGPKQSSPPGLPKPWPFRGEPLPLG 
PWFFFVKNRGRP * AFFGTNPLAFFLPFL 
*DRVLFCHPGWSWVQSQLTVQPPPLRL 
KQFS 


3438 


17339 


A 


3461 


350 


2 


AETAHNINNAFGPGSANKCTVRWWFK*Ii 
CKXSVKRIiEDKALGGWPSICVAXNQLRAI I 
KGAPLKPTRKVAKELNIEPSAVIQHLKR 
IGKVKKLHKWVPHELHENKRNRFEVWS S 
IxIIP 


3439 


17340 


A 


3462 


'312 


1 


HTVTYYSGRKEPFGYIGMV*AMISIGFL 
GFIV*AHHIFTVGZDVI7iRAxFT5V 
IAI P PGVKVFS * LATLHGSNMK* SAAVL 
* AT/^t?TT7T T.pv^nT/TPTVT.2! 

~J-iXAJr X g i-tLtf V OVjXj X tvX V Lxt\ 


3440 


17341 


A 


3463 


181 


3 


FLPLIAAFSGLLLFSGFLGVFLFCFFET 
GTH 


1 A A 1 

3441 


17342 


A 


3464 


176 


263 


VvanxJAiiltiroxxiVJvjJr^^ 

A 


3442 


17343 


A 


3465 


239 


336 


EPGEVAYSCNPSTLGGRSGWII *GQEFE 

TOT *P 
XoXtX 


3443 


17344 


A 


3466 


88 


326 


QTSLLNTI**PLH*KNIPLPPPPI,PPTIi 
X rr if ofr'Jrrr i J FJrir JSJUSJSJUVKJx 


3444 


17345 


A 


3467 


4 


345 


eeerrrerkkkrrkkeekkkeerereek: 
erkkkkkkke kkkrrgrqekrkgtg * ie 


3445 


17346 


A 


3468 


265 


335 


QMNPDRHLQF * DKVLLCHPGWGTVARS * 
LTATSRSSC^SPPSGWLX^VPPOLASF 


3446 


17347 


A 


3469 - 


115 


342 


INKQINKIKDPTSYNIIiLSQSLI*FKAL 

I XxuIN o V JvAy KvjiS IV_A. I JBalLr ISuo lt5 T rWK 

FKQINHLYNIKVQGEKPSTD 


3447 


17348 


A 


3470 


56 


328 


RTYPTKPQKMRDPEDPTSRPRYIPRHLY 
ITDBVNDPSLTIKGIGHQWD*TYEYTDY 

Cim .TffHQ VTT.D OT.PT.nDCnT.TJ T .T TMmtJD 
vjoJj 1 rKb X Xlxr Jrxtr JJlSJroUlxKIiiAU V JJIlK 

VEPATRA 




i lyv? 


j± 


1/171 


j 




VNDSLVMPRPDKNHQWAFNKNCFPLVDV 
GIDPYLVYHLRPHQAQGI I FLYECVMGM 


3449 


17350 


A 


3472 


154 


2 


PNLLYPKSYWKKKDIWTPMFIVALVTIV 
KIQKQPRCPl J MDKSIR*Ii*YIH 


3450 


17351 


A 


3473 


132 


340 


AIxKTEKITxTiCxiDF*LIETVR* *MCVFK 
IK 
















1 lOJC. 






3 




vnooxriujijui iniioon rri vvjxjxvx 
NNUjAFIPVLTKKINPRSTI^IICYFLTQ 
A17^IFLLIAILFNNIIiSGL*TITNTTN 
QYSSLI 1 IMAIAI KLGI APFRRRGREFQ 
IY 


3452 


17353 


A 


3475 


1 


254 


FHHVGQAGFTPDLR*STRLGLPKCWDYR 
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REALCLVHILLYLPMIGPFRIQKLLKAA 
VFVFLQLHQAFIQSLLTLFYFISFPCNS 




\n*l <A 


A 


j*t/0 


1^ 1 
j5 1 


loo 


LAPAS * SAGIRGVRHNTWPVLT ITLN 




1 iJJJ 


A 


J*f / / 




•3 
J 


Ur» K. V r fc» 1 Pxj 1 KJJJN V S b X JjRL.r xjMAYGVE 
PALCDRLRTQPFQAQPHQQKAAVLAPLV 

NVKERLRRIiKTARA* RLGRSQVKK 


3455 


17356 


A 


3478 


373 


248 


NRLNPGGGGWSBPRSQHSTPTWVTEYDS 
VSKK* INKNKNWK 


3456 


17357 


A 


3479 


3 


92 


EGQGCSEP*SRYCSPAWVTE*DCVSKNF 
r 


3457 


17358 


A 


3480 


380 


2 


VEVPLLNTSVFFASGVSIT*AHHSLIEN 
NRNQI IQALLITNLGGLYFTLLQASEYL 
bbrifl ISAbl YGS i r rVAxvjFHGIjHVTI 
GSTFLTMCFIRQI I FHFTSKHHFGLEAA 
A*CWHFVDWCIFL 




1 lijry 


ri. 


"3/101 


225 


1 


&^AERVVEQVMIAIiFSPNY IRLSFPI S P 
TPVVRHKFRFHLKGYIWPGTVAHTCNPS 
TIX3SQGGQITRSGD* DHPG 


3459 


mm 


A 


3482 


332 


1 


IKYFLTQATAS I ILLIAILFNNILSGQ* 
TXTNTTNQYSSLI I IMA1AXN*GIAPFH 
F*VPEVTQGTPLTS 


3460 


17361 


A 


3483 


277 


1 


LSMSS FSISSI PSAS INS* ALLFSSAES 
NSIjE CT FS SS FS PNS S ESFS SS FPSCSC 
AS*FSNSQSLIKASSSTEASPLNSSASC 
FLPLPLVG 


3461 


17362 


A 


3484 


404 


209 


ALSVQPLWRPRQENGVNPGGKASR*PRF 
P P L.G PAW Ax KG Eli V P KKKKKKI S FLGHD 
QSLELLLQR 




1 /jOJ 




1AQK 




"5 /IT 


c L.XxjERluUfLxvv * CWKFFFFFFFFFLE 
KGFFFLPLIiVFFWCYYHLKGASLP*VFL 
FFYLHCYSSLFTLFFIFLQSLTPHFFLV 
* TOFYYDPLMFFFIj TLFLELV 




1 /JM 


A 


1A O/C 




161 


y iSJvVKUP i isyij* AN KFKN* EEMPRFLE K 
HKLPKLTQEEI KNQ 


3464 


17365 


A 


3487 


311 


3 


PRRGRFQLTQNFALPFNDGNKTKI PFKK 

J\J\J\iVX ElrXj^FL.XUvt-AAjVLJjCirillj^X * 

PC^CaHDCDNIOrO/NIC\WCVVTHLH 
TGLCS CVDCDHI CVGTCC 


3465 


17366 


A 


3488 


245 


1 


RPRRLLALLLI I ITF* LPQLNGYIEKST 
PYECGFDPISPARVPLSIKFFLVAITFL 

T.TmT.17T31T.T.T.T>T.O* JVT J^TVTKTT TJT TV 

ur uud XAuxjLUrJj ir « f 1 w 1 1 r 1 1 1 V 


3466 


17367 


A 


3489 


2 


400 


AAP I I^MLGLRQRGKYKRQRQLIiCLWS A 
ILLFLYFLN1CLL* LTMCKKKKKK3CKKKK 

KPPGGFFKGGGGPKKIFFFKNRGALFFK 
K^POKKNMPGPPGLKNFlxKEKKKA 


3467 


17368 


A 


3490 


57 


366 


INVFAGKDQLQPLCACPSSPI ISVESDR 

iri-fi xyV-V3HJl>ltvr X XXiXX I LiKno £ Au Vi^A 

GV*WHHLGSLHPLPPEPKCFSC 


3468 




t\ 




25 




1Kb if r r nvjvjijvivjxjo xvxnWPJj* JKJvxMr oP 
PKLGVLPPPLFFPGGGFFFFFPGAPPGF 
KKKKPPPKKFFF* KKGAPPFFFFFFFFF 
FF 


3469 


17370 


A 


3492 


170 


41 


AQPDQHGETTFLLKIQKLTRHGGI CL* S 
QLIGGLS F ITQKITC 


3470 


17371 


A 


3493 


1 


240 


VLS PADKTNVKAAWGKVGAHAGE YGAEA 
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LERMFLS FPTTPS * FIiDFFFFIiEKIGFH 

IXW/T^C CT J~*T7T TCDHDCSVHOT T^C 
» Vvi^oo LAsC Li 1 OKU fb AxiNo LtlA?o 


3471 


17372 


A 


3494 


359 


204 


TQWLMP VI LAP WEAE VGR * PEVRSLRPA 
LNS P FFKRL WIMMYIMNI I ILS 


3472 


17373 


A 


3495 


1 


178 


DRI*SPGV*GSSEL*IiC2ICTPGWATEPDH 
VSRKKKKRKKVGLGKNKAKQKHPRGYYR 
AEL 


3473 


17374 


A 


3496 


361 


1 


LGPKLLKKPFWIS IGIKGVAGLPGFFPT 
PPQ * G * w FGliKKGGrpr Y FFfciAKGWFNT 
PGGKQRGPFRSLEPFPFRVKLFFPPNLP 
NNWGHKGAPP KPP FFFFFLRWS FALVAQ 
AGVQWCDL 


3474 


17375 


A 


3497 


276 


79 


AEVYKIKCELGAVAHACNPSTLGGQGGR 
ITRS * VSASFTKKYWQLAMKEIiLNE 


3475 


17376 


A 


3498 


3 


476 


FGTSMS I ITLITPRIARGNNNLFSCFPT 
FTFDHLATPLLI LTT* LLPLTSMASQRH 
JjrREPLSRKKIiiLiS 1 IjNSXiQSSIjX ITFT 
GTELI IFYIFFETTLI PTLAI ITR*GNQ 

PERLNAGTYLLFYTLVGSLPLLIALIYT 

tjtit) pr /igT »ttt T.T.nrcani? 
liiN 1 Lrfjo JjW IJjjjjj 1 VoAUc* 


3476 


17377 


A 


3499 


581 


2 


AHACNLNTLGGRGRWIT*GQEFtrrsiAN 
K 


3477 


17378 


A 


3500 


380 


135 


RHQSGQIGfCTPSLVKI PKLTPPKGGVL * 
SHLTKRARQEKGLNPKSKGFN* PKLRPC 
PPT I VAKKQNP VS KKKJvQTKQE PSA 


3478 


17379 


A 


3501 


1 


386 


GSR * GNHPERLNAGAYFLFYTL * GSLPL 
LIRLIYTDNTLCSLNILLLTLTGQELSN 
S * AHNLI * LAYTIAFIVKI PLYRLHL* L 
P KAHAQAP IAGS I VLAAVLLKLCG YG 1 1 
RLTL I LNPUKKHI DYP 


3479 


17380 


A 


3502 


2 


389 


EKSQL * KDKQVYRATHRLLVIiGAGESGKS 
TIAKHMRILHVNGFNGEGGEEDPHAARS 
YIDGE KATKVQD I LJONLKEGI ET IGAVM 
SNLVPPVELANPENQFIVDYIIiSAMNVP 
DrD F PS a FxKHAKALiW K 


3480 


17381 


A 


3503 


2 


379 


PFGYIGMV*AMISIGFLGFIG*AHHIFT 
VGIDVYTRAYFTCATI I IAI PTGVKVFS 
*l4AliirlooJMrfUV*oAAal*^Aljw IV 
SGLTGIGLTNSSLDIRLHDTYYVVAHFH 
YVLS IGAVFAI IGG 


1 A O 1 

3481 


17382 


A 


3504 


281 


68 


NK*KNIPCSWIGRISIVKMPTLPKVTYR 
FNAFPIKLPSPFFTL 


3482 


17383 


A 


3505 


139 


263 


ETGSHYVPQAGL* LLASSDPPATASQNI 
GITHVSYYVQPL*PIFWEAVAGGSLEAR 
S 


3483 


17384 


A 


3506 


2 


427 


T VPeMTMTCTTT T «T**T 01)71717114 7V/W/VPT1 

liKIfaMlMlb 1 IliLtT* JjKPAWH*AQYQTP 
FFV*SALITAGIiFLFLPVLAAGITVLL 
TDRNLNTTFFDPAGGGDPILYQHLF* FF 
GHPEGYIIiILPGFG 1 1 SHIGTYYSGKKE 
PFGYIGMV*AMISTGFLGVIG*AHHVFT 
Gvj 


1AQA 






"7 CAT 

3507 


343 


i 
5 


\j ri r±o JXO ouunoo Ir oo f onKoKiiK V oauo 

LGRL*WSKSSKTSVSCGVVARDTEKPPIi 
SKIEASSSDSCESVMAKANAVPFTSTTC 
EKPASSASLSLSVWASGSSPPGAANSSS 

s 


3485 


17386 


A 


3508 


3 


363 


GPGDKAARRIALENLD IT * KWGMRTHKC 
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AKQLLS KQPF *SSRTGDLKLEIjQIELVR 
ETKRKC * RVL* IDRALTSHLYSLLQTQD 
ALGDAS ADL I Q KTS * LQEE FG YYAKTQK 
LLCRNGET 


3486 


17387 


A 


3509 




AT. A 


TKELLSQRFGLGS PTVDLBL * LQIELLR 
ETKRKYESVLHLGRAIjTAHIjYSIiLQTQH 
ALGDAFADLRQKS PQLQEE FGCYAQTQK 
LLCKNGETLLGAANLFVSS I YTLGTKTM 
EDTL 


3487 


17388 


A 


3510 


1 


401 


GTRKNS FLHDS QTS FCFSDS I PTPSNME 
etohksnliellri slklt e pwt/rpvp ft . 
KSMFANNIiVYDTSDSDDYHLLKDIi*EGI 
PTLMGRLEDGNRRTGQILKQTYSKFDTN 
SHNHDALLKNYGLLYCFRKDMN 


3488 


17389 


A 


3511 


117 


2 


FKFIFSEMESHSVA*AGVQWRNLGSLQP 


3489 


17390 


A 


3512 


491 


90 


K3CVKIVLAGFFFKPPPPKN*KKKGLK*R 
KGVG PRG YTVNPHP CGYKP FGGPGVQNP 
PGQKGETPFFQKKNKKPGGGGGALIPPP 
KGGGRGVPIjSPEKFGLKFPPGPPP*GKK 


3490 


17391 


A 


3513 


2 


31 


GKGAPTTSLINVAGTKI IAKWEDNKLP 
GAICSLTCGGAHIGTPMARDERVNLLSF 
TGSTQGGKQVGLMVHERFGRRLVELGGN 
NAI I AFEDAHLNLYVPSTLFAAVGTAGP 


3491 


17392 


A 


3514 


2 


390 


KEFHKTHPLENITIiAPDPEGPDGLPPVA 
YNPWMDIRQREDGQTLNIRBPYGPIPVD 
F* RKIRQS YFASGA* LDTQDGRLLSALD 
DLQLANSTI IAFTSDHGWALGEHGEWA* 
YNNFDDDTHDPLT FYDT 


3492 


17393 


A 


3515 


3 


410 


RGTGNNWAQGHYTKGAELVDSVLDWRK 
ES*SCDCLQGFQLTHSLGGGTGFGMGTL 
LITKIREEYPNRIMNTFRVMPLPKV*DT 
EGEP YNATLLVEQLVGNTDETYS IDNEA 
LYDICFRTLKLTTPTYGDRHHT^VQ " 


3493 


17394 


A 


3516 


1 


396 


GEDAANIYARGHYTIGMENIDLGLDRIR 
KLADPCTGLQGFLVFHS FGGGTGSGFTS 
LLMERI SCDYGKKNKLEVS I YPAPQVCT 
AVFEPYNYILSTHTTLEHSD*AFMVDNE 
AIYDICRINLHIERSTYTNL 


3494 


17395 


A 


3517 


2 


399 


EKIGYNPDTVAFVP ISGWNGDNMLEPNA 
NMPWFKGWKVTRKDGNASGTTLLEALEC 
ILPPTRSTDKPLRLPLQDVYKNGGIGTV 
PAGRVETGVLKPGMEVTFAPWGTTEV* 
SAERHHEPFREALPGDNVGF 


3495 


17396 


A 


3518 


1 


402 


GEDAANNYARGHYT IGTEI IDLLLDRIR 
KLADQRTGLQGFL VFHS FGGGTGSG ITS 
LLMERLS IDYGTKSKLEFSIYPAPQVST 
ACSEPYNS I LTTHTTIiEHSDCAFMVHNB 
AIYDI *RINIiDIERSTYTNIjNR 


3496 


17397 


A 


3519 


1 


408 


GSCUGQKLYNSTTKTAT*WSSNHTKKNP 
FSKFPKLQNVWTHPESHRDWTAPTRLYW 
ICGHRPYVKLPNQWAGSCIIGTIKPSFF 
LLPIKTGELLGFPVYASHAKRS IAIGNW 
KDNEWPPERI IQYYEPATWAQDGL 


3497 


17398 


A 


3520 


3 


162 


ETVLLCCPGWL*TLGLRQSSCLSLPKCS 
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DYRHATVPGQKYRFFNRYYYGSYS 


3498 


17399 


A 


3521 


184 


354 


ETGSSLLP*GSSELPVSHPYFFNTHSI*S 
CQRLVAVYRQKGS FKDQCTDI 


3499 


17400 


A 


3522 


2 


Tin 

313 


wt .q DDwirzririPnAwnpTT .pt .nTfr.AO * 

X I Xio if if l v iV7 Lf \£ r^Vo^norj X XJlrijy JNJ-irlvj OVJ 

ARLRSQIiLRRLRQENCLTPGIQACREIiL 
DKARDSVSTKTKREGRGEK 


3500 


17401 


A 


3523 


13 




F*VETGFHHVGQAGLELLTSTRP 


3501 


17402 


A 


3524 


148 


1 

1 


PGLKRSF*LSLPSSRDHRLMT 


3502 


17403 


A 


3525 


1 


98 


VNPGGEGCS EPRSHHCTTAWVTE * DS IS 
KINK 


3503 


17404 


A 


3526 


138 


3 


AASTHDLG*LQPPPPGLTPFSCLRLLSS 
WDYRPVPLRPANFLYF 


3504 


17405 


A 


3527 


10 


206 


PTTSLISFTSYSPCLKCSNVYEQVNRMR 
VLFYKSKQRMMVAHAYDPSTLGGRGSK 
J. i T byWKo 


3505 


17406 


A 


3528 


345 


3 


VTELIQSNKAKLKKKSKNKNEQSLQKI * 
KI IEENFPGLTRDLE IQIQKAJQRPGKFI 
K 


3506 


17407 


A 


3529 


3 


90 


AVL*ALGFI FLFTVGGLTGI VLANSVDA 
A 


3507 


17408 


A 


3530 


3 


90 


AVL*A1^FI FLFTVGGLTGI VIiANSVDA 


3508 


17409 


A 


3531 


354 


3 


AVI NS LPTKS PG PDGFTAKS YQR YKEEIi 
VLFIxLKPFQRIEKEGILPNSFYESSVIIi 
I PKPHRGTP KES FRP I S LMNLNAKI LNK 
RLAS *NHQHIKKVTHHGQVGFIPRTQGW 

JflNAJU 


3509 


17410 


A 


3532 


2 


357 


FTRPRFSFTFFVLIRKYALSCGKISIQH 
SY*KKKSKGKLGGL*GINFCFFERKFHS 
VTQVGGQKHDPG+LKFPPPGFRGFSRLS 

T.P<5 CWKrWTTRnTPPT .ATtfPPTPtJTrftfRnPPfi 

RKFQIY 


3510 


17411 


A 


3533 


76 


365 


KKKKKKECKKKKIOCKKKKGGAGPEIPKLK 
MGKSEKPPGVFLRGVGEKI KFFFLKKGG 
PFF*KKKPLKNKNWAREPSFKKIFKKKK 


3511 


17412 


A 


3534 


7 


353 


FFGFGGVFLVFLFFFF*KYPFFFPPPPIj 
FFFPFFKWAPRRPPPPFFSSFFKNFFFF 
FPPPPPFFFFFPQKNFPLFFSPLFFFFS 

HPQTQRK 


3512 


17413 


A 


3535 


354 


41 


AGGVPPGIPPPWGGRAGGSPKGRGFGPP 

WPfTPf5TPPPPP*TOTTirPf FTPfTTTf PrJftlPATPA 

TREGEGGKSP*PWKPRVQPT*NAPPPSP 
PGEGAKPPFQKKKKSMFHKR 


3513 


17414 


A 


3536 


174 


364 


TFNSLAFTFFFFF*RETGSHFFLEAGVQ 
GNHSSLNP*PPGLRGSSCLTL?KC*DYR 
REPLPPA 


3514 


17415 


A 


3537 


38 


368 


PGRLRLQEAVNHCTIxEPG<MSV*KKKKK 
KRGGAFFKAKLTIiAGWKGEKKKKTGVFF 
♦TLKIRPKEIRNQGGSQQRKKAELGKLQ 
QTY P VLNFKATFYGEQGD YY * T FFKP 



765 
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E=G!utaraic Acid, F=Phenylalanine, 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 

viuwiuiHCj i\— '/\rginine, oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V— 1 III it It Awn ^I— 5fftn /»r»rl/ir» / — ■ nniridla 
A 1/IIK.IIUlTfl) — JlOp CQOOflj/ — pOSSiOlC 

nucleotide deletion, V=possib!e 
nucleotide insertion 


3515 


17416 


A 


3538 


107 


349 


SVTFPFFFFFLKQTGSHFW*AGGQGGN 
PTGPSNFGI FFEKRGFCLVPQASFE 


3516 


17417 


A 


3539 


274 


3 


PRRWSLPLTQMGPLPSRMGPKTKLCPKK 
KKKKIPPRKILEPDVFTGQFYLMFKEIT 
SIFLKLFQQCEEEGTFSNSF+EARIAIjK 


3517 


17418 


A 


3540 


1 


374 


ACALVRNDHIYYWGPTYALGELFFTET 
PIII*G*DN*LGPLVIGAHWALTLINY 
ISL * L* PPS VLLLLACAI PEARAGTGCT 
AYPPLAGDYSHRAAYVNLTILSI^HLARV 

DO X linL/1 LSO 111 


3518 


17419 


A 


3541 


427 


76 


RGSLSPGV*K* PGGQNQTFPKAPKLRPG 
HrHLr irAolXvjo x W 1 r*w> r FVjVtKKo Jl FC 
SKKKKNFRETPPSFFWGGPPFLKNRGGF 
MFWAS KTKNGLNKTIiGPNPGP PLGGLRS 
PFWVG 


3519 


17420 


A 


3542 


234 


88 


FFFFFFFFFFFFFFFFFFS LFFFFFFFF 
r cirrr a v err ft r »\i ww xjji r v 


3520 


17421 


A 


3543 


30 


419 


TFS FFPFLFFFGFIiVLGGEKQGKWDPKP 
PGAEKPPTFS PPKI *KKGAPPPPRVNFK 

AJ'J&tr Litre orAJV?i\XVijJw r WolVvoXuiM 

PLWVTPPKNPQKKPPFFKGPKKNFFFKG 
PKPGGKKTQI FFFLWEKK 


3521 


17422 


A 


3544 


159 


411 


VSRDISPVRLSPSSCFH*PFxiHISSSVE 
GRFSSFAQFLLSCIiNFCTFPLLFLLLRF 


3522 


17423 


A 


3545 


2 


351 


CliAHTSGVES *ASLPCGQNS YIHYLKLL 
CMEELSFPPYLLIYSISYLY*HTIMVIY 
FILVL* SILLNLI YLFVTQI FPAIASGS 
SENWLLCLFNTTLAVHVCV CVCVCVCVR 


3523 


17424 


A 


3546 


397 


1 


KPPPPCSYGKKKKtFIiLVFFRTLPTKPP 
GAGRKIFPRGPGFFLKNPKAL*TTFAPP 
PFIQKKGPFLLQQA*IPFPVFFLFKKKP 

ITT .PC P * rST2T/^\/CT .X?t?T7WI70T?'Tn'C , O OOXTTr* 
rij^or " vj'alo volif c c 17 roL i Jtt*ovJ£> v iy 

AGVQSCDU3SLQPLPPGLKR 


3524 


17425 


A 


3547 


76 


354 


FLFNFMGKIFKNFRGGVFFFFFWGGYNI 

FIiGFFFGKISQKIFFFFVN*lFLTjXXGA 
PPLFFFFFL 


3525 


17426 


A 


3548 


197 


1 


INKMKQQRMD *EKI FAKHVSHKGLXSQ I 
CK* LIHLNSEKTNS P VRKW AE DLNRHFS 
KRDTKMVNM 


3526 


17427 


A 


3549 


175 


309 


KSTLKVMLISISIFNFYFF*DRVLLCHP 
GWSAVA* SLLTAASTRP 


3527 


17428 


A 


3550 


3 


196 


GFHRVSQLX3LDLLTS*SACVSFPKCWDY 
SHEPPRPAHKTSLKKNKMRVRIIIMMTI 
TYPAPWr 


3528 


17429 


A 


3551 


3 


360 


RAVEIFCYDVCVESGCADIQLLLIiCPTS 
IKVCFSSLSLDIVTVIPDIYICIYIYII 
FFFFFKQARS*CQCLNDKE*YCSLWPTP 
PGFKKLS CVSLPS I WDLRCAPSHPPNFC 
VFGKNGV 


3529 


17430 


A 


3552 


326 


1 


KDNHRILKIjMIiAKSGKVRHYVFDFGKQI 
SKMSEKNIGVKPPIGKEKTKNPKKKPSR 
TEDPKI CTYVL * PD KKKWRRFLNT* GIQ 
U3 * KEI KLSLFADDMI VYLENP IV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
.09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ominn of*ifl 
itlUIUU dUU 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 

C\ — f~? 1 n+ti m in A R=AroininA ^ — Ca>i n <> 

T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 

Y— I Ink Tin wn *=Ston mHon l—w\ ncc ! \\ 1a 

nucleotide deletion, \~possible 
nucleotide insertion 


3530 


17431 


A 


3553 


186 


1 


RPRRLPIKLPMTFFTEIiEKTTFKFI*NQ 
KRVRIAKSIIjSQKNEVGGITPPDFKIj*Y 
KDTVTN 


3531 


17432 


A 


3554 


3 


366 


KTSLGNMVKPYNFFV*TFYI**KKEIKK 
NSIGVL* PGTEQSKHEFKKILPSKRNKI 

£VLv X ivi. XJ XXX XI X ijVJyL/i 1 X X CtH X i\-X 1 JXI 

KEIIEDIxNIOrKDIPCS*IIRLDLVMMAV 
tiTKIiTYRFN 

J— 1 X *\i 1 JL -L IVi li 


3532 


17433 


A 


3555 


239 


1 


GRGKMIiP VS LLN I TRVTFAP CSGS 1 1 LV 
LIALPSLRILYITDEVNDPSLTIKSIGH 

rj&TV* TYEYTDYGGL T FNSYTT. P P 
x x x Ei x li/iuviiiriiui JLJjrr 


3533 


17434 


A 


3556 


3 


123 


QAGLEV* TSSDLPTSVSQSAGITGGSHR 
ARPANIINSVP 


3534 


17435 


A 


3557 


237 


2 


KGFSPPTQKFGFIiGVFPPQKKKTKKNPR 

4r rOUU irvo X\.V3I*1 XjX&r I irifJ J? J? J? J? 

SRSVAQAGVQWGNLGSLQALPP 


3535 


17436 


A 


3558 


87 


379 


K300CKKKFTI^KPVGFLNPPNGEAKSQI 

■XXl rii 1 1 1 1 IIVX/1X XVXVJ\-C\XNXJ-I\J IIW J I\XVJ\,\JI\XVX->X7 

RGNYNPLFKKIKDDPTNGKNFPCS*MGK 


3536 


17437 


A 


3559 


390 


181 


KKiCKKNPGGGGPPFFPTFRGG*GGGPPQ 
PGGGGAPKV* ISPPPPPPGGKKSPPOKK 
KKKKKKKGPFLPHL 


3537 


17438 


A 


3560 


2 


101 


VIVTAHAFVIIFFIVIPIIIGGFGN*LV 
PVDAA 


J J JO 


I'M "JO 


TV 


x^£l 


I 


17A 


TWTT»T»ttT»T.TiTTTTTi , *T»PnT»TJnYTF , K'ClTP 

XXM X IJUrM iLlDlll X J? "iirUUNVJl lHADir 

YBCGFDP I SRRRG 




11AA A 
i /44U 


7\ 

A 




Jo 




r «xT \^XV.x X Xxru?£JLx\Ax any x OV3/-ix\-*vriV».^vj X 

NNIMKFLKKKCI PTTIFWISFFSFLFFF 
FETKPHFFNWPEGKGPNI/3*LKPLPIjGIi 
KQFCLTLPRKGNYRHVLPPPVNGFFFF 


3540 


17441 


A 


3563 


6 


130 


TLRRILEDHFSLRS *GCX5EPCLRHCMPA 
WLTE* DPVSKNI Y 


3541 


17442 


A 


3564 


3 


152 


GFHCVRQDIxDLLTS + S AHLSLPKCWDYR 
RKPPHPAGKGTYFYIDYSTIB 






TA 
f\ 




7/11 


i 

j * 


NTiVSHVI SMRiCVINIiVEONVTSSNRSNH 
TWLSFI KLYFI 1 1 IS IVFGEQWFGYVA 
KFFSDDF * DFGVPITQTVYTAPN 


3543 


17444 


A 


3566 


71 


256 


SVFlXTVFLYPLATSFFFIiNRVSLCCPG 
WL*TPGLKQFSCI»SIiPKRWNYRHEPLTP 
GSNFF 




1 fVrJ 




jJO/ 


i 
i 




AGAGTSGTDYPPLSGNYSHPGACVDIiTI 
FSlMlJ&VASMLGAISmTTVMNIEPPA 
ITQYQTPLFV*SGROTAGLLLLCVPVLA 
AG ITI LLTDPJHxNTTFFDPAGGGDP I LY 
QHLKGKRGIIICSHRDRI 


3545 


17446 


A 


3568 


334 


409 


LGTV7UIACNPSTLGG*GGHITRSGV 


3546 


17447 


A 


3569 


3 


467 


PKNPPGGFFSPL*EKKYILPPPP*IWPP 
PGFF*KAPPPFFFFFFFFFFFFFPTFQI 
YFITF 


3547 


17448 


A 


3570 


3 


420 


FFDPAGGGDPILYQHLF+FFGHPEEYML 
ILPGFGILSHIATYYSGKKEPFGYIGMV 
♦AMIS IGFLGI IV+AEHI FTG* IDVDTR 
AYFTSATI I IAIPTGVEVFS *LATLHGS 
NMK* SAAVLS ALCFIFLFTRRGLTS IV 


3548 


17449 


A 


3571 


1 


418 


NLOraSFIDLPTPSNIFA**NFGSLLGA 
CLIU2ITTGLFLAMHY* PDASTAFSS IG 
HITRDVKYG* HRYLHAKGASIFFICLF 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acio 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K«Lysine, ^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
v/=vriuiamiDe, k— Arginine, b-oenne, 
T=Threonine, V=Valine, 
w— i rypiopnan, i — lyrosme, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LHIGRGLYYGAFLY*ET*NIGI ILLLAT 
XATAr XGxvljFGGQISr *GATVITNIiIj 


3549 


17450 


A 


3572 


3 


414 


SSGLAM* FHFHSITL»LILGLLTNTLTIY 
Q * WRDVTRES TYQGHHTPPVQKGLRYG I 
ILFITSEVFFFAGFF*AFYHSSIiAPTPQ 
LGGHWPPTGITPLNPLEVPLLNTSVLLA 
SGVSIT* AHHSIiIENNRNQI IQALL 


3550 


17451 


A 


3573 


2 


423 


GTLYLLFGA*AGVLGTALILLIRAELGQ 
PGNLVGNDHIYNGI VTAHAFVI IFFIVI 
PIIIGGFGN*LVPLIIGAPEMAFPRINN 
ISIj*LLPPSTiTtLLIjASAIVEAGAGTG*T 
VYPPLAGNYSHPGAYVDLT I FCLHLAGV 


3551 


17452 


A 


3574 


411 


117 


DGGPLLQLLRRARQENCFNPRGGGCS * P 
RLCPCPPAWGAKLNSLSGKKKKKKRHVE 
KYAQHDYIINVLCWCVHIKWLSLYMYS 
XTKS IDKICKKAS 


3552 


17453 


A 


3575 


2 


394 


ALANMWEQIRSIiHQYAVHRI ISLFSLLS 
KKHDRVLEQATQYLRGSIiNTNDVPLPDY 
AQDLTVIEELIPMMLEIINS *LTNSLHH 
IPNLVYALLY*RDIiFEQFRTHPSFQDIM 
UN-LJJJjVo VNiSDx I iUbr Lit* 


3553 


17454 


A 


3576 


262 


418 


GWREWTELI IFYI FFETTLI PTLAI ITR 
♦GNQPERLNAGTYFLFYTLVGSLP 


3554 


17455 


A 


3577 


232 


443 


PSVQTFPCCPLSEEGPWLLCVAMTPSPG 
VPVPTERALYSMECAFHPLFSLTSGACR 
JjDYRTPDNR*VQLP 


3555 


17456 


A 


3578 


48 


410 


GGIiLLHRAGCWCGHADRRADQPDSSDPH 
S LHAACLP PPGAARAAGAPS PA* TPAAA 
CPAPVPAPCSEHRC* PPGIiSQRPCLPGIj 
PDARPPGAAAHGGWS LPTRLVLHDRSHP 
EEAPRRHED 


3556 


17457 


A 


3579 


422 


186 


VYSKFL* SQLLRRXiRQENRIiNPGSRGCR 
EPRS*HCIPAWVTQQDSLSISSSTSHEI 
CGSHCSPGYLTVHICDQPVFSL 


3557 


17458 


A 


3580 


442 


3 


GSAISSRCTQESTYQGPHTPPVKKGLRY 
GIILFITSEVFFLAGFIi*AFSHSTLAPT 
SQLGGHWPPTGITPLNPLEVPLLNTSVL 
LASGVS IT*AHHSLI ENNRNQ I XQALLI 
TILIX3LYFTIjI^ASE*LESPFTISDGMY 
CGRSRG 




17459 


A 


3581 


428 


1 


TPLTIYQ*WRDVSRESTYQGHLTPPVQK 
GLRYGIILFITSEVFFFAGFF*AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEVPLLN 

ATiT.A QRW TT* A WHQT .TT?NWPPT AAA 

LE 


3559 


17460 


A 


3582 


160 


415 


RCLSHCLAFPMLS ILFI PSQLLSLPFHV 
KWIFFFFFLNKKFNCVPQAGGQGKNLI+ 

LHPPPRGLKKFSCIiRITSS*NYGREPQP 

e 
o 


3560 


17461 


A 


3583 


399 


2 


STHLGLPKCWDYRCEPPRPALIiAIVKLL 
CKTIHVRINNVCCLPLLTGAASVIGLSA 
SLLFS VS PGTSLQLEAR*TFPWPHMSST 
CVWRF I FTSTaGRAQPQHS DLQTCPRRDV 
DLAEPGLLFNGQMLVPPGRS 


3561 


17462 


A 


3584 


215 


1 


KS PGLLNAFSRGSQI YQASRAMGA * LNQ 
RFWPGTVAHACNPSTLGG*GARIMRSGD 
GDRNSKDGVS PCGRV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ n> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AJanine 
C^Cysteine, D==AsparticAcid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, I^=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryp to p h a n, Y=Tyros i n e, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


3562 


17463 


A 


3585 


138 


19 


LNQRFwPGTVAHACNPRPLGG * GAR I MR 
SGDVYRNSKDEV 


3563 


17464 


A 


3586 


262 


3 


GRPEVFYKPPPRGKFFFFFFFLFFPFPF 
FFFFFFVLFFFFFFFFFFFFFFFFFF *Q 
VRLYLLEKYELFFFFLFYPQEISLCCPG 
WS 


3564 


17465 


A 


3587 


381 


1 


KKFTPPPPRKMGPPPGFF*RAPPFFFFL 
GGGGGPLFFLGGPPPPGGGFFFSPPPPV 
FF*KSPPLFSFF* KKRGPPFFKRGGS PK 
TPPCFFFKKKNPPKKKKKKKKKKKKKKK 
GRPSSKS DAWVRTRG 


3565 


17466 


A 


3588 


198 


1 


PKSPPRPLARGKEPNFFSKKKKKRSI *N 
LNTNSWLDMVAHI CNPSTSGGRGG* PHb 
RSGVRDQPGQ 


3566 


17467 


A 


3589 


373 


1 


GIFFPPPKRGGFPPPPPKKFFFFPGGFI 
FGGGGGPFWPPPRKGFFLKKKKGVF*SP 
P*KGKKFFFFPGGGVGPPRGFLKGGPPI 
IFFFFFFFFFFFFFFFFFPATI FIjITYK 
DLTCHPGTHFTH 


3567 


17468 


A 


3590 


506 


98 


HTSKGP PPPPKkKX PPTkPFFFCCFW V V 
FFPPKKKFFFFCSPRSFIHSPPKKKKKN 
FFFPKGGGPPLSPFLTPPPPLFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF * LFVSLFS I 
FVLLIFFIiFYNAITTI 


3568 


17469 


A 


3591 


407 


1 


LVRGQITNKKNLFMFIiGEELREWRVVFF 
LPQQRLFFFKL* LQ * KEIYLF I * KMTCS 
RGGAGFKSSLYFFFRSPPQWLFFFFFF 
IFFYFF* FFI FFFFFFFFFFFFCNDFMI 
YLYISPPLPILEKKKGGRSRSRS 


3569 


17470 


A 


3592 


380 


68 


FRRGVGPQWPPPQKKGFSQKPPGGFKRP 
PLKGKKITFPPPGKFGPPKGPFKRPPPF 
FFLEGKTLFLGPDP* *NLGWGG*RGHGK 
* GPANPRVPPVFFFFEEYS Y 


3570 


17471 


A 


3593 


394 


45 


ALMFSTGQEGRQEKHPPPHQBCKKASRGE 
EPRGGVRKNYPPPKKKSFPKKNPPGVTP 
PKKKKKAKLPPRPL*GGPQRRPQKTAPP 
PLNFYPPKEPTKVLSIFFSFFFFFSAND 
VELY 


3571 


17472 


A 


3594 


402 


62 


AMIVLLHSSIX3NKARPCLFKKKWAKDLN 
KHFS KEDSQMANKYMKRCLI SLVIRNMQ 

VQKI PTLKHCWWECKI V* ILWTNLAARV 
G 


3572 


17473 


A 


3595 


220 


425 


FISGFLNLQEEREIAFFFKVNIRLGAW 
HTCNPSTLGGRGRRIT*SPGV*DQPGQH 


3573 


17474 


A 


3596 


13 


412 


AEIAPLRS I HRARNCIiKKKGRKGRKNPG 
GDGGNPQIGPAFFGGGNGKTPGQKPHQP 
QTWGQGRHLKGGGGGPKRKGVPPKKKTP 
T I WGGT P IiArXjVE PN W WGHTGNPGGAD P 
KPPGREK* SGGAQI TGPHTGP 


3574 


17475 


A 


3597 


2 


293 


iTif .C D * VI jTITVD "CD 171? 1?D T?T3 T? D 1TO T7 D C D 
v_JjL*o tr aLAa X r^ht rs K r. rf r. K M. K r. K r. f/ p. K p, K 

ERERLPPRDLREKRAQDVFATHPNYEFV 
CVIRHYRGOTVYTPLTTAHS IDAHKMCV 
CVKKRPLYKKKKK 


3575 


17476 


A 


3598 


243 • 


9 


IHjRQEKQCWSKGTRFQLQDEQVLNT*D 
RTWEVMWPGAVTHACNPSTLGGRGGRIM 
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nucleotide 

location 

correspond 
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Predict- 
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nucle- 
otide 
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correspon 
ding to 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, ENHistidine, I=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Vf^uiutamine, K—Argmine, S^benne, 
i— • 1 nreonine, v*— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RSRDRDSCRIHQGGSKPKSPR 


3576 


17477 


A 


3599 


1 


329 


PSARRDWKNIPKLPPPRGKNPTGFFFLK 

PPQFCPPPPRGGFFFFPPGGPPGEKKGG 
VKGPPPPFFFFWPPPPPNLFFPPGA 




1 /4/o 


a 

A 


3600 


206 


I 


iiKALiO X MERAAHTE VRLVF I Q I IiGSPL* 
YIF*RQDLSPSPRLECSGTIAARCSIjDL 
LSASNSPASASQ 


3578 


17479 


A 


3601 


322 


99 


KRTGAFGFFFPPLFNKKKKVFLGPEGPK 
IVKKKKKKEKKEKQNKNRKQTKKK* KTN 


3579 


17480 


A 


3602 


410 


25 


LFFFKKKTGGGGGGPPPFFHPFGGVNGG 
VIRVGELSPPCPPGGTRFFFKKTKNYPG 
G*GGPFFPPPSGGGLGPKKTLSPRGGAS 
L* PNFGPSIiP P KGKGGP PFS KKKKKKKK 
QSRPRDS*SLRRGIRK 


3580 


17481 


A 


3603 


27 


349 


RSSIQCGGILLFYFLYFMVFNDIFWL* 
GFSAITSILISMLFSCI*YYFFKMYCLL 
FNFRCHWIFYILF*MKTIAQIGN*IFFR 
DAYIjGYFFLFIjFFFFFFGGKIiYL 


3581 


17482 


A 


3604 


88 


435 


RAISTCLQNEQYKWLTTITSVEKSIiNDL 
TELKTMV*ELHDKCTSFSSRFIQLEBRI 
SVTEDQMNEMNNPPLXHGFI FYGSS YS W 
STMVRKQDKLRVKDAGIKHRNQNKGFTA 
EKSA 


3582 


17483 


A 


3605 


331 


428 


YGHYGQIEATFLPAITTEKETIiKTRKLT 
*SS INLI*K*T*NVRPGTVGHACNIjSTIiG 
GRGGQIMRSGIRDQPNST 


JJOJ 


1 1AQA 
1 /4o4 


A 


3606 


3 


294 


0C3 * GCb JiL* S CHCT P S SliG * NTVRKKER 
KKGKREREKERKREIIPGFGPCAHHKGA 
VTLGNLFPRVQVSQLVQRRKI I ITVI PD 

nt /*3T7 TT7T TfOAOT 


3584 


17485 


A 


3607 


263 


426 


RKLAGTFPFLWKSGPFQTILAAAQRMGG 
NDP KETRVFHAGLS SSLGENK* VWKI 


3585 


17486 


A 


3608 


213 


439 


LSWACAAEWTVTQPANFPLDYILPFFIjIj 
IRESSKF*RHMATQLETAASRPGVWHA 
CNPITLGGRGGRITRSGVH 


JD50 


174o7 


A 


3609 


2 


A At 

441 


PxTDIiESFPIjESELSANSPVQCSGIaZKPV 
LT I FQVISHKGHCHSKYYLLNANYNVFL 
HLSNSVLPRLI.SGQRTETQKHFYTAVLS 

VPT.Q T VT J3PTTO'\/T.T.r , V"Df2WQ 7AVM* T7VQT 

*PQIHGIiKQSSCI j SLPKC*DYQHGISPP 
WCN 


3587 


17488 


A 


3610 


102 


354 


EPGAWADRCWIiGRVIiLTHLLWPCCLYFF 
LSLPECWDYRCQPLHPACAVFIENSKR 


3588 


17489 


A 


3611 


462 


371 


GtjNFLTL* SARLGLPKCWDYRHEPPRPA 
WS 


3589 


17490 


A 


3612 


463 


319 


RNSHDGLELLTL* SAHLGLP KCWDYRCE 
PPRPVLC 




n>!Gi 

J /4yi 


TV 

A 


3613 


49 


476 


PRNS ITIjLIIAjIjLTNTLTLYQGWFDVTR 
ESTYQGHHTPPVQKGLRYGI ILFITSEV 
FFFAGLF * SFYHSRLAPTPQLGGHWPPT 
GITPLNPLEVPLlaOTSVLLASGVSIT*A 
HHSLIENNRNQIiMQALIilTILLGLYFTL 
LQ 


3591 


17492 


A 


3614 


223 


467 


TPS * PLAI I * F I S TLAETNRTP FDLAER 
ESELVSGFNIEYAAGPFALFFIAEYTNI 
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nucle- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso1eucine, 
K=Lysine, D=Leucine, MhMethionine, 
N=Asparagine, PHProline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














MI INTLTTT I FLGTTYDALS P ELYTT YF 
VTKTIiLLTCLFL* I RTAYPRFRYDQL I H 
LL*ENFLiPLTLALLI*YVSIPITISSIP 


3592 


17493 


A 


3615 


2 


405 


ARGLQEAGAVQNDPVSRLFDHAMLQGHR 
AHQMAIVTYQEFGETYIPKDQKY* FLHE 
SQTYFCFLNSI PTPSSMEETQHRSSLEL 
uR I S Ixlilj 1 clS WUi F VRrLKo MNANNLi VY 
DTSDRDDYHLLKDLSEGIQTLM 


3593 


17494 


A 


3616 


328 


63 


PSKF*KPCLISPPPLGPGPPN*TPGFLK 
KPKFPPRGGAHP*SPLLFRVRPGDPLNP 
RGQGFP* PKLPPGPPTRATKPNFVSQKK 
KKKE 


3594 


17495 


A 


3617 


1 


379 


GTSVGIPTGGQEFS *FAALHGSNVK*SA 
AAL*ALRIMILFTVRGLTGIGLANSSLD 
IVLHDTDYVEAHFHYVLSIGAVFAIIGG 
FIH* FPLFIjGYTLDQTYAKSHFTIIFIG 
VSVTLFPQHFLGLC 


3595 


17496 


A 


3618 


494 


17 


GSANPRNPRRSGSKTRETSPRNVMSLMS 
PMSPTLPMRPTSPGTPMHLTNSMGPMQP 
PNSSRPVNPASSATPVSPAS PARPRKPS 
CFPAPQ* *SPPI»*SPKPRHGLSCLSWVK 
KECETWHWSMHIVEDRHSKPYHGPGMV 
AHACNPSTLGG 


3596 


17497 


A 


3619 


350 


480 


IiGS WT> YVCKP PHLAVLLL * RQS FVLL PR 
JjVTN S * AQ VIHIiP W P 


3597 


17498 


A 


3620 


2 


400 


AAABPHGCYAKVLDEFKPLGEEP*NIjIK 
QNCELFEQUGEYKFQNALLGRYTKKVPQ 
VSTPTLVEVS *NLGKVGSKCCKHPEAKR 
MP * AEDYLS WLNQLCVMHEKTPVSDRV 
TKLLTESLVNRRPCPSALEVK 


3598 


17499 


A 


3621 


3 


476 


LGYSVHPPQPIiAQVLSPPLPGASRAGWL 
LRMRGPPS PHPPRTPAGLQAPHAALVPA 
RASPSTPPCKLREWAPAIASPERGSHSA 
VGG*RAPQMPPKWE PRQGRCQEQARALR 
TASMLSPLSFTHPVTLSLLNFPTSLPPR 

KTOTTt TO \ TO 


3599 


17500 


A 


3622 


40 


390 


MKGALTQPGRAGGLGSQSQMDMASTPKP 
SPDDQGCSIHLWGFPFLVSLLPPSLSPS 
PPSPPRHSVGTHCAP*VLPRPLGALRVP 
HSSVPRPHAVPWGTTYRYSRGAWPLSI* 
DFRGL 


3600 


17501 


A 


3623 


52 


475 


NSPK*NQI ICDLFVPFHLA*CF*DLFML 
WRAGAQFWLTKTP PHS PARLKQFS CLTL 
P 


3601 


17502 


A 


3624 


333 


466 


SIIGKIRLWKMRPWPGAVAQACDPGTLR 
G*GKRITRSGVRDRPG 


3602 


17503 


A 


3625 


2 


138 


iiKKG^KuN PWJWjCGE P * bKHCTPASST 
E*DSVSKNQKTKNSQWH 




J /3U4 




iozo 


*i 
i 




T SIT iP P P AIj P PP P PP PT .PTPfiTVSHPn c; Af3 
FPGRPTL*RI*PPLPDPPPSLGI»SPGDDP 
GLAREELKSPCPWGPHPLCHLPFP*CWA 
S SQPS INAYEWPJCKKKKKKKKKKKKKK 


3604 


17505 


A 


3627 


2 


393 


LGTDLSLIilRFELGQPGMLLGNDHIYNV 
I VTAHAFVI IFFI VI P 1 1 IGGFGN * LFP 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














LIIGAPDMALPRINNISL*LLPPSLLLL 
IASAIVEAGAGTG*TGYPPLAGNYSHPG 
ALVDLTI FYLHLARVCS I 


3605 


17506 


A 


3628 


2 


379 


QDATYPI IEELITFHEHALI I I CLI CFL 
ALDALFLTLTTKLTNTNISDAQEIETV* 
TMLPAIILILIALPCLRILYITDEGNDP 
SLTIK* IGHQWY+TYEYTDYGGLIFNS Y 
I LPPLFLKPGDIiRIi 


3606 


17507 


A 


3629 


1 


400 


PTRPPTRPPTRPMTNTLNS IHPPI,PR * K 
KKKKKKKKKKKKKKKKKKKKKTAAGGGA 
* KKKNRGAKTHRGSKKLNFFFKKRNKKN 
PPVKIEKKTFFLGGKKKNKPPQKNNALK 
KKKNYFEGEGESLLLL 


3607 


17508 


A 


3630 • 


416 


1 


SQLGYSGVRDPLEEATCLFSDLKIiHAGR 
TTPLFQAVRQGHLI LQRFLQP FA* L CPA 
PRGGVYRGRQASLS CGGLHPVRASWPLC 
LPTQFSAMEGAPPXiALLPRSSS ISDYCA 
SNEQGSVGIGPSEPCTGYNLLVCHLL 


3608 


17509 


A 


3631 


275 


83 


RWGS CYWLAGLQLLGSSDS PALASQSA 
GITGVSHCTQPKNVI *KI SHLTVAGPAA 
DKTPQTPR 


3609 


17510 


A 


3632 


186 


3 


FFFFFFLFFFFFFFFFFFFFFFFLFFFF 
F FFP F FP FFFLVCQVPLLCNL * HLVYLF 
YMVL 


3610 


17511 


A 


3633 


3 


235 


G 1 1 S FLL I R* * YARADANTAAI RAIL YN 
RIGDIGFIIALA*FILHSNS*DPQQIAI 
LPHPSDSKKYLYLSHIQLKKA 


3611 


17512 


A 


3634 


1 


428 


RTLKTTLFNPDGG*DPILYQHLF*FFGH 
PEVYI I ILPRFR+ ISHIVTYYSGKNEPF 
GYISMV*AMISLGCLGVIA*AHHIFTV* 
IDVDTRAYLTFATIMIAI PTAVRALS * L 
APLHRSN I K* SAAKL * ALRL I FL FTVGG 
QT 


3612 


17513 


A 


3635 


2 


129 


PVI YSTI FAGTLITALSSH* FFT*VGLE 
INMLAFI PVLGFWD 


3613 


17514 


A 


3636 


2 


422 


DR* LFSTNHKDIGTLYLLYGA* AVVLGT 
ALNLLIRAELGQPGNLLGNDHIYNVIVT 
AHAFGI ILFIVIPII IGGFGN*LDPLI I 
GAPDMAFPRINNISC*LLPPSLIiLLLAS 
AIMEAGARTX3 *TAYP PuAGNYSHPGAS V 


3614 


17515 


A 


3637 


14 


479 


QNCKINNFPMYFLPHQSE FPRKSCKQQN 
T I CFRLSNQNTMKKTTENHALL W IRKWQ 
T * KQQCKSNSGYTPSTPSRAKKKKKKKK 

LKKL 














3615 


17516 


A 


3638 


2 


449 


1 1 1 1 CFVGLCALVLiTLTTKLTNANILDA 
QEVETG*AILPAIMLVIIALPCLRILYI 
TDEVNDPSLTIK* IGHQW Y * AYE YTD YG 
GLMFNAY I LPPLFLKPGDLRLLDGDNRV 
VLPNEAPIRIVMTSQDVMHS *AVRTLSL 
RTDAIPGRL 


3616 


17517 


A 


3639 


347 


98 


HFFLGGGKGGGFS PFPPQKFFFFLRGFF 
FRfif?nr;PWT?*p p pick - cjffq KTrpnnvp , *TT> 

PKKKKKFFFFPRGFWGPPGI FFKGPPP 


3617 


17518 


A 


3640 


2 


426 


DR*LFSTNHKDIVTLYLLFGA*AGGLCT 
ALSLLIRAELGQPGNLLGNDHIYNVXVT 
AHAFVIIFFIGIPIIIGGCGN*LVPLII 
GAPDMAFPRINNI S F* LLP PSLLLLLAF 
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AI VRAGAGTG * TVYP PLAGNYS HPGACA 
D 


3618 


17519 


A 


3641 


94 


294 


GRGGGGVAGGRGAMWSKGRQRGRNGPVR 
GREGGRKSHFRKEG*KEGRKEPPPPPKK 
KSRGEKNQREG 


3619 


17520 


A 


3642 


1 


405 


RTRGFDLERFVLFDVAQQMVESGYVCEG 
DHKTMANAI IDRVSLIKRKREQRQLVRE 
EQB KKKQEE SSLKQQVEQSS AS QTG I KQ 
LPFADTGI PTASTTSASVSTQVEPEE PE 
AD*HEQLQYQQPSLSVLSDGTVD 


3620 


17521 


A 


3643 


34 


405 


HLLGRKQNKAI FLEAEKKKKKKKKKKKG 
GPIiKKMPGGAKNNRGGEKKI FSP*GGVK 
KTPLGIFEKKPYFGGGKKWANPPKKIKT 
IiKEKKKF*RGKGGKKPENPWGEKDESSS 
PK 


3621 


17522 


A 


3644 


430 


81 


EFAQLSPPQIKGFPSPPPNFLPPSLFGF 
FSPFSP* KFFFSLKGFI FVGGFFPFFS P 
PKKSFFSKNPQLVFISPPFKKKIFLFPP 
PLNFGPPRVFFKRPPPFFFFFFF 


3622 


17523 


A 


3645 


399 


2 


AFFFTKKKKEFFPPFPFKNFFFFSRVFF 
FLGGFFHFFPPQKKIFFLKI PRGFFLTP 
PKKKKI FFFFPR* FLAPPGFFL*APPPF 
FFFFFFFFFFFF F FFFFFS PCS WSCHQV 
FPHASNRIHNPSNSYPLQQY 


3623 


17524 


A 


3646 


49 


326 


KKKKKKKKKKRGGPFKKTPGGAQKNPGG 
BKKFFFIi*GGHKKPPRGI FKKKPFFGGG 
KNWTTPPQKNKAFGGKKKFLGGKGEKPP 
PKPRVKKN 


3624 


17525 


A 


3647 


327 


94 


QKKKKKFFPPPPPKNFFFSPRGFFFGGG 
GGPFSPPQKKGFFSKNPGGVFLPPPKKK 
KIFFFPPGGFGAPPGFF*RGPP 


3625 


17526 


A 


3648 


2 


301 


TSSVASTFIMSLFPTTIFMCLDQEGI I W 
N*H*ATTQTTQIiYLSFKLDYFSIIFIPV 
ALFVTWPIIEVSL*YINSDPNINQFFKY 
LLIFLITILILVTGC 


3626 


17527 


A 


3649 


2 


406 


STNHKDIGTLYLLFGA*AGILGTALNLL 
IRAELGQPGNLVGNDHIYNVTVTAHAFG 
I IFFIVT PI I IGGFGN*LVPL I IGAPDM 
BFARINNISF*IiI»PPSLLLIiLGCAIVEA 
GAGTG * TVYP P IiAGNYSHPGACV 


3627 


17528 


A 


3650 


3 


418 


HAYADAWVHFHS ITLLILGLLTNTLTI Y 
Q*WRDVSRESTYQGHHTPPVQKGLRYGI 
ILFITSEVFFFAGFF*AFYHSSLAPTPQ 
LGGHWPPTGITPLNTLEVPIiLNTSVLLA 
SGVS IT* AHHSLIENNRNQI IQALLI 


3628 


17529 


A 


3651 


526 


2 


WGGGEKGRAGGAGEETPHQESQRQGCAL 
PQEPPLYPPCPAIHPPPLPLGGTiTiTiFPS 
LPPLP*AAWFPHPGQPGRNLNGESPTTR 
EKRLWGPPJjPTPLPWPPTQP rPXjSRK IG 
CPPCPREVPTPSPPWAEPPPLPGPPSEI 
AHLTTVRRGAALANVNHEN I S DTDENKL 
NAFVTA 


3629 


17530 


A 


3652 


464 


297 


DYRCEPLLPASYIDVS 


3630 


17531 


A 


3653 


162 


467 


NSKKQQFVLLSE ITS YVYWLHKRDLDSH 
TI IMGDFNTPLSTLDRSARQKVNKDIQE 
LN*ALHQADLIDIYRTIiHPKSTEYTFFS 
APHCTYNFLFQSYFSSQF 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
v^— wiuuiiiiijic, ix—Arginine, o— oenne, 
T=Threonine, V=Valine, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
iiuueutiuc liiacruon 


3631 


17532 


A 


3654 


372 


1 


LMMLSIFSCVHLAFVCFLWRREMSGVSL 

SSCLCKRHLQI^PQVYTTMLGYLFLFLF 
LROSCSVADAGVOWHDHSSLOP * TPGLK 
RSNS PTLV* DLD 


3632 


17533 


A 


3655 


444 


341 


VGQDGLLDLLTS * SACLGLPKCWDYRRE 
PPRIiAY 


3633 


17534 


A 


3656 


466 


315 


VGHTGLEIiPTSSELPA*ASESAGITGVS 
HCTQQSGCAFCYSTMI PTKPYTC 


3634 


17535 


A 


3657 


40 


276 


EFDTAHAFVTIFFIVIPIIIGGFGN*LV 
PLI I GAPDMAF PR T NNI S F * IjLPP .LT, 
LLASAI VE KKKNEKKKKTGAPLV 


3635 


17S16 


A 




1S1 


470 


FTTLPVTSASLPl^KTTiAWPGMVAHaCNP 
STLGG*GGRIT 


3636 


17537 


A 


3659 


40 


182 


GIRKENHIxNIiGGRGRIEPGSHHCTPAWA 
TK*DSVKIKKTRIQKTPLI 


1617 










*K)1 


x inoivli x xs i Jr\ji*j v/\n-r\ in xai x/VjvtVjvjCjvj X 
A*GQEFQPSLRN 


3638 


17539 


A 


3661 


145 


437 


YFSIjSLCIjLELMNS VI xxFFFxjVxjERGFx" 

PPPT.lTI7MfV!!PWn7fi* T.KTDPPPr'T . VTil71?r*T 

TLPKSWNNRPPPPRGGNFLDF*KKRGFN 
YVGRVDLKLRTSG 


3639 


17540 


A 


3662 


255 


451 


LLSFNPFFFFFFFGKGGPPSliPPGGKGG 

GTiNPT*TTGPGGfrR1\T<5PPOPPGPPfl*Pr2 

PTPHPGLFF 


3640 


17541 


A 


3663 


185 


66 


Plxl^PLRK*LxT^FRTOVxxW^PDWSAjyM 
QSRLMATSAEF 


3641 


17542 


A 


3664 


42 


193 


FQLLTRLRQENHLNAGGTSCSEPRSHHC 
TPA*VTETQSQKQKVKVLGFSL 




i / j*tj 


TV 
■** 


J ODD 


ZlO 


o 
JL 


ITQlfrt"KT'QT?T.nQP\7KrPr r KrQVC , T T T7T DnPl/' 
lw IS.VJO r ixtloK V IN Kv?IN SJ\JCiJjuifljluAaV 

WRYCPAWS * TPGLKQSPHFNI PKCWDYR 

REEFHHPGLWYEFE 


1641 






jOOO 






GP T»W Q iVMT PG KT. TTTT .r*If Tn&ID AT.PWTCCJD 
V7tvXJVY Owl X ±r V7IVXJXV. X XJV. CvJL L/VY JriUxlS Vlanflf Jr 

LEGSUDRSLVSKVWHKVTYKPRNPDQFP 
YRDT*LELVLDPPPPTHSG 


1644 






16/\7 




<> 


QOTiT GR AF AP GMT ."P PP Q T .P V T WITT C r\\7W 
WYTPVLPSSCDYRHTPPRPANFCIFSRD 
GVSPC* PGWSRTPD 


3645 


17546 


A 


3668 


104 


334 


TVlxFPPCFIJ^IDFCMVLDFFSNFFVLN 
YF*NYRRVQKMYEEFPFFLFFFLIFSFS 
FLFFLSTFNFFLCPLCDLWYH 


3646 


17547 


A 


3669 


369 


260 


DGLDLLTL* STHLSLPKCWDYRHEPPCP 
ANMPSFKN 


3647 


17548 


A 


3670 


111 


342 


ngvsffppkkgxsiwangnfp^ppkpgg:* 
rnfla* ppqkfgmagpppppqpvgiagp 

PPHAREILEFFFFFFKKKGV 


3648 


17549 


A 


3671 


377 


3 


FAIIGGFIH* FPVFPGYAIAQTYAKIHF 
T 1 1 FIGVNLAFFPQHFLGLSGMRRRYSD 
YPDAFTT*NILSSVGSFISLTAVILIIF 
MI*EAFASKRKVLIVEAPSINLE*LYGC 
PPPSHTLEDPVY 


3649 


17550 


A 


3672 


39 


348 


AKIAPLNS S LDERMR VS KKKKKGHGLTP 
KPFLKKKl^KNFPLSQLEGVGKKNQKPF 
G FKKGLELAP FNGKGNGENPNTWRWG *T 
MSFFFFF*DSHPFIQAGVQWRBLGSLQP 
RPPRFKRF 


3650 


17551 


A 


3673 


327 


3 


S KD P VFF KVF KKNP PTPN I G VF P FAL PL 
KRGQFQPLFNPKGFFFFFFKPVQLGRGK 
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OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=G!titamine. R— Arf inine. S=Serine« 
T=Threonine, V=Valine, 
W=^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 














ISPQFFFKKRFWG*TKSFFFFF*DSHPF 
IQAGVQWRDLGSLQPRPPRFKRF 


JUJ 1 


l / jja 


A 


^674 


^79 


267 


EPRLSPCTPAWATE * DSIS KQNKTKKPK 
LALNKKKMF 




l IjjO 


A 


jO/J 






FCRDRVSLYCPVRS * TPGFKGSS CLDLP 
SSWDYRHAPPGLAKIGGLHVFCLPCCGS 
LLVCWS SHCSIiAHQELLLENKHTDVWMG 
GWMDR 




1 /jjh 




JO/O 


•3 
■J 




PVYKNKNKRPDAVAHTCNPS TLGGRGGQ 
ITRSGDRDHPG*HL 


3654 


17555 


A 


3677 


291 


30 


YSLVLLI FFF YCFS IS YFI SLMNFCHNL 
YYFLLLDFGLVCS YFPS ILWHNEIYFIG 
SMFFIFSVTV* I FFCWEMS CNIYKLHQE 
MF 


3655 


17556 


A 


3678 


1 


298 


RDFSASASQVRGTTGMHRHAQLI FVFRV 
CVCVCVCVCERBRERERDGVLLCRPDCC 
SVAHT*VQGRDLGSLQPPPPGFKRFFCL 
KLPSSWGHRCWDAP 


3656 


17557 


A 


3679 


20 


376 


VPP PHLANFLNF I LLFVDTGS YYVAWAG 
FKLLAP CNLSASNS * SVG I TGVSHHAIiL 
LLALKCNQLiFQFVI FILLHAVCIYDFIM 
VRFSLQCFPQFYDFPFCFIIiFVSSKAQF 
CFLRLPY 


36S7 

JOJ / 


1 / JJO 


A 




110 

I Jv 


334 


NTTDYFKLP* ILSGTR*DMYKQPL*HIiN 
TVRBYTS KYGWRPGWAHACNPS TLGGQ 
GBRITRSRDQDQPCQHGET 




175SQ 
j tjjy 


j± 


JQOl 


17tf 


511 


LTMMLQMKISVI CCLLRNIFWPGALAHA 
CNPSTLGS *GGRI ARS RDLENS PSLLGS 
QPSS 


3659 


17560 


A 


3682 


448 


248 


AKDLNTFFTKE*TPLANKHMKKGSARFT 
YRNVQ I QTT*RCLGT PTRVTGV* MRNS A 
RHR*GCGGLV 


3660 


17561 


A 


36S3 


486 


176 


PAPINPPQIiRSLVQRSGIRFC*AGQVGF 
KIiLVSSDPPTLASQSAGITGVSHQTWP * 
NYS *NSGMPTHPRQAS S WRLREQWYHVS 
LLPCAYYKNGRQNSLLI 


3661 


17562 


A 


3684 


238 


27 


NKHILCYVGQEMGMGWGSHSHGWKPHFK 
S FQVCPSAVAHACNPSTLGG*GGQITLV 
QPGKHGKRRIPALF 


3662 


17563 


A 


3685 


1 


299 


LITQTCHVGCLKIKCI *TSKKKKKKKKK 
KKKXKARGPLKKKGPLKTPEKPPGGPFK 
APPFWGGPPPRFFFGGGGAPPPVFLKKK 
KKNSPLWGGGGFFFR 


3663 


17564 


A 


3686 


494 


91 


PLGKRFPGGDIALTKLSRPWAWAPPGIP 
PPLGGPGGGVP*VTPFKPTPFRPGEPPIi 
PLKNQTFFGVPGPPPYFPPLGGLGPBIP 
FTPGGPGS IKLAPPPSPPPGAPKRNFFP 
KKKKEVMKMLCFHFNLLVQNI F 


3664 


17565 


A 


3687 


17 


249 


YSRFGRHFGHDPGLYFQ ILRRLRWENHL 
NPGGRGCSKL*SHHCTPAWATE*DPVS * 
KKNVTTIAKRWKQLTCPLADG 


jDOj 


1 /JUD 


j± 






461 


FWLWVSSVLKHMNWPGVVTHACNPNTLG 
GQGE * SA*ARV*DQPG*HGETCS AKEK 


3666 


17567 


A 


3689 


149 


422 


ASGRQGPDRGCWGGQAPRHPARGAKPAA 
DMGMSGQGYTKQAAP *GDPSQRAGVRLP 
REPTSRKASGGSGGHSRASGGAVPGQRV 
AQPSPAR 


3667 


17568 


A 


3690 


1 


415 


GDPAGGGDPILYQHLF* ffghpevyili 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PbenyiaIanine, 
G=dycine, H=Histidine, l=Isoleucine, 
K=Lysine ) l>Leucine, M=Methionine, 
N=»Asparagine, P=Proline, 

C^=CZ 111 ITi I n f RsArmnina Czarina 

vjiu tain iii »v— Arginine, o— oerine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 

iiucicuiiuc ucictiun^ \ — possiDie 
nucleotide insertion 


- 






• 






LPGFGI ISHI VTYYSGKKEPFGYIGMV* 
AMISIGFLRFIB*AHHIFTVGIDVDTRA 
YFTS ATI I IAI PTG VKVFS * LATLHGS N 


3668 


17569 


A 


3691 


1 


393 


GTRKYSFLHDSQTSFCLSNSIPTPSNME 
ETTQQKSNLELLRISLLLIESWIiEPVRFL 
KSMFANNTjVYDTSDSDDYHLIiKDL*BGI 
QTLMGRLEDGTRRTGHILKQTYSKFDTN 


3669 


17570 


A 


3692 


402 


2 


FLTQRWGAPLP I P PFWEAP FGPSP* APG 
FKPPLPPRGNPPFS * KTQI YPALGARPV 
IPPSLGGGGGES FLPPGGGFP * PRLFPC 
PPPRGAKPNFFFQKKKKKCLFCKAPQAQ 
HRSACLHEOiPLGNLLKLCPR 


3670 


1 / J / 1 


a 

A 




J 




TlTVTl?l?/Tl'PTT.Tn f PT TV T TTTJ * f~* ~KT<~\ ngDT T\Y 

lr ilf roTIljll?rXaAXXTR*GNQPEKLN 

AGTYFLFYTLGGSLTI^:i^ 

SUSIILLLTLTGQELSNS*ANNLI*LAYT 

IVLAAVLLKLGGYGI IRLTL I LNPLTKH 
IAYP 


3671 


17572 


A 


3694 


452 


208 


INKSARHGSMNL*SQLLRRLREVCLSPG 
TOKKTCQHLYDHRPSAIiEATFRIjSH 


3672 


17573 


A 


3695 


464 


146 


LKQTRATKGNLFLP * KPKLTPPGGFRPP 
FQPFWKPRPLGPPKSGVLDPPGPQGKTP 
FVLKKPNLLGGVI PTLFFPIiSGGLGRKI 
PSPRGGGGAKKRKP PRQKKKKK 


JU ID 




TV 




*> 


455 


irPDT.oi?ODRr t T/ocpoiiPi'unpnpgTT t tv 
Vlx<xjKI2IKFiUxLjVi>£> IoWTEDEDFSXIjXjA 

ALEKFEQLTLDGHNLPSLVCVITGKGPL 

DRVVQPT»TWnVlIT?rk'lJTnvr ,, T , DtJT T77M7TYV 

PLl^SAEPGVCIJaTSSSGIJ>LPMKVVD 
MFGCRLPVCAVNFKC!IiHELVKHDENGLV 
FEDS*ELAAHLEKL 


3674 


17575 


A 


3697 


173 


51 


TFISWFGEQWFGCMEKFFSGr)F*DFG 

A P TTfifi * Q PT T 


3675 


17576 


A 


3698 


380 


26 


KQRTOFFLFFFKGTGAFWFFFPPFFNKK 

RKQTKKK*KTNNKQNKKRETDIFGVEDE 
TWRQRWGI/3GGRVANSSAEMGKGNSTTR 
PSWNS 


3676 


17577 


A 


3699 


11 


414 


RIiWTAILIQTP * S FTGAVI LI IAHGLT 
SSLLFCLANSOOKICKKKIC 


3677 


17578 


A 


3700 


1 


401 


SSSLT^NNNKSFLNQFVTFNEMWILYD 
lW**PAQWLIX2EAPKHFPKPNLrX}KKVM 
FAVWWSAAGLIHYCFLTPGETI TSEKYA 

A* SHITQLKLQKLN* LGYEVL 


3678 


17570 


A 


3 /l/l 


yyj 




TP PKKKITKI PPPP P IT T T? P P PTf WW DP D 

PFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FPFFFSIYIDFNIiCICDNL*LQKTLVFL 
E 


3679 


17580 


A 


3702 


407 


371 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFLWYKDI I I KGKNEQNV* CS 


3680 


17581 


A 


3703 


36 


409 


LLKCEGIWKKEGKKKKKRREKKGGPFFR 
VLIiGGPTFRGGAHKlvS LLFRG VFFNLGG 
FFFEGPFFLGGALLGPPPPPIFTPIiGKK 
KIFKGKRGPPPLIFLFF*IRGPPPPPSP 
PSTKPPPPPPPTP 
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peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, {^Phenylalanine, 
G=Glycine, H-Histidine, Hsoleudne, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q := Glutamine, R=Arginine, S = Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 


3681 


17582 


A 


3704 


56 


415 


QAPWGKKKKKGKEKKKKKKKKKKKKKRG 
GPLKKNPGGGPTMGGGEKKYFSPWGGEK 
KNPGGNLGKKTPLGGGGNWAPPPPKKKG 
AGGKKKI *RGGGGKTPLIGGGKKNGGPP 
RTSPR 


3682 


17583 


A 


3705 


108 


1 


NLAK* IQ* *IQTIMHHDQVSFIPGIQGW 
LNILKSIN 


3683 


17584 


A 


3706 


425 


52 


PFPH* * *L*KVDPIPLPPQNFIiHPSNNI 
YFSPLSPKKFFFSLQPFFFFGEIFPIFP 
PPKKNFFS*NPNLVFISPPLKKKIFFSP 
PPPNFAPPKNLFKS PPPFFFFFFFFFFF 
FSGKNTLYWVDK 


3684 


17585 


A 


3707 


117 


421 


VLLVIQAEYLLPEVFGTRSVSDFGFFRF 
WNICI ILSG*ASLI * KSBI *NALKNISS 
FKKKF I FILE PRSHS VTQAGVQWQDHGS 
LQPQTSGLKLFSHLSLP 


3685 


17586 


A 


3708 


308 


3 


RSPPWFFFGFFFFVLFFFFFFVFQLFFS 
KYNITLYIKKKTETFTETNDNGNATCQN 
LWDTAKAI PREKFIAISTYI *KVGKHQI 
NNLMKHLKEIEKQKQTK 


3686 


17587 


A 


3709 


94 


420 


KKKKKKKKKGGGAFKKKPGGAKI *RGKE 
KKNF FLKGGGKKKKVGNFGKKP YFGGGK 
KREKTPKKN*SLKGKKKFLKGKGGKTPP 
KPWA*KKWSS 


3687 


17588 


A 


3710 


413 


184 


LGECLSLRSKKLWSGPAGGGVCL* FQQV 
RRRKWEDHLSPGIRG*RALSSHRSIPAW 
TAEQDPVSRGLVWGLPGGSW 


3688 


17589 


A 


3711 


3 


104 


GGCSELRSCHCTPAWRQS*TSSQRKKKK 
KKKMY 


3689 


17590 


A 


3712 


2 


393 


GLFSTNHKDIGTIjYLIiFGA*AGVLGTAI* 
SLLIRAELGQPGSLLGNDHIYNVIVTAH 
AFVIIFFIGIPIIIGGCXa?*IiVPLIIGA 
PDMAFPRIlTOISL*LLPPSIiIiLIiLASAI 
VEAGAGTG*TVYPPLAGN 


3690 


17591 


A 


3713 


424 


1 


FFFFFFLRRSLTWQAGVQWRNLGSL*A 
RPRV 


3691 


17592 


A 


3714 


2 


130 


GRVGGRVG*NLKLYSQRLKKKKKKKKKK 
KKKKKKKKKKGGGPF 


3692 


17593 


A 


3715 


3 


197 


FHHVGQDGIiNLLTL*STRLGFPKC*DYR 
CEPPRPAAFFFFFLKKKKTLAPKKFKTL 
DLTTRERS 




i / oy* 








497 


RSPPSGFAWPGPAGPWEAGAAAAGQPLG 
ARPHSPCTGGCRSAGTPP* PAGPAGPPP 
HAPAAAERKPRGAEQSQRRVQETGPPPT 
RAESNPSGLRGREVPGSAGCWSGSQESD 
FGAGCPAVT*GTYPYRIAGQVSSRAPHP 
AESRGCTAGLS IGPDGRQ 


3694 


17595 


A 


3717 


40 


379 


AITSPRSPPAVHGSCLARTHPNSPQCLP 
LSGTLLLPSSSLSLSHPATAAGE * IWQA 

SPGGALKKKNPGGPNPPGGGKNT SSPPR 
G 














"3695 


17596 


A 


3718 


445 


286 


KKKK* METRSCYVGQSGIjEIiLASSDPAT 
LASQSVGITGVSHQAQPVNKLPRVQ 


36% 


17597 


A 


3719 


15 


383 


GYFLBASCLLHPLLHPMYQNCGHVACLA 
AKENGKFMQAACGQLNS PALS LKNGRLS 
AAYGGSDCVYLFPMSKMVLGTQ*Al^rKY 
TLNGPGAVAHACOTSTLGG*GGRNLRSG 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionbe, 
N=Asparagine, P=Proline, 
V£=uiutamine, K=»Arguiine, 2>==oenne, 
i — inreonine, v— vanne, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














VGDQPGQHGET 






A 


3720 


14 


472 


RS CARNPGIjTHASAQYQTP LFV* SVXiI t 
AVLLLLSLPALAAGITILLTDRNLNTTP 
FDPAGGGDP ILYQHLF * FFGHPEAYILI 
LPGCAI I IHIVTYYSGKKEPFGYIGRV* 
AMI blvj i IiGt X VRAHHIFTVGIDRDTRA 
YFTSATIIIAIPT 


3698 


17599 


A 


3721 


3 


298 


EAAIKYFLTQATAS I ILLIAILFNNILS 
GQ*TITNTTNQYSSLriIMAIAIKLGIA 
PFHF*VPEVTQGTPLTSGLLLLT*QKLA 

Tl TO T TVAT O HOT XTT7" 


3699 


17600 


A 


3722 


86 


505 


AGAVTYIVTYYSL*KKDSFWIHSVWC*A 
MRVNWE S * GliSV* AHH I FTVG I DVDTRA 
I r TSAT X I XAX PTGVKVFS * XiATXjHGSN 
MK*SAAVL*AfiGFIFLFTAGGLTGIVLA 
NSSLDIVLHOTYYWAHFHYVLS IGAVL 


3700 


17601 


A 


3723 


432 


48 


FKTSFI1PFLLPSFCPSFPSFSLFPFSFS 
CFLSCPSSLSFLFFFIiLSFHPFFFPTLI* 
XSF*VFIjF 


3701 


17602 


A 


3724 


2 


406 


HHIFTSR I DVDTRAY FTSAT 1 1 1 AI PTG 

T ft/T f PC Ik fill TT#*^OlWf1Ut9^ » i< T ^ « f T *»* 

VKVF5 * XjATIjHGSNMK* SAAAL* ALGI I 
FLFTVSGLTGIVIiANSSIjDIGLHDTHYV 
VAHLHYVLS IGAAFAI IGGFIH* FPLFS 
SYTLDQTYAIIHFTMIFIGVNLT 


3702 


17603 


A 


3725 


174 


158 


KFFGGGFPHFFPPQKKGFFPKNPPRGFF 
T PPLLGKNFFFP P PVNLGPPRGF F * GP 


3703 


17604 


A 


3726 


2 


392 


L FS TNHNDMG TL YLIiFGA* AGGLGTALS 
LLIRASLGQPGSIiIjGNDHMYWVIVTGHA 
FGIIFFIVIPIIIGGFGNRLGPLIIGAP 
DMARPRIWISC*VLPPSIIJJjLASAIA 
EGGAGTG+TAYPPLAGNY 


J7U4 


17605 


A 


3727 


48 


474 


KKKKKKKKKTKKKKKKKKKKKKTRGPPP 

KKTPAGPPPPPPREKKNPPPEGGQKKPP 

PAH*KKPPPPRGGQQPPPPPPQKNPPPE 

KQKTWGAEGGTPP PKPPRKKNGSQQIAD 

FJb'oNAAttJNKHJrXyKFFFQSQC 

PP 


3705 


17606 


A 


3728 


387 


53 


PPPKKGVLPSPPNFYTPPK*GVFSPLPP 
+KFFFSPKGLIFWGGGGPNFPPPKKKFF 
SKKPRGGFFFPP*KKKILISPPREKLGP 


3706 


17607 


A 


3729 


1 


397 


PTRPLQPLSPEFKRFSCRTSPDVAW*VS 
LPSPPPSGRS FFVFL* VESFCVWIKKRF 
TMLARLVLDS *PSDPPAWASQSAGITGA 

qT,T?n«3TiT»VTV*n7Q TnTy2T.T.DWK»T.TOVT/ 
OioICnOJUUX 11 V_JC O XyiJUiJLiriSIijl^Z V 

QLLIVYPRIHHLYLSVPHLH 


3707 


17608 


A 


3730 


407 


0 


TLKHTKKKKKKKKKKRGGLLKKTLGGPK 

ILGGEKKGKKPPKKKKPLGEKKNFKGEK 
GKKNLKMFGVIKLGTSSKKIIF 


3708 


17609 


A 


3731 


3 


200 


GGCS ELRS CRCTLAWVTE * DSVS LKKKK 
KKKGPFKVLRGKGPSGNKREGAKKKIQG 
KPCPFWESF 


3709 


17610 


A 


3732 


3 


380 


TNHKDIGTLFLLFGA*AGVLGTALSLIjI 
RAELGQPGNLLGNDHIYNGIVTAHAFVI 
IFFIVIPIIIGGFGN*LVPLIIGAPDMA 
FPRINNISF*LLPPSLLLIiLASAIVEAG 
AGTG*TSYPPLAGN 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteiney D^Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M==Methionine, 
N=Asparagine, P^Proline, 
Q=GJutamine, R=Arginine, S=Sertne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3710 


17611 


A 


3733 


6 


394 


AMISIGFLGVIG*AHHIPTV*IDVNARA 
YCTSATIIIAIPTGGKVFS*I*ATLHGSN 
MK* SAAVL* ALRFI FLFTVSGLTGIVLA 
NSSLDIVLHDTYYVRAHFHYVLSIGAGF 
AI IGGF IH* FPLFSGYT 


3711 


17612 


A 


3734 


27 


238 


GIPGSSHASAHIG*IIAVLPHYLNITII, 
NLTIDI I LTTTAFT J J iNLNS STTTLLLS 
RT*NKKKKKRGGPF 


3712 


17613 


A 


3735 


402 


249 


ASAHASAPLFV* S VLITAVLI.LLSLPVL 
AAGITIIjLTDRNLNTTFFDP 


3713 


17614 


A 


3736 


100 


430 


VTWPKS DS YEE VKI KLE PRQTDLRALGF 
CLSFTGKKKK* KKKKKKKKKKKKKKKKD 
KRGG 


3714 


17615 


A 


j i j f 




306 


FliSSVFSLVQGVLFSSLCLPRNLPDMMQ 
LWSEIFNK*LYLQEIiSILGVY 


3715 


17616 


A 


3738 


279 


3 


PGHCFYNFNLNFYFQI FFQNVLANSGSL 
LFLYQTLILSFNIFYILYLIIQISIVFQ 
SVIL* F I FSNS YS WWYGFLVCLMTFEGK 
KQTCAHV 


3716 


17617 


A 


3739 


268 


388 


HFFFFEMEADSVAKAGV*WYDVGIjQQPP 
PPGVKRYS*LSL 


3717 


17618 


A 


3740 


395 


153 


GWLMRFKERCGLQNIKV*GEAASADAEA 
AAS*PEDLAKIADEGSYT*QWIFYVGET 
AFYWKKMPSRTFIAREKSMPGFKL 


3718 


17619 


A 


3741 


271 


386 


TCPPPIiKLWVYELGLli*RKNMRQGAVAH 
ACNPSTLKG*GGRISRSGDR 


3719 


17620 


A 


3742 


406 


190 


QIjLKRIjRQENPWRWGG*GCS KPRSPPRP 
QAWVTERDFVSKSKTKTNQPKQNGERKV 
FFWFFFLRKPKVQFEI 


3720 


17621 


A 


3743 


272 


408 


MGLQACANS PL* YVD IAI PCNNKGAHS V 
GLMWWMLAREVLRMRGT 


3721 


17622 


A 


3744 


211 


407 


KGKRNI LFGPVKNI FCDTQFTSCVNFFR 
EEVSLDCPGWSQTPALK*SSCLGLPTCW 
D*RRKQPRL 


3722 


17623 


A 


3745 




189 

107 


CVSQDGLDLLTS *STR3jGLPKCWDYRKC 
WTNQLFVNGQKS LNS FVCYS FMVLDTTG 
FKEIVMLWHLVPIiF 


3723 


17624 


A 


3746 


120 


362 


LKA*EISWLNGTIQTHQEKPEVQNQVIL 
EENIVFLGPQDRI FECQAVTAELEPEKK 
AIVAGCSDSHLYPSTLGGQDGWIT 


3724 


17625 


A 


3747 


401 


233 


NSGGRGCSEPRSLPCTPAWVTDRDPISK 
NKNKTNHQ*QKKCKR*TGKVIFMTRSIL 


3725 


17626 


A 


3748 


353 


109 


LGLANFCI FSRDGVFPCWPGWSRTPDLR 
* SAHLSLPKCWEYRA* ATMPGLLMCFLI 
KALDFQGCVGRWGGGSWTDIKLCQM 


3726 


17627 


A 


3749 


412 


221 


HGLKECLVYPKWCQNSNISLQGIHFSEV 
AM*QRAHNHGIHWFYHDSFPRKIWINWA 
RRGGSCL 


3727 


17628 


A 


3750 


1 


355 


FHHVSQDGLDLLTS * S PRIiGLP KCWD YR 
REPLRPAENTVYVHFSSWSFTNHSGKIN 
* TLDQWKEQLPGWL * I LRGVFFFNPS IW 
NS *QKKAS I YVFPLCWKRDI FLYHFPVS 
GTFRGF 


3728 


17629 


A 


3751 


423 


125 


QVCEKVL ISLITEMQ IKTTVRYHLTPVK 
MAFRQKTDDNKCWRGYGEKGTPVHCWWK 
CKWQPL*RTAWRFFK*RKIEIiPYDPAT 
PLLGI * P KERKSGQ V 


3729 


17630 


A 


3752 


3 


171 


FQPIAFLYANSEQSGKEESTS I YSSYEE 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoieucine, 
K-Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P^Proline, 
\£— vnu uminc, iv— /\ rgi n me, y^enne, 
T=Threoninp V=ValinA 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 

niirlpnHHft rf<*fotfnn \^nnccihlp 

nucleotide insertion 














YLGINLTKEMKDL* KENYKTLMEKIELD 


J /JU 


1 /Oj I 










RHE PRRPALTRFLFVF LYLQRLDHL I R I 
NS FF I MS VNGMFTFVVLIOjFNIj I HA 


3731 


17632 


A 


3754 


3 


197 


GLELLTL*SACLGLPKCWDYRHDPLCPA 
OYIiS FLYAINKLFSSTYIiE IHNRT.VCCT 
VTLLIYKP 


3732 


1 /Ojj 






7 




PRPAW 


3733 


17634 


A 


3756 


103 


2 


FRPGMVAHACNLSTLGG*GGGITRSGVQ 
DWPGQ 


3734 


17635 


A 


3757 


394 


244* 


ASTHPANFFCRNGVLLCCPGWSQMPGLK 


3735 


11636 


A 


3758 


341 


2 


AK IAPLHS S LGMKRDS VS KKKELKKLNK 
QKPNNPLKK*AKDMNRHFAKGIQGANEH 
KIKCSTSTOLREMQIKTTVRSLLI * VRM 

nllAAoin X9V*n w AIUUaXUb7XiJXX 1 WnCiU 

K 


3736 


17637 


A 


3759 


2 


322 


PPRPTRPLNNTINQMETICMYTKCiTiI I 

HIS* 1 1j r r^c>i-iJtlL7 1 1j _L xvJ. Lrrl L JJjjrLrViVa luN 1\. 

FKRI KI IQS AFFKHNG IKLENDKRKI TG 
KHPNTWKLRKHTSKPMSQEMFF 


3737 


17638 


A 


3760 


444 


422 


VRLPHRPS P P PDPRVRFHHVGQACLKLL 
LP 


3738 


17639 


A 


3761 


266 


1 


GGGVPLGPVlPPLRPPQRVGFQAIiAPQP 
GPGSKIF* IFFGPQFKNRVFFKBRNLVF 

l?PPT.T?OT.T.Pl?l?'n , T?l?*TVIVQT nU'DT'CICT E* 
1? rrJbroJjliirr c c ^IAjV JljL.rlgVyrl jijgj 

CSGM 


j ijy 


1 7AA(\ 


* 

A 




/U 




HQ TTG VT * B T?r , T?WT?l7T.T VT7MC 

ilo X 1 o xJL /\r iwur rbrur Vr rJuXrUSrlE* 
SHYVAQADHEIiPASWAPPTSASQYAGIS 

RSLAPSFFAQAGAQWRDLGS 


3740 


17641 


A 


3763 


229 


390 


DI PYSFCFFFFFWKGS FVFVPQAGGKGR 


3741 


17642 


A 


3764 


119 


1 


QTSCRPGAVAHTYNPSTLGG*GVDHLRS 
GVRNQPDQHGE 


3742 


17643 


A 


3765 


400 


200 


ITAYCSLDLIiQPHFLFFSRDKVSLCCPV 
WSRTPELK* S PCLSLPECWn* S * ATASG 
LKHFIKSLPL 


3743 


17644 


A 


3766 


376 


44 


TQP I FLGPGFQAP PGPMG * PNFLF* EKK 
LVFFPKKPKTAPAGGGGPKFPLPRKPKQ 
KKGPHPGSQNFQKPRPKFPPRPPTWGKK 
RE FFSOKKKKKKRERL7VKI TDTiNT 


3744 


17645 


A 


3767 


403 


3 


DHFFFKGPPSQKGEFPLFLKTFPSPLGG 
FG* ISLPPPPINFPPPPKKRIFPGPLSF 
FFFKKKKTYPAGKGrjLFFPFPICKPGGGF 
KRWPGPPKKFFFPPPGFWACSPPFFFFF 
FFGAKVSLVAOAGARSRSRTS 


3745 


17646 


A 


3768 


407 


177 


FHHVSQDGLDLLTS*SARLSLPKC*DYR 
HDPPRLARCIKNFCRGWRCNILRKSPLS 
LPPSAGELRAPFLDFLEDAO 


3746 


17647 


A 


3769 


292 


391 


RPGWAHTCNPSTLGG*GGQITRSGVQD 
QTGQY 


3747 


17648 


A 


3770 


395 


198 


LDMGFHHVGQAGPEUjTSSDLPASASQS 
AAITGMSHRAQPY*VISMCQQCYKHFIR 
IPFNLHNNV 


3748 


17649 


A 


3771 


1 


241 


RTRGRTRGRTRGLLST^HTGRYQKHPRIR 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, (^Phenylalanine, 
G=GIycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














TPQIFSTGTTKAEAGELLEPGGGACSEL 
RSCHCSPARATRVKLHVKKFKKIK* KHP 
RIRTPQIFSTGTTKAEAGELLEPGGGAC 
SELRSCHCSPARATRVKLHVKKFKKIK 


3749 


17650 


A 


3772 


112 


406 


FFFFFFFFFFFFFFQGEGVPPI*KPPPP 
K*FMFPKKKI,PPPPPPALKLFFFSPPPP 
FFFFF 


3750 


17651 


A 


3773 


24 


197 


IFSVEMGFCHVGQCDLDLLGSSNPPASV 
S * SAGVIGV SHHAWPLVKLFVKRNKLSY 

V ! 


3751 


17652 


A 


3774 


85 


201 


WFFHLI*FVNKRIKLEKKKKKKKKKKK 
KKKKINWKKKKK 


3752 


17653 


A 


3775 


1 


382 


WENHLSPGSRGSSEP*SHYCTPA* VTE * 
DS VS KKKKPNIiQEAEKRLTIaLL I KKAI F 
KNRGS PCFSIKWAGFKKIKQGWFFVATR 
MGDRSRGTPNLGENFLKSFLSFSIKIQR 
FYYLGLKNSPSRNIY 


3753 


17654 


A 


3776 


1 


181 


IKMATVTKATYRFCVIPSKLPMLLFTKL 
ETKI LKLIWNLERA* IP KAILS *KNKMG 
LSNS 


3754 


17655 


A 


3777 


70 


385 


LNKMQTKTFDSDTHTSICFTLHFINTKL 
QFIFLLAIYLLFVNFPFIFINFFEEPGF 
CFDDLSSCFLFY*CIP*FYYFFSAFFRL 
HLFFFYRFLSLLTIoDLFSFLI 


3755 


17656 


A 


3778 


2 


406 


KDDRNTEHQVHPNASDQEDEAFDPYENF 
FEDSDSPTKSSSTEPSPHIHPVDIQMTI 
FHCADNFARQYILAKLAKKKKKKKKKKK 
KKKKKDSSSPGGGPKWSAPPPPPRGEKN 
FFFFGPPLI FSGGGF*NGGGGGP 


3756 


17657 


A 


3779 


403 


176 


FLFP ILFFFLRQS FTLVTQTGGQQRDLG 
SMQPPPPRFK* FNEGKGKRKLWGGA 


3757 


17658 


A 


3780 


3 


248 


GLNCyTQIiRKILAYSSITHIG*IIAVLPY 
NPNITILNLTIYI I LTTT ACKKKKKKKK 
KKKKKKKKKKKKKKKKKKKKGGGGA 


3758 


17659 


A 


3781 


3 


330 


VDRLRTGVQDHPGKHSESPSLPKKKKKR 
* KILPKRGGVYLNPHFFERLRGENNLTP 
GGEGQRETESPPCLPAGGKNKIFV*KIQ 
KKKRGGPFKGNKFFPRGGGKKIFF 


3759 


17660 


A 


3782 


399 


187 


FPPPAKGDFLPSHLFGLPPUFSPFPVtfK 
PRPRNLILGPP+KKFYLPPPRP*TWFL* 
NGPPPFFFFFFFLDF 


3760 


17661 


A 


3783 


392 


11 


KiUiKvWLPDtlPAV PPLGlr PKJsMKAG 1 * 
RAICTLMFIAALLTIEK*WKQPNWIiLVD 
K 


3761 


17662 


A 


3784 


324 


80 


PGPFFFFPPPAKRGFFPPPFFLVPPGFF 
PPPLF*TPPPDFFFIjAPIjKKFFFS PPPA 
LNFFFFRGPPPFFFFFFFFFFFFFF 


3762 


17663 


A 


3785 


354 


24 


PPNKIFFSPPPQKKYFPYKTPPPTFFFL 
SSFFFFFFFFFFFFFFFFFFFFLC*MVH 
F I VKYRXCKRGTR I I LHTPTLRQLVLCC 
LSDVLLGCCIFIYVTMCDLCAPKIiG 


3763 


17664 


A 


3786 


3 


369 


SCRGLVWVYGELLLHPCRGNQISRVMQK 
KKKKKFGAS P PRECzAP PGE PP * RGP * TQ 
KEPPPLKK*APPPGPGGGRPPPMGLGGG 
PAPPPFFWPKAFFKKGPFSGQEFGFPLK 
GLFFWKFPF 


3764 


17665 


A 


3787 


1 


125 


FRHRFS RDGLDLLI F* SARLGLPKCWDY 
RREPPRPAHSIII 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoteucine, 
KHLysine, ^Leucine, MHVfethionine, 
N=Asparagine, P^Proline, 
Q=OIutamine, K=Arginine, S=Senne, 
T=Threonine, V=Valine, 
w-iryptopnan, Y=Tyrosine, 
X=Unicnown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3765 


17666 


A 


3788 


•427 


251 


LTPKGRGCSEPR* CHCTPAWVTEGDSVS 
KKKQAKNRCS LRMKYE RNB FTS S PC 


3766 


17667 


A 


3789 


250 


2 


GFPPVEKIKKNPFPIF*KKPGFFFFPNP 
FF* I SGGKGKI FFLGPPP FS FFS PFFF * 
DRVLLCHSGRSAVAQSQIjTATSTFRR 


3767 


17668 


A 


3790 


412 


3 


GSTP ITTPFWGVGGALPPGGGVPPPPAP 
PGGTPFFFKKKKKKKFQNFPRPRCSRYN 
GGLGRKNFFTLEAKGS INPNFS PAPPPG 
GQNKTPFPKKKKKSR*MTLQSFPRPGTA 
PGAAM*PVSPPEPCGPPAPALSTK 


3768 


17669 


A 


3791 


10 


187 


GCSEIr*I»IHCTPAWVHSETLSQKKKGRG 
KGGFKKNGIPCFGKNKGGKRFGKSFGHF 
STG 


3769 


17670 


A 


3792 


1 


331 


FRVPLGRLSRGEHQQQ*QQQPPPPPPPP 
GPLRPLAGPSRKGS FKIRLSRLFRTKS C 
NGGSGGGDGTGKRPSGELAASAASLTDM 
GGSAGRELDAGR* DRPGAGRQTS FSW 


J / /v 

• 


17671 


A 


3793 


1 


431 


FRKGS ARCRAPKPAGADGVHWGHHRAS P 
QTAQAPPVTCSARGSWMDFSHASGPQDR 
DLHSPSPVCPRGWAPGTHSPTGASGA*G 
RTPVGGGK*VAGVQQRIHPGLAPG*LVIi 
PGIAVHr TKKPGS KGTITPGTVAHACMP 
NTLG 


3771 


17672 


A 


3794 


342 


48 


KRENMPGVAYNPSTLGGCNPS I VGGFCN 
RVCNPSALGGCNPS FLGGFCNHVCNPS P 
LGG*GGRISRSAAQDHPRQHGEKMSPQK 
I QKI S QAC F ARVG 


3772 


17673 


A 


3795 


2 


189 


LNPGGRGCSEPRSCHCTQAWATE * DSVS 
KKKKKKGSRFQNTI PSALPGFPVSGELI 
LGGPNL 


•J //J 


17674 


A 


3796 


1 


401 


IiNPGCKGCSEPRS CHCTPVWVSE * DPI S 
QKKKKKKKKK 


J 1 /*! 


l /0/5 


A 


3797 


279 


3 


I LAKVYPMVNSFLE KIH YFLLVSLCCFL 
QHVLYYLTFKTTETGSRSVAQAGVQWHD 
RSSL * PLPHGLKRSFFLSLPRS WDYMHT 
PPCUATE 


3775 


17676 


A 


3798 


229 


394 


VFVKSIiLCQPGMVAHACNPSTLGG*GGR 
ITMSGD* DHPGKHGEVSTYNTKKKKGG 


3776 


17677 


A 


3799 


380 


206 


AGHM * S * LLRRLRQEDPLSPGVQGCSEP 
* S * PCPPAQVTVRI S LKKKRLHERDKS S 
AS 


0 1 1 1 


1 /O/o 


A 


3800 


450 


3 


IYLiLRGRHSLHYSRPSFYFSNYRGACSY 
MRPPQQATLLGRLNQGITREGPAIAHSP 
AVRPRLSLAHAHVRRLCLPRPGAAPEFP 
AFFLPCLLVAQAGVRWRDLGSLQPPPPR 
FKKi? SLJjbbPSS WDDRR.PPPLSS * FAFL 
VEMGFRRV 


3778 


17679 


A 


3801 


301 


10 


KKQPPKKKKRCFPPLFPPPKGFKGSPPF 
FFLTPPPLKNLPWTLKALPQIFLRTFFF 
r 1? ciSWjorb v l\iA(jVIiWHDlAjSIiQP*TP 
GFKHMPSCPVLLC 


3779 


17680 


A 


3802 


420 


51 


SASSPPASFFFFPPPKKKIFSHPFFFFF 

P P FF FP P P F F* PP P PPP 1 T? PFP PTCTTirK'T T? 
17 k i- k c it k l jrrr rr tr tr *^ t> in, is I\ 1 Jc 

PPPPQKIFFFLKPPPPFFFFFFFFFFFF 
FFFCGFVNCQFSVLI 


3780 


1768t 


A 


3803 


253 


73 


QTTPSKRYSYVLNCCNN* ITTCKRIKLD 
P FIjTLDLKIN* KS FADLNVRAKTI DTI K 
KVKG 
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peptide 
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Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, J=IsoIeucine, 
K=Lysine, L/=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q^vylutamme, k— Arginine, a— oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknotvn, *=$top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


3781 


17682 


A 


3804 


385 


55 


AGTRPVFAAIiAAAAPAPRLLPLAPHGSA 
ATGF P P P FCFHCjEGIiRGAP AA5 S£> P P JUA. 
HRSQTPNWVDCGESPIPDPQFPLL*NGP 
PALCLPQGLETVNVESLALCFTSHGN 


3782 


17683 


A 


3805 


191 


2 


QKHLKALNFLPRCTCVOTCTCVCVCVNV 
TF*ACKICTCSDFPGSPRKKYLEFGTKI 

TVTTmPXT 

INIVDFN 


3783 


17684 


A 


3806 


1 


367 


KLRQVT YLEDLPT 1 1 DEGS YTKQWI FNV 
N*TAFYWKKMPLKTSVTRBEKSMPGFKA 
SKDRIiTVF i/jANVVSUr KXtKFHlPJ. xJNoJS 
NPKAIQNDAKSTLPMLYKRNNKACIAAH 
LFTAWFTEYF 


3784 


17685 


A 


3807 


3 


145 


LGPGGGGCS E PRS CHCTPAWATRGKLCL 
QEKKKKR*RJjDPEAfivAvSRGRAIAJA2P 
GQQEGNS VS RKKKKKGRDKVS LCCPAGF 


3785 


17686 


A 


3808 


362 


86 


NFPJ? JUJoPPrr FFliWt»lri\y V VfPWUSVj' 

TPPGPPCEIPLPPKKKSKSPPGVWGPPF 
FPPPWGGGAPKFFLPPKWGFPQTQICPIi 
PFGLGEKN 


3786 


17687 


A 


•3809 


245 


377 


FGYMPS FF ETQSRSVTQAGVQWHDIiGSIi 
QPIiPPGFK*FSCLGIiP 


3787 


17688 


A 


3810 


3 


268 


KRNFLNLIKRRYEKPIANFVFNSKXVNT 
FHLRSSTS*GYPLLPLLLNILKVIiPNEI 
RQQKEI KBtUCNKGKKTISVYSIMIiSANT 
EKPK 


3788 


17689 


A 


3811 


2 


388 


SAENCLNPGGRGCGDIiRWCHCTPAWVTE 
RDCVSKTNKKETEKYMRN* TGNMLTI ID 
AGRRYMRCHQTIFTFFFFWRGSLIFLAP 
PPSvwwi.FijPnFFlSl^JUUxVKJL IIP 1 x 
FCFLRTDGFLPCGPGGF 


3789 


17690 


A 


3812 


292 


136 


PPTVIARKGKPRPKLSSEPWQFSAEQLG 
LPSFLCFFVCVRCCCLLLFLLLFCRDKV 
S ICCPGWS *TFGJuKQ5onIjo r PNCWDCK 
CEPLCRAI F* FFCLCTLLLFAFVS AAVL 


3790 


17691 


A 


3813 


359 


52 


KKGRFFFFEKNPFGDLVPKKKKKKTPPP 
PNPKKKKRGAPPFFFFFFFFRH*VPLCC 
LGWFRTPGLRGSSCVGLPKPWDYRHKPIi 
PLiTKDD X xvXCSJjP F V s 


3791 


17692 


A 


3814 


382 


1 


GGPGMEAPLGYHI KTPFPLKPQKNNPGG 

UCfUUr V V f f f ruu " lW>jXdr£&Litr JrrcyiCr IrJjX 

QKGPPPPPLGGKREPPPQKKKKKRVTAM 
PPHYI PNRNTDMCS C 


3792 


17693 


A 


3815 


472 


340 


TGFHRVSQDGLDLLNS * SAGLGLPKCWD 
YRRGLFI I I FNMGEWS 


3793 


17694 


A 


3816 


247 


378 


PPPPGGGXGXNLGLLKPLPPG*REIPPP 
PPRGGGNPGGAPPPPG 


3794 


17695 


A 


3817 


25 


465 


TPPLGvj * GKRr PRGPGFS PP PPPK.GNPP 
FF*KFKKKPRPGGGAP*FPPPGGGRPKN 
PLPPGPKGSH*PKFPPPPPPGETKRKPP 
PPKKKKKKRKKEKKKEI *AYKAYICFDC 

FijlTjroUJLiRolJlVl JjAJ? V JLijrliFKf C 1 U 
LSPR 


3795 


17696 


A 


3818 


475 


147 


PPKG*RHPPGPPRFRRGPGNPPPHRFFF 
VLRTPGYSFPF*KKRGPPPTFFFTLPGP 
PGFLKGP PSPGGGGQKI PFPRGGGGPLK 
* IPPFPPPPGKKKDFFFLTKKKKKKP 


37% 


17697 


A 


3819 


461 


57 


TNQKKAAP P P LL KKKF FF F FKKHTGGAP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N»Asparagine, P=Proline, 
v^uiu limine, iv— Aigmine, o = oenne, 
^Threonine, V=Va!ine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 

iiucieuuiic inocriiun 














PGEKKKRGGGEKKKRGVYRKGPRAPSRG 
CAPPLYYRGGLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFHVKNI LFVF* DYI VT IADL 

Tn Q T .P WWTJPOT?T .T .P P 


3797 


17698 


A 


3820 


1 


418 


HAYHI * KPSP * PLAGALSTLRMTSAIiAM 
*FHFHSITLLILGLLTTTLTIYQ*WCDG 
TPESTYQGHHTPPVQKGLRYGI ILFI TS 
QAFFFAGFF*AFYHSSLAPTPQLGGHWP 

^TY^TTPT.TtfPT.FVPT.T.N*rC.TrT T AQritfe t 


3798 


17699 


A 


3821 


385 


1 


LQFPPFSPLLFGVQRPFFSSFPSCPWWG 
PVPSKLLPIiLQILGRGFYPQKSLLVSLG 
FFFFFFLIIKKHRNEVLLCCPGWSRTPE 
LK*SSHLGTPKCSDYRCEPPHPPDPWDI 
oc. wp ft .n.Q'TO'BYtiT .nv 


3799 


17700 


A 


3822 


72 


424 


KKKKKKKKKFFPRKKGAPPAPPIiFGKAG 

KKKKRGGP*KKKKLKAGGGEKKIFFKGA 
QKKNPGAGFKKGGRGKNRGAPQKKGGEG 

sss 


3800 


17701 


A 


3823 


154 


2 


GHHTQHTEPQRLNNSYRYAELKHLSNTK 
nnp t * KurairnMNT? upq in?n t nwi 


3801 


17702 


A 


3824 


422 


212 


MGSSNPSPTASWVDEPTGICQHT*FYVE 
MGFCHVSKAGLGLLGLRDPPPSASQSAG 
VAGVSHCTRLWIIL 


3802 


17703 


A 


3825 


2 


417 


KRVNTRGFPPPPFFKTPPPKIFMGPQKK 


3803 


17704 


A 


3826 


423 


65 


AOGRGKYFKSGGRVKPFCAVDFPPSGKK 
CPCPMPKAS PGPVS FNKGGVAPATKDFC 
LYSIYNK 


3804 


17705 




Dai. I 






FFFT*F FTC FT iFPF PF FFPP F FFFFFPP FF 

FFFFFFFFSGA*KIFIIATIHY 


3805 


17706 


A 


3828 


425 


316 


QDGLDLLTS * SARLSLPKS WDYRHE PPH 
PAQTFFKQ | 


3806 


17707 


A 


3829 


3 


169 


LSNS * ANNLI * LAYTIAFI VKI PLYGLH 
t,*LPKAHVEAPIAGS IVLAAVLLKLGG 


JOU/ 


1 / /Uo 






O 




TrtPr¥TT!Piiprnpr , cirRTi^Di\rirD\j'CDx>r2r , i7 " 

GRKFLLFKGGGFQKFPPLP PGWGAKTPF 

LKPPPmCKKKERICSKKEKEKKRNPGPP 

VSPSSWSPVP*APSEQRSGPQLPPSRGD 

*WRLPSPPRPFRDCVRRSATRGNSPPC 

RGSOGP 


3808 


17709 


A 


3831 


45 


387 


NFGVKVFS *LATLHGSNMK* S AAVL*AL 
RYIFVPTVGKKKKNKKKKKKKKKBCRGGT 
PPGGEKRKFFSPKKKKKKGPLKNPOPK1C 
NFFWGKPQKKKGGKTPFWGGGKKAPKKT 
P 


3809 






JOJ/. 


HOj 


Zoo 


APCILMATFFQFEVLFSEFLETIVKSVL 
LH 


3810 


17711 


A 


3833 


294 


90 


FFFLFFFFFFLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFF*AGIYFI 
TCAPPPAYPRS 


3811 


17712 


A 


3834 


427 


37 


SYHFGRLRRADHEVRRIi*NRKGRPGAVT 
HACNPITLGG*GGRIMRSED*DH 


3812 


17713 


A 


3835 


499 


53 


AKLMYFQSRENVAKRSYRRNQKASMVHP 
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C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, PHProline, 

T^hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

A U'"» l IW»»'«y ^Jm/U UIUVUj / — UVNIVIB 

nucleotide deletion, V=possible 
nucleotide insertion 














NQEPAVI AGQGTIALBVLNQVK* LSS YH 
f*RT.t»VGMYHGCTiLVSSVTPFC!NIFSGIiK 

\ n i mi v Vj 1 A X HV3\ il 11 I » UWV X £ 4. VnXf JL.JL O NJUIX 

VH 


3813 


17714 


A 


3836 


304 


3 


RLKJ TKTNNNQTE INS * YI KDLNARAET 
IRLLEENIGGKPHDIRFAMIFWI*HLKK 
TKNIYKLDYTKILDFCISKDTINKVKR* 


3814 


17715 


A 


3837 ! 


475 


135 

* 


MEGLFCPAPMVHPFIPPPLGG*GGGSPG 
PR TOTP PGYPG R PPFFFTTj K"KT. PRRGAV 

FFFPPPLGGLGPKNVFTPGAGASLEPGW 
APSPPPGGPP*NPFPKKKKKKRLKIGWH 
F 


3815 


17716 


A 


3838 


415 


245 


ETGFHHVGQDGLDLLTS * S ACLS LPKCW 
DHRREPPHPDPQVSFIRALFPFMRAPLS 


3816 


17717 


A 


3839 


460 


413 


WDYIKLTGFCTAKETRVKRQPTEWDKTS 
AKYISDKGLKTRLHKEQ* KMTNLI *NWV 
K*LGRYSH 


3817 


17718 


A 


3840 


2 


470 


QGLLTSQTGWPGRDAPHLPDGVTAGPRR 
SPHLRRWAAGQTRSSLPRCDGGREEALL 
TS * VGWRPGRDAPHFPDWAARQRGSSHP 
P RW AAPORRSSI»PRRGSGRAEAAISNSA 
SLQRVSKVSVSCRPGG 


3818 


17719 


A 


3841 


54 


332 


QFGFHHVDQAAPELLTSDHLPSSASQSA 
OTTGVq WRAP P ^TjAHSFTHSIiAL * PTPA 
HSLIHWLTHSLTHSLSFPCSFSSPPSHT 
HSLARPLGV 


3819 


17720 


A 


3842 


208 


468 


ENPTPPPLWCFALGVL*EDGFSLYSLSA 
CYYEWHLFSVSLEFHTVS 


3820 


17721 


A 


3843 


406 


Z 


pqo c c o c c q qp pAAFFLfiCPGVFPSPPF 
ItKPAPGFFFWGPKKKNFFS PPRGVKFFF 
FRGAPPIiFFFFFFFFFF*DGVLLCHPGW 
SAAHAS 


*J Oil 

3821 


17722 


A 


lOA A 


11 f\ 

370 




VGT J .EPRGOGCS ELRSHHSTPAWATE * D 
PILKQNETEQK 


3822 


17723 


A 


3845 


369 


73 


LRQDNGMNPGGRACSEPRSHHCTPAWAT 
E*DS ISKKRKSONSVFTESYFASEHCYS 
LTGGGVAI LMEGPTGPDGAI FMCHLRNL 
CRCSPILPLALQCDH 


3823 


17724 


A 


3846 


198 


388 


FRKS INVRYHVNRINLKNHT 1 1 P IDAQK 
P * EKLQQHLTMKYS I IMNSRKLSQPALR 
NLF*KPS 


3824 


17725 


A 


3847 


391 


20 


GFPPPRVF*HGPANFFLGPPKKKFILPP 
PR* *TWFLIiKGPPLFFFFFFF*DGVSLC 
RPGWSAVS KN* QTNKKS 1 1 LP I RGKIjNL 
FVHSMLIKGFTCDLLIFKNRKVKESTKT 
IDHIFSRQLMA 


3825 


17726 


A 


3848 


227 


2 


KRRNPFHKAIAAIDSSDGPR*SKLKNFW 
RGFS IADAI KI FRDS WEVKI S I LTGFWK 
KW T TjTTiMDDFRGFNRTRGR 


3826 


17727 


A 


3849 


373 


2 


SSPQKYWGGPGFFPPPGFKTPPPPFFFW 
APTKKKIFFGAGRENFFFLRGPPPPIIF 

I *DIKTHVTLLHSVGFRVLVYKTRliYFT 
LYHSFLKNELG 


3827 


17728 


A 


3850 


394 


2 


EACSVTQAGVQWRNLGSLQAPRRQSFCL 
RGSLLSPQH*EDRT*HIGGTQQTSMNE* 



785 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycme, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
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X^Un known, ^Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














MNLYLQ WLCPLL YQLCS TS YT * E VI QLN 
IAFCVCLVGFFCFFVFLFFETASQSFA* 
AGVQWCHHHS LQPQLPKL 


3828' 


17729 


A 


3851 


3 


164 


NHLNLKGRGCSDLRLCHHTLAWATEGDS 
X S KTKTVQ* RJ&JLiQNN I PHE HTCKNIj 


3829 


17730 


A 


3852 


35 


390 


NFVSNS FFFFFFGNGAPRGPRGGGQGPN 
U3*GEPPPPG* KQFSFIjTQPRGGNNGGG 
PPPRVNFGFIiRKNGVPPGGPGGPQP PDL 
GTGPPPPPEGSPPPGPQFSRTKRTAPGQ 


3830 


17731 


A 


3853 


6 


437 


PNK*KTPGLPPPPFFYFFPPPPPFIiSFT, 
KPFSFPPPPPPPLFFFLPPP 


3831 


17732 


A 


3854 


1 


236 


PTRPLRWIiRQBIWUjNPEGQGCGEPRSCY 

vhtpr watt? ♦tic? TgiriryyprjOPT vptvpt *t» 

SGGGQGLFFLRGPQKSTSGAAF 


3832 


17733 


A 


3855 


52 


351 


tvgimepqttdvyvklhkctdfllpdyi 


























* LTJjvj P P PPPIi XI FFF 


3833 


17734 


A 


3856 


72 


420 


NTYGKKKKKKGKKKKKKKRGGREKEKLW 
EAGGGGTTRGL* KENSGGNMGKPRI KKK 
GGKARGAPLGVSSTQLL*RGEGEKCLSP 
GGGGCS EHKTTPIiAx PGIK5KKKETQSGK 
EKK 


3834 


17735 


A 


3857 


410 


107 


SSSSS PIiARYFLGVAPGFPLPrLFKTGP 
RI r FJjGP P* KTF 5 IiPTPGR* T* S P* KGP 
PFFFFCRDRVLIiW * PGWSEMCGLKQSSC 
LCLQKCWRYRYKPPQIiA 


3835 


17736 


A. 


3858 


419 


239 


TTRSQLLLFLVEMGFHHAGQAGLKLLTL 
K* SAjuJjP K.CWD 1 JUvESli x LtADA* GFYNI* 
EDPL 


3836 


17737 


A 


3859 


262 


i 


TAAQEFNFGGPIKKIFRCQPGEVKLDPF 
JVKJrlrr r vc c r Lit r r r Lilfrr w KKVLiPtA_P 
GWS*TPEFK*STCLGLPK*WDYRHEPPN 


3837 


17738 


A 


3860 


357 


133 


NTPPPGAGRNKENPLFPKKKPRGVGPLF 

P PPP K"an*PPPWT,VPft PIT* PPT7I7P DDODrJ 
ir tririvcx-iTk.^ i. OXLVM JU 1 c\jtr]\ C c V T if if if if if \J 

GKNTKKKPPPQKKKKKKSL 


•JO-DO 


1 / fflZf 




jooi 




j / J 


Tr »s poc? * ot?p p p p p p p nnfiT.R & p p p p va 
LFFVFWGKRGVS P 


3839 


17740 


A 


3862 


3 


482 


YITDEVNDPSLTIKS IGHQW Y* TYEYTD 
YGGLI FNS YILP PLFIiEPGDVRFI* 


3840 


17741 


A 




i 
i 




LANSNYERTHSR 1 1 T IiSOGriOTIiT. PTi T A 
F* * LIiAS IiANLAKEG 


3841 


17742 


A 


3864 


289 


372 


NHLVKKWAKDLNRHFS * ED IQ IAHRHMR 


3842 


17743 


A 


3865 


266 


31 


KICFI FS PPQKKS VFFPKVRFPQKRKP P 
PALLFSAYFFFEAE S SSVTQAG WQ * CN 

t ysc pnp P P PRTT TTP V C PT ^CTT/^n 


3843 


17744 


A 


3866 


1 


395 


NTSDFCIKPRAYNESEHHWDMVRRHL*G 
ETEAMVKAVDSLTNLNPVTWVKTIGNST 

TatJPVT.TT/5CT,AQT,T.\7WPVTr>nT .opnc 
JLnlMI; VulliuwjnoujjvV X£vX l<^yiuu(i/o 

DQREGAMMTMAVLSKRKGG 


3844 


17745 


A 


3867 


137 


2 


APLIKKHECLRAVAPACNFSTLGGRGGE 
ITR*GV* DQPDQHGETP 


3845 


17746 


A 


3868 


1 


108 


ENRLNPGGRRCSEPRSCYCTPAWATERD 
SMS*INNK 


3846 


17747 


A 


3869 


198 


302 


XiLNINHNYGGV* GCS EL * SHHCTLAWAA 
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C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Pbenyla!anine, 
G=Glycine, H=Histidine, Wsoleucinc, 
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nucleotide deletion, \=possible 
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ASLHPKKKKKJUiKKKSVKSQGAFYSHjA 
MAILNYIRBKKLFLKWK*LLNINHNYGG 
VLEKKKP*SYTYNLKKSQNLPQLL 


3847 


17748 


A 


3870 


237 


419 


KEIQSKQGLRAICIPMFIAASLMIDKG* 
RQPKCPWREE * INKLWYKHLMAYHSALK 
RNEIL 


3848 


17749 


A 


3871 


239 


232 


KAFLHTRVKRG IQPGAVAHACNPSALGG 
RGGR I TRSGD RDNPA*HGFS WGKCLGI E 
LVGHMFNFLRNCQTVFQSGYI I YILPMP 
LDS 


3849 


17750 


A 


3872 


417 


219 


CHVAQADLKLLSSSDPPASASQSTGIAG 
I SHRAW PRGTNP IHKGSTLMT * LPPQRP 
HLLTLSPRGF 


3850 


17751 


A 


3873 


45 


414 


NILFFFFFWKKSNLVPQGEGQGQSLG*W 
NPPPRGLRGFFNLTPPRTGNKGGDPPPQ 
ENFGF+ RKGGVTQGGGGGS KTPAIjGGPP 
GLTPQRGGNNGQDPRPPAKKFGIiGKKKK 
NRQLPFIiGFFR 


3851 


17752 


A 


3874 


278 


2 


LISI YVS I IYLYHSGFQLSKISPPQGQL 
AYLS I YLS VCHFLI YEIGVITMPASKID 
K*DLIKIKSFCTAKETIIRANR*PTEWE 
KXrAXYSS 


3852 


17753 


A 


3875 


233 


379 


QCD YFNELLRME I LGPGAVAHACNPNTL 
GGRSG* ITRSGV*DQPGQHGE 


3853 


17754 


A 


3876 


6 


247 


ML ITSAS YPEDLAKI I DEGG YNKSQI FN 
VDETALN * KKTPS KTFTARDKS MPGFKA 
S KDRLTLIJjGANFKLKPMNDIjI L I 


3854 


17755 


A 


3877 


411 


3 


AKIPPLTSSQGHRTRVCQKKREREREKE 
KEEGRKERERERKKGRKKERKEERKGS * 
VRMLFLDKVDFRSQKI ITDKEGHYIMIR 
E *MHQ *DI IIiNNYI SNNRALKYMKQKLT 
ELIGRVGRPGRSGTVPGRRNGVQ 


3855 


17756 


A 


3878 


174 


375 


SFQQ*TAELEIKNTTSFTLTPPKLKYLG 
INLTKYVQDLYK 


3856 


17757 


A 


3879 


1 


106 


FHRVSRDGIjNLLTS * SARLGLPKCWDYR 
REPPS PA 


3857 


17758 


A 


3880 


383 


24 


FAI^NFPQGGAFSGPLTLSPQVGLFFAP 
GPPPPGGGFSCPLAQSFFRIFPQKSGPP 
ALSFSPLFKKNPVVFFIiRTQPLLPljFPQ 
PRVFFFPPFFFFF*DRVSLCCPG*SAVA 
QYGSLQP 


3858 


17759 


A 


3881 


400 


71 


RIRQENRLNPGGGCYSEPRSRPRTTAWV 
TG*DS XoKN^KNK*NJjSFIYFYYYFLR 
WSLTLSPKL 


3859 


17760 


A 


3882 


282 


3 


IFNFCGYIVSVYIYEVQEIF*YKHAMCN 

»TT_TTT7mvT^T7C' T nCO TVPT fWrtPHTVTT T TT 

NH1 VJsWLy VJs X PSS X I SLCY KQSWYXIJjV 
IIiKYAG WAQ WLTP VT PALWEAEAGGSRS 
QEMETILA 


3860 


17761 


A 


3883 


1 


175 


LYKNFFKIiAEHGGVYQILRRIiRQEDCIjS 
PGG*GCSEFKSCHCTPAWVTQQDPVSKK 
FF 


3861 


17762 


A 


3884 


415 


233 


SQDGLDLLTP*SARLGLPKCWHYRHEPP 
ALPS 


3862 


17763 


A 


3885 


237 


367 


GPVLGPGMVARTCNPRILGGRGRWIT*G 
RE FETNLANMAKPCL 


3863 


17764 


A 


3886 


359 


108 


TGQGGGGPPPPPLGRLKKKNGVNPEGKN 
SNKPKIiAPSPPPGGKRKPPSKKKKKKKG 
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nucleotide deletion, \=possib!e 
nucleotide insertion 














G * HLPMLPRLGSNS WAHDMI LP PYP PK 


->00*t 


1 T7£^ 




loon 

Joo7 


245 


"2 "7*7 

377 


JL KN IS WFGAY AYACNPTTXiRG*GGWI 
KRS RV* DHPDMVKPHL 


OoOO 


1 / /OO 


7V 
A 


Jooo 


406 


2 


SLNIFFFFFFFFFFFFFFFFFKPGRVFP 
LKNFFFFPFKGCRIPFPPIFPLWLKVFF 
GPTPL*NTPRG*KEGFCPPPFFFFFFFFF 
*GRVLLCRPGWNAVAQSRLTATSAHAS 


3866 


17767 


A 


3889 


3 


114 


EFHCVSQDGLDLPTS *SAKLGLPKCWDY 

DD 7BOPDA 


3867 


17768 


A 


3890 


395 


1 


MVLIS * PCDPPTSASQSAETTGISHRAR 
QFPMLLSQ * LVCHSVTEVFL I FLLKL FY 
IYDILLYI IYNYILLI IYYLFLFLLFFE 
TGSCSVIQTGVSSTITAHYSLKSHPSGL 

wd o out ot n c p^nvnnn tvp 
KKo oHJbb LiPS SQD YRCAPT 


3868 


17769 


A 


3891 


2 


189 


GRVGERGGTLL* SQLLRR*ENHLNLGGK 
oCo E P RliHHCTP S WATEQDS I S KKKRS F 
IKALHM 


3869 


17770 


A 


3892 


410 


258 


GC*GCSEI*SRPCTPAWATE*EPVAQKE 
KKJUUU4KEIUCKHVIjATLAI s s l icl 


3870 


17771 


A 


3893 


244 


412 


GIBEACGNFRVIVLKSSLALEGEEPSGR 
DSS * EASGGRARWLTPVI PALWEAEAGG 


3871 


Mill 


A 


3894 


208 


2 


RISPPITPFPPWVTKKTKG* PGFFFQKN 
*KSFFFFLRWSLALVPQAGAQ*IiDLGSL 
LPPPPGYKPFSC 


3872 


17773 


A 


3895 


407 


18 


PLIiGHPPKKAGAPLiIFNLPRRPK*KPEF 
PPPPLSPTQPPPPPPPPICPAWLPPPGV 
PSSFKG*MGGPP*RPEV*GPINPFWAPS 
LPVWPQNPIPFPKKKKWGEIHCMVCPLS 
FNETFS FFKMESLRDGK 


JO ID 


17774 


i\ 


3896 


82 


401 


KGFIiKNSPKGEGLFLFSFPFFFLFLATW 
LWPRLKCSGAI IARCSL*LLGSSNPSDP 
PTSTFGEAGITAAGHAQFPPPDSPPPKT 
KAHHVTQFGIiKjjPS *ssppas a 


3874 


17775 


A 


3897 


263 


396 


VWWPGGLG*Y*TPPIiI/3GQGGRITRSGD 
QDHPG * RGKPVSTKNT 


JO / J 


1 / / /o 


A 


3898 


2 


386 


KTGIiELIiGLSNP PASASQSAGI IGMSHC 
AWPNLFLYWNHHLYPHKK*VWPGTVAHA 
YNPSTLGG*GRRIA*AQEFKTSL 


3876 


17777 


A 


3899 


140 


33 


IFFFHL*GEIGFTMNMLDIGGGFTGTEF 
QLEEVNFSVNDINKIIAFFVT*GVDV*S 


3877 


17778 


A 


3900 


377 


2 


TPPKENGGLPKIKVPYFRGNKFKKFPLC 
C £>l\ V r* lr 1 o L TWtt V JjJVWUryFr 1 J. \r e LiKjvJNr 

FKTPKKGVSHYNFPPQPKS * GVFFFS KK 
KNFYPFFWEEQKKQRGPNAFLFFLKKKK 

P B. A B.P PTP DDTP P 


3878 


17779 


A 


3901 


2 


123 


RVSQDGLNLLTS+STCLGLLKCWDYRHE 
PPCPALRCVFYG 


387Q 

JO/7 


1 / /oil 




,jyu2 


loo 


3 


VAHACSPSILRGQGSWISRSGVQNQLGQ 
HVETP 


3880 


17781 


A 


3903 


422 


238 


ILHVQLPSSWDSKRKPFFFFL*R*GFTM 
LPKTiLLNS *GS SDPIiASAC^Rl^TTRMq 
HYTWP 1 


3881 


17782 


A 


3904 


414 


1 


STHLGLPKCWDYRHEPLHPASMSYL* * I 
FFIYLFTHWSVSLTARMQFAEDRDLIFL 
LHCYISSDSNSTSWL*YILVFVHSPWL 
VTTIALILGFYVFVFVFLFFFETGSRSV 
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AQAGVQW*GLSSLQPSIPPGFK*FSC 


3882 


17783 


A 


3905 


396 


236 


DRVSLIxSPRLECNGTISAHCNLCLLGSS 
LLCSALWQKQPYTMYKGMEMAVCL 


3883 


17784 


A 


3906 


377 


57 


RGLVFFFQNPQA* KNLGRGRVFFPSGAQ 
KNPQKGGWAFPGRPPPRFGGRPRGPGAR 

GVPLQWVGGNSGLFFSKKKHKL 


3884 


17785 


A 


3907 


341 


3 


iKc a t r oir-LAjiviSAjr i?o.rwjrr AX»»jXir±<.v r Jr 
LPPFLNPGPGIYFWGPIKKKKTFPPPGV 
KFGSFKRAPPFFFFLMFIAALFMWKTW 
irnpPPPQVQE* ♦WKT.QYTi+TTRYPPQTTC 


3885 


17786 


A 


3908 


3 


162 


MMRKAI RGHL * NNPALEKLL PH IRGNVG 
FVFTKEDMMFQKPALLQHHSQYI 


3886 


17787 


A 


3909 


1 


421 


PTRP PTRTRWGFTMLVRLLLN * KHSQAG 
AVAHACNPSTLGGQGRQ ITRTGVQEQPD 

yHvjlSU F fa ttx KoK X rl\j 


3887 


17788 


A 


3910 


2 


424 


PRVRTSPGLALSGLTGLKFS KNQKKKGK 
KPQTTKTQKKKKS PKKKKTPKKKKGGAP 

LuvKACI ITMAKIsvJfiJN JTrr r ~vjjt1\jRJW>IFVjAK. 
FKKGGGGKTRGTPKKKAVGKTPFFRGGE 
KKKES PSQFYKKI KKKVFGGKKI FP 


3888 


17789 


A 


3911 


3 


236 


GFCHVGQADLELLATDDLPASAPQSAGI 

Q/TT.QWr'ntJPV* PWVMQQT.tTQT.TiHTPPQT. 

HRVKU2KNGLRKTRKLYMKNS 


3889 


17790 


A 


3912 


304 


2 • 


VAKKGKSKPGGLKKNPPKIGLGYPQGGP 

r>Kin T UT? 17 T TDTTMPWHlVPT *nr2f2f2PP 
UMU-L WifiJvr rcij 1 JrAJWKWartJr Li * crtjtdijir*' 

QTjPKTTP (2PPP PGWSNPPPP FFFFFFLRO 
SCSVTQTGVQWRDLEL 


3890 


17791 




3913 


91 


1AA 


WAAPVfYSPRTT.KVWPfTAP^ WPPfl Q WAHP 

PPTWPEPSGTTWAPEGTVVTIiGPPQELS 
LRRGVSYHCPPP*PLIWGRKWPPPIjPCL 


jo91 


177y2 


7\ 

A 




32o 




JfCSWRnWP VAVAHVfWP ^TTiGfJ+GfJR 

IVjLjInlVJVj r¥ Cr ,/vrV VAQ V v-IXT U1JJVJV3 VjvjCv. 


3892 


17793 


A 


3915 


16 


179 


GGGCSELR*RHCTSAWVTRARLSQKKEK 
KKKGPRKGEVATGSVWTRGELKSWAL 


3893 


17794 


A 


3916 


229 


46 


AGPELIiASSDPLTSASQSSRITGIPFLT 
GL.YSKDHI IETANI SYTNA*AL* ITLRT 


3894 


17795 


A 


3917 


42 


383 


AFPCFVIGRVISKGKECi'LYFFLFLFFF 

GLKETPLPHLLEEGKFWAPPPPPVYFFV 
F**KRDFTLVARWSNRLPSREKGLDYR 
p 


3895 


17796 


A 


3918 


402 


286 


PPPPF*TPPPPKIFGAKKKKKNSPPPPG 
ESFSF*KRPPP 


3896 


17797 


A 


3919 


349 


380 


KKSPFPQ*RNFGSLQPPFPGLKPSSPLS 
AQ 


3897 


17798 


A 


3920 


28 


327 


KTTLLNQATI IQAKAGTWGHPP PLS PRE 
KKKKKKK3CKNGGGGP * KKKNFSGGGGEK 
















moo 


j± 




A1A 


i 
i 


NFPKVSGPKLMSPSPPHFFFPKKSFPPP 
PLFLEKKNGRFS PPFPEKKV* F * KKPPF 
FFSPKKORFPPKVLKVGGKKRVLKNLG 
PGLLS KKGRGTP P P P PL FFFFFF * DGVS 
UXPGWSAVVRSQLTVTSASQVQVOE 


3899 


17800 


A 


3922 


1 


169 


LRSCKCTPSWATE*DSVSKKKKKKTLKN 
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LSPQKLNRAGKIFIiRVFPLEKKGQTQLL 

Xl L\-T^\J A\ X> L. XJXv VI r l ipinni X \£ 1 1 1 ,1 


3900 


17801 


A 


3923 


263 


74 


LLSVWRIPCAQKJ^RKKRYECECFIilVF 
upnr.c EENS Q V T J? FKP TIj CVC VCTVCVCV 
CVCVW* VKNFRKS * *SSDRSKTKTIKKH 
SHSYLFFLSFFCAQGILQTERSHLLPHL 
EVSLFKE 


J/vl 


1 / OUZ 


A. 




j 


TOO 


YRHEPPHPAPFCD 


3902 


i /ovj 


A. 






in i 


NTj P Ii PGATD YPAS GGGGCNEMRS CHCTP 
AWVTGAKPHLRKK* KKKRQMLKTISKEM 
GNFNMVYILDKITK*FLYFKQLILSFLM 
GELGVK ' 


3903 


17804 ■ 


A 


3926 


110 


1 


QLGAVAHACNPS T * GGQGQAEHLRSG I Q 
DOPGKHGK 


3904 


17805 


A 


3927 


180 


368 


EINFITNYYIILFFNRQGlxALSPRL*SG 
GAI ITHGTLKHLDSSEPPTSDS * VAGTS 
GTYHHA 


3905 


17806 


A 


3928 


92 


2 


VRPGAVAHACNPSTLGGGBG+ITRSGDR 
DH 


3906 


17807 


A 


3929 


2 


146 


HIxNLGGGSCNBLRIxHCCTPAWATE*DPA 
SKKKKS FKITSAFTCOIVML 


3907 


17808 


A 


3930 


221 


3 


KERAI WGTELNMVS KIFPI* LKQRKNL * 
PPNI KCYNNLLRP * LGPGTVAHACNPST 
LGSRGGRITRSGDQDH 


J/VO 


1 tC\J7 


A 




i 
l 


1ft7 
1 IM, 


RG1? G PK K P <5 <?HNPT P A * VT KR D <5 T T 

UUKuw Ct c O OxxXi l>» X. IT V 1 lUU/O XO All X 

NNKIVK 


3909 


17810 


A 


3932 


351 


69 


GQGGG FRFIQPPSSRGKRFFPPPPPP I W 
G*RIGPPPRANFFFIiFF*KRGFFLFGRE 
IFFFPP PGVPP PWP PONFG FOGGAPP P P 
PPFFFFFFF 


3910 


17811 


A 


3933 


401 


94 


RGPPPIxNPPLWEKNHAGPPKVGG*KPPP 
PTKKKPIjFS * KTKI FPPGGP PPT. TPP PP 
GGKPKKFLYPPKFRFREPKKGPFPPPRG 
TKKKTPPPKKKKKKKKVS 


39H 


17812 


A 


3934 


184 


2 


LKEFFPSRNSRSVLHTSNF*HLFI*LI I 
YFFBIESYSVTIiAGVQWSDIiSSLQPRPP 
GLKQ 


3912 


17813 


A 


3935 


2 


393 


KIL I KVE I EGTYRKI TNAI CEKPTAU IV 
PNRKKLKAFPLRNGTR*GCPVS PVLFNI 
TLEVIARVIYSEK* IKGTKIQISIATME 
NSKKFPQKTENKTTICSSNSTTGCISKR 
KEMHISKRHLHSPVYCNT 


3913 


17814 


A 


3936 


291 


2 


FPPPPQTRGPKSSSRAHSPPGPGPGSTT 
GAR* PFRPLIxLVRASLRPPLGLPTPTGP 
TGGMQRPEA* RTPATDKPQNSRRGP*GA 
WGRPGCRCRGGA 


3914 


17815 


A 


3937 


382 


1 


QWTAQTWSVQNELLLFCLKPHPPPSLLP 
PSSCPGPIiliGTPWTPAFPSHPTSTLKAH 
PIGSGPSSPLLLTSPRMPPEIQTTATG* 
PASSLTPTVCSQHGNRFFKKGVRAGRSG 
SHLE SOHFEGPEAGG 


3915 


17816 


A 


3938 


398 


2 


PPPIiLLNKIGAKKKAFFCPS PFGPRKI P 
CGQKTKKSVFPFRVP *DKSTFDCFLCPI 
VYVKKQIWLAPLHCVFSGRVAQGSKKSP 
LCLSLPFFFFEMESHPVTQAGVQWRDFR 
SPQPPPPGPKPFSCLGLPSS 


3916 


17817 


A 


3939 


240 


385 


KIxLiFLFLFIYFF*DGVSLCCPGWSAVVQ 
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Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibte 
nucleotide deletion, V=possible 
nucleotide insertion 














TLLTBPSAIPGAGGTPVSPI 


3917 


17818 


A 


3940 


230 


381 


LISSSYREYIiAKDLRFSSKQS * WLCSKE 
FRGIjGAVAHVCNP s tlggqgrw 


3918 


17819 


A 


3941 


1 


107 


CWPVWS*TPDLKQSTCLGLPKCWDYRQA 
QPHRNLL 


3919 


17820 


A 


3942 


2 


95 


DLLTS* SACLGLPKCWDYRRKPPRPALF 
AFS 


J7iU 




A 




163 


404 


CRHKAQWLPQLPGHLLLSEKTLKYSS PK 
GLRSQH*KLLFLFCFG*TESRSVTQAGV 
QWRNLGSLQPLAAGSKRFSCLSLL 


3921 


17822 


A 


3944 


3 


200 


GLDLLTS *SAHLGLPKCWDYRREPPRLA 
YFTFI KKKKKDGCELLNLFPNS CMDTAH 
VHPAHLTGG 


3922 


17823 


A 


3945 


423 


2 


LFFFFSEV* SHS VAQAGVQW YNLGS LHS 
S 


3923 


17824 


A 


3946 


209 


413 


TTCIEQCNKTRRLGAVAYACNPSTLRG* 
GRW I TRSGVRDRPDQHGETTKNTNI STT 
KKSTKNTKjCSQA 


3924 


17825 


A 


3947 


425 


3 


SPSQGLRDKTP I S LGQSTWGKGQMWAQR 
QQT*TSWPDGSEESSRSPSTFEQQISER 
SSSAKGQTNSSSGSLNSVSPDWETPPSR 
SQQTPHTGELQL&SGRCSSGMNLPEEGT 
DSNLCCSATHAS 


3925 


17826 


A 


3948 


299 


3 


ASQTTFREGCFLFFSFFISVCIiFVCLFE 
MEFHSCRPGWMECNGDRARLHLKTTITT 
TIEIVKDDTNKWKHIH*RLPNIKMSILP 
KVIDIIiFNAIPIKI 


3926 


17827 


A 


3949 


331 


374 


NAPKSERASKAF*AILEKNKKKJjPLD1W 
P WHPHGH I FKCVAP PHCLGWDLS AVP A 
SHALGFPQPQNSAWHVARSKSAASCLGT 
AVTSAGSGVLGTPNKMPQKVRERARLFW 
R 


3927 


17828 


A 


3950 


439 


69 


TNQKVWVPPKEKGLFKKKQLRPRVGGPA 
FHSPPLEGQGS * FP *GQRFKPRPPPWVK 
PLFSKNPKNYPGGGAPPFFPPSPGG*GG 
KFPPTPKAKVPFIQISPPPAQAGGPTQV 
PFPKKKKKKSH 


3928 


17829 


A 


3951 


422 


3 


CFPALGIGPPHPPKRVG*KGGTPPPGLF 
FGKFLKEKNFGKNPGPGQGGPFFPPKKQ 
RGPFNNGRPGKVKKKGAPGPGVPKKKPP 
RVPPPPGFFGNPPGAPPKFFFSPRGGED 
QKRKNFFFFFFLRWS FTNEAQAGVQWR 


3929 


17830 


A 


3952 


223 


3 


AliFFWGRGTLPRVKGLNPKGGGGKFFFF 
FFF*ETESHSVAQAGHjQWHDLG*LQPPP 
SRFKPFSCLRLPGS SIiS 


3930 


17831 


A 


3953 


1 


388 


E KGVS PGGRTCSEQRWCHCTP VWATERD 
SVSKKKGMSNNFSLILRSWNNKGQGGTY 
FQQKFI YPCFLKI KGHS YINCKI FKK+ * 
NLSLHVHVTDVNVRIILQKLDCQIHWVD 
FNFELN I LQCQGLGGCK 


3931 


17832 


A 


3954 


1 


396 


KLRPKDQDTWPTQQSAIWEFVPPSVYF 
VKSISEILFFFFLKKNFFLGPQLKGRGQ 
I LVKGNPCPGGKKKFP APPPQRGGKKGG 
PPPPQFIFVFLKKTGV* KIGPGGV* TPA 
LGIAPPFFPKGGE * RGGPPP 


3932 


17833 


A 


3955 


2 


199 


PGGGGCGEPKTRGCTPAWATERDSVSKI 
NK*KLKLKNNSWLYAKSLASSPTGR*EE 
WGCFTNEHI 
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Amino acid sequence (A-AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Hist id in e, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Profine, 
0=Glutamine. Rr=Arf>minp ^=S«»rin** 
T^hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 

niirlpnfiHe insprtfrm 

IIUUCUUUV IllOwl nun 


3933 


17834 


A 


3956 


393 


2 


KBPPIjFPPGAPPWPIaSGPFSVLEKPKCG 
GGWPG INTPPGGGGFPQGSIiWAGDFP * 
PLFSPQIPFFKKQKFFFFLPRGGLWAPF 
FPPFGGPLFSQNF*lCrGF*GGFFFFFFG 
DGVS LCYPGWS AVAPHAS 


3934 


17835 


A 


3957 


278 


390 


LVYI FKI RTCW PGAVAHACNPGTLGG* G 
GRVTRSGDQ 


3935 


17836 


A 


3958 


385 


311 


GPKNPPASASQKGETTBV*HHAWLIFFL 
FFVEIiWSSHVAOAGLKLPASSDOS * TL Y 

I 


3936 


17837 


A 


3959 


410 


119 


SPPPRFKTPPPEFIFGAPKKKFFSPPPR 
PLNLILLKGPPLFFFFLPKAFKPHINFV 
FPPOKKKNV* IWLYFPPPKICENES*;* GIT 
HFALHCNPSCISK 


3937 


17838 


A 


3960 


77 


400 


TALPLKPKKKKKKKKKKKKRGGPLKKKK 

jl i\rvrvjLiumuii r nunc iviuvc ivwis v cvavj 

ERKKPGGNKKKTFGKKPPFPRGEKKKKP 
QA 


3938 


17839 


A 


3961 


2 


194 


CL*SQLLGRLRWEDHLSSGN*GCSEP* S 
PHCTPAWVTE * DPVSRKRKKYPWNQTIi 
RGFLSPAH 


3939 


17840 


A 


3962 


52 


409 


NS KTS SQEKKRR.VI RLS LRSRVGDDTKV 
NTRRWGVFGWHLKNXiPTTLVLKKMEKIL 

GGTITDLIVF* I VSKPNAAGLSLNENVN 
AKFNDLI 

jTXiVL KM UX1 ■*» 


3940 


17841 


A 


3963 


382 


96 


LSSQLLREAGERNE*TGWLQQPLQMQIH 
KNP C I E PRSRYCTPAWATEODPVSKNKO 
TNKQTKNSLHL 


3941 


17842 


A 


3965 


3 


169 


DAWELRSHRCPPAW*QSETPSQKKKKRG 
KKKI FRFYLFFNKRKKFLNGGNLGVI P 

******** *>* *> *J-4* *- XVXVlVX\JvVL *J1i W Wll *AJ V JL JC 


3942 


17843 


A 


3966 


1 


121 


AGFHRVSSDGLDLLTS *SARLGLPKCWD 
YWCEPPHLAENS 


3943 


17X44. 










PP FR I YWGRVLT PIj Fr iR V1CPK P'PKPWGF 
PFCLFFFFFFEMESRSVAQA*MQWRDLD 
SLHPPPPAFK 


3944 


17R4S 

1 /0*T«/ 






4.71 

*r f I 




AGILGVSHSTW * EAPNSLAITFLSSHLS 
PRQP*THLGC 


3945 


17846 


A 


3969 


214 


2 


EGQPECQLQAGDWVSLWNLVCSGLEVRA 
WREAPRLEWSFCLFFVSEM*SHSVT*DR 
VQWHNIGSLQPLPP 


3946 


17847 


A 


3970 


53 


398 


GDLPKKQQQQKPIiNFCFQLLLYARLFGP 
IFLV*HLATYNNKHLLECVGKVMINCQA 
D I LNKME I S KVGAQKKKKKKKKKKKRGG 
PLKKKKFIPRGGEENFFFLGAPKKKTGG 
GFK 


3947 


17848 


A 


3971 


396 


160 


KRERIKEKEDNKEEKDKEKKASGKKERK 
KEKB I RDRESKEES SNLLLQEELS LCCP 
S PGHPPNSSL *DP YFATGNSIiRC 


3948 


17849 


A 


3972 


416 


3 


FP FPRVPEGVPGGEGVFP PGFFGKSGVP 
FPRPRGGGAPI *KPPPFPGGPPNFFF* K 
KTGPPFSPPQGAPPAKRAPPPFFGGGGG 
KIFTPLKKGGLFFFGPPKKGVFFGPKKT 
LFFFQKKKKKKKGRPPRSHASAHAS 


3949 


17850 


A 


3973 


232 


381 


YKTFSLIF*LFFFRDRVSFCHPDRSAW 
LS*FTVASISWAQGIXiLAQPI J E 


3950 


17851 


A 


3974 


394 


141 


CFFPPPPKRGFFPTPIFGGPPGFFFPPF 
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C=Cysteine, D=Aspartic Acid, 
Er=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, k= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

O — fHitffimiTiP ft= A roininp ^=S*»rint* 
vjiuia iu mc, jcv giiimc, o — oci me, 

^Threonine, V=Valine, 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 

DiICicQUUC lIlMsrUUI* 














F*NPPPKIFFGAPKKKKIFPPPGGKNFF 
FT .V dX> PPFFFFFFFFFFFFFFFFFFEDG 


3951 


17852 


A 


3975 


286 


422 


RKQLISPHSQDNSKQKEQSWRHHATCLQ 
TILQGYSNQNS + * APIAKTILS KKNKAG 
aTTT.PVFKLYYKATVTKTAWYWDOK 


3952 


17853 


A 


3976 


3 


116 


GFHRVGQNGFDLLTS * SACLGLPKCWDY 
RHEPPRPAR 


3953 


17854 


A 


3977 


223 


399 


LLPPPPGIK*FFYLSLPSNWDYRRLPPR 
PVN 


3954 


17855 


A 


3978 


227 


422 


^jp iro" LXC JtoJj vUXjv? vouj uiAJHDyiiiW 
LKGSSLFS FPKCWDYRQ* ATTPGLPVYF 
FPFPVFTFF 


3955 


17856 


A 


3979 


519 


313 


DCLKKKKXKTSVCVCVOTCVCVCV*NHL 
SSLGLFPEPVGLN 


3956 


17857 


A 


.3980 


217 


414 


IYIFCGDGVSLCCPGWLKQSFCIiSLPKC 
*DYRHVPPRPTSFHLLTNIGVLQFLKLQ 
AlJuor Xlir XxLi 


3957 


17858 


A 


3981 


2 


409 


KLTFAV*DY*EKIjLIiSLASIiTSQAAFI»F 

QFYAKS*CVTYE*KKKNLLSPPTGDKPN 
*f2P CrsTVTf TfYTtrV* QRTCFFVKTiFRTGS 
HHVAQAGLELLSSRDPPTSASQST 


3958 


17859 


A 


3982 


234 


416 


" SVLPSTPLLNKL* KSKKKKKKKKKKKKK 


3959 


17860 


A 


3983 


414 


291 


GGRACSEPRSCPCTPAWATBRDSV* KNK 
QPNKQKNKKDLL 


3960 


17861 


A 


3984 


192 


415 


TOANKl^NIXlEMDRFLETQNLLRLNNBE 
PDGFTGEFYQTFKEKLIP 


3961 


17862 


A 


3985 


403 


145 


AFFFFFFPPGEKGDFSPTVLFGGPPGFS 

O DT5\JT?Tr , TT> DPNT7FT/ZP PTfrfPCVQ FPPPRP 

KNWFP* KGPPP FFFFFFFFFFWKLGIEV 
IS 


3962 


17863 


A 


3986 


255 


425 


nr^DTywr' qtct crqafiT ^V5FYY yyhppitft <5 
RPGV*QRGEWKNPGVPNLFALSTSPPGR 
W 


3963 


17864 


A 


3987 


247 


2 


EEAVRVSNLYTKLDE EQ * WKKQIiNYVE 
RLKK* ELFLQDLYRKVPLKI * I F* VKNL 
«q FWPD AVAHACNPSTLGGRGGW I T 

worn tri/A v nnnvii rwi *wv3*\ww •» j. j. 


3964 


17865 


A 


3988 


1 


394 


HHHTWI IFKFFCKDGVSLCCLSWARTPG 
LKRSSCFRLTKCWDYRHEQLCPAFKHFF 
TiY*HWFMSTEKMOE *HKNS* YPS PTVPK 
GVTFYYISYICFIILILSAITYKFVHTY 
IVSLNHVCYSNEAPLSUJI 


3965 


17866 


A 


3989 


208 


402 


ETDRRLKSGGSLSKTLIKCMTCFWFRD 
GVSLCCPGWIQTPGLK*SSCISLPSSWD 
YPHMPPCPA 


3966 


17867 


A 


3990 


2 


408 


THRPYHATPTxTjPFYTNNLIKGGKKKKK 
KGGKKKK3CGGPPFKKKHFSPPGGEKNFF 
FYGPQKKKSRGRV* KRGEGKNPGAPQNK 
PliGKNPLFPGGEKKKKPRFAS * PPPPPF 
S FGKKRAP PTKYPG AGLHP 


3967 


17868 


A 


3991 


81 


406 


KGGLFLL.PRGKGKGKIWVKGNLGLKNKK 
NFPA* PSEEGEKKGGPHQAGKILKFGGK 
RGU3KGGKTGQKLKPGGIPPPGP*KVGE 
KKEKPPAPGQKLFFKKRKGQKKNF 
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Amino acid sequence (A=A1anine 
OCysteine, D=Aspartic Acid, 
E=€lutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
O^GIutamine. R=An*inine. S=5Iprinp 
Threonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


3968 


17869 


A 


3992 


396 


64 


YSSPPPTKKTDDSDASRGPFFFFPPPEK 
RGFFPNDFFWVPPGFFPPPGF*TPPPVF 
FFWAP*KKIFFPPPRE*KFFFLKGPPPF 
FFFFFFFFFEHLQVLLIC 


3969 


17870 


A 


3993 


367 


193 


KLWEKKKILWKKG*GE IKS CHCTPAW AT 
RVKLCLTKKKKKKKKKGGEGEIFKKKSF 
S FLGKQKKKRGPAFF 


3970 


17871 


A 


3994 


188 


2 


KRANTLPDFECTYYKAIVLKTLWYWRKDK 
HRSMKQN*ESNEVELIjYHNIHRNKLKMD 
HRbKCQ 


3971 


17872 


A 


3995 


2 


147 


OENHLNPGG*GCS ELRS *HYT P AWATF * 
DSVSKKNYI * *TSHIFSGDF 


3972 


17873 


A 


3996 


23 


329 


RNSFIRSAFSNVLNKVVLFLTSEKKKKK 
KKKKKKKKGGRKREYLTRRGAGElIKrfnrF 
YK*KKKKOTCG*KKKGWGRKREREKKRL 
GGKKKFPPFEEEGKKKPG 


3973 


17874 


A 


3997 


164 


392 


KPLGRAGLVPCLCS S FTGFLM* KTEPCS 
WQAGVQRGHLCSLQPQSPMFKRFSLLS 
LPSSWDYRVIiLRHPSWSAVV 


3974 


17875 


A 


3998 


396 


3 


FFFFFPPGGKWGISDSPQLFFSGVKNFS 
PPPPPVNWGLRVWPPPQKNFFFFF*KKV 
FSFFWGGPPNPPFFFSLLLGGPPPPPI F 
SGEKI PPSLFFFFFFFEMES CSVAQAGV 
R* RHPGPLOPPPPWLKOF 


3975 


17876 


A 


3999 


383 


2 


VPSPEKVGIKKIQPLDPFF*FLGDCQRK 
KKSPS *RPPPPPEGKKLN* IRLMGPPFN 
SKEKFF * KF* AFKPFFFPPKGFR T *C;FF 
SRAHSFPNQRALFWFFFFEMESHSITQA 
SMQWHDLRSLQPPPL 


3976 


17877 


A 


4000 


213 


1 


RGRQNKTLFLTKPNQTKPNQTKPNQTKP 
TT FL* S VI E I RVKKICXTCTTRKTTKNPTT 
SLTRSLGRYKSPTRP 


3977 


17878 


A 


4001 


412 


129 


GSIjSS PRLECSGTI PAHCSLNIiPGS S <5 P 

^tM^A^W^ ^ A-> ■ ■iiiV^fc^^J A •A- 47XlI4VWl4ilij47 VW O W *T 

CTSAPRYAGPE*MGQACMPPCIiPVNFFF 
WGGGGIIiVETRSC* VAQADLELLS FSDP 
PASASKVLEL 


3978 


17879 


A 


4002 


411 


293 


RDTRCLPP * LDNFVFLVEMGFHHVGQAG 
I1ELH1TSSEEIA 


3979 


17880 


A 


4003 


46 


392 


QDFFFFFFFIiGKGAPPPPGKKGGTPFPK 
KKTKGGGPRGPPLEPPPLGGPGPPEPGG 
GDLKKPWPPQGTPPL*KKPKMGGGGGGP 
P*TPGTGGGGEKNPLTPGGGPKNGPPPP 
PPG 


3980 


17881 


A 


4004 


360 


I 


KCIiGFQFFFRFWNICI I FTSQhS IASQV 
QRSEIQNAPMSIFFEHHVSFKQFQILEH 
FGFWIFILGMLSTYKTLILLKLN*RFNA 
MPIWIPWIFMEIDIOliILKPTWKYKEPR 
RVKAILRN 


3981 


17882 ■ 


A 


4005 


152 


408 


NKFVSHFYWMGNVLSRYPQFT*QFSFCH 
VRLPSVF*YLYTRI^ILIFFAHKAFFLD 
AKYVOTCVCVCTCVCVCVYFGIGEGNIiP 
S 


3982 


17883 


A 


4006 


411 


67 


YLPPPSSPTPGVFFFFPPPKKEFFPWPF 
ILGTPRFFPFPPF* KPPPKFFFLGPKKK 
K* FSPPRGKKIFFFKRAPPFFFFFFFFF 
FFFFIiLGGSLS FMYFHIKTYK 


3983 


17884 


A 


4007 


217 


2 


PLFFFFKHLKAYYSDVIHISFAILRYliL 
F*DRVSIiCHSGWSAVARSWIjPVAGQNTF 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutatnic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProKne, 
0=f»lu£amine R^Artrifiinp S^Serine. 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ITRQEAP YLI*AI PRV 


3984 


17885 


A 


4008 


235 


406 


GRHRVSSNP IS I * RRRGLSLLPKLECSG 
VS IAHCSFDLPG* SDPSAS ASRLLGRLR 
Q 


3985 


17886 


A 


4009 


833 


237 


SLDNRARPCLVSFIFKKGRSTAERCGAP 
GTTS P PFPRLSERGHP 1 1 PS *RGWERKS 

SPGTFPGNGGSDLVPVPTAPIPIEPQIC 
QATCAVMKLS FDEEYRRAMNE LGE CPRS 
SGKPSSSPRTGGGSSAAKGTQAGSSELG 
LGIGARRGTDSLESHSTSHEWTVTLGPL 
SP 


3986 


17887 




Ant a 


Z/O 


o 

L 


VEVKGNNS FSEGS ITLI PNPDKOPKETT 
DQPLMSTYAKIESPGKNPSHDG*LIFDK 
GAKIIQ*VKNSIFNKWCWDNWDFHVKTM 
NPHFTLI 


3987 


17888 


A 


4011 


188 


27 


REKFQIYTIKNDKGSITTDTTER*KIIR 
DYYEHLYAHKLENLEEMHKLIGIPP 


3988 


17889 


A 


4012 


1 . 


202 


CLHQKVIISN*L*ATTQTTQLSLSFKLD 
YFSIIIIPVALWTWAIIKFSI**YINSP 


3989 


17890 


A 


4013 


1 


403 


CLDQEVIISN*H*ATTQTTQLSLSFKLD 
YFSIIFIPVALFVTWSIIEFSIi*YIHSP 
PKKKKKKKKKKKKKKRGGAPLKNSRGGP 
NFWGGGKNI FFFFGGGDKKPPRAFWKKP 
LPLGGGNLGPPPPOKFTPWGKK 


3990 


17891 


A 


4014 


67 


439 


TELI I FYIFFETTLIPTLAI ITR*GNQP 
ERLNAGTYFLFYTLVGSLPLLIALIYTH 
NTLGSLNILIiLTLTAQBLSNS * ANNLI * 
LAYTIAFWRKKTKKKKGGGLLKDPWGGQ 
ILAGREKIKFFP 


3991 


17892 


A 


4015 


3 


260 


LIVPTIILLPIjT*IiSKKHI I * IINTTTHS 
T.TIS I IPLLFFNOINNNLFSCSPTFSSD 

F 














3992 


17893 


A 


4016 


258 


3 

i— 


LKKIFSPPTINLFFPPFPLKKFFFPLSL 
* FFLGVFSPFFPPPKKGFFPKI PPGVFF 
SPPFKKKKFFFPPPFYFAPPRVFFKGPP 


3993 


17894 


A 


4017 


287 


2 


KKQKNFIFFFKNIiSFFGRGAFFPPLFPP 
ILRGKGGDFFYPRGLKPALGNQ*NPFSR 
F* IFFFFFFFLRRSLAPVTQARVQWHDP 
GSLQPSPSGFK 


3994 


17895 


A 


4018 


228 


487 


SYDVKQEKPNGI * ASKYWAYVFHSFFLS 
FFFFFFKKKNFFFCPQGGGQGPNLSLRE 
PSPPGVKKLFGLNLSKSWE*QNCPPPPI 
IF 


3995 


17896 


A 


4019 


35 


228 


EELNHIiNQGGEGCSEQRSHHCTPAWATE 
*DSSQKKNKFWYIHPCNRIVTTTNTPMT 
ESPKHYAE 


3996 


17897 


A 


4020 


2 


402 


ARGNIjNTTFFDPAGGGDP ILYQHLF * FF 
GHPEVYILILPGFGI ISHIGTYYCGKKE 
PFGYIGMG*AMISIGFLGFIV*AHHIFT 
VGIDVDTRAYFTSATII I AIPTGV*VFS 
*LATLHGSNMK*SAAVL*ALG 


3997 


17898 


A 


4021 


1 


420 


TRGATELIILDILVETTLI PTLAI ITR* 
GGQP*RIiNAGTYFLFYTLVGSLSLLMGL 
I YTHKTLG* VNI LLLTLTAQEL * NS * AN 
NLI * IAYTIAFIGKI PFYGLQL* LPKAH 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=*Proline, 

Into m l n RsA roSnina Q— Q ar t nA 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














v Ejnr X_rUjrO XUXirlttVJXJXJlvxOk7 X VjX XKxj X JLi 


3998 


17899 


A 


4022 


409 


3 


LEI^CLIICNNNNPFFYQIVTCDEKWI 
LYENW * LPAQWLH* EEASKHS PRPNLHE 
KKIMVPVW*SADGLIHYSFLNPCETITS 
E*YAQQINVMH*KLQGLQLTFINRKGPI 


3999 


17900 


A 


4023 


464 


374 


HLPPWEDKARRHLPDAR1TILAFLASRN 
VRNKFFSLQATQFLVLYASSTKITKPLK 
TNQKKRLKKSBQSLRNMWTPIKRTWICI 

V VJf JL JL H Kj rUlt Vjllt rJjMJLlt. X, V IS IS X V Jr y X Jc MXl 

VKDIK*KVQ 


4000 


17901 


A 


4024 


58 

i 


323 


RHKLLCEHVKKCS AVLI IREMQTKTTLK 
YHFLLVT IAKI FNL I HF * RS VGETL I CQ 

WAJaKil i\Ko w f r WKK1 x Xj X JKXi x VL. VUW 

YTHS 


4001 


17902 


A 


4025 


480 


150 


RNSLLRRIoRQEDGMSFGCQGCSEL*LHH 

KTDKSNGKKGPDFSRKNPPKNSPPPPWQ 
RSTAFRGRAAKSCPTH 


4002 


17903 


A 


4026 


204 


1 


KKKFGALFFFFMQGLTLLPRLEYR*WCD 
HGSL* P*PPGSSDPPTSVSQIAGTMGVH 

WT7RT.D GrSI/VT.Cir 
CULf JUHouV VXjOlX. 


4003 


17904 


A 


4027 


116 


472 


MPPLPKPVSGPRPHSCSPLGGPQLSTPL 
ruirKPAfa Lib Lit 1 X b &G X VJGC X xAl liWAVW 
YLVLQMLGIiSLFMA*RGAGSSCSAHKKW 
WLGLASPSC*TFFKNRTKGRARWLTPVI 


4004 


17905 


A 


4028 


355 


444 


VSYKKK* S RPGAVAHACNPS TLGG* GGR 
IT 


4005 


17906 


A 


4029 


410 


181 


ASIILLL^VFFNNILSGQ*TITNTTNQY 
SSLII IMAIAIKLGIAPFHF* VPEVTQG 
TPLTSGLIiLLT*QKIiAPCF 




T7Q/V7 




4U3U 


49 


An a 
474 


f AKAAr CHX Vx Y x SGKivEPFGY xgmv*g 
MISIGFLGFIV*AHHIFTVGIDVDTRAY 

J?TQ AT T T T A T PTYJVTfVPQ * T.1TT .UP CXTM 
r ion! 111/u.r XWV AVTd " JjAXXullLrfiWin 

K*SAAVL*AD3FIFLFTVGGLTGIVLXN 
SSMI VLHDTYYVXAHFHYVl^ IGAVFA 
II 


4007 


17908 


A 


4031 


274 


427 


FFF*EMESHSVTQAGLLGRNHSLLVPQT 
PGLKRSSLLSLLSNQYYRSGPHT 


4008 


17909 


A 


4032 


205 


393 


TWGGFKNPS PGEG * I KKMHD FNEMFLQN 
CWKILFFGGGFFTGQI FPRAAPPINKNI 
IiVWGGGK 


4009 


17910 


A 


4033 


2 


386 


SEPLSRKKLYLSIIiISLQISLIITFTAT 
ELI I F Y I F FETTL I PTLAT I TR * GNDPE 
RJ^AGTYFLFx^VGSLPLLlJailYTHN 
TLGSLNILLLTXTAQELKKKKKKKKKKK 
KNIGGPPLRAPSGAPN 


4010 


1701 1 


A 




-a 
J 




GT.FI1AMHYS PDASTAFS <? T AH T TPDVNY 
G* IIRYLHANGASI FFICLFLHIGRGL 


4011 


17912 


A 


4035 


3 


317 


HASADAWAFIGYVLP*GQISF*GATVIT 

IN uuDnX t X XVj ILIDVy x uvjl 0 v Uor XXJ 

TRFFTFHFILPFI UALAALHLLFLHET 
GSNNPLGITSQKKKKKGGPF 


4012 


17913 


A 


4036 


446 


194 


KKKNQPGGPPPPPAFPVGGLIKKRFPPT 
PGKKFKFR * KFFP PKGGFFVRDPPPKPG 
CWEREKKKKPIxLNTDKWVILEGSEAGLF 


4013 


17914 


A 


4037 


213 


1 


GMAKVKARFLPQKRGQRVLNPPWWGKGF 
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K=Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
V^oiutamine, K=Arginine, o — oerine, 
T=Threonine, V=Valine, 

U/=Tn/ntAnhfl n V=Xvrneirn» 
W — * ry j) u*m, i — jijfruMiic, 

X=Un known, *^=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ir iNOv7 1 jSJIi jl c r err l^rtv jjij^.^r or^ortv^A. 
QS * LTTASTFWAQAI 


4014 


17915 


A 


4038 


307 


36 


PJCGFFFSKCXSEFKKLPGPPIjGGWYLKLI 
FGQKKFFHPKFPGFPG* IFLPPPKIFFI 
FSFEMEFCSCCPGWSAMGRPRLTATSAC 


4015 


17916 


A 


4039 


2 


372 


NDTSLTVKS IGHQWE * TYE YTD YRGLVF 
NCYILPPLFLEPGDLRLUDVDNRVILPI 

IPGRLNQTTFTGTRPEVYYGQCSEICGA 
HRRSMLIVLELI 


4016 


17917 


A 


4040 


107 


279 


IjLLLFFFFWRKSLAIoAPRWECRGKI *gh 
CKLRLP S P CHS P A* AS PEAGTTGWTYGC 

TP 

r 


4017 


17918 


A 


4041 


3 


351 


GSHYIAQAGLKNPGSSHLPASAAHSAGI 
TGVSHHAWLKIFFKESVSDYSPHLNSRK 
NAVYVSFFFFWKKSFVFVPQAGGKGLNL 
GSLKFSPLRIiKQFSCLTLPRS *EYGLAP 
irr'Jrv 


4018 


17919 


A 


4042 


1 


385 


RPRRPD IE * QERRTQEVLQAVAKKVKEE 
SQLPGTGGPKNVLQPVPRAKAERPKKQA 
EASGLKKETDVVLKVDAQEAiCTEPFTQG 

ViSfVIfYPTDP CPPTTa PHWI'^' CT*Q QPT ,VT 
rJ^^J\i JL ±)ra&E JSi^MJry v icoi* do&JLiv JL 

TCQAETLAGVKSQEMV 


4019 


17920 


A 


4043 


3 


282 


TAILIQTP *SFTGAVILIIAHGLTSSLL 
FCLANSNYERTHSRI I ILSQGLQTLLPL 
IAF* * LLAS LANLALP PTIl^LGELSVL 


4020 


17921 


A 


4044 


2 


407 


GTRLEIMSRNHGIFPKrLEIFKDNEFEE 
PYREALPTLKLRDSLYFGIEPEEHVSG* 
ESLEESCFATPTSKIDEVXKYYLIRDG* 
VSDDSVKQYTSRDHIiAKHFQVPDFKFEG 
KDHKBVI LH * RDL VCGVMDERSR 


4021 


17922 


A 


4045 


4 


380 


IVNWijI 1 IKUAWntSAJUJXNJUN 1 1 c fUr/ibb 
GDPII,YQHLF*FLGHPEGYILIIjPGFGI 
ISHIVTYYSGKKEPFGYIGMG*AMISIG 
FLGFIV*AHHIFTVGIDVDTRAYFTSAT 
I IIAIPTGVKVFS * IiATLHGSNMK*SAT 
t t t a t "PTnvrnrp*? 


4022 


17923 


A 


4046 


2 


94 


ADRNLNTTFFYPAGGGDPILYQHIiF*FF 
GHPEGYI L I LPGFG I ISHIATYYSGKKE 
PFGYIGMV*AMIS IGFLGLI V*AHHIFT 
VGIDVDTRAYFTSATINIAIPTGVKVFS 
*LATLHGSNMK*SIPTPILIFRPP 


4023 


17924 


A 


4047 


376 


138 


LLARQRQENGVNPGGGACGEPRSRHCTP 
AWATE*DSVFLLWREALLTQTAPFNRLQ 
LLALPSEFFHFRILIVLQPHHSQ 


4024 


17925 


A 


4048 


145 


226 


EPIQACSSRRVCEPIQACSSRRVCVCVT 
PSRHALPGCVCV*EPIQACSSRRVCVCV 

RAHPGMLFPACV*AHPGMIaFPACVCVCD 
PIQACSSRLCVCV*ERAHPRLLF*ACVC 
VCESPSRHALPGVCTCTOAHPGMLFPVC 
VCESPSRHALPGVCVSPSRHALPGVCVC 
V 


4025 


17926 


A 


4049 


2 


414 


ADRNLNTTF FDPA* GVDPILYQHLL* FF 
GHPEVYILILPGIjGI ishi vtyysgkke 
PFGYIGMV* AMIS IGFLRVIV* AHHIFT 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucieouae insertion 














VGIDVDTRAYFTSATI I IAIPTGVKVFS 
* LATLHGSNMK* S AAVL * TLGFI FL 






7\ 
r\ 




4/Z 


J.J.O 


ujituiuvatnLxr oiujcivnvayAmrijViAjfi 
SCSEPKLCPCPPAKKTKDYPVSKKKKKF 
IWI INRTKI PRKRKKNKAYHKVKYF 


4027 


17928 


j\ 
A 


a net 

4051 


238 


3 


ODUnQWADT.DDflT.DT.T'T QVTWT.PTTTT.P 

NILYLYFETGSRYVAQVGLKLIiCSSEPP 
PSTS*STEITGMSHGARPHNIP 


4028 


17929 


A 


4052 


225 


172 


IARYPFHIQSCTNMPSPNKS ITSKEEDN 
GPGAVAHACNPSTIjGGQGGQ ITKSGVRD 
HPG*HGLV 


4029 


17930 


A 


4053 


471 


30 


GPGUX JvX SGAJjr PXiNK VKKVPV VrPRPP 
LIFPQKGVFNQNGGDPGGGFWPGETPGR 
VPPPPGVFFVKSPFLSLPFPPPPV*KTA 
VKTGFPYRPCGVPPRGLVFPKKKKSVLN 
RPDLSMGPQPE*SLPLPGEPSVSHAGCL 


4030 


17931 


A 


4054 


425 


41 


NWGSGTLCPQKLGVFPPPPPGEGGVPGA 
PPPARLFFFFF* KKGFSPVGRGGFKFLP 
PKNPPPLPPQKVGIPEGGPRAGPIFSPP 
Ui-ir r r rJjl wbLil PorKljisUb^iAlLiAriL. 1 
PAWETEQDS ISKQTKK 


4031 


17932 


A 


4055 


312 


1 


IKNARAHLLPQGSPPPLIPIWIIETIS 
LLIQPIALTVRLAANITAGHLIMHLIGS 
AALAISTIILP*TRIMFTIIIERTILEI 
AVALIQAYVFTLWSLYLHA 


4032 


17933 


A 


4056 


648 


123 


DPDPVPSARGCVPSGAPGRGAQLKKAVTD 
(jALiA^ijrPLXjL^UaLiForbijCaltiCaPAVCi 
LSVNSERDGRLGEVSINAEFTVAAFLKG 
TQSRRGINGMPLPQREPSSRYCLGLGGE 
SSAQGTSGLTGQRV*QNSDLPELASGDD 
JWjy Ko o yuvjA v oyo o fKvjrvy J. f Jr ba f r*A 
ALFNFL 


4033 


17934 


A 


4057 


135 


464 


QHNSRLMQQAKKGVTVLARVIDLDYEDE 
ISLLLHNGGKEE *AWNTGEPLGCLLVLP 
CPVINVNGKLQRHNPGRTTNGPDTSGMK 
VWVTPAGTKPQPAEVLAEGKENKBCG 


4034 


17935 


A 


4058 


3 


575 


LRSRPIiHIILSCGLLVTPRSLSPPPPPQ 

LRI EGS KTL*NCC * LYHGC * I S * IHHGG 
IiFDVAAKNI IHEEVEKYDKQ YRGKELLG 
FATYKTFEIIVHQYIQ*LVEPALSMLQK 
AMET I QQAF INVANKHFGE F FNLNQTVQ 
STIEDIKVKHTAOAEYMIOLOVR 


4035 


17936 


A 


4059 


389 


3 


TFKGPKVFRYPTFFLCPPPRGFFSKNIW 
TPPPPPFFGVFPEREKIFFFLKGALPIiF 
SFRVTPFFGLKRGFFLSPTPPFFFPPFF 
WPPPFFFFFFFFFPLRRSFALVTQAGVQ 
WRDLG*LOPT,PPGFKR 


4036 


17937 


A 


4060 


397 


1 


KRRGNFFFKKGISLFI*TPPRGPLWVFF 
F*KAPKGFCAKFFPFFYPPLPKENGKTH 
RQKTPFPHFCQIFKNGFWVLHAFSLGDF 
PARVWGPPFKRFFKGI FKEFPP YGSPP P 
KGPKKKKRTAARDLELADAW 


4037 


17938 


A 


4061 


38 


410 


AWISI * ERPFTLVRVFKNFLAISGLLFS 
YFRINFLYVQIHIHTFFRKLFVLSYVFR 
FTDVKFFFYILPFFPPHFLFGWICSYAA 
L*FFIYFCFtiPFFFNGPTSP*IGLFLLI 
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FFFKKPTFGFGG 


4038 


17939 


A 


4062 


405 


81 


GGANVFPQKSPPFGNKKGGHFFSYKILN 
NPGHP P FLKKKKNAPGGWGNP WFS P P W K 
GAL*PKK3CNPPKKKKKSPPPKGGKKKNP 
FPPKKKKKKKKA*GDNFDMGTGN 












239 


RPSFFFFPPPKKSCFSTFFFFLSPGFFF 
PPPFFFTPPPFFFFFPPKKKKYFFPPPG 
KKFFFF*PPPPSLFFFFFFFFFVFFFFF 
FFFFFFFFFFFFF*GIiTSPHKQIGI»VLA 
FLLALSKITHAS I PVPVSS PS KKKKKKK 
KKKKKKKKKKK 


4040 


17941 


A 


4064 


95 


241 


SPCRSPHRWVNS*ANNLI*LAYTIAFIV 
KI PLYGLHL * LPKAHVEAP IAGS IVLAA 
VLLKLGGYGI IRLTLILNPLTKHIAYPF 
LVLSL 


4041 


17942 


A 


4065 


28 


288 


Y FD I FVEARS P YVFQVGLE LLGSSNP PA 
SASQSVGITGGSHCAQP*VTILFTTCTL 
HVNG*I*CIKFLIHNIPILVISLQYIFG 
FFL 


4042 


17943 


A 


4066 


338 


134 


PGGGCSEPRSGHYSHCTPAWQ * SETPSK 
KNKKKKITYFSVKNWGIRVFIEIIIiNSF 
TTVRP 


4043 


17944 


A 


4067 


26 


365 


WVSLMTSALAMRVDFYFITLLILGLLTS 
TLTIYQ* WRDVTRESTYQGHHTPPVQKG 
LRYGIILFITSNDCFFAGFF*AFYHSSL 
APTPQLGGHWPPTGITPIiNPLEVPLLNT 
S 


4044 


17945 


A 


4068 


491 


174 


TQLKTH* GSTTDNRTEVRVE PRVRTNYK 
DLLKFL* S KGYDFESETETET I AKLVKY 
MYDNRESQDTS FTTLVERVIQQLVLSHI 
FKDNYANINAEKLSGN 


WfJ 




A 


4069 




402 


NVSKGLRKVP STQ * VIN I S CRPG WARA 
CGPSTLGG*GG* ITRSGVRDQPGQHGET 
PSLGPGAV 


4046 


17947 


A 


4070 


202 


I 


DS AIALQ PRQ * EQNSVS KKKI YRTIDLR 
SEYFARWSFTLAARAGVQWRNLGSLQPP 
PPGFKRFSHAS 


4047 


17948 


A 


4071 


412 


143 


FLGAGV* PRLGPKG*TPFLLKNQKLIGH 
GGGALYS QYFGGWGRRNS FNPGGKGFTN 
QNFPPSIiQTWGKKGVSFSKKKKKERKTR 
KWGGE 


4048 


17949 


A 


4072 


2 


324 


RRGNFO!F*SDRVSSCCPSWS*SPGLKR 
SSCLSLPKCWDYRFEPLYLARLVLMRCY 
STHNTYIMYCQMQRLFQGHRYLSVSTSS 
ATRP YLDNFFFFCKNKVMP CCQG 


4049 


17950 


A 


4073 


2 


326 


RRGRMLPRT PGRPGNDAQQ PQKHSLVD * 
LFIYLFYFLWTGSHSVAQAGGQWRNHGS 
LQP* PPSLKGSSRQSAGRVKGVSHCAWL 
LLDFLSFGKPYTYKKVDKVVSFEA 


4050 


17951 


A 


4074 


422 


116 


EIKQEKNPGPFFFFPPPAKRGFFPTPLI 
WGPPGFPPSPFLKRRQGGFFLGPL*KGN 
PPVFPPPKVFFFWAGPPFFFSFKKKFFF 
LVPQSTFIHFIFHICYLL 


4051 


17952 


A 


4075 


3 


294 


VFCHVGQAGIiELLDS SNLPSVASQSAGI 
TDTSHLT*PDLSFCCKTKFD 


4052 


17953 


A 


4076 


60 


389 


psaqlgstkhihmavqp*pppspellhl 
pklmlcphsahtprpppnpgtvctyspr 
sspafpqegpssgeglgglvslfssppn 
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E=G!utamic Acid, F=Phenylatanine, 
G=GIycine, H=Histidine, I^lsoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
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nucleotide deletion, V=possible 
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HVHPAPTSPESGRGRGGRVPREGAYV 


4053 


17954 


A 


4077 


93 


385 


KPLFAPGVLFGGPGEPKREKKGGAQGGP 
KKFSPPPPPFS FPPPKTLWVAPLFQETQ 
GIiRLCPPFFF*RGKGALF * KRGGGRFKP 

P P P P PflT .T /ZV PTP 


4054 


17955 


A 


4078 


2 


365 


RKVEGRVS*DEDLKLTELLRYYMLNI EA 
AKDLLYRRTKALIDYDNSNKALDKARFK 
SKDLKSDCAHPRDC*RA*APYFIjFAKNE 

T . WPTTlPnP HT?T /"mtTDT. TVfiD *PT /TT.O T O 
Uw v L\jCi\it\rtc UJUvc JLi X VuKXXj X AilCX Jtr 

DHTSLPLSL 


4055 


17956 


A 


4079 


6 


298 


PLAIMGGFFTLAETNRTPFDIiAEGESEL 
VSGFNIEYAGGPFALFFIAEYTNIII IN 

XXjX X X lr XAj 1 X XUJWfKc w Xjor oiji: X I tt 

PPSPYPTPWSTST ! 


4056 


17957 


A 


4080 


414 


154 


PIPATREGKAGNSLNPEGEGCNKRRSRH 
CTPAWQEGKTPSQKNKKVLLIAIiKV* S I 
FHLMES*KYLKPGFSLDLITVIPTYFI)L 
FRC 


4057 


17958 


A 


4081 


347 


1 


VLKPRPGNI I FS PKKKKNLPPP PGRNYF 
FFFPPPPFLFFFFFFFFFFFFFFFFFFF 
FFFFFFFF*RGGLLLFRIiKIQLPPTPHP 
YLSNTLRGWNPGEGSCWRSEGGRGEDCL 


4058 


17959 


A 


4082 


1 


354 


STFI ISLFPTTI FMCLDQEVI ISN*H*A 
TTQTTQLSLS FKLD YFS I I KKKKKKKKK 
KKKKKKKKRGGGGLKKKKFNGGGGGKKF 

KGGGEK 




1 /5/OU 


A 


Woo 




OJ 


lI r I?T7DVT?l7t?f2r , DD WCl?et3Dni?T TOOm* /"Mr 
P BE xr SsJiiC c\a\^,tfts.ur l?oJrIrv»JrXiX lr irfXiyv 

LEFPIKKKLSPPPPPLKNFFFYTPPPFF 
















4060 


17961 


A 


4084 


1 


122 


FHRFGRDGLDLLTS *SARLGLPKCWDYR 
REPPRPAFLPQP 


4061 


17962 


A 


4085 


223 


401 


NGKLVSWroFCFCFFEM*SRCVAQARV 
YRC 


4062 


17963 


A 


4086 


160 


408 


ALPPRRGGGGAPSFNFCLIiPPGP*GKKG 

GHGGRGYQVRRRVGPPGPGGRRQKIiNEG 
APPPPRLGGKAAP 


4063 


17964 


A 


4087 


261 


81 


YOIDGRI FCQKLESSHYSPGAVAHACNP 
S PIjGG * GGO I TP SfiVO A TH"P QMP PRVWrt 


4064 


17965 


A 


4088 


428 


140 


QPLKNPPLGGRAPPFLRGKKLPPPPPQG 
E PP P F * KKKKKKRGGGGP PFK P P P PRRR 
PKKSL*PQKGGGKKKKKSPPPPPQKKKK 
NPPPKKKKKKKK 


4065 


17966 


A 


4089 


1 


205 


FRLRQENCU^PGGGGCSEIiRSHYCTPAW 
HMPNTS I QQLP I 


4066 


17967 


A 


4090 


154 


410 


ITGCSI IKYFYRAMREREKSEVGYFPSS 
SIOTPLRHRLSDGVLDSRITSLSTHSPA 
PFIKSKNRPGTVAHACNPSTIiGG*GGRL 
TK 


4067 


17968 


A 


4091 


428 


67 


LPHPPPPPPPLSSSSSSSSSSSSSSFPP 
PPPLLPGQGRTLTRPIjPAPAGQLGSRES 
CGAPPSSLGASLHVA*NADGRQ*RAALS 
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X FrioFXi Xvr Q xr.fxrvy» ivoivtvCi y u LiJjJjula- o 


4068 


17969 


A 


4092 


431 


3 


APTSNKGSTIPILQIRKLRATPCSSIGP 
o dctt T»\nrr ct *cvT T DnnWT UWQTTrtTT? 

OJro XXixr VJ\jjJS1j"0 V LiLttzv3\i W Aid r of \—rLV 

FPRPKCSNRPRYTLPVPKLPSSGLPPRP 
SPSQQVTAFPPLTSRKEPSSPSTPASDA 
FPFPPAPVSARRQEVLCES IPSVTMHB 


4069 


17970 


A 


4093 


151 


1 


£s XMPMW Ml) A v An/Vv-N h*U X LAjKs w vnorW A JL J\. 

SGVRDPPDQHGETSSPPKTQKL 


4070 


17971 


A 


4094 


35 


363 


LQSKI YCTFFFFLERGFFPS PRVGRPGA 
HPGLGAKTTGVKGNP P P* PPKGGDPGGV 
P PPPGCFFWFFLGKGGFFLLPGGG* NFG 
KKKKPLVWPPKGGBijKJaCJl rGrrri 


4071 


17972 


A 


4095 


376 


1 


LGKSPPFIiGAPFFLKKGSLLGPIiLGKGF 
TPP * IiGGPHPLEKGPLIiGQWNPNQKVWK 
KRGFPQKGGENPPFFFFFFFFRDGASLC 
CPGWPQTPGLKQSSNLSr.PSSWDYRCLS 
KSLFKKNTETLAS 


4072 


17973 


A 


4096 


65 


446 


PFFLFFLNKGPLFTPKRQGKPRNFPT*T 
PCPRGKKNPPPPPP*GGEKRGGPPPPGK 
FFFFFGERGFPGGGVLFLDLGTPPLGPL 
KGGEKRGRPPGPGPKKPFL* KRKKNSPP 
TGGERGRGGKKHCLP 


4073 


17974 


A 


4097 


34 


387 


IKJfiAUEl* VJjTKIUACVt'VUUMiSt'ijtjtjPHK 

LHTRIGSSPSTTCCTPACPCSSW*HGAS 
VIPSQTAYEDWFITLYNVLyrSI.PVLIM 
GLLDQVGASHKQGHFWTDENALEKSQAN 
VSMHSSGVYFFLIKPYYNSVVRHQGSWL 
CFLFCNKERGF 


4074 


17975 


A 


4098 


80 


422 


ILHVLYTCI ILCFVLQS* ILWFVLQNYI 
LHVLYTCIII^l^tVLYTCIIYMCYIHV+ 
FCILALFAGLS*DSP YHESL* * SFFMEA 
D I SLCRCv ITDQIiN F I Co.LilA2.ti* oJURMr 
QF 


4075 


17976 


A 


4099 


74 


402 


I YIiLSTHLHiraFS WIS ITFDIS FFFFFF 
LKKESLFFPQRGGGGGNU3LLKPPPPGE 
SPFPPPI FRGGGKKGPPPPPWVT FFFF* 


4076 


17977 


A 


4100 


3 


335 


DAWASAWVTEYDSVSKKKKKKRGGGGKK 
KQFPKKIGPRGSLFKPRGGEKPLSLKTP 
PPPFF* F*KPPFKKGGAEPWBPFLKPPF 
CLKDRGGAPKIYFFPNPPGAPRGAAFIK 


4077 


17978 


A 


4101 


406 


1 


RFPKPFPPKPKMSKWALFK*KGFFPPRE 
LLIGPPPPPPKKKKFFPTLFFAKVLPPR 

QKKPTRGFPALFKKKKRSSLVIREIKRK 
PTMRYHLTP I RMAVFHKS KNNRC 


4078 


17979 


A 


4102 


2 


183 


VNPGGGACSSPR* CLCTPAWVTERDSVS 
SFAF 


4079 


17980 


A 


4103 


119 


391 


DNLQCFFFFFWKRGLLFLPRGGGFKSTG 
PFSFGVPGTPPPPPPGEGGLTPKPPPPG 
* ILFFLEKGDFPLLARW*NRQKKKNPP 
PSPPKGG 


4080. 


17981 


A 


4104 


413 


64 


GFKPPPPKDPPPPPPQKGGFMGWRPPPR 
PFAPIiFSPPPPRGGGGIFP*GGGGGAKK 
FHIWGPFG*KKPPPGGG *NSGGQ KNPPG 
GGGKNPPQKKKKKKQPTTANIWFLCI.EI 
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nucieoiiue aetecion, v— possiDie 
nucleotide insertion 














LSGM 


4081 


17982 


A 


4105 


2 


196 


RGRVGKVTilSLFNTDLLWLLCILICFHK 
TO*T /"TOFT"* + fTJOniwrvf^ jr^x r^T7*^ n m Trr*T/~rir 

CVCERERD 




1 *1Q21 

I fyoj 


j\ 


A 1 A£ 

4JU0 


1 


335 


rJjVB\L * rHPVGIiAGIjELIiTIiGDPPR*ST 
LGDP KCWDYRCE P LD PAS VS FFLCLKLF 
IFVLLKLFFFFKPPLYSKLTRGRDNNYN 
YFPPPCTPGPLPHWRADKS I IGGNRRLL 




WQQA 


±\ 


Aim 


l 


14o 


KKKEPPKFFLAGFSGGGKKTP 




i ^QC^ 

1 f70J 




A 1 AD 

4108 


331 


41 


(jJjNFLiAQKr SGRFS P PPPGRVPLGPKPF 
FFLGKPPILEGFPGP*KVFFSPILPPKB 
VLTFFS PPGEKIGPPFSGGACSGFPPKG 
AKKKKKKRPWSN 




i /yoo 


TV 


4I(Jy 


137 


3 


l KKKKW I NKG PGAVAHTCNP S TLajG * S G 
RITRSGDRDHPGQHGE 


4086 


17987 


A 


4110 


3 


269 


GFHRVSQDGIi YLLTS *SACLRLPKCWDY 
RHEPPHPGFLGFFCFFFFFRDSVQPGQQ 
5KNPSHKKKKDEGPF WKGAGKI FLFKPN 
SSAL 


4087 


17988 


A 


4111 


196 


330 


' GAIARGRSMCDC I *GS VGHVL *NDPTLN 
CIIiDQDTJj YGS I M I NEYYBRASDAWALL 
PDEEQLPEGDQTVIA* * A*DMSGGQQHR 
VSLARAVYSGADVYLIiDDP 


4088 


17989 


A 


4112 


426 


241 


LLKRVRHEILFTIRGKGFSEPRSPPCTP 
AWATBRISVSHKEKKRKVLVRP * VDLSN 

opt cyiTT 
SSIjSVXj 


4089 


17990 


A 


4113 


394 


1 


GPGVFSPSGCLYHARAQYFWPHKKKKYF 
PHPGQKNSVFLKGRPIiPGFGFLLIFFFF 
LVEMGSCCIAQAGLELPTSNNPLTSASQ 
kae i kdvs hr5q * ffvflfbtgs cs vtq 
A*VQWCNHSSPRTPGIjKQF 


4090 


17991 


A 


4114 


328 


2 


KKRAHPIWPDTGIEVPNSPSPNIRFLGA 
PE£>Fr FKKNJjPQGDFIjEKPNPCYFFFQG 
VTTPPFPLFPPPPKIFFFFF*DGVSLCR 
PGWSAVARSQLTATTHLPGSSDSP 


4091 


17992 


A 


4115 


220 


3 


FKKGEPGDPPPVWFGLGWVWVWVGFFFF 
ERHS FTLVTQVGVQWCNFSSLQPPPRGL 

VCUCUT Or DCC4UVDIT 


4092 


17993 


A 


4116 


342 


2 


PINFLYLSFNPNQV*RSPNFSKILHFCF 
KILSFFYTLAFLPFSSQLIRQSFFRVYL 
INTICFLQDLYYLYILNLYLILFLFFHL 

ASRLLDFCLFFVFETVLIiCHPGWSTVAQ 

e 
o 


4093 


17994 


A 


4117 


124 


352 


TSGFT*TRMKSKIPT*RSFI IDLLEAKM 
KEKFLKAARETQLITYRGTS IQMPVDFS 
WRKTBAKAQWNLMFEVLKEKNCETRSLH 
PATMSFRNEGKTKMFSDERKLRDSVTLP 

JjJUs 


4094 


17995 


A 


4118 


11 


387 


KTGKLLATS APGDG IMVMVETFCLWS F 
FLSVSKKKKKNPIiPQIKNPLPKPCWVFK 
RAPTPNH*KAFPGPPKTPP*KQPKKQIP 
CRPPKKGPFCI *PNPGINPSPLFFEfCKN 
KGKAPPFKNSKQK 


4095 


17996 


A 


4119 


2 


407 


NTFQDQSGSSSNREPLLRCRDARRDLEL 
AIGGVLRAEQQIKDNLRE VKAQIHSCI S 
RHLECLRSREVWLYEQVDLIYQLKEETL 
QQQAQQLYSLLGQFN* LTHQLECTQNKD 
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OCysteine, D=Aspartic Acid, 
E>=G!utaraic Acid, ^Phenylalanine, 
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nucleotide insertion 














LANQVS VCLERLGS LTLKPEDST 


4096 


17997 


A 


4120 


157 


2 


LTGSHSVT*AGVQWCKHGSLLPRPLGSK 
RSSSLNFLYSWDHROGPPHSANF 


4097 


17998 


A 


4121 


3 


168 


IALKSGRKVDIRELGHWCPGWS*TPEIi 
NQSTCLGLPRCWDYRREPPWI1AI1NMF 


4098 


17999 


A 


4122 


2 


376 


ETGFHCVSQDGLCLLTS *STRLGLPKCW 
DYKMREPPRPADGFINIRDFNPLLRTST 
CCCHVKKDIFASPFTMIVSFLRPPOPCR 
TDPIQNTLPLVIMSPVALLGCDSFSAFP 
VLMNLPFSSDCTK 


4099 


18000 


A 


4123 


11 


221 


GTQLLLRIjRGEDIiLNLGRGGCTEPRLHH 
CAP FWETEPDPV* EKKQAAKAKNSLLTA 
ALLKTQVKKDSVAW 


4100 


18001 


A 


4124 


2 


125 


AI I KKTKSNRWW* GCR* RGMLVHCFWKC 
KLVQPL* KAAWRL 


4101 


18002 


A 


419^ 


*rZ,J 


159 


TPVI LALWEAKAGRSLE PRS * RSAWATW 
QNPISTKKYKK*GGRITSAWGG*GCNGL 
*WCRSTPASATETPSQRKKKK*EKNLFT 
LILKVTK 


4102 


18003 


A 


4126 


3 


99 


CQAGLELLTL * SDS LRL PKCWDYRCE P P 
RPA 


4103 


18004 


A 


4127 


396 


58 


ENCNPLSWSGGI ISGLLLVSGYSNKFPS 
FLVCLL P VIGP PENTAG PS L VPGTKWHD 
HSTL*TSQTPGPSLVPGTKWHDHSAL*T 
SQTPGPKLSSRLSLPSSWDYRHKTPYPA 
H 


4104 


18005 


A 


41951 


si 


S4Q 


LGOS YLLIiLRKCFS PNFOVGDT.DT <3 YTM 

iegitattspesrgctlwposskhtt.pt 

ETSPSVYPLSENVEGTAPP*AHQSFMSP 
PSWGGS PLNLFFGGGGFEKEQS PLKKKS 
FTLYPLGPPSEGEGHF*PSPLCFPPGFQ 
SPPKTGI PSGDELGFF*TT PGPDKKK 


4105 


18006 


A 


4129 


97 


362 


RK* SACLSLPKRWDYRHEPPRPACS C 


4106 


18007 


A 


4130 


376 


279 


DQDDLDLLTS * STHLGISKCWDYRHEPP 
HRAS 


4107 


18008 


A 


4131 


133 


344 


IIFFFF*NFVLQAGGQGGNLGSIiKPPPP 
GLKQFLGITFGRSWDHGPTS FTRANFCI 
FRKKRVLLCCPGWS 


4108 


18009 


A 


4132 


164 


3 


MNEMSYESSLLDHLLKRQ*NQCCLGSSE 
PSPPRLKQFSHLSLPSSWNYRKTSK 


4109 


1801O 


A 


4133 


355 


2 


GKKQNRKTGNFKTHS AS PP P P PKERS S S 
RATEQSWMENDFDEMREEGFRRSNYPBL 
REDIQTKGKEVANFEKNLEECITRIPNT 
EKCLKELMELKTKARELREECRSIiRS* C 
DQLEE 


4110 


18011 


A 


4134 


193 


357 


HHCNPS LCQNDLFWHLVL S P PTG VQ WYN 
HGSLQP*PPGLE*SSCLSLPGS*EYRH 


4111 


18012 


A 


4135 


357 


104 


LRRL* YENHLNPEGGGCSEPNLIiRCTPA 
WATEQDS I SKI 1 1 1 1 *CNFLH* QNMQK3 
RQNPKTNSSKILTE IKISLKGNLRELRE 


4112 


18013 


A 


4136 


179 


1K1 

JO 1 


DTSLHIIONRLSISKKKL*TLYFIY*DR 
VPLCCPGWSAWQS * FTAALTSLKRSSC 
LSLLSNWDCRC 


4113 


18014 


A 


4137 


89 


318 


CLEISCTK1GQDLYTSNYKTLLGKVCKC 
PNK*NTTFMDQAEVSILKISFLPRLIYR 
LRIIPVRIPAGIFVEILKFG 


4114 


18015 


A 


4138 


315 


60 


KKLPAE IHAPLAEEATD*ABLKPAGETH~ 
AEVQAPTEQTPAAEAATT I AEAS VKVQP 
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PPAEEASLAKFPAE IQPPADQERGSED I 


4115 


18016 


A 


4139 


377 


51 


GFTSPFPED*RIPGLFSCIGWTLQSFHF 

(UiJLvw v s C wr *wr EVE cyolu> V J_ii_A_V_.tr vj 

WSQTLGLKPSSCLSLSNSWVYRCAPLVP 
GMPIF*LSCFLAIGFLEFVIYIGY 


4116 


18017 


A 


4140 


413 


175 


NFGIFCRDGVLLCCPGWS*TPRriKPSSC 
LPKCWDYRREPRTVPGLPFLFLNLRKDG 

XTq\7T,Tr T YT .TTVTT T TO T TVOD T .T ATT . 


4117 


18018 


A 


4141 


1 


394 


EIFVTKRWSGLGWAHASNPSTLGG*GG 
QITRSGV*EQPDQHGSPERRGLKERRGL 
MVMKATFLGLRHHLKHIVGHRELRDLHN 
NSKKKISSHPLSTPHVPGTVLKPCRLIL 

f^FKTTHOKPVPK'PPMCIPQVP 


4118 


18019 


A 


4142 


22 


376 


LVNKGKTIFFFLKRGWLPPKGKGRGIN 

HNPGEI* *FFGKKEA*QCGPGGSGIPGP 
RRPSGIiTT^OKGGTYGREPSPtPCJPTPrRN 

xvxvxr o vj xi x XJ\^fv\_rv7 x X VjivCi £*o xr Xi trVjtr X r^nvi 

PRLTKT 


4119 


18020 


A 


4143 


161 


1 


PIPAKYEHFCFFVSLF*DRVSLCHPG*S 
AW*SQLTTASTFQGSSDPLTSATP 


4120 


J OUZ 1 






lo 




HLKK* K* VELKSPSVTHT 


4121 


18022 


A 


4145 


383 


109 


LFFYKIQNYTmGGPCLLTPFLQRVKQK 

VFQKRLKKPKPKPKPKPNWRGLFTjPPFS 
KRPKPKI 


4122 


18023 


A 


4146 


374 


164 


QCQLLRSLRWEDCLS WGG*GCSEP* SCP 
CIPAWVTARSCLQINKSLPAEIVIPSLY 


4123 


18024 


A 


4147 


25 


375 


RKKKALFFCQGGSQPPPSNIjMDPPPPGE 
KKISWLNLFPKKEKKKF*KKK3CFF*KTO 

KGKGFLFAPPPKKPIKKSPHFKIPKKQK 
NFWQR 


4124 


18025 


A 


4148 


376 


202 


HYNS KVFPGGPKRS FL FFLKALS F FFAP 
APIPFWIxHSKIFFFFF*DRVSLCHPGWK 
LR 


4125 


18026 


A 


4149 


3 


144 


LFCPDWTPTPGLKQSSHLGIiPQCWDYRH 
♦VTVPGIjPISFVFFFFPP 


4126 


18027 


A 


4150 


3 


188 


QMSQLLSSI^PBDHMNLGVEGCGKIjSL 
HHCTLA*MTE*DS IS * KKII.KinCPF*KTf 
HSFLY 


4127 


1 OUAO 


A 


4151 




J JO 


GWARWLT PGI PALWEAKAI KS LEDLN A * 
A*QTYAPS*EESLHPIMNTLIRDPWVT 
SFRVTViyrSTIAPJIWLFCWDI.rjQAS IGH 
ILILGPG 


4128 


18029 


A 


4152 


162 


1 


TVWYWHKNKLSNQ WNKI + FLB LDP I TYK 
HLVYDKTG I SNHWVKBALVNTOCPAG 


4129 


18030 


A 


4153 


369 


20 


FQLLARQRQENGVNPGRGAFG * PKSRPC 
P PAWGTK * DS VS KKKJ? KKKS CE FVHT? TT. 
KLEEAYHSLGSNTAQRLFVSCFHKKILC 
LS IS I PDPYLQFKMWPVPPSKPL YQIDW 
VGLL 


4130 


18031 


A 


4154 


117 


2 


QIFFFFFFFKMESHS IAQAGVQWRHLGS 
L*PLPPRFKR 


4131 


18032 


A 


4155 


369 


217 


FFSPPPFFKPPPPPKKIFPP*KKKNPPP 
PKKKKI FFFFFFFFFFFFFFQTP 


4132 


18033 


A 


4156 


214 


283 


ASPSEGQGLSSGP*GLIN*SDKPSVTHS 
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PKPHCETA*GGTSFLSSS6SSPQAPKSP- 
HHGPDSPLSDSIPMVISPPEPPPIiRAKG 
CVOG P PGAS WWGTT 


4133 


18034 


A 


4157 


319 


712 


QDFVS KQHSYCLKNLKSSSLIQVAMNRA 
QVCLISSSKSGERHLYLIKVSRDKISDS 
NDOBS ANCDAKGNNFIC IMLYKI WLN* K 
RLMK*HWTAI* LKI YFSNKMKFSNLNLS 
SVINS FDVKKERLNANTLN 


4134 


18035 


A 


4158 


183 


369 


IiKRKQSQRGEVNFLKVTQPECXJKAGILF 
IYYFFEMESHSVGQTGVQWCNLG*LQPI* 
PPESSD 


4135 


18036 


A 


4159 


190 


336 


DPISIKNFWPGTVYFLIETGFLHVG*AG 
TiRTjPTSGDTjSTIiASOSAGITG 


4136 


18037 


A . 


4160 


10 


390 


QGILLPCFSMSSEERGRIYSNTFSFFFF 
FWKGSPINPQGGSQAKKPR*REPSPSGL 
TPQRPGNGGPPPPPGQNFFFKKKGG*PG 
GPGRA*TPGPRGTP PLGPPKAGNKRGGP 
RGRAKNFLKNQNGPT 


4137 


18038 


A 


4161 


53 


376 


YliFAFFFFSKREAWQGGQPGMEGAKKKF 
MAP* PSKGGGGEFSGLTPPNQWNPRAQP 
PPRGELILILEKKRPAPNAQPGPENLGP 
REPS P IiAPOKGGNKSENPWP P P P 


4138 


18039 


A 


4162 


1 


356 


GVFILVSFNFWAFFLSFFFLGKQILVLM 
PQPESQGQNLAPQNPPPPG* RGFPALTP 
PRTGNKGMAPQAPQFFGFLKKGGFSHGG 
OGGFKTPTLGNIiP PIAPOGLGNNGLGPW 
PPPFFF 


4139 


18040 


A 


4163 


345 


26 


APGFKPAIiGPPGGPFFSKKKSKNLPGLR 
GSTLQAVGPTFIiGG *GKRSLKP PGG*AS 
RGPGWGPSPPPGGPN* IPVSEKKEREGIC 
KKKGREKKKETFSSNLLIFLSF 


4140 


18041 


A 


4164 


18 


284 


TLQGTSGI FEGNFFFFLGTGISLYCPGW 
RAWGGDHGS LQL* PPGSSSPASTPLGRS 
WDYRHVPPGWAS FFFFKTLPNGVTQAGL 
ELWG 


4141 


18042 


A 


4165 


377 


30 


FSGKKKFFQLGWGCGPPPLIPPLWGV* P 
GQPPRVGGFNPPLPPWKNPVFFKKPKQP 
GVGGASPYSPFLGKFRPRIPFTLGPEGS 
G* PNFPPF PPPWAPKKNFSPKKKKKFYN 
CLW 


4142 


18043 


A 


4166 


305 


1 


E SRQVLTS RLTLLS IHEFNYLLNNSTYK 
HITS LP I N I RQ YGQDLKTLKKETKDLNK 
*GNIPCLGIGRINIVKMSLLSKLIYKFK 
AMPVKIPGELFLRNQQA 


4143 


.18044 


A 


4167 


394 


78 


EGKKQGRKKGRREGS * EGRREREKTR* K 
AEKEGMKEGGREGKKEGRKEGRRKGRKE 
GRKEEGQKGGRTDGGWWVLRSLOTSQLI 
SFASGPKWREASSLVWASIiCL 


A 1 A A 


1 Qf\A 

1oU4j 




4101$ 


l 


All 
4j1 


CNTCVR* CAYV3CV* MCGERLCW CVCS CN 
RVCCV+VERVCVLVCMSVCWCVCCCARQ 
CVSVCHSRIESSRPSSGPVAPSKCQTHL 
YTjPKLSAP PAAGLWDLPSEATASIiSGSI. 
CQPSCPRTRSFATLVSSMYHRPVTGEtiQ 
SIT 


4145 


18046 


A 


4169 


3 


423 


YNAREIEAAAGRDHATALQPGYRVRPCL 
KKKVLKLGQAWWLA+ P*NFFFETGSHSV 
PRLECSGVISACCSLNIiPGIV 


4146 


18047 


A 


4170 


118 


402 


Q I CTCTP KP KMKVKXI TMLSNNMRYLYI 
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UUtlVVllulf IIU^I III/ 11 














HLTQDI * DLDIDL*NTADMNLKDLNK*R 
EIHCLRI * RLNI I KTS I FLKLPYRVHAI 
PFKI SARSFVB 


4147 


18048 


A 


4171 


404. 




RGCGEPW *HHCTPAWVTE* DliVSKMKMlf 
QNDNLSHLSLA 


4148 


18049 


A 


4172 


274 


431 


ESLVGQRNWVAKYF YR I TAPGRAQRLMP 
AIPAPWEAEVGGSQGLEIETLL*NNSSR 
PGAEAHACNPSTLGGRGGRITRSRD*DP 
PGQH 


4149 


18050 


A 


4173 


24 


366 


FLCLFHTCVLAFPVQQLCSVGWGCKVEN 
ADREEKQPCDSQARSEPRLCRRGFNRAP 
NINTRHE KLFPVCLF*TGSHS VTQAGVQ 
GCGHSSPQPQLPGPKRSSHLSIiRSSWDT 
GM 


4150 


18051 


A 


4174 


397 


1 


TEPVLANETWGPWDRGPLGLGGLGPENT 
DNSHAVGADQLLKEPPFPLSTRSIMBPM 
SLNAWLDS PHREMQAGTPLSLCGDTYET 
OVM * S WG I RG I I.POOPWKMGAJ3T .^T ,Tl Q 
LSLSFAHTIiKHKHTHTHTLY 


4151 


180*52 


A 


*rl / J 


on 


1R7 
JO / 


NPPPPPPKDRGKKANAPPPGGIWYLKKN 
GFFP I GOGGP EPP P PRGPP P PAP PKGGE 
KRGGPPHRPGKKLLK 


4152 


18053 


A 


4176 


398 


280 


RRLKWEDPLNPRV*GCSKP*SHRCTPAW 
VTE*DPVSNFF 


4153 


18054 




4177 


4. J J 




IjFNFF FI.VKMGS RYVAOAGT.K P T . A*RN"P 
PASAS PRAGITGGSHHTQP 


4154 


18055 


A 


4178 


366 


48 


PWASPG I SLS FSFLRTKPTVKVR* YRVG 
PQRQPCPSRWAPPACPFSMFPPAWVSPP 
GSAASRPPRHAGSQVGATAS PPPAQGLG 


4155 


18056 


A 


4179 


44 


413 


GDGVNSAFFFFFFIjEKKVWF I P PGGGPK 
PEFGFRAP PPPGVK3CTFGFTPT.PTRR* P 

PLPFPPGKGWFFKKNGGS PMWPGGV* PS 
DPKGAPPPGPPKGGE*RGBPPWVGWNFA 
LLKERKPLFKE 


4156 


18057 


A 








HEVS LCCPDWS * T PGLKYS S CliGIiPNCW 
DHRCRNHGPGKTQLiEQGHRQVAGIiSR 


4157 


18058 


A 


4181 


85 


408 


YTCSIjR I TODHFFFFFFFLGKXI F FFP P 
GWGAGGQILTSPKTPPGVKGIPWLNPPR 
GGGPGGPPPPPTLFLGF*KKRGFPGGGQ 
KPRPKGDPPPLPPKGGGLKGGPTP 


4158 


18059 


A 


4182 


1 


149 


NHLNIi*GRGCSEPRIiHHCTPAWVTE*DS 
VLKKKSYIHLGQNPTVRKVLL 


4159 


18060 


A 


4183 


310 


3 


ATAPSQFIRnj*RNHCHYIEVQSEAASA 
NLETAVNNPENLGKIMDEGG*MKKQIWK 
EMTSRIFIVEKSMPGFKTSKAQALILGA 
NIAVNFRLKPMLIYHLEN 


4160 


18061 


A 


4184 


1 


359 


PTRPLVLDRDRPPFFFFLGSGKKKNPGV 
FLYFGGGWTPPKKKKNSLGWGGGGIiKKG 
GGGVFFKPIiPFIjGGGPFIjKN^GGVDKK'K 
RAPPFFTKTPL* KKPPPPFPPKI FKARG 
FW*KMGEPFFFCPPPPSFLKKGPPPKKB 
GV*KKPPPPPFLTPPPPNQGNFFFFWGG 
STPPQNI KKPLGFFSSQTPKKKKRAADR 
DLELADAW 


4161 


18062 


A 


4185 


1 


362 


HFTKYLFISC*K* *LLCWMQ0CCKKGIP 
MNTNVIQEKATSLYGSLKHKEGEGPKAG 
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EFNAS KGWLDNFRKSFGFKKVKITGEAA 
SANQKAADKFPDAIKKI IEEKGYLPEQI 
FNAGESAL 


4162 


18063 


A 


4186 


366 


47 


AALFLVDPPGRGGVKGPPPPPGPFFVFF 
QRRGVPPLSPGGFFFRPPNSPPPGPPKG 
GGSGGGPRGPA* THGVFLKKNPPNCGAG 
KKKGFFIiKKKKKKPVFWFSVK 


4163 


18064 


A 


4187 


394 


112 


KNTKNLKISWAWWHVSMVSAPNEAETGG 
STEGTGCSKL* SCHCTAWVTVRETLS PR 
KFFKKRANYLTFRI *KVCSLYHRKTKNS 
FKKHTQGRKF 


4164 


18065 


A 


4188 


230 


372 


KPMFLPFNPAIIiT>LGIYIiKEYKSF*HKD 
TCMRMFITALFTITKTWNQ 


4165 


18066 


A 


4189 


149 


2 


NRSLRFMRFKEGSHLPP I * VQGKAAS AD 
VEAAASYPEGLGKTINKGGST 


4166 


18067 


A 


4190 


2 


159 


MHQPWLLRRPKQENCLNPGGGGCSEPRS 
HHCIPA*AIERDSLSKKKKKGGAF 


4167 


18068 


A 


4191 


180 


415 


LGFLQT.T.LFVGRSMFIFAIiELSHGWLGS 
SLSF* PVCVKQSSHLSLSSS WDHKHMSP 
YPANFCI FCRDKTLPCCLGWAQ 


4168 


18069 


A 


4192 


424 


41 


PPLLRAPPKjGLI**PKNSFPPGAPP*IPF 
PPKKIKIFKFTPGGGGAPLFLLPRQVKA 
EGSLPPRVFGPPPFFFPPLPSPIjGAKPN 
PFFFFKKKKKKEWIjHKSSLFYSAHFVLC 
LKYShSFFYQSVKTK 


4169 


18070 


A 


4193 


105 


424 


EliKRLVIKLIRGI PEKGKAQCKEIQKLA 
QEVKGEIFMEIGSLKKKQ*KIQETLOTL 
LKMQNALESLSNRIEQVEERNSELEDKV 
FKLTQSKKDPSQIKKKILYNV 


4170 


18071 


A 


4194 


3 


240 


LCLQSQLLGRLRQENPLNIiRI *GCNEP * 
LHHCTPAQVTERDPVSKENKGSFIPMKI 
GKLVSLTLNRMCNCKNSEKENV 


4171 


18072 


A 


4195 


385 


1 


KMIILTKKMEIKHKDEKELQKTEVDLKI 
ENNTTRARLTNNIKXKPPESEKTNTKNH 
TK3CSL*DQDTKKETRPQKKKNPFKSR*V 
RHWRNILNLESRNRGKKNPS GVQKKRDS 
VPTLITQTQETWWWCF 


4172 


18073 


A 


4196 


3 


272 


LFYLIMALKHKSSDVGNSNIYAKEKLES 
ASFKERI*KSYVEVAKIYGKNEYCIYEI 
VKKEK* IMHSI YRVQYYLQFQTSLEVLE 
RIPHG 


4173 


18074 


A 


4197 


236 


379 


GGRFKGSNFTSPGLQGNSFFMGPPKLNS 
RAGV* QRREGKNPGVPQFNR 


4174 


18075 


A 


4198 


375 


1 


NFKIPAPPPQNKGPLRVGTPPFLIFSPP 
GSPPKSFPKAKWARRDRPF*NPGETLQE 
G*VFFSGRFFFFFFFRDRVLLCHPGWSA 
VAQSRLTTSSDSRVHTIIiLVGRGCSEPR 
SHHCNPAWVTRVK 


4175 


18076 


A 


4199 


1 


270 


PSRTGPQI PRRPTRSNCPNQFLGF * GCG 
EPRSRLCTPAWATEQDSISKKERHGUiV 
FLKGFIHYIRLLINLFAARGRKRVLFAL 
GGREMW 


4176 


18077 


A 


4200 


388 


1 


PPGIINLFDPGL* PPPPWVMGPPPS+KL 
ISPQKKKNKVPPPVPPINIFSGPPPPTL 
VFWVLFFVSUCRQLIL* *KSGVSAASLV 
RHDGAPRLTS SAGKGHAEPTFQPRPTHH 
RNACSFVLIHIQSYKVF 


4177 


18078 


A 


4201 


298 


158 


FKI VINFQFLTP I FAPSEGGRGG*GRGE 
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LRS CHCTPAWVTE * EPI * KRKI KGKKRT 
RHP ^PP DMT .A 


4178 


18079 


A 


4202 


128 


358 


KLMI PPS PGQP PRPGGRWGGHPPARPAA 
PSGR*GAPLPGRPY*EKRIPSARTPPRL 
GGWPSMSLRTGHDDNGGVWE 






A 






i 
l 


LLPGF 


4180 


IRAKI 


j\ 






-* 

JL 


A-fK. Iritis. A flatrrT rl" VAAauMrr fc'KjenjLH.ir 

KGGSPPPPPFPPPGGKKGGKKKNNPPGG 
* KPWGGFFFFP P PF * TPPKKRGLFLKKK 


4181 


18082 


A 


4205 


3 


127 


GFHRVSQNGLYLLTL*STRLHLPKC*DY 
RSEPLCLACFLFL 


4182 


18083 


A 


4206 


1 


216 


GFRRVGRDGLDLLTS *S ACLGLPKCWDC 
GHMHRAVWVRWATATL 


4183 


18084 


A 


4207 


427 


0 


LFFTTNFFFWARVFFLPPFF*NPPPGFF 
FLPHKKKKNFPPPPGFFFFFFYAPPPFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
rv VJJ&.X VvoU 


4184 
• 


18085 


A 


4208 


2 


404 


PRVRLGIWCKERIiFLCYFFFFLERGFCF 
LAQGGGPGGNFG*RAPPPPGEKKIPGPP 
PRGGGEKGPREPGRKIFGFLKKKGGGPF 
GPGGVGPPKKGPPPPKPPKGWGYGGKPP 
AyKKuCir r r oivtU<Ar t'i'NWVNG 


4185 


18086 


A 


4209 


40 


412 


PLFCLSEKPRYFEYAIYLMLSLKFLESC 
IMSVNSSAVSASVYY* W* KIEREPLTSW 

KFEGLTLSGFKTHYKTTI ITTVWY*HKD 
TQIDQWNRIESS 


4186 


18087 


A 


4210 


219 


392 


HFFFLNYNFLGKGVS FCPQAEGQGRNLG 
♦IiHPPPPGLKQFFOjTLIiRNWNHRLVPP 
pp 


4187 


18088 


A 


4211 


409 


2 


LEPJCTVFFSPPFIFFFAPVFFLSPFFYT 
PPPLYIFCPPKKKKI FPPPPGKIFFFFK 
GPPP I FFFFFFFFFFFFFFWS * FF IMYQ 
IVFNISYNLIKVDSHLYLFKDKKILLCT 


4188 


18089 


A 


4212 


24 


399 


ADAFSTTNLHGLGPDLFTPTTQLYINLG 
MAILL*AGAVNIGYRSKIKNALAHFLPQ 

LTAIITAGHLLMHLIGSASLAGSTISLP 
STLTVFTILILLT 


4189 


18090 


A 


4213 


411 


197 


SWLTAPSFKRFSCLSLLSS*DYRLPVPH 
QSVGFTAPKVLGLQT 


4190 


18091 


A 


4214 


218 


126 


KEPEPLLFRSSWSVGAIDLKNVMRRVLI 
KSCTPGWAHTCNPSTLGGRGGQITRSG 

IVFE 


4191 


18092 


A 


4215 


1 


332 


MAPSLCLSNKKRGFIGPGFAGAPKHPGR 

^n/^tipPPV'T.AP'PTf JfYSPDtf" PPT VOPT ITT V 

TPPPKPFFFLSPKGGALKTEEEP*TKKK 
GGAEKKPPPGEKRPLGPFCPKKPNWG 


4192 


18093 


A 


4216 


1 


185 


KLYLS I LISLQISLI ITFTATELI I FYI 
F FETTL I PTLAI ITR * GNQ PAN FLYLLV 
SFHEN 


4193 


18094 


A 


4217 


1 


388 


LRFCWETLFPLPYNPFSFHFSFLFFSFS 
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FFFFLGGKFCFCPQPGGGGPNLNIiRDPP 
PRG* KEF * APPPPRGGENQGSPPPPPNG 
GGF* KKGGUtHGGGGGFKPPPQKKPPPL 
ASQRGRNKGGAPPLLKF 


4194 


18095 


A 


4218 


3 


195 


VIYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAFI PVLTKKINPRSTEAAI KYFLTQ 
ATASHPA 


4195 


18096 


A 


4219 


261 


1 


EPRRRERKERKEKKKGATPSRICMKRER 
EBRFS * ERRERGEEGERGEKEERERGER 
ERERSEEERERERERPKN*MRLNGRTRR 
TRG 


4196 


18097 


A 


4220 


5 


375 


DEMLHLKFTYILN+TLKDTIIPKVNENL 
YN*DFLNSKVQGTPPPPRPSRPPSSPSP 
DPPPGPPPAGAGRRARRPGRRGSPPGGP 
PAPP PAPPPRAGRAAGGRGRPPARPGGR 
GPAARPAGGGG 




1QAQD 




4ZZ1 


ZJO 


ZD/ 


GGGALKKKIYCGGGGGNFFF*SPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
IYFIFILIF 


4198 


18099 


A 


4222 


127 


330 


KKKKKKKKKKKKKKKGGGPFKKKKFLPR 
GGGKNFFF *GGQKKKLGGGVKKKGGGKK 
PGAKKKKKRGEK 


4199 


18100 


A 


4223 


1 


377 


RRFHIiRRENLQEKQTWMGI FFFFGVAGP 
PGPGGKSFSPPSFFF*GKNTFFTPPPGG 
GGQNLVGENSNSNPPPPRDPPPSGPSEK 
I*KKTPPPFLVGPNTKNKKHFGGGGSPK 
KKTLLPPPPLDPF 


**ZUU 


ioiUl 




*IZZ*t 


Zz J 


** j i 


IRKTGPLGFGGQNNPLPKV* PPPGLTFF 
QKGPPPPPFPPFQKPRFLKFYPKREGPP 
PPKVGLTPPPGLL 


4201 


18102 


A 


4225 


173 


2 


RCKLHCLFPTLAI ITR*GNQPERLNAGT 
YFLFYTLVGSLPLLIAIIYTHNTLGPTR 
p 


4202 


18103 


A 


4226 


11 


377 


FLTWFICISLFGRLMGI*FFWFFCEL 
FFHAFFSLFLKTDI IFSIDLLEI I IYAH 
VCHICCNLIFSLYFVFYFNGSLKYIKC* 
NLFAI ILDFADVS FLIFMFRTGKRQNDN 
QFSLKILIGF 


4203 


18104 


A 


4227 


1 


412 


KNSKVKNATDLLKNASESPHSRIDEAEE 
RISELEDRLFENTKSEETK*KRIKK*SM 
PTDLENSLKRANLRVIGLKEKVGKEL*V 
GS FFKDI I * QNVPNLEKDNIQAQEAYTT 
LSRFNPKTTSRHXiAIKLSKVKDKER | 


4204 


18105 


A 


4228 


373 


163 


IGVFLGGAPFFFFFFFFFFPQGPKRGGG 
KGGASGKTG *GGGGLKLGGGKQG V7PPFF 
FLRTWPTFTRAPGL 


4205 


18106 


A 


4229 


392 


3 


PPC*NRAPGFNFGGPFKKIYSSPPPRGK 
FGSLKGPPPFFFSAPPGGPKFSKGGGPL 
FFGAPDF+GGFGGKNYFWGQGFP*PHSP 
PPPFSWGPKRRAPFSKKKKKKRRRGWGE 
GEGQGSNHT.TtKHGYFQI 


/to rue 


ioin7 




4ZJU 


Ol 1 

zl 1 




YALIWLYC II1NNYALHSVFFWCCFCFI1T 
FKFIYVFLNLLFLFAF*LVLCLLKVQNH 
SGSLS IVS YFPLLEACRI FPLFPLF*NL 
♦MICT'NMVVLYFKQLCSALSVFLVLFIjF 
FNF* IYLCIFKFIVFVCFLRRNPTLAQA 
GVQWHDLGA 


4207 


18108 


A 


4231 


69 


386 


KRIFFFLGPRGGGKGKMtfVNGNPPPKGK 
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nucleotide 
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SEQ ID 
NO: of 
peptide . 
sequence 


M 

eth 

od 


SEQBD 
NO: in 
USSN 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
dun no itviu 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue oi 
peptide 

bCtjjUClILC 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glu tamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proiine, 
V^vjiutamine, k— Arginine, e>=5>enne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
a— un Known, — otop coaon, /—possible 
nucleotide deletion, \=possib!e 
oucieo tiae insertion 














GNFSA* PPQRGGPKGPGPKNKVIFGFKK 
KRGFYGGQQPPNLGKELAGPPKGGEKRG 
KTHPGGKFYAFLGGGLFSKKRN 


4208 












SPRGEKK 


4209 


18110 


A 


4233 


442 


112 


LLGRLRWEDGLS PQGRSCSE P * S CHCTP 
AWATQ*DPVSNTNNKNPKTPQNACSSIjNP 
SLPWTAAVWRTIiGEHCNGREALPARAKN 
AiKAFVTil I Hrl 1 CJQT VKT I S CAL L PH 


4210 


18111 


A 


4234 


415 


58 


SLQKGHFPPFQELDPQNPGRSGWS PVLP 
TTGLFFSLDFIRFT*FMAGVLVGYKHCW 

T^WTPIV AT T7T T T T?T T3T\T)\TCJT nonnr T TTTT1T 

iiW 1 AALt V J_ili Xj FIjRDKVSLCFPGL ZH 1 LG 
LKRSSCLSIiIiSSWSYRHTIPHPAST*FF 


4211 


18112 


A 


4235 


406 


168 


HGETPSFLKIQKLAGRGGTLL*SQLLGR 
WRQGNHLNWGGRSSVYIYTEKHTVHTQQ 
SVTTPFMSMSQTLAYSPLSLKSS 


4212 


18113 


A 


4236 


247 


397 


TVCFSVYIYTELLPPQVK* FPCLTLPSS 
WJLJ 1Kb V PFJjPAz FCI FS RDGV S 


4213 


18114 


A 


4237 


36 


405 


RLYCFIKGLNVKNKTFRIVFFFFLETNF 
PFGPQGGGEGANSGFPBPLALGVKKGPR 
PPPPGGGE*GAGPPGPGNFGLLKKKGVP 
RGGGGGPKPPTQRFPPGRP PQKKKNFFG 
PALiCiiCKKVFF F 


4214 


18115 


A 


4238 


407 


3 


KKKKKLTRPGGGGP FS PLPKRVRQKKGG 
NPGGGFSKKKKSPPPPPPRGKKKKPFSK 
KBOKRFLSGNNTGKNRLGKDPQS* PAGNA 
KKlU^GWEKAIiARGRAPGMTDRKPQHSGL 
DGSQQSGTGQEPGNSGPVPKVH 


4215 


18116 


A 


4239 


144 


405 


PTVKLVLLYLVLLKVAVSKNLR* *GGLT 
V*QQRQILWKRILSOTEFLYLRQSKMMH 
UVLi * FTVNTIjKSIjKSQQAR wltpvi pal 
WEA 


4216 


18117 


A 


4240 


369 


40 


PLLQGTPASWPRWGRAGDRPRKEG * LHP 
RETRVPPALHHQPAPAGVSLTSPGPPFW 
LS LQRLRGHSSSDS LPAVCQ YSGS WRE E 

KAA/VKAtrAlj 1 FA1CTL. 1 1 Co VLTWL 


4217 . 


18118 


A 


4241 


397 


1 


ISKSLFFPIPFWKKKKKGFSPPLNFIiGG 

FKIFKKYFPFPRGFKTPPKKKIFLNPFF 
KKKGGGGGALFFFFFFETGSRSVTQAGV 
KWHDHGSLQP*PPRLK*SSDP 


4218 


18119 


A 




9^A 




AD P PCS P T.P.Q A T lOTnun * WtJ 
rtunruo r ucoAr jcj X \70no JLn\^ i Vj vy rUM 

YGSP*PLTPGLKLSSCLSLPNCWDYR 


4219 


18120 


A 


4243 


434 


104 


KQPTFQSTTGGSPSAGRPPPPRAKKKKP 
PFF*KKKKKSGNIGGPLFPPPPGGEEKK 
NLFPRKGEGSHKTDSSPPH*RGGKKKNL 
CPKKKiCKKKAWCWrjYRRTfPT.r'PATVP 


4220 


18121 


A 


4244 


2 


425 ; 


FVFKI ILKNLSE IQENLQFSKI KKTIHD 
LNKKFNKETGNITKNQTE ILELKNSMSE 
IKNTVKSFNNRLNQAEERISELEDRSFE 
MTQADK3CRKKGEESLQDVLYTMK*TNIL 
WS FQKEKRKEKGLQNILNE I VAENFPSL 


4221 


18122 


A 


4245 


12 


424 


IYYKTLNCT*IF* FGGALACFHFSYPFY 
IHYVGFVPSPVYLFSHVIISVWTSGYLL 
YSLGCNP ILSLFI FLLKLFHFGYLTI QV 
GIYVWILFVCLFLSNSFINI * FTYHAI P 
IPIYIPLKVYNSMFYSIFTTK. 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQfl) 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vf=oiutamine, K— Argimne, o— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A— unknown, — 2>top codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


4222 


18123 


A 


4246 


215 


425 


LSIQISQIRNLIYIiFINIMGKILRAKRK 
*GTSDAWNQTI FLRPRTVAHSCNPSTL 


4223 


18124 


A 


4247 


413 


2 


WBVESFSLGKAFMWGCATENPGPPHPKF 
LLFIMKAPPP WRVGFQDCFNPKDGRFI * 
PRTPPCPRARGAKKNSVSKKKKIKIWKR 
WPPTPLPPPGNGGTRKFPSSPPPPPPPT 
TPKITPPGGERQL * PQRAEGAGDTK 


4224 


18125 


A 


4248 


61 


281 


ITWKDFDSFS VLRVSKAFHLALPAF IGN 
C I KCH I KMG I YPEEKFKNIHKxL YLE I D 
VCGH*TWKKI FFLFFFLRQRFTLFAQAG 
GQWLDIiSLLQPPPPGFKRFSGLSFLSK* 
DFDSFSVLRVSKAFHLALPAFIGNCIKC 
HIKMGIYPEEKFKNIHKYLiIjEIDVCGH 
IRLGKKFFFCFFF 


4225 


18126 


A 


4249 


23 


13 


RWHRKPFLPLAATSCAWPRSVTPSVTS 
GSADGDGQSLAVGMTLATVGGLELLNS 


4226 


18127 


A 


4250 


182 


342 


KGFFFLPPGGGGGGEF* FNEPPPPRVKG 


4227 


18128 


A 


4251 


2 


219 


PSLRKMQKLARHGGTLLWSQLFGRLRRE 
DGDP VS KNTKFFFFFF 


4228 


18129 


A 


4252 


223 


324 


LGAVAHICNPSTLGGRGRRITR*GVRDQ 
Arvj%jri.VxE» 


4229 


18130 


A 


4253 


165 


336 


ESDAQGHQVAARVSHVLBKDAIiLVFRSL 
CKLAMKPLGEGPPDPK*ADSSSWPPSVQ 


4230 


18131 


A 


4254 


3 


233 


ETAFCHVGQAGLELLTSSDPPASGLQHA 
GI TGPATKSAP * WS * VSGPHLGAFDSLA 


4231 


18132 

- 


A 


4255 


394 


44 


AKIFPPGVKKFFASTPPGGGKKRGPPPP- 

RPWPPKRWGFKGEAPPPPPKLNLFFPQG 
GGGKNFFGKKKGAPTGGPPFFFFEMEFH 


4232 


18133 


A 


4256 


152 


2 


YWSPFYCRSCC IRQGTVAHTCNPSTLVG 


4233 


18134 


A 


4257 


404 


190 


AQLIRGWRQ*DHLSLGGGGCNBPSSCHC 
TPAWITEPNQSLSQKTQKQKMLIPN*YG 
♦TVSPPKSHPELOFS 


4234 


18135 


A 


4258 


288 


407 


ENCRPRAVAHACNPS * ENCRPRAVAHAC 
NPSAFGGRGEQ I PRSGEQEQPVQHG 






7\ 
I\ 




406 


in 

loJ 


onu V y lilrvjDflVJu 1 liv c UIUU^A" Vvjolsvt 

PTRYSPI FGGVGPQNGLNPQGGGSKNPK 
LNPLLPPWGQKGVFFSKKKKKKERD 


4236 


18137 


A 


4260 


101 


276 


LTS I VPS LWEAEGAKLRGLGRPACRESS 
LCHCTLPWVTG*NLVSKKKKKRGGAIiGF 
FF 


4237 


18138 


A 


4261 


2 


414 


WVAATPNLVFVGRGNCQDWFTGPFLTRG 
GWGGVLFPPVKRVIKGLVAPFSGPIPIR 
KNHQKVGGFLEPNKTNFKB *NFVENKNT 
RRG I inrMVPKKKKUOKKT PT.P POT PMPM 
QTWGDFDFFFFAKKKKI INS FLGWA 


4238 


18139 


A 


4262 


337 


382 


RDARFLHCSLV*LIT*LQKEFFFVGQAG 
VQWHYFGFLQPLPSGFNQFFCLSLLSRW 
HYRGPPPSLGKFLVGWLVGWLVFETKFW 
FCWPGWRAlTGFWVPATYTFRVQAIFIi 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ.ED 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, t=IsoIeucine, 
K=Lysine, L/=Leucine, MNVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginme, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 




18140 


TV 
A 


4263 


406 


184 


QLLGRLRQENCHLGGEGCS * LRS P PCI S 
AWATKLDPVSEKPKTKNQKPTGLGGLNQ 
GPASLLKKIRKLGKKRPG 


4z*HJ 


1 O 1 A 1 

18141 


A 


4264 


233 


440 


LPVHHGWGS LRKLT I LGKGEANPS FFTW 
GQQGKAKQKGQKQPGAGAHACNSSPLGG 
QGR* ITRSGVQDQ 


4241 


18142 


A 


4265 


2 


230 


GARL*SHLLVRLRKENHLNLKGKNCREIi 
KWPHCTPAWGTKKDSAAKNKNQPLPPPK 
G VKS P PQREHGL I VTCMFVQ 


4242 


18143 


A 


4266 


1 


349 


HKTNIHFKIiVMVSNVFTFFFFGKGFSFC 
APGGRAGPLFGLVETPPGREKAFFGPNP 
PRRGE * RAMAPCPGKI LVF * KKTGFPRG 
LNFWPGGPPPFGLQRGGNNGGNPWPGPE 
FFWF 


4243 


18144 


A 


4267 


2 


375 


RSEQPAGSASSGNEGLSTRASGCGGCTG 
S PNSAS PPALCS I SRRALAAFPRGRARD 
LQPAMPEPPTHSVGSCAAKPPR*APPPA 
PRRP VP STTQGLRS AGAQRGTGRHLHLQ 
PQCRIHWVKPAG 


4244 


18145 


A 


4268 


377 


3 


TPAWMTERDCIWRRRTSAPGGSWPSGPV 
PSPGAQ* RP PSQGLGI»WWAAAAAPRC*T 
APGPRPPPHGPGSPQGASPPTRPPRCRP 
HP RAGS AGP TGAT P PGSTQGQRRRHSHQ 
LPGHPGHRVALG 


4245 


18146 


A 


4269 


1 


294 


LEDWGGRGARAHYDGPSItEPESDHYRLR 
LGQYHGDAGDSLSWHNDKPFSTVDRDRD 
S YSGNWALYQRGGCWTHACAQ* I LDAVY 
ATGPKYQVHCEALH 


4246 


18147 


A 


4270 


2 


218 


TGRI IILSQGLQTLLPLIAF* *LLASLA 
NLALP PTINLLGELS VLVTTFS * SNITL 
LQKKKKKKKKKKKI FF 


4247 


18148 


A 


4271 


32 


443 


LHSDVDQLAGLVF PGC P WPLAS PARRAP 
AGPWPRRAAAPPS *DAPAPRIiAVSAGSP 
AWPPPST*GLPAPAAWASPASASPTSA 
RSRS * ASPTARRCRPGPGTAAPTSASRQ 
CQWRP * PCQRSPPSGTCSSAAAAPT | 


4248 


18149 


A 


4272 


434 


57 


HTiSFPPASAAPKPIiPQAASLWSSPPSIV 
IiAQVPPMNTDPCPQPSA* PASELS PEMP 
PARPQAPPENWSHP *GCGAQTVAPLVPG 
ATPAHRPAS PSPVTLPVLTAGGHCFYLY 
TYQIFLKFHYIKR 


4249 


18150 


A 


4273 


288 


424 


GLSLSVAQAWQWCDLVSL* PPPPRVKQ 
FSCLSPPRIWDY*HPPP / 


4250 


18151 


A 


4274 


1 


429 


JNTRCaAAG P PQMPHP PRASAFPENPCGRK 
N* GQVSGP*ASGS S PVKWAGPAGTWRKG 
GIiGWGPTRVRGRGPRRRPGASSGYARQQ 

PDPUDntTDDCDDDDT OimAPTlT r»TV*»/VT»0 
larFvjUf KrB fro t'KKK.ijia V fALiVbCFGQ 1 o 

GTRAGLAVWPQIGLCLKAQGARSCPRDH 
SSD 


4251 


18152 


A 


4275 


253 


3 


PHQFNAVI YFPSPVSRGCPNNALYS PFV 
IWLFCFVKTGVSVTQAGV*WHDHSSLQ 
P*TAGLKRFSSLTILSSWDYKSTPPCI 






A 




AAA 


? 

■j 


PMVSOGGVCAK*OARAT PRFfi T PRE nn 
TQWREVQQKYNMGLPVDFDQYNELHLP 
AVTLKTFLRELREPLLTFDLYPHVVGFL 
NIDESQRVPATLQVLQTLPEESYQVIjRF 
QTAFLVQ I S AHSDQNKMTNTC I 


4253 


18154 


A 


4277 


373 


3 


DGVSLCPPGWSAVAQSWIjTAVL*AIiGFI 
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NO: of 
nucleotide 
sequence 
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NO: of 
peptide 
sequence 


M 
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SEQID 
NO: in 
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beginning 

nucleotide 

location 

correspond 

big to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
Er=Glutamic Acid, F=PhenylaIantne, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 

>c viuumuiwvj av glume, O OCI IIIC, 

•^Threonine, V=Valine, 

W=Tryptopban, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

nucleotide deletion V=nn«ihlf* 
nucleotide insertion 














FLFP VGGLTG I VLANS SLD I VLHDTYYV 
VAPFPYVLS IGAVFAI IGGFIH* FPLFS 
GYTLGQTYAKIHFPIIFIGVNLSFFPQH 
NLGLSGMRRCI 


4254 


18155 


A 


4278 


71 


176 


AGGRGCNESRSHHCTPAWATE *DSVKEK 
KKGLFFW 


4255 


18156 


A 


4279 


2 


379 


SPFVPLQSSLGNKSETPSHGRKKKRRKR 
KRTDENQP*KHPLRARPGPGAPRAQGGA 
QGSGGGRRAGPCRKLVHRGHTGNPKREP 
GLI POGEGRST.G T FTP qNGCPTSCP CP<?RfiP 
PPPRPKQGRGRMQN 


4256 


18157 


A 


4280 


2 


338 


aDMPRRKLDPLSGRNTLGFVCVVWAQTS 
GLK*SSCIiCLPKCWDYRHEPLRPVVJVYF 
KLWHPSPWKPGFYFSVFFPFYLPLCTTA 
slfcscIjPPCCSp VTI.VGS e<? QriiWPan 


4257 


18158 


A 


4281 


407 


1 


KERGVATGHTAERGRADNB E KRGEVANK 
RGGREVOPEARRMTTTATGTTAATPnAP 
T * TAATTATAPVTRT AAP ATTAS STRTL 
RTPRRARTAS VL WARCRS RWSATWRASR 
TCDCRTASHSRPTATRSRTARMA 


4258 


18159 


A 


4282 


367 


225 


PCDSACLGLPKCWDYRREPPRPAGVLFL 
* AS FS * QS KEVYVCKLTHA 


4259 


1X160 


A 


ham 






NTCGLOS VCRGTOPGGPKP O PPHPPFP W 
DLGDEQESSKEGRPGREGVQGQGLLGAT 
PGAQGSH*GL*AEAAMSSGHALGPGQVP 
LTPLSPS PPNPHRRPRAGRASRQSREST 
EAQRAVPSQGAAPGWETDWGSSHQWQPC 
QAQQEGRTGR 


4260 


18161 


A 


4284 


436 


22 


CHVTGTQP IKVS WAKDSRK IRSGGKYQI 
SYTiENSAHIiTVT.KVDPrGDSGOVTr'YAVM 
EVGKDSCTAQLNIKERLIPPSFTKRLSE 
TVEETQGNTSFKLEGRVPGSQPITVAWYK 
NNIEIQPTYN*EITFKNNCIAAARRI 


4261 


18162 


A 


4285 


301 


403 


LSIS*PCDIiPALVSQSAGITGMSHHARP 
RVKSLI 


4262 


18163 


A 


4286 


380 


2 


AQVLYSSREQERRQDLPEQVIQAEGE*V 
KASACQLTFEDEEAMESGPAALDKDFQC 
TRKHHFAEVQGSPRCMSSRYIjVDGCPKT 
FAEAQNVCS * CCEANLVS IHAFTFILRI 
QWCTSTVNQAQVCI 


4263 


18164 


A 








YI YEGSIMEEPRKPVPKGPLGLHCPGKF 
QG* RNS YNHHAVRVGTRCAPEG VKDLTS 
SLQSVITKPEQNIQEIiMKHFKKEKSEAE 
NHIRTLKAESLEBKNMAKIHRGQIiEKLK 
SQCDRLTEELTQNENENKKLKLKYQCLK 
DQLEEREDV 


4264 


18165 


A 


4288 


52 


400 


LDLYFFYRQCLALS PRLQRSS AILAHCN 
LKLLGSGDPPTPASQSKEITSMRYHI * P 
NL*NSMC*NCVFDTTHRIGENIWESYI * 

SNLN 


4265 


18166 


A 








EL * SHHCTPVWVTEGD P VSKKRINI CKQ 
NLNEKTNLTVDTPRHS FTHKI KSLNHFQ 
MFL 


4266 


18167 


A 


4290 


3 


142 


YLGVHRLS IDGLDLLT * *SARLGLPKCW 
DYWRDPPRPANFYFILFK 


4267 


18168 


A 


4291 


324 


19 


EMS KRHEQ KFHRRE LQMAN KHVKRT S LA 
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nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OrCysteine, D=Aspartic Acid, 
E>=Glutamtc Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A— un Known, otop coaon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














J.K1SMM ETTMJt I X HTP I QMA* VRNNDTS 
QVQWHTLQVASPLVAEVGGWNEARNLRL 
QIVKMVPMNSHCTPAWVT 


4268 


18169 


A 


4292 


2 


204 


IHFCRDRDSLCCPGWPRTAELKQSALLG 
LPKCWDYRH* ATKPGLQSVFLLGLKI FI* 
I LLVRGVDR YL 




16170 


TV 


4293 


427 


3 


RRKTVTVPRTGERGI ISNEDNGLCKAVG 
NIPGSPLOIVSKIiNlMKHAPGGHAARFC 
IWTESAFRKLDELYGTWRRAASLKSSYN 
LPMHKMINPDLSRILKSPEIQRALRAPR 
KKIHNRVLKKNPVLRPL*RIKLTSRHKR 
K 


4270 


18171 


A 


4294 


24 


372 


FICPLQDYVICSDVTSCWKKILATTGR 
RLRDDSVQFSFFFFGKGGSLLAPRVKEQ 
GGDLG * WNPRP PRLREF PGLALPRCWNN 
GLAPPPPLIIjVFIjEKRGFPLAGKMGLNL 
LHSR 


4271 


18172 


A 


4295 


1 


332 


IHSS YFFPQS * FFGTINTSDKPLRILIK 
EKKREGTN*HIMHEPWKFGIDPEDIIKV 
KKKYYKQLCTH KFDNLE EMVHFLKKKT I 
HLI*NR*FE*LYKYYRN 


4272 


18173 


A 


4296 


407 


30 


WSV I YGEN * E S LGWG AHP SHVANT R VTG 
LR YLFS YGRHALDMLES SQDNMRS WWVS 
QMSSE IDVDNLGHI SLCNAVQ* IRNLNP 
GLKTSKIELKFKELHKS KDKAGSBVTKK 
EFIEVYHELYAVG 


4273 


18174 


A 


4297 


350 


1 


YKTWMNKEKW * YVGYNIEQEQKIjALKT 
TVLDEWYTLPDGRI IKVGGERFEAPEAL 
FQPRL INVEGVG VDEXiLYSTIQAADVGT 
RSEFYKHIVLSGGSTMYPGLPSRMEREL 
KQHV 


4274 


18175 


A 


4298 


623 


1 


SRRGCAATCDGS ITAWPQRQAAQGKPS V 
HS KLEAEAKPTPGDHAASESTGFSCLGP 
GGVHKTAHAHARNMPGDSNTGSGQSPAG 
RRWEARGSAPRHHPTQPDWTHPDALRQA 
MARNPASS F*S F* RCCTAASPPVPTPPS 
PVLMRGPVPGGGGGQGKIRPLQEEAPPP 
SSPWLSR* PQAGTPSSPAVSSLYHGGL 
SPTGRQDRWGR 


HZ /D 


lol /O 


7A 
J\ 


4299 


469 


3 


PVRNCLGARFKVSGRAAHHAL* QSASTL 
QGDPRTKRQAI SA* S SAFDVQDVSHVTL 
PFYPKRAQSKDLIKEAILDNDFMKNLEIi 
J. VUUyi x Jr V £• xtaKJJoRX VKEVDVG 

SCVYVMEDGKVGVTKQGVKLCPVGPGKV 
YEELAILYICRRPCI 


4276 


18177 


A 


4300 


506 


321 


KPLSLLKIQKIiAGHGGSCL*LQLiIiRRVR 
QENCLNLGGEGCSGLPKFWDYRCEPPCP 
DCWPS 


4277 


lol /o 


A 
n 


43U1 


AC A 

454 


1 


NCjblvr I ib(jiU^xvrLF*NTRRPTL»FTWG 
KTN*GERPRSSVYSCSSWRLSSSSLGRS 
+NSVGSLNI ILATLTAQELSNS * ANNLI 
. * LAYT IAF I VKI PLYGLHL* LHKAHVEV 
PIAGS IVLAPVLLKIiGGYGT TRLTT.TT.N 
PLTKHIAYPLY 


4278 


18179 


A 


4302 


3 


463 


AVSLPLPLPRFGRLPPLRVGWCPLRCH 
ILRTQRLEGAPSR* QTGAKQAGVWEELR 
SRLSLGPELGRIRPYSGDPGQVTDSSGA 
LASSLERGCYPAALLVGEGEGEMYIiGPY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
pepuuc 

iclj uuitc 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 

aUu 

9 CjIUUC vl 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Add, 
E=GIutaroic Acid, ^Phenylalanine, 
G^GIycine, H=His±idine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine ? P=Proline, 

f\—{~llt\4-<i mino DsAiviinmA Carina 

v^vjiuiaiTiine, iv— Arginine, o^senne, 
T=Threonine, V=Valine, 

W=:TrvntA nVi o n YzTvrActn^ 

1 In Ifnnwn *2^tftnrnrfnn /=nnccihlp 

A VlllUJV'f IJ| ^/WU lUUVll] / f_/ i/JolulC 

nucleotide dpletinn V=nnc<;tb!e 

IIUUvVIMIG UCIvUUU) i^UU3jll/lw 

nucleotide insertion 














HPT * DGLKRLS LAKPPGGETRLTRSGHT 
FWGEGIPPSAVCL 


A97Q 


ioiou 








9 


FFFFFSETESCSVTQAGV*WHDLGSLY 


4280 


18181 


A 


4304 


194 


3 


FFFFFSETESCSVTQAGV*WHDLGSLY 


4281 


18182 


A 


4306 


517 


254 


WBETAFRLIGKTGIGLVTSGDLAASAS 
QRGGMTGVSHOTCUCEFVLMNI IVMSSGL 

TT7T.D T? Q Pm?V * fJQ *f2T . Q WT .P Q PT .ZVT .MTfl 

LRN 






A 


4JU7 




3 


K" T VMVn T DT .T-I T DTJDT A7WT.NT XIT .T.TVTT.iVT 

SSLAVYSIL*SG*ASNSNYALIGALRAV 
AQTISYEVTLAIILLSTLIISGSFNLST 
LITTIiV 


4283 


18184 


A 


4308 


234 




inrj?T?i?i?ppT RnnfiPT r P r rTW<;ppT.prz*p'K 
SPAPPPPRGGIKGLAPPPNLFFFLDKR 


4284 


18185 


A 


4309 


403 


100 


SGGQKAVGPPWAAPPGYKQNTNKLGPLP 
QPSTEGGAFWPTQGPKPTGLPPLPG*LN 

DDWD 1 ? P P*WP P P P D PT7NTTP C VC* DHTOQ P 

TPLCGPPPKNPRLFFFF 


4285 


18186 


A 


4310 


399 


85 


WKPIPLAPEWRGVIRFRALPPCRPLHCP 

TT.WT.TA OQ^nunPT* VWWf217T .l/TZVGT.QV 

FLERESCSFTQVECSGTKLALRSIiELLG 
VSDPS I SAFQRAGI AGVSHHA 


4286 


18187 


A 


4311 


230 


2 


WTEEDTRRCLVLFSFFLS FFLFCFVLRQ 
RPACCPGWSAVTQSQLTVASTSLAQAKR 

Q QWT.Q PPflP Q + TiVP HTD/V T 


4287 


18188 


A 


4312 


1 


420 


NTSWGVGEIiSLIVIVINMLLPYVWLPKG 
KFFFPQNSRFPSAPPHSSPPGLRSDFSH 
cnni .pi?wt ,T?\rr xinx .pt .dp p p dp a cjttjvp 

RPLGTQGCP*LAWYIHLISASYQKANAA 
pnT.Qf TT.nnPTPfl'rrnnTT.T t?tfftt .pt.q 


4288 


18189 


A 


4313 


403 


207 


ESS EG + LNPLAHLAMRYKGCPFKDVREK 

SEFILKS IQVRKS ILNTRMSLLQIiDFTC 

R YG/R CVPPY 
i\iox\v> vur x 


4289 


18190 


A 


4314 


469 


328 


TSEGGGFNELRLCHCTPVWLTK* DSVS I 
SQRIRKKNLKTQSLVVRCP 


4290 


18191 


A 


4315 


1 


212 


NTLPGVEVLKRLRWEDSLNP*GQGCREP 
* S CHCITAWATEKDP VSKKKKKKKNPGG 
FRPHSLSQPFGPPK 


4291 


18192 


TV 


4316 


63 


559 


QMT .T PUnPfiUD A VPTTPQC T»fVlP VPC PT 

SSATPKPIPS*AACPPPDCALAAEVRAL 
PAAGRPRFSEACLTPQPNPCVLWPCLWD 
HSPTSCPTGSS*APHSCAFLPVPRLGGG 
MTTVHPTVHIiP THS S LTPAWE * PS PPAC 
LTAPRPTXMCLLSVPPSAYSSSSPS 


4292 


18193 


A 


4317 


135 


405 


PAPPS PHPKPOSVSCVIPPS *PAPPSPH 
PKPDSVS<^PPSHW^PSGLPEMTLLB 
PRGPCPPEVPPTSPFVG*AWWPHPPARR 


4293 


18194 


A 


4318 


413 


1 


LYLVGRA* SFHRS FPGQERIHPGGKPYD 

nrFrTl'RTF'T QT.V^ TRPHMT.TtfPnriVPYTr 

V^I\-CiV-.vjiJ x x lOU V o XVjXvTli 1XJ x IlXvVjVJ V JT X XV 

CKVCGKAFD YRS LFR I HERSH PGEKPYE 
CKQCGKAFSCSSYIRIHERTHTGDEPYE 
CKQCGKAFSCCKYIRINERTHTGGV 


4294 


18195 


A 


4319 


479 


342 


GVGVDNTOILVl^TinPWVlxDSAIRRR 
* DNTVNI LFIjTADVTSR 


4295 


18196 


A 


4320 


400 


26 


HSEAVTTVRSHHSPVGHIHSLNAPPTVA 
LTP*Y^KPLNIFV*ARAQKIQGQFLTHT 
IPHILTJ.VITRDTSRACVCVCVCVCVCV 
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NO: of 
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SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 
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SEQ ID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ftminn arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

UlJIg III 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M»Metbionine, 
N=Asparagine, P^Proline, 

\jriuiaiuine, iv— /vrgin me, o— oenne, 
T=Threonine, V=Valine, 

. X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














CVYAYIWVCMCVYIYVCVYIYICNYTFI 

tr X XJ V flUXr J. IN IU1 X 


4296 


18197 


A 


4321 


370 


1 


SDRGQGKAGQATRLGSRQTGQDEDKGTE 
KSIPSWDKGPVNNEFGKSVNVSSNLVTQ 
EPS PEETSTKRS I KQNSNPVKKEKSCKC 

NECEKAFSRNV 


4297 


18198 


A 


4322 


342 


82 


GLGLTTNWLLTGKSAGRGCAPSSGGLWIi 
AVGDSVYYHRETEGGKPSPGRPCC*PS* 

AfST.APPQDPPPnPPWADfiCfVT UVDPT T 

TAH 


4298 


18199 


A 


4323 


284 


3 


ITKKFPRVIGGGCFS PFFMGLSKKNGLF 
PEGALS I SLI FGPAI PPGGAQVTSSAKE 
KKKV1£RPGWAHACNPSTLGG*GGRIK 


4299 


18200 


A 


4324 


365 : 


3 


QVE VYWENHP FRLEES I YQ* CRLEGAT 
SVAGEQISEYNISMRASDGGSPPLSTET 

tlx Xixtiv A.LJ±DiU£iir xr xr ir rllxoxoAx x irriW 

NPRGASIFSVTAQDPDSNNNAR ITYALT 
EDTLQGVY 


4300 


I OJLU t 










ro F LtJKU w ottXrliLKxxtS L^erfflx oAJJ iNLtH. 

LPGSSYPPTSPSQVAGTAGPSMSLQEGR 
ASHDITSSSRS IGAKELLRPADHPQCI 


4301 


18202 


A 


4326 


307 


95 


FLFLLKDKILLSPRIiECSGTIIAHCSLK 
LLDSS YPPAVAS * VAGTSGMCHYTWLRW 
i\lNriJjoir\Jxyov.olSx 


4302 


18203 


A 


4327 


371 


2 


WAPNHISPTPRSGSTTRIWAPTPPSNSP 
RPCQDPGHRTDPWYPEEEFLLTNPDPPR 
APASWSFPFQEKRLHFPALPCP*HLDSS 
MI^OTLFSFSFPPTSIRPSQTNHSKGTP 


4303 


18204 


A 


4328 


409 


1 


RlxLxEARQPEIiEMAAL I FFTLKYKHVERE 
QKYHQLQDEYFTS AWLTL ILAALFGLV 
YLLIFTOSVVVLxxIiLVLCICFLVACVLY 

T.WTTP VOPtrpcrr.T totptvt .t*t TrTin/T 
unxinvyLr trv^v^xix li^XKi vXA»xr XV VXi 

IYSVAQGCWG*LPWAWSSKPNLY 


4304 




A 




zoz 


& 


T iHO A YTtVT .T) T TfiUHNR TTDT5VQ PTTWPOTin 

FFFLVFS PRDSLTLLPRIiECSGTVMAHC 
HLDLLGSGDPPTSAS * I *GTTGAYHHTQ 
HV 


4305 


18206 


A 


4330 


1 


399 


NTF* FLKGMGLWHVFTl^xNXiKKIiYVSN 
LG IGHTR YATTGNV 


4306 


18207 


A 


4331 


408 


3 


SSVGIHVRTHTGEKPYECKHCGKAFSCH 
SSlxREHVRTHSGEKPYECNQCGKAFSHA 
QYFQKHVRSHSGVTCPYECTECGKAYSCS 
SSLRVHVRTHTGERPYECKOCfiKTFEYT. 
ASLQAHVRTHAGA* IYKYSGHV 


4307 


18208 


A 


4332 


110 

X 1 V 


426 


GLSPGTWSDMTGGPAVTAFPCTLTGTHA 
GHIIIDGIDIAKLPlxHTLRSRLSIlLQD 
PVLFSGTIR*APPPxxRPT0POAGSVPSD 

x * xjx w w * ^ *v * *x x x au\4 X VpnwJw' » t wlv 

LEHKEEGVGGDQDPRGVSCSC 


4308 


18209 


A 


4333 


133 


1 


EPCCPSALAAPEVLGPEKYDKSCDMWSL 
GVFMYlUVPSPPPLY 


4309 


18210 


A 


4334 


165 


1 


I ILTFFKKNERKVVKAVQYW* KNR* VDQ 
WNRIASPEIDPHE*SQLISDKGAKECI 


4310 


18211 


A 


4335 


42 


440 


SARRAGDPARGAPSRNNASLPDPRELTG 
RPAGGLQ*GNCPAPPTPQLPSPSVSRPV 
SGRGPPPPPSFGDPRANRPQNPKGAKW 



816 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
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NO: of 
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sequence 


M 
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NO: in 
USSN 
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26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 

pcpuuc 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 

residue of 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glufcamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^^vyiuiaminc, iv Arginiuc, o — oenne, 
^Threonine, V=Valine, 

X=Unknown, *=Stop codon,/=possible 

llUClCUvIUC UCICUUII, \ pU-JOIl'IC 

nucleotide insertion 














LTVQMGEPGE IRRAACPQLS WAMTFPGS 
T.pF PKRTGPO IOKSPPSRRGG 


4311 


18212 


A 


4336 


1 


160 


NTCGGGAQL* SQLLRRLMWEDLNSGSQG 
CSBLRSHHCTAA* ETKGDPLS INEY 


4312 


18213 


A 


4337 


236 


1 


QRSLLCYIMEIRTVAVRIVAIKGVESEV 
YLAMSEEGTVYAKKECNEYGIF*ELVI J E 

QMWTVAttAVT.PPVrVTF^KVQ 

Onl 1 1 IiinnVuK.rVUUJ.JiOIV.VO j 


4313 


18214 


A 


4338 


408 


3 


CQSSVSKKERTNGAQNPGAAKQGNNELR 
DSTEQFQEYYRQRLRYHQHLEQKEQQRH 
I *QQMLLEGGVNQEDGRDQQQNRNEQFL 
NRS IQKLGELNIGMIX3LGNEVSALIQQC 
xrnqyr!M^QMfST.-*VTJQ ittytppwv 

INoOivoWooiNvjij^ VNOC LT JUrJrllV 


4314 


18215 


A 


4339 


363 


1 


RSQSSLPKSFKRKI S WSATKGVPAGTS 
DT* GVQPG WQ * RWGASTATTQ KKPS I S I 
ATESLKSLIPDIKPLAGQEAVVDLHADD 
SRI SEDETERNGDDKTHDKGLKI CRTW 


4315 


18216 


A 


4340 


1 


353 


DVFLDTLARPLRHSCNFFLCYLL*DFFK 
FIPQFFYWMFYFDYI FISRRNFYFLLLT 
HFFTLPLFLVIRIQCLHLFLFFSFFFFF 
FGKGNPFLPPGWRARAQFWVNGSPPLRV 
NAT.T.P 


4316 


18217 


A 


4341 


376 


3 


LELREGGFIiPHIADKVREERS PALLDDR 
urns fW2r*Di? Tnvn /pdttqmt.tpt.arwm* t 

CEHCLEQCFSTSL* PIEIRTLH*D*GCV 
CM(^YVCTCVCVHMCMCAYICICVARMA 
KRPLES IHTSCT 


4317 


18218 


A 


4342 


158 


2 


LAFFFFCETQSCCQWHDLSSLQPLPQRF 
R*FSCLSLPSSWDKKHTSPHPTCI 


4318 


18219 


A 


4343 


145 


2 


I PGEQ WFDYMNKFFSGDF * DFGASLTQ 
AVHTVPNV* SVI PCHPPRV 


4319 


18220 


A 


4344 


1 


434 


KbijlrKAiAXCiXKX !UUiUJJUi\JjiriwJ\Ko 
Sa^IiPKGWDYRHEPPCLARFIFWQIFI 
AYKLR* KHYKSRQRSLPSRGSC* CACDT 
FVSDPSHWGTMAQ I LGGWITLGQVQRSC 
LADQPLLPLLPPTLAPCLP *NDVLLFFF 
TQSQ 


4320 


18221 


A 


4345 


4 


475 


TfWQHP ^T.TTHT.nP PT .PP2\T/'JRT , r r PVRA'RP 

LLLTOIIC^GGLILRMGAINRCLKHPIjA 
RDTPVCLIiAVT ^^QW.qfiy.qpT.T .NHT .T.QG 
LPGLVRAGRGRREAGSKDGGS CLGEAGS 
GIPVCGDKEPTN* CSPFSPLQESGEGGR 
PRGGBASLQGCRWRANG 


4321 


18222 


A 


4346 


313 


442 


DSTLNF FF FF*TE S YFVAQAGVQGHNLS 
StiKPPPPGFKRFSCL 


4322 


i (Km 




A1 AH 

434/ 


43 / 


33o 


AP VOR F YGD YT AVN PHT.FSLN T URCCOG 

RNWDPAQj^RTTQGLTALIitiSLKKCPMI 

RYQLSSEAAKRLAECVKQVI TKEYELFB 

FRRTEVP PLLLI LDRCDDAITPLLNQWT 

YQAMVHELIX5IimiWIDLSRVPGCMRPL 

* P THPRTT iR RS TRT .T .TfiVTP <? WTM Zi 
t\.XXi-IT\3 i. Iiiriiwxivuijj; rJrOOOif JLi lr\ 


4323 


18224 


A 


4348 


409 


118 


GLHDASFVAAIPSTQWYFFFEETASEF 
DFFEIUiHTSRVARVCKNDVGGEKLLQKK 
WTTFLKAQLLCTQ PGQL P FNVI RKAVLL 
PADSPTAPHIYAVFTSQW*AAGPSGRWS 
TSSAAAFRRPRHSCRL 


4324 


18225 


A 


4349 


2 


418 


YTLPDFPHHPHRLSHCRLLQSRTASTYQ 
SPGACTPISQGHPGORSSPCSSSSNPAC 
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nucle- 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidtne, I=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N«Asparagine, PHProline, 
v viuuiiiiiiic, iv— Arginine, o— oerine, 
Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 

Y=l Tnl^nnWtl *— Q>r>rt t*nAnwt 1 — nnocSklA 
j\. isuiviiu wii^ — -oiop COQOII, /— *p055IOie 

nucleotide deletion, ^possible 

niidenf iH<* i n cprf inn 
uuuduuut iiioci uvu 














*SSSSCGSTGFRCPGSAHPGCCSYVIi*M 
GR*R5SOAPRAS P I AOfiPP GAPAfiP anw 

PALPSPLP*GAPPPPSAASSPGL*SPA 


4325 


18226 


A 


4350 


401 


276 


DPLNVGGGGYS * PRLCHCSPTWVTEGDL 
VSKTNTHTNKKHL 


4326 


18227 


A 


4351 


446 


277 


AQDFVLIKDGBVTGGICFRMFPSQGFTE 


4327 


18228 


A 


4352 


408 


3 


PKSSLFTSRPNGKSWSTKRPAISLAYSN 
ENAQGSWNGDQDAGPLLVENPRDYSTEL 

RQ* PLCQLSPHRGIGASBLGHAPQ*ELP 
. ALLLGPIRHBCEAGPPHDTLPV 


4328 


18229 


A 


4353 


62 


280 


VRIGCLTS VPEASSCGLPTTPGCC* PLG 
TLPCRSRSRTRSFRSRSRTQSRSRPKTY 

QPflPP PP QP Qp c DTOD 


4329 


18230 


A 


4354 


364 


3 


PVGEEGKSLVPKSPVEEKGKSPVSQSPV 
EEKAQYRVPKS PVEEAKSKAEVGKGDQK. 
EE * EKEVKQAPKEDKVEKKEEKPKHVPB 
KKKAESPVKEEAVAEWTIAKSVKVQLE 
KETNEECI 


4330 


18231 


A 


4355 


3 


558 


LiGPTLLWRRGKVPKANSPIPTSLRRTLP 
iCAo FC»i*bKk3Piir WRQAGSTPMTIjPRSTP 
SMSPML*MVWPPTAVPVP*KPLMWAPPA 
PLVLLVTILTE IQEPATPAPLTQF* KPT 
SLMVSRPVCPWQGPRTTRSTLCATTIA 
foxlft. 1 i-iK.Jrvj.Uo ill or-UWQTJLiS LUjL»EGQ 

ASTiPKG*NTS ITLPSVS 


4331 


18232 


A 


4357 


258 


1 


LGFHDPGPAGWSQPHGGRSAQAQ* P PQP 

OT\7 N DDTCT.7iT J vX/'GTMM3T DD/vnoonr«nin 
tr± V r* K. I oLiALI V o UJUr JjJr Jr vjlxtio iCirvriHP 

FRLPHTSFSTPGLSAVCLIVHPAARVSR 
CI 


4332 


1R7V* 




tuo 


0/7 




riPnT-rtnwv^iMV^\7unr»\/iTOt>ooxxoT tdo 
uxvujjvjjjis xVjn viioL/ V r KJr oorlFJj J. r o 

SLIGSGCQIFAVSLIVI IVAMIEHLYTE 

♦VPTLNCIMSLLFLNSFNHWNLLSQS 


4333 




A. 






Z 


♦ARDDEDEDDDDDDDDEDEEEDDEDEDD 
DDCI 


4334 




A 




oo 


z 


GSRTDTG*RGSRTDTGGRGSRTGTGGRG 
SRTGTGGV 


4335 


18236 


A 


4361 


2 


250 


ELRDEGkASSAKQRliKCAGLQKFGERAF 
KAWAVARLS QR F P KAEFAE VS KL VTDLT 
KVHTECCHGDLFGCDEESAGLAKYIM 


4336 


18237 


A 


4362 


169 


456 


EQLLCAGQGQLS CEPLHL PS CP VGPQTP 
AHRPGRMAPGPPSFLSPGSPCGLGLAIP 
CSCQPWVAGVPDHHLFPSPMGRAGLSPS 
ERSAGEGVT PWP 


4337 


18238 


A 


*TJOJ 


1 
1 


dlQ 


KSDWLS FTLE I VIMEVQGLKS VAPNRI 
VYCTMEVEGEKLQTDQAEASRPQWGTQG 
DFTTTHPRPVVKVKLFTESTGVIiALEDK 
EI/3RVILYPTSNS SKSAELHRMWPKN 


4338 


18239 


A 


4364 


375 


448 


. PGAVAHACNPSTLGGRSGRIMRAG 


4339 


18240 


A 


4365 


1 


444 


DFLTNHYLHFLRIAGSQLTGLGTAVQLY 
SAYEENNRTFLIiAAVKRNHNQYVNPSGV 
ATFFES I KEILLRQSGVKVBSVDHDSCV 
HGPCQNGRSCLRKLAVSSVLKSRESLPV 
IIVANEPLQPFLCKCLPGYSDSWCEIDI 
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nucleotide deletion, V=possible 
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DEWLPFPW 


4340 


18241 


A 


4366 


52 


402 


CPQEVWFHSFCHCTSAFMKDLYFLCLVR 
PXSDLKEDKVTRHDGASSDGHLiAHI FRHA 
AKELFNEDVEEVTYRALRCGAVS TACL C 
LWLARVHGVPVWI CAOGGSMDGVYTSQN 
QKDFF 


4341 


18242 


A 


4367 


119 


247 


WVLLS FQLHGVAVLGIALI AMGEEIGAE 
MALRTFGHLLRYGE P 


4342 


18243 


A 


4368 


2 


430 


GGT WTX3QPGFHGDS IiSKAPGMNSLEQG 
MVGLNIGDVSSSAVKTVGSWSSVALTG 
VLSGNGGTNVNMPVSKPTSWAAIASKPA 
KPQPIMKTKSGLPPSPIKHNMDIGTWDN 
KGPVPKAPVPQQAPSTQACPQSQQVGQT 
LP 


4343 


18244 


A 


4369 


I 


406 


ETSTPEGEAGPIQRIiDIPVENPVESKNI 
FLGAPLI ICHVTDKRSPLYDI SATDLAN 
QDLEVIVILEGLGETTGISTRARTFYIA 

i iriT Ar.irti Tr^ ntt /i t taint inAtiT>r\nti ■ ^tmrmn 

EEIQWGHRFVS IETEEQRPDS VDYSKFG 
KTDLVTTSRCNARELDEKPSILI 


4344 


18245 


A 


4370 


2 


284 


GTGTLCDLTALLSARYDGVRTCILPCWK 
TISTIPVAIiLIHYHQNQASCGKRAIILE 
TRQHRLFCADPKEQWVKDAKPHLDRQAA 
ALTRNGGTFE 


4345 


18246 


A 


4371 


3 


184 


EDYNILLSIMKGTTRPINKEIEALNDTV 
DQLELTDLCRTLT PT IAE YSF FS SAP KV 
LIQP 


4346 


18247 


A 


4372 


44 


304 


GLPRS IGCQGLPRLPEWPTWVGGCHRGL 
IMPSVPTQVSERPLMFLLDTPGVLAPRI 
ESVBTGLKLALCGEPGIjGLGLGPGPLPP 
HL 


4347 


18248 


A 


4373 


1 


422 


LHLFNPAASAPSRSLFSGPILDPLSRAL 
GVGPGICGLASSPGISEGWDQIRSWTHP 

pdpdhlsgfcrsqvymhpsls ppstmil 
sggtalkppysafpgmqplemvkpqsgs 

PYQPMSGNQALVYEGQLSQAAGIiSASQM 


4348 


18249 


A 


4374 


2 


423 


nsgthtpglledls kngrlpe I klpvng 
csdledsftilqskdlkqeplddptcid 

TSETSLSNQNKLFSDINIiNDQEWQELID 

elantvpeddiqdlfnedfebkkepefs 
qpatetplsqesasvksdpshspfahvs 


4349 


18250 


A 


4375 


292 


429 


LKISVFPSSTEISLQQRLEYIARAILTA 
KSSTGIAS I SADGESLRE 


4350 


18251 


A 


4376 


544 


32 


ALHACACLGLSPEACFLHLLQLTGTSPG 
P VDGQGLDNQGFRIJjKAI LGVS PAPEPV 
HGASEHNADTGVS VARACVLPTS PTLAR 
TAAAKDKS P PLAGIjTMGPSREPGGSCLP 
LPSGLLSSPTRLALPRRPPSVSVWQDAG 
SSGSMPTEPLVQCGHRFYGTLVSWAVMV 
LK 


4351 


18252 


A 


4377 


378 


232 


PAKR^EELLLHDTRCWLNGGAMPEARH 
PRTGASALHVAAAKGYI EVMR 


4352 


18253 


A 


4378 


291 


1 


QMLSVD VAS RYRAPST YVLNS LKEGMDG 
iiHtriib L-bb FLUBPS VAMNMQTAGIjEMGI 
CDGHFRQNVGOGYVLKPDFLRDIQSSLH 
PEKPISPFKGQSL 


4353 


18254 


A 


4379 


171 


458 


RGPLFLTLHIVNSLQGNNFRREYIVTQG 
PLPGTKDDFWKMVWEQNVHNIVMVTQCV 
EKGRVSKQLS WHQS FAFCHLDRI LPHHQ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
O ag Gluta m ine. R—Ar^inin** Q = <jp. r ir»t» 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SETPQNNVGIM 


4354 


18255 


A 


4380 


214 


410 


LDHLPTHPSPTGE I E VKWFEAVQTGLPM 
CILGAFFGPIRLGAQSLQVLDS ELI PWA 
VQNGRIAPC 


4355 


18256 


A 


4381 


2 


74 


IGDSGVGKSNLLSRLTRNEFNLGL 


4356 


18257 


A 


4382 


290 


119 


IATVSLNIVKMPRLPKAIYRFNAIPIKI 
PMTFFTEIDKTILKCMWNHKI I PGNSPA 
H 


4357 


18258 


A 


4383 


2 


423 


LERVCWIKDI WAVTGENMEVMKSI IQK 
YQHKR I S LVEAGVTRHRS I FNGLKALAE 
DQINSKLSKPEWI IHDAVRPFVEEGVL 
LKWTAAKEHGAAGAIRPLVSTWSPSA 
DGCLD YS LERARHRASEMPO AFLFDV T Y 


4358 


18259 


A 


4384 


356 


1 


WYPCCLIiASLI CVFRGLFYLSIiCLVLRV 
PVPLCLQFPGS LLGSLLLFSLLRQGLAL 
WPRLECSSAIRAHCSLELGSNDSPASAS 
QSAGITGMSQCAWPLSLLSVS PRVPGLS 
SVPLIF 


4359 


18260 


A 


4385 


1 


125 


SGSKKTLRSSIiEEPTILDIEKFHRESFF 
YTHIiINFSGKRYC 


4360 


18261 


A 


4386 


53 


215 


WQLYWTTWCQKISRNQGLAHVPRMECSS 
MIIAHCNLKLLGSSDPPASASOIDPQ 


4361 


18262 


A 


4387 


2 


390 


PSLAARVLARGYGNFSVFVLWLRKNDRM 
HGECAPNVSVAVSTSHTTI IGGGIRGGG 
GGGYGSGGSSYGSGGGSYGSGGGDGGGR 
GSYGSGGGNYGSGCAGSGHGSYCSGSSS 
GGYRRGSGGFGSOTSGG 


4362 


18263 


A 


4388 


2 


441 


TLQIiAFAINQEPSDAKMLQMVLKGSEGA 
TVNQGPLEVAQVFLAEIPADPKLYRHHN 
KLRLCFNE FIMRCGEAVEKMICRIiITVnO 
MEYQQELKKNYNKLKENLRPMIERKIPE 
IiVKPIF 


4363 


18264 


A 


4390 


242 


24 


KLGNFLGGCVKES P FVPOGGMO WPD WGS 
LQHPPSRFKQFSCPTLPGNWNFRNAPLR 
SGKFFFS PVFLVETGY 


4364 


18265 


A 


4391 


1 


244 


VDQQMRQNIiFTTGAHHLQQANIQFRTDI 
ARTEYLSNADERLRWQASSLPADDIiCTE 
DAIMLKRFTRYELGCQGEAWEECX5W 


4365 


18266 


A 


4392 


2 


146 


LDLLS QPCRAVYI FAKKNDI PFELRI VD 
LIKGRSSLGFGEPKSQEGDR 


4366 


18267 


A 


4393 


204 


489 


GAASEHP I PKGESHGTSKAS SLCADAGP 
AGHFASVYNPLAWTVTTrVTLTVGFPGV 
RVTDEAGHPVPSQVSGIQRCLQGMKAPP 
VTVTVSGEKGS 


4367 


18268 


A 


4394 


2 


171 


LSDGFQPSRSIIFASWSAGDFGSVGATE 
WLEVFFI I PSHIGHELVGLGCCPEVMFL 


4368 


18269 


A 


4395 


2 


197 


WLELLKFYTLDCALEEYVICVRIQDILT 
RENNNWPKRRIAI EGEMI CCI SFVEYFI 
CATHYL I QC 


4369 

• 


18270 


A 


4396 


2 


285 


TVDFPFGPYKCGOEYMTKVLECLOOKUN 
GILES PTGTGKTLCLLCTTLAWREHLRD 
GISARKIAERAQGELFPDRALSSWGNAA 
AAAGDPIGPS 


4370 


18271 


A 


4397 


366 


461 


TSNFS CSL VLEEEASD YLELDT I KNLVK 
KYSQ 


4371 


18272 


A 


4399 


157 


3 


DVLRNNFARS AVL YL F I DRVTLWNAPRL 
ECSGAI IAHCSLKLLGSNDPPTS 
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peptide 


Amino acid sequence (A- Alanine 
C=Cysteine» D=Aspartic Acid, 
E=Glutamic Acid, F=PbenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucrne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 

0=CwI ii tannine R=sA roininf* C=lQartno 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Uo known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 

UUWIIwUUC IIIoCI HUH 


4372 


18273 


A 


4400 


2 


343 


DIDFKYHFDFFSVNFNEELVALYGGSLQ 
KQTKFVHECIKTILKLYKGQEFAPKSVA 
I IGHSMGGLVARALLTLKNFKHDLINLL 
ITQATPHVAPVMPLDRFITGSTGWLKDH 
6 


4373 


18274 


A 


4401 


?14 


410 


HGELSPVTDFRBS CCRQYEMGECTRGGF 
CNFMHLRPISONIiORO 


4374 


18275 


A 


4402 


3 


227 


LTQVS PQMTGHAGLNTAQAGGMAKVSEL 
KHFQYFLPNRGLSLRVMRKMGPRGPLLF 
FLLGSPRATHAWKRQISF 


4375 


18276 


A 


4403 


158 


313 


NRDE FCHVAQAGIiELLGSSS PPAATSQS 
ACITGVSYHVRPTLSNKFGCSSIi 


4376 


18277 


A 


4404 


3 


628 


HCIREGGQDVPSNKDVTSLDWNSEGTLL 
ATGSYDGFARIWTKDGNLASTLGQHKGP 
T PAT.TTWTTir^niWTT.QanvnTrTTT Twnaw 

X C HLirtn W MVwCiN r 4.XiOrtuVUA.i ill WU/Vtl 

TGBAKQQFPFHSAPALDVDWQSNNTFAS 

NAIKWDP TGNLLAS CSDDMTLKI WSMKQ 
DNCVHDLQAHNKE I YTIKWSPTGPGTNN 
PNANLMLASASL 


4377 


18278 


A 


4405 


35 


180 


MCVDYHYFFCLLGPNNNNPQTSAVRTPT 
QTNGSNVPFKPRGREFSFGK 


4178 

*rJ lO 


ioz,/ y 






i 


'Wo 


DQTYILYIIiYVIKGPSWDTNLSGQNGVT 
VQNLLGELYGKTGLNQEWGLIPYISSLF 
RKKVDVLAEACTOLLSDQKQLTVGLSDE 
PREKIIYAPLPPKELTKLIYEASGQDII 
IGVLTQ 


417Q 






AAIY7 


i 
i 


AO ft 


PSGIIKVFTSNPTPAVLCFRVKNISRLE 
QILPNAQLVFSDPSQCDSNTKDFWMNMQ 
AVTVYTjKKT .SEONP AA5? YYNVDVT iTCVOV 

SLNGIQSTPLISrLATYWKCSASTTDLRVD 
YK 


4380 


18281 


A 


4408 


174 


383 


KNSWPGAVAHACNPSTLGGKGRQITRSIi 
TGLHDLYLTTNLT 


4381 


18282 


A 


4409 


i 
i 


fcOO 


LMAEKDSLDPS FTHAMOLLTAGKGASRV 
PLGRPAI PGMSGPGFTOLASRPLGTEAV 
GSSGLILIFGSLDSTFPPQPGVRGRISR 
LQWRPEGDFAIP 


4382 


18283 


A 


4410 


292 


422 


LVHSSCFQVSSKILELARKQRMNTDIRR 
NT rcTIMTSEDFLDA 


4383 


18284 


A 


4411 


412 


238 


FFSRHRVSQWDQAGLEIiLTLSDLSALAS 
HSAEITGLNHHAQPLTTNFNKYCWISLQ 
VH 


4384 


18285 


A 


4412 


1 


405 


VTSSDKS LlWUMI&VKSrFtmiKNEKFS 
NVFGFLSOKARNLOAOYDRRRGMD I fTOM 
KNFVSQELKGLKQEHRIiLSLHIGACESI 
MKKKTKQDFQELIKTEHAIjIiBGFNIRES 
TS YIEEHIDRQVS P IESLRLMCIi 


4385 


18286 


A 


4414 


2 


149 


GFHHVGQAGLEVLTSDNLPVSASQSAGI 
TGVSHCAQltRSDGFKKRS SPA 


4386 


18287 


A 


4415 


1 


366 


DICHRKLGVECPSHASINRLWQWSSI 
TAS LRFEGP LNVDL I E FQTNLVP YPRIH 
FPMTAFAP IVSADKAYHEQFSVSDITTA 
CFESSNQLVKCDPRLGKYMACCLLYRGD 
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Amino acid sequence (A=A)antne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
€=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vr^vviutamine, K— Argimne, S=Senne, 
i— inreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known , *=Stop codon, /= possible 
nucleotide deletion, \=possible 
nucieoiiae insertion 














WPKEVNAAI 


4387 


IOAOO 


A 


AA 1< 


i 
t 


104 


cck.kv~c.fio x vicj^u 1 UJLi J. Vf IbVL 1 Al il iAK 

GITWVLVYRTDPFGTLAGSEGPGGRY 


4388 


18289 


A 


4417 


2 


141 


DIAIPCNNKGAHSVGLMWWMLAREVLRM 
RGTISREHPWDLTSSVMG 




1 OxVU 




A A 1 Q 


IT/ 

136 


1 


EQTKAFGQLHKVLGMD P LPS km pkkpkn 
ENPVDYTVQI PPRDPMQ 


4390 


18291 


A 


4419 


I 


280 


GNYCKYYGYRGPSCEDGRLRVLKPEWFR 
GRDVLYLGCNVGHLTLS IACKRGPSRMV 
GLDIDSRLIHSARQNIRHYLSEELRLLP 
QTLEGDPGA 






A 


4420 


3 


446 


WSTSQVRQNYHRDSQAAIIIRQISLELY 

ASYVYLSMPYYFDRDDVALKNFAKYFLH 

QSHEDRDHADKLMKLQNQRGGRSIiLQDM 

RKPDCTDWESALNAMEWALHLDKIW 

IxLELHKLTTDKNDPDLSDYIETHYLNEQ 

VTAINELG 


4392 


18293 


A 


4421 


6 


182 


DPSFCSEPRSCHCTPTWRTEQDSISEKK 
KKRNILKNWAKNYITLESEKERKSKQRS 
RR 




1 OOfl/l 

lozV4 


TV 

A 


4422 


1 


456 


GP I SYWVMPKKRQAIiVB FEDVLGACNA 
VNYAADNQIYIAGHPAFVNYSTSQKISR 
LGDSDDSRSVNSVLLFTILNPrYSITTD 
VLYTI CNPCGP VQRI VI FRKNGVQAMVE 
FTDSVQSAQRAKASLNGADIYSGCCTLKI 
EYAKPTRLNGSK 


A1QA 






4423 


2 


257 


FHHVGQAGLKLLISVDPPALDSQSAEIT 
GVSHCAQPVLCILNQLCAKAVLSPDSHP 
YPPKSGRHEGQVREQKMSPCSYIYFKRT 
V 


4395 


18296 


A 


4424 


106 


259 


CCCFKYFFS1WKICFYRDLKPENILLNE 
DMHIQITDPGTAKVLSPESKQGV 


4396 


18297 


A 


4425 


3 


364 


MDEIEKYQEVEEDQDPSCPRLSRELLDE 
KEPEVLQDSLDRCYSTPSGYLELPDLGQ 
P YSSAVYS LEEQ YLGLALDVDSEYLTVK 
VISLHLVFQIGVIFLFQVPLTDPRDVIA 
AGRTYGRI 


4397 


18298 


A 


4426 


2 


241 


HHVWLICFVFFVCVETGFCHVAQVGLER 
LGSSDLPTSVSLSAGITDVNHWAQSKPS 
PSKGTGMNRP IGLRVGKGVWSRL 




lazy?? 




/MOT 


z 


A1Q i 

4Jo 


1 KijaGFNCPGVMNPGKCKMGIMPGHIHK 
KGRIGIVSRSGTLTYEAVHQTTQVGLGQ 
SLCVGIGGDPFNGTDFIDCLEI FLNDSA 
TEG 1 1 L I AE IGGNAQEN AAE FL KQHNS S 
PNSKPVESFIAALTSSTGRKMGHAGQLL 


4399 


18300 


A 


4428 


34 ! 


427 


AGMLPAVGSADEEEDPAEEDCPELVPME 
TTHSEEEEKSGMAKI PVTIITGYLSAG 
KTTIiLNYILTEQHSKRVAVILNEFGEGS 
AUlsro IjAVS FGGEIj iEERLEL IKACLCC 
SVKGNGLIAIENLMRQKGE 


4400 


18301 


A 


4429 


330 


8 


YSDRISHWIDMCPSKYIFRGPVHPPHIS 
TFLHTAHNLVARGNYENMS PHDEEKGAY 
QRKEKCCDDPMAGAARIHFLCFFEMGFR 
SVSQAGVPWPHITSMQPQPPRIK 


4401 


18302 


A 


4430 


446 


222 


LTSSDPPASAPQSAGITGVSHRPQPIWG 
VS IQL YFSGCPDTPGQGRLTREHCDGQD 
GEQGGPCSESVAQFCHDWS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
B=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, MNMethionine, 
N=Asparagine, P=Proline, 

0=r21ntaininf> R= A rninino C— Coring 

V£-— \jiu utitjiiic, rv rirgimiic, o — oermv* 
T=Threonine, V=Valine, 
W-Tryptopban, Y=Tyrosine, 

A — UUIUIUWII, OtUp COilUU, / — pusoiuic 

nucleotide deletion, \=possib!e 
nucleotide insertion 


4402 


18303 


A 


4431 


3 


*>A "7 

243 


z\p im q ani7\7<? wppd FAfwrn .t? n t?<?t ,nx ,ra 

/UvEiXJOinlJXr VOX: JC rUV V Hi Ht\t.JT £>±jUUE>ri. 

DGQPLTPGTSQKDNNFILPRLCIRKFTS 


4403 


18304 


A 


4432 


433 


0 


PPPSPPPPPPPPPPPPPPPPPPPPPPPP 

PPPPPPPPPPPPPPPPPPPPPPPPPPPP 

PPPPPPPPPPPPPPPSPSSPSSSS 


4404 


18305 


A 


4433 


25 


155 


&PUTVH PCTir .PT.P rCT DZV WfcTP PTT PVWKTf 

KKKVQQQGGWPGGLTPVSYRFGRLT 


4405 


18306 


A 


4434 


2 


329 


VRD I KEKLCYVAVDFEQMATVE FSSS L 
EECSYELPDGQVITIGNERFRCPEAtilQP 
SFIX3MES03MHETTLNSIMKCDVDIRKD 

Ulnu 1 YUXuOl XrJX £r\JX^ilJ<wl£lXNXaX 


4406 


18307 


A 


4435 


2 


374 


WVTFISLLFLFSSADSRGGFRRDAHTSE 
VAHRLKDLPEKNFKALLLIAFAHYLHQC 
PFEDHVEIJJNEVTEFAKTCVADESAENC 
IX^SIjOTLFGDTLCTVATIjRETYGDMADC 


4407 


18308 


A 


4436 


2 


352 


KVPvRIDSRLAELDREIRNPQISCQLIYS 
PSLFLGAFFLFFFFGFFFPPPYIiIiAPKN 
QFNPPPLCTGGPVPRPKPPIPPWEPPRP 
APPLFSPQWDLGFQIAPFLWVINGGGPS 


4408 


18309 


A 


4437 


2 


369 


QSCETLFHSWKDVEVCSSALSCLSQVSV 
HliQGLESLFILPGMEVEQRDSQMALVES 

T.JTVT7P/3P T Q JTA MZ\ mJTTWTfTWT. T .TQntJH 

GGNQMLDEGFRBDFSEQMEIAIRAILCA 
IQNLEERKNE 


4409 


18310 


A 


4438 


2 


231 


SDTSRPQLPCPAARERFPDGPLSLRPLP 
FFLSQVYNEQIHDLLEPKGPLAIREDPD 
vr'wnrnPT ciPun\/uriWAP\7P 

I\JL» V V VyvjJUo r XlU V WUWAKVVj 


4410 


18311 


A 


4439 


1 


365 


MAHAMEEVKKCLGPDMMDDICHEQFLEL 
S YLNGVPE P SRGRGVP VRGRRAAP PPPP 
VPRGRGVGP PRWAIiVRGTPVRGAI TRGA 
TVTRGVPPPPTVRGAPAPRALTAVMQRI 

Xr XjOxt Xt riii XV 


4411 


18312 


A 


4440 


92 


510 


LAALPDGGQELHI PRSRAGPAGELS WAV 
VGTGALSSLSILFQMARCRPWPPLRPNS 
PLQGWIFNWPWLVAIPASLFSGFLSDH 
T.TwnfJYpa ttvr kt jwnt^MRTdfJT,*; vfat. 

illliyw X IUiJL X V x\ I\. i^VjI «3 V FiAJJ 

CLGHTSSFCESWFASASIGLQTFNHR 


4412 


18313 


A 


4441 


2 


266 


FPPHLPELGKWECPWHQCDECSSAAVSF 
CEFSPHSFCKDHEKGALVPSALEGRLCC 

EVKE 


4413 


18314 


A 


4442 


2 


279 


KGAELVFLPSNTRSLMYPIiDQGVVRTFK 
AHYAGYSMERIVSAMEENPGREKIMKKL 
LKLSSPKEIPAGEKLCPNDAHDFTELDD 

A V inDHflri 


4414 


18315 


A 


4443 


2 


371 


KWVTFISLLFLFSATDSRGVSRLDAHTS 
EVAHFJETOiySEENFKALEAIAFAQYLQP 
WPFEHHVEIiGNDVTESAKTCGADESADN 
CD KSLHTLFGOTLCWATLRETYGEMAD 
CCSQPEPCEKG 


4415 


18316 


A 


4444 


2 


372 


WVTF I S LL YL, FS S G YS RGVFRRDAHKS E 
VAHRFKDLGQENFKALVL IAFAQ YLQQG 
S !^DHVrUil4NEVTEFAETCVADES SDDC 
DKSMTLFGDTlXITVAiTaRF/rYGH^Arc 
CAQQEPERNEC 
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SEQ ID 
NO: of 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
V£— vriuLdinine, i\— Argimne, o-^oenne, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nuueuiiuc aeiciiun, \ — posSlDie 
nucleotide insertion 


4416 


18317 


A 


4445 


1 


283 


FRKNAEDILTOBGLRATMKHELEAAQKK 
noun b jjJLUC J. triN ± V^JVKJ. W JJjSs r VK5AGVC 

WRSTPDPVOjGITSGGCRTAQI pacsfl 


4417 


18318 


A 


4446 


173 


2 


KKKKC^YPBGHGVFHALVCVCVCVCVC 

VCVCIQVNPRNPCFISWINLSVQVSLLD 
Y 


4418 


18319 


A 


4447 


104 


221 


S CHLHYSIjGDKSETLS knhhyhhhhhnh 


4419 


18320 


A 


4448 


241 


1 


WVSWDKAPSWVRGLGSGSRGGVWGGGSP 

PHP\7TnVJ2 PQ PT .PTV2P CT7KJBMP 3V VFTYD n 

LCKPVSAFGAAGATVGPQGLGGRC 


4420 


18321 


A 


4449 


229 


349 


WKRSHSFGLDWAHACNPSTFGGQGGRI 
MRSGVQDQPGQY 


4421 






A ASCI 


/O 


1 BO 


SHWWSRFQKV 


4422 


18323 


A 


4451 


40 


359 


SGEPLFPFFFSLARIiI PGVCFVEMHTRV 
RLLIGIiKIWEKKVPQVSTPTLVBVSRNL 
RKVGS KCCKHPEAKRMPCABHYLS WLN 

yi-i^ V Lin BtJS. i f VoUKV 1 J. e»£> 


4423 


18324 


A 


4452 


2 


371 


KFQN AIiLVRYTKKVPQGSTPT I VKVSRK 
LGKVGI KCCKHSEAKGMPCAEDYIiS VVL 
NU V C V iiHEKTEAS DRVTKCCTB SLVNRR 
PCFSALEDYETYVPKECNABTFTFHADI 
CTLYENERLFQ 


4424 


18325 


A 


4453 


118 


369 


GRYILLLKTKKQTANNNIKTPQYLSNMS 
K3CFRHSEFFFFYLKWSLALQRCNIjGSIjQ 


4425 


18326 


A 


4454 

i 


364 


3 


GHLSLQRLLLPFVWLCPAPRGGAYRGRQ 
ASr^CGGLHPVRASRLLCLPNQACTMAG 
APPPASLLPCSLISDCCASKQRDSVGVG 
PSEPCAGYNLLVCRFLSPSEKRSPSAGV 

MP PC P PPT. 


4426 


18327 


A 


4455 


1 


379 


AAFTBCCQAADRAACLLPKLDELRDEGK 
ASSARQRLKCASLQKFGERAFKAWAVAR 
LSQRFPKAEFAEVAKLVTDLTKVHTECC 
HGDLLECMDRSDLAKYICENQDSISSK 

T .PPfVPlTDTiT J7TTGTJ 


4427 


18328 


A, 


*r*t-JO 


i 
i 


JUi 


T ■P PAP «5 T .VP PMP a pf^ncriT .pnT.TTT .T.wav 
CQGDGVSYEETTI PRPSAYHNLFGLPLI 
SRREAEVVLTSRELDSLALNQSTGLPTL 
TLPRGTTCLP PALLP YLROFP P T WWTn 
DDLRSWEA 


4428 


18329 


A 


4457 


250 


330 


EWWTHLWLNEGFASWIEYVCVDHCFPE 


4429 


18330 


A 


4459 


96 


319 


YLTI PFSQAMENCKDARLTTS I GAYNFN 

nj\Lil lain J-^iIvtrV3Jj 1 X Ajt V LT<U v pivLlwub 

S FLFFMPLFLS YCQVS IH 


4430 


18331 


A 


4460 


50 


339 


NSQTLKLSNVQCLKTRALiFFFPKGI YPQ 
IKTLPPCPTSGPDPYVRVNLLPKGKGPC 
LKKISGNRKTLKPLFDKTLKFLVPMEKV 

C AD £r J-lii Vr V IMM X 


4431 


18332 


A 


4461 


2 


357 


GYDGVKRWTRDGD I FN KELLL I P I HLEV 
HWSLISVDVRRRTITYFDSQRTLNRRCP 
KHIAKYLQAEAVKKDRLDFHQGWKGYFK 
MYCKHXALSQPFS FTQQDMPKLRRQ I YK 
ELCHCKF 


4432 


18333 


A 


4462 


182 


325 


IRLSSWLCDCASCQVYEFRVKESSI IAP 
APAEDVDTPPRKKKRKHRL 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

oullDu dUU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

residue of 

LfCgJUllC 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eacine, 
K=Lysine, L=Leucine, M=Methionine, . 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

UUUCUUUC UclcLlUU, \ — |HKK>IUlC 

nucleotide insertion 


4433 


18334 


A 


4463 


1 


338 


SRHSPGPTPQPDCSLRTGQRSVQVSDTS 
SCSQLSSSSGSSSSSVAPAAGTWVLQAS 
QCSLTKACRQPPIVFLPKLVYDMVESTD 
SSGLPKAAS TjTjPSPS VMW AS <5 PR PT J .<? K 


4434 


18335 


A 


4464 


2 


343 


IIDPFLGAYLKDEVLKIMPVQKQTRAGQ 
RTRPKAFVAIGDYNGHVGLGVKCSKEVA 
TAIRGAI ILAKXiS IVPVRRGYWGNKIGK 

PHTVPCKVTGRCGSVLVRLI PAPRGTGI 

uc 
wo 


4435 


18336 


A 


4465 


I 


371 


INSTLKMS FVGENSGVKMGS EDWEKDE P 
QCCLEEIFPALASSLSSASAEKGSPILI* 

LMKLAAQKESARRPFIFYRAQVGSWNML 
ESAAHPGWPIC 


4436 


18337 


A 


4466 


246 


12 


LCFTPLPSYILFFFETKSHSASPKLECS 
GVI TATSEVI LLGGGGCS ELRS CYCT PT 

WUTlTT.nci TTTT1TPP TTRlTPimT.V' 


4437 


18338 


A 


4467 


2 


355 


WVTFISLLPIiFSSAYSRGVFRRDAHTSE 
VAHRFKDLGEENFKALALIAFAQYLQQC 
PFEDHVKLMJEVTEFAKTCVADESAENC 
DKSLHTLFGDKLCTVATLRETYGEMADC 


4438 


18339 


A 


4468 


2 


361 


EDHVKLVNEVTEFGKTCVADESAENCDK 
SLHTLFGDKLCSVATIRETYGEMADCCA 
KHEPERNECFLQHKDDNPNLPRLVRPEV 
DEMCTAFHDTDETFLKKYLYEIARRHPY 


4439 


18340 


A 


4469 


1 


373 


SSQPSSPSESVSGTVSVSPSSLPSSPCL 

SPRPHPSVSHAHPHRASALVNCPWGPAP 
TS AP VS F YTTPOS PALTCTGOAS PHPHPTTP 
FPTPSPMQSQKK 


44 Aft 


1 OOtk 


A 


447ft 




J/1 


WSAWQSQLTAAVTSQA 


4441 


18342 


A 


4471 


345 


201 


AWMTAHIiFMAWFTEYFKSTAQKKKFVFK 
TLLLIDNFDPLWDSKEGKL 






j± 




i 


tAA 


ITGLNHHTWPFSVI LTYTYY 


4443 


18344 


A 


4473 


3 


333 


KNHLEASIGEHPFFPDHSPSCPHLLSSS 
NYH5LSCFLGGFFSETGSAPSPRLECSG 
VlAAHCSLKLl^STSPLTSASivVAGTTC 
MCWLYSPAVSPRPSPDLTCTSQLENI 


4444 


1 OJ*t J 


A 


4474 


^oo 


i 
i 


TSAVNRIS IKSQADQEATEQKLRDYSHQ 
GIQILAWQKFE YKPGKVTMDPEDCI LQL 
AKKKEEEEEEEEEEEEBBEEEEEEEEED 
QEEEEEEGEEEE 


4445 


18346 


A 


4475 


3 


124 


PRLECSGMIMAHCSLDLLDSSDPPTLAS 
QVAGTTKKISLH 


4446 


18347 


A 


4476 


196 


2 


SCYEHPGRIVYGFLFLRQDLAVTQAGVP 
WHBHGSLQPLPPELKRFSGTSHLLSSQG 
YRRAQPRR 


4447 


18348 


A 


4477 




IOa 


lxLERNKK LEGS LOOMYSTNF FOUDF T 1? V 
RAXHVLAiiTPRGPLGLCVPLCNLE 


4448 


18349 


A 


4478 


167 


2 


ACX3ISPFSLSSSCFGHVRHASLPLAFHH 
DOGFPEASPAMLPVQPSQLNLFIKYPV 


4449 


18350 


A 


4479 


195 


26 


WVSMLDIHDNGPHEVAVICFVCFLSQSL 
ALLPRIjECSGVISAHCNLQPGRQSQTPS 


4450 


18351 


A 


4480 


175 


1 


NPJSATKCDWGEKRTSQELNLFFWFFVCL 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Atanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, P=Pheny!alanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N«Asparagine, P=Proline, 
Q=Olutaraine, R=Arginine I S=Senne, 
T=Threonine, V=Valine, 
w— tryptophan, Y=iyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FEMESHSVAQAGVQWRNPGSLKPLPPRF 


4451 


18352 


A 


4481 


122 


357 


YHAFLYQVTOJLKREHKRFRCSLLI I FLG 

fl7DirT/*QTr»T WIsTT OOVTrtTDDVtn" DTtlP 

GLGAVAHACNPSTLGGQGEWIT 




iOJJJ 


A 


44oZ 


i m 

137 


2 


r lb YIjKXIjEISMGPHxVPQAGIjEIjL 
PPDSAYQSTEIIGMSHH 




1 ftl^/l 
1oJj4 


ex 


a a an 
44 &J 


1A1 

34/ 


208 


IsQAGIjEIaLiTo Sli PAS AS QS AG I TGAGHH 
TRPRIYFLKKYLAFVU3S 








AAOA 

44o4 


411 


268 


RLiN RRGGGCS E P RS CRCT PAWATEQNS I 
SKKNQQKTTKLPLCCRLILL 


4455 


18356 


A 


4485 


2 


102 


PCLLKMQQLAGliGGMHLCSQLLRRLRCF 
T.iHSYP 


4456 


18357 


A 


4486 


1 


113 


HSAHSTHAGHAGHTSLPKCWDYRSEPPR 

T TV T/PT urnnv 

L«AKr JLiNTRY 


4457 


18358 


A 


4487 


163 


353 


QVAATLNNLAVLYGKRGKYKEAEPLCKR 
ALE I RE KVQKKGAVVFFE I FVLCVCGF F 
FFFKRDP 




loj Dy 


2V 


44 o© 


>IAO 

4Uo 


JU7 


KljConPKinlJLSAAWLiI isKDS VblvKNrCN 

KSPKKQ 


4459 


18360 


A 


4489 


337 


179 


YSINFALILIFLFYU3VLVWAAIINYHR 
ioHGLNNKHLL I M I WMVGS PRLS CG 


4460 


18361 


A 


4490 


229 


345 


LENETLGRARWLTP VI PALWEAEASRSR 
GQEIETIIiANM 


4461 


18362 


A 


4491 


180 


365 


E3^NLKTLVALDMSFYFILFLFYFILFYF " 

ILFYFLRWSFALVAQAGVQWRYLGSLQP 

LPPRFK 


4*K)Z 




A 


4492 


269 


385 


GNIjQl>rLElwFYKIQHPFMI KTLNKLVID 
RTYLNIIKAIY 


4463 


18364 . 


A 


4493 


103 


336 


Ql^VSLIlS^FLQAFKIiQIDCXjHTQIHFW 

D^\KTT fT T DDTT TVPODOMT IT pnPT T?/^C«/*» 

KyiNliLJjijPP i JjP^I PKKNJj 1 J_iSPSl_iii,LSG 

TIWAHRKIiRL PGSHHS PASATR 




JojdD 


I\ 


4494 


305 


379 


x X KX JSNF vj^VVAHTCNPSTliGGRG 


4465 


18366 


A 


4495 


47 


224 


ETATKTQRRWETALVPQHPHRPLAPVTW 
VKTTGNSTIANFLIilLVYLSFLFLVYSC 
IQQ 


440O 


18367 


A 


4496 


3 


129 


XTIjwPGGRGCSELiRSCHCTPAWATERDS 
VSKKKKKFPPPPF 


a a tin 
440/ 


J 8368 


A 


4497 


396 


112 


LPELEFEAAVSHDCASELrQLGHQSKTLF 
QKKINRI INNNNNSHIAISLLGI YPKEF 
in\fiTnp n TrTTJM t t zv ct .t?t t vttd owrkoir 

CuniJl SUKJJ X\-0 JL JrrIX lAOLf 1 X V AJtvytvJ PXv, 

CLATNELVYRM 


44651 


15 Joy 


TV 


/I /IOC 

44^0 


i 


Dz 


XioijPvjri>y 1 WWKAXil V PA1 WhuAJfi VvjtjSPll. 

PRRSRLQVAMITPLHSSCGRRG 


446Q 


ioj /U 


5\ 


/4<100 

44W 


yZ 


Z 


VTDDn^nr r r r UA/"" r NTr>c , T T T onD/VDT'pifexnn) 
KXCPWi V V 1 rlAv_N r o 1 iAioKUGPXIKJbVVK 

DQ 


' 4470 






4jUU 


116 


1 


AJjlAGMUXiHAyijl r lix IriiVfc. 1 Rr uHiiGQ 
AGLELLTLWS 


4471 


1 01*70 


A 


45U1 


oo 
99 


1 


Uo X P VHRWWJvCKXiVy PLiWK IVWo r LiRKV 
KIDNS 




1Q*1*71 
IOJ /J 


A 


4502 


140 


288 


KUrHi VAQAGIjEixlXjFSNijPALTSQSAG 
ITGASHHNQPTSRFLKEKKIG 


4473 


18374 


A 


4503 


330 


181 


FFFFLFFFFFFFFFFFFXXFFFFFFFFF 


4474 


18375 


A 


4504 


171 


2 


VETVSKESSLRICLQDIRYFFFETKPHS 
IAQAAVQWHYQGS LQPQPLRLKQSSRLS 


_4475 


18376 


A 


4505 


345 


74 


SVERTCHSPKPLMLFLPLVFQARDDILN 
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ed end 
nude- | 
otide 
location 
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peptide 

sequence 


Amino acid sequence (A=A(anine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, MHMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R^Arginine, S^Serine, 
T=Threonine t V=Valine, 
W=Tryptopnan, Y=Tyrosine, 
X^Un known, *=otop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














VjottP Vor JarxAGrSr UvaryHy J.yrvjfc'rivisn. 
KHKPGFLEACHPYQTAEPASALTLDFPA 


4476 


18377 


A 


4506 


268 


102 


KNIRELWPGAVAHACNPSTPGGRGRWIT 

DCrTMJTWT "OTCflTTWTT S*\r\V(~lWT) PT.TQ 


4477 


18378 


A 


4507 


1 


67 




4478 


18379 


A 


4508 


339 


180 


ANFFVTFEEMGFHRVAOTGLEVL6SSDL 

DTT TkGTTCAPTTY^TOVCVP'Df'VrKPPT? 
F±JjAoKiiAljX Xuio lb VKJfy 12* la 


4479 


18380 


A 


4509 


174 


3 


NEPKVVIYIIPGDPLFFLLLLFFETESH 
YVTQAGVQWHDLGSLQPLPPRLQRFSCV 


4480 


18381 


A 


4510 


84 


3 


AHHI FT VGID VDTKAX rToAT 1 X LAX 


4481 


18382 


A 


4511 


127 


2 


LLIKIHCWPGWAHACNPSTLGVRGRWI 
TRSGVQDQPEQYG 


4482 


18383 


A 


4512 


109 


345 


EIFLIQGHEGSCLCFLFGSFMCFVATGF 
PHVTWLILTLSNLPTS SYS RAG I TGVSH 
HARANNLFCFGSMPSHGITGLNF 


4483 


18384 


A 


4513 


290 


13 


RDLRPTPQKGPKNHTAPNPLQRAQTHVK 
KGRGPTPPTPTPS KDPGPHPPLRKPKPW 
GRVWGfi KJGvKN KNGF IUJKjKG JGnJUUnJvKJs 

RKIQHSRF 


4484 


18385 


A 


4514 


234 


362 


DGDVLSKFVNFFSFKWEGTPLIDGRRL 
KYKLNGDFIjr LiRIjKIj 


4485 


18386 


A 


4515 


190 


330 


CLALQSECFRRWS FAL IAQAG VQWRDLS 
SPQPLPPVFKRFSSLHPFE 


4486 


18387 


A 


4516 


2 


199 


^CVCVCVCVC V C V C 1 r* V b'VLiS KJK.WG FFFli 
CPPGGPKTLGPKESPPLAPKNRGNSGLT 
PPPPPTFFF 


4487 


18388 


A 


4517 


94 


339 


FPNKKKiiKPPQ I F F P PQKRP r I FF F P P P 
PGGKNFFFFX3KFFFNPREKGPPKNLG 


4488 


18389 


A 


4518 


95 


3 


LYLGPGAVAHACNPSTLGGRGGQITRSG 
VQ 


4489 


18390 


A 


4519 


14 


121 


LIFFGRNRISLCCPVWSQTPGLKPSSWL 
SLPKCSD 


4490 


18391 


A 


4520 


207 


2 


lbKiiAJLiGi vKilrlJWriVJ.niUoJ.Jur I XLir X 
SIALPPRLECNGTIMAHCSLHLPGSSNP 
PASASQVAGTTS 


4491 


18392 


A 


4521 


169 


1 


ANLTQVGSYYSLSLHTLIQVTFFKNIRA 

VKJrG 1 V xUlAUvJrJK X lA3X3trLAyV9 X 1 KoLriJX 


4492 


18393 


A 


4522 


2 


82 


GFILCVCVCVCVCVCVCVCVLYFI FF 


4493 


18394 


A 


4523 


300 


3 


LFLPPSPLPSQIPGGQKPLPLIFPPRVP 
KKFIPPNFWAPPLISPRPFWVRFLGPPT 

/"VTTMVTJT r»T5T*'0'Dt>t>DDT?C!T i" 1 17XTTTC QXTTJI? 

\2Gr JN * FJ-iGfc'G.fc'lr Jclrri/r olJGJclU^ JrWirr 

FIKKKFSPPPKKQK 


4494 


18395 


A 


4524 


348 


94 


JvPGrli GG KG Py N Ir r IrGGJj KJrvxN Jf JjJN FGGG 

GSPEPOTPPWPPPRGAKKKTRPQKDXKKK 
N 


4495 


18396 


A 


4525 


313 


161 


KGHJrl^JulwKJS VJN J.J^^ 


4496 


18397 


A 


4526 


118 


3 


FFFFLRGS FALYAQAWQWRYLSS LQAL 
.PPOFKRF ^PP 

•Jtr Xr\JC XVkVi tjv^c 


4497 


18398 


A 


4527 


153 


2 


KJjAINKMKKW PGWAHACNPS TLGGQGG 
WITRSGAQDQPSHYGETPSLPK 


4498 


18399 


A 


4528 


258 


2 


GGQKRGLPGFIKKPPFRKIFLKKIiGEGP 
GKNSI*S KFFFFFF FETESHCLTQAEVQW 
LhTLSSQQNKPPRFKQFLCLNHPSSWDYR 
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correspond 
Ing to first 

o mi no ac* iH 
aiiiiuv «wu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
{^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, Wsoleucine, 
K=Lysine, IHLeucine, ^Methionine, 
N^Asparagine, P=»Proline, 
v— vriuiarnine, iv— Argmine, o = oenne, 

K A II 1 VUlllUC, T~~v<iiiiie, 

W=0Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 

11UL.ICVIIUC IlLbcI nun 














R 


4499 


18400 


A 


4529 


296 


39 


SIRI INIQILLGFFETGCHS VAQAHLKL 
LGSNDPPASASESTEITEMQHCTRLSIQ 

xr v/irrxowxiUio i iiiif^f i, < » j k 1 u ti »YiVv.i 
K 


4500 


IftdM 

lOWl 


j\ 






in 
j f 


LGSNDPPASASESTEITEMQQCTRLSIIi 
IFVAFPYSNNKLSTKKESILIKLLLYNR 
K 


4501 


18402 


A 


4531 


3 


138 


GLWNGSCLCVCVCI FMGMGACLVCICTC 

Jj X V*W- V tr V W X UIjvJHIA^K 


4502 


18403 


A 


4532 


380 


2 


GGPPKKRESAKAPGGRNPQGGGREKTPP 
PKKKRPPQBCGGGKKTPPGGGGGPPRGGG 
JSAJ\f r FJUCaiyLCHRGGRGGGGPPVVTPK 
EGPPFFFFFFFFFFFFSLPFFFFFCFFI 
FILFFPNIVINGFY 


4503 


18404 


A 


4533 


3 


191 


LSKITHAS I PVPVSKKKKKKKKEKKKKK 
FSRGLE 


4504 




& 


ASIA 


j /u 


/to 


noi\i3Tr\7C aurzf if cine t~* hit r~*T r* vt%t none 

NGVKGYPSPKSSPLFQAPLGGQGFPKPR 
GNGGTPPFFPKKKKKKCALGDGTPTSGC 

ct\i\t3 n antra Xs^JL/wv-.VjV^WVjorv V irxv 


4505 


18406 


A 


4535 


147 


1 


KGPPLFFFFFFFFFFFFFFFFFFFFFFF 
FFFFSFYFYIIYLFFLSFLRL 


4506 


18407 


A 


4536 


198 


I 


FFFFFFFFFFFFFFFDSFIFIFLYFNFI 


4507 


18408 


A 


4537 


2 


356 


QNRTTGNSKTQSAS PPPKECSAS PATEQ 
owraisxvL/ir jj-K-LiK-K ki-tK i k s im y.s k i i i< ^, p 7 q 
TKGKEVENFE KHLHE C I TT I TNTE KCLQ 
ELTELKTTARELRDECRSLTSRRDQLAE 
RVSALE 


4508 


1JUOQ 

10W7 








ZZu 


GITGMSHHVQPKYC 


4509 


18410 


A 


4539 


308 


2 


FS ISCPHDTPAAASQSAGITGVSHCARP 

TT?QT7Tr*l?17T?Tr , TMf^T WT T dtyT^ottmvo 
xrariLrf r i^xro^XlviljijKlvji/oVWxif 

HGl^CFGLCCTLESYIRKLSHMKKLSIL 

ITSREKCKCILQINIGDR 


4510 


18411 


A 


4540 


2 


194 


IKQ WSTALS PRLE CSGT ITAHCS LKLP 
GSASASVVARTTGTCHHAQLIYFSETGS 
HYVDVDAA 


4511 


18412 


A 


4541 


101 


2 


MEENLGNTIQDIGRGKDFTSKTPKAMAT 
KAKID 


4512 


18413 


A 


4542 


i 
* 




AEEAEAEAEAEEEEEEEEEEEEEEE IKY 


4513 


18414 


A 


4543 


3 


183 


KIQILFFVSSKTRLRPSAWHPRFVFCFL 
RWSIiALAAOAGVOWHHT, q «3T /)P T.T .PTJRK 
VDAA 


4514 


18415 






i 
i 


IIS 


EEAKKKTKNKDK 


4515 


18416 


A 




ILrf 


9 


CMCSHK 


4516 


18417 


A 


4546 


5 


108 


FKRLSCRSFPSSWDYRREPPRPANFCIL 
SRDGIS 


4517 


18418 


A 


4547 


131 


1 


RPRRPQVGLQDATS P 1 1 EELITFHDHAL 
IIIFLICFLVLYALF 


4518 


18419 


A 


4548 


156 


3 


KGGQGMSCLSRSPSALLKFECRVISSYW 
AWWRAPWPATRGAEAGESLEP 
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mi4»1o4ttif1 a ificarfiivn 

nuticuuuc liiocriiuii 


4519 


18420 


A 


4549 


2 


340 


LNPGSAGCRELRSHHCTPAWTTERGSVS 
KKKKKKKKKKKKKKKKKKKTFFGGGPLW 

DIVOC JJJ_JVXL>II7 r I Ei .fVaWTVTJ/Ai.i C C IT LI CxLH ex* vj XT 

QLKTWGKKAFFPRKGGGPLPTPPQGFYQG 
A 


4520 


18421 


A 


4550 


165 


335 


PHYLPRRSIIHRPFKSHSPEPGAVAHAC 
NPS TLGGRGGWTTSGOEFETS TiTMMFKS 
L 


4521 


18422 | 


A 


4551 


123 


3 


GRVDRLSCLSLLNSWDYGKLPLHItANFC 
IFSRDGVLSSW 


4*i99 








i 
J 




FGYVGMDVSYDPVRL 


4591 


1R494 






0/ 


i#54 


AAGTASAAVPSLEAQTPAAFRPTPPGLA 
PSSPPAPGPPAPTGKVCGRHFLRSESSA 
SGPLRALL 


4524 




A 


4S54 


114 
1 it 


n 


LTWAPHS L IBT I RNOI IOAIiL TTTT.T.dT. 
YFTLLQRSE 


4525 


18426 


A 


4555 


91 


3 


TTTTYIAJCATACVCVCVCACVCPCVCVC 
M 


4526 


18427 


A 


4556 


239 


364 


CI KI T FS RPGAWHACNPS TLGGQGGR I 
AMS RDRDHPGQQGK 


4527 


18428 


A 


4557 


250 


1 


STEHVKQLRNEFMKISVILIFLEILLYI 
KVHLKLCTLTTDHTtTiTiGIKSLKILFRPG 
AVAHACNPSTLGGRGGRI PRSGDRDHS 




1R49Q 


A, 


455R 


z 


on 


F 


4599 


18410 

lOHJu 




4550 




1Q4 


KKKLRADLILKPKKKNFLKKREWKKREK 
GPSDLKKNTPBTKLNLVQKKFQIFF 


4530 


18431 


A 


4560 


93 


2 


GIISAHCNLCXPGSBDSPASASQVAGIT 
GQ 


4531 


18432 


A 


4561 


125 


4 


GAYILYLNKKSKS IRPGMVAHACNLSTL 
GGRGGRITRSGD 


4532 


18433 


A 


4562 


3 


84 


RSCHCTPAWATERETLSPKKKKKKKF 


4533 


1X414 






145 




GFYHVVPGGVHOS PCIjSTjP KYWNYHPK P 
PPLAS IGFFFFFFETQSRS 


4534 


18435 


A 


4564 


125 


388 


IPJCPQILYSPQSSENMQHLHISVIaMGVT 
GKRGKFMDIIxDVTERCRFSLKKCTGIQE 
DLNKWEGIPCLWI IRIOTVKMAVFSKLI 
CRFS 


4535 


18436 


A 


4565 


59 


325 


VARKGNLVSFLVIFYPLFDIEEVIEPLW 
VLISSFDEGEKVELEFFFFFFFKKKSPF 
VTQAEFQWPNLSSLKPQPPRLKQFFCLG 
LLIT 


4536 


18437 


A 




196 


1 


PE KE FRRIi VT KL I REAP EKGEAOC FCF T O 
KSIQEVKGEIFKE IDRIKKKQLKHQETL 
DILLVMQNALESLSNRIEQVEERNSELE 
DKVFDLTQSNKDKAKRIRKYEQS 


4537 


18438 


A 


4567 


333 


209 


EQAFDPYGTLSQGIFSPKFHEPFHKLMA 
RPAVALPFOVNTjIC 


4538 


18439 


A 


4568 


125 


2 


HNQSMSRS IDCSDPILGHCNLCLPGSSD 
SPASASQVAGTTG 


4539 


18440 


A 


4569 


212 


2 


P I CLSGKKC YGQNVRVI KCHMWKHNHRL 
GVWHACNPNSLVGQGGRFMRSGVLEQH 
EOHACTPSLLKTLK 


4540 


18441 


A 


4570 


142 


14 


GRVDRSNPGRFLS TSNSSL YERP REIRP 
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TSO APS PVNP T T <5 T 

J- uyxlTO IT V IN IT J, JLO 1 


4541 


18442 


A 


4571 


2 


419 


ARDS FLHDSQTS FCFSDS I PTPSNMEET 

QQKSNIiELLRISLLLIESRIiEPVRYIjRS 
MPZVNWT.tTtmTQnQnnvTJT t irnr x?x?r* y r\*v 

ViF J-UMn ±J V tUJ JL Z>XJk>LJLJ X H 1 ti f ftJL 'jjPi KKj X \2 J- 

LMGRLEDGSRRTGQ ILKQTY S KCDTNSH 
NHDALLTNYGLLYCFRKDMDKVETPLR 


4542 


18443 


A 


4572 


3 


194 


HEGRLQSQLLRRLRQKNCLNTGGGGCSE 
JrKorinor JrAWv 1 £*xlor XoUJlSXuUriiFxS 
HPEPFLL 


4543 


18444 


A 


4573 


2 


286 


CRDGLTYNDFLILPGYIDFTADHVDLTS 
ALTKRITLKTPLVYSPMDTVTEAVMAIA 
nflu iwVa x ur A tltUM L. 1 Jr c! r ilANlWKKVNK. 
YEQGFITDPW 


4544 


18445 


A 


4574 


44 


325 


RQWLQEEAYITKEQKYSFLHDSQTSLCF 
SDSIPTSSNMEETQQKSNLEI1I1HISLLL 
IESRLEPVRFLRSTFTNNLVYDTSDSDD 


4545 


18446 


A 


4575 


374 


33 


LRLARRSGASLCGSRSGGPPTCANICPR 
YAPLI PP CRTS LLEAQGLVMWGPD FRV 
KLjV iobJLRKrKYQPPPPGFNRFPCLSLP 
SNWDYRHAPPRLANFNSIiFFFFFLVETG 
Y 


4546 


18447 


A 


4576 


400 


225 


PI FFWFFLFVLFLVETRItCHWQAGLE 
LLGSSNLPISASQSAK1TGMSHPAWPRA 

Of 


4547 


18448 


A 


4577 


2 


372 


ARETI FCFSNS I PAPWNMDETRQKSNLQ 
LLRISLLIIiLYRTRLTLPCITQLHILQT 
X JWINiUCLbhir Hbbo 1 CXjPIjISCTIjYVSL 
TGLDMKLCHSTLLKQNSALMSLLKIQVN 


4548 


18449 


A 


4578 


349 


174 


GCGEPRSHHCTPAWATRAKRCLKKXLKL 
QLKNWKESKHQPLQNFSHLTTSTINTQ 
EN 


4549 


18450 


A 


4579 


3 


124 


HEI1LNPEGEGCSEPRSCHCTI1AWVTRAR 
LRLKKKKKKKKN 


4550 


18451 






iol 


i 


MCCVYPQOPCOT T M2VPT7VKTT T_IT nopnup 

PASASQVAGNAGARHYAWLI FVLLVETG 
hCSC 


4551 


18452 


A 


4581 


227 


3 


HRKRNRCVS RTALLFS LGDGVS PCLKI I 
TRSGVPJ3QPGQHGETPSC 


4552 


18453 


A 


4SR2 


257 


9 


PPRAGPVG3 PRYSRPLGGPGRQI PRSGD 
PGPPGLPRGNPVPLKKNPKRSLLILPAR 
A 


4553 


18454 


A 


4583 


3 


94 


HE IAMAFL I LTERK I LG YIOLPJCG PMUV 
GP 


4554 


18455 


A 


4584 


345 


50 


IYIYICIFLNRNRVSLHCPGWSKTPGLK 
QSSHLGLPKCWDYSHEPLCPAKTSHFKA 

YLQS YS PAP YNNKN 


4555 


18456 


A 


4585 


338 


19 


FQLSRRFRPKNGGNPGSGACSEPRLGPC 
PSAWVTKRDSVSKKKKEYHTLSGLSTTG 
I YFSQ FWMIiGVQGASRFGVWGGAVSWS I 
DGAVLLRPHMAEGARQLSGLGF 


4556 


18457 


A 


4586 


2 


224 


ARGVGRAGLELQTSGDPPASASRGAGIT 
GVSHRVQLLGDRAKFSLKKKKRGI I LPG 
E FYS PLKKRFTLNTPDNP 
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4557 


18458 


A 


4587 


344 


168 


PPGTVVHTYNPSTLGGQAGRIGSLGVQD 
KPGQHRQGIiSLGVEDKPGQHSETPSVQS 
NP 


4558 


18459 


A 


4588 


212 


328 


GLAVAQAGLELLSSSNPPASASQSAGIT 
GVSHHTQPYF 


4559 


18460 


A 


4589 


110 


323 


LSNQKNQLYNPNLLIYCVVRKLKSSYDK 
QAEVQGSKLGSLQALPHGFTPLSGLNLP 
SSWEYTRPPPRPTNF 


4560 


18461 


A 


4590 


2 


127 


ARGCSEPRSRHCTPAWATEQVS KKKKKG 
REKTFGQFFFWVI 


4561 


18462 


A 


4591 


211 


2 


ARGSLEAGSSTPAWATINQSTNKKTDKT 
ALYWKKMPSRTYKARQKSMPGFKSSKDR 
LTLFLGAKV5CLV 


4562 


18463 


A 


4592 


3 


120 


HEKTKTNNP I KKWAKDMNRQLS KEDNDN 
KINGWPGAVF 




1R4£4 

1 0*T\Jt 




45Q1 


Joo 


977 


GHVGODGLEIiLTSDAPPSLASOSAGITG 
VSHRAQPKWUiFQHRSPCLPEAAGK 


4564 

tJvrt 


1K465 

J 0*HJ J 


A 


4504 


1 
l 


riAQ 


PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PPPPPPPPPPPPPPPPPPPPPPPPPPPP 
PLPPPPPPPPPSSPSSSPPNPPATTSVA 
C 


TJUJ 




j\ 


4505 


o 

■L 


lift 
1 JO 


ARAARGFRHVGOAGliELLTSGD P PASAS 
RSAGITGVSHHTQPTNF 


4566 


18467 

iCrtv/ 




4506 


115 


9 


ALKPNYSSTSMNIKARLPQYNINNLNLA 
PLYSSTSNQINI I VI ILINKENNHTLFP 
CLVSNSRLKQSSHLGLPKCWDYRRELLC 
LACDKFFISYSAPAIQLLSC 


4567 

•t JO f 


1R46R 

IOt\JO 


A 


4507 


1 
l 




GTSVDOGSLELLTSGDPPASASOSAEIT 
GVSHRTQPPI/TI IQLYLGYC 


4568 


18469 


A 


4598 


293 


3 


GGTPWPRKKLGQRKFPHRPPGGSGNPDF 
SIiLFFFFLROSFALVAOAGVDWCDLGSI» 
QPPPPVNPFLLRKQFGFSTSCFPILPAS 
GSFFGREASASC 


4569 


lot /u 


A 


4500 


174 




RSVAPFLKSARPQTYRKEPTPDTGSPQP 
PPPRFKRFSCLTRLSSWDYRHAPPHPAS 

c 


4570 


18471 


A 


4600 


59 


252 


LAPFGHELGTSESAHQTDNRHRLEGYQK 
RLDASGLERASYPIAAEFKVGGRGCSSQ 
PLTLCGYP 


4571 


18472 


A 


4601 


233 


366 


IHYLPDAVAHACNPSTLRGQGGRITRSR 
DRDYPGQHGETPFLLK 


4572 


18473 


A 


4602 


182 


325 


LFVFCLFLRQCXSVAQAGVQWHDHCSLY 
PQPPSLKQSSRLGLPSSWDY 


4573 


18474 


A 


4603 


202 


2 


GGSSSGLLRPCAVLGLKQSSYLSLVRRW 
DHNSWLKRSTHLGLPKGWDYRREPPHLA 
CILKFSINLV 


4574 


18475 


A 


4604 


317 


2 


FRU3PQVFFSPPGAGFFFCGPVWGPGPF 
SRSLQLLTFGAKFFFCLSLPESFGAPLR 
GP PSG FMAHC FF FLVLLFFFFFFRDS VL 
LCCTGWSAVAIHRRDPTTARA 


4575 


18476 


A 


4605 


233 


71 


FFVLLVEIEDQLNEKKQEGKIRKKKRMK 
RNKQSLQKIWDYVERPNLAGHSGSRL 


4576 


18477 


A 


4606 


114 


1 


HPRPARVSSCCPNWSAMVWSRIiAATSAS 
RVQAILQSRA 


4577 


18478 


A 


4607 


348 


3 


LKFQNFRGVVHGARCPSFFGGLGRGIPF 
TPEVEVLVNLGPAPALPFWPPSQTLFSQ 
NQKKNQKKKQERKREKERKEERKKKKER 
KKKEKERERKEGRKKGQEKKRKITAKSL 



831 



WO 01/64835 



PCT/US01/04927 



SEQD> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

resiuue 01 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny(alanine, 
(^Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MMMethionine, 
N^Asparagine, F=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— inreonme, v—vanne, 
W=Tryptophan, Y=Tyrosine, 
A— un Known, ^otop couon,/=possiDle 
nucleotide deletion, Y=possible 
nucleotide insertion 
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*¥J Jo 
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i 


GI I IAHCNLKLLGPSDPPTSVEKTKKLV 


id /y 


IftAftfi 

lo*fov 


a. 




1*71 

1/1 


334 


nouccT KTT13VCD etrartunrnunMT mot r\ 
PPPPGFQRFLCFSLPSSWDYRCAPP 




184511 
10401 


a 


ill 


l 


164 


AK^o Kuril W£>Ar LLrl VUyA(jJjn*LaIj xSRPA 

LASQSAGITGISHRAQPPLNTLYSMS 


4581 


18482 


A 


4611 


2 


164 


ARB PRGHIWSSFCHVGQAGLELLTSRPA 

uA&U&Abrl IvjI^HKAvJP PI>i> 1 LjIJ£dMS 


4582 


18483 


A 


4612 


180 


334 


CCLMGSSYLLPRKPNTIiRTAGFLQQRKS 
DIKPGM v VHACW PS TLGGQGGW I 


4583 


18484 


A 


4613 


189 


1 


SGFSRWENGLKCKGQKGGQSLVLLPIR 
VF F F P F FEMKSHS FAQARVQ WGNLG S LQ 
PJjPJjGJFK 


4584 


18485 


A 


4614 


216 


358 


KSTPRLGTVAHTCKPSTLGGQGKRITRS 
(jDKUnoUIjHJxGTPRXiSKNI 


4585 


18486 


A 


4615 


179 


393 


EKQEEEGSWGWAKTEDGVALMLSEAIGST 
CWI KAFIjTDAETFTrHAD I CTTjS EKERQ 
IKKQTALVELVKHKP 


4586 


18487 


A 


4616 


2 


126 


ARVQRYLFLFFTLDGPLPDPSMIRGSVP 
NQMMPRiTPQSGK 


4587 


18488 


A 


4617 


338 


181 


RVGEAGLEIiLAIiSDIiPASDSQSAGITDI 
SHHTRPGSPFLEHHXiYPRYRISGK 


4588 


18489 


A 


4618 


3 


197 


HEKRPEVQNSTVEFIASSDYMLRPPLPL 
PFCFFLFFTKIKMGVSPCCPGSCQTPSL 
KQSSCIiNV 


4589 


18490 


A 


4619 


, 1 


339 


GTRTFGSGNGPTKPDLLQELRVATERGL 
VXVNCTHC1XJGAVTTDYAAGMAMAGAGV 
ISGFDMTSEAALAKLSYVLGQPGIjSLDV 
rkelltkdiirgemtppsveerrpslqgn 
T 


4590 


18491 


A 


4620 


3 


306 


LEPGDGGCSELKSCHCTPAWVTERDSIS 
KKKKKKKKGGPLKGTIiGGPKFNGGWQRK 
IFSQKGGQKKPNLGILGRNLNFGGEKNW 
NKFLTKI KVLREKKNF 


4591 


18492 


A 


4621 


397 


29 


NTKEKIIiSARKNNQVFPPHTQKKNFFPK 
KRKNFREGIPPNFPPPKKSPPQKNHPPQ 
iUU^PiusKKKI IARPPKKRAPPGAFKKK 
PPPFFFLSSLCISYSFIYFFSLINLFFF 


4592 


18493 


A 


4622 


2 


235 


VSLCHPGGTISTHYNFCZLPGSRDPPTSA 
SAYALLI FVFFVETGFPLVAQAGPKLQG 


4593 


18494 


A 


4623 


209 


324 


ILRVLWSGTVAHACNPSTLGARGRQIMR 


4594 


18495 


A 


4624 


52 


287 


NPFSFIiFFSFLSFFLFFFFLEKGPRRGA 
xvj-iVi ivu rvir n.Jtr r*j\j\^r>jvr > ^kjoOiViviN r\. 1 v iv.r 
FSPLKGPRGPPRGKGQTTPKGP 










1 AO 


1 


TUCODtfC T CT.CT G tl*PLI r nJM l TJM*IJ , l , LI , l*C' T V 

lnorrVoiobdJjbalhlnlhlKlhlbli 
RVREIR 








4626 


J ZD 


1 


UKtl V pytl 1 AW r r V VJLiXBr'Rir lixlV(jyAo-u 
KLLTSGNPLVPLV 


4597 


18498 


A 


4627 


3 


167 


ERLRQQNCTNPRSEGCSEPGS PHCTPAA 
WATEQDSVSKKEKEKRKKEKKKKIW 


4598 


18499 


A 


4628 


135 


3 


AASTLSRGTITAQGNLCLPV S SNS PAS A 
SRVAGITGARHDTWL 


4599 


18500 


A 


4629 


320 


210 


W FLG 1 1 FRRVENAES VNIDLTYD IQS FT 
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1451 dllllUU 

acid 

residue of 

peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Gtycine, H-Histidine, l=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
vr=<jiutamine, K— Arginme, o— oerme, 
i — inreomne, v — vaune, 
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X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \— possible 
nucleotide insertion 














DTGKTPSS 


4600 


18501 


A 


4630 


98 


326 


SSPTIaAFQFQYFFSCATKEAVKLSFPKS 
LRLSSCLTLVSGWDHRCAPP 


4601 


18502 


A 


4631 


275 


2 


RVKLIiLFYMRQSCSVIQVESSSTVNGSL 

ANFFLFFWRAKGLAILPVHSNLVNTESP 
LISIKSN 


4oU2 


18503 


A 


4632 


1 OA 

180 


1 


isXjLj ijjxr 1 1> c J. r J\I*ix<J<ijiJtAwxiruJ Wo y 1 f 
ELNQFSCLGLPKCWDFRHEPPHPVKEFK 

PC 1 LTD 

rinn 


4603 


18504 


A 


4633 


234 


5 


KPLRDKPLRSFSNIiPKKNLNFFWPFFPQ 

ITT Wf?F PPTT? C! WrUTO A^ll/'AWT.nT. Q 
cvX ri <Jf r C C C C C a X CiOuU V X V^/rVXjL/iJO 

SLQPPSPGLRQFLCLSLSSS 




law; 


a 
j\ 




1 /l 




TRLECSGVIMAYCSLDLLGSGNPPTSAS 


4605 


18506 


A 


4635 


324 


211 


LRNIIFMPVTMNPGMYTGGCMCVCVCVC 


4606 


18507 


A 


4636 


95 


1 


MRQS INIiSPRLECSGTI PAHCNLCLPGS 
SNS 


40U/ 


loeno 

185U8 


A 


4o3/ 


234 


l 


IYKVWKNSPNYSSLTHHINNIKPSFTRE 

IN XUTir IflUJf XXIXHJJl.nX.Irl/XX X 


4608 


18509 


A 


4638 


2 


243 


WGKV1TODEVGGEALGRLLVVYPWTHMYF 

pjrnrcA wvT ^ pr*T. fft . t pp ht.it pttq <? t w t 
aif uat i loruur r uir jrnur c vooxm 

VSS FYLLFYNFP 1 1 FFTLFCHFFL 


4609 


18510 


A 


4639 


118 


2 


GRVDLQCSGAI S AHCKLHLQRS RHS PAS 
iwyvwui/u x 


4610 


18511 


A 


4640 


50 


168 






KKKKKKKKKKN 


4611 


18512 


A 


4641 


335 


87 


GIFLRDRNLALLPRIiECSGTI IAQCNLE 
LLSPKDS PAS PS PVARTTGVQPRPDTLT 
t.t?t?t»t vtp wriMT p<?nnnr* 

UP CtXlX X XivAAUl'JXr O^VJiJU 


4612 


18513 


A 


4642 


383 


2 


KFKKKKFAGGKAPARLSPPLWGPKGGGP 
PRAGVLSPPGPPGETPFFIiKNPKFPGHG 
cinvi'o ire d tptj^it 1 t .7^phc t it v/^/ira ttotd 

OO FKlVo f If rvaOJj RVxEtlNoXt 1 KJ1 Auur J.JM f 

NIAPPAFPPKGQNPGPFPKK3CKKKKNRS 


4613 


18514 


A 


4643 


3 


205 


LKRFFCFSLGGQGCSEPRSCQCI PAWET 
VKPKGPLGAGI 


4614 


18515 


A 


4644 


149 


1 


FFPIKADLVFFFFFLRRNLADSQGAVQW 
HDLGSIjOAPPPGFMPLSCLS l 


4615 


18516 


A 


4645 


474 


344 


AIHEHYLPESSASGDCDEDLQAACTILV 
NALMDFHQRIiAAHTV 


4616 


18517 


A 


4646 


165 


357 


liCNALSHLLPQCTPTPLISILCI IDTIS 

T.T.TOP T TW A VP Ti <3 A WT T A r*WT.T .MOT TT T. 

STLSICTL 




1Q<1 Q 




AC AH 


2 




2i PmTNTiT PfifWlPQRT.PT.PUPTPT'&iaT 

RVNCVSQKKKKNQPKPKPNQTKPNQTRK 

f5 Q PT5PPT ,T .WT .1TVT Q pp IfG T .WDM T T .tfTAia 
KVTFRNWIiTPE IPPFWKMEBKGS FDPRI 




1 OJ17 




WHO 


o J j 


15/ 


LTSGDPPASASQSAGITGVSHHALPPLV 
QFPSHSLTPTPLFC 


4619 


18520 


A 


4649 


236 


324 


GQVMWPGMMAHACOTSTIjGGRSRWIVRS 
R 


4620 


18521 


A 


4650 


225 


313 


GQVMWPGMMAHACNPSTLGGRGRWIVRS 
R 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Add, F=Pbenylalanine ) 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 

Osl^liitaminp RsAroininp C— C Ar S n a 
V£ vjri umiumcj iv fvi giiiinc, 4j— oerincj 
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W=Trvnfonhfln Y=Tvrn<?inp 
X=Un known, *=Stop cod on, /^possible 
mirleotirlp deletion \=nnQcihlp. 
nucleotide insertion 


4621 


18522 


A 


4651 


240 


76 


FCVHAHTHTHTPS FQASGSSEHMHTYTH 
PSGS PEHTHTYTHTT FLASRSAE PQLP 


4622 


18523 


A 


4652 


273 


379 


KQLAAQCVRPLS PSQAIAS PVTCTYMPR 
WPEVTEE 


4623 


18524 


A 


4653 




IRQ 


G xA/xtOAGAOCRNLGS T iOPT PP PCSPKY") P<? 
CKSLYSIMICYSKDCTMSTTLGIRANNR 
LHL 


4624 


18525 


A 


4654 


168 


346 


DRICRASSYTIHLML»yFVFLIFLFYWYI 

YPTjFFVT.PHPTT PT TT.PY«?PTTT,T.T.T.T,V 
x ±r xjx l v ur ixx iirx x ur xoci i inn » x 

FYN 


4625 


18526 


A 








WKRTFPPPFFI/RTC?SfiPAfiRPPTT.r.T.T7T. 
* * s\xv xx x x x x x x,i n x viuucruiUrL X xxLjxJx Xt 

YSIX3POTRVLIRVFLFLSSSRCYYYYLF 
IXSLSLLFLFFYIMYIxI^IRIjFFFVIjSGIj 
llllssyyrfi ifff 

■ ill 11 UJI )U X X l\X X. X. X X L 


4626 


18527 


A 


4656 


50 


186 


SACIiGIiPKCWDYRHEALRPAMYIxLTGSS 
YVAOAGLNLLGSADPPE 


4627 


18528 


A 


4657 


131 


366 


DHFVASSESVKEEGLFFVFVFFFFIFIF 

FFHiYICFFYFKFIFIIIiLYIIIIFIFI 

FYFFYTjYYTjIIjT PFTiT PP I FTO 
x xx v x xj x x xjxxj x x ruirr ic X\j 


4628 


18529 


A 


4658 


290 


155 


LEYIVEKGFHQDDKASPELLTSRYLSDS 
ASOSAGI TGHRAWPTJCG 


4629 


18530 


A 


4659 


163 


372 


LRLFALFAFLFMFFFYFLFLLYVFCFVF 

FLFFFPTFPTjYYPFL.YPPT.YPT.PQT .VPT, 
x xjx c s r xxxxjxxxx xj irruir xj\_<d xj v x xj 

FFICFIFFWFFVFF 


4630 


18531 


A 


4660 


227 


378 


FIFFSNCTSTALVHPLFYIiSRLVLSKNR 
NKFIGRARWLTPVFPALWEAEA 


4631 


18532 


A 


4661 


3 


141 


RSRHCTPAWQQSEILSQKERIOCERKKKK 


4632 


18533 


A 


4662 


346 


3 


TFKFLGGVCPFFSPPKKRVFSKNSPGGF 
FLPPFLGENFFFAPPVNFGPPGGFFKGG 
PPFFFFFFFFFFFFFFFFFFFFFFFFSQ 

ILIKIAL 


4633 


18534 


A 


4663 


302 


3 


GDINRRSHSPFVWSKPFAPCFMSSLDVF 
YKSSDFFYFFFETEFHSCCPGGISAYCN 
LCLPGSSNS PASASOVAQTTOVRHHTOTj 
I FASWETFKITYKN 


4634 


18535 


A 


4664 


2 


76 


RxxAPTPQLGGHWPPTGIxTIiNPIxEV 


4635 


18536 


A 


4665 


1 


210 


SPTRSPxCPTPPYYQTTIJaCPPTiaCiaCKK 
KKKKKKKKKKKKmGGAPlJ^ PGGPKI 
NRGKKKKFFFLKGG 














4636 


18537 


A 


4666 


2 


356 


GSRPGGGKRTARRWTRPWRXxRGACIjGTG 
MAFAFHHQVI YKI KSFLRKKKIQ^ 
KZKaCiaaGCKKKiaCKKKK^ 
GGFIiKKLWGGKFFMGGGKIPPPPIPGGK 
KKMWGG 


4637 


18538 


A 


4667 


1 


323 


RTRGIFCYDVCVESGCMIQIjLIiLCIjKK 

NPGKKKKNFSPKRGPKKTPPGNFEKKPF 
FGGGPNGAPPPQKNKTPKEKKKF 














4638 


18539 


A 


4668 


167 


446 


ELNKGLCKPKKKKKKGGGRSQFKPTRPK 

* VXvi\X Vi VtvXvXXX>JL VLVt^^XXVAVlVXV 

KS 


4639 


18540 


A 


4669 


321 


52 


GVFSPFS PKKFFFS PKGLNFWGGFAP IF 
SPPKKRFFSKNPQGGFKYPPIjRGKILIW 
GPRENWGPPGGFLKGAPIjFFFFFFFFFF 
FFFFWG 


4640 


18541 


A 


4670 


82 


359 


NAYGI I LI KGNEDTKTVKWKSS CQLANT 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














WEFHIRALLGFNCFFLIFKRRGLTLLPR 
IiDCNGVI I IHNNLESLGLSNPPMLASQV 
ARTTGLCY 


4641 


18542 


A 


4671 


249 


60 


LliIFIYrFYFFLFFFLFFFFFFFFFFFF 
IjFVQVHLPVHLPCYDLS plykcvgvlvk 
CPFKYT 


4642 


18543 


A 


4672 


2 


319 


AAASTLVFVFSFFWVLGFFFFFFFFFWG 
PPKKILKKKGGGGGGGGGGPRGGPRGGG 
KRNPGGQKKGIGGANGGQKKERNFQGGG 
PPRGEKRGGGGGPGNPKGGGLK 








A £.11 
40/ J 


J 




HESFNPVI ISQLERGEDPWVTJDRKGAKK 
SQGLWSDYSDNLKYDHTTACTQQDSLSC 
PWECETKGESQNTOLSPKPLISEQTVIL 
GKTPLGRIDQENNETKQSFCLSPNSVDH 
REVQVLSQSMPLTPHQAVPSGERPYMCV 
ECGKCFGRS 


*HJ*t*T 






H-0 /** 


A AC 
*f*fO 


IRQ 
toy 


S ISIjSTLVNVCQIiISIjSDPASGFVDHVC 
CI FT FYS INFALILI FLFYLGVLVWAAI 
INYHRLHGLNTKHLLIMIWMVGSPRIiSC 
G 


4645 


18546 


A 


4675 


26 


170 


HLWFLVI VAHWASSRQI ISHWKNCTRH 
DCPVCLPLKNASDKRNQQSK 


4646 


18547 


A 


4676 


3 


378 


HEDEFDKMGNQHQALLEAMEQQS ISLAK 
AG WC I LPARTS I IAVAHS FAGIiYLI FN 
TVSWFFFFSNFIFLILSLFNYFCIS I Y 
ILFLIIFIFIFISSLIVFFCLISFRFLL 
FIFFSALSRIFIY 


4647 


18548 


A 


4677 


18 


242 


PIGIiAIiSPRLECSGVIIAHLLGSNNPPT 
SGSRVAGTTGIHHHAQPHCFNCI FPMSS 
EVECLFICLliLIHILSRA 


AGAR 




A 


467K 


zoo 


2 


PKYRWVPEKPTTASENAE IPSERQKEG I 
KLTIRISSRKKKPDSPPKVLEPETKQEK 
PEKEEEKTNVGRTLRRSPRISRPPAKVA 
QIRDPKAHKKK 


4649 


18550 


A 


4679 


340 


208 


SVGQAGLELLTSGDPPAXASQSAGITGL 
SQCYPPDSVPETFLPL 


4650 


18551 


A 


4680 


4 


379 


EENTG INLHDLGFGNGFLDMAPKAQATT 
AKKKKKKKKKIKIKSFLFLGGLFKKGKK 
TPPKGEKNFGNFKFWGGDLTPKI FGDPG 
ASTKKKKPFLKGGRGLKKTFFKKKKKRG 
PHPLESPSPFSVF 


4651 


18552 


A 


4681 


1 


350 


SRFSGAKEAKMEKKITGYTTVDISQWHR 
KEHFEAFQS VAQCTYNQTVQLDI TAFLK 
TVKKNKHKFYPAFIHIIiARLMNAHPEFR 
MAMKDGELVTWDSVHPCYTVFHEQTETF 
SSLW 


4652 


18553 


A 


4682 


365 


45 


SPKWFVASYDPYAGPGRPTRPDNSRDET 
GSHSVSESSSGSDSKRESTSSDREANEP 
SHRASPEPEPPPTNKWQLDNWLNKVN 


4653 


18554 


A 


4683 


238 


372 


SATPLI SS FS FLFFFFFFFREGVWLCHP 
GWSAVSRDCTTAAVWPG 




18S^ 

\OJJJ 




*HJ6*f 


J 




KTKES REAVE KEFEPLLNWMKDRALKDK 
I EKAWSQRLTES PCAL VASQ YGWS GNM 
ERIMKAQAYQTGKDISTNYYASQKKTFE 
INPRHPLIRDMLRRIKEDEDDKTVLDLA 
VGLFETATLRSGYLLPDTKAYGDRIERM 
LRIiSLNIDPDARG 


4655 


18556 


A 


4686 


410 


0 


FFFFFFFFFFFLLGKGGFGIIKFL 
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nucle- 
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Jinn <4*<t 

aing 10 
last amino 
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residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny1alanine, 
G=dycine, H=Histidine, I=IsoIeucine, 
KHLysine, L=Leucine, MHVfethionine, 
N=Asparagine, P^Proline, 
v==\ji Diamine, tv^Arginme, o-^oerine, 
T=Threonine, V=Valine, 

»» — 1 rypiupn^iij 1 — jjrosine, 
X=Unknown, *=Stop codon, /-possible 
iiucicuuuc ueitsiiun, \ — possioie 
nucleotide insertion 


4656 


18557 


A 


4687 


151 


2 


IFFFFFDRVQWHKLGSLQPLPPRFQQFS 
CLSLPSSWEYRHAPPRVRPRV 


* 


lOJJO 




AAQQ 

4ooo 


j4z 


ioi 


SPGFKRFSWAEVGGLHKPRSQRLQ 


40 jo 






AAQQ 

HOoy 


IZo 


4U4 


ALYFI FLEMGS YS VTQAGVQWHSIiGSLQ 
PPQWSYVLTFLWHPSSWGYFVQNHLTPC 
FDGKVPWV 


4659 


18560 


A 


4690 


2 


128 


GESLFYI VS PRDWVAKERDQDDHIDWL 


4660 


18561 


A 


4691 


407 


2 


YFFFPLPRMKIFFFPHQKGFFFGGCPII 
SPPPNEVFFSPKEGGFFFSPPKKRVFFH 

MPDPf T7PPPPtlPT?MOPODT>Ii'PL'li'L l PgT Tf 

NPFJ&GJr rPPFPrrMSF r r r r r r c SLF 
FFFFFFFFFFFFFFFFFFFFFFFFFSFR 
1 1 LLNQ KKNLS ACVELRS CCTFM 


4661 


18562 


A 


4692 


1 


176 


SRRLNPGGRGCSEPRSHHCTSTWSTERD 
SVSKKKKKRFFALKFLGEANVKPTIGGQ 
IvN 


AAA") 
4©0x 


1 Q<A1 
10 JO 5 


a 
a 


4693 


1 


403 


0 KaEIoKKIjGIjIjKQTV SQ WNAKEKFIiK 
EVKSATPMNTRMIRKRNSLIADMEKVLV 
VWIEDQTSRNI PLSQSLIQNKALTLFNS 

HFHNI KAQGEAASADVEAAAS Y 


4uOJ 


1 QCAA 

IojOh 




AAQA 

40V4 


J 4,3 


42D 


WP17T . C T .T>T .fJ Q l?T? AT TMVPH17 ITT ^V*4T .1? 


4664 


18565 


A . 


4695 


395 


2 


KNKGLFFFFFLRWS FTLVTQASNVTTAV 
CJN v I XliAtlvJiNijmjiroooJNoFAOAoKVAO 
ITGSCHQG 


AAA* 


1 Q<AA 




4696 


429 


0 


KKARG 


AAAA 
4000 


1 9^ An 
J 5 jo/ 


A 


ACXS1 

4oy/ 


371 


217 


KKIEYQHISLYKKPSKRCAIiFIjI 


AAfn 
t *OOf 


Jojoo 


T\ 
A 


4oyo 


41j 


jy 


KRGGGLEKKKKKGFFAPYPPH 




IOJO7 


7V 
A 


/I /COO 


ill 1 


A 

u 


T.tTVTT?DD/ttfibvt.GDGTlDDT)QDDDT3nVDD 
J-iJVIvXf rrouKV liolroirr' if ir if tr ir tr ir tr C irif 

P 


4669 


18570 


A 


4700 


371 


245 


TLGDPPASASQSAGITGVSHHAQPLFGI 
FIYKFTYICRKTRT 






j± 


4/U1 


4IU 


U 


KQCWFTQKKKKKKKKKKKKKKKKKKKKK 






DRG 


4671 


18572 


A 


4702 


340 


402 


P FAGGG YRLGAAPE EESAYVA 


HD/Z 


lOJ / J 


7V 
A 


4/UJ 


loo 


L 


TGTiPlfrWF'YPMHDDr'DfiHTT T TI?T,PT?T1? 
X 0 JJ IT D*\~Tt E» x rvririf if ^.c ATI 1 XjIj JL V llEE ±S 

VDMGSCHVAQVALEIiLGSSDPPTSASQS 
AGIIGM 


4673 


18574 


A 


4704 


423 


34 






isjmm v ijvjVjin x JMi v vj<jAy± JJV7 1 lU\J_i\-.i7 0 r r 
YRVF 


4674 

• 


18575 


A 


4705 


2 


.297 


PRVREREKEMASMKEEFTRLKEALEKSE 

DNLADAEERCI^LIKNKIQLEAKVKEMN 
ERT.EDEPEMNP *?T.P 


4675 


18576 


A 


4706 


2 


179 


QENGMNPGGRACCE PRLRYCTS S WATER 
DSVS KKNEKKR S AS LRR FD I L FGALTGR 
KAS 


4676 


18577 


A 


4707 


432 


578 


PRIVCFQELRKPSYAEICQRTSKEPPSS 
PLQPSSSSSSSS 


4677 


18578 


A 


4708 


419 


35 


GSFFFFFLKDQPKKGRKKEGCPPPPPKK 
IIYPPPATKWGGGGGKKPPPPKKRGPPK 
KNRGRKKKPPKKKKKKFPPPPKKRAPQK 
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Predicted 

beginning 

nucleotide 

location 

correspond 
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UCUUUlf 
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nucle- 
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Ulllg IV 

last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leudne, M=Methionine, 
N=Asparagine, P^Proline, 

T=Threonine, V=Valine, 
W=Trvntonhan Y— Tvrnsin^ 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion l=nn<i<;ihlf» 
nucleotide insertion 














KKKXPPPPIFFYIiYFFKNFIFFFFFFFY 
FFPFFFFKFIFNLKKF 


4678 


18579 


A 


4709 


147 


31 


NFFFFFFFFFXFXFFFFFXXXKFFLFFF 
* FFFLKLKIFF 


4679 


18580 


A 


4710 


3 


229 


HASAHASDQPERLNAGTYFLFYTLVGSL 
PliLIALIYTHNTLGSLNILLLTTiTAQEL 
S KKKKKGGAVLKNP VJGAQ S 


4680 


18581 


A 


4711 


I 


158 


PTRPPTRPRSCSELRSHHCTPAWVTBRL 
OuKKKKKKFPNLSTCFKILGILKG 


4681 


18582 


A 


4712 


163 


404 


KKKKKKIQCKKKKKKKKKKKKXKKKKKKP 
KKKKN1000CKKKKKK3CKKHPSRGGAQKK 


4682 


18583 


A 


4713 


3 


378 


FEEPVYI KSRQKRKESNPPKLVSSQPHG 
KKPGGAKNKPC^EKTO^FLKGf^KKNPW 

XVXVXT \J\JLt\ IVJ^I i\Xv7vJ i— t XV-TA-L>I L JL JJXVvjvJTVJ EXXVLv JET VM 

GFFKKKTFFGGGKIGAKPPKKKKSLKKK 
KKFLRGKGGKKT 






j± 


*+/ it 


L 


ZUo 


VSNPAVSVPHTjPFSVYKSSPMASMTF*? K 
KKKPWGAQNKPG 














4684 


18585 


A 


4715 


290 


56 


SAPPPIFFIiFFFIiFLFIFFIFFFFFKQI 
LEKKGGPPPFFLFFFFIFFFFFFYRMGC 
DRWWFILIOGTFHRKOKPVLKV 


4685 


18586 


A 


4716 


50 


424 


GGFKIKFLFTQFIiFIiLFSPSMFIiISKSP 
aylwossprdvpmvvfgktnfiivsieck: 
KBGiyFFCIPFVPFGFPHPKILCSIiliPF 
LAFLRRS LALL PRLECSGAI LTHCNPCL 
PGSSNSHAWA 


4686 


18587 


A 


4717 


161 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSLQPIiPPGIjQ 


4687 


18588 


A 


4718 


2 


115 


vyttamaieeypeawgsgvevlvsvlvg 
iamevgstrp 


4688 


18589 


A 


4719 


1 


416 


GNQGGGYGGGYDNYGGGNYGSGNPSSSS 

ssssssssssssssssssssssssssss 
sssssssssssssssssss 










i 




fiPn<3fiT<?QTRQFITTTT.PAAAnnVYVRTVR 
IGNVSTSHLLILDDSVEMEIRPRFPLFG 
GWKTHYXVGYNItPSYEYIiYNLGDOYATjir 
MRFVDHVFDEQVIDSLTVKI ILPEGAKN 
IEIDSPYEISRAPDELHYTYLDTFGRPV 
IVAYKKNLVEQ 


4690 


18591 


A 


4721 


3 


173 


DAWAGFHYVGQAGLEIiLTSSHLPASASQ 
SAG ITDVSHRTQIiDVPFSFSLS PLPHQC 


4691 


18592 


A 


4722 


d* 




IiKPGGGGCSEPRSCHCSTPAWRQS ETPS 
QKKKKGKWGLERIGWGGSTTNNGAKNQR 
GEKFGKKGGF 


4692 


18593 


A 


4723 


2 


183 


LKPGGGGCSEPRSCHCSTPAWRQSETPS 
QKKKKGKWGLEIICWGGSPTNKRATNSP 
VEKF 


4693 


18594 


A 


4724 


1 


395 


QDATSPIIEELITFHDHALIIIFLICFL 
DLHALSLTLPSALT 


4694 


18595 


A 


4725 


388 


3 


SFSPLEENLGNTIQDIGRGKDFTSiCrPK 
AMATKAKID 


4695 


18596 


A 


4726 


223 


44 


WIiFFFXPLFFFFFWGGIIFFLLKKLYIF 
FFFMSKI FFFFFFFFFFFFFFFFFFFFF 
FFFF 


"4696 


18597 


A 


4727 


41 


374 


KKKKKGKKKKKGGGGGGNPKKKKGKMEK 
PPGGNKRGEGGKKKNFLKKKGGGLFKGK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etfa 

od 


SEQID 
NO: in 

ussn 

09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!aIanine, 
OGIycine, H=Histidine, I-Isoleucine, 
K*Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=|j|utamine, K^Argimne, o— oerine, 
T=0Threonine, V=Valine, 
w— i ryptopnan, y— tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KPPKKKNGGRKPWKKKNFKKKKKKAPFF 
IiGPKI FKKKKKRGLKKKLGFFSQSVGP 


a can 




^ 


4/2o 


323 


69 


AGFQLFLQPPPPGSKQFGRLNLLGNKDL 


4698 


18599 


A 


4729 


3 


197 


CriAIJ^HLSRQRQYALPLLGLPGCPHIHT 
rlorl J. aAautiX Hz lb PiiAHKHAHlnTHMH 
TLPYTTHCI 


4oyy 


looUU 


7V 


4/iU 


309 


AA1 

443 


VIjX JUo trKXuSUoorUH XAnxoijiLLil^jooDH 

PTSASQSGGI IGMSYHA 


4700 


18601 


A 


4731 


3 


172 


DAWAFFNGPGVSLCCTGWSRTPQLKRST 

/IT OT Olf f'OTTVVDDTPDT CDAT5VPT povtrr T 

t^oJbFlvUWJJxKFlSrijoP/^ x VXjIi 


4701 


18602 


A 


4732 


98 


232 


QTVFLLIWLLALCSPGVLQTVKRFLEFR 
CIFSIiGIVTFICLYRN 


4702 


18603 


A 


4733 


238 


1 


LGPPPPSPLGPPPFLVIFRGFSFPKTPL 
FFSPFFFGPPPFFFFFFFFFFIRRSFAL 
VTQAGVQWRDLGSLQPIiPPGFKR 


4703 


18604 


A 


4734 


121 


424 


LIQGHRWI YFSQLRKVRGRQACDSTLL 
GHLPKWRRMFPFLSFTVAGLEPTSHYR 
MFVDVVIjVDQHHWRYQTGKWVQ CGKAEG 
TMPGAR 


4/U4 


18605 


A 


4735 


319 


8 


np pqkkkkki ffp pkkktp pkki kknp p 
PFTIFFFFFFFFFFFFFFFFFFFFFFFF 

PPVPAUTTSTD/tPPT TTTTTT Li»T T"C*tTT DT H W 

Fr xFQHVWFCFBI IHVIFFIiIEVXjFIiAF 
HVGDS IADAWSTKNSNRTK 




ioOUo 


7\ 
/i 


All C 


297 


442 


xQCjr rGwrRKIGr r rRGLKWGGGGNPK 
LRFLKPGQLVGRGKTKKNQG 


4706 


18607 


A 


4737 


430 


0 


CVIiGSKDKKKKKKKKKKKKKKKKKKK^ 


4707 


18608 


A 


4738 


137 


1 


INYYFFLKQGFPLLPRLECSGAI iahcs 
LEFPGSSDPRPSASQVA 


4/Uo 




A 


4739 


188 


3 


I b FKALiN PKKEVGP 1 FPPPKKKVP PQNP 
PGGFFTP PHKEKNFSLPPPVNLGP PKDP 
LKRPP 


4709 


18610 


A 


4740 


188 


2 


TSLPKCWDYRHEPPCPAHTIiLIFLFFIF 
VDMGS CHVAQVALELLGSSDPPTSASQS 


4710 


18611 


A 


4741 


410 


i 


RRPGGGFCPPPPPKIFFSPPPPPFFVGV 
LPQTPPPPKKFFFLNPPPGFFFPPPKKK 

lTTT?t?17GT3'DT?I3 , Ol3DOT DODI3DTinm7PDOD 

Mrr rolrirc If br irJr J. r ifr frrFjk'Pr ttrr 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
GKDRVSLCCSDWSRTPGPKPSSLL 


All 1 


loOlZ 




4/4Z 


22j 


AAA 

444 


S FDEAALLFFQNDTYWPGTVAHACNPST 


4712 


18613 


A 


4743 


2 


452 


VSNPAVSVPHLPFSVYKSSPMASMTFSK 

WTTKTP 
IV-XViVLNIx 














4713 


18614 


A 


4744 


3 


144 


IiAAVLLKLGGYGI IRLTLILNPLTKHIA 
YPILLIiSLMPDIITGFSS 


4714 


18615 


A 


4745 


206 


2 


IQNPQTERVNSTSKLKISSKDTINRAKK 

r\Tiiurr»T»n3"T7Tr^wTJTT?*^ , Tru , r TOT3n/rvtrn»r #^\t 
yPNUWiSV lv_-KJ\ri J. e vjivr»Li±oKM x lUSLiQij 

NNKKQIVPPTRP 


4715 


18616 


A 


4746 


523 


178 


RHRRFFMDIEREQVKEQQRQKEQKKDIE 
KIKDKREQECYAAEQRILRMNFHEDPYS 
GEKLSEILAQLQLQEIKGTREKQQREKE 
YLRYVEALRAQIQEKMQLYNITLP PLC 


4716 


18617 


A 


4747 


3 


518 


ENRLNLEGGGCGE PRS RHCTPTWAT 
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SEQID 
NO: of 
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sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

a till II U aUU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

oCv| Uvllvv 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Gluta mic Acid, F=PhenylaJanme, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leurine, M=Methionine, 
N=Asparagine, PMProline, 

rV=T^l ritflminp R=A rcrininA .^=Q<>ri nf* 

\£ \JlUUtUIIIlC, l\T"flLI gllllUC, iJ^OCI HIC, 

T=Tbreonine, V=Valine, 
W==Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

II UUICUIIUC IlliCi UUII 


4717 


18618 


A 


4748 


491 


145 


LCHVGQAGLELLTSSDLPTLASRSAGIA 
GVSHRTRPHSHFYFYFLYSSSFFFFFFL 
AQRGNRAPRGKKT P FGLATLN I KSNGGV 
FWFLFLGFGFLKRE PTFALGLKDRGGNL 

v 


4718 


18619 


A 


4749 


6 


362 


NKLPNGTEFLSSLVPNTFFFSFLFFPFF 
FFFFFWGKGGPPRGGGAKFYPLEFPPPG 
VPPPPKGGKKGGPPPPTNFLFFKKEGGS 
PFGPGGVKTPGPKGFPPPSPQRGGKKGQ 
WAQKPP 


4719 


18620 


A 


4750 


380 


3 


S LLLAFALLCL P WLQEAGAVQTVPLSRL 

FDHAMLQAHRAHHIiAI DTYQEFSETYI P 

KDQKYS FVHDS QTS FCFSDS I PTPSNME 

EPQHKSNIiEIiliRISLLLIESWLEPVRFL 

RSMFANNIjVYD PS 
iwnr rum u v iAJiro 


4720 


18621 


A 


4751 


1 


345 


LFYI FKKTWSLFLCFLWMMCVFI FFFFL 
LFVFFLCLGGILVSFIPPGGEKREKKKR 
RGGEKKKKKKNVVFKKKKKNPWGGGGGL 
KKQRARGGKKKKKKGGGGSGGGKKKQS P 
LRG 


4721 


18622 


A 


4752 


1 


407 


QEFEETYIPKDQKYSFIjHDSQTSFCFSD 
S I PTPSNME ETQQKSNLELLRI SLLLI E 
SWLEPVRFIiRSMFANNLVYDTSDSDDYH 
LliKDLEEGIQTLMGVRVAPGVTNPGTPL 
ASRAGGEKYCCPLFSNKARDOEN 


4722 


18623 


A 


4753 


357 


1 


PLFFFKRNLFFFFFFTKKKEKKAPVFFF 
FPPPSREWGGFFFRVGGVNTQFLHPFFF 
SONTSVFFSTLFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLFYCFIYFSR 
DRGLTLL 


4723 


18624 


A 


4754 


1 


207 


LGLAGVEEEVASRLNAGGRGCSKPRSHD 
WT P AWAME ODSMS KKTKKfEYORROLIHL 
QSSTSGFLYVHEQ 


4724 


18625 


A 


4755 


2 


336 


HEERERERBRERERERERERERERERES 
RAOFRAR KNSQAPHPRVCETIiWTEEGGS 
FYPLTEKDFAQCERAHFCVLGVSHPPGW 
6LCKKMNPECERATHSLHGDADISGVL 


4725 


18626 


A 


4756 


230 


446 


VGFSSLFSRYCTILISASSLTKELCEKK 
RS FLF C FAFWLLGFLRQSLALS PRLEYN 
GAI SAHCNLHLPGS SD 


4726 


18627 


A 


4757 


3 


390 


QTSFCFSDSGPTPFNMEETQQKSNLELL 
RISLLLIESWLEPVRFLRSMFANNLVYD 
TSDSDDYHLLKDLEEGIQTLMGRLEDGS 
RRTGQILKQTYSKFDTNSHNHDALLKNY 
GLLYCFRKDMDXVETFL 


4727 


18628 


A 


4758 


234 


387 


LAKIFLSFSSATGWIENRPQSPATGRTP 
VFVSPIPPPLPPPFPFAFVT 


4728 


18629 


A 


4759 


396 


0 


I YTHTHTHTHTHTY I YMYVFMI I 


479Q 




A 


Alffi 
*t/OU 


1 /O 




MVIPLFNKDTQHI I LFWGYVLIFFFLKR 
NFVFVAQAGGQGRYFGSLKPPPPRLKPF 
SCLSLLGTWDYRR 


4730 


18631 


A 


4761 


419 


208 


IYPPPPPWEPKGGSFIaKKKKKKKKPGPG 
VPPFIPPPLGGQAGGFPNSKIQIPPAPQ 
GKPLFFQKITQNYLA 


4731 


18632 


A 


4762 


158 


2 


GNQUJIPPPPSGSAFCFLFFLRRSLALVT 
QARVQWHDLGSLQPSPSGFKRFSC 


4732 


18633 


A 


4763 


489 


398 


AGFEIiLTSSDPPASASQSVGITGMSHCT 
QP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

alllllJU add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
v=vjiutamine, K— Arginine, S=aertne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Vn known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


4733 


18634 


A 


4764 


420 


1 


FFFFFLGEI FPPGGKGGVFPPPPLKKFF 

UCDDriT PITT /V^O^BXTLf i mTtwm ti9t 

co trKfjuifi* IAjGGv? PN F FP PKKRVF FKKP 
RAGFFPPPGVLKRGPGPGFLTPPQKEKN 
FFFPPPGVLGPPKGFFKGGPPFFFFFFF 

r r c JilvIUUUSJUuUjj; ivi KJ^LAjC^T V RKA 


4734 


18635 


A 


4765 


167 


3 


I YFFFFLKKI FFYKKKINKLKKKKKKKK 


4735 


18636 


A 


4766 


424 


222 


FQNIHKKNFPSPPLLKTGPAPLFKKPPK 
FFFFFFFFFFF 


4736 


too J 1 


j± 


** /Of 




All 

42J 


V X xw W AKKN KJM K.WUW PviAV Abo CN P S TL» 

GGPGQANPLRSGVQDHPGQHGETLSLLK 


4737 


18638 


A 


4768 


363 


0 


KTTTSQARGPTSVIPAAQETEAGDSSNP 
crvrcvcccecoon 


4738 


18639 


A 


4769 


51 


169 




4739 


18640 


A 


4770 


188 


3 


ISPKALNPRREVGPIFPPPKKKVPPQNP 
PGGr rTPPQKJS KNFTLP PP VNIjGPPRDP 
LKRPP 


4740 




j± 


*»//l 


i 


141 


DELGSLQKKKRGGPF 


4741 


18642 


A 


4772 


363 


452 


RLECSGMIIXHCSLNLPGSSDSPTSASQ 

lift. 


4742 


18643 


A 


4773 


121 


1 


RPRRPVAQAGVQWHNLSSLQPIiPPGFKQ 
FFHLSLLSSWDY 




1 9.ZAA 
IoOh** 


7\ 
A 


ATI A 


1 1 
J i 


226 


Mlb bQIiSI 1 PTPTQEQGLKiCJCKKiv^^ 














KiOCKIO<KKKKKKXKKBCKKGGG 
GAKFLGGEKKNFFFF 


4744 


18645 


A 


4775 | 


161 


1 


KPPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFLFFFFLKKSWRP 
WGLKSAPAL 


4745 


18646 


A 


4776 


394 


67 


SPPRGEGGVAPPPPPKNFFFPRAVFFLG 
GGGGPMPPPQKRGFFPKTPGGVFKPPPK 

nfTtrv? V T? n dd/"* wv* o d m? v wr* t% n rs r» ts w o 
i^Kjr J\lf rrr f JAjlSti\3 f f fcMi r r Kxi Jr Ir PP r r r 

FFFFFFFFFFFFFFFFCFFFFFFFF 


4746 


i out / 


A 


Ann 


*> 




«vtir ovjrlJ\xllNl\AirljProvlilJ>iy I INS 
KKT 














4747 


18648 


A 


Ann 


456 


0 


FFYKKKKKKKKKKKKKKKKKKKKSSS 


4748 


18649 


A 


4119 


438 


0 










nn 


4749 


18650 


A 


4780 


58 


330 


S P FT QLKKKKKKKKKKKKKKKKKKKKKK 
KKKKKK3CKKKKKKKKKK3CKXKKKKKK^ 
KXKKKKKKKKKKKPQQKGGEKKKNPGEL 
GKKNCGGGGVLVGKPHKEKKRGGGKKKN 


4750 


18651 


A 


4781 


435 


73 


FFPPPPRGKFFFKKPPGGKLFSPPELGV 
FFPPFPbKIFFFPSTPFFFWGGFPPFSP 
PPKKVFFPKFPPGFFFPPPLKKKFFFSP 

FFFFFFFSP 


4751 


18652 


A 


4782 


420 


109 


IjFKKFFSPTEREGVFPPQPPKNFFFSPN 
PLF FGGGGAQNFPPQKKNFFPKNPPGFF 
FSPPKKKKKFFS PPGKILAPPGYFFKGP 
PPFFFFFFFFFFFFFFFFFF 


4752 


18653 


A 


4783 


230 


2 


LGLWGAIiYSPFLGGLGGRKFLLPWRPR 
VPLTPISPPPSRLGGPPKKKKSQAWWHA 
PWPTTREAEAGGSLEPRRL 
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SEQfl) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Am in a nrirl 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
\£— vriuimuiuc, i\ /\.rginine, o- oerine, 
T=Threonine, V=Valine, 
W=TrvntoDhan Y— Tvrn«inA 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


4753 


18654 


A 


47R4 


971 

All 


44fi 


CGPSTLGGRGGQ IMRSGVRDKP YRHGET 


4754 


1 OUJJ 


A 


47RS 




ZOO 


KKYG 1 1 1 PCS FFLS LFLKTiOGTi APT . PP T . 
ECS WFAAHRGLGLLGWS DPP KSAFLVA 
GITGMHHHAWLI FKVQFSNLMGP WS YTW 
SAVD 


4755 


18656 


A 


4786 


216 


435 


HP^GDSSLKKKKKKKKKKKKKKKKKKKK 
KKKKKIWEFWKGPFLWGGKKRKSGGG 


4756 


18657 


A 


4787 


249 


437 


MNNBVVPLKTKTSRPDVVAHACNPSTLG 
KYLAWSV 


4757 


18658 


A 


4788 


107 


368 


AHGSLDFLPGGPKKSHQWFPKPKDGKPP 
PF<5KifPT.TfKTrTPGinrf3ra^PTP<^&pQinr 

GGYPRGGDPGGPFFFKKPGQKKKTPFLL 
RRD 


"/JO 


IOOJ7 


A 


47R0 
*r/oy 


i 
i 




PTI?PPTTNNTRPAi?PFYYrTTPT.VT .KT.TA 
LAOTFJ^LLTTLDLNxTjTNKIJCI KSPLC 
AKKKKKKKIKKKKKKKEJEG ' 


4759 


18660 


A 


4790 


2 


216 


SGRNSRVDGRQEAACRDYQSSLEDLTFN 
SKPHINMIiTILAEENLPFAKEIVSLIEA 
OTAKVT? IHPAAS YYF 


4760 


18661 


A 


4791 


389 


30 


BKKSPPQKKKKKFFPPPPQKKFFSPPNQ 

PPPKKKXXIFPPPQKKGPPPiaLKKPPP 
PFFFFFFFFFFFFFPFFFFYFSt»FFSGG 
FSYKlFIi 


4761 


18662 


A 




i Vt 


J 


LFPGFFMPGVKGEKGISFPP FFFFETR <3 
HSVIQARGQWRRLGSLHPLPPGFKR 


4762 


18663 


A 


4m 


550 


413 


AEMGFHHVGQAGIiELLTSSDPPAIjASQS 
agitgtshraophrpic 


4763 


18664 


A 


4794 


456 


352 


RSQDGLYLLTS S FACLGLP KCWDYRRE P 
PCLAVFF 


4764 


18665 


A 


4795 


242 


3 


KPFSGGGGAFQTAFFITFGKKFWSRLG 
LLKKKVFLKFLPQAFFFFFFETESHSVA 
QAGVQWRSLGSMQPSPPGFKRLS 


4765 


lOUVJU 


A 








G F C PVFFTV<? T.CP PG ZV WP <3 P T .T & T 

SASRVQAI ILPQPPE 


4766 


lowO / 


A 


47Q7 






TCFPFNnGTGGPFPPPPPJfNFPPQT.SrDPT? 

FLGGFSPFFPPPKKIFFSKIPPGFFFSP 
PLKKKIFFFPPRLILAPPRFFFKAPPPF 
FFFFFFFFFFFFFFRPI 


4767 


1 OUVJO 


A 


479R 


1 
1 


IZAJ 


I LIKIiSSTWEGIOAGKETiREOHG THCI^M 
TLLFS FAQAVACAE 


4768 


18669 


A 


4799 


01 

Oi 




I LFFFFFFPGGGTP KKSOVFFKWPOMP T 
GKPTRMGKGEKPPPPRGKKKKKGGPPTL 
GKNKNPVPGWPGAFLPPLEEKKKPFPGP 
PPIiKKKK 


4769 


18670 


A 


4800 ; 


1 


99 


GCSEPRSRHCTLSWATRAKLHLI KKKKK 
KKKGA 


4770 


18671 


A 


4801 


238 


5 


NHQNS VQNYFKNLRI FQLI YSLSMNLPR 
SS FKLKS MTHfl T S F FT .F<3 F FR T H Q uqu 
AQDRIKWHDLGSLQPQPPGLK 


4771 


18672 


A 


4802 


166 


62 


I FLFS FFLFLPJ3RVLLCYVGWSVMVQST 
QTPGLK 


4772 


18673 


A 


4803 


102 


6 


AASITUS PS VAHAGVQWCDLGS LKPGSR 
RFQ 


4773 


18674 


A 


4804 


17 


374 


FGSRAATLFFFFFFLGRGGFFLGPRGPK 
KKTPGGEGFTRFPDPGWGKGPBKKTPPP 



841 
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SEQU) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methk>nine, 
N=Asparagine, PHProline, 
v/=v*iutamine, KFArginine, o— oenne, 
T=Threonioe, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop cod on, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 














WGGKGG P KKMNL WGAGKI KFLE I KGKS F 
FSFKLKN 


4774 


J OO/J 


A 




so 


1 / j 


LKKKTGGGPN 


4775 


lOO/O 




HOUO 






C QfUVT.YQ P A DT Y VCVUYVT.TVD OVT> T TTTC 
OOU V XjX OXsAcrJi irV^VTyluXf K^ViJlIvlo 

HLPQEAE 


477A 


loO / / 




HO\Jf 




i 
i 


j/CTfl WT/2*PM 2VXJ C /"WD CTT /JTAn^TTD C 
A*5 X-MmX/Ij X PXHktto CAM ro X XAjtjyiJVjVi A J- Kb 

GVRDQPGKHGETP 


4777 


18678 


A 


4808 


319 


406 


LYKSYSSWP1WVAHTCTPNTLGGQGGQI 

1 


4778 


18679 


A 


4809 


236 


351 


GFWPAPGVHACOTSILGGQGGRVTGPKIj 


4770 


loOoU 


a 


45IU 


1 /J 




T7TIT "DT rPDOTT TOT T SMUIKTVT T VT TAVDIi 

FVAQAGVQWHNLSSLLPPPPRFKRFSCL 
R 


4780 


18681 


A 


4811 


2 


94 


TRILKVWQKRKCS VKNGFLTISHGTVS I 
LF 




loooz 


TV 

A 


481/ 


358 


2 


ecv r r SETESCS VAQAGVQWRDLGSL 


4782 


J8683 


A 


4813 


362 


3 


YSEVCIISLQQAHDSI^VINFHLAECVI 
QLPVNLREIiPGAVAHS PLGSELCFSDSG 
SVPSHTRSFPKAAIRLTANHRRGCTESG 
WPFSLWLFERCMWPGTVAHTCNPSTLG 
AQGGQIT 


4783 


18684 


A 


4814 


341 


1 


WNQKRAHITKSILSQIOTCAGGIMLPDFK 
L YYKATVTKTAWYWYQNRD I DQ WNRTE P 
SEIMPHIxTJYLIFDKPDKNKKWGKDSLF 
NKWCWEIWIiATCRKLEIjDPFTjT PYTKIN 

s i 


4784 


18685 


A 


4815 


2 


283 


VYTGBEILQKKEITIRDQEAPYLLRNLS 
DHTVAISSSTTTJ3CS0\NGVPBPQITWFK 

sssssssss 




loooo 


TV 

A 


AO 1 C 

4816 


266 


3 


AAb aIjQAHRAHQIiAIDTYQEFEETYIPK 
DQKYSFLHDSQSLRQPLHTHTGSVGTRT 
LAASLSSGRTAHTRTHMHTQHTRKRTAH 
TNA 


**/oO 


J 000/ 




4oi / 


Jo J 


I 


& r r v rroii 1 Js£>Ko lO JcLEi? i>0 xXoAHCN 
LRLPDS SNS PASA 


4787 


18688 


A 


4818 


87 


372 


ASYKKDKYCMIPLIRVIKFPQTERKKWL 

ETEPPSVTQAGEQWCDLSSLKPLPHRLE 
RVSCTj5? T.P <?CJW 


4788 


18689 


A 


4819 


5 


175 


FAFTiTI PLSRLFDNAMLRARRLYQLAYD 

s 


47RQ 


IOOtV/ 




4oZU 




jj i 


r j? aj: x? X? &c r jvr r rrrrrrr Ar X7 r A.r aj? r 
XFFXFFFFFXF 


4790 


18691 


A 


4821 


2 


155 


IFLlxLIESWLEPVRFLKSMFANNLVYDT 
Q HQ nn YWT .T .TTHT ."RPflTnTT MAP T . 


4791 


18692 


A 


4822 


90 


2 


LFFFFFETESPSVTQAGVQWHDLGSLQP 
P 


4792 


18693 


A 




947 


3 


KVKGS PSHRRAMVAKGNLSLHERMKS PG 
ISNYMGNSKILFSYFiOTC^PGTVAQAYN 
PSTLGRGGWITRSGVRDQPGQYSET 


4793 


18694 


A 


4824 


93 


407 


QHP KKKKKWGGRGPLKNKEKVFKHKTFN 
FFSRQKKKKKK3CKKKKKKKKKKKKKGGG 
GLTjKKFWGGPKFPGGKKKFFFFFGGGKK 
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NPPGVFLKKNFFWGGEKKKKK 


4794 j 


18695 


A 


4826 


2 


129 


TTAHCSLNLPGS S DPPTS RS LVAGTTHV 
CHNAWVNFVFCRRG 


4795 | 


18696 


A 


4827 


342 


140 


GGFFFFPPPPFFFKFPLFIiPPPPPPGPS 
H\in£X\JC\YVPTXr\z P PPOT TjPPTjPP P AP 

PPETDNPFFFS 


4796 


18697 


A 


4828 


1 


z3y 


RGKGKKKIGKGGFQRKFKGPNPHGRFLR 
ESHNSPKKGFPKKFFGNPPPGTSKKVPL 
PH 


4797 


18698 


A 


4830 


125 


1 


T .PPfiPR P T .TCT ,<> T • 


4798 


18699 


A 


4831 


1 


323 


ARGEREREREREREREREREREREPPPL 
QRCJGQ E Q YT.KS AWREPIiliLERBPIiFFFG 
GKMGNLFPPPPPPVGRGYTGGGLITGGP 
FWWGECVSHTHHTHTHTHTYMSV 


4799 




A 


Ann 
453Z 


3 


707 


TRRKRERRRERERERKRTIj 


45UU 


1C7H1 

lo/Ul 






i 
i 


IJU 


ARERERE REVERE REREREREREKKKKN 
HSRGGGPGPQFPIiGGGVKKFGGGV 


4501 


lo/Uz 




4534 


1 
1 




ARGERERERERERERERERERERERERE 
RERGAGGGPT 


AQfYJ 

4olTz 


15 /U3 


a 


453 J 


L 




HEERERERERERERERERERAAR 


4803 


18704 


A 


4836 


2 


73 


PEERERERERERERERERERETAR 


4804 


18705 


A 


4837 


146 


2 


FGLVYSISQFCLFFVRRSIALVTQVGVQ 
WRNLGSLQALPPGFTAFSCL 


4805 


18706 


A 


4838 


354 


U 


zr it c if tr trxr ir zr t ctrtrcc^Otririrci^tf trOlr 

SSSPP 


4806 


18707 


A 


4839 


62 


358 


uuVfuJAXldirnv3Vyv»odiijri viiirojjwiror xr x 
FSTKSPSVPPARALWPPDSRWALQVGHL 
GLTPTTRQESAQAPRRKLALLEGSFQGQ 


4807 


18708 


• "h. 

A 


4540 


Ml 


13 


R PGRP PPE FTP FSCLSLPGS WDCRRP P P 
RPANLLYF 


4808 


18709 


A 


4841 


3 


221 


LVMGFVGFSSKPS P I YGGLAL I VSGWG 
PVT T TiNFGC5GYMG1j TVFLI YLGGMMVGF 
GCILDFSLNENVGLNF 


4809 


18710 


A 


4842 


3 


84 


CHCTPAWTANLDS I KKKKKKRKKFKN 


4810 


18711 


A 


4843 


33 


230 


LTVTISKAAEVLKHKKKKKKKKKKKKKP 
KKKTPKKNPNPPPQKGELGKKIFFFFWE 
p PTtPfiinrp p 


4811 


18712 


A 


4844 


3 


368 


HELNPRGRSCGE PRPCHCTPAWATRAKV 

KETPWSPQNFEKKGPRRFSPRGKNPPLA 
IiRGKKKGGKSLSGKIFPPFPRGKTPPWA 
FMGTQKGEYS 


/lorn 


15/13 


ri 


A9A^ 
4543 


103 




HHGHS I S KFLLTVS S PKS FNQLRPDAVA 
HACNASTLGGRGGW ITRSGVQDQLV 


AO t*> 


18714 




AO AC 

4o4o 


357 


OIA 
Z34 


KGGGEPRS CHCPPAWATIAKLLFKKKKT 
KPKKNPKKQEKTFPYNLNRVCY 


4814 


18715 


A 


4847 


189 


349 


KFSRFSGQKALLLKD I LRLGTAAHTCNP 
STLGGQGRWIMRCQVFKTSIiAKMVK 




15/ JO 




4545 


J 


OS 


HENFYCPGS ALGRNEGNI FPNPEATFVK 
EM 


4816 


18717 


A 


4849 


2 


325 


DSHTI IMGDFNTPLAILDRSTRQKVNED 
IQDLNSALHQADLIDIYKILHPKSTEYT 
FFSAPHRTYSKI DHKI YSQGNHETDCRY 
GKKGGGWKDERFPDVDCEKFKSQ 
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4817 


18718 


A 


4850 


241 


348 


SFPSQTQASEQLTKSKQGLAIiRTLPPVQ 
FPPPPPRP 


4818 


18719 


A 


4851 


2 


176 


LVETG FCHVGOAGLETLLTSGE CWD YP C F 
PPSPAYCFLHTIVTIYPLLCLAFFSYLC 
DC 


4819 


18720 


A 


4852 


2 


191 


VGRVGLELLAS SDLPTSGS PS AGITGMS 
HCPWPNFLLI FIGYFFTVI/rPCFYV<;ftfi 
MTVPLWL 


4820 


18721 


A 


4853 


67 


376 


IKCTTPNIKKKKKKKKKKKKKKKKKKGG ' 
ALKKKPWGGQKKTGEKKKKFFSKRGGKK 
KRGPGRVLKKGEREKIFPWNFLKKNFFW 
GGGNLGQ P P P KKLRALGKK 


4821 


18722 


A 


4854 


278 


363 


I WPD WAHTCNPS TLGGQGR WTMRSG VR 


4822 


18723 


A 


4855 


3 




RGCS E PHTiR YPT P AWZiTRfin Q A Q V V WW 

KKTQI PWEGPGGGVSPFKPHFFKGPGGK 
KPWGQKFKNPPPQWEKPLFFFKKKKTKF 
SPGGGGPP 


4823 


18724 


A 


4857 


382 


224 


PVIQLIGGVRQEKNFNDGVKGFNEPKS P 
PCP PTWVAKONS LSKKKKK1C TGKK 


4824 


18725 


A 


4858 


392 


102 


FFFFFFLRQGLSWQARVQWCFCSNLGL 
LGSSNPPTSASLSCFLNKKEYRWEKVTN 
KV 


4825 


18726 


A 


4859 


3 


111 


FHRISQEGLILLTLGSAHLGLPECWDYR 
REPPRPA 


4826 


18727 


A 


4860 


282 


83 


AQIYKVQNWGSSGNKCICVCVCVCVCVC 
VCIYIYMLOISRFKYKCIVGCMNIDSYV 


4827 


18728 


A 


4861 


3 


417 


NHSNLGGRGCSDPRSRHCTPAWVTERDS 

iw*vivivsvx>av*\-Evl\.x x ruxxCvILxlIVXiXlJJb tfsLfij 

KKNENSGFPLNPNFFFGGKGKKATLGPIi 
GLVLRGVPFFRGGETONGENOICPGTTTj f 
KFEKHFWVGPPPLEKISPHKGKQTQK 


4828 


18729 


A 


4862 


1 


88 


MEEMYKKAHAAILENPVYEKKPKKKKKK 
K 


4829 


18730 


A 


4863 


274 


369 


VSLCCAGWRAAVRSQITAALISRAQAIL 
PPQP 


4830 


18731 


A 


4864 


37 


444 


DPRVRSEKKIPFKILLLIDNAPSHPKAL 
MEI YEE INVI FMPANTTS IIiOPMDflfiVT 
SxTKSYYIJOTFHKALAAMDSDVSDGSG 
QSKLKTFWKGFTILDAIKNIRDSWEEVK 
LS TLTGVWKKL I PTL XDDYEG FKT 


4831 


18732 


A 


4865 


341 


210 


VYMCHVGLLYHQPVIYIRNRIEASEITP 
HIYNYLIFDKPSHQF 


4832 


18733 


A 


4866 


206 


412 


QIXI*SQACTLARGNAKGFLKY IHRNNVS 
MPSVAGHTWGPDQLVKGQGETCPATIHF 
SDSPS LKYTMYPM 


4833 


18734 


A 


4867 


248 


78 


SHFEAGLAAQLTVSWFPQGTPVFVHAGP 
FANIAHGNSSIIADRIALKLVGPEGFVG 


4834 


18735 


A 


4868 


637 


2 


EFDGRWPRQI VS S IGLCRYGGRI DCCWG 
WARQSWGQCQPVCQPRCKHGE CIGPNKC 
v( 'H unY2in?f r rnjrinpuT dsdt tv^p gtwi d 

Hl*flxrV7 X riVJis. X v^XN ^UdiT X rAr XjXJ^OO E*v 

LFQPIiDHQATSLPSRDUJECXjLKPRPCK 
HROTNTYGSx^CYCx^GYMLMPDGSCSS 
ALTCSMANCQYGCDWKGQIRCQCPSPG 
LQLAPDGRTCVDVDECATGRASCPRFRQ 
CVNTFGSYICKCHKG 


4835 


18736 


A 


4869 


276 


426 


VSFFLFFFFWGANPPFVPQAGGQGGFLG 
SLNPLPPGLKHFFCLTPPSSGN 
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nucleotide deletion, \=possible 
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4836 


18737 


A 


4870 


100 


485 


EERNREMTERCSVTQAGVQWHDLGSLQP 
SP PGFKRFS CLSLPSSWDYRIiECDGMNL 
AHGKLRLPGS SESPASASRVAGITGMCH 

PRRQEVEHNTPSLVPI 






71 
A 


4o/i 


Tiff 

/io 


Af\Q 
4Uo 


RVT.TT.nT. A. WP DCSIST XI AWWWMHOnRZVT/SP 

ATDSKEPPEELCPDVLYRTARTLHGQET 
YTPRJ. I IiMDIiHYDVS LGAL VLTAAG YRA 
SSCSQSVAVCTGSVIiEARA 


4838 


18739 


A 


4872 


392 


246 


RGGCSELRSCHCTPAWATRVKLHLKKKE 
KRKCKI RPEGNE I LVNNAKG 




1 O JH\J 




AQ.TX 


l 




GGGGYSEPRSCHSTPAWTTRAKLRLLKK 
KKKKKGGVCFGVGFLGPRPKGGFF 


AH Aft 


lo/41 


j± 


AWIA 
HO /*r 




A 


AHG I S I YASMLAMS I PSRRNVS FRRAGQ 
VKAVFPVSVVFGPLHRHHLGEHFLFFLR 
QSPTLVAQAGVQWCHLGSPQPLPPA 


4841 


18742 


A 


4875 


72 


2 


CPPAAPDEDSTTNITKKQKWTVE 


4842 


18743 


A 


4876 


375 


221 


GRLRQENRLNPAGGGCGBPRSHYCTPAW 
VTEQDSISNICIYINIFSWHGQ 


4843 


18744 


A 


4877 


151 


2 


RAPFFFFFFFFVAQAGVQWRVLGSLQAP 
PPOPTPPRfT.^TiP*? cjWTJYPP P 


4844 


18745 


A 


4878 


401 


70 


PHRREGGCVPPPPPKNFFFSPGGYFFWG 
GGGQNPPPPKGGFFPKTPPGFFFSPPQK 

KTfT PPPPPPPPRPPPC3PPT.P PPPPPPPP 

FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFF 


4845 


18746 


A 


4879 


4 


152 


LPYSTPSTPFRAKKKKKKKKKKKKKKKK 
KR 














4846 


18747 


A 


4880 


267 


i 
1 
















FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFF FFFFFFFFFFFRTRGRTRGRTRGRT 
RGR 


4847 


18748 


A 


4881 


2 


408 


LQDATSPIIEELITFHDHALIIIFLICF 
LVLYALFLTLTTKKKKKKKKKKKKKKKK 

PKKLGGEKKIFFFFKGGKKKTRGGVGKK 
KIFLGGGNFGPP PPKKKKPRGKI 














4848 


18749 


A 


4882 


413 


69 


IiPPPPPGFKHFWAPPPPGRGVPGPPPPP 
RVNFVFLGKKGVSPFGPGGPRPLFPNPG 
GVGAGDPLDPRGQGFHGPKFMPCPPARG 
TKREPPFPKKKKKEKEEKKBPERKBKKK 
EI 


4849 


18750 


A 


4883 


46 


394 


KQYQVSVKLLFVTQNLGTKKKKKKKKKK 
KKKKKGGALKKKPGGGQKKPGGKXKNFF 
LKRGGK3CKPPGIFEKKTLFGGGENWGTP 
PQKKKPPGKKKKILKGEGGKKTLYFGRG 
KNFS 


4850 


18751 


A 


4884 


399 


3 


FFFKKFFSPNEFWFFFPPFPLKIFFFPP 
RLFFFVIGGLAQFSPPPKKVFFSKFPRGF 
FFPPPLGKNFFFFPPGYFWPPPGFFLRA 
PPLFFFFFFFFFFFFFFFSVSIRNAIAI 
P^GTMP^RRLAMGMLLRRG 


4851 


18752 


A 


4885 


391 


47 


PHFFWGGVFPPPPPKKKKFFFLPQKGGG 
GGAPKKKKKKKKFLPPNFSGGFFFFSPP 
GGPPRFFFFIFFFFFFFFFFFFFFFFFF 
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FFFFFFPPFFFLSKMFTFILKYTLKQNE 
r i 


4852 


18753 


A 


4886 


32 


379 


LLPVKKKKEKKGE KKKKKRKKKKKKKKK 

SFSHRGGLKKPLFFLGRTYFLGGGGLWP 
HPPPKKIAGGAKNI ILRGWGEQKPLLFP 
PGEK 


4853 


18754 


A 


4887 


178 


33 


IFKRKWFFFRFFFFFFLYYFFFKFFFF 

FFFFFFFFFFFNTr'HWPKT.V 


4854 


18755 


A 


4888 


238 


3 


KPFNPFKKKKIGPQKKKKKFPPFLKINP 
PKYKTQNINIYKYIKIFFFFFFVRGSDS 

VAF.»FVf) UnUT jfJ QT .If P A<5 PfSFlf 
vnoruivynnnjjooiiJUrrurvir jv. 


4855 


18756 


A 


4889 


93 


1 


DGILLLSPRLKCSGTISAHCNLCPPGSS 
NSR 


4856 


18757 


A 


4890 


6 


219 


I EKAHKALGTVPGS TF VR YHHYHHHNHY 
HHHCHHHHPHQDNSEWEKAQSLE PDKLF 


4857 


18758 


A 


4891 


1 


364 


TSSRSRAGRPMDFGFFFKKKGHREPPGG 
VEKKKKNWGSRKTPPSGVKNFPGFDPPK 
MWNKGGPPTPPVNFFGFKKKRGFHKGLK 

LGGKFQGKR 


4858 


18759 


A 


4892 


387 


81 


GVFSPPRGGPPPGGKTPPFFPRGGPPCG 
PGGRGPIFPPLSHKKFFFSPPPPLLGGG 
t^irKo/vr irFluJ Vr tsJLrtrturict WGAr XxCK. 
SFFLKKGGFFGPPRGFF 


4859 


18760 


A 


4893 


213 


14 


KTPPPFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFMLE VMFLVNRRGKI CRVPFTFFNL 

or. 
oJj 


4860 


18761 


A 


4894 


384 


10 


PLPPFI PPPPPPPPLFGGGVCPYFSPPP 

UT/^GDQDDOnnOOQDDDDDDDOTDDDDO 

rlLuo r o r f f buKr o r IrrrirFrto 1 r'trirt'r' 

PPFFPPPPPFVSGAPPPPPRSCLAPPFF 
FFFPPPFFFFFFFFFFFFLPFFFIiKKSD 
FIFSLQMKYFMLI 




lo/oZ 


A 




Z4U « 




T.TTTPZVT.FFIfTFWT WT /»AUllUiPlJDGTT 

GGQGGQ I MRS 


4862 


18763 


A 


4896 


339 


3 


GVGWHLFFPPQKKFFFFFQVTCASCAAK 
SS KESHKGIiHLNIYLPKIFFYRPPPPFF 

FFFFFFFPFFFT . ffVPP P FFF pp p p pp p p 

FFFLWREG YNQHFRGMGP IGRVGGRV 


4863 


18764 


A 


4897 


56 


335 


TCFLYKLKKKKKKKKKKKKKKKKKK3CJCK 

GGEKKNLWGFFEKKP FFGGGKI CPTPPQ 
KKKPPGEKK 




lO /OJ 






n 
I 




KKKKKKKKKKKKKGGGPLKKNPGGAPNN 
PGGEKKNSPP 


4865 


18766 


A 


4899 


74 


250 


INICSEKLPKKKKKKKXKKKKKKKKKKK 
KKKGKKI KRGLNSOTS tiLKKRE TCKKTCRG 
KI 




io tO/ 


j± 


AQ{\[\ 


131 




KYHFHKNYFFSI/LYFSONVSOI^S PDCiPL 
PQLPLP YINS S ATRVFFGHDRRPADG 


4867 


18768 


A 


4901 


1 


364 


LNI/3GRGCSEPRLRYCTPAWATEQDSAS 
QKKKKKKNPNF 


4868 


18769 


A 


4902 


1 


96 


GGGGCGELKSCHCTPAWVAEQDSVSKKK 
KKRG 


4869 


18770 


A 


4903 


3 


225 


GFHRVGQDGFDLPTS WTARLGLPRGWDW 
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RREPPRPAVWAGLQIiLTSSDPPASDSQR 

n/— T'l'U 1 CUT *PT VC*UT DT7 


4870 


18771 


A 


4904 


384 


194 


HWVGRPFFNLWGQKNPPPPPPKRIjEYQG 
WGPPPPPPFPFFKKTPIFSQVFNPPPFF 
FFFFFFG 


4871 


18772 


A 


4905 


3 


387 


TXITPILLTLFLITQLKIIiNTNYHLPLP 


4872 


18773 


A 


4906 


1 


327 


PTRPIEPIPENPKFSVPPITPHPKBCKKK 














KTWGGPKKRGGGKKIFFFLRGGKKKPRG 
EFLKKTSFWGGENIiATTPPKKKKARKKK 
KNF 


4873 


18774 


A 


4907 


134 


2 


ALYSLKVKWPGAVAHACNPNTLGGQGEW 
IRRSGVRDOPGOHSET 


4874 


18775 


A 


4908 


59 


449 


ATRHYTIMIiAYTSVGI PITAWIiSKQAI 
KGARFQVRRTTENYDRQQtilHKGGGRPP 
GCHSTHTVLLPPVTWVKTTGNSTVANFV 
TjTTiVCLSFTjIjLVYRCIOOLORDSNOREG 
AMMMWVXjSKRKGGYAGK 


4875 


Jo/yo 


A 


4yiw 


JJ/ 


i 

j 


LKTAWATRGNP PLYKKNTO I SGARGEGP 
PIPLIGGGLSQKNFFTPGGENSINPDPP 
PSPPPGAKKETPPPGKKKKKKPTTKELG 
KDWTLELWDSDSQRAGKLAEVPGESSP 














a one 

4876 


1 0*7*7*7 

18777 


7\ 


4ifIU 


1 A 

14 


lOZ 


AESGAKRETAFPFAPOOGATDKRLNIIY 
AAKKHMKTCLPSLAIREMQIK 


4877 


18778 


A 


4911 


331 


2 


IiPLLAPKEGQGKILGYKKPPPPGIiPPFF 
GSTPPRNGEKGGPPPPRGNFWVFKKNGG 

*iifi j JL XT XT i\iiVJiJi\wVJ XT XT XT XT IvVJll 4 f 1 V JL A V*\A,» VJXJ 

FPRLGGRGPFPGPQKPPPRPPQKGGKQG 
GNPAAGPFFFFFETEFHSCCPGWSA 


4878 


18779 


A 


4912 


116 


316 


ACAHAHTLFFFFFLKKKPPFVPLLGGKG 
GNLGYLKLWLPGLKKVCLTPPRTGNYGG 
APPHRIIFCFL 


4879 


18780 


A 


4913 


93 


16 


SPPGWPGTVAHACHPSTLGGRGGRIT 


4880 


18781 


A 


4914 


316 


2 


KPRPGNPLRARVFNPPGPPGETPFFPKN 
PKFTRGGRGGPIYNPFPGRVRPKNFLYP 
RGSRFHWPNS I PCPFAWGTKPNFFSKKK 

iVVlJlwl 11*1 XT A^I^J *A JET \<# XT A nflW A XVXX1 W XVLV&V 

KKKRKKKEKLCFSASVPDAQ 


4881 


18782 


A 


4915 


3 


289 


TSCNPSTLGGRGGRITRVGVRGQPDQHG 
ET PS LLKNY I Y I Y I CVDVYLYVCMYVGA 
YIYICICGYIYIWRYLSLTSEHTWrRAP 
GQYRVTYFFCG 


4882 


18783 


A 


4916 


1 


222 


ARGERERERERERERERESRHQGGGAGP 
PTHLWGEEGWLSSPQTGVREKNAPIWG 
CAAPVLAPRVGRTPFGVD 


4883 


18784 


A 


4918 


1 


281 


ARGERERERERERERERERERERERERE 
RERERERERGAPPPPPRGGVRERGTPCP 
SARGARALSPTISCGDPHRGPKIYRGVG 
VHTQRYFSVGSSLCINTHT • 


A9.9A 
4354 




A 




l 


/.yy 


ARGERERERERERERERERERVFVEKGD 
MVTLPAGIYHRFGGGEKKYTKAVRVCVR 
KPGWRAHNRPADHFGARGPHVKFLAQTV 
YE 


4885 


18786 


A 


4920 


797 


910 


NTMQPGVVTHTCNPSTXiGGQGGRITGSG 
VQDQPGQHGE 


4886 


18787 


A 


4921 


2 


67 


LEERERERERERERERERERERSS 


4887 


18788 


A 


4922 


1 


698 


TLLliAELGTICDPYRSCSISEDSGLSTA 
FT 1AHELGHVFNMPHDDNNKKKKRGGPF 
KGPNFSPPVRASKNFFFGPPNLNSWARF 
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miflAn'H flA incArttnn 
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LTPGEGKTPGVPPFNPFATKAEL 


48S8 


18789 


A 


4923 


323 


2 


LVCRGTIHKFRCVPHLTGRRFEHGVTDC 

X XXir l\.LJr\.i.Xi i it\\J j.Eii'LtrLJ E ITXCIlX/IJWWfvrJvx 

QNLYLDNLEATGLYQVPLSAAQPGDVLL 
CCFGS S VPNHAAIYCGDGBLLHH 










i 
J 




T-TR R3\ T iHT .RYYTVr.MRPMirR TTRT . T TT/TTU T(2 Q 
riXa&rlXlIXlJlT X XM?l<3 VJIXxXvxVJa J-lX JL vf niuo 

DEHQD I DVAI LTALLKGKPLRTFLFVRP 
FILCMYMCVYVCICVCI 


4890 


18791 


A„ 


4925 


371 


169 


HTQS I SVVLVBRRFHHSGQENPLNPGGR 
GC^ELRLCHCTPAWATRAKSRLKKTKQT 


4891 


18792 


A 


4926 


2 


352 


ARAARAGRI IKELFFFFF FGNIiKKKKGL 

FWWPGGVKNPDLGNPPPWPPKGGGITGL 

TPRPGPMGVIFKKFGKKGSPSPTGPKSL 

GPREFFGLALQRGGDU3LNHRGPPFFFG 
trr.TTTM 

VJJOXFl 


4892 


18793 


A 


4927 


2 


379 


ARANMEETQQKSNLELVRISLLLIKTGL 
air VltxxjKorlr IxwJMJjV XiJloiJoJJiJxxlJjliA. 

DLQEG I QTIiMGRLENGRRRTGQ II/TQTY 

JIVJcl/X 1M Ol^lNXllyJTLjJlJfU.N XX\XliJl\^C X JVL/riXs 

KVETFLRMEQCRSV 


4893 


18794 

JO / 7*T 


A 




i fj 




I LGDLFPPAWLHP FLFLLLPFS PPSLAV 
TEVAVQWRNLGLKQPLPPGF 


AQQA 












VT^RT.RPPfT>f3WQPQT.RT.'NrAQar*T QRT/KV 1 
V JUrtXir f v^L»lrvjWO Jr JXJaXJlN^OA^jjJgXJJxv^ 

WDYRCEPLCSVS 




lO 








i in 


ftRUa * A.onnv_ x it «nrti'ioii x vo ivi\j\j\oovjt 
IiFFLPRLV 


4896 

TO/U 


lojy / 






i 


i 


KYVAPCPPT.PT.vr^prRpqrFPPPyrun 

jvx v/*Jr V^tvLrXjJr Xj Vijjr^JvttoUP xrl* fr \~.£\U 

CKFPESFSEATMFFLQPAEPRA 






j± 




z 


1 j j 


ar nrvr •wpnfW 1 qt?t .p<jt?wpto a w aTRp n 

nRUUiul rOVaVJVgO &XJ twtmv> X riinHl HxvL/ 

SISKKKKOTQNIWGNLKKKWNLK 


4898 


18799 


A 


4933 


144 


249 


ETLYPQPPQSLNASCVCVCVCVCVCVCV 
CLCVCV 


4899 


18800 


A 


4934 


2 


153 


ARESFVEKGFRHVGQVGLKLLASGNPPN 


4900 


18801 


A 


4935 


1 


355 


GTSQEFEGRKDRxxASPLETGNAGAGRER 
LABVSTCPSGASKPLQTPRPGGQEGTPG 

rdRRRPMPWQZVnPQPNrPAUFRQTPnDQT TC 

TALLVACSIFHIHEDPILHDVLISRCMS 
IKTSHR 


4901 


18802 


A 


4936 


112 


358 


NIIKFSKAFFFSFYIFILINSVGLASTF 

1 InVjvJ Jj i_J 1VL Of iuvjXliXXr^rr JVX INXJiAiry 

RGILRTGTFKEDPDDPEGNKVSSLLQ 


4902 


18803 


A 


4937 


1 


258 


GTS PYVTQQFFSGWGRQGLTLS PKLEHS 
ELRLOTCTOSWVTERDSISKKXKKKLFY 
LAGPT FFPFGGGNFNLALGRIxLKRGGGE 
KK 


4903 


18804 


A 


4938 


380 


1 


FFFFFSETEPYSVAQAGVQWC 


4904 


JOoUJ 








t 


SLCGNLFYVIRFFDMESCSVTQAGVLWC 
NIGS SOALTPG LAP FS F I S TjPSTPONSR 

P 


4905 


18806 


A 


4941 


10 


173 


KYIEYDTNKWKHNSCSQIGRINFVKMSL 
PPKAIYTFSVFPIKIPSVYLPVHIiSI 


4906 


18807 


A 


4942 


144 


3 


DKCIRRPGTVTHTCNSSTLGDQGGRVMR 
SGVRDQQGQHSETPSVRV 


4907 


18808 


A 


4944 


484 


274 


PSSQRS PRTARM I ALSTRPTTRSMKFCH 
VAQAGLKLLGS SDL PAS AS QNVG I S GVS 
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HHTWPRFYYliS 


4908 


18809 


A 


4945 


103 


412 


KKKKKKKIGCKKKKKKKKKKKGGGPIjKKN 
PGGAKKKRGRKKKIFFFKRGEKKNPGGI 
FEKKPFFGGGKNWAPPPKKNKPFGEKKK 
FLRGKGGKNPPIPPGKKKL 


4909 


18810 


A 


4946 


309 


81 


KFFFFLKGL I FLGGFCPIFPPPKKI FFF 
RVFFKGPPPFFFFFFFFFF 


4910 


18811 


A 


4947 


18 


264 


NYTQR FKVE YDNN I 1 NQLDLTD I YR I LY 
PTVEYTFFKNPHGTLCRTDHMLGHNTGL 
NKFKRTECTQSMSSDHNGIKYHSAP 


4911 


18812 


A 


4948 


418 


293 


QLLRRVRQENCFNPGGGGCSBPRSPPCP 
PTWVTKQVFFSKP 


4912 


18813 


A 


4949 


3 


252 


DSAKHLGLKHVVYCXjLENVKRLTDGKIiE 
VPHFDSKGEVEEYFWSIGIPMTSVRVAA 
YFENFLAAWRPVKASDGDYYTLGKTK 


4913 


18814 


A 


4950 


434 


270 


KRGFPPKTPPGFKKPPPKKKKKIFPPPP 
KIGPPQKILKKPPPPFFFFFFFFFFF 


4914 


18815 


A 


4951 


399 


1 


VGFSLFPPPQKKGFSPKPPRGFLFPPPK 
GKKKIFPPPGKIGPPQGFFKRPPPLFFF 
FFFFFFFFFFFFFFGPPSVTLLFLVTIL 
LQIQTTALGRKSWGKGFRIPKPVCPVPA 
QQTRQGAKHLLGQWE INSCA 


4915 


18816 


A 


4952 


235 


56 


FTWPPPKKKKKNPPGGGGPVPPPWGGK 
NKKIFLPPKFKGPFSFGAPLPFPPGQKK 
KPL 


4916 


18817 


A 


4953 


13 


194 


SIKESLQISIjIITFTATELIIFYIFFET 
GGAF 














4917 


18818 


A 


4954 


177 


1 


PPQKKKKNIFSPPGKIGPPQGFFKRGPP 

lffffffffffffffffrfegbcsdldg 

RV 


4918 


18819 


A 


4955 


450 


0 


LPNKADKKKKKKKKKKXKKKKKKKAR | 


4919 


18820 


A 


4956 


29 


158 


amivplysslgdffkkxekkkkkkxkkk 
kkkkkkkkkkggggf 


4920 


18821 


A 


4957 


397 


0 


PSSPPPPS PPPSPPPPPPPPPPPPPPPS 
SPPPAPPPPPPPSPPP 


4921 


18822 


A 


4958 


257 


397 


ffflsftfvaqagxqgxxygsldppppg 
fnlfscpslpriwnytapp 


4922 


18823 


A 


4959 


419 


266 


rfhhdgqdglelltssdlpasasqsagi 

IGVSHRAQS I KSLAPQ YWYFPS F 


4923 


18824 


A 


4960 


300 


190 


FFFFFFFFFIVAFFFFFLFCFLFFFFFF 
FFXIFNNIF 


4924 


18825 


A 


4961 


3 


179 


DAWVLPFI IAALAALHLIiFLHETGSNNP 
LGITSHSDKITFHPYYTrKDALGLLIiFIi 
LS 


4925 


18826 


A 


4962 


2 


182 


RVNAKDSKWLTPIiHRAVAS CSEEAVQVL 
LKHSADVNARDKNWQTPLHXAAANKAVK 
CAKL 


4926 


18827 


A 


4963 


273 


440 


KIHIQHCWWEYKLLQSLWKAAWHFLKEL 
KAELPFNPAIPLLDIYPEEYKSVYIiKDT 


4927 


18828 


A 


4964 


179 


3 


SLPFCCNLTPKKKLFFKFCRKNGFFFFK 
MESHSVTQAGVQWCNLNSLQPCPHGLKQ 
FS 


4928 


18829 


A 


4965 


268 


364 


TVCVTELLTSGDRPASASQSARITGLSH 
HAQP 


4929 


18830 


A 


4966 


374 


2 


QNFFIiKKKKKKRVGPGVPPFNPPPFWGQ 
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GGGFPRS ILGNP I KPLFFLNS P KIS PPL 
WPGPFFPLPWGVWAGKNPLPRGGGFPLT 
KFPPLPSSLGKKIQCFRFKKKKKKKGQAR 
WLTPAI PTPRE A 


4930 


1S831 


A 


4967 


79 


346 


DKI FSNISCGI VIDLLAI YNWDAYATRL 
GIYKHWDFII IDKDTSSCRLSFSSYPRF 
LESLDDFYILSSGLILIjQTTNSVFNKTL 
LKQVI 


4931 


18832 


A 


4968 


3 


93 


FLHVGQAGLGL PTS GDP PASAS QSVGLQ 
A 


4932 


18833 


A 


4969 


158 


1 


SKSNFAYIFPI KDNGI/TGKKT INRWTK 
NWANDLNRHFS KVNIQMAKRYWKK 


4933 


18834 


A 


4970 


2 


266 


VGQAGVELLSSSDPPASAYEGAGIPGVS 
HHTWPKHFFPALFVACISSLVNCLFQLF 
ACISIGIaSFFFFFFREGDLGNLLKIAGL 
TFLG 


4934 


18835 


A 


4971 


3 


96 


GVRHHAHLIFVFLVETGFHHVGQDGLNF 
L 


4935 


18836 


A 


4972 


253 


348 


NFI FLLF I YFEMESHS VAEAGLQWHCLS 
SLQP 


4936 


18837 


A 


4973 


326 


2 


PMEIPQHKLSLWPGFAISVSYFERKLLF 
S ADVS YKYLiRNETVLE FMTALCQRTGIiS 
CFTQTCE KQL IGLI VLTRYNNRTYS IDD 
IDWSVKPTHTFLKRDGTEITYVDY 


4937 


18838 


A 


4974 


3 


244 


DASLVFKVAETANEEEVKKMCMYKYPGN 
KCGREGGREKKREEERREGRKEGREGLR 
EGETDEEREEGEEGLS YS PFKNSI 


4VJo 


18839 


A 


4975 


6 


79 


THFSLTI TSLQPED IGTYYCQQYD I 


4939 


18840 


A 


4976 


15 


339 


PGEAGNCLNPGGGSCSEPRSRHCTPAWA 
TERNS VSKKKKKKKKKKKI PRGRGLPPV 
SHPFWKAGGADWFDLGTLEPPWPTGETP 
VFKKKKKLTGGGGAPLVAPTWGGG 


4940 


18841 


A 


4977 


204 


275 


I IEVDPDTKEMLKLLDFGSLSNLQ 


4941 


18842 


A 


4978 


3 


352 


RRPSPHGLVGAVSVGGAGVMAVETLSPD 
WEFDRVDDGSQKIHAEVQLKNYGKFLEE 
YTSQLRR I EDALDDS IGDAWDSNTjE PMA 
WRRLPYEQSSVLELUCTENKVLNRWTG 
YAGL 




18843 


A 


4979 


219 


3 


AASTMAMSFEWPWQYRFPPFFTLQPNVD 
TRQKQLAAWCSLVliSFSRIiHKQSSMTIJfl 
KSKLTS bo 1 1 vMXiDC 


4943 


18844 


A 


4980 


142 


3 


FQGFFFLRDKVLLCCPGWSRTPELKRSS 
pt f*x nrflunvnnn twit ~k 


4944 


18845 


A 


4981 


351 


2 


FFFFFSETESRSVAQAGVQWRDLD 


4945 


18846 


A 


4982 


133 


3 


SFFFFLFETECCSVPQAGVQWCDliHPIiQ 
PLPPGFRRFSCLSLP 


4946 


18847 


A 


4983 


3 


161 


LGSLHDTANTLWPRLERSRTIMAHYSLD 

tr f~i_n r— i nnnnmn ■! ntiiri i j~tn_i ji 

LPGSSDPPTSASHVRGTTGMRSTRP 


4947 


18848 


A 


4984 


363 


3 


GPSNKKRGRSKKAHVLAASVEQATENFL 
EKGDKIAKESQFLKEELVAAVEDVRKQG 
DLMKAAAGE FADDP CS SVKRGNMVRAAR 
ALLSAVTRLLILADMADVYKLLVQLKVV 

T7rY2TT.WT. 

CiLAjlliAJj 


4948 


18849 


A 


4985 


101 


1 


LQVCYGRRWMNSREYGAWKQQVESKNM 
PFQDA 


4949 


18850 


A 


4986 


341 


1 


PPPMS PPDVLKTS PVADAAGWVDVDKET 
LQHQRYPNVFGIGDCTNLHTSKTAAAVA 
AQSG ILDRP ISVTMKNTQTPS KKYDGYTS 
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CPLVTGYKRVILAEFDYKAEPLETFPFD 
Q 


4950 


18851 


A 


4987 


86 


1 


KCSGTISAHCNVRLLGSSNSGPSASRAI 


4951 


18852 


A 


4988 


124 


3 


LPWLCSKFETKTFQFQIYYKAKGVKTVW 
YSHKDRNI DQWN 


495z 


18853 




4989 


305 


150 


OTT .T .TJVT? VQ LTG7V TTOni/PPnPT /POTtTTPCTT jCi 

S PDHFS KFFNQ YFFKI GFDYWKN 


4!03 


iooj4 


A 


xhnn 


4LA 
64 


2/f A 

34U 


NPIjKKKKXKKKKPLSKGGGGKKTPLKKK 
kkpppkgf 


4954 


18855 


A 


4991 


3 


158 


PSLVQTRIiRHAGQAGIiKLLTSTDSPASA 
SQSAGTTGVSHHAMPKDCSIjNSN 


4955 


18856 


A 


4992 


116 


66 


FFFFFFFFFFFFFFFXE 


4:00 


lotD/ 




4yy3 


3 


1 A*7 


{VlAriWT.T PUT A AT.IJP A ITUT^ Q PPVP Q T.P 

PAWPTQ 


4:0/ 


loo JO 


TV 




1 

1 


i as 


KT .DP T.AP HrJT lYPTnCKT^P KfYR TfPPKTvTP M 
KKVRGTW 


4958 


18859 


A 


4995 


110 


3 


RRIGRYKTVFLCTQLEILMARYPLPPSP 


4959 


18860 


A 


4996 


109 


3 


DEVSILCPRLECNGTILAHCNRRLPGSS 


4960 


18861 


A 


4997 


1 


148 


ACCPFCTIYLLPMFMIIKAPLMGTSKKK 
KKKKKKKKXKKKKKKKKKKKKKKKKKKK 
K 


4yoi 


JOOOZ 


A 


4yyo 


3 


tin 


V X J-lS^V70JMX> VEiXJ VnfiUNotU XXX VXjVLAjV^ 

SKKTSTRP 


4?Oi 


1 oo^o 
I ooo J 


TV 

A 




3 


330 


D T TXTTPrt 7A T? VX/Q TAP P WMT T C P1?T .T .T .If 7A 
r XAJJlvVxAGi X V£*AKr«WPiX £lxU\^LxJjJ\A 

HKKAIRRA1WTFGYIAQAIGPHDVLAT 
IJ^NNLKGHERC^VCTTVAIAIGAETCS 
PFTALPALMNEYRVPELNVQNGVLK 


4*03 


looo4 


TV 

A 


5UUU 


3 


1 1 A 


VT T /V2QMT1VPT A/HPnUGP PT r V t T\7T .TTTYir* 

O JVEVX w X XV t 


4964 


18865 


A 


5001 


25 


329 


NSRRRRNIX^SPNLCQTFTFMTLPYIjPE 

NI KQCE INYVKKFQS FQDHKLKI S KEES 
KILKKAQKDGFIJIETLLN 


4965 


18866 


A 


5002 


278 


345 


edeegywdgevddbkdeeelgg 


AQAA 


IRQ 67 
lOOO / 


A, 


JVvrj 




J 


VRI IISGTOKKKK3CKGKlxPKNroPKVTP 
DPER 


4967 


18868 


A 


5004 


18 


350 


VSHECLIFHIQNVIiTGLVISIiSCPSVPS 
HTHKHTHIHTHTHTHTHTHTFS 


4968 


18869 


A 


5005 


26 


147 


KEEVPJ4ALFANDMIVYLENPFVSAPYVL 
KLI SCFCKGRLG 




loo/U 




JUUO 




■i 
l 


55 AS R VAGI IGKHHHTRIiIFVFLVETGO 4 ? 




too/ 1 




snn7 

JUUf 




*j 
j 


RARKPIiLW VLKKI WS P WI YKKKTRINFO 
KEKKSPFFSRPFFLFKKGGFLPPGVFFS 
RGFIjKKI PPFFFFFFETESRSVTQFGVW 
SVQWCNLGLLQPEPPGSKQlxLC!RGLPSS 
WDYRCLSTRP 


4071 


lOo/Z 


A 


soar 


1 




I RI liSKI KNALTHFLPOGTPT PI. T P T TjF 


4972 


100/ J 


A 


5009 


218 


91 

7J 


FFFFFFFFYFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


4973 


18874 


A 


5010 


3 


250 


RPRRRRI^QENHLtNLiGGEGCSESRSRHC 
TLTWATOQDSVSKKKKKRVFPPPHPRGK 
FFWAGGPTFFPPLKTVPFPGGGKPNF 


4974 


18875 


A 


5011 


336 


0 


SSSSPPSSSSSPPPPPPPPPPPPPPP 
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lacf amino 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 

mic lent ill ft incArfinn 


4975 


18876 


A 


5012 


2 


80 


SNQNGTGGES I YGEKFEDENFHYKVK 


4976 


18877 


A 


Jul J 






QSPSHLEU^NLCRHINIRVDPTGLREGM 
AQVNES 


4977 


18878 


A 






1 
1 


F P P FF cftf cj r cvan arrvnwr? vr /1 c t .n a 
PPPGFM 


4978 


18879 


A 


5015 


193 


32 


PLL I LCS ERLCHCT PAWAI ERDSVfi KTN 
KTKQQQQQQQNQGTKCMPLALLGOI 


4979 


18880 


A 


5016 


354 


235 


CGEIGMLLHCWWECKLVRPLWTNLFLRD 
OSTLTVGSK3 3 F 


4980 


18881 


A 


5017 




7 


RT iTiVfJTCT jMFT.HOFO C ^flTT 2VTnnP 


4981 


18882 


A 


5018 


76 


1 


I F FLS Q VS P LS KED AGE YECHASNS 


4982 


18883 


A 


5019 


68 


356 


YFGGVGGFFFFFFFFFWFGPSGGVFFVG 
VQAIFFFFFFFLGGFFFLVRDYFFVAVF 
wwvjjuiiirirevr r r VVr\.r Vrr rr rvSWKRK 
NIFYGWVFLFCW 




J 0004 


A 




1 
1 


'TOO 

.Zoo 


MQNFKNLVGYHNS I TEENLPQLGANENL 
ESQSGNFSWF I FFN&DRKRGMVLLLPN 
NEMTILKPRTSV 


4984 


1000 J 


A 






I 


V Vfll JS 1 MNo X lUr WlWit> IviArl rUN PSTli 

GGQGGQITS PQKFKTS 


4985 


18886 


A 


5022 


2 


337 


RRSDPNFKNRLRERRKKQKLAKERAGLS 
KLPDLKDAEAAQKFFLEKI QLGEKLLAQ 

QTLPPPVFQMLLTKLPTISQRIVSAHSI, 


4986 


1 0007 
1 000 / 


A 




1 


IOC 


TTTVSOD^KTT T OCtlDOT TXT DJlcmTOr r^iitTTY 

V LXjCEAN uuSoxuwii VIjRAET 1 0 LGBHP 
CDRGEQVTLFLFNDCLE IARKRRKVIGT 

FP C PTTf"2ATT3 DDfiCT .KTJTXIT.M'DT - COT WTi 

LDIIETEDCHNAFALLVRPPTEQA 


4987 


18888 


A 


5024 


135 


2 


ATMFLNS KVS KYSGYLLGFHECREKGWM 
TWDGERDPSPGILQLQ 


4988 


18889 


A 


5025 


2 


94 


KTATKLIGGXHYDSXNIKAIRDGLLARR 

WIT. 


4989 


18890 


A 


5026 


2 


362 


QELERSMAQRCVCTLALVAMLIiLVFPTV 
MKEALKVFPTWSTS FIQHEWEBYSHL 
WEVPPFSG 


4990 


18891 


A 




ill 


OA 
Ort 


AGITGMSHRTQPMVHtiY 


4991 : 


18892 


A - 


5028 


366 


0 


SGRSKKKKKXKKKKKKKKKKXKKKKKKK 
KKKKKKRGGR 


4992 


18893 


A 


5029 


231 


68 


AGMGSRALPKPLLMSHSSLKAVBIjPDS f 

C PPT.P C T .T T.rtP TWKJPP. T jTZCT fZOtZ 


4993 


18894 


A 


5030 


2 


366 


LNLGGRGCSELRS CHCTPAWVTETP FKK 
K"jnn7PHf!fs vFvnrvr .t.dpp pmp twp p d 

QGERTCPGWGAPVHTGLLPRRGQAFTQP 
PSAFKGFFHPTNICS S S EAKKKKKKPRVG 

■C will. 111 1* X J.' JW »J O !JX^£V£\>XvXVLV[\Jr X\ V \ J 

RKKNVGYIL 


4994 


18895 


A 


5031 


241 


368 


QVERNFKSQSGAEAHICDLSTLGGCGGQ 
ITRSGVHDQPCQHG 


4995 


18896 


A 


5032 


279 


396 


DGGMWPGTVAHACNPGTLGGQGRWITRS 
GVRDQPGQHGE 


4996 


18897 


A 


5033 


173 


398 


SNESSl^VNPFFFFFFLKTNFSFCPPGG 
RQGPNFSLLDPPPPKVKEIFCLTPPKRW 
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EYRPAPPPPRNFGFLIKTG 


4997 


18898 


A 


5034 


2 


143 


SNP PALTSQSMGITS MTRHALP IACFLV 

17\nTT .T?PT2MFCTIQ T.^H & \7\1C\ 
C V C LlC C JUTlCiO ti»-5 Lui\tnJ2i V \£ 


499S 


18899 


A 


5035 


350 


2 


QEISSVQTSTQLFNGMTVKARATTREVM 

a TVT T ITTl TX7TPT. T T CiT .D^ltfVPT JdCl T TVT? 

SGKRVGVAVRLFIFITQKSFIFLFSFIiT 

r.rT ,fT AUITHNnPT.T .T .T VP T T, T aMA*FT,MT» 

TKRK 


4999 


18900 


A 


5036 


1 


100 


HECGSSSQRTLSVQEAAAYLKVSNEIRI 
LIAIF 


5000 


18901 


A 


5037 


247 


334 


uyiUloAVViii 1 1AV tt^SlAjlSXxLxxxrxftJNAfl 

S 


5001 


18902 


A 


5038 


2 


296 


DKAPMLKVIVNSLKNMINTFVPSGKVMQ 
i/\mT7VT twit Ty^wT?Dr2Dm?i?'piunmT7i7M7 r P 

DIPLGKVHLEALKKKVTKFFYKFPLRCD 
IHTAKCTGLRYTAQ 


5002 


18903 


A 


5039 


152 


335 


RPIWSYYLIiFFFLFWGLYLDVNKRIFTF 

XXjoxSIIX r URUUW %{lwJUSviuU± X JSJvITxjIKM 

TRISE 


5003 


18904 


A 


5040 


336 


1 


GGLTISSLLKEKEGSEVAKFTLEQLCIiI 

LFDRINLSGDMGTFS PVISSS IHLLVQD 
VDAACDPAMVAMSKMQCQNVQHVGVKSS 


5004 


18905 


A 


5041 


3 


204 


xxN JjouvjVTt^o Ciirivo \JH— 1 iro n V Xxalvl iTUjxV 

KKKKKKRGKNPLKNGGKENFKFLQILVN 

PRIM QT.TSTJT AV 
fiMNO XiEiXM XXf* V 


5005 


18906 


A 


5042 


382 


148 


WCNHRGPRSRKKRSEGSTKSRRLGATIR 

FIV X mi XKX V*i4XVI^lr£ti-IX»3fVJQ».rV,v» V i luIUl 

THTHlTIIYIEniTrOTCIYVKTH 


5006 


18907 


A 


5043 


137 


1 


RPRRRYMKKFSTSLI IKKMKIKITMKYH 
LSHLI P VKMAITKKTKD 


5007 


18908 


A 


5044 


198 


34 


KPASRFCHVGQADIxELTjTSSDLPASASQ 

Q "Afl T TYSVCTJnQn PKTT7T TT •f'T.'Vrtr'TTVUT?!? 


5008 


18909 


A 


5045 


11 


357 


LLTYPl lMFKlliK I Y KKEKQJLiI IKQiSK 
GQEKVKELRQFKEHRKAKDSALQS IESK 
mt TrT.*TTvrpT.PCQnHRTnTMRinnrPTrKni.2JkP 

rll irVLi lIN£OJtCC»o VCiijXVi X^atC^u^J^^X^J^tri/\l^. 

GPYWKFKYSQRWKSPPGPFQWGWGPRKN 
FFV 














PQVHLMPSLYHFRFLQVDTKDLLRASAD 
LIHRGIT i 


5010 


18911 


A 


5047 


1 


196 


Kx^VT^SPMKAAPRNFSCAPSIxSIiLPF 
S FRKrrrriiTSNTADD FAY FDLCKMEPC 
SMVLFFVFC 


^ni i 






jU^o 


IRA 




AOAVLELLDSSDIiPPSASOSAGISGVSH 
CTQPDSIF 




lay 13 


r\ 




1 




MHGEATVADTEAAAGYPEDLAKITDKGV 
YTOOOIliNGDEIAFCWOKI PCRI FtARE 
Q AVPGCNAS KARLTVLLW ANAAGD 


jyjii 


1 OA1 A 

18914 






1 




KEKKKNPVEKSAKVI 




l071 J 










RS F IP SASASASTLCLKCSQNGQPGAVA 
HACNPSTLGGQSRQLTRSGVQDQSGQY 


5015 


18916 


A 


5052 


192 


341 


AKRVKRNNFFFFFLKQFCSVPQAGGRGP 
DPGSUCPLPPGLKGFSCPTPLN 


5016 


18917 


A 


5053 


340 


1 


KRIPDKPQKELRRLATKLIREAPEKGQA 
QGKE IHKS I QEAKGE I FKAIDRI KKS QF 
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DRQEALDILLVMQNALESLSNRIEQVBE 

RNSELEDKVFDLTRSSKAKGKRIRNYDH 
3 


5017 


18918 


A 


5054 


2 


383 


VVKVATQPADNPIJDVLSRKLHLGPNVGR 
DVPRLSLPGKLVPPSSTGSHFFMLGIGD 
IVMPOLLLCFVLRYDNYKKQASGDSCGA 
PGPANI SGRMQKVSYFHCTLIGYFVGLL 


5018 


18919 


A 


5055 


2 


383 


GSVLS KKGD YLKYHYNAS L LDGTLLDS T 
WNLGKTYNIVLGSGQVVLGMDMGLKEMC 
VGEKRTVI IPPHLGYGEAGVDGEVPGSA 
VLMPDIELLELVAGLPEGYMFIWNGEVS 

fCfl l_i.r Ci IS X U1\LAja\sa V 


5019 


18920 


A 


5056 


2 


383 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKENIDLVLDRI RKLADQ 
CTGLQGFLVFHSFGGGTGSGFTSLLMER 
LSVDYGKKSKLEFS I YPAPQVFTAWE P 
YNFILTTHTTLEHSD 




10Z7Z I 


A 






45Z 


lUjNoJjKlARQAyfnfiAAX Dili rRTGTYRQLF 
HPEQLITGQEDAANNYARGHYTIGKENI 
DLGVDR I RKLADQSTGLQGFLVFHS FGG 
GTCSGFTSLLMERLSVDYGKKSKLEFFI 
YPAPHVFTAVVEPYNFILTTHTTLEHSD 


5021 


18922 


A 


5058 


2 


385 


AVIDEVRTGTYRQLFHPEQLITGKEDAA 
NNYARGHYTIGKE I IDLVLDRI PKLADQ 
CTGLQGFLVFHS FGGGTGSGFTSLLMER 
LS VDYGKKSKLE FY I YPEPHVCTAWE P 

VMT7TT TPU'I'MT WGT\r* 


5022 


18923 


A 


5059 


3 


390 


GDAANNYARGHYT IGKE 1 1 DLVLDRIRK 
LMERLSVDYGKKSKLEFSIYPAPQVSTA 

WEPVKr CJTT .T w t*WT w PT .PUQnr»a TTMVnivTP TV 

I YD I CRRNLDI ERPTYT 


5023. 




A 


JUUU 


IvJ 


j 


LPRAT 


5024 


18925 


A 


5061 


262 


412 


KASPFRTGTAFGNGKTSDYLLLGNFGYT 
FGGITGCLKAGLETSYWTWFTH 


5025 


18926 


A 


5062 


293 


406 


VIIGSIFEVIWAVIKPGTSFGISVLRAL 


5026 


18927 


A 


5063 


419 


15 


WEEEGPLPfCKKKGGFSMKRGNI IMGPFL 
KRPPBKKPPPPPRKINPFFFFFKRKGPP 
PPRVKGKGAIKGSRNPPLFGSPDFFQPA 
PQKSGAPGGPPPPPVNKNQFFFFFFLVE 
TGFHHVTQAGPELLSSSSPPTI 


5027 


18928 


A 


5064 


2 


ill) 


GLTNLF I FCRDRILLCCPGWSQTPSLKQ 
SSHLSLPKPWDYMYEPLCXYFLYPWFCL 
SLSLS FPLSHT YFPG 


5028 


18929 


A 


5065 


259 


36 


SQPHWDYRASRAEE IATFLEVWLQAIFV 
CCLFLRQSPFIAQAGVQWHYLKSLQPPP 


5029 


18930 


A 


5066 


2 


404 


GKGAPTTSLISVAVTKI IAKVLEDNKLP 
GAICSLTCGGANIGTAMAKDERVNLLSF 
TGSTQVGKQVGLMVQERFGRSLLELGGN 
NAI IAFEDADLSLI VPSALFAAVGTAGQ 
RCTTARRLFIHES IHDEWNRL 


5030 


18931 


A 


5067 1 


1 


400 ! 


GENMITGTSQADCAVLNVAAGVGEFEAG 
I S KNGRTREHALLA YTLGVKQL IDGVNK 
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MDSTEPPYSQKRYEEIVKEVSTYIKKIG 
YDPDTIAFVPISGWNGDKMLEPRANIPL 
VTGWKATPIDCDASGTTLLDA 


5031 


18932 


A 


5068 


41 


392 


GSPHHPCAHIBRKKKPYNSNIGFYTKRN 
ALRVAEVWMDDYKSHVYIAWNIiPLENPG 
IDIGDVSERRAIiRKRLKCKNFQWYLDHV 
YPBMRRYNNTVAYGELRNNKAKDVCLDQ 
GPLEN 


5032 


18933 


A 


5069 


3 


395 


GTPTRPHILLQALFIRAMLPCPDYDTDT 
KTGLDQ ALKI CQAMLDEAANQGWIjVTVL 
NNTNLI^AIQGRWVKDSSLLTLPYIEN 
HHLHLFKKRKP IMKGPHAKSRTS IECLP 
ELIHACGGKDHVFRSLVQQ 


5033 


18934 


A 


5070 


3 


405 


PRASEVCGFSCHITCVNKAPTTCPVPPE 
QTKGPLGIDPQKGIGTAYEGHVRIPKPA 
GVKKGWQRALAI VCDFKLFLYD IAEGKA 
SQPSWISQVIDMRDEEFSVSSVLASDV 
IHASRKDIPCI FRVTASQLSAS 


5034 


18935 


A 


5071 


3 


393 


ITRQE F IDGI LASNFPTTT I EMTVGAD I 
FDREGDG YID YYE FGAALHPNKDAYRPT 
SDAPKTFHQGTRQVAQCICAKRFLVEHI 
GENKYRFFLCNHFGDSHQMRLVRILLST 
VMVLDGGGWMALDQFLTT 


5035 


18936 


A 


5072 


3 


394 


ITRHEFIDGIIiASKFPTTKIKMTVGADI 
FDRDGDGYIDYYEFVAALHPNKDAYRPT 
SDAYKIEDEVTRQVAQCKCAKRLLABHI 
GENKYRFFLGNQFGDSHQLRLVRILRST 
VMVP VGGGWMALDE FLVND 


5036 


18937 


A 


5073 


1 


393 


GEDAANNYSRGHYTIGKEI IDLGLDRIC 
KLADQCTGLQGFLVFHRFGGGTGSGVTS 
LLMEHLSPDYGPKSKLEFSIYPAPQVFT 
AVHEPYNSMLTTHTTLEHSDCAFMGDNE 
AIYDICRTNLDIERTTYTN 


JVJ / 


toy jo 


A 


S074 






LGLHS AWRDDKIGYNPDTVS FVP ISGWN 
GDNMLEPSANMPWFKGWKVTRNDGNASG 
rTLLEALDCIIxPPTRATDKPLRLPLQDV 
YKIGGIGTVPVGRVETGVUCPGMVATFA 
PVNDTTEVKSVEMHHEALSEALPGDNVG 
FNVKNVSV 


5038 


18939 


A 


5075 


1 


390 


GVSMAVSLVIIYQYWRNMPDPHNLPIV 
AGWKKYPLFFGTAVFAFEG I RWLPLEN 
QMKESKRFPQAIiNIGMGIGTTLYVTLAT 
LGYMCFHDEIKGSITUSLPQDVWVYQSV 
KILYS FGI FVTYS IQFYV 


5039 


18940 


A 


5076 


330 


462 


VNFFFREGGTESPS IAQARVPWCSLNSL 
QPLPPGFKRFSCLSLP 


5040 


18941 


A 


5077 


1 


403 


GGWGVEAEDFEYAPDVEPLEPTIiSNI IE 
QRSLKWI FLGGKGGVGKTTCS CSLAVQL 
SKGRESVLIIFTOPAHNISHAFDQKVSK 
VPTKVKGYDNLFAMEIDPSLGVAELPDK 
FFEEDNMLSMGKKMMQEAMSAF 


5041 


18942 


A 


5078 


348 


1 


RPQGKNIERWQGGEVLIIQLLRIAKSVK 
NVQLLWKTQ FL I KLHILLFYDPR I PFVS 
NYPKEKKTYVHKKTYAQIFISLPRTFLF 
LRQSLNMLPRQDIiSS 


5042 


18943 


A 


5079 


248 


3 


PGQVLEREHTCRRP WAVVNVC ICLAGPA 
KPNREDVSS SQEESLQLNS I PPTPTLTS 
TAVKSRQPLWGLKEMEEEDGSELDF 
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G=Glycine, H=Histidine, I=IsoIeucine, 
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nucleotide deletion, \=possible 
nucleotide insertion 


5043 


18944 


A 


5080 


3 


192 


GDGSELRSCHCTPAWLTERDTSSQKKKK 
xvciivxvx n vi it f\ir & li r c c xjxjxjEiIVjSJctSJIn v^J\.xr 

NMKKVL 


5044 


18945 


A 


5081 




171 


CFCTjGPVTTVTRQT J.K'DOT rvx .t'i?dwt nn " 

v»c v.i/jr vvj v v»t*OJL»lJxVfc'yxA..r xAarlTxr YttiK^lJ 

AROKFRS VTiVR ATVTTT .TiPT .l/TT? CTD "D 


5045 


18946 


A 


5082 


94 


1 


DFWPGKVAHACNPSTLGGRGGWITRSGV 
QDT 


5046 


18947 


A 


5083 


142 


3 


FKRFS FIiSLPSS WGRKIA 


5047 


18948 


A 


5084 


257 


177 
it/ 




5048 


18949 


A 


5085 


143 


1 


GGESHSDTQAGVQWCYLGSTQPPLTLTF 


5049 


18950 


A 


5086 


3 


87 


HVGQAGLVLLASCDPPSLASQSDGITV 


5050 


18951 


A 


5087 

JvO / 


j / J 




T.WQT.COUTRTDCinfnTVUMT T?7\ nv»fT>T nmr 
lxtxo JjoyrtJLA 1 Ki> JsJSi»Ar 1M Ur Ab \ j 1 J_»S P V 

STNWVGFNPISFEGLMLI I FFFFF 


5051 


18952 


A 


5088 


40 


203 


IFLPLLLVSFFFYILTWGFTFFWGLTLI 

Mr)T.T.NT7?P , T«V^!W'C/3TQC , 0!iJBV^oon/^c"T r* 
"i-'-UJJiv r c c oivovjx o con 


5052 


18953 


A 


5089 


1 


145 


^MRVPAQLLGLLLO^LSGARCDIQMTQ 
S PSS LSASVGDREDGDVDAA 


5053 


18954 


A 


5090 


146 


328 


FMPFPLLESLEIFSSKLFNPPNLFFFFE 
1oSC^VAQIX3VQWYDIjGSLQPPSPGFKR 
LSCLS 


5054 


18955 


A 


5091 


338 


1 


PNLPSVQLPPTTSCTPI1IGDI1I1VRRAHF 
AGWKHQGINSPSLPMAPREGMEAPRNPF 
HVHX k JjCIjIMGFHIiAVSKLySLFYYyyY 
YFS RWS FTLVAQAGVPWRDLGS LQPLPP 


5055 


18956 


A 




X 

1 


52.1 


F f li I F P X r PS S PKNKNP P PPHYNPP P P P 

FRTPPPPPYPPLSPPKSPPPPPRVDPPL 

PYSIFPRPNLISPPPYSPFYliLPQIiQAF 
p 

XT 


5056 


18957 


A 


5093 - 


133 


3 


AQTCTPSTQINSKWITDLNVKGKTIKLL " 


5057 


18958 


A 


5094 


170 


350 


AGGQGGNFYSLQPNPPWIRESSHSTIiPK 
FWNYRHTPTGPPNFGFFFLKMGFGLVAQ 
x iru 


5058 


18959 


A 


5095 


133 


274 


RDII IFLESGIKGYFLFFSDKITSELVS 
PT iYiTiirNwirTDVT?nT ruTTm/^ 

rvX. VJU MN V¥ 1\X-K1\J£ V7J_lLfl5 V Aj 


5059 


18960 


A 


5096 


2 


188 


REMQ I KTTVRYHLTTARMVLi I KKSEKNR 

CWHGCSEKGTItLHCWWECKnLQPLWKIG 
WRVDAA 


5060 


18961 


A 


5097 


235 


81 


FPKKKKAQIKKPQAAQSLNIjSLSLYIYA 
ft y ththrhthtoththty t tt t q 


5061 


18962 


A 


5098 


355 


169 


KIlxLLITDSCAHISCRFSECMKYFNFFF " 

HPHSVYVTDDKIRLLEEQLPHVFSNKME 

PFKVCN 


5062 


18963 


A 


5099 


160 


1 


FSASYFNSKKKLIFLLLKLRWNFFVCFF 
GMESHSVAQAGVQWOxIjGSxjQPLPS 


5063 


18964 


A 


5100 


jot 


151 


GITGVSHYASQEFLKKEFHSAHLIPLQG 

TPVDfiPTTZxfiPYQ T? a T. 
x ^- v \£ x tr x o Xj/i-Lt 


5064 


18965 


A 


5101 


205 


3 


AASPGVQWPDSRLLQPRPFGVKQFFPLS 
FPSSWGYRGAAPPPGHFFFFFPSRDEIS 
VOLPGWSPTPE 


5065 


18966 


A 


5102 


349 


207 


DGASLCCPDWGLELLSSSGLPALAtiQNP 
EITGVGHHTQPWTLSLLIY 


5066 [ 


18967 


A 


5103 


425 


0 


CEFFKNKDFCFLSl<XK3CKKKxaciCXKl^ 
KKKQSSSQTSYS 
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nucleotide insertion 


5067 


18968 


A 


5104 


437 


184 


LFLFFFPPPKKTCSQKTPRSVYSPPPKK 
KKKNSPPPLKIiGPPQEIFKTPPPPIFFF 

WWMWtPQ WWWfCfZfTB PP FT CT .WQ PPT. 


5068 


18969 


A 


5105 


401 


10 


MFSPPFGGPRGGVPLRPGFKNPPGPKGK 
TPFFLKNQNYLGPPLYSPFLERLNKKKA 
FTLEAKGPNNPNSSRSPPPGAQKKNPPF 
KKKKKKKRKI I FMPQAHPAVYVVCSGNQ 

Lie* JL X J: c* X o A XouJrljSdXrX 


5069 


18970 


A 


5106 


413 


2 


PNKKPSPPPFSPPHJCTPSSPSPPPPPNK 

PTPTP P D PNTnTNTT? O D DOP D P VTTtfNP PtfYP 

PPPQKKTPPPPKKKKHPPPPHKKTPPPP 
HTKNPPPPPPLPPSLSIPPLFPLYINSP 


5070 


18971 


A 


5107 


269 


115 


GFFFFFFFSFFFFFFFFFFFFFFFFFFF 
rrrrrri ru^uni x jl u i\x xuo x oin 


5071 


18972 


A 


5108 


238 


4 


QWLPLVDRIWILLFFDSLNLKMFPQYQ 
IGl^OKVKNNTTFYFLFSFFLRWSFTL 
\/Tn7i<*ivnwr*nT^iPT j^ppppp ptt 

v iy/ivj vyw i~xjxiV? ir xj^xr tr Jr jt r^r t\. 


JV 12. 




A 


DIUV 




A 
U 


cf pop aQ PTf Tf TTTTIf TTKTCK ICTHf KKTCfCKTf TClf 
KKKKKfCKKKSSK 


CAT} 


15:7/4 




Jl lU 


1 
1 


2.HJ 


VSKKKKS FRAJuDI FFFNGKKI LGTLRGK 
NQKPPREVSSGNYFHFKGLFGPCYK 


5074 


18975 


A 


5111 


78 


4 


LGMVAHACNPSTLGGRGGRITRSGV 


Dv/j 


1 QQ*7£ 


A 


jIIZ 


o 

z 




NFL 


Jv/O 


1 80*7*7 
loV// 


a 


till 




o 


T YKTIVT iR PfWT ,R T .T .TWnNRWT .P TP BPT 

RIIIPSQDVLHS 


5077 


18978 


A 


5114 


132 


2 


DDI^SLQPIiPPRFKRFCCLSLLDSWDYR 
pppT.ppaTnpapaAP 


*?A7R 
Jv/O 


1ftO*7G 




Jl ID 


l 




SAGITGMSHRAWPSLFSILS 


5079 


18980 


A 


5116 


213 


3 


GAVGFSSGCFLLYLYALIiGPFPPLPLLQ 
IiKF I OTP<^VAHVCNPSTLGGRGGR I TR 
SGVRDQPDQHDET 


5080 


18981 


A 


5117 


407 


0 


LGWAVWFTPVIPTLREVKAGGSPEVRS 


5081 


18982 


A 


5118 


215 


414 


KIilPKEIRNNGGSQQRKRAQU5KIiEQTN 
PRRGKGVFFF 


DUoZ 






<1 1 Q 

ji iy 


A(\ 
4U 


*V>£ 


K3CGIKPQKK1KGFFQGK3^^ 

LIQGFFQKKKOTSKKKISQKIOCKNGE^P 

LGKGFCPKKKT 


5083 


18984 


A 


5120 


99 


2 


GRSTRHSPAHTHTHTLRHTGTIjIxH^ 
RHSP 




1O70J 


7\ 


j i^i 


j 


919 


RT,PT.r t Hf t TPTMA f rPVPT ^QKKKFKKKKK 

NTVGKKKGl^GKi^POTKl^ 

FLFS KFFKTRGTTI I 


5085 


18986 


A 


5122 


262 


351 


IT^TIxAHAOTPSTI^RGGVJII'xRSGVQ 
DQ 


JVJOO 


10701 




J I jLj 


j4U 


n 


KKKKISOAWWWVPWPATWEAEVG ! 


5087 


18988 


A 


5124 




2 


LMPI I PALWEAEVGGAPGQEFQTSLANM 
VK 


5088 


18989 


A 


5125 


310 


68 


LSPimCLGFWGRVIYPPPPPKVSPPVFG 
APPPPBKKKKPPPPPPKSSIK10>TEKKI 
CLAASGKGIxEVCFCKKPLIQSYFL 


5089 


18990 


A 


5126 


343 


131 


SRPRRPGIiELIjTS DDLPAPATQS AGITG 
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VSNHTQIWPISVGDLMNTGVVIPENRIV 
SELLREIHFLRICL 


5090 


18991 


A 


5127 


2 


145 


FSLLGSSVGRLEGTGTISTHCTLPASRD 
SPASNYRLTPPCPADFLYF 


5091 


18992 


A 


5128 


1 


130 


SRSVTRLECSGTISAHCNLPGSSDSPAS 
DYRLTPPCPADFLYF 


5092 


18993 


A 


5129 


1 


338 


VVRKEGIRFEKEKSKDFKNHVIKYLETL 
LYSQQQVCKIiWVTFI 


5093 


18994 


A 


5130 


346 


3 


DFFFFHPILFFFFFKKTLFFSPPEGNGG 
HTLFLKNTPPRGRIFFFIiPPIiLLWGGWA 
PPPPPKKNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFMPPTQSLFKDQEGPVQGGEEA 
PV 


5094 


18995 


A 


5131 


3 


114 


FLHIGQAGLELLTSDDPPASASQSAGIT 
GMSHHAQP 


5095 


18996 


A 


5132 


292 


180 


AASTDS IEGHGASIiPSKKTPSEEDFETI 
KLISNGAYG 


5096 


18997 


A 


5133 


338 


216 


RGENRLPPGGRGFSGPKSHFCPPAWATE 
RDSLSKKKKNFV 


5097 


18998 


A 


5134 


337 


3 


I WLGNGKGIIRNNQI FSNKEAGIYILY 
HGNPWSGNHIFKGRAAGIAVNENGKGL 
ITENVI RENQWGG VD I RRGG I P VLRSTL 
ICFGYSDGVWGDEGKGLIEGNPISAN 


5098 


18999 


A 


5135 


227 


343 


NYVKEKLI PTWNWMVS IMDSTEAQLPYG 
SALTSWDPGQ 


5099 


19000 


A 


5136 


166 


375 


ATFVSLGVFCSAVILLYFKNMMKLDLSL 
TPHTTINSKWIKRLHTRPEAIKFLEVNI 
GKKFFDIGHEIIIi 


5100 


19001 


A 


5137 


90 


2 


KTTLWPGAVAHACNPS ILGGRGGRITRS 
G 


5101 


19002 


A 


5138 


213 


379 


PTFQSVGETGSLLKMHILGPGAVAHACN 
PSTLGGRGGRITRSGVQDQPGQYGETP 


5102 


19003 


A 


5139 


384 


2 


FFFFFFLSVMESCSATQAGVQWCDLSS 


5103 


19004 


A 


5140 


41 


317 


TMSRDRPSDKTWTYNRSNWMPDDGAPF 
RYSFSALKDRHNAVEVNWIDPNNGWETA 
TELVEDTQAIARYGRNVTKMDAFGCTSR 
GQAHRAGL 


5104 


19005 


A 


5141 


29 


523 


VKAEAAYKKADDIWNLRKDDYFVNDEAR 
ARYWDDREKARLALEAARKKAEQQTQQD 
KNAQQQSDTEASRLKYTEEAQKAYERLQ 
TP LEKYTARQ E ELNKALKDGKI LQAD YN 
TLMAAAXKDYEATLKKPKQSSVKVSAGD 
RQEDSAHAALLTLQAELRTLBKHAG 


5105 


19006 


A 


5142 


432 


144 


GGFLPKAFLPPKKKKGAFFPPPGKGAI P 
PLFWGGWTVFLIPEKNPPRGFPPPKRGE 
KPVGGPFFRIFAPPPQKKIGGGGFFLER 
FFFWDWGKTGP 


5106 


19007 


A 


5143 


403 


6 


FFFFKKQIGPRGGGPPQKSPPLGGGGRG 
GSPRPGVLTPLGPQGKNLFFLKKKKKLG 
GGGGPRNPNFLGGGPGKSLYPGGQRFQG 
PKILPFFPPRGKKKKIFFPKKKKKKSHR 
KLIQTLILQTGKLMKMLWRT 


5107 


19008 


A 


5144 


115 


427 


KKKKKKKKKKKKKKKGGGPKKKKNFSPG 
GGKKKFFFNGAKK3CKPRAPVKKTGEKKK 
GGKKKKKCFEKNPFFGGGKKKKKKCSSS 
YPCPKKQ 


5108 


19009 


A 


5145 


2 


79 


GCSEPRSCHCTPAWVTEGDS ISKKK 


5109 


19010 


A 


5146 


317 


444 


IHQPVCVAHAYNPSTLGGRGGHITRSGD 
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RDHPGQHGETPYPL 


5110 


19011 


A 


5147 


3 


413 


SFYRGFI PVLQTVTVLI IGDPSLHGDAW 
SWMEFFLTVIFIALWMLPLIGLTIWNA 
IWFQDIADLAPBVSWTKPHPFPSVSQI I 
ADMLFPLLLQALFLIQGIAVSLFPIHLV 
GQLDSLIiHMTLLYSLYCSEYRLVIN 


5111 


19012 


A 


5148 


409 


247 


QTKFRHVAQAGPHFLGSSDVPTSASQSV 
GITGMSLHAQHQHFFTASMGRAFGEN 


5112 


19013 


A 


5149 


284 


1 


AEASRAHMEAHARTQGTPGS KAEGVGP V 
BEYRIWRRPGPDRAHASDSEESGEPGGP 
PPHPANFVFLVETGFFRVGQAGLEFPTS 
GDPAASASPV 


5113 


19014 


A 


5150 


439 


120 


FGSPRLECSGVISAHCNLRLLGSSDSPA 
SASHVAGI TARPNARI IKAFGAPSTCTV 
VWERHEQQTCIQRSDLPAGRQKKTGPKN 
RPGLVAFPFNASPLGGPGGRIA 


5114 


19015 


A 


5151 


415 


0 


FFFFFFFFFFFFLPFVQSFYIYIYMYI 


5115 


19016 


A 


5152 


67 


179 


ICIiSLFLTVHVCVCVCVCVCVCVCVCEM 
VFGVSIPCN 


j i k\j 


19017 


A 


SI 51 


280 


2 


KTPFFLNLKEKTRAPFFLKKTFFKKKPF 
KIFFSKKRNFFFFPPPGKRPPTLFKKBCF 

ffffqkkknfffspkkkkkkkkkkgrrs 
rsrtsprv 














5117 


19018 


A 


5154 


296 


406 


DRVLLCHSSCSEVAESRLTAASTSWAQA 
ILLPQPPEW 


5118 


19019 


A 


5155 


114 


402 


VWRIKASVEKKKKKKKKKKQKKKKKKKR 


51 19 

J 1 17 


XP\J£.\J 


A 




1 10 


1 


QRFLLTLKSLILRLGAVAHACNPSTLGG 
QGGRIMRS 


5120 


19021 


A 


5157 


382 


95 


SSSSPHFLTPSQLRLFFPFFPLKIFFFP 
KAFNFCGGVFPIFSPPKKKFFFQNSQVG 
FKNPPQKEKNFSFPPPVKFGPPKGFFKR 


5121 


19022 


A 


5158 


2 


350 


TLQPGRQSETPFQKKKKKKKKKKKKKKK 
KKKKKKRGGGFLKNLGGGPKIWGVEKFF 
FFFFGGGYKKPLGFFIiEKIFFLGGGIWA 
PPPPKKISFWGKKKFFWGGGGKTPLFFC 
GGKK 


5122 


19Q23 


A 


5159 


1 


208 


PTRTLYLVKLGLMGPABIFLSSFQEIRL 
IDYDKMVDHRGVRWAFGQWAGYAGKYA 
RAYITHFLNWLGE 


5123 


19024 


A 


5160 


1 


208 


PTRPLYLLKLSLMGPVEIFLSSFQEIRL 
IDYEKMVDHRGVRWAFGQWAGVAGKYA 
RAYITHFLNWLGE 


5124 


19025 


A 


5161 


419 


25 


AQKKKKKKKKKKKKKGGGPLKKNFGGGK 
NKRGKKKKNFFLKRGGKKKTGGILEKKN 
FFGGGKKGEKPPQKKKALKEKKKFLRGK 
GGKKPLNWGGKKKW 


5125 


19026 


A 


5162 


475 


150 


KPRTSGTARVPGDPPPRCLDDKRLPSRP 
VIFVFLAETECDRVGQDGLQLLTSGDHP 
HLTFPNCYNYRREPPRPTSSGFQHDIET 
LHYRRGKADRFKAHLPK 


5126 


19027 


A 


5163 


237 


409 


FHNQKTYLDSALCTLVYFWFVLRDGIF 
LLPQLECSGTVMAHCNLKLIiGSRSPPAL 
A 


5127 


19028 


A 


5164 


373 


476 


LGFILLSFLETRSHSVAQAEVQWCDHGL 
LQPQTP 


5128 


19029 


A 


5165 


374 


132 


IFYSPFLYYH>n^KYHKIKNLKIKHMWL 
DAVAHACNPSTLGGRGGRITRLGDRDHP 
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GQHDETPSLLFLWTFASSLSGFPIj 


5129 


19030 


A 


5166 


332 


2 


RPQGLANCFPGALPPTKSEGFPGQSPGF 
QIWGGKAFFLRGFPtfFPQVESKGAVS PP 
CKVPPPGLRPFSAPTPPKNGDKRGPPPG 
RATGFFFFFVFLVETGFHHVSNDGLY 


5130 


19031 


A 


5167 


238 


1 


GGFTGTGGKYEPAGIVHRGEFVFTKEAT 
SRIGVGNIiYRLMRGYATGGYVGTPGSMA 
DSRSQASGTFEQNNHGSAGESRG 


5131 


19032 


A 


5168 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5132 


19033 


A 


5169 


3 


127 


DEDFSDFDEKADDEDFVPSDAS PPKTKT 
SPKQVSYLIWVLS 


5133 


19034 


A 


5170 


20 


254 


IAPPARIRHEERERERERERERERERER 
ERERERERDTRAEWRARLFFFFFFFLPA 
DQCFFIKPSGQRFLPPGGGGVP 


5134 


19035 


A 


5171 


36 


479 


TXGXDNKKDLL IS VGDLVDRGAENAECL 
ELITFPWFRAVRGNHEQMMIDGLSERGN 
VNHWLLNGGGWVFNLDYDKE I LAKALAH 
TADBLPIiIIELVSGDKKYVT CHADYPFD 
E YEFGKPVDHHQVI WNRERI SNSQDG IV 
KEIKGADT 


5135 


19036 


A 


5172 


278 


448 


RFLCPLLHPFFSSETKSLTLLPRIX3CSG 
VIIAHCSLKLLGSSNPPTSASRVAGMTG 
S 


5136 


19037 


A 


5173 


205 


1 


HARLIFVFLVETGFHCVGQAGLELLTSG 
DPPSASQSETPSPLKILKSAGCGGANST 
ALASGLLFSLGP 


5137 


19038 


A 


5174 


409 


194 


FFFLRWSIiALSARLVQWRDLGSRQAPPP 
GFTPFSCIiSLLSISFQPSATYMRFLNTI 
TIVEYSFAYFPISQP 


5138 


19039 


A 


5176 


339 


472 


AKFTCISSNIKIiSNTRPGTVAHACGPST 
LGGRGGQITGSGDGDH 


5139 


19040 


A 


5177 


426 


148 


GGRGPPPLSHPFGGGGGGGPLRAGGQKN 
PGPKGETPFFGQTQKKNRGGATPPFSQK 
LGGEKHKNSFTPGKENSIKFDFPPAPPT 
WGEKKNFFF 


5140 


19041 


A 


5178 


328 


474 


NOTE YNKNRLGAVAHACNPS TLGGRG RW 
ITRSRDRISP 


5141 


19042 


A 


5179 


448 


330 


ETGFCHVGQAGLELLGSSDLPASASQSA 
GIIGVSPHAWQ 


5142 


19043 


A 


5180 


259 


14 


LRYKAILCSWIRTTNIVKMAELPKVIYI 
FNAIP IFKIPDMFCRIARF ILKCMWNVK 
VSTIDKTI FKKENRVGIPPRVLMW 


5143 


19044 


A 


5181 


2 


423 


KFYATLVRYVGDRKNLVCREMSMALLSN 
LAQGDALAARAI AVQKGS IGNLI S FLED 
GVTMAQYQQSQHNLMHMQP PPLEPPSVD 
MMCRAAKALLAMARVDENRSEFLLHEGR 
LLDIS I S AVLNSIiVASVI CDVLFQ IGQL 


5144 


19045 


A 


5182 


182 


312 


FFIQVGFHHVAQAGLKLPSSSSPPHLAS 
QSAGITGVSHCAQPG 


5145 


19046 


A 


5183 


301 


472 


GI PFFFFFLRRS FALVAQDGVQWRDLGS 
PQPPPPGLKQFFRNSTAWTKGLQPFPRL 


5146 


19047 


A 


5184 


216 


413 


KPXXKKKKKKKKKKKKKKKKKKXGGAVK 
KKGPQKTPVKPPGGFFFAPPLGVSPPPG 
VFFAGGGAPP 


5147 


19048 


A 


5185 


357 


465 


IKPTKI LAPGPGAVAHACNPSTLGGQGG 
WITRSGD 


5148 


19049 


A 


5186 


467 


0 


FFFFFFFFFRFLFFFFFSEKK 
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nucleotide deletion, possible 
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5149 


19050 


A 


5187 


346 


3 


'RFIiPLGQGGVEILTPRSTCIiGPPKWWDY 
KGEPIiRPAQKCFFFKKKRRRKKVQNSVA 
VYMVYHSMCGENKLRWRWWGI FTHTHTH 
THTHTQKKQNILSRTHKKLVITTGSBEG 
TL 


5150 


19051 


A 


5188 


11 


109 


GETP S LLKVQNI XWAWWHMP VI PAMWEA 
BARES 


5151 


19052 


A 


5189 


3 


151 


QLIiRRPRQENHLNPGSRGCSEPRLHHCT 
PAWATVQDS ISNTNNNNNKCP 


5152 


19053 


A 


5190 


2 


373 


AVAADKGVPLYRHIANIAGNPDLILPVP 
AFSVINGGSHAGNKLAMQEFMILPVGAS 
SFKEAMRIGAEVYHHLRGVI KAKYGKDA 
TNVGDEGGFGPI ILDNNDALELVKTAIQ 
AAGYPDKGAIGR 


5153 


19054 


A 


5191 


55 


421 


ARVADVCES MKEHLLVLVERAKYIPGFC 
ELPLDDQGALVTAHAGBHLLIjGATKRSM 
VFEDVIiLLGNDYIVPSHCPEIiAEMSRVS 
XRILDELVLPFQELHIDDNEYAYLKAII 
FFDPDKETEA 


5154 


19055 


A 


5192 


163 


466 


TCFLGSQSAGITSVSHCPSREVFFLKLI 
HWRQGGQVALLVATPHS PCCPQYRLAPI 
PRARHDFACASLIFACILLVHVLLMPRV 
SAGRGVGVRPAGIQAGR 


5155 


19056 


A 


5193 


470 


0 


ECIiWPGTVAHASNPTTLGGRGGRIMK 


5156 


19057 


A 


5194 


491 


281 


RGIiALLPMGQCSGMMLAHGSLDLLiDSSD 
PPASASQSVGIMGVSHHAWPSLVNISFV 
CLIHRSPKTKPEGR 


5H7 


19058 


A 


5195 


3 


222 


LSFHSLHRCLYMLGTTSBNVSPFSLLEL 
LSRLATLUGDYCGSLSEGTISRNVALVY 
ELLDEVLRLESRCVAQA 


5158 


19059 


A 


5196 


20 


191 


STWWNSSRGGGCSEPRSHHCTPAWVTER 
DSVSKTKKKINNKPALALPAVHPTSHPG 
S 


5159 


19060 


A 


5197 


460 


350 


RRLRRENHLNPGGGGCSELRSSHCTPAW 
VTERGPSD 


5160 


19061 


A 


5198 


174 


41 


SLGELLTECLFETKSHSVTQAGVQWRHP 
GSIiQPPPQGFKRIPPH 


5161 


19062 


A 


5199 


131 


454 


FEYFKNRKPFFFFFFFGKGVSFCAPGGK 
AGDPFKLREPPPPRVKGFFGPPPPSGGN 
NGPPPPPRVFFWFFKKRGGSPFWPGGVL 
TPGGGPPPPAPQRGGINGLDPPAR 


5162 ■ 


19063 


A 


5200 


168 


51 


TVKTPIRPGAVAHACNLSTLGGRGMWIT 
RSGARAQPNR 


5163 


19064 


A 


5201 


414 


497 


LLFXXXXDNGS IYNPEVLDITEETLHSR 


5164 


19065 


A 


5202 


381 


614 


VQPGAPPEWVALSTCPSAAPEGSQQPYI 
PPTFCFFPVKWLLSVTWLYLFIYFLSWI 
SVAQAGVQWCNHGSLQPRPSRL 


5165 


19066 


A 


5203 


377 


485 


PKEPGPQPCAPQPQPHAPQPQPRAPQPQ 
PCAPQPQP 


5166 


19067 


A 


5204 


54 


434 


MRTPENLELTNPQEFGSSWAAVECPDTL 
DPRDMCVLNPLREPFAKKECSIIiLSEVF 
EICHPWDVTWFYSNCLTDTCGCSQGGD 
CECFCASVSAYAHQCCQHGVAVDWRTPR 
LCRECPRQSPEGTE 


5167 


19068 


A 


5205 


201 


418 


GDAGAHAVLRLLPGHPGTSHLPFS SLTT 
SPLGLKRVTGGFNSKNRCDARTYCYLLP 
TFAFAHKDRDVQDETY 


5168 


19069 


A 


5206 


27 


177 


NSFHHAGQSGLELLTSSDPPASASQSAG 
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ITGVSHRAQPKAGTFLSPFHR 


5169 


19070 


A 


5207 


6 


1175 


DLGPHTPAWTRPKREDLVYQSTVRLPEV 
RISDNGPYECHVGIYDRATREKVVLASG 
NI FLNVMAPPTS IEWAADTPAPFSRYQ 
AQNFTLVCIVSGGKPAPMVYFKRDGEPI 
DAVFLSEPPAASSGPLQDSRPFRSLLHR 
DUDDTKMQKSLSLUDAENRGGRPYTERP 
SRGLTPDPNILLQPTTENIPBTVVSREF 
PRWVHSAEPTYFLRHSRTPSSDGTVEVR 
ALLTWTLNPQIDNEALFSCEVKHPALSM 
PMQAEVTLVAPKGPKIVMTPSRARVGDT 
VRILVHGFQNEVFPEPMFTWTRVGSRLL 
DGSAElFDGKELVLERVPAELNGSMYRCT 
AQMPLGS TDTHTRL I VFENPNI PRGTED 
SNGSIGPTGARLTLVLALTVILELT 


5170 


19071 


A 


5208 . 


401 


3 


FFFFFSETESHSVAQAGVQWHHLGSLQG 
PPPGF 


5171 


19072 


A 


5209 


402 


229 


GFFKKGPPFFFFFFFFFFFFFFFKMVWL 
RLSGSKVEWVWGSQKLYMNFGCCRGRSW 
FT [ 


5172 


19073 


A 


5210 


198 


2 


PPQAKILSSSSPPIRPPPKKGFFPKNPQ 
VGFYSAPHKEKTFTLPAPVKFGPPKDPL 
KRPPP 


5173 


19074 


A 


5211 


516 


222 


GSTLRLTQRSMPAS S STMAPSS KRLLLP 
RPERSVPAAAGTAGHHEASRNCGRGGAG 
ADEGPATKGDSHPKPGYCRAAHPSAAPW 
PPGPEXNFMRVG 


5174 


19075 


A 


5212 


103 


203 


DKVSLRCPGWPQTPGLKQSSCLSLPKCW 
DYKCE 


5175 


19076 


A 


5213 


409 


249 


RRNQVGGPRQLNPGKRI KSPGIDPYI YG 
QLIFDKAVRAIWWRKGSSFLQKAIiE 


5176 


19077 


A 


5214 


V 


170 


ERKII/3YI QliRKGPNWGPYGLLQPFAD 
AIKLFTKEPLKPATSPYPLQSPAFPLKP 


5177 


19078 


A 


5215 


270 


464 


GQLNKNVWGWDPGKVTFRDGVSLYCLHW 
HAVAIHGHRHSVPQPQTPGFKQSCLRLP 
SSWDYMPLY 


5178 


19079 


A 


5216 


156 


248 


PDKSYI ICTCTCVCVCVCVCLCVWVCME 
CP 


5179 


19080 


A 


5217 


412 


2 


PPPQIFFFSPPHFFSPPPKGGFFPPPPP 
KIFFFPPPLFFFWGFFPPFSPPPKKFFF 
PKSPPFFFFPPPKKKKFFFFPPLFFFPP 
PGFFLTPPPP FFFFFFFFFFFFFFFFFF 
FRMGCDRWHGEFWILRDGFD 


5180 


19081 


A 


5218 


165 


I 


Q VKYHNKKTHNP I K I G KRS EQTFHE KDT 
QMANKYMKRSPTSLRIKGIQIKTIMRY 


5181 


19082 


A 


5219 


408 


88 


PSSSPHFFSPPPLGGVFPPFPFKIFFFP 
PGIiFFFWGVFPLFSPPPKRVFFPNSPPG 
FFFPPLFGKKFFFSPPPFFWPPPGFFLS 
PPPLFFFFFFFFFFFFFFFF 


5182 


19083 


A 


5220 


3 


259 


NLLLL I VP I L IAMAFLMLTERKI 1X3 Y I Q 
LKKKKKRK3CKKKKKKKKKKKRGGAFKKK 
PRGGQNFPGEKKKKFFFLREVKKKPRGN 
F 


5183 


19084 


A 


5221 


10 


479 


KLMPAESDGRHRERERERERERBRERER 
ESSSARARCVSLSLDRYKNRVYMKVAEA 
LCECRLLAYISQAPTQMFFLLRLINI IH 
AHTLTQENDMCLHTTLE I LTLARQRERV 
PLYLRVLQRMEHKKKHPGFVLNNSHNLV 
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RLWQQLYLHRDRDS I C 


5184 


19085 


A 


5222 


142 


56 


KFQPGAVAHTCNPSALGGWGGQI KRSGV 


5185 


19086 


A 


5223 


14 


Alt 

471 


T»S01CETn.PVTO TVMTDAGRPHSEAAYTL 
G PLLCRGDQS FWNSASFNTETS YXiHFPA 
FHGELTADVCFFFKTTVSSGVFMENLGI 
TDFIRIEIaRGKIATLDKSQGTHYIiAIKA 
LTQKKKKKKTRG 


5186 


19087 


A 


5224 


A OA 


rv 
U 


PPT? FPP P PFPPT ■ T VAOFYFIi 


5187 


19088 


A 


5225 


16 


422 


VRRTARIRHBGKPYECNACGKAFNRSAH 
LTEMQRTHTGEKPYVCKECGKTFSRSTH 
LTEHLKIHS CVKP YQCNECQKLFCYRTS 

LTKHKRIHTRERPYQCKKKKKKK 


5188 


19089 


A 


5226 


99 


436 


GHPSFE I PATMTAAPAGFPPQVPWEDVR 
YLFDEIMYGGHITDDGDCKLCRVYIiEEF 
MNPSLLAALSGTHA05EKPRLPANSHVS 
EPSWKWII^PQSSLQMTJAIiANILTATS 


5189 


19090 


A 


5227 


473 


9 


PnTTaTOWT.aWTlPTVnPPPWPQRPCO T A 

DIRQVEASTRYLGTALYVJIAAS INIKPG 
HDYYFYIRSVNTVGKSAFVEAVGRASDD 
AEGYLDFFKGKIAESHDGKELXiEKVELT 
EDIASRIiEEFSKEVTKDASDKWNAMWAVK 
IEQTKVGERDVTSG 


5190 


19091 


A 


5228 


1 


473 


zr tr ir X UxvXi irDi \_ lAUKIM OnK V 1 i J. u«.vjy *r no 

FAD YTLGPLLCCGDKS FWNSASFNTETS 
VT ,HF P AFHGFXTADVCFFFKTTVSSGVF 
MENLGI TDFIRIELRGRLATLDKSQGTH 
YLAIKAIjTQKKKKKTRGGAGPPFPLIGS 
RITIHGPPFNNAAMREK 


5191 


19092 


A 


5229 


170 


25 


CTPLLGPGTVAHACNLSTLGDQSGCIMR 
SGVRDQIiRQHDETPSLIiKRI 


5192 


19093 


A 


5230 


465 


0 


VVLYPPQKAQKKKKKKKKKKKKXKKKTK 
ARG 


5193 


19094 


A 


5231 




JO 




5194 


19095 


A 


5232 


79 


451 


AGSNLQEHRGLRPESPFNPGGGGCREPR 
PJeHCPPAWAKPJOCLPiKKKKHGRNQEVA 
QGPJiGPFFPGLGPEGFLPRRPGSRKRRK 
GRNVPG P FXiKGPGAVLFSRHRRGGADIiR 
PPRKGPRGGGGG 


5195 


19096 


TV 
A 


5233 




40/ 


VOOOORAOEOOOOHPVIiHLOPOO IMOLQ 
QQQQRQISQQPYPQQPPHPFSQQQQQQQ 
QAHPHQFSQQQLQFPQQQLHPPQQLHRP 

QQQLQPFQQQHALQQQFHQLQQHQLQQQ 
QIAQLQQQHSLIiQQQQQQQIQQQQIiQRM 
HQQ 


5196 


19097 


A 


5234 


194 


45 


Mir^FFNFKIHFFFFLRQSIiALVAQAG 
VKWCNVGSQQPPPPGFKRNST 


5197 


19098 


A 


5235 


204 


475 


AS ITVHLWYVEKCYSQSTEKVTPMPSSS 
WFGPQPDAQLKHGLRPGAVAHACNPSTM 
GGRGGRITSSGDRAHP 


5198 


19099 


A 


5236 


59 


357 


NHWRKI YCKVYNRKKI RHNTTNTTKN YQ 
NFQRNRTKSSCTIFRIQSHYNKHIKKHI 
QRKRQSIFOTIxFKTQMI^LVWIFKVVIV 
ILTKIKTPKTLKING 


5199 


19100 


A 


5237 


447 


166 


EliAPWQTQYNNMNAVNLFFFWIKAPGI 
PTPSFASRRTHSPFFSFLFFLFFFIiGKK 
GFPLCGPGGPKSLNLKTPPLGPPKGGK 
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5200 


19101 


A 


5238 


34 


296 


LSVAEFLAFWQTQYNNMNAVKLFFAWIK 
VPGIPTPSFASRRTHSPFFSFLFFLFFF 
LGKNGFSPCGPEGPKFLNLITPPLGPPK 
GWB 


5201 


19102 


A 


5239 


340 


472 


SLNETHVRSPAHHRPGATAHACNPITLG 
GEDGLIMRSGDQDHPG 


5202 


19103 


A 1 


5240 


460 


0 


PKKKI FTI YFFYFFFFFFFFFFFFFL 


5203 


19104 


A 


5241 


161 


46 


SFYSTHRKPCHWPGSVAHACNPSTLGGR 
GGRITRSRDR 


5204 


19105 


A 


5242 


175 


36 


LKYYL I FI YLLSFFFLRQSRSVGQAGVQ 
WRDLSSLQPPNSTALQYR 


5205 


19106 


A 


5243 


11 


442 


DTIRWGLPTLGSKSTTNBKKREKRRKKK 
EQQQSEANELRNLAFKKIPQKSSHAVCN 
AQHDLPLSNPVQKDSREENWQEWRQRDE 
QLTSEMFEADIiEKALLLSKLBYEEHKKE 
YEDAENTSTQS KVMNKKDKRKKHQGKDR 
PLTV 


5206 


19107 


A 


5244 


459 


333 


FLRVTQAGLKLLSSSDPPTIASPKCWDY 
RHBPLCPAQWSVS 


5207 


19108 


A 


5245 


229 


436 


FNSTWRKRGLGGFIHLNLIQTSFFAHL 
VVSLTYVIjCVSFFVCLFLRQSHFVTQAG 
VQWHNLGSLQPLP 


5208 


19109 


A 


5246 


227 


30 


VFFARWVFLRQDLDLSSRLECSDVILVH 
CNVRLAGSRGTPS S VSKVAGTTGMLYHT 
WFFYEFHR 


5209 


19110 


A 


5247 


173 


442 


FLGSSSKAAITLYYCQYMEFFFFFFFFE 
KNFFFSPPGGGEGPNFILLEFPPPGLKK 
ISCPTPPRGWDYGVAPPPLVNFWFFKKN 
GVPPPW 


5210 


19111 


A 


5248 


255 


358 


TGPGTVAHTCDPSTSGGRGGQITMSGDR 
DHPGQH 


5211 


19112 


A 


5249 


3 


130 


QPQLAAPS IS WAPTSASQVAGTTGVCHH 
AWL I FLFWVESRRG ] 


5212 


19113 


A 


5250 


90 


1 


STRLEKNGTMSAQCNLRLTGSSNSPNSA 
SR 


5213 


19114 


A 


5251 


358 


447 


SFISYRRRSPSPYYSRYRSRSRSRSYTP 


5214 


19115 


A 


5252 


55 


202 


CWNTSMRGPIWEAEAGESLEPGGRRSCG 
EARTCHCI PAWWRAKLVDAA 


5215 


19116 


A 


5253 


330 


3 


EEMGFSHVGQGGFKIIASGDTPAWAFQK 
GGISGVSPRAWAGFI IFIWNFSGGPQTG 
FPFYPFRGIFFPRAKINYIRVLNPKGLC 
QKGVFFFFFSGFPHYRCRLCQI SV 


5216 


19117 


A 


5255 


175 


3 


MYLRFPLGLSLSERPGMVAHACNPCTYW 
ESKAGGSLSLGVQDQPREHGGTLCLQKS 
K 


5217 


19118 


A 


5256 


263 


1 


RPRRRFNRDKTALLQEMPFRTFTALGKL 
ISDFKDSKDGLTLYFVETASRSWLAGL 
ELLTSSDPPTSASQSTGITGVSHFSQPG 
IiAF 


5218 


19119 


A 


5257 


1 


103 


CHVSQAGLELLAS SDPPASASQSVGITG 
VSHRTW 


6219 


19120 


A 


5258 


128 


2 


IHKWRDCKMVQPLWKRAWQFLMNLNTHL 
S CDPTVLLLDILTQ 


5220 


19121 


A 


5259 


381 


2 


FFFFFSEMESPSVAQAGVQWCGLGSLQA 
PSPGFT 


5221 


19122 


A 


5260 


2 


173 


ENYLNMGGGGCS E PSLHHCS PALAIERlT - 

SVSTTTTTTKQQQQNNQLGLFSYPFNPL 

I 
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G=Glycine, H-Histidine, Wsofeucine, 
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nucleotide insertion 


5222 


19)23 


A 


5261 


1 


175 


IKGNQIGKKEV3CLFLFTDDMILYMENPK 
DSAKNYYKQSKTFCLP WIiGRS CKRTCP 
LA 


5223 


19124 


A 


5262 


2 


145 


QVSLKLLGSSDPPALTSQGAGIMGVSHR 
TRPRNTDFKNNTQKSKNTQ 


5224 


19125 


A 


5263 


182 


3 


STTSFLNFFLTEMVSHYVSQAGLKVLGS 
SDPPSSASHSAGITGMSHHTRPQQPLLN 
LKV 


5225 


19126 


A 


5264 


183 


369 


WNSLDFFFFFPSFFRGYSSMGKVQDAFI 
FYRRFIDKSKPRANTWGSIGGLYQQQIH 
PMDALR 


5226 


19127 


A 


5265 


329 


207 


RGCSELRSRHCTPVWVTEQDSVSQKRKK 
KSPCHLQLGTSQ 


5227 


19128 


A 


5266 


177 


13 


TLTILLNFFFLRQSYSVAQAGVQWHNLN 
SIiKPPPPGFKRFSCIjSLPEFQQGHNIR 


5228 


19129 


A 


5267 


474 


122 


FICSPHPGGFLTLGPKLNILGPSQPLFP 
IiRGGSKRQGIPFYSYKIKGPPGAGPPGS 
ARYSNPFGGYGGRVPIiPPKFPDPPGEHK 
PRFFLKKKKKGHQVPTSKYLPEYLFAYY 
NASLE 


5229 


19130 


A 


5268 


55 


311 


GIQTFGKNVVVAGRSKNVGMPIAMLLHT 
DGEHERPGGDATVTIAHRYTPKEQLKIH 
TQIADIIIVAAGSLSrSHAGVQWRNHGS 
h 


5230 


19131 


A 


5269 


238 


133 


NKN I WVQKKCVFY I YLPLICVCVCVCVC 
VCVCVCV 


5231 


19132 


A 


5270 


336 


517 


PVLYILEHSPYHAS I YQLALKKYQSRPG 
AVAHACDPNTLGGRGGRITRSGDRDHPG 
QRGE 


5232 


19133 


A 


5271 


262 


467 


VLSMRPR IHGSAAREEDEHP YELLLTAE 
TKKWLVDGKTKGTFPTTPGRTNSKGIF 
KVCDPEWKGKMS 


5233 


19134 


A 


5272 


213 


2 


MKIYINVYILLFSLKKRQGPTIiSPRIiEC 
SGMX IVHRSLKLVGSSDPPALTSRIAWL 
IGARHTPDCSVCKF 


5234 


19135 


A 


5273 


284 


105 


YSCSYSYFDEPVELRNSSFFRWNHSSDS 
YWKKKSSKDTEPVLKPPGYSARYECKTV 
GSS 


5235 


19136 


A 


5274 


290 


449 


IFYFSGRVRAKLSAPLAGMGNAKADSRG 
RSRTKMVSQSQRMYCLSALLYLSLC 


5236 


19137 


A 


5275 


248 


423 


NKYNITLMTKLEKOTTKiCENYRPISLLN 
IDMKILSKILQNQIQQYIQSI IHHDQVK 
LL 


5237 


19138 


A 


5276 


425 


30 


TRRGMPHRQGSPRKAPVSVRRQRVREEC 
GREPLLWFLQSQLLGRLRQETHLNWGGR 
GCSEPRSCHCI PAWATRANS I FCNFQAS 
SVEVRRSARKKLFSDILKRHNTITWRVS 
GLLLVDSYFGRLATPVRTQ 


5238 


19139 


A 


5277 


25 


185 


SLFSVABFAFNAYKVQTEKEEKEEARSK 
YKEAKESFQRFIiENHEKMTSTTRYK 


5239 


19140 


A 


5278 


263 


483 


VFTWFFQLFCVYKLIFSFLTECRSVAQ 
AAHAGAHWHNLGSLQPPLSGFKQFSCLS 
LPEFPRMYSVLSPSP 


5240 


19141 


A 


5279 


25 


252 


LETLGIiMTLVQCGG I PGFCHVGQAGLKL 
LTSGDLPASASQGAGIAGMSHHTRPILL 
HFIYFILLYHPPCLAYFI IF 


5241 


19142 


A 


5280 


326 


203 


IETESCYVAQDGLE LLGS SNP PTSASQS 
AGITGVSHHTWTT 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L== Leu cine, M=Methionine, 
N=Asparagine, P=*Prolme, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y^OTyrosine, 
X=Unknown, *=Stop codon, /^possible- 
nucleotide deletion, ^possible 
nucleotide insertion 


5242 


19143 


A 


5281 


210 


459 


NKNNKKFNEKKKKNYKKWARELYRHFSK 
DDIHVANRYMKRYPTSLIIRKIQIKTSM 
RYHLTPIRIATIPR 


5243 


19144 


A 


5282 


262 


454 


LLFTDYHEGMIiHSVKLLFCFLGWSFTLA 
IQAGGHWRDLGSLCPLLPRLKQSSCLNL 
LSSWDYKR 


5244 


19145 


A 


5283 


451 


303 


GTEFHHVGAGLELLGSSDLPVLASKSAG 
I IGMGHHAQPKQYS FVILTVQ 


5245 


19146 


A 


5284 


217 


489 


PVSISIILRCIiSLPTPGSRVLCMLEGMD 
GISLGWVPLPSLPVPIiHPLDIiSLPVCRQ 
VGGTKTG WRYVGETDFAKGEWCGVE LD 
KPLGKND 


5246 


19147 


A 


5285 


247 


99 


FFFFFFFFFLYGFFFCFFFFFFFFFFFF 
FFFLKNKTQPYFIYKRNILSM 


5247 


19148 


A 


5286 


2 


117 


PRVRPRVRPRVRPRVRPRVRPRVRKKKK 
KKKKKKKKKKGGGF 


5248 


19149 


A 


5287 


121 


3 


KFWPGAVAHACNPS PLGGRGGW I TRSGD 
RDHPGQHSETP 


5249 


19150 


A 


5288 


416 


48 


IGKPGSFFPPPPSPRGAPPPPPKIFFLP 
PPPFFFRAGFPLFPPPPQIFFFPKIPPP 
FFFTPPLKKKFFFPPPPIFESPIiGLFFF 
PPPPSLFSFFFFFFFFFFFFFFFFFFFF 
FFFFFFIiGVR 


5250 


19151 


A 


5289 


503 


0 


FFFFFFFFS PFFF FFS SS FFFFFFFFFF 


5251 


19152 


A 


5290 


70 


236 


IFCNSQLTSPHKHQKKKKKKKKKKKKKK 
KKKKKKKKKKKIOGCGGGDFKKKIGGAK 


5252 


19153 


A 


5291 


443 


287 


QFTKKKKKKKKRK3CKKNKKKKTKQKKTI 
IPKPPLF 


5253 


19154 


A 


5292 


334 


115 


KNPNALF FFFF FVDR I SLLP RQWHNLGS 
LQPLP PGFTQFSCLS LPNSWDYRCEPLR, 
PALLFFSSKSKIHIKPL 


5254 


19155 


A 


5293 


413 


3 


SSSPPIFFPPQKKKIFSPPPPKKFFFPP 
KTFFFLRGFFQIFPPPKKNFFPKKPQNF 
FFYPPKKKKIFFFPPPKIFPPPKIFLKT 
PPPFFFFFFFFFFFFFFFFFFFFFFLVS 
LEKGF PDAWADAWADAW 


5255 


19156 


A 


5294 


291 


11 


IQPYIKRLMQHDRVGIiSSEKQSWFRIYI 
YINVPGFQNKGFI I IGSSQKAQKMIFFF 
LDRVLLCHPGRSTWQSQPTAALKNWAQ 
KTLPPQPSE 


5256 


19157 


A 


5295 


2 


401 


NPRSTEAAI KYFLTQATAS 1 1 LL IAILF 
NNILSGQKKKKKKKKKKKKKKKKGGGPK 
KKKIGGGKKKPGGKKKIFFFIRGEKKNP 
RGEFEKKTFFWGGEKRANPPPKKKTLGE 
KKKILGGGGGKKFFFFVGKKK 


5257 


19158 


A 


5296 


49 


363 


RLQKKNKEKKKKKKiCKKKKKKKKKNKRK 
KKKKKKKRRNGGGGKLLWKGGGQ ILTKE 
KILILVAVS YTCNINNLGRQRKWI IGGG 
KSEPNYEFKRNIGAGRGGSLL 


5258 


19159 


A 


5297 


100 


323 


E YLNTCVRI CHSHAHFLPPS YVS FALLP 
RFLFKVVLIiYMQVFGEGTEAVKKSIiEGI 
FDDIVPDGKVKINVCFLF 


5259 


19160 


A 


5298 


595 


0 


VCVCVCMCACICIC 


5260 


19161 


A 


5299 


412 | 


141 


TVDTSYSEKKIIiFQILLLIDNGPGHLRA 
LMEMYKEINWSMPANTTSIMQLMNQGV 
ISIFKSSYLRNAFCKAIAAIHSDSFDES 
GQSNLL 


5261 j 


19162 


A 


5300 


216 


2 


SQDWLRKFCYHPRVFQRQPIQKLMGLFF 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D==Aspartic Acid, 
EXrtutamic Acid, F=PhenylaIanine, 
G=Glycine, H-Histidine, I-Isoleucine, 
K=Lysine, Lr=Leucine,M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R= Arginine, S=Serine, 
T=Threonine, V=Vaiine, 
W^Tryptophan, Y=Tyrosine, 
X=Ud known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FCLPNMVKNGVFFFFEMEGRSWQAEVQ 
WHDNSPLQPKPPGLK 


5262 


19163 


A 


5301 


368 


3 


IFFFPQGKEGGGLFYIFFPPKKKBFFFK 
KKI FFFYPPPKKXKKI IRAAEKRGPPLF 
FLKKTRPNFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLVREK 
VFNSTISTQ 


5263 


19164 


A 


5302 


389 


2 


AMAQASGSCLCSRIiLVDGFIPWGVFRYF 
IVBAGS CYVDQGGIiAIIiASS YPTTLVSQ 
SAGMTGIiSHRARPPLNGPPPGIjTPSI»SM 
VPKAOTFSLLEKPTGLTQALRITLANVS 
NAPGQTRSGLEGWGCFF 


5264 


19165 


A 


5303 


84 


333 


FHDCCRPERSRILRNFLLSSAGIQSIAD 
AYRLALPQGLLYGPTNF AP I INHVARFA 
AHAAHQGTASVRWGGSGSGGNQGLP 


5265 


19166 


A 


5304 


485 


93 


PPLFRRFSPPPPPKFFFSPRAFFFGQQF 
FSFFSPPKKSFFFKNPPGFFFFPPLKKK 
KIFFLPPLFLAPPKDSFLAPPPFFFFFF 
FFFFFFFFFFFFFFFFFFFFSRENI FFI 
III F I YLFRDRFLFCCLG 


5266 


19167 


A 


5305 


157 


2 


PPPPFFFFFFFFFVWWVITLNVELEPSF 
SPNTESQIGPEEAMERLQENRVE 


5267 


19168 


A 


5306 


I 


133 


TQESIMDLENDKQQLDERLKKKKKRPAV 
LKNPWGGQSLPGMARE 


5268 


19169 


A 


5307 


239 


2 


PPRNWGFFSPLSPIiKSSSPPKAFNFWGG 
VGPNFPPPKKRFFSKNPPGGFFYPPLKE 
KNIFSLPPVNLGPPRVFFKGPPP 


5269 


19170 


A 


5308 


454 


116 


FRLPS SSNSPAPASRVAG ISGFRCVGRA 
GLQLLTASDPPASASQGAGIADGVSFTQ 
CSMVPRLECSGM I S ACYNIiHLP AACLGP 
PKCRDCSLCLAATPSGK 


5270 


19171 


A 


5309 


123 


1 


CSVS KWYP iAFLKN I IMLWEAEAGGSRG 
QEIKTILANTVKP 


5271 


19172 


A 


5310 


153 


33 


WPGWAHTCNPPPLGGR5GLITRSGVQP 
QPGQHSESPSI 


5272 


19173 


A 


5311 


350 


3 


NSRNLILTQEHSPEEKNEFLFSLPLQSV 
SMNTTHSPLLSNSLPHFI IS FLPTRFLV 
GPSPTRSFPPAQSPDRISFSSRLECSGT 
ITAHRSLNLPGSSDPLISVSQVAGTTCM 
CHH 


5273 


19174 


A 


5312 


1 


392 


RTRGRTRGICKS ITI IHHINRTNEKKNH 
MIISVDAESAPLQIHSAKLKK 


5274 


19175 


A 


5313 


1 


239 


LKTSFHHVGQAVLEFLASSDLPAIASQS 
AGITCMSHCAQPYIS INFIPAAETQQSS 
PELVPLPASQKTPSFSTPLFSLP 


5275 


19176 


A 


5314 


345 


237 


LYFYIYFLKSACVI ILSTLCVCVCVCVC 
VCVCLCM 


5276 


19177 


A 


5315 


268 


412 


GMISSNNSSDSFGNCUIPGGKGCCEPGS 
YHYPPAWATDTFSKKKKKKL 


5277 


19178 


A 


5316 


112 


224 


PWMLENELGLHAS YLAMSTPLS PVEIEC 
ASKK1FTFC 


5278 


19179 


A 


5317 


377 


139 


KPPPRIFYU5PQKJCKKLFYPPPFKNCFF 
LGPPPPPIiFMIFFFFFFFFFFFFFFFFF 
PQWVMFLFFLLSASNQNSPSWSP 


5279 


19180 


A 


5318 


216 


385 


RGVLLCCPAWSQTPGLKGSSCLSLPKCW 
D YKHELPRP ATS CNS ILFFNLQGAFMYH 


5280 


19181 


A 


5319 


376 


3 


RIDFGGPKKKILIiPPPPAVKIVSLKGPP 
LFFFFLNSFFAPAGGQWGVFQLISSGDP 



867 



WO 01/64835 



PCTAJS01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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X— Unknown, *=Stop codon, /—possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PPPAPESVRIPGVSFCAGVPQGFFFFFL 
NSSVAQAGVQWRDLGSLQPLPPRSKRVS 
CLSHPSSWDYSH 


5281 


19182 


A 


5320 


377 


287 


ELLTSSDLPAIiASQSAGITSMSHHTQPG 
SF 


5282 


19183 


A 


5321 


381 


65 


SSSPSSRTSGPFFFFSPPKKKKNFQKKK 
NFLYPPFFFPPPFYKNPPKKKLKGPKKK 
KKISPPPPKKNLSLKNPPPFFFFFFFFF 
FFFFF 


5283 


19184 


A 


5322 


132 


396 


EGGFVLKIIREGIGPHFLGLEEEGSKFH 
EHI FWEKHRESFPKQGPIPHFMELGTWG 
LSKNPYFRVKQRVEHIEGFKNFFNEKKD 
FLKE 


5284 


19185 


A 


5323 


366 


1 


GFLKKGVPFFPPGKRGGGNQTPPKPFVL 
GGKQPPPPPPLKKKNPKGGPPQQKKIFF 
FLTRGGVSLFGPGGGFPFFFQPPKKPPP 
GGFPFFFLKKPPPPPLFFFFFFFFETGF 
CSCRPGRTRG 


5285 


19186 


A 


5324 


131 


1 


DGALSPRUBWCSGTILAHCSLRLPGSSA 
SCVSASRVAGITGVC 


5286 


19187 


A 


5325 


112 


1 


DGILLLLPRLESNGAISAHCNtiHLPSTS 
ESTASAFLS 


5287 


19188 


A 


5326 


404 


18 


PVSPHGKNHEKHTVTCVGSTSGAFLHGE 
EHCHGNTHHP FE PSNPQTACQSQANR I R 
HGNS FKNLWRS WAWWRTP WSATQDAEM 
GRPLEPRGLS LAWATQQDPSLKQKQKQK 
KPTHLWSYILGHHQLP 


5288 


19189 


A 


5327 


109 


3 


KCVLT P VI RAL WEAETGGS RGQ DIET I L 
SDTVKPR 


5289 


19190 


A 


5328 


1 


117 


ETGFCCVAQSGLQIiLDSSNLLALVSQSA 
GVTGMSHRSRP 


5290 


19191 


A 


5329 


3 


120 


DAWVAAEAEGQNDT IEEPNKVQKRKRGI 
NDNVPAGQAH 


5291 


19192 


A 


5330 


105 


2 


RSGWAHI CNPSTLGGQGRWITRSGVQD 
QSVQDG 


5292 


19193 


A 


5331 


204 


411 


SKKKQKKKKKKKKKKKKKKKKKQDKKG 


5293 


19194 


A 


5332 


185 


2 


FFFETESHS ITQAGVQWRDLSSLQPLPP 
GFKSLPSSWDYRCLPPCPANFCIFSRDR 
VSLCW 


5294 


19195 


A 


5333 


561 


325 


WSTLHSFPIWLLNTGYYFKEIiTGFFFFL 
TVALSPRLECSDTI IAHYSLRLLGSRDP 
PTLVFQSTKITSVSHCTQPSVRF 


5295 


19196 


A 


5334 


3 


96 


EGKAAFSQEKSRRVKEENPE IAVSASTI 
PE 


5296 


19197 


A 


5335 


375 


228 


LSPWIiECSGTILAHCNLRPLGSSNSPGV 
FGFYFFLTLILGSGVHVQLC 


5297 


19198 


A 


5336 


403 


0 


NISVPASKHISPFPKKKKKKKKKKKKKK 
KKSQKKKSRG 


5298 


19199 


A 


5337 


397 


80 


IIITSKKKKKKKKKKKKXKKKK 


5299 


19200 


A 


5338 


406 


34 


KRTLKDSKKKRKEKKKK3CRKKKKKKKNR 
EKKKKPKXK3CKKKKRGGAQKKKNNHPAG 
GKAYFFFLGGTKKKRGGGDTKPLGEKNS 
PAQTKKLGGNNPPFVERGKKKKTPGI 


5300 


19201 


A 


5339 


410 


0 


KKQKKKKKKDAPGGGGGGRKKKGGPQKK 
KKGSHLKTPKKEKTGGA 


5301 


19202 


A 


5340 


306 


392 


IQYKILPQKENDWDKIQGQLAIPVSSLN 
F 
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Predict- 
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nucle- 
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last amino 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGlycine, ENHistidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, ^Proline, 
Q=GIntamine, R=Arginine, S=Serine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


5302 


19203 


A 


5341 


3 


173 


TFTATELI I FYI FFETTLI PTLAI ITRG 
GNQPERI^AGTYFLFYTLVGEGSLLARS 
R 


5303 


19204 


A 


5342 


247 


426 


GRAWWFTPVTTPLWEAEAGVQWRDLSSL 
QPLLPGFKRFSCLRLPNNWDYRHVPPRP 
ANFL 




1920S 


A 


5343 


3 


149 


SWFLHFGEAGLELLTSGDPPALALPKCW 
DYRLVSNSWPEVILQPQPPK 


5305 


19206 


A 


5344 


464 


147 


LRRLRWENRLRPGGGGCSEPRSHRCTLA 
RVTESIaSRVQCQRDPRRIiAQTSRRPGAS 
S FPFAGPAFSAVNSTYQRAIiP PAQGAPW 
TPRLPLGPHGTTSS TSS YLPV 


5306 


19207 


A 


5345 


228 


1 


VIRLEETYFLLFFVLFLFLFFRQTRSGS 
FAQAGVQSSHLSLPS SWDHRYTPHLANF 
CIFLQKQSIAMLPRLVSNTK 


51(Y7 




A 




2 


106 


FVLELIjGSNNSPSSASESAGITGVSHCXj 
QPIiFNF 


5308 


19209 


A 


5347 


140 


436 


FLCDMRVLYVTFFFFGKGVSFFPPGWRA 
GGEFGLIENPPPGFMPFFLLNLPKKGEIi 
RAPPPAP IFFFFFFEKRGFTRLTQEGLN 
FWTWBPPPLTLPRGG 




1091 A 


A 


Jj'tO 


1 


465 


LESRRFEDGTRVPGSTISWDPPEEAVCP 
FSDLQIiRAGRTTTLFKAVRQGHSSIiQRI 
IiI*PFVWQCPAPRGGVYRGRQASLSCSGL 
HPVRASWLLCFPSEAWAMSGTPPPASLP 
PCNLISDCCASNEQGSVGIGPSEPGVEY 
NLLLRHLLKPEEKRS 


5310 


19211 


A 


5349 


183 


58 


ERDYEEGPGAVACACIPSHiGGRGGWIT 
RSGDRDHPDEIVDP 


5311 


19212 


A 


5350 


288 


404 


I EXNGMI IiAHCaSf FQLPGSSNS PASASPV 
AVITGMNHYAR 






A 




413 


305 


AEAGLKPLSSTYLRASASQSAGITGVSH 
HTQPLLFF 


5313 


19214 


A 


5352 


91 


356 


TVPKRNTFSPKMVAITTAGTVSTSVLAL 
GNRESALPTRLGWALCCFIFWGGRGHRV 
PLOIPGWSTVARSRLTATSTSQVQAILL 
PQPP 


5314 


19215 


A 


5353 


2 


410 


FVQWQFLKTGSDRSHKAVESLGYEKVEK 
AMTVNYSFRSLNTKGKLELLGKISVNLY 
AKWKHFREKGRRYTNTLGPTATAAFYST 
EKTEFLPWEQTCKPFIDSFNKYLLPGA 
ADHACNPSTIiGGRGGKITRSGDRD 


5315 


19216 


A 


5354 


3 


288 


SLDDDLLKLLLPLMLQYSDEFVQSAYIjS 
RRLAYFCARRLSLLLSDSPNLLAAHSPH 
MMIGPNNSSIGAPSPGPPGPGMSPVQLA 
FSDFLSCAQH 


5316 


19217 


A 


5355 


187 


3 


KKILWQTPGVNIiFSFKGAPLFFFFLRRV 
LLCCPGWSAVARSPLTATSASRVQAILL 
PQPPK 


5317 


19218 


A 


5356 


383 


261 


GYKINGQKSQAFLYTNNRQTESRLWYCY 
DS PQRLIHLS IK 


5318 


19219 


A 


5357 


367 


241 


FFFRHVAQAGLQLLGSNNPPASPSQSVR 
VTGVSHCTQPLCFV 


5319 


19220 


A 


5358 


217 


1 


SCDRENTGLKLLSILLLIHCETDTRISL 
LCISLSIKREIWLGAVARACNPSTLGGR 
GGRITRSEDRDPGETP 


5320 


19221 


A 


5359 


255 


412 


TQbCRKRNAFFFLATNHSQKGQSRWGTV 
AHACNPSTLGGRGGR I TRS GDGDH 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine t ENHistidine, Hsoleucine, 
K?=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=?Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


5321 


19222 


A 


5360 


391 


271 


RSCHIAQAGIiKLLASSNPPASASQMAEI 
TGVSHHLTPICC 


5322 


19223 


A 


5361 


232 


397 


EGLKEPAGLFTMEYYTARKRNKIMPFW 
TWMQLEAIIDGESTQEQKTKFCMFSPV 


5323 


19224 


A 


5362 


213 


392 


LFIKVEGKFFLRSINILNESLLFIiSSNG 
PGLLKKLGVWPG WAHACN P S TLRGQGG 
RTA 


5324 


19225 


A 


5363 


306 


418 


SDGITGTLPLASQE SAWEDLLYVLVGV 
DGRYVSAQP 


5325 


19226 


A 


5364 


347 


48 


KKRWLKPGGGKNPPKKGGKKGGTPPFFQ 
KH3FFFPPFFFFLEKRDLSPQAGGKGEK 
KXPPQPWKAEGKKNPWGNPPQRGGYKGN 
PTRPPPQGFFFFFFL 


5326 


19227 


A 


5365 


412 


172 


PDSNLSFGAHYQASLLiGSFSLMPLLLYS 
KCSGQWD VARSAECTS KSSQNKERKKER 
KKERKKERKKERKKTRKQKKNPLT 


5327 


19228 


A 


5366 


400 


296 


LVEGGFHHVGRAGLELPTSSDPPTSASQ 
NLGLQA 


5328 


19229 


A 


5367 


162 


1 


APSSLDETPTLLKKTLFGPGLVAGICNP 
STLRGRGGRITRSGVRDQPDQHGETP 


5329 


19230 


A 


5368 


152 


403 


TTVLCKLVQPLVfKTVWRFLKELEVDLPF 
NPAVPLLGIYPTEKKSYEKGTCTHVFIA 
AQFQIARIWNQSKCPPINEQKKKKGAPV 


5330 


19231 


A 


5369 


170 


367 


I VFE ILVI S YFKS FPQKTTEWNTFFFHP 
FKVFNWPSTVAHACNPSTLGGRGRQITR 
SGDRDHPAR 


5331 


19232 


A 


5370 


396 


1 


QMCVRAHTFAGGRARTHTHTHTHTHTHS 
WATPRVSHNHCVCVCVCVCVC 


5332 


19233 


A 


5371 


139 


3 


PKLECGAS 1 1 AHCS LELLGS KD P P PS AF 
QVAETTGCAPPCLAFFF 


5333 


19234 


A 


5372 


1 


341 

• 


PLINILSHLILLCRNYFSFVKFCIRHTS 
IiLYWISLFHVLLCWS FLTDRRAHNAS 
VTYGFVL PVTFKNS PLPGAVAHACNPST 
LGGRGGQIMRSGVQDQPGQARSSRPALP 
T 


5334 


19235 


A 


5373 


280 


449 


ICHNLQLCPSKDTIEKVKRQPIEWEKIF 
ANHISHRGLSS 


5335 


19236 


A 


5374 


170 


1 


QKRGFSMGTPMVS IFRPQDLTPLGFPKA 
WDPRVNPWGPWPCPLFFKGNPQLFQAPG 


5336 


19237 


A 


5375 


462 


165 


PPTRFLHWQTGLELPTSGDLPASASQG 
AGITGVSHHAQPRGHVFDVLLEYSQQGV 
FTLLSFLGFSVFFEPLFC 


5337 


19238 


A 


5376 


3 


451 


PRAKLGTRRGLRNLDTIGVAVDLILLFR 
ELRVEQESLLTAFPCGLQFLRNIASRHE 
DSQSIGWVHAFPPPVWPCLSLPPPHTGS 
SSSITLPPLLPHRPHPPPHSHPPMATLS 
RPPYPPPPHPPPHPLPPSLSPPALSPPP 
PPPPPPPSL 


5338 


19239 


A 


5377 


460 


300 


KVLLCCPRLECRGAIMAHHSLELLGSSN 
PPTSASQSAGVTGVSHCRSPWEFL 


5339 


19240 


A 


5378 


452 


191 


DSRFHHWQADLEHPTSSDLHASASQSA 
VTTGVSHRTRPDIAFAPSISSYFSCVAE 
ISLLHLSESFPKQFL 


5340 


19241 


A 


5379 


240 


395 


LATSSVANLRNTFYLKLKSSVLGVEAHS 
CNPSTLGGQGVRITKSGVRDQPGQ 


5341 


19242 


A 


5380 


210 


411 


TSVNSAPNSGQKASVEECFGQVTKVKIQ 
NVFU3WAQWLTPVI PALWE AGAGGS RDQ 
EIDTILANTVR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K^Lysine, L=Leucine, MhMethionine, 
N=Asparagioe, P=Proline, 
Q=GIutamme, R=Areinine, S=Serme, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5342 


19243 


A 


5381 


406 


300 


FHYVGQPGLELLTLGS t clgi*s kcwd yr 
REPPHQA 


5343 


19244 


A 


5382 


364 


479 


INDGISKISLFLSHLKDQAGQVWWLTPV 
IPALWEAEAG 




19245 


A 


5383 


182 


395 


HHLRGSAASFLKSAQEPTDFRHTGFKRF 
SCLSLPSSWDYRCPPPRRGGGVLCLPGW 
RTWQSWLTTTSISQ 


5345 


19246 


A 


5384 


37 


478 


GGIHSYPVICLFILMGSSVSIiPFSSVAC 
LLAWLLPSFLPSLLPSFLPSFIiPSLTGR 
AFL 


5346 


19247 


A 


5385 


201 


458 


WNPPFLSPRKWAARADTEWGVGKTAQEP 
GLSSRLQAYEERQRHWQRERKALREDCA 
AYAQCAQRAQQLLQLQVFQLQQEKRQLQ 
DD 


JJt/ 




A 


5386 


206 


407 


FFSPGGRAGKKFGFMEPPPPGFKKTLWL 
NPPGKWKQRAPPPIPVYFLEFKKKGGWP 
NMVQVGLNPRP 


5348 


19249 


A 


5387 


468 


0 


NFFPPRPAPPFFFFFFFVAQAGVQWRVL 
GSLQAPPPGFTPFSCLSLPSSWDYRRPA 
RVSPHC 


5349 


19250 


A 


5388 


136 


2 


IiFFFFLFETGFCSVPRLBCSGTIIAHCS 
LKLPGSCNSPTSVSQV 


5350 


19251 


A 


5389 


397 


0 


ILYPSQKKKKKKKKKKKKKKKKKKSQ 


5351 


19252 


A 


5390 


299 


3 


AQPLAPFPGSFILMYFFMLTJiFGIPLLY 
MEVIMGQWLHVDNIRVWKQLVPWLCSMS 
YAHSLVNEGPSPRPSPISLCGIjQVSAHR 
VHPQVCASVSLYNS 


5352 


19253 


A 


5391 


264 


202 


FFFFFFFFFFFFFFFFSBNKI 


5353 


19254 


A 


5392 


2 


150 


PRVRFL I ETEFCHVGQAGLKLLTSVDPP 
TSASQSAGITGVCHLVRPEKS 


5354 


19255 


A 


5393 


382 


0 


NSRAQAWWLMPVISPHWBAEVGGSLEPT 
S 


5355 


19256 


A 


5394 


395 


239 


SLPSSWDYRHAPQCWANFIFLVETGFSS 
VILCS FRLGGFGLSLNILCSLKSL 


5356 


19257 


A 


5395 


24 


265 


LLPHRKSRNYRTRPTRSLRQKNGMNIiGG 
GACSESRSHHCTPVWATERDSVSKKKIiF 
TIELVYVFLKKNSKGEPLPKVLGL 


5357 


19258 


A 


5396 


174 


118 


FFFFFFFFFFFFFFFFRSS 


5358 


19259 


A 


5397 


406 


134 


PSAPSSPPPPPLIWGPPGFFPPPLFKNP 
PPKFFFGAPKKKIYLPPPRALNFVLLKG 
PPPFFFFFFFFFSFF 


5359 


19260 


A 


5398 


294 


64 


IFFFFFFFFFFFFFFFFWEloHILWGALF 
LLKSRIPPGKKKMTCKIFLCTPLTGYIP 
LLKCFQKQNKKSQSQNTELKS 


5360 


19261 


A 


5399 


1 


211 


LLLPTPYHHINQDSSNKYHETKKKKKKK 
BCKKKKKKKKKKKKKKKKKKKKRGEKKKK 
KKKKGGGGGYKKKR 


5361 


19262 


A 


5400 


305 


1 


SPPRPP FliliP FLFFLPSFFFKAPHTINH 
FFSPPPLFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFLKLYLFYKMHRINYTSNI 
LKINI FVFSSPGSGGRR 


5362 


19263 


A 


5401 


416 


3 


BNTERDRGRQRREDRSRAETQETNRAGQ 
QESTPPPQKKCNTYSIPSSSSSSPTPQK 
GKKGGPPGGPPPPAGEKGGPKKKKRGPQ 
KKKIRPPPPGKKKPPPKTPPLFFFFFFF 
FFFFFHKPQTFIVGVNTYILLRSVS 


5363 


19264 


A 


5402 


154 


416 


KKKKKKKKKKKKKKKKKKKK^^ 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Gtutamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine,V«=Valine, j 
W=»Tryptophan, Y=Tyrosine, 
X=Unknown, *^=Stop codon, £=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKKKKKKDSGGGAF 


5364 


19265 


A 


5403 


327 


2 


PPPQKFFLGGVFFFPPFFYNPPPPFFFF 
SPKKKKYFSPPRGKIFFFFFPPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LRWSLALVAQAGVWWLNHGSLQSL 


5365 


19266 


A 


5404 


372 


181 


YKGLAHTRCSIRNFLNFILIIEMGSRFV 
AQAGLEPLGSSDPPTSASQSDGITGVSR 
RTRPINIC 


5366 


19267 


A 


5405 


110 


I 


PVPVSSTEVKKLVLWPGTVVHACNPKTS 
GGQGGRIT 


5367 


19268 


A 


5406 


2 


401 


GFHLTTYGNRKQMQRHVREASARPLQGT 
PPPGHALEWTLENVLLAAPRPRKAQVLF 
AIVASETSSWDREKLWTLSIiEAKCKGIT 
LFVLALGPGVGTHELAELAELVSAPSEQ 
HLLRLQGVSKPEVNYAQGFTR 


5368 


19269 


A 


5407 


3 


181 


QLILFHDKLSPKLLSWGNVDILINFFLS 
HCSPERARRLQAKMMTNLVMAKDRLQIiL 
GIN 


5369 


19270 


A 


5408 


371 


222 


SVEMGFHHVAQPSLEPLGSSDPPASASQ 
SAEITGVCHCARPNHDSYRRS | 


5370 


19271 


A 


5409 


204 


416 


NQVGEWVXiRKGTIjGDLIMSMLFHFLVVR 
SKDEKNTKIWPGAVAHTCNPSTLGGRGG 
QITRSGVRDQPGRHG 


5371 


19272 


A 


5410 


236 


308 


AGRPGTVTHTCNPSTLGGQGGRIT 


5372 


19273 


A 


5411 


375 


216 


GCSEPRWCHCTTPAWVTVQDSPPKKBKK 
RKEKKKKLLTTENSKKALKRNGYL 


5373 


19274 


A 


5412 


1 


381 


RFHHVAQAGIiNIiLGSNDLSASASQSAGI 
TGMSPRTQP 


5374 


19275 


A 


5413 


144 


3 


RPPP IFFFLAKKLRALFFFFHS VIQAGV 
QWRNLGSLQPLPPGFKQF 


5375 


19276 


A 


5414 


412 


32 


LFSKKKKKKKKKKKKKKKKKKKK 


5376 


19277 


A 


5415 


414 


2 


NNNSFVFFSKPIFLKTVGGAPPPSPIYN 
PIPIFLYIYPPHKWGGVYIPTHMYGGGD 
IWGYMYTPPPPPLFIYFFPPPPFYIEKK 
KKTVFCF FAQGGVRLCLKKKKNKDRGWG 
DTMTMGHGGRLEMVANVCGLKRYW 


5377 


19278 


A 


5416 


1 


240 


GGEGC3EPRSCHCTPAWATERDS XSKKK 
KKKGGPFKRTHFSSPGLPSFFFFFGAPK 
FNSRARFLTPREGKNPGLPPFYPA 


5378 


19279 


A 


5417 


2 


63 


LCLKNNNNNNNNNNNNNNND 


5379 


19280 


A 


5418 


322 


1 


WPLCFSRCSKKRGSPLGFPFSPPLGQSK 
KFVRFACKTPSAFKPFPSLS FFQRWAGP 
PPFLQGLGELPKGLFFQQCFFFFFFLRW 
SLTLVAQAGVQWHDPGSLQPPPL 


5380 


19281 


A 


5419 


1 


392 


FRMAGSYPEGAPAILADKRQQFGSRFLS 
D PARVFHHNAWDNVEWSEEQAAAAERKV 
QENS IQRVCQEKQVDYEINAHKYWNDFY 
KIHENGFFKDRHWLFTEFPELAPSQNQN 
HLKDWFLENKSEAPSQKK 


5381 


19282 


A 


5420 


273 


91 


RWYIVLDNLIFILENICCETSLKIYLGP 
GWARACNLSTLGGQGGQITRSGVRDQQ 
GQHGE 


5382 


19283 


A 


5421 


1 


149 


GGGGCS E PRLRRCTPAWATERDS VSKKK 
NLKKFFCPGLGLYQKQNTWTL • 


5383 


19284 


A " 


5422 


381 


179 


S SCLGLP KS WDYRCEPLCPARTALNLKT 
I KKS KDNRRWRGCGEKEMLMHCWWECDM 
VWLCVPTQISC 
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M 
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Predicted 

beginning 

nucleotide 

location 

correspond 
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amino nuii 

residaeof 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

IDJIIUC Ul 

peptide 

CP/1 11PI11*P 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 

0=fwlll famine D=Ara!niiiA .Q=C»rino 
\^ viiu uiiuilic, IX. rVigllllllC, O — OCilllc, 

T^Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

Y— ¥ T n k n nwn ^s^tnn rnHnn A=nrxcihic» 
W U IJ1VJI w IT JJ, —OlO(/ LUUIIIl, /-"JIU5SIUJC 

nucleotide deletion, V=possibIe 

mif*lpftfiHp incortinn 

II ULIvvUUC I Ho CI uvu 


5384 


19285 


A 


5423 


195 


369 


CSIiSPIKECKLHISGLQDSSISLVNLGL 
AIRNIRNKPGVVAHS CNPSTLGGOAGWI 
T 


5385 


19286 


A 


5424 


253 


414 


KKXKXKKKKKKKIUCK^^ 
KKKKKXXRGGAAKKKPHSPAPAERDL 


5386 


19287 


A 


5425 


160 


2 


NLKGSAPFSKKTSPFFLVKENPLFFFFF 


J JO/ 




A 






A7#C 


ASSS FFFFFFIjGKGGS TPP PK!GGGBGTTN 
FFLWKPGAPGGGGPPPPTPRGGGKKGPP 
P PGOKT FLGMF FEKKRFHHGGGGGFKLR 
GKKKPPPIAPKRVKNK3GNPGGGQPLKF 
KKTLPEKMGLKK 


DJ oo 




A 




0V 


1 
1 


KWPGTVAHACNPS TTjGGRGG 


JJ07 


1Q9Qfi 


A 




0 / 


947 


KKKNPOKKKKKKKKKKKKKKKKKNKKKA 
PAGAGDLSLYYPRAGRNNLFFNYLP 


5390 


19291 


A 


5429 


209 


405 


KbNLVNFSSLSSLKAMISAILLKLSVPI 
FPMKIVTLSTLNSGSWPGAVAHACDPSA 
LGGRGGRIT 


5391 


19292 


A 


5430 


124 


237 


VMNAFVCVCVCVCS C I CVCVCVCVYFLF 


5392 


19293 


A 


5431 


409 


3 


FFFFFSETESRSVTQAGVQWRDLGS 




10904 






970 


4/)9 


MKT I FHL IR ITG WAO WLTP VTPAL WEAE 
TGGSRGQEFETTVKPC 


5394 


19295 


A 


5433 


107 


277 


YNFNNHRYREAIQKWDEALQLTPNDATL 
YEMKSOVI IMKNFTFIjMKIjC YVCSVSEO 


5395 


19296 


A 


5434 


95 


1 


RNRKNNGQSGTVAHACNPSTIjGGRDGWI 
TRS 




10907 






A19 




KKERWDLAMLLRVGYSGHRQVHCTDHGT 
LQPQPF 






a 

•rl 




*>*7*> 
Z/O 


HOO 


DT.PnPNimCLPT.T.PT .T.VPT /3T7TiT?3i DT.T JWT. 

GHGPKRARLPIFFPQEAYFITFRLVFAE 
KT<BYRGPLTMCIiAPR 

XVCW3 X XvVJriJ X ITU ■ 1 irvrR 


5398 


19299 


A 


5437 


179 


2 


GISPLKGGTPSLKKJGCKAKDFNRRVSKE 
DIQFTNKHMBCRCITLL IIKQMQ IKITOR 
YLP 




I7JVA/ 


A 




j 


1 DO 


TPLYSOILT SLROEHYFKAEGRGCNETjT 
S YHCTPAWAKDGDSVS KNKQNT 


5400 


10101 


A 




47 


171 


slqlflqteerkdsddeksdrnrpwwrk 
rfvsampkgnfiky 


5401 


19302 


A 


5440 


3 


122 


TTLTLTSLIPPILTTLVNPNKKNPPKLI 
RILVLFQTTEK 


5402 


19303 


A 


5441 


390 


489 


KYKSYSLQPPPPKFKHFSCVTLPKHWDY 
RREPP 


5403 


19304 


A 


5442 


86 


275 


RQEFETSQQGETPCIxlxKIQKIxAGRAHLN 

PGGRG cs e prlcdctaawvtragls q I k 

KIKKKKK 


5404 


19305 


A 


5443 


424 


166 


CFFVSPKAKSQGWLGSPPPPPPGLKHF 
SAPTPRGLFLFFFFPRGVPPLGKGGSKS 
PPPGVFPPWPPQKGGDSGGYPPPRAKIF 
LF 


5405 


19306 


A 


5444 


356 


3 


AFFWFTOPFPPGFKGFFCPHPLSGISKG 
GPPPRGNFFFFGKAHFFS PFFKFLPPI F 
LPFLAPQKVGFPGFGPPPKPSQGVFLFF 
FFFFFSFLFFFFFETESSSVAQAGVQWR 
DPDAW 


5406 


19307 


A 


5445 


198 


86 


PWPLGAQAILLPQSSCLNLPKCWDYRRE 
PPHTALLNS • ! 
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M 
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od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 
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nucleotide 
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correspond 

ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=<;iutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5407 


19308 


A 


5446 


198 


86 


PWPLGAQAILLPQSSCLNLPKCWDYRRE 
PPHTALLNS 


54nx 


19309 


A 


5447 


143 


3 


TMIYFYFYFFLSQSLAIdjPSLEGNGTIS 
AHCNLRLPGSSDSSADAW 


5409 


19310 


A 


5448 


324 


1 


KKNSATS FFS VBP FSTGGVLANKGALTQ 
EKILPCFKIKKIRKRVGAVAPARNHFLF 
FSPFFKEGLQGIVYFFFFFFEIiESRSVT 
QAGVQWRNLCSPQPLPPGFKRTRG 


5410 


19311 


A 


5449 


403 


0 


LFFFFFFFFFFFFPKENH 


5411 


19312 


A 


5450 


114 


409 


LNIWTIiKYLISKHIiLIVCLPDTILSSLC 
LIKYI GLQGKVLI FFFKQFSAIDKLGQN 
IAWGKFGFAHYSLLTKKWKLFGNITQE 
QNMIVTGGLSWWND 






A 


5451 


192 


406 


RNFFLGHLKYVKRFEQIGWGRDFMFKTP 
KARATKNKMDKGDL I KLKS FCPAKETTF 
RGTGHPTKGEKI FAT 


5413 


19314 


A 


5452 


254 


48 


NAPGWLCVHYALQFIDKSVKNSVCWLGA 
VAHACNPSTIjGGOGGWITRSGVRHQPGQ 
YG ETLRTRASTR 


5414 


1G/U5 


A 


5453 


\ 


288 


PLLFILIPSPFSFPHPFCGPIKFILPRE 

FPRGGKKKI FFFWAQKKKIRGGVLKTGG 
RKKPGGKKNKSG 


5415 


19316 


A 


5454 


285 


415 


TSGCCFVFLVLKLS I FCKGKVLNAIEDN 
GLKNSTFTYFTSDHG 


5416 


19317 


A 


5455 


372 


125 


ITPAFSFSPMGKKGVFPPPKFFGFPPGF 
PPPPFFKPPPGIIFH3PKKKKFFSPPPR 
VYFFFFKRPPPFFFFFFFFFFFFFFF 


5417 


10/HX 

17JIO 


A 


5456 


3 


159 


AHASAHAYDTIKDALGLLLFILSLMTLT 
LFSPDLLGDQKKKKKKKKKRGGP 


541ft 


10A1Q 


A 


5457 


109 


392 


NIIIIKRSTSLFLRPIFPPHLFPFILYL 
FISLRTLLFFFFFKREFCFLPQVGGQGP 
FLNSLKPLPRGIiMQFSCLTLLRSGNYGP 
PPPPPINFFF 


541Q 


19120 


A 


5458 


243 


402 


FLKIGFIHKYLKEFVWGILNIVFNRFNS 
HERAPHAIVFVNGTTIEGHVEKCYW 


5420 


19321 


A 


5459 


394 


105 


IPPPSPRPGGPGAAPPPGGGGSTTPPPK 
KKPPFFPKNKNQPGGGGGAPPS PPPRGL 
GPKIPLTPGTKVFKETKPPPFPPPRGAP 
PKFLSKKKKKET 


5421 


19322 


A 


5460 


56 


379 


VCFSPSIYGAWLNKSPLRSHGSSSTSG 
WGGLPS PFPAQRLPLASQVQEHNGHVFA 
SYQVSIPQSCEQCLSYIWLMDKAIiLCSG 
EWLPHQAPKPSMSPTTPHCSRGLQ 


5422 


19323 


A 


5461 


184 


420 


FFFFFLNNNLFFFFFFFFFFFFFFFFLG 
RN 


5423 


19324 


A 


5462 


295 


384 


IiGAVAQARNPITLGGRGGWITRSGVQDQ 
PG 


5424 


19325 


A 


5463 


413 


257 


FLNPGGRGCSEPKSHKSTLAWVTERDLI 
SKKKKKE KCWI PQRMPERGQWS PD 


5425 


19326 


A 


5464 


2 


181 


SL YCSGWSQTPGLKQSSCLS FPKCWDYR 
CELPRPVYVLLYKTSFPKRYVFGIHLCF 
MSQ 


5426 


19327 


A 


5465 


I 


229 


TGSCYVAQANTiELLSSSDASTSVSQNAG 
TTGVSHHVQPYTS S FSQVPPLRAQSAAT 
ETTS I LWVQKLRLQWR CSV 


5427 


19328 


A 


5466 


1 


368 


KERETKKEREEERRMRRRREGRKEKGRK 
ERKKEREKERKKREEKKRNNLGQSAGIT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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sequence 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=PhenyIalanine, 
O&ycine, H-Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valiue, 
W=Tryptophan, Y=Tyrostne, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GMSHHTRPKSHNVLSKFTNLCWAAFKAI 
LGHMWGRIiDKLNVEQCFSTELSAIiPSII 
ATSHMRLWVI 


5428 


19329 


A 


5467 


183 


3 


VSLCYGEKFSPQMNESQIiYFRPGAVAHA 
CNPSTLGGRGRQITRSGDRNHPGQHGET 
GRV 


5429 


19330 


A 


5468 


1 


128 


GGAGYS ELRLCHCTRAWATRAKLHLKKK 
KKKGPLRALKKIGG 


5430 


19331 


A 


5469 


1 


155 


NKFPVLISNVINI IGMCHCTPAWATERD 
SVS KRKKKEKKRS KLARHSVLCL 


5431 


19332 


A 


5470 


113 


1 


FFFFFFQKTSSRLGTVAHACNPSTLGGR 
GGRITRSGD 


5432 


19333 


A 


5471 


407 


281 


NGMIPGGRACSELRSRHCTPAWATEQDS 
ISKKQNKSKNPGLK 


5433 


19334 


A 


5472 


1 


177 


GGGCSEPRSRHCPPAWATERDSYLHKMK - 
QSKTKGERMEVACYCI I ILFMLTDADEV 
LMA 


5434 


19335 


A 


5473 


76 


267 


KERREEKEKEKRKRSERKGRKKEGRRDR 
EKEKEERKDERREGEEGEEGRKEKKEKD 
IKRRGWP 


5435 


19336 


A 


5474 


133 


2 


LCRMVVLFCFEAGSHSVTQAGVQWCX5LN 
SLQPLPPGFKQFSCL 


5436 


19337 


A 


5475 


5 


394 


YGVGTKGFRAGVKEFQALFLLITVHSQL 
CHIEGTRHHSRHI I WVDFINQIENGTF 
FPLVLSNGCIWLLIVYLFWNI FTRQEGQ 
FFYGKLSHYS VFVFCFLRRS FALAQAG V 
RWCDLGSLQPLPPGFKRF 


5437 


19338 


A 


5476 


163 


2 


SEKPWVVALTLMTFSHFYFFIYFYYIFI 
FQMESCS ITQAGVQWRDLNSLQPIiP 


5438 


19339 


A 


5477 


111 


2 


FLFLKQDLPGAVAHACNPSTLGGRGRQI 
TRSGDRDH 


5439 


19340 


A 


5478 


232 


0 


HLVWRMESGSVAHAAVQWCDLSSLQPPP 
PRFKPFPSS 


5440 


19341 


A 


5479 


388 


225 


ALFSSKKKXKRKKERKRKKEKKKKKESM 
QRKGGHLPRVCCX3ASMVSPGHKLSL 


5441 


19342 


A 


5480 


150 


2 


HRVAVIYEVFFVCLFFYFLFLRQSLTLV 
TQPGVQWRNLS KLQ PPHPGFK 


5442 


19343 


A 


5481 


156 


395 


DSVCTCVLHTCHSSGLSOGRSCIJiCSLG 
LPNSNCPCVOTCTCVCVCVCVCVCVSFC 
PLHQGVOTCVCVCVCVCVFIiSAHFTKGC 
VCVCVCVCVCVCFSPPI 


5443 


19344 


A 


5482 


393 


115 


RACPDCRVRFPHTSPPCLPCGPEAEPGP 
GPALRELVQPLPGQIjQPPFGMPPSGIjGA 
APS PAPPACAWTRP PHLHPSS FS SSVPQ 
ISSIiFLCF 


5444 


19345 


A 


5483 


392 


2 


FFFFFSETQFRSCCPRLECNGAISVHCN 
LRLPGSNNSP 


5445 


19346 


A 


5484 


381 


282 


PVPWGQAIFLPPPPPYIiGIPGPPPPPGL 
FLFP 


5446 


19347 


A 


5485 


407 


3 


FFFFFFLRRSLTVAQAGVRWCDIiGSLQA 
PPPGFT 


5447 


19348 


A 


5486 


2 


417 


QENHLNPGGVGCSEPRSCHCTPAWVTEY 
DKKKKK 


5448 


19349 


A 


5487 


3 


413 


VYLHPSLQPSLSSLVPDQALSHDIWLKK 
KKKKKKKKKKKRGGISPPSPVGGVGPRE 
TWGWG PGANLFQRGGGVRE L FSQKKKG 
EKGQRNPEKKPPHKRGPGGPKEKFIFFE 
KKKKKGGKPFGKKTKPPPPRVNPGG 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylatanine, 
G=Glycine, H=Histidine > Msoleucine, 
K=Lysine, L= Leu cine, M=Meth ionine, 
N=Asparagine, P=ProIine, 
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T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *«=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


5449 


19350 


A 


5488 


229 


411 


IHGPGFKNGGRGKTI/jIjPNLMALKKIPF 
SPGGYKPKNP 


5450 


19351 


A 


5489 


190 


50 


YHHHHHCHHHHHHQCHHRHHHDHHyHGD 
NDYS SHHSRSTCHVAVTV 


5451 


19352 


A 


5490 


615 


402 


RWCLTLSPRLECSGMISAHYNLHLPGSK 
QLSCLSLPSSWDYRCPPPGKTTFSRGIR 
DALKPRKPKRINLI 


5452 


19353 


A 


5491 


338 


53 


KGPPFFFFCKKGGLFCSNWFLTPGGKWF 
SPPTPPRSGGKRVPPEGGGKGFSLFFFV 
EGGS CHI AQAGLKLI/3S SDL VTSASQSA 
GIMGKS PHAWP 


5453 


19354 


A 


5492 


266 


412 


NPNFFERKSCSVSQAGVHRRELNSLKAA 
PSGYTPASCVSLPSSWDYRRP 


5454 


19355 


A 


5493 


285 


412 


RDGVS ALS PGLE CGGM I IAHCNLELLGS 
SDPSALASQSTWNY 


5455 


19356 


A 


5494 


1 


1350 


MGFMTGICIBDNWGPSPESGSVLTVGG 
VRIQMLDRCHTAHCPVI PGLLRGQLWTE 
NLAERNSHDRRPFPTWCSHFAQDLWPE 
QSIKDSYQKVILRKFEKOGHGNLHFKKG 
CES VDECKLHKRGYNGLNQCLTTTQS KI 
FQCGKYVKVFHQFSNSKRHKRRHTEKKP 
LKYIEGDKAFNQSSTHTTHKKIDTGEKP 
YKCEECGKAFNRSSHLTTHKITHTREKP 
YKCEECGKVFKYFSSFTTHKKIHSGEKP 
YICEECGKAFMYPYTLTTHKI IHTGEQP 
YKCKECDKAFNHPATLSSHKKIHTGEKP 
YTCDKCGKAFISSSILSKHEKIHTGEKP 
YKCEECGKAFTRSSHLTMHKI IHTGEKP 
YKCEECGKAFTWSAGLHKHRRTHTGEKP 
YKCEECGKAYTTSSNLTEHKTTHTGBKP 
YKCKECGKAFNWSSDLNKHKRIHIGQKP 
RT 


5456 


19357 


A 


5495 


4 


146 


PPTRPTCYYIiS FYV5 YPTG YKEDTRMVN 
KHMKRCSTSLVYLLWINFY 


5457 


19358 


A 


5496 


5 


388 


CGCFFFS PPPKiCKVFPPPFFFFAPGFFP 
PPLFFTPPPQFFFWAPKKKKNS PPPPGK 
KIFFLKGPPPP 


5458 


19359 


A 


5497 


231 


163 


FFFFFFFFFFFFFFFFXXTNKWF 


5459 


19360 


A 


5498 


163 


2 


S VKFTLGKVSQFLKKHTPAGWAHTVI PA 
LWEAEAGGSRGQE I ETILTNTVKPH 


5460 


19361 


A 


5499 


229 


388 


RTHLQASSDLFCLYSLKTGSCCVAYAGV 
QRHDIiSSLQPPPPRFKRSSCLRPPE 


5461 


19362 


A 


5500 


405 


289 


HVAQAQLKLLAS SDP PTPASQSAGVTSV 
SHCTHPRCLFV 


5462 


19363 


A 


5501 


286 


375 


GDFVRFSFWPGGGAHSCNPSTLGGRGGQ 
VT 


5463 


19364 


A 


5502 


20 


374 


ILQFQNFWIVLFKNFYHFGKFLIYILNW 
FSGLFWYYFSELSCFSLSSKINILNLLS 
GILNVFFSISGVRKKIFLIKRYCWLGTV 
THACEPSTLGGQGRQITRSGDRDHPGQR 
WKIQN 


5464 


19365 


A 


5503 


1 


185 


FHHIGQAGLILLTSSNLPASASQKAGIT 
GMSHCTQPI IIFLYSLFLQLSFSWQDFV 
MRKKF 


5465 


19366 


A 


5504 


202 


402 


NSFSSTAKLREKYRQFLYTPIPTGTQPL 
PLLTFCTRPDTVAHACNHSTLGGQGGQI 
TRSGACNQPGQ 


5466 


19367 


A 


5505 


118 


7 


I KKENNWPGAVAHACNPS I LGGRDGRIT 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Le urine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














RSGDRDHPG 


5467 


19368 


A 


5506 


205 


422 


ANHFTS IRMAI INNNKQTBNNKCWQGQG 
EWGTLVHCWWECKMVWMLRQIVWSFLKK 
LNIQLSRDPATLLLAI 


5468 


19369 


A 


5507 


3 


425 


RSCHCTPAWATERDSVSQKKKKKKKKK 


5469 


19370 


A 


5508 


3 


86 


SKKLQVLLNCMTEIYYQFKKDKAERSK 


5470 


19371 


A 


5509 


395 


108 


GVLBCNDTITAHCSIiDLVGSSDPPTSAS 
QVTGTTGTCHHAQLGLPKCWDDRHEPRL 
PAWNKTLRRKNWHSIitiPVVPAIQSLEVG 
GSLRPRSLRLE 


5471 


19372 


A 


5510 


421 


59 


FFFFFFLRRSLAVS QAGVQWWDLGSLQA 
LQA 


5472 


19373 


A 


5511 


168 


3 


EICKMCFPVPFGVFFLWIVFVFVFVFVL 
RWRMESLSVSQAGVQWCNLSSLQPPP 


5473 


19374 


A 


5512 


446 


149 


FFFFFFFCFFBTESCSVARLESSGMISA 
HSSHNLCI 


5474 


19375 


A 


5513 


74 


398 


DRIIDAGPKGNYARFMNHCCQPNCETQK 
WSVNGDTRVGLFALSDIKAGKNHFRILQ 
LTSEFQGFCFLQTASSDYNFKHFYMYRS 
LELSSFTFLCNHHHHPSPEIiFSSS 


5475 


19376 


A 


5514 


73 


335 


EGGSLFCPGWSAGALFWLTAPSASRGWG 
GAPGVPWGBVGQKNCWNPGGGGCNELG 
LHHC I PP WGTGKTLS P KKKKKKGATDMG 
GAS 


5476 


19377 


A 


5515 


416 


0 


WIjGPGAVAHTCNPTTLGGRGRWITRSRD 


5477 


19378 


A 


5516 


242 


423 


AGSPIjIHQGGQAI iltqnptpgwgtmvf 
QPVLRPVKGtiTOANHVTSSPVASQPIIF 
.TTAG 


5478 


19379 


A 


5517 


2 


151 


GRVGFVGMGSHCVAQAGLEUjGSGGPPT 
SASQSAI IAGVSHRAWPYFPS 


5479 


19380 


A 


5518 


312 


405 


GQVRPGAVAHACSPNTLGGRGRITRSGV 
PDQ 


5480 


19381 


A 


5519 


3 


387 


ACSGPRSCHCTPAWATHQDSVS KKKRKK 
KREKPFLLEKKRGPTLTWGEPKKGIPLG 
ERNGPPSHFKKGPPLAPKPPGGNLHPEI 
AQGPGSPTDGNKKGDLNPTPVFYKFKTP 
EQAINWAAPPFPSEK 


5481 


19382 


A 


5520 


292 


140 


KKGPPRARPGGGGSPPIiFFFFFFFFFFF 
FFFFFWESTLKKRELHRSDLCV 


5482 


19383 


A 


5521 


1 


399 


KKWLFLSSAQIPTKLSRRRRNRETWPKE 
RS KINLHKP ILKKQRHELPDKE FKIAL I 
KMLYELKVTMHEQNEKINKETENTKKKK 
KKKENPPIiFGTGKNQSPPBKFQMGGQPP 
PPPPQJCKKPPPPNRFFVFSPP 


5483 


19384 


A 


5523 


316 


399 


MLLYWPGTVAYACNPSTLGGRGGR ITRL 


5484 


19385 


A 


5524 


400 


0 


FFFFFFFCFFFFFLFFFS 


5485 


19386 


A 


5525 


3 


429 


TTNIGRAMGATTS I VGSDTSQAERPGGT 
TWSPGASSTSQSSRPGTSVTPDSSAStf 
SETVTTKEFSGTTAISRTSHTGTPAASG 
GQATGSLTATTX^VAPGTTVAPGSSNTEA 
TTSVGEKKKDQKKKKASSP 


5486 


19387 


A 


5526 


135 


3 


GGKICTLIKQSVIQESWPGTVAHACNTS 
TLGGRGGW IMKSGDR 


5487 


19388 


A 


5527 


105 


5 


KKKKKKKS GRWPGAVADACNPNI LGGQG 
GQITI 


5488 


19389 


A 


5528 


411 


31 


GQRVPLSRGVAKKKKFLWNPGGGAGKNG 
GKNFLKKKPPGAPPPSGGPKKGPGPPPP 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KKPI* I L.VS PPPRKGGAHNMHFRQKKSGN 
KGGEGSTKGGKEKIFLGKFVQKNFFFFF 
DPEFHSCCLGWSAMP 


5489 


19390 


A 


5529 


411 


142 


PRFSFLFSPPRKKEVFPPPKILGGPRFF 
PPPRFLKPPPHFFFWGPKKKKKFPPPPG 
EKFFFFKGAPPPFFFFFFFFFFFFFFF 


5490 


19391 


A 


5530 


174 


401 


QVQPQQKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKAS PGGGG KKKKK 
ERGGGGERKNI I WVPLLFPP 


5491 


19392 


A 


5531 


410 


0 


QPQS KKKKKKKKKKKKKKKKKKKKKKAS 
QKKKKKKKKKKKKKKSSGGGFFKEV 


5492 


19393 


A 


5532 


383 


145 


ggfffnppakkgsfpGphnlggppvfsp 
ppvfktpprniffgapkkkkffpppgak 
kyffkkapppfffffffffffff 


5493 


19394 


A 


5533 


418 


328 


KTGFRHVGQAGLKLIiASSDIiPAIiASQSA 
GL 


5494 


19395 


A 


5534 


169 


3 


IILIIHPCSHVPRRETLGLKACTFKNLF 
WPGMVAHACNPSTSGGRERR I TRSGDR 


5495 


19396 


A 


5535 


1 


254 


PTRPCKPPQTSPNSGKLSPSSNGCMNTL 
HISSTNTVGEVIEALLKKFLVTESPAKF 
ALYKRCHREDQVYACKLSDREHPLYLHD 


5496 


19397 


A 


5536 


43 


404 


QGGLTVLPRLVLNSWPDKRSSRLGLPEC 
WDYRYEPPHQAN 


5497 


19398 


A 


5538 


2 


212 


KENHLNPGGGGCSELRSCRCTPAWATER 
NSVSKKKKKNFRGGGLKGKGPGFYPLEK 
KFPGVGFFPKTNLV 


5498 


19399 


A 


5539 


2 


387 


I FQEHKNCGEMSE I EAKVKYVKLARSIiR 
TYGVSFFLVKEKMKGKNKLVPRLLGITK 
DS VMRVDEKTKEVLQE W PLTTVKRWAAS 
PKSFTLDFGEYQESYYSVQTTBGEQISQ 
LIAGYIDIILKKKKKKD j 


5499 


19400 


A' 


5540 


317 


407 


GLSHHVFFEEGKLKKPRIKDKDKKVPEP 
DN 


5500 


19401 


A 


5541 


120 


3 


NFFFFFLRRSFALSPRLECSGTISAHCN 
LHLPGSSDSP 


5501 


19402 


A 


5542 


430 


0 


QKAQSNGPEKQEKRGVIQNFKRTLSKKK 


5502 


19403 


A 


5543 


172 


2 


FIFNFLHIFSFWFITKFGCFIVLFIiRQS 
LALLPRPGVQWRDLGSLQPPPPRVRPRV 


5503 


19404 


A 


5544 


149 


2 


EKDIQELNSALHQADLIDIYRTIiHRKST 
F FS APHRTYS KIDH I VGS KAL 


5504 


19405 


A 


5545 


84 


276 


TLKKRPKEIRNQGGSPQRKRAQLGKLQQ 
KYPVLNFKAPFFGERGDYYKS FFKTCLD 
NLPRRGKG 


5505 


19406 


A 


5546 


2 


914 


RPAWHEEQKSKNEPEOTEDKEGCQMDKE 
PS AVKKKPKPTNPVE I KEELKSTSPASE 
KADPGAVKDKASPEPEKDFSEKAKPS PH 
PIKDKLKGKDETDSPTVHLGLDSDSESE 
LVIDLGEDHSGREGRKNKKEPKEPSPKQ 
D WGKTP PSTTVGSHS P PETP VLTRS s A 
QTSAAGATATTSTSSTVTVTAPAPAATG 
SPVKKQRPLLPKETAPAVQRWWNSSSK 
FQTSSQKWYMQDSSSSSSSSSPQNQQQQ 
PQS SQGTRYHTRQAVKAVQQKE ITQS PS 
TSTITLVTSTQSSPLVTSSGSMST 


5506 


19407 


A 


5547 


2 


244 


TRLIFVFLVETGFRRVGQAGLKLLTSGY 
VGTSASQRMTAHLCIiPRCWDYRCEQLHP 
AQRFFKRMLSVAVDAALIjHTSGGM 


5507 _J 


19408 


A 


5548 


406 


1 


FFFFFSETESCS ISQARVQWCNLSSLKP 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5508 


19409 


A 


5549 


417 


43 


IRPARKGCSSGLFGAPIAGGGAPPPPPP 
RQTSPPPPHTKGGGGGPHKPPPKKKRPP 
PKKKGGGKKNPPKKKKKNTPPPPKSAPP 
PAAKTDPPPPLTVPPFIiPPYIFTKKIPF 
LKSYFFFFY 


5509 


19410 


A 


5550 


3 


136 


QPRFPTLTNINEWNPTLIiNPIKRIiAAGS 
LFAGFLITNNISYHSP 


5510 


19411 


A 


5551 


75 


194 


NKEIjSRNKKKKKKKKKKKKKKKKKRGGG 
LKKKNGGGAKT 


5511 
j j i & 


19412 


A 


5552 


343 


493 


1 1 FS FFFFF FLKGS FVFAPQAGGHGGNF 
GPLKPPPDGLKEFSCLTLLSSW 


5512 


19413 


A 


5553 


148 


3 


DAVSLFCPGSLEHSDTI IAHYS PDLLGS 
SSLSASTSRVAGITGICHHA 


5513 


19414 


A 


5554 


407 


252 


KKKFLKKLAGQGGMRLQSLLIiRRIiRQED 
HSSSGVQSCSELWGGHLIYSKNR 


5514 


19415 


A 


5555 


3 


231 


HEFVFLVETRFHHVGRAGLELLISSYPL 
ASAFREAKARREDLLSPGVKDKPGQYRD 
TLTLKKKKKSFFIYLGECW 


5515 


19416 


A 


5556 


3 


131 


SWVS WEESKDAKELVGYYI EAS VAGSG 
KWEPCNNNPVKGSR 


5516 


19417 


A 


5557 


3 


131 


SVWSWEESKDAKELVGYYIEAS VAGSG 
KWEPCNNNPVKGSR 


5517 


19418 


A 


5558 


462 


255 


RSHHSAQAGLELQASSDPPALTSQSAGV 
TNVSLRAQPIIACSSVEVSSRCQVCLHV 
RRGKCPRAESYR 


5518 


19419 


A 


5559 


445 


29 


KSSNILILQKVENGDLSNKILKITDFGL 
AREWHRTTKMSAAGTYAWMAPEVIRASM 
FS KGS DVWS YGVLLWELLTGEVP FRG I D 
GLAVAYGVAMNKLALPIPSTCPEPFAKL 
MEDCWNPDPHSRPSFTNILASCQAWR 


5519 


19420 


A 


5560 


225 


475 


GAVESSCTEEHMWI PEKAVEGSLTPTPE 
SRSVAHAGVQWHNVSSIjQPPPPGFRRFS 
CLSLPSSWDYTSAGEGCAASYRSKVLK 


5520 


19421 


A 


5561 


248 


142 


NKNIWVQKKCVFYIYLPFICVCVCVCVC 
VCVCVCV 


5521 


19422 


A 


5562 


262 


456 


NFPLKTTY FLKKKKKKXKKKKKKKKKKK 














KKKKKASSPPPIiIKRGPKN 


5522 


19423 


A 


5563 


314 


444 


LKSMFCFVLFCFFE IESRSVAQAGVQWC 
HLSPLQPPPPGFKRS 


5523 


19424 


A 


5564 


220 


486 


LNSSHPESPLPLFGLVLQFFYPSNPKS F 
NKKNLEIRMLLTVDRGPNHPDSLFFVHS 
NVDTVFLPQEGPSFLQPLGQGITEAFEH 
NQAYV 


5524 


19425 


A 


5565 


176 


492 


CTKTAVAS KAR YLQALLYALAS VT I AAT 
ATRIiAS I SGAS P I FRS FP YLQPTAPGLV 
LPS I L I LVDVGLRKEKGPDTVAHACGPG 
TlJGGRGGRITRSGVRDRPDRH 


5595 




A 


5567 


24 


470 


PMDXKXPXXXETPLNGGAGENQDDMFAK 
LKEKLFNEINK.IPLPPWALIAIAGVAGL 
LLLTCCFCICKKCCCKKKKNKKEKGKGM 
KNRMNMKDMKGGQLPQDDDDAKTGLTEG 
EGEGEEEKEPENLGKLQFSLDYDFQANQ 
LTVGVLQAA 


5526 


19427 


A 


5568 


45 


485 


GXDGRLLRGHNQYAYDGLDYIALNEDLS 
OraAADTVAQITQRKWEAAREAEQLTAY 
LEX3I»CVEWLRTHLENGKETIiQRADPPKT 
HVTHHPVS DHEATLRCWALGFYPAEITL 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
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TWQRDGEDQTHDTE OVETR PAGDRT FQ K 
WAAVEVP 


5527 


19428 


A 


5569 


221 


422 


FFFLTGRVRQSLADIiEQKIBVHSMPPFE 
SLALSPTLYCRGAILGATSASQVAGITN 
ICHHAQPSYLF 


5528 


19429 


A 


5570 


401 


477 


IFSRDiCDVAMLPRrVIiNSWAQVILL 


5529 


19430 


A 


5571 


102 


36 


EVENNDD I SHHHHHHHHHHST 


5530 


19431 


A 


5572 


106 


407 


AGRQGVVLEGCRPAAWPTLLTPPATSSR 
GASRPPTATCPTPDPAPATTTWQPWPPR 
PLWAGVQWHDLGSSQPPPPGIKQFSCLS 
CPS S WDYRGRRAVCHH 


5531 


19432 


A 


5574 


253 


363 


YHSNQCVGLRKWPGAWHACNPSTLGGR 
GGWIMRSGA 


5532 


19433 


A 


5575 


1 


132 


MECHYVAQAGLELPGSSDLPASASQSTG 
ITGVSHCAWPGFFFFF 


5533 


19434 


A 


5576 


18 


378 


TPGGGGCI EPRWHHCTPAWATRAKIjS YK 
KKKKKKK 


5534 


19435 


A 


5577 


21 


232 


ILRCTTI ETLLKVQDNHLNPGGGSCSEP 
RSHHCTPTWARERDSVSKKKKKGGAALW 
DPGGGQPSAGSQTR 


5535 


19436 


A 


5578 


377 


2 


LS L I QT I TLCLGAI TTLFAAVCAVTQND 
IKKIEAFSTSSQIGLIIVTIGINQPHLA 
FLHICTHAFFKAI IiFMCSGS I IHNLNNE 
QDIRKIGGLLKTIPLTSTSLTIGSLALA 
GI PFLTGFYRTRG 


5536 


19437 


A 


5579 


2 


227 


IESWLEPVRFLRSMFANNLVYDTSDSDD 
YHLLKDLEEGIQTLMGRLEDGSRRTGQI 
LKQTYS KFDTNSHNHVDAA 


5537 


19438 


A 


5580 


113 


3 


NFPFFFFIiRQSLAVAQAGVQWCDLGSLQ 
PPPPWFKQ 


5538 


19439 


A 


5581 


184 


3 


IMPPHSRLS PTARFCLQKKTNNNNTP IE 
MASIQCPGWSQTPGLKPPSCFVLPKCWD 
YRCE 


5539 


19440 


A 


5582 


381 


3 


FFFFIiGDRAPPGGEKNPDGGGLKTPGKP 
PGGWGVS PPPQKKKMPPPNFFPPGKKKS 
FGGGGPIFFPGKNFPPPTQKKKRGGGGF 
FFFFKKPGGFFFFFFPFFFFFSSFFLGK 
SFIKISFKKRRRG 


5540 


19441 


A 


5583 


314 


389 


FSCLSLLSSWDYRCVLPHLGNFIiYF 


5541 


19442 


A 


5584 


131 


1 


FPLWTSEQGVGRNKQTYVTWQADCKENA 
GGDYYWTFFPQPTFV 


5542 


19443 


A 


5585 


484 


3 


MTE I LLKQAMVG I VGS IGSAIGGAVGGG 
ASASGGTAIQAAAAKFHFATGGFTGTGG 
KYEPAGIVHRGEFVFTKEATSRIGVGNL 
YRIiMRGYATGGYVGTPGSMADSRSQASG 
TFEQNNHWTNNDGTNGQIGPAALKAVY 
DMARKGARDSNS AGDQ I FHH 


5543 


19444 


A 


5586 


136 


48 


NTMQPGWTHTCNPSTJjGGQGGRrTGSG 
R 


5544 


19445 


A 


5587 


306 


479 


DS FFPGVGS AFLYQVAAWTTKYFKKR V 
ALSTDIARSEMRLTFIiLAPFTKFLIDVC 
DW 


5545 


19446 


A 


5588 


201 


2 


ILETGFHHVAQAGLNLLASSNLPASTSQ 
SAGITGVNHHAWPS FLIQGTKHS FKNSK 
PVFHIKLSKN 


5546 


19447 


A 


5589 


30 


477 


TPSLARGVYPCWLYTGFLHDFLFSPKIE 
ERFMDVGVLRATEPSNSIiFAI I YSTPIE 
LAIKSWHI IGDGLS IELVAVERGNRTTI 
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ISRLPEFEKSS LSDQSLVTLTSGYFAVY 
RVKLTAKKLEGIHEGAFQITTDYEILTI 
PVKARNALC 


5547 


19448 


A 


5590 


208 


91 


KGHFWVTGKKKVPCPPPRFFFFFFLDGI 
SLCYPGWSAW 


5548 


19449 


A 


5591 


325 


467 


ADLCSPCDILQLDFRHIRKTVDTLLALG 
EKAPPPTSALRSRDLISFCF 


5549 


19450 


A 


5592 


332 


403 


I^LCVXHTHTHTHTHXHTKiil PI Y 


5550 


19451 


A 


5593 


292 


26 


FLLDMEGAEAGAELGLYHCLLCAWETPS 
RLAVLQHLRTPAHRDAQAQRRIiQLLQNG 
PTTEEGLAALQS I LS FSHGQLRTPGEEE 
GTGE 


5551 


19452 


A 


5594 


387 


51 


LKKIFLGRGQGVSLFFSPPLGGPGGRFL 
GLE I GALFGLPGKTPPFPKI KKNYPGLW 
GGALFPLPLEGLGGKMAFPRKGGAPFSQ 
KNPPPPPLGGKTKPPPPLKKKKKKKVL 


5552 


19453 


A 


5595 


88 


362 


AETS PGQGS YQDGLRRPGFAEGP PAGRN 
PRRKKTQTDRRGGSAGSQPQACADGVRL 
CTRLCK^SGAISAHCNLGLPGSSDCPASA 
SRVAGIT 


5553 


19454 


A 


5596 


92 


366 


DQPWPGQLPGRSPEAGIRGGSTSRKKPQ 
BEENSDRS PGRQRGI PASGVRGRCAALY 
QAAVQWCHLGSLQPLPPGFKRLSCLSIiP 
SSWDYSH 


5554 


19455 


A 


5597 


3 


138 


TRPRTRGVGQAGLELLTSSDPPTLASQS 
AGITGVSHRARPQSSS 


5555 


19456 


A 


5598 


150 


3 


SCVIYSWMKFFSFUWSFTLVTQAGVQW 
CDLSSLQPPPPRFKLFSCPG 


5556 


19457 


A 


5599 


148 


3 


TQHIiNLAKKSQITPGMVAHSVAHAGRQW 
CDLGSPKPLPPGLNRASCLP 


5557 


19458 


A 


5600 


195 


1 


MVYFKKFCGFFL FVRSRIACSVAQVGMQ 
WCNHSSLQPRTPGLKQSSHFS FPSRWNN 
RHMQPCPGA 


5558 


19459 


A 


5601 


228 


415 


VNVS I PKDVFLL III ILFFFGP APQAGG 
RGCNIiNSLLPWPPGLKQFS CLTLPRSWN 
YGLPPP 


5559 


19460 


A 


5602 


405 


2 


AATGVDRHAQRTQAGLPLKTPQTPCPNF 
TSKARGTPSAQHPSCTRQRHQASNPTQR 
RRRRPQAEHS IRTDAHG PGLPASGHMAG 
ERPTRKAQSRDVSVLRQRLALLPRIiECT 
SAISAHCSQNLTGSGNPPVSCL 


5560 


19461 


A 


5603 


3 


236 


RQENGMNPGGGACSEPRSRHCTPAWATE 
RNSVSKKKGI KIKGHCSSAMHHQYIiSPF 
GWRQVCSHCVCPKPWLLVRGH 


5561 


19462 


A 


5604 


184 


2 


RPFLRIQGKIRMFSGSRFIPNFLILETW 
SHSVPQARVQWCHRSSLQPKPLVFKSSS 
YIiLP 


5562 


19463 


A 


5605 


261 


2 


GWFLKGPPLFLGPKKPGGIiGFFQKKLVL 
QKKNPPFGIGPPKNLKKTPLFFFFFLRR 
SLALSPRLECSGTI SAHCNLRI PRTSNS 
PA 


5563 


19464 


A 


5606 


134 


2 


GS PFKNLVFFLGQS LTLSPRLECSGTIS 
AHCKLRLPGPHQSPAS 


5564 


19465 


A 


5607 


3 


269 


RQESHSNPGGGGCGEPRSRHCTPAWLAT 
ERDSVSKKKKRKRNFLNFIGKKBCKPCLP 
KPKLTKKKKWGGITALVLKLYKRAIAPK 
TGGY 


5565 


19466 


A 


5608 


36 


380 


PPFXXPPGGETFGGTPRGPPPKKKKSPR 
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KRGGGGKKKPPRGAPPPKHLLGGKNRGG 
GTPPPPRKTPPLNPKKKRGGGKRGVFFP 
PKGGI FQKRGFFFSRGGGGGAPPPHP P P 
GGG 


5566 


19467 


A 


5609 


3 


411 


SSRSRAAALFYFFFFSRRGQKEKEPFLG 
GKTPPVFVFFPGGVGKKNRGFPPPPPAG 
KGPFSGGGGRBQTPIWSPPPPPRGKWGK 
PPKP VTPNFNFLGRVGFLGKNPS PGGVC 
PPPKTPI FFFGRPPPGGFFWGGGG 


5567 


19468 


A 


5610 


364 


483 


ISWVLFFIaFLKWS FVLVAQAGBQWRGLG 
SLQPLPPGLKGF 


5568 


19469 


A 


5611 


399 


285 


GGCSEPKSRPCTPAWGTERDFVSKKKNK 
NNYLVKKSN 


5569 


19470 


A 


5612 


408 


3 


KEVFSFPPGLEHGGVFYSLPPLPPRVKG 
FSPPPPPGGGGPRGPPPPPGYFLLFPKK 
GGFPPLPGGFSSPQIWPPRPPQKVGVPG 
GAPRPGPNFFFFFFRESFPFIiARGFPKK 
KKKTCWNVTISHSASVQSSVLL 


5570 


19471 


A 


5613 


1 


112 


IjAQhvffiiIjNts iaspadssellmevhg - 

NGKRPSPER 


5571 


19472 


A 


5614 


2 


389 


FVREGERKKEKRKKEKKRKEGTKEKGKE 
EGRKERKKEKVRKKTVDDRRRPQVTFQA 
VNLESEGAPGNCGVTAPPGGHIWLKQVE 
VQTHLENRCHRLKATVTTTTRPGAEPRS 
PPFIITCGEIMNVFSDM 


5572 


19473 


A 


5615 


385 


3 


KPPPGDPGKTPFFLKSQKITGGGGGAPL 
FPPPQKVKAKKRGYPRRQSFKRARLGPC 
PPSRGGKKKAGFKTNPPTKKKNQTKKPM 
KAKTS KLIiQENRSGPGTVAHACNPS TLG 
GQGGWIMRSGDRDTK 


5573 


19474 


A 


5616 


218 


402 


RS YS FSQI»VGTVT KARKPESSCWLCAPN 
RKNAFWPGAVAHTCNPSTLGGRGGRIMR 
SGDGD 


5574 < 


19475 


A 


5617 


2 


235 


KSFGCASRLQMHGRTHTGEKPYKCKQCG 
KAFGCPSNIiRRHGRTHTGBKPYKCNQCG 
KVFRCSSQLQVHGRAHCIDTP 


5575 


19476 


A 


5618 


347 


421 


IYLXXKKKKKKKKKKKKKKQKKKKK 


5576 


19477 


A 


5619 


92 


1 


ENLALLPRLECNGT I LAHCNLRLPGS S D 
SP 


5577 


19478 


A 


5620 


3 


402 


APPPSFFFKKGLIMAISPTLGAIKLINF 
LFFFKKGKKXKKPPLEKKKKKKKKKGGA 
PLKKKKFSPPGGGRKIFFKGAPKKISRA 
RVKKQRRGEKTRGNPKKKRGKKKRPFPN 
TVSR 


5578 


19479 


A 


5621 


420 


3 


HPPPQGSANFFPPPRKIGINGPPPPAGI 
IFVFLECRGFPQVGQGGFKIRPPNKPPP 
PPPKNLGLQKKSPPPPPIFFFFFIiWRWG 
LMTIiPRLVSNSWLKQSSHLGLLKHWDYM 
SEAPHLAKN^LKEYEWSLSFHVEGS 


5579 


19480 


A 


5622 


246 


132 


EVAPHLLGCISPTMCVCI^CVCVCVCV 
CVCVCLCSW 


5580 


19481 


A 


5623 


219 


424 


ILNVMVLGIGTFGRLLGLEGRALTNGI K 
ALIRRDMGPGAVAHTCNPNI LGGREGW I 
MRSGDGDHPGQH 


5581 


19482 


A 


5624 


32 


454 


GLEKVHSMTXDRNSRLSVQLRDWDGGGE 
LLQFSVHLDGEDTAYSLQLTAPVAGQLG 
ATTVPPSGLSVPFSTWDQDHDLRRDKNC 
AKSLSGGWWFGTCSHSNLNGQYFRS IPQ 
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E=Glu tannic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleurine, 
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QRQKLKKGIPWKTWRGRYYPLQATTMIiI 
Q 


5582 


19483 


A 


5625 


166 


2 


LFGLVKLTALKGLKMILPSQVWWLTPVI 
PTLWEAKAGGS QGQEFETS LANTVKP 


5583 


19484 


A 


5626 


120 


406 


GEAATQENLAELRPEPEIiIiSPSTVLSRE 
PELPSPSTVLSREPELPSPSTVLSRKPD 
U^PSTVLSRKPDLLSPSTVLSRKPDLL 
SPSTVLSRKPDLLSPSTVLSRKPDLLSP 
STVLSRKPD 


5584 


19485 


A 


5627 


324 


16 


LSPPPPLFKTAPRPQKKNPPPKKItKPAP 
PKKKNIiFFFFFFFFIjVFFFIWCSWRLAE 
WNETIiGECDMRWGOAVGTGTSRELRPGG 
SRSQPRRLAGKELGPHCAP 


5585 


19486 


A 


5628 


378 


72 


GRGGERPPPPQKSGAKKKPGVLNFFPQR 
GKRSGPPGPPGFFKIFSQEAPGFFSSFF 
WGRSLALFPSWGARGGFLVPANLGPPAG 
APKGNPPSKKKKEKIED 


5586 


19487 


A 


5629 


3 


204 


QESCLNMGGRGCSEPRSCHCTPAWATEK 
DSVSKK3CKKWKFGGFYEVIGNFLHLFPK 
WSYPPSFRIF 


5587 


19488 


A 


5630 


177 


1 


SFHEMRFWKLI FI IRIYVHFERRSCPVP 
QSRVQWHDLGSU2PLPLGFKRFSCLGLL 
SNW 


5588 


19489 


A 


5631 


154 


3 


KWFQMGS RS VTQ AGARQCDHS S LRP PT P 
QLKRSSCLSFPGGWDYRHVPPC 


5589 


19490 


A 


5632 


239 


17 


EGVRNYLILQPRSLCRLNCS FFPNREKE 
IiCHHSSFSSPLAAAKESQGEKGRLLSQD 
EGLLLWKVFVEDVEINS 


5590 


19491 


A 


5633 


2 


110 


GCSEPRSHHCIPAWVTEGDSVSKKQPPP 
QKKKCIAG 


5591 


19492 


A 


5634 


433 


0 


FFFFFFFFFFLKKIPPPEGGGF 


5592 


19493 


A 


5635 


306 


433 


FSIS IERSSSGRAQLLTPVMPALWEAEA 
GRSRSQEIKTILAN 


5593 


19494 


A 


5636 


2 


285 


QSRTSVI FNSEKAERGEETTEEKFEASR 
GWLVKFKERSCIiHNI KVQS EAISADVEA 
AASDPEDLAKIIDEDGYTKQHIFGQALC 
LMSNPTTLGG 


5594 


19495 


A 


5637 


160 


1 


LYCGKIYWKFIRLNFFLSQGLTLSPKL 
ECTGTITAHCSLKLISPSNPPARAA 


5595 


19496 


A 


5638 


3 


261 


KIQIiNPGSQGCSGLSIjCHCS PAWATBTL 
SQKKKKKNPPFGFKI KGEIEPLREFGTK 
KKKGGFLKKFPPHPLGFFLEPFTWGGFW 
N 


5596 


19497 


A 


5639 


2 


121 


SELRSCHCTPAWRQSKTTSHKKKKNIKK 
KNKFFLPFREN 


5597 


19498 


A 


5640 


370 


180 


VEMEFCHVDQSGIiELLISGDTPTSASHN 
AGITGVSHRTWQPSFFSFFKERRNLKSV 
TSLNVSP 


5598 


19499 


A 


5641 


254 


1 


FYIIiRKVFVFLF PAVS YGKGTYFAVDAS- 
YS AKDTYS KPDSNGRKHMYVVR VLTGVF 
TKGRAGLVTPPPKNPHNPTDLFDSVTNN 


5599 


19500 


A 


5642 


292 


106 


NFVILFYLFKQGSHYVAQAGLEPLSSSD 
PLATASQRAGITSVSHRAQTKQAETYW 
PFSLFT 


5600 


19501 


A 


5643 


3 


100 


DTFFEVEVEMEKEVCRDLMCTSPKDEEG 
FIiLI 


5601 


19502 


A 


5644 


1 


282 


EKVPBAKRLYGKRGDPFYEAQENHNLIG 
V7UWFLECLFCDVKLQYAVPI ISQQGEV 
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STGQ P WRTMVVQVFCFGLFWFS FYLSYV 
LEVDQETKAN 


5602 


19503 


A 


5645 


142 


67 


NKNRPGTVAHACYPNTI/3GQGGQIT 


5603 


19504 


A 


5646 


350 


213 


GLELLTSSDPPASASQSAGITSMSHRPW 
PPLASRLDLCCGTLYLI 


5604 


19505 


A 


5647 


3 


180 


EHNAERAFLAQAHLGPGAPCTPGFPSFI 
PTQFPPFRSSGIiPNIPVQTISRAAABKL 
FG 


5605 


19506 


A 


5648 


366 


1 


RSWFTTYKERSLLHIIKVQREAASADIE 
TAAS YSEDLAKITHESGYPKQQI FNVDE 
TAFYWKMI PLSTC I AREEKS I SGFKAS K 
DRLTLWLGANAIGDIKLKPMIVYHFKTP 
RALKNYASSL 


5606 


19507 


A 


5649 


1 


355 


QIQSSQSGKGUjLTSFIDEETKXiFFFGN 
KVWFCI PPGPGGKLCFKKKKI FGLYPGG 
GGFWGPQTPPLAPTFPPGFPPPPGPNFS 
PRGP FPKMGPAIiRGPGKKKKPPEKTKPA 
KGGTGP 


5607 


19508 


A 


5650 


348 


124 


RAPPVFFWVFLGQFPKKKSTNSPRGGGP 
PRSFQPFSLPPFFRNVFKKKPPPFFFEG 
GRGPPAPPFKNP FFFFSRIi 


5608 


19509 


A 


5651 


194 


43 


KKKFFFFFYEMESHPVTQAAVQWHHLGS 
LQPPTPTFERFSCIjTLSIiIAIP 


5609 


19510 


A 


5652 


27 


244 


KKQQSTLFLFYFLSFILFFFLVKTGSHY 
AAQAGLELRASSNPNTSVSLPKSWDYGH 
EHTVPSPTWFLAAKTP 


5610 


19511 


A 


5653 


38 


429 


GLFLSCLFFFFLEKGVPLTPPGATGENP 
APKKKGTGQKLLKNRDFFLKKAMFGKQK 
QKKFFFAPTI TGDQRAPRVFKGPGAQKE 
PFAQKKKERGGGKKTRAAPNKKKKPGAF 
FSUCKTPLWAGGGPPPFF 


5611 


19512 


A 


5654 


391 


68 


NLYPHGGVPGVS PGKGSLPKAPNPLGGI 
PPSGEPPPliFPQGFFPCLKPRKRLGKAP 
GPFFFFFLAILLGWFQISNTDSCVQVFI 
IQTIWUFTTIIRGCHLHFTFLF 


5612 


19513 


A 


5655 


159 


34 


GARFLFFSPVLSPLIEHCVCVCVCVCVC 
VCVCVCVCVLCAOTCT 


5613 


19514 


A 


5656 


88 


388 


SIFCFIYVFFLRQSFTLVAQAGVQWRDL 
GPLQS PPQKKKGKETFSFLGENRYLQPK 
NFPTFLTQVCFFEKI PPNGLINGKRPKW 
I PGCE FLKTLWGPW 


5614 


19515 


A 


5657 


176 


371 


HFYFCFSDINLAAEPKVNRGKAGVKRSA 
AEMYGS VTEHPS PS PLLRSGTLLFITAL 
CPSVGIFSF 


5615 


19516 


A 


5658 


3 


113 


CSEPRSRHCTPAWA'TERDS VS KKQQTKN 
LSYKKIHW 


5616 


19517 


A 


5659 


409 


67 


SSSNKPPTRPVFILPFFGHGVPRLPPGK 
KPPGPPPGPPPPQVEQMYGRKGGFALNL 
PPRRQNKNMLYSPELGEK 


5617 


19518 


A 


5660 


2 


377 


GRISCSRNTPLRHRSLMLIVCGLRMPLS 
G FLP WVGQVFLAL I PVNPQPS S LGVRT I 
MDMLRCCRDFIYGQFWGQFMARLWGACS 
QQYTEFFRFKIIFISSLLFNIVLEVLAR 
AIRQEKELNGMLQ 


5618 


19519 


A 


5661 


256 


381 


NFNTGQAWWLMPVILAIWEAEAGESRGQ 
EIETILANTVKPHL 


5619 


19520 


A 


5662 


19 


313 


IiYSSLAIIiGFLPLLINFI 1 1 1 1 FFFFER 
DPGQQNQNFVSLKKRETPPPPFSLGVPG 
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SEQ ID 
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nucleotide 
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M 
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beginning 

nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

oentide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Hsoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G)utaraine, R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














KRSLIiPPPPGPKHAGPPVIiPAGLWGGPE 
PPTWGRGAKQKREI 




10/591 


A 




/o 




ETCRKRKETRWNLKKVCLLKMSLQGEKE 
ERKREKYRRTEMKRQKNSEKQRGREKGK 
KSKYQRRRKKEKXGENQTKPKPKVEESS 
SRQ 


5621 


19522 


A 


5664 


159 


28 


DTSTLPTPTGFKRFFCLSLPSKWDYRRP 
PICLPNFFFFLWTGY 


5622 


19523 


A 


5665 


350 


151 


IRCLCPDKFLAFCRDGVLLCCSGWSRTP 
GLKRSPCPSLPKSWDHRLCLADIYIYIF 
LKDTSTIMSF 


5623 


19524 


A 


5666 


394 


248 


LTPGGGGCGKLRLRHCTPAWATRANLPL 
KTKKKRKRFLFTLAETSKVY 


5624 


19525 


A 


5667 


1 


194 


RLNPGGRGCSELRSRHC I PAWATRTKLH 
LKKKKKGDQGFRGDHTKNIWLFPRVFLP 
LHSLNTLK 


5625 


19526 


A 


5668 


2 


312 


RGQNSARGIII IAILKKVCVYIYRCGIiT 
LSPKLKCRGTI IAPCSLELMGSCDPSTS 
ASQIARTADMHHHTRLIIASLTFCLLKK 
KKNVFCGTYNHCPPKGFYL 


5626 


19527 


A 


5669 


1 


245 


RLNPGGGGCGE PRSSHCTPVWATRVKLS 
IjKKKKKKRPPFFFFGIRQPPQGGPFQTL 


5627 


19528 


A 


5670 


1 


135 


ARTPSPSIiSESSEDEKPTKKHKKGKALR 
LKRRFWWLMSALPCIH 


5628 


19529 


A 


5671 


97 


2 


ICKFTHVWPGWAHACSPSTLGGRGGRIT 
RLN 


DOZ7 




A 






171 


VFSPPPPFiCTPPPPPKKIPPPKKKKTPP 
PKKKKLFFFFFFFFFFFFFFGTRIVTDN 
ILSGNRV 






A 




t*o / 


13 


SPGGGGGFAPYWMFFLDFERQSCSVAQA 
GVQWHNLSSLQPSPPRFYFFSRDGVKNR 
PVPRYAPPSPKGISLRPYK 


JOJ 1 




A 






14 


ANFLKAHI VSTLGLQ I IWSLLQLLNFAI 
VFYFLRQS FALVAQAGVQWRDIGSLQPL 
PPRFKRFACQRFSWEAEVAVS 


5632 


19533 


A 


5675 


2 


249 


SHPTHSHPQHLPLTPTWNWSSTPTOFIF 
RKAPPVFPHWQHHRAGPGTSLIPTDAPQ 
EASPPPPYPSSPSSTPTIPRETFLRQ 


5633 


19534 


A 


5676 


115 


241 


SSTLGGQGS ITLAQELRSCHCTPAWATG 
QDS IS KKKKTLLGP 






A 


5677 


460 


2 


RLGVTGRWQKLKPQERMRAWREWGKGPK 
ALPASWEGPTVNLLWKHEHRAWVSCHCD 
SGFCAWCQGLGSEQSGLPSLIPSWDGLV 
LTSPPNPTGPPPPLKTRSVLRGDDVLLP 
CDQPSNLARALWLLNGSMGLSDGQGGYR 
VGVDGLLVTDAQ 


5635 


19536 


A 


5678 


2 


346 


ENCLNSGG EGCS KLRLCHCTPAWVTERD 
SITKEKKKKNFKPVLE KFGEI FFLKGWK 
FFQLNPLKSSKEMEFFNITLPLEPCFNG 
NKPFWAAKI WTLNFFQTQKGS I FKWGGR 
KK 


5636 


19537 


A 


5679 


181 


4 


PKEVRTLFGRFFFFFHMESHSWQAGVQ 
WCNFSSPQPPPPGFQKFSCLSIiPISWNS 
RHM 


5637 


19538 


A 


5680 


549 


410 


F FVQTD FCHVAQAGLQLLGSSN P PAS AS 
PKYWDYRREPVCWPSSGF 


5638 


19539 


A 


5681 


ill 


403 


SRQINGFFWSRTKVAIGQFLVGPSVKI 
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ing to first 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LNLeucine, M=Methionine, 
N=Asparagine, P=ProHne, 
0 = Glutamine R=A.rainine S=Serinp 
T=Threonine, V=Valine, 
W=Tryptophan, Y«Tyrosine, 

X— Unknown *=5\tnn rndnn /=nn«ihle 

nucleotide deletion, V=possible 
nucleotide insertion 














FENNNEICMIKS IQSY I VLNHKHLKVYS 
SNI WQLKNILLRGGAVAHACNPS 1LGGR 
GRR I TRSRDRDHP 


5639 


1 7JHV 


A 




IRQ 


1 i 


NNNRKASWKKKDRVTjLYCPGCNWCDTjGS 
LQPLPPGFKRFSCIjKLPKCWDYKHBPLH 
PAPHV 


5640 


19541 


A 


5683 


2 


227 


IHVCVCVRVCVCILREDSAQPARCVCVC 
VCVCVCVCVCLDPRRELSPTSQLESSCH 
PLYVPSGGSRINRPYPGGC 


5641 


19542 


A 


5684 


318 


421 


DVPCFPVAFCCDCKFPBAS PAMLNCESM 
KPLSFI 


5642 


19543 


A 


5685 


213 


398 


SLCCLLFCCCYFETESRSWQAGVQWHD 
HSSPQPGPPRLKSSSHLSLWRHDHSSPQ 
PGPPRI* 


5643 


19544 


A 


5686 


398 


1 


SFPTSSTYSSSVOGGGOGAHSYKSCTAP 
TAQPHDRPLTAS S SLAPGQRVQNLHAYQ 
SGRLSYDQQQQQQQQQQQQQQALQSRHH 

AOKTT.HYDNT » AK YOH YGOOGOflYf'OPn A. 
AVRTPEQYYQTFSPSSSHRE 


5644 


19545 


A 


5687 


2 


288 


SDRPRSPSFSFPSSPAGVGRASSIFPIL 
SAILLLLGGVCVAASRVYKSKRNIILGA 

VCVCVRARARA 


5645 




A 


JvOO 


j I** 


o 
Z 


CP PG <30r ^RTGDGGGGR <5 FVGGHGGnW P 

GAPQRGRYLQPLLMCGAS PHASPRQPLA 
ILAAGVKFRVLRHEAGRPLRItLMQINPL 
P YSRWHTYRLPS CGCLTCI 


5646 


19547 


A 


5689 


122 


3 


MGGGEFSKKMNINGO^VVAHACNPSTIiG 
GRGRQITRSGV 


5647 


1QS4R 
I j**o 


A 


SfiQft 

JOTU 


144 


i 


EAVKKYTCPTWAGAVAHACNP ^TLGGRG 
GWITRSRGRDNNGKHGEIPR 


5648 




A 




Oil 




r/5 VFPP FFFI»RGEGI\S P P P PGGi\VKNO P 
FFPPGQKGPPGTPERGFQGFLGPKFFPP 
IjPPQGPraiFl^KKKiaaaCIPICAIxAVL 
TYQHFCYRTGQERVDGLREL 


5649 


19550 


A 


5692 


12 


147 


YERRFCHIGQAGIxELLTSGNPPASASRG 
AE ITGVS YHALMRNIS 


5650 


19551 


A 


5693 


183 


395 


GKRFGFIiHPPPPGFGGFFRPPPPKTEGK 
RAPPPPPRFFFFIX3GGGFSGGGPRGVLF 
LEHENP P PGAPKGVG 


5651 


19552 


A 


5694 


177 


283 


THFI^TDGPJlWSIxASLPSSGYGlNTPS 
STVSVSA 


5652 


19553 


A 


5695 


266 


460 


LSQRKYAFFFFFFFFGKKI FFSSPGGEI 
PAQFFLTEPSPFWVKGFSPLPLPKTWDY 
IGRDPPPLG 


5653 


19554 


A 


5696 


1 


412 


MQPLKVNlxlxEQKNI^IxEFVAPFTWIHCP 
SYWSIAYNVYFSQYSFB^CTMFITVN 
IPSKKKKKKKKKKXKIOitKKKKSSSPGSP 
GYAPKKAPKNKKKK 


5654 


19555 


A 


5697 


379 


0 


FFFPSQKKKKKKKKKKKK 


5655 


19556 


A 


5698 


1 


187 


PTRP PTRFRFENGVNLGGGGCGEPRLCH 
CTPAWVTEPJDSVSKKKKKKKXAVGGPWF 
FQKKKV 


5656 


19557 


A 


5699 


374 


0 


DFLMKKKKKKKKKKKKKKKKKKKKRGG 


5657 


19558 


A 


5700 


219 


413 


PFCPQGGGGGPNFVFK3PPPPGLKEIPP 
PPPRGGGKKKPPPPPRGGFFFFKKKGVS 
PGGPGGSPP 


5658 


19559 


A 


5701 


381 


249 


FHRAGQDGLDIjIiAPGS APLGLP KCWDYR 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D-Aspartic Acid, 
E=Glu tamic Acid, ^Phenylalanine, 
G=Glycine, HNBttstidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Clutamine, R—Arginine, S^Serine, 
T=Threonine, V=Va!ine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknovro, *=Stop cod on, /possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














REPPRPAFFFFFFF 


5659 


19560 


A 


5702 


382 


197 


LANFC I FSRDG I LPCC PGWSGTPGLKQS 
AHLCLPKCWDYTHKPPCPAPQVIKPSIN 
ILIVW 


5660 


19561 


A 


5703 


1 


381 


ERGSKHPPRGLKHIRGRTCFVLKGCMIQ 
LCWPPGQGKKKKEGGGEKKKDKKREEE 
GREK3CKNGNQGERVKKKKSKRKKKKRSS 
SS 


5661 


19562 


A 


5704 


291 


167 


NGPPFFFFFLVKMGFCHVGQAGLEFLTP 
GDPPALASRLLGL 


5662 


19563 


A 


5705 


367 


23 


KNPCPPPPPEFFLAPKKKGPPPRKKTRF 
VPPPLFLIKGPPFKRGRGPPPGGFPPLF 
CPLGGKRGGVFLSPRVGKPRGKGGNPLF 
PKKKKKKKRRRRKKEGKKGRKERQGYIH 
TV 


5663 


19564 


A 


5706 


411 


116 


FLKRWGSHYAAQAGUSLLDS S DIiPVS AS 
QSAGTTGVTHCSRLSYSALACQLCQPVW 
RIPWRGRGVGRETSFRGWGRRYGRAWSS 
SGGGAESLPGLSPG 


5664 


19565 


A 


5707 


162 


2 


SYCFPNSDDFIPFQPLPHTWFLWPRPFF 
IFKTAHGWARWLTPVIPALWEAEAG 


5665 


19566 


A 


5708 


1 


416 


LPLLYKIPPPPAILFLFIKDSPPPPPPP 
PFFFFFFPPP 


5666 


19567 


A 


5709 


403 


142 


LSKEKKWGWPRGFSPRIFEKPPPPKIFF 
APPKKKIFPPPPGEKKFFFKGAPPPFFF 
FFFSFFFFFFDCYNSSTWNSAWHIRCS I 
SIC 


5667 


19568 


A 


5710 


339 


461 


PSTFILFILRRSFALVAQGGVRWRHLGS 
PQPPPPGFKRVPA 


5668 


19569 


A 


5711 


408 


42 


TGARIIFSGPPKKTYPSPPRAFIILSFK 
RAAPQLIYFYLNLFAYKRLVKLTTVIMR 
RRPTFFLLLFKTFLFVEPGFCHVAQAGL 
QFLGSSTCX3SIiPQCWDYRCEPQHLAKRS 
TFIKSESAV 


5669 


19570 


A 


5712 


277 


9 


GVCVOTGGVWPPMGLQTPSAPWGLSLA 
PPLGTLCSVQWLAESIHLCICQALVEPL 
RRQLYQVPVRKLLLASSIVSGFGDSIWN 
GSYLR 


5670 


19571 


A 


5713 


411 


0 


FLQILLFLIDIAFFLVFFFFFWKK 


5671 


19572 


A 


5714 


205 


400 


HYLIQSSSSMASESVQLDWMKIPLTWP 
GAVAHTCSPSTLGGRGGRITRSGDGDHP 
GQHDEAPSL 


5672 


19573 


A 


5715 


1 


89 


FFFF KTES RSVAQAGVQWCTLGS LQP P L 
P 


5673 


19574 


A 


5716 


176 


392 


GVHLVTiEWKLSMVCAEDDELFPLCSWKK 
FFFFLFLRQSRSVTQAGVQGRMFGSLQP 
LPPGLKRISCLTRPSS 


5674 


19575 


A 


5717 


1 


89 


FFFFKTESRSVAQAGVQWCTLGSLQPPL 
P 


5675 


19576 


A 


5718 


306 


42 


GSGLRROUiNSVKCLNTLECILLGIYLR 
QIKICVYTKTCTCKFI VTLFI IAKKYEY 
PSVHQLINKMWHIQI IFLVIKRNEGCS3G 
SCL 


5676 


19577 


A 


5719 


1 


263 


PTRPGRFFL ILGDRGCS ELKS CVCTPAW 
VTEGDS XS KQNKKQVELRSPAAHLLGGL 
VPNRPQASTRIGDPCTKKLLELISKFSK 
AAA 


5677 


19578 


A 


5720 


3 


278 


NQEILKNVESSRTVQPHFLEFLLSLGWS 
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M 
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ing to first 
Amino arid 
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peptide 


Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

rpciHiif* of 

peptide 
cAn ma nee 


Amino acid sequence (A=Alanine 
C=Cysteine, D^Aspartic Acid, 
E=GI u tarn ic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N= Asparagine, P=ProIine, 

T=Threonine, V-Valine, 
W«Tryptophan, Y=Tyrosine, 
X— Unknown *=Stoncorfon /=no«ihlp 
nucleotide deletion, \=possibte 

niir.l^nfirip incprtinn 














VDVGRHPGWTGHVSTSWS INCCDDGEGS 
QQEEVISSEDIGAS IFNGQKKSMHI I KQ 
IKYFLSH 


5678 


19579 


A 


5721 


127 


3 


KKKKKKEEEBQGEGRRRRRKKKKEKEKK 
EKEEDGEGKEKGE 


5679 


19580 


A 


5722 


904. 


Of. j 


VTSRS SLFSTPGI KRASCLGLPKCWD YR 
PQPPHIA 


5680 


19581 


A 


5723 


3 


261 


QIVRQWKYNDDDDDQLFYTRVYLDPGLR 
VGKAEEWAHPAALWEWRPO AEWDT.GS KG 
GCKEKNVCLGVVAHACHPRLLGGLGRRI 
A 


5681 


19582 


A 




114. 


jDo 


PPPPPPPXLLFFVSAVPPSPSPSLSFPP 
HRPCYSPPSPTLSSLPPPLLFPPPPSSP 
PPPPRPSAPSPPPSSPFFS 


5682 


19583 


A 


5726 


248 


470 


IiLNVQNGIRNVCSCPHYTPFHKVVKYLS 
VFTiTRvn ^nr.RPDT iHry; Qna a t rpm ttt 

vrjjxov yoOyiMu(yUv«UUOUnnl JCitwlXIl 

PGREKVQILHLALIYFLV 


5683. 


19584 


A 


5727 


1 


281 


NQHNRGFFDSCDGFFTNYNWREEHLERM 
LGQAGERRADVYLGVDVFARWNWGGRF 
DTDKGGGGFLS KGQRPSAS PGTS FPHGP 
SIPVCTSKE 


JUOf 




TV 


j/Zo 




3 


CrkTOTT TT.rtDHTTT.DCCCTTiT TQTXTTVC 

dU 1 it 1 Jj X LuJrCiS x 1 Jjiroirtj XAL» 1 o jli VlAjo 
GNPVSVEVQSVTDGVKVKVSRVPDGVAE 
YSWELKLPTLRQRLFRCVS IRENDDGT 
YAITAVQHVPEKEAIVDNGAHFDGEQSG 
TVNGVTPPAVQHLTAEVTADSGEYQVLA 


5685 


19586 


A 


5729 


70 


465 


SQCSPHHSVLCQKVQDLESGTLQSNWKK 
x x c rii/jZiix vv.vv<vLv ^.vurv* vol iv* v xvlv 
YVCLCVCVCVTV CMCVCLY I KGLLFLIiA 
KKETAGSFPHSQLTCLGVPCAFTIHPVP 
FLPQCRSHCVSGGRFWPGVL 


5686 


19587 


A 


5730 


281 


427 


SWLTAAPNFQLQMESRSVAQAGAQWHDL 
GSLQAPPPGPQSAETQRQMIV 


5687 


19588 


A 






IQd 


TKFHHVGHAGIiEIiVTSGDLiTjS sasos ar 
ITOMHHCSWPASVLSMNRPSGSTRCLRN 
GIFGPLKDFKPNDLTVQL 


5688 


19589 


A 


5732 


229 


2 


PGGAPPLVRSGFPGWGGAPPFVFVPNGE 
PPKNFF F F FDGVSLCKPGWS AVAGS OT.T 

ASST3RVHAILLPQPGIVA 


5689 


19590 


A 


5733 


435 


138 


PPRYPPPLWGAPPHLSRGGGSFTPLTPQ 
GKPRFFLKIKINPGGGPPPYSPPFGGGG 
PRNFFS PGAKGPLVLISPLFFPPGDQKK 
NSFFKKKKKERERE 


5690 


19591 


A 


5734 


113 


409 


KDVCGGVLTGLSGVLTSPEYPNNYPNSM 
ECHWVIRAAGPAHVKLVFVDFQVEGNEE 
CTYDYVAVLGGSSPTPVHQYCGS ISAPY 
LFFLAKNIMSYSLPN 


5691 


19592 


A 


5735 


3 


122 


GGWQVQFVSEPCSLS INGVI FGLTSTDL 
LFHLGAEEISR 


5692 


19593 


A 


5736 


14 


428 


FLALRWGHIRLPASGPRDTATLFSTLDT 
QlxLOTLWGKDETGFYVSKALVHTGVAL 
VPRGLTLAPADGPTTDEVTLQVSGEREG 
SPSTAVRYPSGSVALPSQWLLIGHHELP 
PVLHTTMLRVHPTLGSGTAETRP PEN 


5693 


19594 


A 


5737 


415 


1 


QHINTLGQFWTTKAFIiPRMLEIiQNGHIV 
CLNSVLALSAIPGAIDYCTSKASAFAFM 
ESLTLGLLDCPGVSATTVLPFHTSTEMF 



888 



WO 01/64835 



PCT/US01/04927 



SEQ1D 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 
Dentide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=»Aspartic Acid, 
E=Gtutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Try ptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QGMRVRF PNLFPP LKP ETVARRTVEAVQ 
LTQALLLLPWTMHALVTLKS I LPQAA 


5694 


19595 


A 


5738 


335 


152 


GV^GFHHWQAGLEFLSSSDPPASASQ 
SA3ITGVSHQASPFFLPLSGITLPLSIY 
SSGSD 




i nerve 

1 Sow 


*\ 








IDASDFGECSQAAINDNWIVYELLEEM 
VDNGFPLATESNILKBLIKPPTIIiRSVV 
NSITGSSNAGDTLPTGQLSNIPWRRAGV 
KYTNNEAYFDAAEEIDALIDKSRSTVFA 
EIQGVIDACliKLSGMPDLSLS FMNPTLL 
DD 


5696 


19597 


A 


5740 


429 


325 


NPGGRGCSELRSHPCTPAWVTEQDAILG 
GRSGRIT 


5697 


19598 


A 


5741 


344 


423 


FWPGAVAHACNPSTLGGRGGRITRSR 


5698 


19599 


A 


5742 


431 


103 


RGPFFPRPRQKKGGSPGGGPPQNKKNGP 
PLHTGGAKGGSFFKKKKKKKVPKKGPGL 
FFPG PQG PRKG P VGAP P VL I PNRLGP PR 
IPPGWEPGRQIFFLAPVFFFPSKPK 


jt>yy 


to Ann 






j 1 


774. 


EEMGFYHAEQADFELLSSNDPAAIiAFQG 
AGIASMSHCSWPGKNFNMFCCIWKMKRL 
AIHWECIA 


5700 


19601 


A 


5744 


217 


400 


HSVMGGSPSFFFFFFFFPGEKGLFFFCP 
GGPNLGKGNPPPPGKKGFPPPPPPKGGN 
NGAPP 


5701 


19602 


TV 




A 




DGLEFPGRRFRGQRCSSPPRRGRGQVEA 
LLTSQTGRRGRGAPHVS DDGRRGRDAPH 
FLDGLVAGKRRSSLP I WDGGRAETLLTF 
QTGQPGRGAPHVPDDGRPGREAPHFPDG 
VAAGQRLQSRHFGGQGRRLGGGGCIEPR 
SRHCTPAWAP 


5702 


19603 


A 


5746 


381 


212 


P RGGGGGRYS P P PGGGGRGNS FY PGGGG 
LNKPHFGPCPPPGGPKMKPPPQKKKKSL 


5703 


19604 


A 


5747 


334 


432 


MFCENYKE IAWPGAVAHSCNPNTLGGRG 
RHITC 


5704 


19605 


A 


5748 


190 


1 


IPPKKGGLNFFSPPTPKKGVKSKPPQGK 
VFFFLRRSFTLVTQHGVQWRDLGSPQPP 
APGSKHE 


J ruJ 


iy\J\f\J 


A 


5749 


425 


297 


ESAFHHVAQAGLELLGS CDLP VSASQS V 
GVTGMSHHTRLYCS 


5706 


19607 


A 


5750 


425 


0 


KKKQRKKERKKERKKERKKERKKE KKRT 
KSDE 


5707 


19608 


A 


5751 


136 


3 


AQSSRPAWL I FLFLVETGFHNVGQAGLE 
LLASGDLPTSQVPRRP 


5708 


19609 


A 


5752 


3 


237 


GCSEPRSRHCTPAWVTEGDSVSKTNKQK 
TQRGLKS IALFPAIHSLLPWIKIAMPI 
LAMDIFPHPIVEFSYHVWEQM 


5709 


19610 


A 


5753 


1 


358 


GDRGCS ELKS CMC I PAWVTEQDLPS KKK 
KKKKGGKKTRNGPQTCGGDLPPPPLGGG 
KNFPKGETFWFHSGKKKRFFSLGKGEKC 
FFIPRAPKGPGKTLGGPNRGGGAKKKLK 
RPPGPVY 


j / 1U 








9 


191 


QGCSELCS YHCT PAWETRARPVS KKKKP 
KTKNQTKKNRLGKKPPNLI FPPFQKKKK 
WVQIPGK 


5711 


19612 


A 


5755 


42 


408 


KHAAP PAS hS LSLLLHHGQKRAC F P FAF 
CRDCEFLEGSPAMLPVQPAKLTPRSTPP 
HP CS LENDS AHFFN 


5712 - 


19613 


A 


5756 


179 


410 


LQAFLRPSPPVLFLLYADLSNQNLWPGM 
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VARACNPTTLGGRGGRIT 


5713 


19614 


A 


5757 


166 


2 


GRPGPADFRVRPQLLQRFLFIYLFTEME 
SCSVTQAGVQWCNLGSIiQPLPPGLPD 


5714 


19615 


A 


5758 


334 


3 


EEKVFGLPSP ILLS KGQGGAASGPPCP I 
SPRAMGPQKKGWETMVF ITKAPPS FGFV 
LVKQS PGTG PVPRNFF FFLRRSLALS PR 
AGVQWRDLGSMQAPPPGFTPFSCLNL 


5715 


19616 


A 


5759 


373 


470 


SSCWSLRNSSPGAVAHACNPNTLGGRGG 
RITR 


5716 


19617 


A 


5760 


210 


384 


EALDTTI FFFFFFETKFCFVPQAGGQGQ 
NLGSLGAPPTGLTPFSCLTLRICTWDCGP 
PP 


5717 


19618 


A 


5761 


257 


400 


SLGS EYTWVCFCVFETESRS VGQVGVQW 
CNLSSLQPPPPGFKRFSCLR 


5718 


19619 


A 


5762 


400 


2 


ARAVGS VGEDPGLGTGPS PARETWEWAV 
VCRACRGSQLLWGKIiFILRHWDLRPCVIiA 
RGNGTKDOjKQLWSQCGAGMGRGPFQGL 
KGKPVGCLCSSREVFYLFLYLFRDRVSL 
CHPGWNAVKQTRLTAASASQ 


5719 


19620 


A 


5763 


34 


440 


RVRAPLGWEEGNYF VLRAKPRSGKRS CL 
PFLRCGHQEPLLWGISTASPLRHLVGAS 
QICGVLFLCFQVSLKPAGSWKWIGNNG 
ERKAQRLCPLRATRVEGI FCFRFVFLRW 
SFTLVAQAGVQWRDLGSLQPPPL 


5720 


19621 


A 


5764 


297 


468 


LVGAGVPTGCGQRNGWNSHKSVQAQWL 
MPVIPALWEAEAGGSRSQBIETILANMV 
T 


5721 


19622 


A 


5765 


i 


205 


GFKLTDS FASGAKAI LRSQLLGRLRQEN 
RWNLGGGGCWGLKV/HNCP PAWAIEGNLL 
FKKKKKKGGGTF 


5722 


19623 


A 


5766 


307 


1 


KERDPPPlVFGKTFFFFFFKRNFCFFPP 
KEGQGG I LGPRHP F FSRLS FLGRGVPRG 
FPPGPRNFFFFWILVKMGVPHVGQADFK 
LiLTSGDIiNKTASQNAGIK 


5723 


19624 


A 


5767 


2 


319 


LLGNGEELSRFP FQRSSQAWVFESSAG I 
FGGKAGATGLEKLTGSFQQLTGHPDPRT 
PEELGDPEKSSASEKTRGLQEANGVPEI 
LKAVTYTQAWPKEAKVDPITP 


5724 


19625 


A 


5768 


787 


1118 


EAAWRELEAERAQLQSQLQREQEELLAR 
RKAEKEQLSEE IAALQQEHDEGLLLAES 
EKQQVREPWRGLCCSLSCSSSGAGPCSA 
TWQLGALGQATALLGASVSLPAGGED 


5725 


19626 


A 


5769 


2 


452 


NGAGTMSVSLEADENPFAQGALKSEDCF 
I LDHGKDGKI FVW KGKQANTEERKAALK 
TASDFITKMDYPKQTQVSVLPEGGETPL 
FKQFFKNWRDPDQTDGLGLTYLSSHIAN 
VERVPFDGGNLDTFTAMAAQHGMDDDGT 
GQKQIWRIEG 


5726 


19627 


A 


5770 


2 


130 


GTQGKI VDLVKELDRDTVFALVNYI FFK 
GKWERPFEVKDWKS 


5727 


19628 


A 


5771 


422 


8 


HIARWGARAPQGGWALLKAELCLNPAPP~ 

PIGARTAPLWPGCCFSLRPVFLLLFPFF 

SLFIQSFRILYPQPWPLQGRRSTTLSRM 

GTTDHVIVLASTNRADILDGALMRPGRL 

DRHVFIDNPTLQERRKCMRRLGASRN 


5728 


19629 


A 


5772 


432 


3 


YSLNLKKRWFVLTHNSVHYYKSSEKSAL 
KLGTWINSLCS WPPDEKI FKETEYWN 
VAVYGRKHCYRLYTKLLSEASRWSSAIQ 
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NVTDTKAPIDTPTQQLIQDLKENCMTSH 
WEQ I YKRKKI LRYTHHPCIAAAVEDHD 
ND 


5729 


19630 


A . 


5773 


20 


432 


SRAAALLEAVTETLFYYEVAEKIWSNRA 
NRQCADCGSSRPDWAAVNLGWICKQCA 
GQHRALGSGISKVQSLKLDTSVWSNEIV 
QLFIVLGNDRANRFWAGTLP PGEGLHPD 
ATPGPRGEFISRKYRLGLFRKPHPQ 




IQJttl 

I/OJl 


A 




All 




RGAFQTLQKKDLQAFVLSSVQLQVLTGS 
CFKLRTVHNIPVPSNKVDDECPGLYGFL 
HVIVHPAKGFKQSATIiSCAIiEVDSFGYF 
VSKAKTRVFRDTAEPKWDEEFEIELEGS 
PSLRIIjWYEEYNNKSKVNNLYSAAVGFS 
AWQ 


5731 


19632 


A 


5775 


133 


436 


MLISLPHPSTHLTSLPFCADFGTRQTTG 
AASHVLTIiSAHSSVCSCPPLMGEATTSR 
AQDLPADEHTAFAHELEAPALSKGQGLS 
ILRALTGHLPLGQEVNL 


5732 


19633 


A 


5776 


397 


58 


KGTISIGKTjEEKLRGAARQAIiADAIIEF 
QLLPASLCTEDTPTGMQVKRSLGSRECH 
RSKQDSVAGERAKVGFRGSTSENYYIRT 
TYSQLSVSLSGHHPGATLAAVNGCSCSP 
T 


5733 


19634 


A 


5777 


1 


324 


MRRCVSAGGVELEEYYPAPIiDMVRSMLE 
GSIDPTQYEDTLREMFTIHAYVGFTMDK 
LVQNIARQVSRAGVGPALRTAEMALPSF 
HS PLGRGPWS TANI*SCFFL I ETK 


5734 


19635 


A 


5778 


1 


398 


RLGSRPSLHDQSPLELRSEIQELKGDVK 
KTVKLPQTEPLCAIQDAEGAIHEVKAAC 
REEIQSNAVRSARWLFETRSLDVINQDP 
SQVRL IRGI SLEEGARPDVSATRWI FDT 
QSLDAIREILVDEKDFPPSP 


5735 


19636 


A 


5779 


281 


12 


VACNLGWRVRTPTPASYPPCVPQLSVRG 
KNRKELLGGFXRNIVKSADBALITGMSG 
LKEVDDFFEHERTNLLEYHTLYCGRSED 
HATSR 


5736 


19637 


A 


"5780 


406 


23 


DSQATGPGISRGRRISSPSSLCTRFSTS 
IVKCSPLEHVLKYLDPPPPPTRCSRPAS 
IiRSSPKHCVQQQFPIACRPGAAAPPPPP 
AQCPNPGPAAASARHEPKWGSRTEGGPA 
RARRREGTGRRKKRR 


5737 


19638 


A 


5781 


415 


18 


TSQAGDDLLHGSLLRRGPEMGYLPGYDP 
DPTLATPPAGQTLAVPSLPRATEPGTGP 
LTTAVTPNGVRGAG P SAPELLTPP PGTT 
APPPPSPASPGPPLGPEGGEEEPPPTI I 
TTTTVNTTVTS PMYCGRSEG 


5738 


19639 


A 


5782 


168 


432 


GMRRQTGLDQQSRGVGKNWRVDFFFPSS 
LSFFFFFFGKESYFVLKPGIHGLKPGYL 
EPLSPGLKEFSRLSPPKGGHLKCRPPSP 
IiFFF 


5739 


19640 


A 


5783 


1 


423 


TDDDLNWLDHSRTFREQGVDBNETIiLLR 
RKFFYSDQNVDSRDPVQLNLLYVQARDD 
ILNGSHP VS FEKACE FGGFQAQ IQFGPH 
VEHKHKPGFLDLKEFLPKEYIKQRGAEK 
RI FQEHKNCE KI S ERSAKV 


5740 


19641 


A 


5784 


273 


400 


DSVVLLMLSPGVCPHVQVWPISCTLGAT 
FGYVAGLVT S PLW I 


5741 


19642 


A 


5785 


445 


28 


DTNRAPNHME IKGSGGTEAKVWGADS ID 
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KEAAGDETMEARTMEAEATGAKPTGVBA 
TGAKVTETKPTGAEVREMBSTEEEANME 
SKE*TGAQATDTBTTGVEAMQVEATBCTKA 
EEAEMQAYGVGAGQAEPPVTGTVLRPL 


5742 


19643 


A 


5786 


380 


2 


QRQQESPEASSLHILERQVQELQQLLVE 
SQEEKESLGREVESLQSRLSLLENERGN 
TSYDVTTLQDEEGELPDLPGAEVLLSRQ 
LSPSAQEHLASLQEQVAVLTRQNQELME 
XVQIMENFEKDBCI 


5743 


19644 


A 


5787 


663 


2 


FGVERRATRTIRIiTRTVLDLYS FLAGVS 
ENLRHATQDDASRTRAPGLSSQHPKPDT 
TVSGDTETGQSPGVFNTKESGMKDIKSB 
LAEVKDTLKNKSDKLEELDGKVKGYEGQ 
LRQLQEAAQGPTVTMTTNELYQAYVDSK 
IDALREELMBGMDRKLADLKNSCEYKLT 
GLQQQCDDYGSSYLGVIELIGEKETSLR 
KEINNLRARLQBPSAQANCCDSVY 


5744 


19645 


A 


5788 


398 


1 


TRCYHS ALRYGGS FAELGPRIGRLQQAQ 
LWNFHTGSCQHRAKVLPPLEQVWNLLHL 
EYKRNYGAKRGGPPVKRAAEPPWQPVP 
PAALSGPSGEEGLSPGGKRRRGCNSEQT 
GLPPGLPLPPPPLPPPPPPV 


5745 


19646 


A 


5789 


116 


371 


RHPKIWGLRGPPGLTLEKRGKGEEPLDP 
RSWGFKEAETPPRPFTRGKKGTPRLKKK 
P KERKTRERGGKKEGKKKGKKNEGJCTNP 
h 


5746 


19647 


A 


5790 


406 


284 


RRGFFHVDQAGLELPTSGDPPTLASQSK 
PGPIiCPALLHTF 


5747 


19648 


A 


5791 


200 


3 


GDALIYMEPEKQVMSRSSDECVVALCDQ 
WLVS YSKKI PLVNLLTFDLKKNIS ICPL 
PGTWIMEMY 


5748 


19649 


A 


5792 


372 


278 


LQIQSPAGQSRCGGFLVREDFVXrTAAHC 
WGR 


5749 


19650 


A 


5793 


1 


167 


VAPPKLDPHSGRVRAQAGLELLTLCDPP 
ASSSQTAAITGASHHTRQVLGSYKALS 


5750 


19651 


A 


5795 


112 


3 


SSWGKVGAHAGEYGAEALERMFLSFPTT 
KTYFPHCI 


5751 


19652 


A 


5797 


356 


238 


FLFCNNGVYSLEHFRCLAENAGDDAFVK 
DVTVLQNTDGR 


5752 


19653 


A 


5798 


379 


60 


VKIRHCHTMVSMSLLKNIRLILEEEQVE 
GNYS FCVLDNQNLQQLWDWDHRS LTI KA 
GKMYFAFNPRLCVSEIYRMEEVTGTKGR 
QSKGDINTRSNGERASCE 


5753 


19654 


A 


5799 


377 


3 


KVAFPGEGRQGESWSAFQGHTRWSDLK 
WAVYEPDLMVTSSVGTYIYIWDI KDTRK 
LTVALPVVAGASQVKWNKKNANCIATSH 
DGDVRIWDKRKPSTAVEYLAAHLSKIHG 
LDWHPDS EHNLY 


5754 


19655 


A 


5800 


143 


316 


CPTSPPRLPYTEPFGGVSGLTVEQFRKI 
NGFPNAPWGWGGBDDDLWNRYLCRSFGV 
F 


5755 


19656 


A 


5801 


390 


2 


LFFFFFFFIiRQSLTLVPQAGVRWRDLSS 
LQPPPPGFKRFSCLCV 


5756 


19657 


A 


5802 


342 


465 


VIFFFKLKQGLTLSPRLECSGTIITHCG 
LDLLGS S ES PTFT 


5757 


19658 


A 


5803 


1 


378 


VYSSEFENIKEEYRGRGFPTICYFEKGR 
FLIQYDNYGSTAEDIVEWLKNPQPPQPQ 
VPETPWADEGGSVYHLTDEDFDQFVKEH 
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S SVLVMFHAPWCGHCKKMNP E FE KASEA 
LHGEADSSGVLAAV 


5758 


19659 


A 


5804 


152 


1 


KPEIFFFESRSVTQTGVQWCNFGSIjQPP 
PSGFKLFS CLSLPS SWDYTAHV 


5759 


19660 


A 


5805 


207 


31 


LYIjQLLQQTASSGNLNTLSSLHPMGGKC 
FTAAEEQQHPGQQTQLTTVLGVWHRFCE 
NIF 


5760 


19661 


A 


5806 


199 


2 


GSECSCLLSRGGEMHVSLSSFKVTFSCD 
AIIHFSFLFFWRQEFPSVQAGVQWRDIiG 
SLQPPPPGV 


5761 


19662 


A 


5807 


385 


239 


QDQPGI^KMCVCVCVCVCVCVCVCVIIC 
KS CIRLS YMKM P INFGLKNL 


5762 


19663 


A 


5808 


39 


184 


GRIIIKFFGEMRSHHAAQTGLELLGSSD 
PPTWASQSAAITGVSLRIRP 


5763 


19664 


A 


5809 


376 


2 


WAEPTSFHVCITSYTQFFRGLTAFTRVR 
W KCLVIDEMQRVKGMTERHWEAVFTLQS 
QQRLLLIDSPLHNTFLELWTMVHFLVPG 
ISRPYLSSPLRAPSEESQDYYHKWIRL 
HRVTQPF I LRSV 


5764 


19665 


A 


5810 


271 


120 


FLCFIGKTSERGSFSLYSRDTGLPGCQV 
SLMIKKSNEMVFKHINKGSINR 


5765 


19666 


A 


5811 


367 


3 


DKSFKEWHKMLEDANAPTEERDERIAGL 
YEE I ERDLMLLGAS AVEDKLQEGV I ETV 
TSLSLANIKIWVLTGDKQETAINIGYAC 
NMLTDDMNDVFVIAGNNAVEVREELRKA 
KQNLFGRCI 


5766 


19667 


A 


5812 


294 


37 


FRGENPDDGVRGS P PED YRLRQVAS SLF 
RGEHHSRGGTGRIiASLFSSLEPQIQPVY 
VPVPKVSHWAFFPNSSLBPLTYKAI FLP 
TA 


5767 


19668 


A 


5813 


184 


3 


I IS YLKNYLLFSPLQRQNFVLLPRLECS 
GT I TAHCNLKLVGSGDP PAPAS QVARTT 
GMRH 


5768 


19669 


A 


5814 


468 


1 


DDMEYEGSVSVFTQVPRKSVNKGIiDYTIi 
MVAGEFGLGKSTLVNSLFVSDLYRDRKL 
LGAEERIMQTVEI S KHAVDIEEKGVRLR 
LTIVDTPGFGDAVISTECGKRRAMY 


5769 


19670 


A 


5815 


24 


314 


QAPPGPKCFNVNLCFSLSLSKGEPEFHY 
IAGAHGNEVLGRBLLLLLVQFVCQEYLA 
RNARIVHLVEETRI HVLPSLNPDGYEKA 
YEGVMALAHLTQ 


5770 


19671 


A 


5816 


474 


3 


TTLRKGYSRHKGLLHS IRQEVQATPLEG 
IIVSGQMSTMNLSSTLWPSPKRVRRLCI 
GRTLRSLRIRPCKRTPLQTRRMPQKMRA 
LYGDPGDSPPLRSRAIPASSPGSTGSFR 
ARPPHPLHPPPHLTREGEPPLLPVLQPR 
RGPPLPIiHSQASRPIV 


5771 


19672 


A 


5817 


511 


389 


GMYCTEGREWPTFRNEIEIEBDHCGRV 
SASIILTHCSNR 


5772 


19673 


A 


5818 


510 


2 


PLS CEVPNPLF PVPGVRM I QHH I EACAQ 
VRGMQEIVLIGFYQPDEPIiTQFIEAAQQ 
EFILPVRYLQEFAPLGTGGGLYHFRDQI 
IiAGSPEAFFVLNADVCSGFPLSAMLEAH 
RRQRHPFLLLGTTANRTQSLNYGCIVEN 
PQTHEVLHYVEKPSCIAAAIEDQASSRQ 
R 


5773 


19674 


A 


5819 


282 


152 


FSLFLFLLQSERHFIMQWCEATQCPDT 
RVSVRLYEKSVLHIiS 
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5774 


19675 


A 


5820 


347 


1 


EPHPVTPPAVGTSRYNPLIMMDRNRSAV 
TPPSHPPQQPSSMQTGMNPSAMQGPSPP 
PPPPSYMHIPRYSTNPITVTVSQNLPSG 
QTVPRALQILPQIPSNLYGSPGS I YIRQ 
TKV 


5775 


19676 


A 


5821 


482 


13 


IYEKLITGCYNILANHADPNSGLDESIL 
EECLQYLEKQLESSQARKAMEEFFSDSG 
E IiVQ I MMATANENIiS AKFCNR VLKFFT K 
LFQLTEKSPNPSLLHLCGSLAQLACVEP 
VRLQAWLTRMTTS 


5776 


19677 


A 


5822 


608 


7 


FRFAEKWEGMFI IVNS IT I KIHSKAFH 
AS FELWQLQGYS VNPNWQQSDLRLTRIT 
DPCRGEVLTFKE ITWQTLRIEADATDNG 
DQDPVTTPLRLITWQGRIQIAIiKRRTKD 
CWISSKIJ^FIiIiDDLLWVLTDSQLKAMM 
KYAESLSEAMEKSAHQRKSIAPEPVQIT 
PPAPSAQQSWAQAFGGSQGNSNSSSSPV 
LRPL 


5777 


19678 


A 


5823 


328 


3 


AEVASEDCSLPCFLAVWNRI IEPVAAMR 
KEADMIiRLFPEYLKGEELFGLTVHAVLR 
IAESLPGVESCQNYLFRYGRHPLMELPL 
MINPSGCARSEHKILTHYKRPSLY 


5778 


19679 


A 


5824 


69 


308 


TS VPS CVRCRY 1 1 LRTS S ALTN I LGTHS 
NSSFFHASSSALHSCCFPFFSWLQTLDI 
NVKAPALMTKAWPEMEKRGYRE 


5779 


19680 


A 


5825 


424 


2 


LQRAFS ESI RKLRGYQGADRKQ I YHRRC 
S FANHSVRPSADEKCNSNFFEQRHGGSH 
QSSKWTPVGPAPSTSQSQKRSSGLQSGH 
SSQRTSAGSSSGTNSSGQRHDRESYNNS 
GSSSRKKGQHGSEHSKSRSSSPGKPQTV 


5780 


19681 


A 


5826 


107 


2 


S S LTAG VRMG VPAQ S TQGTVNGS S PQMS 
GTAALTS 


5781 


19682 


A 


5827 


2 


437 


FPTEDSRTSKESMSEADRAQKMDGESEE 
EQESVDTGEEEEGGDESDLSSESSIKKK 
FLKRKGMTDSPWIKPARKRRRRSRKKPS 
GALGSESDKSSAGSAEHIGPCDSTGDME 
VSSGLPGSRRPENPVFLCIWVAVRAICP 
GFIiWS 


5782 


19683 


A 


5828 


487 


22 


MPEPVFPLSHFRQF IAAX KLQFQARLSR 
CVRDLVRSLAAPNHDTLRMLFQHLCRVI 
EHGEQNRMSVQSVAIVFGPTLLRPEVEE 
TSMPMTMVFQNRWELIIiQQCADI FPPH 


5783 


19684 


A 


5829 


2 


871 


RGICSQRWRREGSQSRGPGLVITS PSGS 
LVTTASSAQTFPISAPMI VSALPPGSQA 
LQWPDLSKKVASTLTEEGGGGGGGGGS 
VAPKPPRGRKKKRMLESGLPEMNDPYVL 
SPEDDDDHQKDGKTYRCRMCSLTFYSKS 
EMQIHSKSHTETKPHKCPHCSKTFANSS 
YLAQHIRIHSGAKPYSCNFCEKSFRQLS 
HLQQHTRIHSKMHTETIKPHKCPHCSKT 
FANTSYLAQHLRIHS GAKP YNCS YCQKA 
FRQLSHLQQHTRIHTGDRPYKCAHPGCE 
KAFTQLSNLQ 


5784 


19685 


A 


5830 


463 


1 


LPESRLPSPHKREEGSRARVIMTSYPIE 
PHERRKGS LAD WATLKQ KKLEEMTRTE 
QEDSSCMEKLLSKDWKEKMERLNTSELL 
GEIKGTPESLAEKERQLSTMITQLISLR 
EQIiLAAHDEQKKIiAASQIEKQRQQMDLA 
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RQQQEQIARHQQLY 


5785 


19686 


A 


5831 


462 


2 


MI LFDSS LYFLCGWYIiSNIj I LGTYGVRK 
PWYFPFTASYWKSVGFLVEKRQYFLSSS 
LFFFNENFDNKGSSLQNREGELEGSAPG 
VTLVSVTKEYEGHKAWQDLSLTFYRDQ 
ITALLGTNGAGKTTI ISMLTCLHPPTSG 
TIIINGKNLQTVY 


5786 


19687 


A 


5832 


396 


2 


VPSTPASKRKGIVPRCDIFIYDRGGALP 
KDIHVAGILFASAWMNVWPLLSAIEFHS 
AWAMGMGVNFFVASTHHVSLNMSGSGRY 
APNGRKVYHYDMKTEMGELLLS EVDSHP 
hS S LAY PTADNWNA YATL Y 


5787 


19688 


A 


5833 


384 


137 


E E I CL>N P G AGGCS E P RLWHCT PAW AT KR 
DSSKKKEKNCLSKNKKEKEKEMFACVCV 
CVLFS I YAFPSKAQELSNPCKHYGNF 


5788 


19689 


A 


5834 


171 


2 


FVEGINISGNFYRNKLKYIAFLRKRMNT 
NPSRGPYHFRAPSRI FWRTVRGMLPHMY 


5789 


19690 


A 


5835 


278 


404 


S VL I KFEKKTLKYKAI WPGAVAHACNP S 
TLGGRSRGITRSGE 


5790 


19691 


A 


5836 


417 


3 


SKGKEGRCEWQYDLPSGTHAVFQFFLKQ 
QGRVPGIiTQAVQ VAKMS LI DLAGS ERAS 
STHAKGERLREGANINRS LliALINVLNA 
LADAK3GRBCTHVPYRDSKLTRLLKDSLGG 
NCRTVMIAAISPSSMTNEDTYSTHV 


5791 


19692 


A 


5837 


409 


230 


AGRAYCYNGMCLTYQEQCQQIjWGPGKAL 
LGSTALPSPPLPSQPPLAbRVVWNLGFK 
WHL 


5792 


19693 


A 


5838 


3 


409 


LRSVPCKDYLTQNHYITSPLSEEEAAFP 
tiAYVMVIHKDFDTFERLFRAIYMPQNVY 
CVHVDEKAPAEYKESVRQLLS CFQNAFI 
ASfCTESWYAGISRLiQADIiNCLKDLVAS 
EWWKYVTNTCGQDFPLTTTRPV 


5793 


19694 


A 


5839 


385 


174 


GLAVEIGSRRIAEDGLELLASSDPPTSA 
SQSAGSTGVSHHAWPDTVSSVYWCFLCC 
TKQWVGALSIVSL 


5794 


19695 


A 


5840 


260 


2 


PLPRYTAACSRLL VQ YKflAFRQVQGSE I 
SSlDEFCaiKFRLDCPLAMERIKEDRPIT 
IKDDKGNI^CIADWSFYITVMDKLRQ 
CI 


5795 


19696 


A 


5842 


299 


3 


FSGIKYI CI IFLFVCFFLYIiIiGIGS PYV 
AQAGLTLLGS SDPPPSASQSPGI IGVSH 
YVQPIHTVAQLSPSS IFRTFSS FQTETV 
PIKHTSHSPSPSLY 


5796 


19697 


A 


5843 


391 


2 


APHAP AFLRARGEPQDP LSHPRVP AVSA 
NCRMWKHLPVHSSPTPRLTPLWKLQARW 
LLPQIiVYLQGWGSYSLLRPAALISMVLL 
AREFLYPAKMSVSEVCSSGLSSPLLEQH 
KTNLIFYASGDICSANV 


5797 


19698 


A 


5844 


479 


59 


FVGME PGVGHASLAMHGLHIRSLGK I KL 
RKRKCTLF FNTQE KS ARRRGHLLGEN I Y 
LLLFAlAIiRILNCLLVQTSFVPDEYVJQS 
LEVSHHMVFNYGYLPWEWTERLRSyAYP 
LICAS IYKVLHLLGKDSVQMLVSGNKSS 


5798 


19699 


A 


5845 


406 


548 


S VUTK-fb rcli ESHP VTQAGVQWHNIiCSij 
QPLPCRSKQFLCLRLPSSW 


5799 


19700 


A 


5846 


2 


394 


TLCHRAKDPTVHHESGRIMYLNGLCFIjM 
GPAQLTQRLSVSRQGEC 


5800 


19701 


A 5847 


472 


201 


LS ILSDTSLCLGRFFTYENGCAYFHEEE 
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REGLARICRLSIHSRYBDFWDGFNVLY 
NKKPVIYRSAPARPGIX3QSLCNQVMWYQ 
VAILNK 


5801 


19702 


A 


5848 


127 


3 


S SNVKAAWGKVGAHAGB YGAEALERMFL 
SFPTTKTYFPRCI 


5802 


29703 


A 


5849 


238 


1 j 


MABTHPNPTGLPIiAPALPCLAGSATHPC 
LFSPEISLDVDADRDGWEKNNPKKVPS 
FQGRHHPEPRASEDTAPDPAGMY 


5803 


19704 


A 


5850 


423 


2 


LSPLGNWSSANQKWFDSVKPSRKDLFA 
VDTQVGTVPSLTAGEQGFGDGGKGRAGE 
QATQAAGD WS SNMWGSDG I LSHSAPLTV 
PQPLLTGPQGPCLCRPLPNQQTSFPALL 
PFSTQLCSMCPGPASARPPPLIiLKPTMY 


5804 


19705 


A 


5851 


326 


682 


PSYSLLSLLQVKNEVEKLPRQQRKESMK 
QKMEEHTQKKQLLVSPWPQLGSAGAGAR 
PGMVTRPPLCLCP I PAPQI PRS BYLNLA 
SSFMAPLSPPPCSLHRLSCLHLEVRNS I 
SGYPNCI 


5805 


19706 


A 


5852 


263 


30 


HEKTDDERGPGQSARSGAITKPPGPPLP 
IEPHETTPEHPAPSGTIPEPPLPVELHE 
TTPQHPVPSGTIPEPPYLLSQ 


5806 


19707 


A 


5853 


389 


2 


GLPTQREKFGASMRTRMTIQS IAWTSF 
VCLVGEGNNVQGFRAESRCWRYDPRHNR 
WFQIQSLQQEHADLSVCWGRYIYAVAG 
RGYHNDLNAVERYD PATNS WAYVAPLKR 
RCMPTKAKRWRGRCISP 


5807 


19708 


A 


5854 


379 


2 


GRSLRYSGSCSGEENSTNNSAGQSRAVI 
AAAARRRGNSHNEYYYEEAEHERRVRKR 
RARLWAVEEAFTHIKRLQEEEQKNPRE 
VMDPREAAQAIFASMARAMQKYLRAAKQ 
QNYNTMESILQHC 


5808 


19709 


A 


5855 


374 


2 


SDAGAP VN I YE FRHRPQCLEDTKP AFVK 
ADHADEVRFVFGGAFLKGDIVMFEGATE 
EEKLLSRKMMKYWATFARAGNPNGNDLS 
LWP AYNLTEQ YLQLDLNMS LGQRLKB PR 
VEIWNSTTPPCI 


5809 


19710 


A 


5856 


516 


18 


PYECKECGKAFNCGSSLVQHERIHTGEK 
PYECKECGKAFSRGCHRTQHQKIHRGET 
PIKCKECGKAFSWGSSLVKHERVHTNEK 
SYECKDCGKAFGSGYQLSVHQRFHTGEK 
LYQRKEFGKTFTHGS KLVHERTHSNDKP 
YKNKECX5EAFLWTTYSNEKC IASAK 


5810 


19711 


A 


5857 


392 


3 


CERGMGGS PRALGRHWTS FLKLRLNCS V 
PGDSTFYFDVLQALTGPVNLHGRSALFG 
VFTTQTNS I PGS AVCAFYLDE I ERG PEG 
KFKEQRSIiDGAWTPVSEDRVPSPRPGSC 
AGVGGAAKFTSSRDIiLY 


5811 


19712 


A 


5858 


391 


3 


ARRTTGMVPKAGGGKGRRGAVFRSYIRE 
IEBLRSKLVESEAMNESLRRSLSRASDR 
SPYSLGASPAAPAFGGSPAS SMEDASEV 
IRRAKQDLERLKKKEVRQRRKS PEKEAF 
KKRAKLQQENSEETNGE 


5812 


19713 


A 


5859 


409 


3 


SLPGEGDPWWAGQEELIjFVQEGKLSLPV 
AERVLLRIACRYDPRSNSWAEIAPMKNC 
REHFVLGAMEEYLYAVGGRNELRQVLPT 
VERYCPKKNKWTFVQSFDRSLSCHAGYV 
ADGLLWISGGVTNTAQYQNRQCI 


5813 


19714 


A 


5860 


405 


135 


NLS RLS LCRQ PMTLVTWVLLF YD S EEKA 
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nucleotide insertion 














FWLLVALWKRKL PD YYNTRWGE CPRVL 
PRLGLQVPAGSMRVAALPRPSSRLHVGS 
LGVRG 


5814 


19715 


A 


5861 


389 


2 


YYSELEEQLTDEFHAEliNRVPLKRLDLI 
FVTFQDSRMAKRVRKDYKYVQCGVQPQQ 
SS VTTIVKSYYWRVTMAPHPKDI IWKHL 
S VRRFFWWARF I AINTFLFFLFFFLTTP 
AIIMNTIDMYNVTRPCI 


5815 


19716 


A 


5862 


525 


0 


SHLPESERIHHTVGQKEQVLMDTSKTRP 
NNDVPEPPMPIADQVSNDDRPEGSVEDE 
EKKESSLPKSFKRKI SWSTKGVPAGNS 
DTEGGQPGRKRRWGASTATTQKKPS I S I 
TTESLKSLI PDI KPLAGQEAWDLHADD 
SRISEDETESKGDDGTHDKGLTIC 


5816 


I97I7 


A 


5863 


399 


2 


ATRRNRNRVPSGMTRTNVREMIAAVGPG 
PSPYPLPPPPEGTSS IEYSNQGNTCQGH 
GNFDFPHGNPGGTSMNDFMHGPPQLSHP 
PDMPNNMAAI^KPISHPMQETMPHAGSS 
DQPHP SMQQDKARNT PQPMY 


5817 


19718 


A 


5864 


3 


714 


RRPFIALCLSNVAFMLPWQFAQFILFTQ 
IASLFPMYWGYIEPSKFQKIIYMNMIS 
VTLSFILMFGNSMYLSSYYSSSLLMTWA 
I ILKRNE IQKLGVSKLNCWLIQGSAWWC 
GTIILKFLTSKILGVSDHICLSDLIAAG 
ILRYTDFDTLKYTCS PEFDFMEKATLL I 
YTKTLIiLPWMVITCFIFKKTVGDISRV 
IATOVYLRKQLIiEHSELAFHTLQLLAFT 
ALAILILRLKLVL 


5818 


19719 


A 


5865 


423 


1 


APPVSTAVAQSNSSEEEAREVGSPAQEF 
KYQKSLPPRFC3RQQQQQQQEQLYKMQHW 
QPVYPPPSHPQRTFYPHHPQMLGFDPRW 
MMMPSYMDPRITPTRTPVDFYPSALHPS 
GLMKPMMPQESLNGTGCRSEDQNCVPPC 
I 


5819 


19720 


A 


5S66 


497 


2 


AVGAGQKGGRGGGGRELMPPQKIWGGGA 
FTNNAHVGPLKIMPRLIRTGMLEEIHLE 
PGWQGLFPCVDELSDIHTRFLSQLLER 
RRQALCPGSTRNFVTHRLGDLLISQFSG 
HSAEQMCKTYSEFCSRHSKALKLYKELY 
ARDKRFQQFIRKVYCGRCRGSQQGR 


5820 


19721 


A 


5867 


382 


74 


LALSPRLKCSGQI IAHCSFDLLGSSDPP 
PSASRVAGTTGARRHARLPYSLRHYHFV 
LRLFNPSKHSKLKHFHPILIRNPGKVGS 
YSFQMSDLLQSRRARQQS 


5821 


19722 


A 


5869 


378 


3 


SFSRSANLISHQRIHTGEKPFQCAECGK 
SFSRSPNLIAHQRSHTGEKPYSCPECX3K 
SFGNRSSLNTHQGIRSGEKPYECKECGE 
SFSYKSNLIRHQRILTGEKPYKCTDCGQ 
RFSQSSALITPV 


5822 


19723 


A 


5870 


375 


3 


IHVIVFNQATGHVMAKRVFDTYSPHEDE 
AMVLFLNMVAPGRVL I CTVKDEGS FHLK 
DTAKALLRSLGSQAGPALGWRDTWAFVG 
RKGEAECHWADTELNRRRRRFCS KVEGY 
GSVCSCKDPTV 


5823 


19724 


A 


5871 


373 


1 


QPEEVSGALSPPSASAYVKLVLLGLIMC 
VSLAGNAILSIJ^VLKERALHKAPYYFLL 
DLCLADGIRSAVCFPFVLASVRHGSSWT 
FSALSCKIVAFMAVLFCFHAAFMLFCIS 
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nucleotide deletion, ^possible 
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VTRNMAIAHHLY 


5824 


19725 


A 


5872 


382 


2 


KSFSQKSQLI IHLRTHTGERPFECPE CG 
KAFREKSTVIIHYRTHTGEKPYECNECG 
KAFTQKSNLIVHQKTHTGEKTYECZAKCX3 
ESFIQKLDLIIHHSTHTGKKPHECNECK 
KTFSDKSTLIIPHV 


5825 


19726 


A 


5873 


370 


3 


MGRVRAQNISGVMSGPQKMLMQSQFPTQ - " 
GQQGFCEGKEP YQAMSQNMGNTQDMFS P 
DQSSMTMSNVGATRLSHMPLiP PASNP PG 
TVHSAPNRGLGRRRSDLTISINQMGSPG 
IGHLNSTTCI 


5826 


19727 


A 


5874 


362 


2 


GGKFIiVIiGNLPSKLEESMVQYYRLVTAA 
SLVRGQISEYNISLRASDGGSPPLSTET 
HITLHVIGINDNPPTFPHLSYSACIPEN 
NPRGAS I FSVTAQDPDSNNNAR ITYALT 
EDTLQGVY 


5827 


19728 


A 


5875 


369 


1 


RI RPRPTARLASARTLHEVS LQES IRYA 
PGDAVEKWLNDLLCLDCLNIARIVSVCP 
IiPEACDLSYVNRDTLFWCHKASEVFLQR 
IjMALYVASRFKNSPNDIiQMLSDAPAHRL 
LNKCIiLCPPPV 


5828 


19729 


A 


5876 


119 


39 


VIQDYTTPPNEEIjSRDLVNKLKPYMR 


5829 


19730 


A 


5877 


461 


2 


RRGWTSRRKPKEDPSGAAVPEMPKKSSK 
IASFIPKGGKLNSAKKEPMAPSHSGIPK 
PGMKSMPGKSPSAPAPSKEGERSRSGKL 
SSGLPQQKPQltDGRHSSSSSSLASSEGK 
GPGGTTLNHSISSQTVSGSVGTTQTTGS 
NANSVQLPQPLCI 


5830 


19731 


A 


5878 


503 


3 


PSPQVPGECPSPKKLGARAAFTTPDPAP 
LSPQSRVASSGSEQTEEQGSSRNSFQED 
GSGMKDVPSWLKSIxRLHKYAALFSQMSY 
EEMMTLTEQHLESQNVTKGARHKI ALS I 
QKLRERQSVLKSLEKDVLEGGlJLRNAIiQ 
ELQQI I IT P IKAYS VLQATVAAATLY 


5831 


19732 


A 


5879 


421 


34 


LVHKVLSASNVLVDAEGTVKltoYSlSK 
RLADI CKEDVFEQTRVRFSVNAMHYRTG 
NKGDVWRLGLLMLSLSQGQBCAEYPVAI 
PSGLPGGFQDFLKKCVCLDDKKSMESPA 
WETKQNKS PANVLRPL 


5832 


19733 


A 


5880 


379 


3 


AQGIiLIiAWPNLNRQHCPCSCSHNIiSTF I 
QNWKS PGLSLKTRAPGQGS VLSSSDLPR 
AVLTGTFAVMSVMVGSVTESUVPQALNG 
SMINETARDAARVQVASTLSVLVGLFQV 
WNSQEYVPPTTCI 


5833 


19734 


A 


5881 


429 


1 


SLVSYMDTESTAEKLGKELGLQAKELSA 
VHSSHHE XGVNDSNLFSLEMREPLESSN 
TKSFHSAVEIKNRSQHELPCFQHHGIDT 
PTSLQKRSACCPPSLLRLEETASSSQDG 
DPQIWDDLPFSESLNKFLTVLESEIAVT 
QCI 


5834 


19735 


A 


5882 


436 


3 


WSLVSYMDKKSTAEKLGKELGLQAKELS 
AVHSSHHEIGVNDSNLFSLEMREPLESS 
NAKSFHSAVEIKNRSQHELPCFQHHGID 
TPTSLQKRSACCPPSLLRTiEETASSSQD 
GDPQMWDDLPFSESLIKFKTVLESEIAV 
TQCI 


5835 


19736 


A 


5883 


195 


353 


DS YS YVRS TAPAVAYDS KQYYQQPTATA 
AAVAAAAQPQPSVAETYYQTGGFS 
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X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


5836 


19737 


A 


5884 


432 


I 


LSAKVPVASVSDQAAAMHLSQCPKNLAT 
SLAELRTASQKAHKLOGPMEIDSALNTV 
QTLKNEIjQDAKMAAVESQIiKPLPGBTIiE 
KCAQDLGSTSKAVGSSMAQLLTCAAQGN 
EHYTGVAARKTAQAIiKTLAQAARGVAAS 
TTDP 


5837 


19738 


A 


5885 


313 


1 


GKSPHLLIIiQTSPIiSKAPQPLIIiLLLYW 
SWVGWS VTAASRETEAGI QVS SEE PGP 
RNVSPHSRLQDETNLRLEAENNLAAYRQ 
VRBVQGRDREGRKGLMATLY 


5838 




A 


5886 


432 


3 


GSGFCSPIIjCLWWVEEAVGI IRLHIjWLR 
ELGCWHSFSLPGTYSLEFLSTCTQVLPy 
SWSGLLFASGNYLSLYLVPSRAETDSRG 
SQPLPVARHGVGACKCKPRQTRPGLS PQ 
VCVEKLMPLSSFCSAFQQNTYNKQPMSR 
PV 


5839 


19740 


A 


5887 


417 


2 


ASLMVAPDYAE I SPLAMPAPSSGWCTP 
IMSTSSSEAMSTPLMLAPDSGELSPILM 
QDMNPGVMSTQPVPAPSFEAMSPLQITD 
EDTEAMS KVLMTALASGB I SSLLMSGTD 
SEAIS SLIMS AVASGGTSPQPTSTLY 


5840 


19741 


A 


5888 


420 


3 


KFDLIKKLDLKTMSSYDIiDIEASDGGGL 
SGKCSVSVKVLDVNDNFPELS ISSLTSP 
I P ENS PETEVALFRIRDRDSGENGKM I C 
S IQDDVLFKLKPSVENFYRLVTEGALDR 
ETRAEYNITITITDMGTPRLKTEQSV 


5841 


19742 


A 


5889 


321 


414 


FNMRIWPGAGAHACNPSTLGGRGGRITR 
SAD 


5842 


19743 


A 


5890 


415 


1 


PGLPGTSVKGI PASKQS PHESPRTLHLK 
TSPI IQQLGLYLSHTAIRYHPQETLKEF 
VQLVCPDAGQQAGQVGFLNPNGSSQDKV 
HNPFLPTPMLPPPPPPPMARPVPLPVPD 
TKPPTTSTEGGADYPTSPTYSTPSLY 


5843 


19744 


A 


5891 


193 


3 


KPSSKVWSRRDPFLLSSVEQPIKDAVIT 
VPVFFNQAERRAVLQAARMAGLK^QLI 
NDNTACI 


5844 


19745 


A 


5892 


412 


2 


KHQVSDNKDSFYVSLYPDFSSLSRAILD 
WQFFKWKTVTVVYDDSAGLIRLQELIK 
APSRYNLRIiKIRQLPVDTKDAKPLLKEM 
KRGKEFHVIFDCSHEMAAGILKQALAMG 
MMTE YYHYI FTTLDLFMiD VE PS V 


5845 


19746 


A 


5893 


413 


3 


ELLLCSNTSCCRCFCVECLEVLVGTGTA 
AEAKLQEPWSCYMCLPQRCHGVLRRRKD 
WNVRLQAFFTSDTGLEYEAPKLYPAI PA 
ARRRP I R VLSLFDG IATG YLVLKELG I K 
VGKYVASEVCEES IAVGTVKHEWR 


5846 


19747 


A 


5894 • 


424 


3 


CSGRREPSVRGVQCKGDQGHHSARMAPS 
EAPGTRSCTPSHGQNTAAEATPAQKTPA 
KVVYVFSTEMANKAAKAVLKGQVETMVS 
FHIQNISSSKTERSTAPLNTQISAIiRND 
PKPLPQQPPAPASQDQNYSQNTRLQPCI 


5847 


19748 


A 


5895 


400 


1 


ASVQNPALRLVTREEFAIMQTPAGELYD 
KSI IQSAQQDS IKKANMKRENKAYSFKE 
QI IELELKEVSTRRHLRKVAAELGGLKL 
VAFPTAHMLFTWNI IjRPMAHS S DAPDQC 
LVHWHTFPSPLLHS FFHPQLY 


584$ 


19749 


A 


5896 


421 


3 


VSSIQMGAVNLIiGGGLDSLLGSDLGGGI 
AGSPAVGQSFIPSSVQATCAPSPTPAW 
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E=GIutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, P^Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














S SGLNDLS ELSTW IGMAHGG YVAPKAVW 
LPAVKAKGLESSGTFTRRQGHIYMAMDF 
TNKALQHMTDSAIQFTKNSFGVIPSCI 


5849 


19750 


A 


5897 


384 


1 


PRASRPIHEAGIVPLPMRTHTPKKPPTC" 
EVAGVQPSRNVKVKIHMRKHTGERPYSR 
QSCSVRFVHSYDLKNHMHLHTGDRPYEC 
HLCHKAFAKEDHLQRHLKGQ I CLEVRT R 
RRRKDDAP PH YPPPCI 


5850 


19751 


A 


5898 


429 


2 


QDIAAWQSLTQVLTPESWRKANIMTBPQ 
KSQERYKGIYVKEKLYRRARHDESLNWT 
SCDHHESQECKGEDPGRHPNCGKNLGMK 
STVEQHHVVHVLPQPFTCNNCGVAFADD 
TDPRAHPSTHLGEKSYKCDQYGKILSQS 
LY 


5851 


19752 


A 


5899 


424 


1 


GmLQALAAHDAGSRAHVLLSLSQQDGI 
EQHMD FDS RYTLLELFAETTS S EEHCMA 
FEGIHLPQIPGKLLFSLVKRYLCVTSLL 
DQLNSSPELGAGDQSSPCATREKSRGQR 
ELEFSMAVGNLISELVRSMGWARNLSEH 
V 


5852 


19753 


A 


5900 


135 


459 


YEMLKAEHVISYVCSENSLMVVVNMRFC 
LIGLIHIQEFYFFEMEPHRAIQAGVQWC 
DLCSLQPPPPGLKRCSCIPPSSSRDYRC 
APPYP 


5853 


19754 


A 


5901 


418 


10 


GRSPKPGDRLWBSVHLSFPGPADSLSGG 
SAPASSYEPSQRSSFS SNRSQRGSTSTR 
NSSQKGSS VCjS I KQKGKRELYMEKLQBH 
LIKAKAFTIKKKFSQNLVAQLWYFVKCV 
YFGLSAYQIPRGYPTRVLGNFLTK 


5854 


19755 


A 


5902 


402 


2 


GYRHPELEIKSVDGFQGPBKQPVTLSLV 
RSNRKGEVGFPVEDRRINVDVTRARRLV 
TVTCDYRTVSSHAFLKPLVBYFTQHGEV 
RTAFEYLDDIVPDNYSHEDSQGSSQAAT 
RPQGPGTSTRTKKQRQEGGMY 


5855 


19756 


A 


5903 


346 


1 


GICPLSHIKMTDARCQIYEMKGGEKSPK 
DTGKE PGHSEAKTGPPQVLAGVPAQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AGASETAQATDLSLS PGGES KANCS PED 
PMY 


5856 


19757 


A 


5904 


139 


212 


EVENNDDISHHHHHHHHHHSNSKS 


5857 


19758 


A 


5905 


2 


345 


PCGCWGRCALLLISAAAKAKSKCGPTFL 
PCASGIHCI IGRFRSKGFEDCPDGSDEB 
NCTANPMLCSTARYHCKNGLCIDKSFIC 
DGQNNCQDNSDEESCESSQGVGYVCGFK 
KA 


5858 


19759 


A 


5906 


432 


17 


PQTTPHRTFGGGKAAWLLAVGGQFLLC 
WS PYFS FHLYVALS AQPISTGQVES WT 
WIGYFCFTSNPFFYGCI^RQIRGELSKQ 
FVCFFKPAPEEEIiRLPSREGS IEENFLQ 
FLQGTGCP YKSWVSRPLPC I AAALRV 


5859 


19760 


A 


5907 


407 


1 


P YACGECGKS YRQS S S LVSHRR IHSGVR 
PHHCDECGEFFSRKYDLLIHQRVHSGER 
PYKCSECGKS FSHSCSLIAHQRIHTGMR 
PYECSECGISFIHSCSLITQQRVHLGTR 
PYMCSECGKSFSQSCRLIKHRSV 


5860 


19761 


A 


5908 


419 


3 


VLAKKTI ITKSARDCHEFGNIIiHLSTNL 
VASIQRPDKHESFGNNMVDNLDLFSRSS 
AENKYDNGCAKLF FHTEYEKTNPGMKPY 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














G YKECGKGLRRKRGIiSLHQRI KNGEKPF 
ECTACRKTFSKKSHLIVHWRTHTGVY 


5861 


19762 


A 


5909 


441 


57 


KVMGIFKSSLFQALVDIQEFYEVTIiINS 
QKSCEQKIEEANQVLQKWBKTSLIiAPCH 
DRIiQKSSELQSLIGQHSSLETTDGPQPW 
LFPRGSEGHRGKTYDHCHS IGLQIAVDQ 
RKMLHVLSKIKKIRV 


5862 


19763 


A 


5910 


3 


317 


CLUVILWRRWGSRGSRQRRLQATPSWEST 
KTADGQGDRRESGSQRHCSSPHPQSKST 
TTSSVPAAREGAQEAAQGGPRKGQTSRP 
QVSTRQCAMPPARHRKLENDA 


5863 


19764 


A 


5911 


183 


1 


QNSDSKDSLNRVASRDHAKPNLTCHVSP 
AIQGTGS ISESS I PSVSDTSTPRRSRRQ 
LPPCI 


5864 


19765 


A 


5912 


395 


1 


KriQFNICVKVPHRFSNSNKDKIRYAGD 
KTFKCKECGKSFHVLSRLTQHKRIHTGE 
NPYTCEECGKAFNWSSILTKHKRIHARE 
KFYKCEECGKGFTRS5HLTKHKRIHTGE 
KPYICEKCGKAFNQSSTPV 


5865 


19766 


A 


5913 


449 


2 


FGSHLEKEDEKKQELVDKAI KPS I EATL 
ESIQRKLQYKRAESSRPEDIKDMTKAQI 
ANEKVALQKAIiLYYESIHGRPVTKNERQ 
VMKPLYDR YRLVKQ ILSRANTI P I IEEE 
EGSEADSWKPDFMVNKKTDFSARMVKR 
PLGGSARPD 


5866 


19767 


A 


5914 


464 


23 


SAHMTETRSKSFDYGSEiSIjTGPSAPAPV 
APPARVAPPERRKCFLVRQASLSRPPET 
ELEVAPKGRQESEEPQPSSSKPSAKSSL 
SQISSAATSHGGPPGGKGPGQDRPPLGP 
TVPYTEALPVFHHPVAQTHKHEKPYLPP 
LYCGRCG 


5867 


19768 


A 


5915 


366 


1 


LVSFINFFTSVLATLWFAVLGFKANIM 
l^KCVVENAEKILGYLNTNVLSRDLIPP 
HVNFSHLTTKDYMEMYNVIMTVKEDQFS 
ALGLDPdiLEDELDKSVQGTGLAFIAFN 
EAMTHSPACI 


5868 


J9769 


A 


5916 


246 


366 


TSVKEQQMPGAVAHACNPSTLEGQGGRI 
TRSGVRDQPCQH 


5869 


19770 


A 


5917 


400 


3 


NDNAPEFYQSVYKVTVLENAFNGTLVIK 
LNATDPDDGTNGD I VYSFRRP VWPAWY 
AFTINPNNGEIRTKGKLDFEEKKLYEIS 
VEAVDKGNI PMAGHCTLLVEVLDVNDNA 
PEVTITSIiSLPIREDTQPCI 


5870 


19771 


A 


5918 


410 


200 


CTPPQPVKCRFLNRDRVSSCCPGWSQIP 
GIRRSSYLSIiPECWDLRPVILLFCFLSH 
ISSKQPYFLPPSYR 


5871 


19772 


A 


5919 


214 


407 


MYFYIDRASLLSPRLECIWTIAHCSLE 
LLCSRDLPASASQSAGITGIRHHTWLKT 
HFYSSFKT 


5872 


19773 


A 


5920 


476 


39 


VPYGLFVGSTGRLGLPYLRGTSHPLRGC 
LHAAALNGRRLVQPI/TRNKHEGCAEEFS 
ANDDVALGFCGSHSLAALPAWGTQDEGT 
LEFTLTTQSWQAPLAFQAAGWHGDFIHV 
D I FEGHL WS MVE KGQGTVLLLNS VPVTD 
AQPHM 


5873 


19774 


A 


5921 


430 


2 


SLRPSSTGPSPSGGLSEEPAAKDIiDNRM 
PGLVGQEVGSGEGPRTSS PLFNKAVFLR 
PSSSTMILFETTKSGPALGKAVSEGAEE 
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C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
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^ ■«^»»" uiiuiiivj a% rxi cliff iiv, *J Uvi life, 

^Threonine, V=Valine, 
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X— Un known *=Stnn mrlnn A=nn«ihlA 
nucleotide deletion, \=possible 
nucleotide insertion 














AKLGVSGSRPEVAAKPALPTQKPAGTLP 
RSAPLSQOTKPPVPQEEAGQDHPPSKAS 
RC 


5874 


19775 


A 


5923 


411 


2 


^LKVDVEALENSAGATYIRKKGGKVTG 
DSQ P KEQGQG VLKKKKKKKKGMV P KNYD 
PKVTPDPERWLPMRERSYYRGRKKGKKK 
DQIGKGTQGATAGASSELDASKTVSSPP 
TSPRPGS AATVSAKTSNI T P PPT.Y 


5875 


19776 


A 


5924 


402 


2 


LQSGAIMNKFYQPHEAHI PYLLQIiFIDY 
NLYGMNL INLAAVKFRKARRKSNTLHAT 

ILEGVEPQSTCKLEVDAVAADILNRLD I 
EAQIGGMPGLQAIWEDEKQRR 


5876 


19777 


A 


5925 


215 


403 


AETSTYFFPPLERQFPFAPQLGGHGRNIi 
PSPTFFC 


5877 


19778 


A 


5926 


320 


1 


PGPvPJDRWIWGQGECLEYLLASMTQWCER 
EISS IAPGELCCLLLSFLPQE5CQNYVR 
VL IYAGRKVFMCGTNAFS PMCTSRQVGN 

T J5PT*PP.VT M/^VAPr*DVnDPTTPX7 


5878 


19779 


A 


5927 


400 


1 


WATDGGLPLLAS s atvs valqdvndne 
PQFQRTFYNASIiPEGTQPGTCFLQVTAT 
DADSGPFGLIiSYSLGAGLGSSGSPPFRI 

VDGGGLKSMVYVKVFLSDEMY 


5879 


19780 


A 


5928 


398 


62 


HNTEVLVGESVTLECSATGHPPPRISWT 
RGDRTPLPVDPRVNITPSGGLYIQNWQ 
GDSGEYACSATNNI DSVHATAFI IVQGG 

YPQ 9 VPPPf2TiflPTT/PPPT.fcP 2V 2i TPTHau 


5880 


19781 


A 


5929 


403 


1 


LASDGGDPVLSGTSRXCVKVLDANDNAP 
VFTQPEYRIS I PENTLVGTR I LTVTATD 
ADEGYYAQWYFLEKSPGETSJEVFEIiKS 
TSGELTI IKDLDYEDATFHEIDIEAQDG 

PGUjTPJMCVT VTV7 .TWNTYN JVTJV 


5881 


19782 


A 


5930 


408 


3 


VASIXMDPVI^GTSPJCCVKVIjEIANDNAP 
VFTQPEYRIS IPENTLVGSR ILTVTATD 
ADEGYYAQWYFLEKSPGETSEVFELKS 
S SGELTI I KDLDYEDATFHE IDT EADDG 
PGLLTPJUO^rVTVLDVNDNANV 


5882 


19783 


A 


5931 


126 


391 


PVLTCAHFHFSAGAGGSYFMISRSLGPE 
FGGAEGIiCLYLATTFATAMYILGAIQIL 
LVSQAEPWSTLAMI KQFLWTPNRRLMVL 
I PTC 


5883 


19784 


A 


5932 


436 


359 


NKEWLPGSKVPEKS INDVKNTSGLL 


5884 


19785 


A 


5933 


365 


3 


LIIGMLTAIIGDIiASHFGCTIGLKDSVT 
AWFVAFGTSVPDTFASKAAALQDVYAD 
ASIG1WTGSNAVNVFLGIGLAWSVAAIY 
WALQGQE FHVS AGTLAFSVTLFT I FAFV 
CISVLLYR 


5885 


19786 


A 


5934 


394 


3 


FQYFSRTDRVLKHERMCHENHDKKLNRC 
AlKGGLLTSEEDSVFSTSPKDNSIiPKKK 
RQITBKP^SGMDKESALGKCDIiKKVKND 
YSPLYSSSTKVKDEYMVAEYAVEMPHSS 
VGGSHLE YASGE IHPPCI 


5886 


19787 


A 


5935 


393 


1 


REDLIAGIDEFLDEVIVLPPGEWDPNIR 
IEPPKKVPSADKRJCSVFSLAELGQMNGS 
VGGGGGAPGGGNGGGGGGGSGGGAGSGG 
AGGTS SGDDGEMPAMHEIGEEL I WTGRF 
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residue of 
peptide 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=GIycine, H-Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R— Arginine, S == Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














PGGLCLDIKRKLPWFPSCI 


5887 


19788 


A 


5936 


410 


126 


PICTYKKAGRARWLTPVIPSSWDYRHQA 
QLIFVFLVETGFHRVAQAGPELMTSGDP 
LAYNFLCSYPNEVFRSQASASDLQFVLR 
NLBPPDAGNV 


JOOO 


1 0*700 




JlO / 




1 
1 


DATFHEI DI EAODGPGLLTRAKVIVTVL 
DVNDNAPEFYMTSATSSVSEDSLPGTI I 
GLFNVHDRDSGQNAFTTCSLPEDLPFKL 
EKSVDNYYRLVTTRALDREQFSFYNITL 
TAKDGGNPSLSTDAHILLQVADINDNAP 
CI 


3ooV 










5 


ADPTKGLLRNGSVCVRAPGGVSOGWSVW 
LKNSKQACLPSGGIPSLNNGTFSPPKQW 
SKESKAEQAESKRVPLPEGCPSGAASDL 
OSKHLPKTAKPASOEHVRCSAIGTGESP 
KESAQLSGAS P KES PSRGPAPPQENKW 
SPW 


5890 


19791 


A 


5939 


3 


191 


YS VCLWSQLIiRRLRQEDHLS PGGGGCTE 
PVVNTERLSKNKKTKKREREKQOGFHSM 


5891 


19792 


A 


5940 


399 


1 


QYSDDNDMSWKVTRAAAKCIiDAWSTRH 
EMIiPEFYKTVSPALIYRFKEREENVKAD 
VCHAHHSLFKQTSPVQSWLCGIiDAMDQG 
GTPLAMIiQSQVANIVKAVHKQMKEKSVK 
TRQCCFTLLTELVNVLLGAVY 


5892 


19793 


A 


5941 


411 


3 


SPCEGPRRFQCRSGKRVDGGKVCDVQRD 
CRDWSDELLKVWCGACLRPIiAGLSLLPS 
PSWYLGSRPSSAPCPDTFCSGPLPGFMC 
RPMASHGAFRPQASGLHLYKVLRACPSQ 
VLKNYVFSHKLGLSSFLPRSDHV 


5893 


19794 


A 


5943 


431 


26 


KAWGIPGDMGPPGITVRPGYNGLPGNP 
GVQGQRGEPGVGLPGLKGLPGLPGIPGS 
PEEKGS IGVPGVPGEHGAIGPPGHQGIR 
GEPGPPGLPGSVGSPGVPGIGPPGARGP 
PVGQGPPGLSGPLVIKGEVSRVR 


5894 


19795 


A 


5944 


396 


3 


PLPVELIRVPAFLDLFMQSLFKPGARIS 
QDLKHKYIHILAYAASVVETCKKNKRVS 
INKDELKSTSKAVETVHNLCCNENKGAS 
ELVAELSTLYQCI RFP WAMGVLKWVDW 
TVSEPRYFQLQTDHTPTV 


5895 


iy /yu 




jy*rj 






ERVTLAD ITWCTLiLWL YKQVLE PS FRQ 
AFPNTNRWFLTC INQPQFRAVLGEVKLC 
EKMAQFDAKKFAETQPKKDTPRKEKGSR 
EEKQKPQAERKEEKKAAAPAPEEEMDEC 
EQAMAAEPKAKDPFAHMNKSTFVLDDCI 
AAAL 


5896 


19797 


A 


5946 


414 


2 


ATPPVRCIRKEIRNWYVDIQPVQEPKAQ 
AOGNGHGIIIIAETSTGCLFAGSSLGKR 
GVNADKVAIEAAEMIiIjANLRHGGTVYEY 
KQDQh I VFMALANGVS 1 1 KTGPVTLHTQ 
TAILYDEQIVKVNCKENHSVDEQVY 


5897 


19798 


A 


5947 


411 


1 


EPCWRRIADSSVQTDDEDGESRYLLSR 
RRRARRSADCSVQTDDEDSAEWEQPVRR 
RRSRLPRHSDSGSDSKHDATASSSSAAA 
TVRAMSSVGIQTISDCSVQTEPDQLPRV 
S PAIHITAATDPKVEI VR YISAPCI 


5898 


19799 


A 


5948 


153 


409 


LPFFPILLPFISFFRYRVGLLSPKLKCS 
GVT IAHCS LELLGS SNPPTS ASRVI FLY 
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OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, ^Phenylalanine, 
OGJycine, H=Htstidine, I=Isoleucine, 
K=Lysine, L=Leudne, ^Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W~Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LSLCLFVFEMESCSVAQAGEQWRDIjGSL 
Q 


5899 


19800 


A 


5949 


21 


447 


RAAAMSSRNFRFLSSHRELYFGVDLPSG 
NLWREPADREQLCRAKAACVLTYDLVL 
EDPLEIiHKIRIHVLDTNDNSPLFPAGDV 
QLHI PEFLTPGARFTLPNAQDDDEGSNG 
ILSYSLSPSQHFRLDMGSRVDGSEYPEL 
VL 


5900 


19801 


A 


5950 


268 


19 


QDNWAYYCTVPBYCKTKLNIIiAWWLVIN 
YLSGDFYFFFI FFLDGVSLCHPGWSAVA 
RSQLTATSSSQVQGRFSCLSHCIAAAL 


5901 


19802 


A 


5951 


389 


3 


QQMDLARQQQEQIARQQQQLLQQQHKIN 
LLQQQIQVQGHMPPLMIPIFPHDQRTLA 
AAAAAQQGFLFPPGITYKPGDNYPVQFI 
PSTMAAAAASGLSPLQLQQLYAAQLASM 
QVS PGAKMPSTPQPMY 


5902 


19803 


A 


5952 


2308 


307 


RRRPARKKTWVRDGGPHQGL/YPQLPSA 
VLQPTQPGHGPRALGKGRELESWQGKRP 
GK/ GQTHRRKTRGTAS PAVYFSVEWGD\ 
SGGCPMTGKGAKQQRAGLGAAPNPELSG 
EQGTGRGRAQDSQGHGGGKGRRGQHPPG 
ND/ PRGVSGMGQES VHSPEARPTGGGTG 
/EMLGRrWRHQQGLVGGTGNLLETGSWS 
GSGSWGRGVVLRPQEGQVTTGMGLAGR/ 
YQPARQRAVELS PGAQGLRQRRGGWSGP 
PQETEQSGVPGG*GSWPLESHQGEQ\PA 
AGQKQSLQMLPGGFPCWS/SGMGGGPQQ 
LLESEGAGPSPGGGRHHGKGRVAVTTTP 
REGD\RGQSPGGHTLQLFPCPLWS *ETQ 
GQGSRTVQHQEWGGEGREGEGAGSDQSR 
ALGIA* I CPHTLRPTSMEPQSTAPAPGQ 
PPGPPSWGHRGHQGMGYGGRCPGQ/IQG 
RGRAQIjGSTVG*RDG/ qrg*mqgrdqqg 
PRSNGAGVGSCSHTSQK*IPPSSLCT*N 
SSHGPASG\QLWWSSPFIHSPGETNIPH 
TLTEPHSVPGWCWDTLRRHGAGQGHPGM 
ARSGTGEGQ/ QRGRY*ERGRRGRRQQKQ 
KKQGLKEPG*RAAPTLKGATRPLCRCLR 
KVQKPKQDGDVGS * LLKVFRAPGAIiGTK 
PQRTCRGPADFFHALSGLSNVHRSYCST 
KSGFGACM 


5903 


19804 


A 


5953 


461 


117 


HPRDI IGPVHQGLPPS PQPPPPCKQAPT 
TPSS\M * P * TNRHDTGLSGP VQVPGADW 
KPLNAP * LPELAFGEEPCRPGVPDPPYV 
WLPDSAPEHCVTWATS PGLSDLLS YPRR 
KQA 


5904 


19805 


A 


5954 


845 


610 


FFETGSY/ SVAQVGAQWCNFGSLQSQPP 
RLKRSCHLSPPSSWDYRGAPPRLARFFF 
FCI FYRNGVS PCCPGWS *TPELQ 


5905 


19806 


A 


5955 


1 


378 


KTPVSDR \ ATKCCS ESLGNRRPCFSALE 
AYETYVPKEFNAETFTIiHADICTLSDKE 
RQINKHTALAELVKHKPKARQEQLKAVM 
DDFADFVEKROVIADDKETCFVEEG * TLV 
AASQAALGLLHRIK • 


5906 


19807 


A 


5956 


723 


343 


GCQEGICPSPCPVIPATREAEAGESLEL 
RRRRLH*AEIVPLHS\KAQRSLCSSDFI 
RILVIFSGMFLVFTLAGALFLHQRRKYR 
SNKGESPVEPAEPCRYSCPREEEGSTIP 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G^Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^^Glutarnine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
XHUnknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IQEDYRKPEPACSP 










jj 




AQHFNPHNWPVT* LLPHFTNEGGQVTEA 
RCLKWQAR\CLVPMQKVPNAMPGT* * PS 
AEVTPPVCNQHHWPQVEGIPSS/APAFQ 
WPLTMGHPC\WESAPRQBASTS PRPGTA 
SHCPSAVAFQERGSSPCPPHHSDSGLCL 
LPRLLPAP 


5908 


19809 


A 


5958 


2 


413 


KKPDQ\AE\REHLCTSL\WSGRNTDKNG 
EELHGGKRVMBRLKKAVKIANQCMDPSL 
QMQLYIEILNRYIYFYEKEYDAGTIQVL 
NQLIQKIREDLPNLESSEEPEQIT*HFH 
NTLBHLRLRRESPESEGP IYEGLILYE 


cnnn 
5909 


j9o1U 




rnrn 


5/1 




WMC5JATFGSHLN*AWRIiRPVNSG/LIiM/ 
PSEGKAKKESTLMVGVTNPD\MKGKIRL 
LLHSDGKDEYWNAGHPD I LG \HPLTLQ 
CPI I KIKKKL+ * PKSGRTADGPDHSRIK 
V*VTPLGKEPTTDEVLAEGKENME*VTE 
TC* IDMN* VLQGQTWNLQR 


5910 


19811 


A 


5960 


390 


271 


GINFFSRN/ RSLTT * SRLV*NS WAQVI V 
Ii PWPS RVLGFQA 


5911 


19812 


A 


5961 


333 


922 


GLSCRVFGRTLSCPPCILRHLSEVETLQ 
TljQKKEIEDLYSRLGKQPPPGIGAPAAM 
IjSSRORRIiSKGS FPTSRRNSLORSEPPG 
PGIMRRNSLSGSSTGSQEQRASKGVTFA 
GDVGRM* IQNRSHVSPPHQGPPWSLCSQ 
NLMLSDQQN*ARKI PTLKG* KA\GGHGE 
CSS 1 I VKSQTYVNCLLCGGVS saayhlh 
L 


5912 


19813 


A 


5962 


705 


387 


CVAQTGVQ*HDLGSLQPLPPGFKQSSHL 
SLPSSWDYRRVTPRLPNF\ * FFVETDFC 
HI AQAVLQIJj\S SNNLPAS AS QS AGVTG 
VSHRARPTLTLDIQAEIPDLKPSG 


5913 


19814 


A 


5963 


425 


182 


ASNQ* DSVGVGPSEPGAGYNLLVRFLKC 
XLEKHSSWVGVTQFSRCHLSPLSLTRKG 
NSLTPCTSRVRQCLALLWLTHGSRTH 


5914 


19815 


A 


5964 


388 


14 


PPMYTQliCSIRSTQAFCFVFFSRDE/SIj 
ALLPRLVLNSWVQAILLPWPPKVQG*QA 
QWLTLVIPALWEAMAGGLLEPRSSRPAW 
ATTQDLISTKENKTKSLSASDAAELCVH 
WRUiKI 


5915 


19816 


A 


5965 


1 


373 


DDQRVKSVINLLLAAYTGDVSALRRFAL 
SAMDMEQRDYDSRTALHVSAAQGHVEW 
KFL\LEACKVNPFPKDRWNNTPMDETLH 
FGHHDVFKI LQB YQVQYTP * GDSDNGKE 
HHTVHKNLDGLS 


5916 


19817 


A 


5966 


3 


329 


HEETSRYE I YEI* IHVHPYAVKQS FDLEE 
YS LSQS TLEQVGHF * GFVSSMVYVYKTF 
PGSCRGSTALFSSLYFLPPFQVFLEIiSK 
EQELGDFE / EDFDPSVKWKLLPQEEP 


5917 


19818 


A 


5967 


2 


47 


MEDRRLVRGIPCPQHNARQCPAVPPGIQ 
AYGAAPFEDIiQVDFTEMSKCRGNKYLLV 
LGRTYSG * VEAYPTRTEKAREVTRVLLR 
DLLPRFELPLRIGSDNRPAFVADLVQKA 
AKILRI TWKLHAAYWPQSSGKVERMNQT 
IKNSLGKVCQETGLKWIQALPMVLFKIR 
CTTSKRAGYSPYEILYHRPPPlLRGIiPG 
TP*ELGEIELQRQLQAIjGKITQTI/YSP 
SKWPVSLFS PVHPLS PGDRVWI KNWNVA 
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C=Cysteine, D=Aspartic Acid, 
&=Clutamic Acid, ^Phenylalanine, 
0=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, LHLeuctne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R~Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/ s possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














SLCPLWKGPQTVVLSPPTAVKVE* I PSW 
IHHSHVKP*TA 


5918 


19819 


A 


5968 


2 


150 


KSQSI*QSSLLSYFEKLPQPPQPSA\TS 
TSKQDPPLATRLWLAEGSDDH 


5919 


19820 


A 


5969 


3 


371 


HEGKSGPMTGWIiVAVGBVAMKILLLCL 
CLILLRVRSCRRKAARAALGMEAADAVT 
D*SPDSRLLPD\PPHPVPPQSERPCFLC 
RLCMVCPLLDS PDPSFPCHPTSPQTLRA 
KDSAALDTLHV 


5920 


19821 


A 


5970 


396 


58 


YWVLS KGPVTCMLLNS VFS FQASSCLS F 
LINS*AP*PGIFLGMSVFILPFFVFVGQ 
AGLELLTSGDI*PDSASQ\ARITGVSHCA 
QPHTALSQYLEQCLAPSRQLIS VSL ILR 


5921 


19822 


A 


5971 


442 


120 


ICMKQGLALSPRLDSSSAFSNTVQPMPP 
RFKSSYGLSLP/RSWDYRCTPPHMARSG 
NGV* P \ FMVETGSW* AAQGGLEIiLSSSN 
IjPP /SAFQSAGITSKGHCTRPAVFLVC 


5922 


19823 


A 


5972 


459 


434 


MWPLPWFSISSPSSPPPT*VSSNNHPCA 
VPRAYPGADI PGPGWQRLPPPHRGLSGP 
PESCLLTTILCSKHPNGCPLTQNPRSIiP 
GPRPCLIYSTPSPPP*AQL/YAAPSP*L 
YTWREALIR 


5923 


19824 


A ! 


5973 


681 


1758 

i 


VANKCQIPDIKAKTYCICNTKEKRSYLK 
T*NIiHRGF\ I KKQIEEFNIGKRHLANMM 
GEDPETFTQEDIDRAIAYLFPSGLFEKR 
ARPVMKHPEQIFPRQRAIQWGEDGRPFH 
YLFYTGRQSYYSLMHDVYGMLLNLEKHQ 
SHLQAKSLLPEKTVTRDVIGSRWLIKEE 
LEEMLVEKLSDLDYMQFIRLLEKLLTSQ 
QGAAEEEFVORFRRSVTr.ES KKOTjTEPV 
QYDEQGMAFSKSEGKRKTAKAEAIVYKH 
GSGRIKVNGIDYQLYFPITQDREQLMFP 
FHFVTJRLGKHDVTCTVSGGGRSAQAGAI 
RLAMAKALCS FVTEDEVEWMRQAGLLTT 
DPRVRERKKPGQEGARRKFTWKKR 


5924 


19825 


A 


5974 


1232 


980 


SLSLSPRLiECSGVTAHCSIiRLPASSNSC 
FSASQVAGTIGACHHIWLVFVFIjVETGF 
HHVGQTGLELLT\ * VI CPPWPPKVLGLQ 
A 


5925 


19826 


A 


5975 


1 


259 


LTSYDYRRALPCLVN\FCVYFLKREELA 
LLPKLFWNS WA* ATLLP * PPKI FGLQAQ 
VAMLRLKFPACLPACLPAYLPPFLFSFI 
PP 


5926 


19827 


A 


5976 


24 


223 


PYIiTQISTNESSVCSHARAEFIiDHQNQL 
/WLGTVTYTCNPS/TLGG*GGRIT*GQE 
FKTSLGNTARPCL 


5927 


19828 


A 


5977 


395 


136 


GSWD YTHVPPRSANF/ LVETGFRHVA* A 
GliELLGNPPTSASQSAKITAMSHCTWPS 
SYY*CEYNARFGSLHKRSGMTIYIKSYK 
RQN 


5928 


19829 


A 


5978 


1 


785 


GTRLKSGTNTKKALQAVYSMMSWPDDVP 
PEGWNRTRHVIILMTD/GSEGTSIiLSQP 
PHLLRPACGP*VHL*HYTHGWGPECDS* 
LAVHLSCDPS*GILPKPCDQLSLTLPQL 
AHPAMCI PAFSQF I FLIS YPHGPVSS AG 
LHNMGGDPITVIDEIRDLLYIGKDRKNP 
REJJYLGE* PA*DPAPHFLRAWTLILPFY 
PSDVYVFGVGPLVNQVNINALASKKDNE 



906 



WO 01/64835 



PCT7US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

a min a nctH 
residue of 
nentide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 
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OCysteine, D=Aspartic Acid, 
E==G!utamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=ProIine> 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QHVFKVKDMENLED VFYQM IGRE IQGNK 
EPNSPQVPLK 










o 


419 


KAIAPIDSDSSDGSGQNKLKT*KEFI IL 
NAI KS I YDSWBEVKI STLTGVWKKLI P I 
LIEDFEVFK\ADLVEIARERELEVBGEL 
/LQSQDKT 


5930 


19831 


A 


5981 


1 


154 


KKC*MMLTWWCMPVG/PTPRAEAGGSPE 
TGNLRL*CTMIMSVNSHCTPAWAT 


5931 


19532 


TV 

A 




1/ 


31 1 


R Kf5 TjNFVEGCN I RliNKCRHTN FVPOT I F 
FFLLFIi*LGQC* *VWFSFFFFFFFCFV 
LFFVFVCWFVFFCFCFF/ GCLFWFLL 
/ YFYELFCIVI IMCYYLR* YV* IVYMLF 
FVCWIFVLYLCFHY*LFLFFFVFVFLLF 
IOGVFWFYELFCI VX IMCYYLR 














IGQKRASEDTT * GSAD/PKKSSAGPKRD 
ARQIYNPPSGKYSSNLGNFNYERSLQGK 
YRWPRLGHTRKS 


5933 


19834 


A 


5984 


368 


138 


STVGKMLSGQHPCYRE/ IL/REQKSQSM 
*QISLLSYFKKLPQPFQPSATTALISQQ 
PSASRQNPPPAKRL*LAEGSYDH 


5934 


19835 


A 


5985 


2 


155 


LANF* IYFC/RDKCLPVLPRLVSNSWPQ 
VILPPWPPKVLGLQGMSPCAPRHS 


5935 




A 








FTYYIjFFIiSSIiYLI ITYALFIiSSF*R* V 
VYHWVMVLLKVVICHHFLTLFFSFRSLI 
LYLFFILFM/YLLIFVLFFIQFSFYLFF 
FYLFYYLFFNDYCFDLIFFLCLIYYYHF 
QSYYFIS * FIYYLFFLSSLYLI ITYALF 
LSSFSI 


5936 


19837 


A 


5987 


81 


651 


KLQVAGCRTLSQIDVKFKPSVHSSLAVC 
LRANYFTSQPNVIMQVTLJjTRCVAHSKH 
LGTVSYSSWYI KLLFPKSHFI YLFILRR 
SLAPVAPGVQWLDLGSLQPPPPGPKRFS 
CLS /LPR * LDYRHPPPRLANF\ * FLVEK 
G FCHVGQAGLELLASSDLPTS ASQNAG I 
TGVTPAPS PQKATLNLILGGQFHI 


5937 


19838 


A 


5988 


301 


341 


FCFCPCKNRVSLW*WHAPPCPANF\*LF 
IYLFLVETGFSHVGQASLKLLTLDDLPA 
AASQSAGIPGMSHHDGLFL 


5938 


19839 


A 


5989 


71 


1252 


TKEQRSLRSCX2LGVRQQRSHRLKFGGGG 
APSARGVGGSWSLATRDGFVSTANMSRP 
VRNRKWDYSQFQESDDADEDYGRDSGP 
PTKKI RSS PREAKNKRRSGKNS QEDSED 
SEDKDVKTKKDDSHSAEDSEDEKEDHKN 
VRQQRQAASKAASKQREMLMEDVGSEEE 
QEEEDEAPFQEKDSGSDEDFLMEDDDDS 
DYGSSK/NEKQKDG*BVQT*KKRKBNAQ 
TQTKGYSDAKS SERQRESGS P \QASKAS 
KEKTPS PKEEDEE PES PPEKKTSTSPPP 
EKSGDEGSEDEAPSGED*K**WSGERFY 
*KKKEKKRKKREEKKEPT*DRTWFWLWL 
DSWGFQCFFPFVESNISLSLSFFFFFFK 
ANHCMCKCLSYLFVYWSLCQPS PFPMKA 
MSN 


5939 


19840 


A 


5990 


630 


316 


RWNVNSVA\QAG\IQWCDLGSTGNPRPP 
GF\K* FSCLSLLSSWDYPTHATMSSKFL 
YF*WRRGFTMLARLVSNWLVLNS*PSDP 
PASAAQSAGITGMSHRAQPIYLS 


5940 


19841 


A 


5991 


2 


182 


WQEQKAE/ CLRDWTTAMCTAAIiLTIAKG 
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X=Un known, *=^top codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














R\NIPCPMMKEWVRKMGPIYTFEYYSAV 
QKDK* RILLS CNNAGNS 


5941 


19842 


A 


5992 


79 


278 


EAK*RVNGCSKPTWHMYTYVTNLHTYV/ 
I KPAHCAHVP* KLKYKI I ITI IK* * RWG 
U I XJuPKliVbJNbWAUV J.LLPWPPKILiGIjQ 
A 








jyyi 


ZZ 


274 


AuUJirlKAW vjt x QPAM * ARLEMNTGPGDT / 
MPaMASQRAGITGVSH*GCPFFSLLNKT 
WRSH*VAQAGEQRLFRE*TPGLSCYSCS 
LPSN*DYTSAPLCPPYDT 




1Q9AA 


n 




34/ 


195 


I MHI i\H 1 H 1 HTHTHTHTHTH P \ E KHLKA 
PNEV* KSKNNV* I KP KQSQSCFS 




lyiyto 




CQQf 
J77J 


1 


122 


K x ciLial) ttualJJyl 1 X y IjHAIjJm&NNxQIjR* 
SSHVSIPST*GYRHVPSCPANF/SSLTM 
FPSLVSSCWAQVIHLPWPYKVLGL*ALK 
ENNYQLRTI IIDCTLNNNIF 


5945 


19846 


A 


5996 


365 


229 


RPPPCPANF \ * FLVETGFHHVGKAGLEL 


5946 


19847 


A 


5997 


357 


17 


ILVDKINNVWNLFQNNLWC * RGWGYK* 
DRSFHLLKTWaE*WVYEGSLY*SLSSY 
I Lir / C* RQQLlr^PRLVLNSWHQAILLP 
WPFSLLLCILEVFQNRELKKKNQGKPMT 
GS 


5947 


19848 


A 


5998 


1 


207 


PILYNLFQKI EAEGILPNSFYEASVTLL 

rjxmr , »VT~vT r T v D / TTTV+ t?TT t T+MTT xwirr 

SNQIQLYIKRITY 




iylyvy 


a 


5999 


603 


269 


EDR VLVLLPKAKSAV AWPRLTATSASW \ 
VQRSFHPSLPSRWDHRHVPPHSANFCCV 
CL*RQGLSLLPGLVLNSWaQVTLPPWP\ 
PKVLGI TSMSHHARPLS FYYGHFKYIQK 




iyoj\J 


2i 
Jn. 


OUWJ 




150 


U xKHAPliHiiAl cKK/ Ft* CRD/ RGIiSVLP 
GLVSNS WPQTVLPP * PP KALGLQA 


5950 


19851 


A 


6001 


365 


62 


AITAHCSLNLLGSSNPTASATRIAGTIG 
EAAMSYF * ISSTLTHFSYT/C* LHHVWL 
1 r tUTr r V JJo V x v AUAo LibLu-ib ooDFPAL 
VSQNARFTGVSHCTEPV 


5951 


19852 


A 


6002 


537 


236 


DRVSLCCPGWSAVARFQLTATSAFR\VQ 

♦ QQfT CT DQCr*FiVT? ClDDrt)lMT?T CCTD 
w oo*-Jooiat'oo^lJXKoAl^ir^irANr JbocTJK 

DEGGGLPGLPRLVSNSWAQAIHPPQPPS 
VWITGVINGTWPKIPLS 


5952 


tyojj 


A 








LSYFKQLPRPPTTSQQPSTWRQDPPPAK 
R/LRTYKGLDDR 


5953 


19854 


A 


6004 


327 


335 


NQSIFTFNNYPRPF/CCKFKFKM/TLHL 
WADKVAHTCNLSTLEGQGGRIA*GQEFE 


5954 


19855 


A 


6005 


2 


367 


WQFLKKLNTKLLYGPVF/ PYLGI YKQEK 
WXHVHAETCTQMFITVLFIIVPK*KQPK 

PDCMTTTJ* Trv*M*'I?T7XrMT 1? VD 7i T in?XTVT T 

\JJro rlrSrS ^ ± U * ri w c V xUtLLi jc* XJtCAXJRJSJNrvijX. 

HATWMNLRNTMLSE I SQTQITTCMNLLH 
KMFRKVTETES 


5955 


19856 


A 


6006 


1 


176 


PGFKQF FCLS FWTRGI FI EMGFHHVG * A 
GLGLLMSGDPPASASQSAGITG\VSHCA 
WTY 


5956 


19857 


A 


6007 


1 


289 


EVLTKI PIR/ FFVEIGKLIQDLKDTGPR 
IA*TMLTKKTEMGRIVH5DLVAYYIVAV 
IKMVWYYW*RDRHINQ/WERLEISEIDS 
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G=Glycine, H=Histidine, I=Isoleucine, 
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QNMFNWVFTKMQKLF 


5957 


19858 


A 


6008 


439 


223 


FFSPGSH/SIT*ARVQW/PHHSSLQPBP 
PSLKQSSHLSLPSSSDYRCAPPHPADLC 
RDWVSPCCPGWSQTLGLK 


5958 


19859 


A 


6009 


1 


252 


MVSIS* PRDPPAbASQSAGITGMSHHAQ 
LFFLN/ CIS/ELISSTVTPGLKESACIiG 
LPKCWHYRREPQHPACTVLYLQFQSDFQ 
LK 


5959 


19860 


A 


6010 


131 


514 


pwpyslsssgllssfen/vekwvpeiih 
\hslflgttidlrs\epcttekln\kdk 

YNPLTPQMADMMVCDLKVVE* LECSAFM 
WKRLK\VFDEALWAALEED*RSSCRKBT 
STLLQSTPLLVSYALPCTP 


5960 


19861 


A 


6011 


625 


1 A 1 

141 


PTVJQ P l?U*T r H'2Vf3\/TrWWMWrj Q T JTPHP PR T .IT 
Ci loo rcr v ia/w v pjinwnuo uLXXr\^cjri\SjJ\ 

* FSYFS LLLSS WD YRHVP PRPANLVYPL 

*RRf?PPSMTjPOAStiKF\LDSK* T PPCIiO 
I\.v3vJ xr it 01 imryrvij 1 ir» 1. .l it it v_j_is«/ 

PS/AKVLGLQVSHRPSPKVTFHQRAKBG 
D WSHWSRGKS I PDRGNSQCMHRS IPC 
VFEEQKGGQCYCSIESKEETSRI 


5961 


19862 




6012 


1 AO 




KHILS* PAVY/C* FLKNIK 


5962 


19863 


A 


6013 


363 


82 


VS VSLS PS LKTKVNYAVASAS FTIAKI W 
1COr.KCPSMDK"OMIC* TWHTHTMEYYSA/ 1 
KQKIMS YSTTWMNLEDIMIjSE I S I SQAY 
KGKHHMIYGI 


5963 


19864 


A 


6014 


329 


99 


HIFGNTYFFQVASSFIYLFRDSVSLCRP 
GWSAVA*SWFTATSG\FRPSACLGLPKC 
/WDYRREPPRPANVLFSMRVTH 








OUl J 






LAMLSRLVSNS WPQAI LLPLP P KVIiGLQ 
A 


5965 


19866 


A 


6016 


2 


308 


FWKAIAGIEGDSSDKCRQ/.SKLKTIWKK 
FISLDAMROTOTS*EKVKMSIFWKMIPT 
PRDDF/EGLKTS VEKRTEDWKTARQ * E 
LKVDRT* TDBELLLLDQQRK 


5966 


19867 


A 


6017 


*> 
2 


*07 


PRGFTP FSCZLSLPSS WDYRRPLPRPANF 
FYF**RRGFIMLARMVSIS*PRDIiPAIiA 
SQSAGITDVSHRRAERVISKQRIVSVMW 
K^LPEIHIPCII^LSLRFNHRSVAGLRN 
SLI VRMLS ILTHG 


5967 


19868 


A 


6018 


389 


142 


AHMLFAAQGDSS I PMLVAPLFTWKMWE 
QSKLPLADEWSGFW*HIHVTBY/YAIKR 
RB ILTHATTSLKALCYVT* ALTKGrRTL 










184 

JOT 




LNLPSSWDYRRAPSHSA/NFSYF*RDGG 
LTML\ PGWPQVIQMPRPLKAL *LQA 


5969 


19870 


A 


6020 


333 


84 


GL IDQEFATDTTKSTSLSLT* T/ IPSRC 
SLCLRGSSDSPASC/SQPPASVAGITDT 
CHHAHI I FVFLVETRVHHVGRANTEKPR 
LH 


jy/u 


IOQ01 


7V 




1 CifL 
IUO 


J DO 


DTISFFFFLKAPIEPVFIIFIFFYFFFI 
FFFI*FFFLLFLFLFIFYFVF/ IFFFIL 
LCLFFFFLVLLFYLIiFFF/SLFFLFCFV 
FCFFF* FCYFYFYLFFILFFFFFYIiALF 
I FFFFGSS FLFIVFLFIiFFLFCFVFCFF 
F 


5971 


19872 


A 


6022 


384 


18 


REKKPGPGTGKPPKKKGGPKFGPKGPKK 
RGE KKKKGG P /NS FF PRGGGTPGGNF * R 
GFPPENQKKRGRKKFFQGGKRGANPGIP 
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si inn t~f\ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

— f~" til 4-o m ino U— A rAininA C-^Carina 

v^^vjiuLdinine, tv— Argmine, o — oenne, 
T=Threonine, V=Valine, 

W— Trvnfnnhnn V= r T vt-/vi i n a 

X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 

IIU1.ICUUUC fllSCriKliI 














PLLGPKAEGGFI PGKKKKKKKNSNKKSK 
QTKKALSRTSHS 


J7 14. 


IsOi O 


A 




q 
j 


lOI 


i-» v r\.VjI\.-L KonriO I W v IV1 W/ itr X iroVl v LriXl/ 

TYNPNPLGGQGRWIA*AQEPKTSLGNW 
KPHLYKKKI 


jy jo 




A 




JO** 


1 AH 


j. xinojjx xriiiriii r rv-LiNLxri l ±jo inn r tlx. Li 
CVCVYIYICLC\CIYVSVCLCVCVYTHI 
QYTY* INKHIC*RLEAK 


5974 






OUZO 


110. 


JUz 


YRHEPPCIiALFLIFVGT* *VYIFMGYRR 
Y/C*YRHAMCNNHIRVNMVSVGLiNF 


5975 


19876 


A 


6027 


99 


77 


REICTPTFTEAFSIEAKGWK* PKQPLTN 
VFHEISKSQKDKHYYI*FLSPTYVKIQA 


5976 


19877 


A 


6028 


2 


328 


TITYRGAKIRITSDSSSBTMQAREE*SE 

T T? + \7T.V / gyyVTU* DVTT VO/^tTT COVC IT 

if vujv/ niru\Jvlri w irixXxjXooKJLtorKoK. 
GEIK/YF*GKQN/LKEFVSSRPVLQGML 
KVVLQREGKLYRSETQTYKKKEKASEKE 


5977 


19878 


A 


6029 


8 


241 


GFAPLPRLECSNTNMAHHSLNLSGSSDS 
irAo/UMr bar \X]SlKrljnVAyAVb w rJjDb 
SNPPISPSPSVEITGVSHHARP 


5978 


19879 


A 


6030 


108 


362 


CCYLMVTEISAHQNPAKQGLA/HTAFP* 
KTFFLKVS WTGTVAHV* KPSTL/3G*GGR 

YP 


5979 


19880 


A 


6031 


1520 


77 


ASSSSSSSSVILSS *NIPNYNYIVIFM/ 
CL*IIF*AASSSSIYYVPGICYXYH\IL 
u V v_ V t_ A iv C 1 YS X C * Y I VYVS I 
YYIHY 


5980 


19881 


A 


6032 


405 


100 


EGFFLPPRGGGGKR/ PPGPliTPGGPGNP 
RPKPPQK^KPGPPPGGGPF*WVLLSPS 

T.P/3fifiVPGPf2B.\rQf2DPrrtrPDCf*CPni?CP 

TAVGTTVTFLS KKKKKQV 


5981 


19882 


A 


6033 


42 


309 


CDKFFHKASNHIYVSTYIWVFQICVCVC 
VWCWCILC*AY/CGCLGIMIWFLFVL 
*CI/CYCCY*SSLFSVIICIFLMCFVSY 

CilvJcvXr XXJT A 


5982 


19883 


A 


6034 


397 


2 


KGPPP*GGPDFF\*PRGGPPQKNPFFPP 
GGGPPPPKNGVGGPPPQKAPFFGPKRGA 
PQKGGPPGGGPPRGEKKSLCPPPNWENT 
PRGLKKGPPPFFFFFFFFFFFFFFFVLR 


5983 


19884 


A 


6035 


343 


2 


LWPPQGSFKTAAPFFFFFFIiFISDFITF 
HFLYM\YVFFIPLFLL*SFFCIK* IFYY 
VTF*FLQSFSILFFSWFL\WLL*SLLY 
S F * IiTP T YFRFT T/F * FO* DTPTT .PTTYVCj 

rp 


5984 


19885 


A 


6036 


319 


68 


SLGEICAPK*FFPPAQKKGPFQKFPSCK 
F *? PS PVFKTRPPJMCF * KGP P * GfCfCRYVZV 
NPGKI\GPPKGSFKRPPLFFFFFFFFFF 
F 


5985 


19886 


A 


6037 


419 


191 


RIHTlPITAIiFAVAKR/WQSKYPLVDE/ 
MDKIWYIHTMED*AAYGKKEIMSHARTW 
INLKNIMLS EISQSQKDKYFPF I 


5986 


19887 


A 


6038 


434 


401 


YGKNFSFSLKV* IFCEGLVLVFFPPKKK , 
/ FFFKNFHRWFPLYVFF*TGRGWFFK 
PPFLEKIFFFLTRVNLGPPRGFF*GAPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFS 
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E=Glutamic Acid, F=Phenyf alanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glatamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














*NFY"FMN* I FTVI 


5987 


19888 


A 


6039 


12 


346 


SFSFVRRSLTVTQAGVQWCNIiGSLQPP \ 
LPPSFK*FSCLSLQSNWDYRCLPSCLSN 
FCIF/M*RRGFSMIARLVSNS*PQ/CDP 
PASASQSAGITGVSH/SHLV*NA*SFKT 
ESHF 


5988 


19889 


A 


6040 


275 


45 


KDVQHQWKIVQPIjWKQYGSIiKASTESPH 
DPSILLLGTYPRKMKT* IHIK\TCIRMF 
AEALFLI SKK* KLPKCLLADK 


5989 


19890 


A 


6041 


355 


1 


FGTI KNFFFYIFFFKNYF/ FFLYI IFFK 
KKVLFFF* KKKI FFLFIPNKKI FWVFFF 
FFFFFFFFFCPRFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLVTRHLF 
EMRKGN 


5990 


19891 


A 


6042 


635 


209 


PLNI KKTFFFLRQSFTLVAQAGAQWHDF 
GSPQPPPPRFK*FSCLSLPRSWDHRRPP 
PRPANLVF\DFLVETGFHRVSQDGLDLL 
T/S/GDLPALASQSAGITGVSHCARPTIi 
EFLKVKEMIFCSTTVKAVLDHANFLILR 
FSIT 


5991 


19892 


A 


6043 


12 


348 


ESRYSSICLPFSSVYCSRIISLQLYSYV 
LLVYILFFF/CFLKREFGFCPPLGVQGP 
GLGSLNPPLLG*KQFSGLIFPGIGNTGL 
APPSPPNFGFFRKKGVSPCGPNRPAISK 
F 


5992 


19893 


A 


6044 


192 


1 

» 


IFHRFCT/HITLP/ TTALNKN* PWLGTM 
AYAYNPNTL * G * GRRIV* AQAFKSSLGN 
MTRPCLYKK 


5993 


19894 


A 


6045 


338 


42 


WKTAWW/ FLKKLD/MNYC*DPAILLLGI 
YPKELKAGT*TDIRTSMFTAELFAIAKR 
WKQNVDLKNLRQNECRQF I WAKVEGNWL 
PS WFGEWS I QHRLQAGF 


5994 


19895 


A 


6046 


300 


3 


RLDCHYLHSHTHTHTHTHNPAGIYTHH/ 
HCWNLHTHNPGGI LGVALDL*MQFLLNR 
GWSWGGEQMAWPCPQGHVMWLGDPGKMI 
LKPPLWVCQLQGSSYDG 


5995 


19896 ' 


A 


6047 


332 


3 


SLBSAGFT* YKMYFYYYYKIK\YIYKS I 
FIILFL*IFFFFDLFFIFFFFIFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFLKSP 
KRVI YFGFGTQNPQMFSLLTLKYVLC ! 


5996 


19897 


A 


6048 


2 


152 


KNLKIS WT* WLMPWSAT/ WGGSLKPGR 
LRLQ*AVIAPLHSSLGDRARPYL 


5997 


19898 


A 


6049 


1 


288 


P 1 1 1 YVLLLLFFETDSHS VT \ RL* CSGA 
FSTHCNLCLPGPGDSSASDS /RYAGVAP 
ACLF * PYRGFSGPATLAFRALDPGLPLH 
PGFSLQRPSCSRGG 


5998 


19899 


A 


6050 


358 


1 


FS I FS ILI FDFLGGFLKFFAFFYGWDFL 
RFPFFYR*PFGPKSSFIFPHVF*R\WIW 
FLIiLSSWFLKWSFDPQFFLKRFFFFFFF 
FPKG FFYKRVFFFPGFFFFSNLLIiSSQG 
KGVWLI 


5999 


19900 


A 


6051 


308 


45 


ADLS ARAL * TRRE WDD I F KVLKE K/NKG 
QPKILYPSKLSLINE/NEIKSFPDKQKL 
REFTTRLVL* EMLKGILHMEAQGQYLPS 
*KHTKV 


6000 


19901 


A 


6052 


2 


124 


I FCVLVETGFHWVAQAGL * LLS SGSPPA 
S/DLPKAGITCVSH*AGITCVSH 


6001 


19902 


A 


6053 


I 


330 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
K-Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
V=ijiuiamine, K— Argmine, 5>=5>erine, 
T=Threonine,V=Valine, 
W=Tryptophan, Y=*Tyrosine, 
a— un Known, — otop cod on, /—possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














EPLHLVFFFFFFFFT* IYLKGFYLNPRK 
PSLWPNMVFFKRVLGEKI FRGFFPP 


6002 


19903 


A 


6054 


34 


293 


SPILFIiLCLRHQSGCHPSWTAVAQSELT 
AASNSWAQAILPHSWDYRHAPPH/LNLF 

LQA 


6003 


19904 


A 


6055 


319 


217 


YHFTPTGMAISKNRQKTGDNKCC*GCRE 
IUMJj V CC * WQ x KMVWIjIj/ WKxVWHSLKG 
IK/Ii*ELPYDPAIPIi*GIQRVSWKPIPH 
SSEG 


6004 


19905 


A 


6056 


341 


151 


VHAGGSPES/ REFETNLGNKARLHLYKK 
* ISQMWWCIPAVPATHAGPSLHFNLYYS 
PWLFQCDY 


6005 


19906 


A 


6057 


2 


558 


FFFLRWS FTL IAQAG VQWCNLS S LQPL P 
PRFKRFSCXjSLPSSWDYRHVPP \RIjANF 
FVFLVEMGFLHVGQASLELLASGDPPSG 
DPDIiR*LAFQSIiGITGVT/HHAGQIFLF 
/CLETESPSVAQVGVQWCNLGSIiQPLPP 
\G&1£ * r bCi/jijPSS WD \ YRHALTHPANF 
\SIFSRDGVSPSWPGWSQMPDLS 


6006 


19907 


A 


6058 


320 


21 


EICMTTNHPSVSNTIW/SKMTLASVYLT 
JjIiPI S aTSKTSGJjPASKE VNRYLGTCAI 
RYFVFYFIIMLFV*RQGLTMLPRLVSNS 
WAQVILPPWPPKVLGLQA 


6007 


19908 


A 


6059 


294 


50 


CYFSLSFGL*QFLSLNQSFVTLSSHCDR 
LJ x e E» * 0 Wo v J. / Pvj±GV z So * £• * TRGLAM 
MPRQVYNSWAQAILMPWLPKMLEVQT 


6008 


19909 


A 


6060 


616 


354 


ERVSPCHPGWSAVALSQPTAALTSW\VK 
0o5HIjoIj]jNSVTOYK.CMPPIjIjANF \ KFFC 
RNE/SLLMLPRLTSNPWAQVICPP*PPK 
VLGLHA 


6009 


19910 


A 


6061 


214 


11 


AASTRPSAWQPPLLGS EEPLCPATTPSG 
KLTlVv/ liH*fiRATWTMAVLWNRKGGKVG 
KRLRNRLVAMSV 


6010 


19911 


A 


6062 


349 


54 


QSKSAFPKKRINRISDSSEGYGQSKLKT 
FWKGFTILDAIKNICNSSEEVKIATLMG 

EVEPBSVTG*QQSHVQP 


6011 


19912 


A 


6063 


332 


3 


SQPASGQ I SKVTQSTLRCLKMS VQQVKS 

SYFYTHSIRSKIL/WPGMVAHTCNPSAL 
ESQGGCMT *GQHCETSMVNMVTPRVSRA 


6012 


19913 


A 


6064 


174 


416 


NENELFFCFFFXIFFFFXFFFFFFFFFF 

P PP P P PG P pV p P P T? 1? p P CM? I? p T? T? Y PT? P P 

FFFFFFFFFFFFFFFFFFFFFFFXF 


6013 


19914 


A 


6065 


826 


571 


DGSHSLA\RLECSGVI SAHCNLHLPGSS 
DSVASAS * VAGTTGT CHHARL I F/WFL 

LQA 


6014 


19915 


A 


6066 


1 


284 


GTSFFF*NRVLFCCPG*RAVPQ\SWLTA 
ALTS WAQGS S*NHRLKP/HMLS FFNFCK 
QGLTMLPKGALNSWAQAILQPQPPM*LG 
MACTYAWSLGP 


6015 


19916 


A 


6067 


36 


384 


VS KNKP IHLFLF FFS KTKSWS VT \RLEC 
SGVTSAHCYLRLPGSREGAASAL* LSAT 
TWLCKVALPLGEDLGALCRNPWGAGVKG 
SSRESPPLGEARGGELPWGQVRIRLWGR 
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GEPAP 


6016 


19917 


A 


6068 


393 


116 


PQPIFPRY*QNKDPNL*KTF*HKYMPKC 
PAK\LIIFIEKGFSRPGQPGL*HLSSSN 
LPALASQSAGISGINPGGRARFPLFFFF 
FFSIiLLLSF 


6017 


19918 


A 


6069 


53 


280 


FPRFCFC/ LLESESHSVA*ARVHWRRIiG 
PG/LKRFACLKLPSNWNYRHTPTHPANF 
CIFIRDQ(»ITr J *PSWS*T\PDIK 


6018 


19919 


A 


6070 


3 


377 


HEE PQGCATVPGE FFKFLWRHGL ITYPR 
IiVSNSWAQAALPPQ*METHIIFSIGRLH 
MVK\RQYWSGAVAHAYNPRLLGS *GGKA 
A*AQEFETSLGYVMRPCXHKNIiKNSPGT 
VAHPCGSSC 


6019 


19920 


A 


6071 


337 


3 


KESQAFFSKKKKKGFTKMV VFEQRCERG 
KGARQQSHLRKS I YV* EIASAKALRRHM 
PGVFQKQPGG / WPGVVAHT * NPRTLGGR 
CX5WIT*AQELETSIiGNPEKPCLYKKYKL 
V 


6020 


19921 


A 


6072 


1007 


628 


FFLIAHS PHSVAQAGVQWGDIiGS LQALP 
PGFMPFSCLSLPSSWDHRRLPPRLANF\ 
FIFLVATGFHRVSQMVLIS * PCDPPASA 
SQSAGTT/GARPKVF* IFVYMYFGVRKH 
SILMSMPQHDIiFGYKI 


6021 


19922 


A 


6073 


27 


364 


MDLPSVSIFDLRSLWADISRATSTLFWL 
LFSWNCFLHIiLSFNLFLSLGQK*VCCR* 
HIQSCVT*QWRY\WSGLVTHACNPSTFG 
SQGGWVASV*EFETSLCNMAKPCLYKKY 
E 


6022 


19923 


A 


6074 


2 


68 


ARACSHGTVALTAS CFPLQLDYKCLQYS 
FVCFFFFETKSNFVT\RIjKCNGPISVN* 
NFNLPGLTRSQA*ASREAGTTGTCYHA* 
♦ILLPSSTRL 


6023 


19924 


A 


6075 


330 


49 


KRRFALWQAGVQWRDLRSLQPLPPVFS 
CLSLPSSRDYRCPAS /RPS * FFLELLTS 
GDLPTSASQSAGIIGMSHCAQPPLIYFL 
FQIKVILFFS 


6024 


19925 


A 


6076 


2 


267 


ARGVEVKV^GKTQKFLNMIRRTLVHCWWK 
CKLIQPLF/ WRILPKVKEELPYHLAILP 
LGIYPKQMKSISL\KYICSYI * I VDHFV 
*AILPLGIYPKQMKSISLNIHMFLYIDC 
RSFCIGTFLLL 


6025 


19926 


A 


6077 


379 


236 


PPPPQTRETPPPGHVHKDPPPPHHPYPP 
PQEPQPRTPVPNTHPGEEQNHELFNSSL 
IiEPWW\YYSSLLPPEQA*EPHIH 


6026 


19927 


A 


6078 


2 


290 


RRSLPSSPLEDGRI>SIAPGLTWEPPSHP 
PPQ*RSPPPPKKKKKKKKSPPGPKF/HK 
FKTGKPEKPPRVL * RGHWPPFiCFF *LKP 
GLPFF*GTKPPKV 


6027 


19928 


A 


6079 


1241 


1487 


KMYCIFFVE*KPCLSCAIFIRLUCWS*P 
TVCNSISGGPPHG* IMPSIIL* FLFLET 
VFHSV\TRVGVQWPDHGSLQPGPPGLK* 
SSCLTASQSTQI IGVSHHAWPPS ITFEI 
FKRIAYSMSADGW* KSPPMKLDLGALKK 
KLSFARPG 


6028 


19929 


A 


6080 


333 


1 


LCHHVQLSFKFFVEMEVSLCGPGWPFK* 
FSHLGLPKCWDYRHDPPHIiAFFFSIiS\ * 
FFCRDRVSWLPRLVLKSWSQAI LLL* PP 
KVFFLFVCFLRRVS FHHAEP YAVSSC 
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Amino acid sequence (A~Aianine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I~Isoleucine, 
K-Lysinc, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine t V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X = Un known, *— Stop codon, /^possible 
nucleotide deletion, V=possible 

nn/* I nAf i rl a in cak*tiAn 

nucieoiiue insertion 


6029 


19930 


A 


6081 


2 


239 


ARGKY I YNRDKI SFCCPGWNAI I /RSLQ 
P*TSELKRSSHINRPNSWDYRCMPSHLA 
KTF * I FRRDGGLASLPRLGTGWLKS 


OVJU 


1 77 J I 


7V 


Ovoz 




1 Q*Z 
10 J 


Q2i^'T/ST.DK , ^ , / WnVPPHPT.fT.X7QD *\/T.MQ 
oAUxArX) trill./ WJ^IXrvtlr'ijL.Jj VaK* VlxNo 

♦AQAICLPWPPKVLGLQA 


6031 


19932 


A 


6083 


296 


2 


GERILEECIYARSHRAEITSLHCSLSVS 
K/HSVAQVWKWPDFCSMQSMPPRFK* P 
SCVSWSSWDYGLPPPCAGLHIWRRNVT 
LALQGLPTTSliALV 




19933 




6084 


A A1 

443 


1 


y KF K* * iv I VI WS o I SI^SRKMKKKX IiSV 
ITFFKT*GEEEEEKEKKRK\KEE*EK*K 
EPSQKKEVEEVEVEDERRGRK* KVDMEV 
KEEE KDKGEE KSGEEQ * KES AB AAVRE * 

A 


6033 


19934 


A 


6085 


328 


1 


SNIIiIFKAQVTKHIYSLBTCSPDYPGIS 
UAX KAJn r brL 1 VU* LUb'oNWAl KHJjPPCIi 
DNF* IFCRG\RSILCRLISNSWLQQVKL 
QEFPQFLGCHARFQGAGFLSHCLV 


fJYXA 


lOQIC 




oUoo 


299 


AC 

46 


A*S*LTVA\*TPVILSLRLK*SSHLNLP 
SSWD YRWSNS * PQGLLLPRSPKVHLLL 


Oil J J 


19936 


A 


oOo/ 


266 


AO 

48 


QKI/WLGTVAHTCNPSNCGGRSGRIT*G 
QEFKQSSHLSLPKCWDYRL 


01X5O 


199J/ 


jH. 


OUoo 


2 


185 


AKV I jyLuJr'KLiXoiNb ooi±J^XiKJW_W 

DYRCEPPCPAE I S F ISSGYLTRSGI AGS 
YCNSV 




19938 




/Aon 

6089 


520 


297 


XAi X KU rhHThS If P X S / WNYRCXiPSCPA/ W 
FFILVETN* FHHVGQAGFEHLTSGDLPT 
SASQ\SAEITGMTYRAQpAAGS 


6038 


19939 


A 


6090 


388 


187 


SQHLGKLSREDCL/RLIVPDQS \Gqhsk 

X ir oX»JSJv w r w iv w AW V n w HMlr V V FAXy oAh, 
VGGSLEPRKSKLQ 


6039 


19940 


A 


6091 


188 


329 


TbGLKRFFCLSIiR* FIYFFFRDRVLLYY 

PGWKAVGIHRRDYSMIjYSIjELLASKDS / 
crT.QT.T?csf5Kmvr , vwT twcit^m* r*vx>r\nj 

oLbJiuCobWU I V- V WXiXc r vrv-rl w UKKyDXi 

SIIPWQRSLIGQ 


6040 


19941 


A 


6092 


124 


342 


RPAWPTWQNPISAKNTNITWA*AARVAG 

LYPKI I ILYDKLSHYKCSHLQMRKALBT 
YSGIYSIiFTIYLV 


6041 


19942 


A 


6093 


169 


2 


SH*PFTSSTIRSIWT\GAVAHTCNPSSW 
GE *GGPMA* DO Kl* ETS r.VNMTP PHT ,VPP 
A 


6042 


19943 


A 


6094 


2 


324 


F FFLRWS L/ DS VAQAGVQWRDLGS LQAP 
PRGFTPFSCLSLPSSWDYRRPLPRPANF 

*PPGP , TMTiAPM\7* t ?TQ*PP'nr.PaT.3i 
CLE XSJWJC xriLuuU'ivOXO f rt i ji i i>a 

SQSAGITGMSHRARPAVYILTSTSN 


6043 


19944 


A 


6095 


76 


340 


LRKKFCFSVYPFIIiRWTFELLPIiNDFLM 
NFYFRIVFRFMKKLQR\WPGAVAHAYNP 

STKNM 


6044 


19945 


A 


6096 


349 


84 


FIiLSLF*DTISIiCHPGQSAWRTMTSQL 
KQSSHRSLLSSWDCRCVPPCPVNFFSFC 
RDE/SIxAMIxARLVSNLWDQALVAEAGVS 
VWARQE 


6045 


19946 


A 


6097 


346 


184 


LTMLPRLVSNS \GLK* SSHFGLPKCWD Y 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Vpossible 
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RHEPPCLAKISLYITKFFHILSLPI 


6046 


19947 


A 


6098 


378 


250 


FKQFSCLSLPISWDYSREPPQMANS\*P 
LVETRFHHVYSYVP 


6047 


19948 


A 


6099 


3 


225 


GFKRFSCLSLLSSWDYR/HPAPYLANF \ 
* FLVEMGFHHIGQAGLELLTSGDPPTLA 
SQSAGITGVSHHAQPIil IF 


6048 


19949 


A 


6100 


1129 


457 


YCVSSKYRRIISFSLIA*LLI,TKLGS/V 
QKVGVQWRDLG* LQPPPPG\ FKLFSCLS 
LPSGW\DYGHVPPHPA*FCVF*WEWGVS 
PPWT .RT.V\ T jNS * PSGGSRP PS AS OSAGS 
LQGLI \HRARPEH* F *TS * PLGEGKTLD 
EIYIKTKTFYQFCIVCLKLH*RI»MEFAL 
VAQAGMQWHDLGSLQPLPPQFKWFSCLS 
LPSS * DYRYWLLYHPALELKEAPSFPBS 
PCTH 


6049 


19950 


A 


6101 


3 


296 


EI IQALLITIVLGLYRTLLQAS *YL*SA 
FTISDGIYGSTFLiAA\TGIiHGLHVIIGS 
TFLTICFIRQL IFHFTSKHHFGFEADA* 
YWHFVDED*LFLNG 


6050 


19951 


A 


6102 


1 


749 


RHBGGLFKTILLI *TSLTIGSIiAIiAGLL 
FFTGFYSKNHIIE\TANISYTNA*ALSI 
TLIATSLTSAYSTRIILLTLTGQPRFPT 
I/TNFKENN\PALLNP\ 1 1 YVAAGSLFAG 
Ft T TNNISPASPFOTTIPIiYLKIjTALAV 
TFLGLLTALDLNYLTNKLKIKSPIiCTFY 
FSN I LG FYPS I THRTI P Y\ LGLLTSQNL 
PLLIiLGLT*IiEKLIiP\NTISQHQISTSI 
ITSTKK\GILKFYFLSFFFPLILTLLLI 
T 


6051 


19952 


A 


6103 


377 


1 


SKAFGPPGF*APYGLKAHCFPPGFRRGV 
WAPSGFWARPPIGYPFGALIGAPVWPSP 
GGPPKPGPP\GPFGVPKLGGKRLGIGGP 
FPGSPGFLTTPGSKKKKSTKKGKPQKKI 
TPLDTSCSRTIVM 


6052 


19953 


A 


Z*1 Ait 

6104 


1 AA1 
1001 


DDI 


FFLRRS FTLVAO AGMOWCDLGS LOPP P P 

gfkqfscfgliisssfdyrhvpptlanfsv 
flvem\gfsmlarlvsts*pq/ cdppap 
asqsagi tgmshcaqlpkpi fnyl* aid 

SVTTS I S LARDWPNSS ENRLLGYMGKPS 
KFKRQEQKMECYRDLAPS 


6053 


19954 


A 


6105 


376 


117 


CSDWSAVSRSQLTAAPNS* \VKQSSCHL 
SI^SCWN/YRCQPRHLANFVLFCYVLF* 
RjWALAMLPILVLNCWPQAVFLPWPPKEM 
GLQA 


6054 


19955 


A 


6106 


689 


433 


LCHPGWSAWQSWLIATSPSLV\KPSSH 
IiSLLSSWDYRCTSPCPANFFVFFY*R*G 
STM* PRLLWNSWAQAILPPRPPKVLGLQ 
A 


6055 


19956 


A 


6107 


2 


176 


AREYPANFFVCLFFLILCRDG/DFTMLL 
RLI SNSWLKRSDRLSLPKW*DDRREPPH 
QAY 


6056 


19957 


A 


6108 


328 


69 


LYI FS FG* ERVLFCHPGWSAVAQSQLTA 
ASTFRTQVLSIjIGRWDNRHVPPHLADL/ 
SFYRDGVSPCCSG+SQTQAILLPWPPKV 
LGFQE 


6057 


19958 


A 


6109 


777 


329 


FLH^GSHSVAQAGVQWRDLGSLQPPPPG 
FKRFSCLSFPSSVfDYRRPRPRPA/NFFV 
FLVETGF\TILARLVSNS* PRDPPASAS 
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K=Lysine, L=Le urine, M=Methionine, 
N-Asparagine, P=ProIine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QSAGITGVTYPRPALTQNFSNLTFGEEN 
RKPNPEDSCVRYLLLS IRQRS PTALLHV 
ILFQQMIQLVL 


6058 


19959 


A 


6110 


345 


309 


ASL I FV* LVEMRFLHSGISLFNSGKNGP 
TAFSQDPPG\SSVSKKAIIIirNKIWK* 
IQHVTVIQIEVQWHNHVSIiQPLNPELNR 
FSRLSLLSSWDHRHMPPHSANF * NFKN 


6059 


19960 


A 


6111 


332 


3 


RKVLVCPPGLSAVAGFQVPARSAFQVHP 
LFLBAEVAVS PGCAPAFKPGPQGPF * KK 
KEEKRKKKGRRKEEE/EEEEEEEEEEEE 
EEEEEGEEGEEEEE FYFLRAGKVKS PRA 


6060 


19961 


A 


6112 


498 


191 


STDRVLLCQKARVQWCDPGSLKPPPPGF 
K* FLCLS LPKS WNYSHAPPCLANF\ *FL 
VETGFCHVGKAGLELLASSDPPALASQS 
VEITGVSHHAWPMQEYFQI 


6061 


19962 


A 


6113 


1381 


172 


RNIDSKAILSKKNKAGGITLPSFKLYYK 
ATVTKTA*YWHK/NH/DVD*WNRIENLE 
MRPHTYNF\LI FDEPDKNKQWAMDSLFN 
KSCWDNWPAIWKRLKMDAFLMPYTKINS 
RWIK/DLNVKPKTVKILEDNPGSTIQDI 
GTGKDFMTKTPKTIATEAKIHKWDLIKL 
KSCTAKETIKRVNRQLTEWEKIVANYAS 
EKGLKSS I YT* FKQI * KK\NHPIKKWAK 
DMNRHFSKEDICGQQSY/VKK/CPASLI 
IREMQIETTTRYHLTPVQMAI I KMSENN 
RCW*GCX3EKNMLIHCWWKCK/IjKLVQPL 
WKTVW/RIKTEIPFNPAISLLGIYPKE* 
KS/C/CYKDTCTRMFVAA\CPSVTDWIK 
KMWYIHTREYYTAI KRNKTDWAWWLMPV 
ILALCEAEVGGSLEVRSSRPAWPTW 


6062 


19963 


A 


6114 


3 


155 


HEKKI SQVWWCMPWP/ TILWWLR* EDS 
LSPRKLRLQ*SYDCTTVLPAWATE 


6063 


19964 


A 


6115 


47 


368 


STPTPSAFLCLAF/ YRQRWLCWPDWGA 
V\IQHTYSLPT*LS IIiKQD* PLNLLGT / 
WSSRRVAACPENQI YRRNGVS /MLPRLV 
LNSWTQVILLPWPPKV/LGLQA 


6064 


19965 


A 


6116 


1 


592 


RQRIFLI4ECGGAISAHCNLHLPGSSNSP 
ASVS * VAGTTGVRHHAQLIFVILVETEF 
HHVGQDGLDLlL/NIMIHPPRPPKVL*LQ 
G 


6065 


19966 


A 


6117 


1 


306 


LWKTLWQFVLKLNILLPYNPAIVPLGIY 
PNKMKIYVHTKTCTQICIGALFIIAKT* 
KQPTCSSIGEW/lKKIiWYIQTMEYYSTP 
ERNEPSSHENTWKNLKCI 


'6066 


19967 


A 


6118 


2 


284 


QTGVHWHDQ\NLL*PETPG\SRDSPGSA 
SQVAGTTGMHQHARLNFFSFSFFLSFFF 
FFSLGKTLGLGLEFFPRWLQMSGPKKIL 
RLYLQNPKNKG 


6067 


19968 


A 


6119 


2 


315 


SRVAGI TG VGHHARL I FVF / CFFFFFLE 
RDFIiFVPRVGFQGPNIiG*LKFPLPGIiPL 
FSGLTLPKTGD *GAPLQPRVNFWI F*KK 
G\V*YCGPGWFEILDLRGSPPF 


6068 


19969 


A 


6120 


42 


392 


LQWRNLCSLQPLPPVFKQFSCLRLLSS* 
DYR\HAPPRLG * FC I FS RRI VFLHGWVR 
AGSRNS * PQDGSAPPQASQtiAGITG \ VS 
HHHTWAPHFLFF KNSGCSTQNR * QVHKV 
GDKETS 


6069 


19970 


A 


6121 


2 


202 


VHLSDQNQPKCP*MIEWIKKMWHIDTME 
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W=Trvntnnhan Y=Tvro«ne. 

X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YYAAI KKDKFMS F \ ETITLGEETQEWKT 
KHRLLSLIRGS 


6070 


19971 


A 


6122 


355 


161 


FIXYLFFFFFFFFFFFFFLFFLSFLFFF 
XXXXXXXXXLXVFTGMIS*LWVCLYVYM 
NVYM*IYMF 


6071 


19972 


A 


6123 


1 


133 


LC*PGWSR\FLTS*SACVGLPKCWDYRR 
E PLHLVFFFFFFFFTI* 


6072 


19973 


A 


6124 


3 


212 


HPGLPSSWDYRCT* PCLftNF/SCCCC*R 
LGLTMLPRLASNSWTQVILLLWLPKLAQ 
NTSLINNFYIDQMLK 






j± 




J 30 


z 


ALLTHAKNKAL \ WLGAVAHD CNLSTLEG 
*GGWIT*G*EFNISLOTIVKHHLY 


6074 


19975 


A 


6126 


345 


34 


HPSTIHRVFCSL*S *SSPQARNNGSCLS 
SEHFGITGVNHHTWLLFFKKNY/CILIK 
T RTGS * YVADAGIiEIiIjSSSNPP I SGSES 
AGFTGMSHGIRPVS PSUCWS 


6075 


19976 


A 


6127 


2 


328 


araylplcffiiffi*vsviyfiwll*g 
li *lnkrylahpk\ y* lnvs ysvs fckt 

HFFPRHKEKAI IQLGWAHACNPSTLGG 
OC3GW IT*GPRFETI LTSMEKPRLYY 


6076 


19977 


A 


6128 


1 


341 


GTRKFSDYRGKPPHPAKNRNLLFSIFFY 
FETLSCSVLLCEVTSSQL/CNSFHFPDIi 
PSSNDPPSSASRAAETTGACHHAQMIF* 
FFVETGPDWS*TPG /LLAVLPPRSPRML 
ELQA 


6077 


19978 


A 


6129 


264 


2 


TAISENKVASRSEHRRHRML*LQVI/IjI 
alohr sordnlsh * kn* kkkmlgevaha 
CNPSTMGGQGGWI S *GQQFQTSLSNMVK 
PPRA 


6078 


19979 


A 


6130 


1392 


1157 


LSLPKCWNYRREPPCPALCCPGWS*TPR 
hK* S S PLGLPKCWDYR/R* TTAPGLFFI 
S KSSLRS PAYS CVTVCTLQLSHH 


6079 


19980 


A 


6131 


383 


108 


MRLCSQPHHHVDFSVLETESCCVTQVGV 
QWHDLSSLQSPPPRFKRFSSLK*LGAIJi 
T.r.pQ c urn YR CT P PH PAN /LCRF *HGPVC 
LYMFGEKVTP 


6080 


19981 


A 


6132 


307 


1 


LKTLPALCELESHSVSQWRNLSSLQPPP 
LRFKRLSYLSLPSS*DHSHP*AHQANF\ 
CIFSRDRETGFHHVGQENLIMIilVQQSS 
GLGFS I FATTjIiAVMQLV 


6081 


19982 


A 


6133 


351 


66 


RWSFTMIiHRJiIjLKS *LQ/CDPPASASQS 
AGMTGMSHHTRLHNLF*TANNLTQSHTK 
FYSFISLTPHFAINVTNYIFLYYTSINI 
VCIFLSFKFYARI 


6082 


19983 


A 


6134 


2118 


1392 


FFCKGK*IFIQCPDWGG/DLCV*QEKIQ 
KKTYHTGKDKYV*LF*KVKPITKI*RQI 
HTSGNTDAQLS YKENHDQGH * QAKNFKF 
FIiFFVWYLRRSL/NSVTOAGVOWRDLG 
SLQPPPPGFKQF/SCLSLPSSWDYRRPP 
PRPANFC/ 1 FE*RWGFTMLARMVSIS+P 
RNPPTSASQSAGITGVSHCTRQEFKVKT 
GYHFHLLPSGCLE^LFYVFYLYACLLYVS 
VYYPLRQQESRKISKIEMFLTTWH 


6083 


19984 


A 


6135 


383 


47 


HLFLNFYYYIiSHYNYIjYYFLLHLIPSSP 
KIFFFFT*KIIFFLIKFNKVF*NLSF*K 
KIF/IFSYTF*FWPP*NFF*KPPPFFFF 
FFFFFFFFFFFF*QTLFLGGCGYNTKLR 


6084 


19985 


A 


6136 


2 


196 


TRPS I IiVTAL YYLY I FTTTQWG * \LTHH 
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nucleotide deletion, ^possible 
nucleotide insertion 














TUMTaDOPTPPMrT Ml? TUT ODTT T T OT XT 

PDIITGVCS 


6085 


19986 


A 


6137 


700 


433 


FLRWS S ASVTRLE * SGAI S AHCI LCLLG 
oo ibrAJjAb * V Ali X Xvi X UnriAQLi I Evr Li 
VETGFLHVGQDGIiDLL/NLMlCPPRPPK 
VLVLQA | 


OUOO 


1 0OQ"J 


A 


/no 

6138 


377 


AAA 

298 


QMiCWK 1 tit) if irRYRVXiliC/HP W5AVA*LQ 
LTAASN\PGLK* SSCLSL/MLTMLVLNS 
WPQVILLPWPPKVQEL*AIAAYTIKMWV 


6087 


19988 


A 


6139 


I 


199 


LYTTYFVTKTLIjLTSLFL* irtg\yprf 
RYBQLIHr J L*KNFLPLTLALLI*HVS IP 
X X Xoo X FFyX 


6088 


19989 


A 


6140 


381 


28 


STPNRL IERQEDPPES *N*TNPPGLPRKT 
PSFTKNPKINLARWGGPENPPNPEEKPR 
KVP ES P KRKKP * TKIRPLPS S PGDPTKP 
P\LKKKKKKKKSVVPATQEAEMGGPLKA 
RSLRL 




19990 


A 


6141 


1 


209 


LKLXiASSDLSTLGSQSAGNIVGMSHCTQ 

PKI1WGILLKLVYRFNINPVKTPAGLF/ 

TKKKT*KTEIDKRILKFIWKCTEHRIAK 

QSLKRTKLEDFLLNFKTYYEATEI KTS * 

XKraiNPVIvrPAGxjFKKKKHEKQKL 

S 


6090 


19991 


A 


6142 


3 


282 


QG I FLTKR VKNF * RGKFKTL VKKNQGD P 
KKGKNPPGPKIGKNNFGKTPFWAKKI * K 
FNLI\PKKTPPFFFQKLKKTGVKFFWAP 


6091 


19992 


A 


6143 


382 


158 


FFP*DSLAVSAPGVQ\WHDLCSIi\RPLP 
PRFRRFWCLSLLSGWD\YRHVP\QYPAK 


6092 


19993 


A 


6144 


185 


316 


PCLANF* IFFCK/NGGFSMLPTLGLNSW 
AQG I LLPKPLKVLGLSA 


6093 


19994 


A 


6145 


489 


231 


SVA*AGVQWHDHS*LQPQ/PLGLKQSSC 
XiSLSNSWDYRHMAPHPTNFLKKFFVETG 
VLSCCPGWSSTPGIiKGSFCLNIiPKF 


6094 


19995 


A 


6146 


179 


1011 


GTILNPQSLPFKPFFFSTLIFLKTVLIiC 
CPGWVSNSSGSSCRSLPKCWDY*HEPPH 
SAL*TPPWPFP*LVKDTSIHSTQLLKAB 
IFSASLDAAPSPTPYIRTHPSSGLQRWL 
LDAARSPTPYIRMHPASRLQRWCSTCTIi 

J-iiroiro IrlXf A W AMX XAooLiXoPr r PliPP 

PAHFSQCRMTFCLFVL/CLFEMESRSVA 
QGWECSGAILGSL\QPPPPGFK/RGFSC 
LSLLSRW\DY\GHAPPCPG\NFCIFSRD 
G\VSLCWPGWSRTPDLWHPPRPPKALG 


6095 


19996 


A 


6147 


348 


3 


GYHKT INWHCCMS VFANLTCGFNAI P I K 
/ IPAKYFVDPDKLVLKFTWKGKRPRIAG 

W*W*NNRHNHWNQIGSPEIAPHKYSQLI 


6096 


19997 


A 


6148 


22 


228 


THRVALFVRT*NWKQPKCPSTGEWLNTL 
CYIHTIE Y* YYSAI KRNKLS IHVKT\WI 
DLKG I VLNENNQI 


6097 


19998 


A 


6149 


406 


279 


RSIQGSSHSHAS A/ S * VAGITGARHHTQ 
VI IVFLVETGVMALTY 


6098 


19999 


A 


6150 


406 


10 


MRYHFTPDERLKSKGLKLEYW*GSGETR 
SLIHCWLEYKMSHPLWK*/TVWLFPIiO, 
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C=Cysteine, D=Aspartic Acid, 
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K=Lysine, L=Le urine, M=Methionine T 
N=Asparagine, P=Proline, 
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Y=T In known *=Ston rod on. /^oossible 
nucleotide deletion, \=possible 
nucleotide insertion 














NKNVP YGPAISLLDIL* EK* KHMSTKNT 
YK* MLQALF I ITKNWLLS I KRRMTKQ IM 
VYSNRNYYSTIKNELPIQTRI 


6099 


20000 


A 


6152 


367 


I 


PTPWEAEAGGSFKPRR/LQ/CPN*PPMC 
SCT PAWAPGS P P VS KKKKMQ FHRRMGHV 
SSAWWPOC3ADVRNLRAGGYVRHRERFPG 
LPWAAGGRDEVTRCLTVCTACGEDHPGS 
HLAGPRGGGFP 


6100 


20001 


A 


6153 


441 


38 


ILNPGEAITSEKYVQ* FNEMYQKAKRL/ 
QVNRKG PVLLS YNA* PQVAQP VSQKLNK 
T/5HEVIiSHRPYSIiS LSPTDYHFFKC IN I 
FLKEKRFYNQQDAENAFQEFTES *SA/D 
FCTTRINTFISHWSKYVDYNRSYFD 


6101 


20002 


A 


6154 


308 


121 


GESFPAEKTFPKENDYRWPSFFQWSSVIj 
/CKIHGLLMLPRLV*NSWPQAILLPWPP 
KVLGLQA 


6102 


20003 






1 




LLDIT*GRCKPCPYK/ESIC/RMFIEAL 
FIIAKKLKLSKCGSAGE* INKV*DIYTT 
EYYSAI KSNKQLTHrri' W I KLKSITPSE 
*SQT*ETACFTMLIiIWHSRKGNIIVTEN 
RSMF 


6103 


20004 


A 


6156 


77 


415 


rLGLKPTSSSHLSIiSKCWDYRQEPPCPT 

OJCifU* rr rr c v3\xc>.f jrimuro jj rv^^i. r r v e» 

MG\FTMLPRLVLNSWPQAHLILP/ASAS 
QSVGI TGKSHRAP PQKLTFFF FLGGNFP 
FGPQAGCPGAESNYPEP * TPGLKAFFGL 
NLSKCLKYGDSPTAPGKLEIFFFKWGGV 
SPSCASRVLNPQPR 


6104 


20005 


A 


6157 


436 


206 


LWYIHTMEYYSALKKNAILIHTI\STWM 
SIiBNMLNEIKQEWHKRANI * FHIjYEI PR 
KE55RMEVNOGLGRRGTWEPTVY 


6105 


20006 


A 


6158 


416 


50 


RFFTHPPAGEFFSPPPPKNIFFPPPP+K 
FWGGGGKASPPQKKFPPIiPPPQNGVLCF 
PPHKKK* FFPPPEKTVEPGPIjHVGQRPP 
QS\PFLCYPPLSFFIjYFCYFI FFPFVFF 
HFLPFFFFFIR 


6106 


20007 


A 


6159 


443 


145 


FFFFS IRFLFWWLYQG* EYLCFFMT* 
* IGVLLFLFEICEFFFFLLVGILFGMFL 
/YFFATASPGLFFFCFFFIFFFFFFFFF 
FFFFFFFFFFFFFFFFFFCEKS 


6107 


20008 


A 


6160 


322 


2 


TQDKNRRHRARHI YKRARKSARD * RSLV 
HR*K*KVQKRNEY*PEGTLHTCIALSQG 
NNI*DA/HSQE*WITPVI*TLWEAEAGA 
SRGQEFKTSLANIVKPDAWANAW 


6108 


20009 


A 


6161 


421 


57 


FCEMKFTPGNNAVNIV+PTTKDLEY/WI 
NI/DKTVAGCEKTDSNYERSSTMGKML\ 
SNSIAWYTEIFRGRKSQLMWQISLFSFK 
KL*QPLQYSATIALIRQQPTTSRQDPSP 
AKRLRLPEGSGDH 


6309 


20010 


A 


6162 


333 


41 


GAWACHLHVGAALNAPTPGLSRFHEEPG 
WSCRTAGGGSRLELS VFFVQMGS CHVGQ 
VGFFCIFCR/ DRGLAMFSRLVSNS *TQG 
ICPPWPPKVLGLQA 


6110 


20011 


A 


6163 


2 


423 


KINNTKSQCFENIIjKIEKTIiSNIjTKKE \ 
REDTNY*YWE*NMDITVESTDIKRIIRN 
TT\ETLHKFDNL 


6111 


20012 


A 


6164 


434 


5 


ICHIFFS ISKLLNILRPVSKLFFS INFI 
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FGPELYYIi* IHPNP* *GKCLS FLFIiQNI 
CIGPFFNSFMQI*SYPLRISLFIPVSPI 
JjUj \ F vy X F I VS IN I tiYLS I * LRIiFFS F 
FLSLFliFPFYYYTIiSFRVHVHIVQVSYI 
RIHF 


6112 


20013 


A 


6165 


1238 


1017 


ETGSHSVTQAGVQLHNSSSIiQPAPPRLK 
* So x JjbliijSSWDYRRTPPQPAN \FCIFG 
RDKVSPCCPGWSRTPEFR 


6113 


20014 


A 


6166 


504 


210 


FFFKTRSHDQGSLQP* PLGIiKGSSCIjSR 
TiiFHSWD/ YRV / PPPHPAKKYKNIFIFF 
VETEFHHVAQAGLKVXiDSSNS PVSASQS 
VGITS I SHCAQPILCQF 


6114 


20015 


A 


6167 


258 


65 


YKTKNTTSIFSLPSSF*TQILTF*NTYY 
YYY/CRDE/SLAMLPRLVSNSWPQVILL 

PWPPKVVEIjQV 


6115 


20016 


A 


6168 


1 


156 


LARMVSIS*PHDPPAPASQSAGITGVSH 
CAQLFFFFFWRQPRFFLRAGGQGRDLGK 
VKPLLSGVRPFLS *TLPTEGRKSGPPTH 
FG* VXiRE * FGGSH \ PVSSLW*QSAGITG 
VSHCAQLFFFFFWRQPRFFLRAGGQGR 


6116 


20017 


A 


6169 


3 


285 


HAS AHASGNILS YVGS FISLTAAI \LII 
IMI*EAFASKRNVLIVEEPSINLE*LYG 
CPPP YHTFEEP VYI ICS RRIRRS L IDQDA 
ARKETQQGAAD 


6117 


20018 


A 


6170 


645 


15 


LPFIYCSPCRNFNSLFWQERDSNNPLG/ 
ISLPHSDKITFNP\YYT\IKDALGLTSL 
HSLLKDLHS FPHETS *GPQAYYSL/ QPT 
PLNNPSPHARPEWI FP YCAYHNFSGSGP 
LTNLGGVL\ALIiL\SILHPKQ*FPHPPS 
YPNKQRH*YFGPLRPITFIDFLGRRPSF 
ISNRESGGQPVSYPFTIIGQVASVLYFT 

T*TT TT T T1«T> TPT TOXTVTT tTMH 

T 1 1*1 Jj 1 PT I SL I ENKI IiKWA 


6118 


20019 


A 


6171 . 


409 


25 


YTTPGGKIFFKSNPGSKLFPPPPF/PPF 
FSPFSP*KFFFSPKPLNFFGGFFPFFPP 
PKKRFFSKNPPLVFFS PPFKEKVFPFPP 
JrljNFiUrrRVr IiKAPPPFFFFFFFFFKIKj 
EVKWLS EALDCKS KDRG 


fit 1Q 


ZUUZU 




ol 11 


15 


^o 


unr t lfr , DTllTT7*ntr\rr t nu nniiic rrn rnnr 
V ULiJbJVUK X M J. r * U ItV J_iUUri FtsWia 1 VAK£> 

♦LTATSRSSCLSPPSSWDYRRVPPCLAS 
F/SF*YIALNN*A*AIILPQPSKV3jGL* 
AVAHAYS PSTLEG 


6120 


20021 


A 


6173 


325 


470 


IFTFFFLLAFI*FFCRNR/SLALliPRLV 
IaNSWLQAVLPPWPPKVLGLQT 




l\}\jJLL 


7V 
Jn. 


Ol /4 


Z 




*PT I?ri'NTCT?T'Df~ , lf r PTf , lCG Gr"TOUVT/TVMr7V "A 

lJjCtiJlNo rXJrUJVlXyoooL.Xr'wl V lyKAA 

CITLLSM*PKKLWYVYTKSCTQMFIVAI 
FI IAKNWKQPRCLSVSB * INKLWYI * TM 

E *ypvlkrnelsshektw/mem* iqlli 

ERSQS * KAWCLHKKLHTDVYSSYIHNC 
Vivi_i n^tt. x ru T io r o jYtttTiM £\. 


6122 


20023 


A 


6175 


3 


408 


RYLTLAIRDMQIITPVSYHFVPSGDYGQ 
KDNNKCW*RG*KMGSLMHC*W*CKMVQP 

T U7tn\7VTiIf*\l?T V /fit TVTjTJTk TT T?T /"»Tvrvr»Qim 

XiWKAAWyFXtit/ UVJUiRAXLiFIAaX YPREM 

KTYVHAKTCMHIFTLALFLYVHLNALtlS 
VMDWMLGHPIEEISRDSY3CVCLL 


6123 


20024 


A 


6176 


2 


237 


WVWWL I P VVLAAWEAEAKGLLELGS S RL 
* \WTMPLHYSLGNRARLCLKKKKKXKFK 
VFPPQKK3CKXK30CKKKKKGGAL 


6124 


20025 


A 


6177 


387 


154 


AWQWRDLSSLQAPPPGFE*FSCLSL\W 
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DYRHKLLTSGDLPASASQSAGITGVSHC 
AQPLSLFVFFLTNQRFVAALS 


6125 


20026 


A 


6178 


1 


239 


FFFFEMECLTVSQAGVQWYNIiHSLQPLP 
PGFKQFSC\LSLPSSWD*RVPTSRPAKF 
/CVIF*DGVSHCQPGWSAWQPPLH 


6126 


20027 


A 


6179 


325 


50 


RKFPLTWRRVS FFSLLRSSTDWMPTPHR 
HIGGI ICVTQSTDSNIGLIQKHPHRHTQ 
KNV* PNV\WHPMAKSS*HPKLTITVYPL 
STWHTYTSL 


6127 


20028 


A 


6180 


157 


338 


SQHFGRPRQEACSSSEVRDQPGQQREFP 
HLYKN* KEITQVWWHMPPEAQDY/LEPG 
RSRLQ 


6128 


20029 


A 


6181 


3 


273 


HEGQSKLETFWKGFTI *DVIKNIDDL/ W 
RGVKIPILTGVWEKLIPTLINNFEVFKT 
S LEE I TE YWE IGRET iKT iKVE P KDVTEL 
LQSHEKT 


6129 


20030 


A 


6182 


2 


359 


YYPAIPLLGIHPK* KNS I YQMHTCTHWF 
TAVLFT IAKIWNQL * CSSMGEW I KKM / W 
MHTHTHTHIHTMEYYVPIK/NBIISFAA 
TWMELGVT ILTE I IQAQKKTKGI FSHLY 
VGAKIFDYM 


6130 


20031 


A 


6183 


3 


211 


NPGKLSVS INQDCFPWTPCPFQMGRCPT 
LLF\SFYGGQAI PLITPSVGTQIFRLKM 
/ PEVTHHFHSFP * EPLTAAALNQPSLPL 
SIiLVDF/RDRGLFCHLGWSAVD*SHLTA 
ALNS \LVTQLS CPSFLSS WD YRRMLPCL 
GNF* IFRLKMLKSLTIFILFHESH 


6131 


20032 


A 


6184 


707 


384 


LQIFFYNRVKCVKNKSTQNQFYIFSRIY 
MHANRHGKYEI /WI I KKMPMREG*AVQL 
RMKWKUKKNTHI KLGVVVHGYNIiSTLG 
GQGGRIT*AQEFKTSLGNIARLHLYT 


6132 


20033 


A 


6185 


720 


363 


FFFWGRSL*TILF* /QTGSHSVAQSGVQ 
WHHHSLLQP*TSGLKQSSHLSFQSSWDH 

RCAPPRVANFFLD fcrdrvcvaqaglkl 
IiASSNPPTVASQSAGI TGGSHRTWPFWD 
ILGIMYWGC 


6133 


20034 


A 


6186 


439 


66 


KPKTLGSKKVNSNPRVPPGK/ PWGVGPL 
GPGP*MGGSKP/S *NPSPPPPLFKTRFG 
FFPPG*RQGHYSGFLQPPPPGGKTFFPP 
SPPKKWGHKKKIPFQKKKKKERKKRNI I 
GSWTTSETVGNPAPW 


6134 


20035 


A 


6187 


83 


287 


NKMFKI FRFRF/CGFLGQSHS VTQGOVQ 
WHDHSSLQP*TPGLKRSSLPTCWDNRHK 
PPHLVLFCFYYRL 


6135 


20036 


A 


6188 


290 


609 


RRMDDPPFFKYFFERRVLVCHQGCS*V\ 
K*SSHFSFPCSWDHRCV/PMPHPAN\FN 
FFHTDKSLTMLPRLVSNSWAQVI LWPLP 
P*VLGFAGMSPCPQPPSQFCIEAV 


6136 


20037 


A 


6189 


2 


371 


RYIFEKVFAKYSELTNDS LS ILSGKSGI 
LWKKRI»SLFS*Q* S YKWFSLGAI YCGMH 
*KRFLGTSIIi/FTE*VETGFHHVDQGGL 
ELLTSCETLAS AY *SAG ITG ISHCAESQ 
STLKKYI PGSTI 


6137 


20038 


A 


6190 


531 


205 


RWRFALVAQAGVQW/HDLGSLQPLPP* F 
K*FICLRIjQRSWDYRCAPLRPA\NFFVF 
LVETGFHHFGOT\NLQLLXSGDPPASAS 
QSAGITGVSH*ARPWFIIFTALIiYK 


6138 


20039 


A 


6191 


52 


375 


NRCETSFLFFFFEKKFWFCPPGGGEWRN 
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LK*MEP\LPPGLKEFSCLTLRRGGN*GP 
FPPLQENFGVFRKKGVPHCNPG* FQTPD 
PREIARLTSPRGGDGGGRPPPRVK 


6139 


20040 


A 


6192 


3 


158 


FFFLLLNCGLKKETG/WLGWAHACNPS 


6140 


20041 


A 


6193 


2 


54 


FFFBTESHSVAQAGVQWCDLGSLQPPPL 
GLKEFSCLSLLSSWDYRHTPPHPANLFV 
F/SFLVETGFRHVGQAGLELLTSGGPPA 

RPGWSAW 


6141 


20042 


A 


6194 


1 


323 


FILPS S WDCMCAPP /RPDWVLYFLVETG 
FHRVGRAGLQLLTASDLPASAS*GAGIA 
DGVS LTQCSMLPMLECRGVI LARYNLHL 


6142 


20043 


A 


6195 


619 


243 


S PETGS CSGAQAG VQ WHDLGS LQP * LGL 
K*SSYLSLLSNWDYRCA/PPRI J ANFLFF 
VQAGS CHLAQAGLlELWS S CHLPALVSQN 
AGITGVSHHARIiAVAWKRTPLYGLPSRI 

oiMij\_ v j_ij_irvr j_ir r iiix 


6143 


20044 


A 


6196 


385 


19 


GGHFGAQEISPPGVQLFFRLHFLSNWGF 
KRPPWPPG * FF / VFL* KRGFPP FGRWP 
PSLFP/SDLPPPAPPKGWVSGLRPCPRT 

SPCWPPWSRTPDLQ 


6144 


20045 


A 


6197 


498 


437 


ENLQS S INLVDKAMAWFBR I DSNFDRS F 
TTVKMPSNLTS I INACCGE I FHERKS * S 

KEP*TSR/QDRLLLAEG*D 


6145 


20046 


A 


6198 


79 


394 


SVKLIALFFFFFGENFCFAPQVEIiKGPN 
LG*LNPPPPGLKNFSGLT/LPKKWELGG 

LGLNQFACIiTLPKGGNLRPJSGP 


6146 


20047 


A 


6199 


2 


313 


SASHSAGI IGBSHCAWPIMPVYSLVLNF 
FFFLEKKPPFVTQLEGQGGNLG/SMEPS 
PSGIiKGIFCLGPPKKWEIjRAPPPCPGYF 
WFF * KKGGLP I RPGRFWKS VP 


6147 


20048 


A 


6200 


62 


732 


AFQQARKGTQ\GRLSSPAAGALP\PVLA 
PSGPPLPA*\GSSS\PASVAPYPPPQAT 
G/SPPTP PGELRRGGWSKGGLDHPRNTF 

Q/LPPSS IARCVP * TPSHVPPHPLAFPP 
SLPHPGPASRAPP/TGSSDLSASAVSAP 
GREGHP * GTRMVPQGAGRQKPGHRGAS * 
GSSCSKRTLSPLSPRQSGFQSRCCFHPP 
RQNHSRC 


6148 


20049 


A 


6201 


424 


7 


KNMKRSPISCVTRDPQIKITMR*HYTGI . 
KMANV/ RLTGNTKC* *GYGAKGPLIQC* 
WECKIV* PLRKTAWOFSOS /LKTV* /PY 
DPAIALPGI QLC* KCTS \KKNCM* I FIT 
ALFIMAKKKQKPRYPS IGEWKYKLLHL 


6149 


20050 


A 


6202 


371 


144 


RWALPRRTNF \ * F FVETRS LHVAQGGVK 
HPFELSFSLIEYEYIYLDFF 


6150 


20051 


A 


6203 


409 


110 


LRRS S * LCHLGWS AMVQS WPT * FKP * LL 
GLSDPLTSASRVDYRGMPPCPRNFI IFD 
M* FI * CVCVC/RTWGLI VLPRLVSNFWP 
EVLLHPGPSKVLELQV 


6151 


20052 


A 


6204 


15 


221 


KASRKMQ I ETTLRYYLSPMRLAKMQML I 



922 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SJEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspoo 

Ulug W 

last amino 

dUU 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=PbenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V^Valine, 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NTSCW/ RKSQBTRTLIC*W * EGKMVQPIj 
S KG IWPYTjIGAEYP 


6152 


20053 


A 


6205 


23 


371 


KVACTCMFITVLFTIVKL*NQSRCPSTD 
DWIKKM/WLCVCIYSSVCVCIiYVCIDR* 
IERHDIYVYIMEYYIiALK* /DEIMFSAA 

LIMVST 


6153 


20054 


A 


6206 


113 


417 


SAWLLVIYIJSfDLKTYVHTKTCTQMFTA 
ALFIIAKI*KQLRCPS\IiIWIN*C*KEM 
SYQAMKRSYARNLWSSKKLWRNIiKCLLTi 


6154 

♦ 


20055 


A 


6207 


510 


167 


GLGQAI S PSLPKSQG IGAYGFGFFFFFK 
P* IjLVCCPG\ SIiNIjVAfiSOS S CLSLPKH 
WDHECSPPCLAFFFFLL*RQGIiAMLPRI 

QAQ 


6155 


20056 


A 


6208 


418 


40 


SYTHTHTHTHTHTHTHIP* * LTGS/RLQ 
GICRKGSRKHPRGIjSQIiLP 


6156 


20057 


A 


6209 


1 


258 


SGFLLS * VPS VRDPTGNRTV* LTWQPLP 
ADr.FJ?\r r PPT.sn 


6157 


20058 


A 


6210 


229 


310 


D FSEAQKSRQCGGG I SNVNHPGWLKPV 
IQHF*DGVLLCCPGWSAW*SQLP*LPR 
IiK*SSQLSLPR/SWDYRHVLQ\HPANCF 
VFFVEl^LAMLPRLVSNSWALAILLPQP 
PKVLDDWLBHHTWMiyiTYATTTLP 


6158 


20059 


TV 

A 


6211 


496 


1 


PGAGPTLNKRPFSDNPLQPGRRGGCFPF 
VSQRGSQKDRVPPKWFFFFFS F*DRVSIi 
WPPGPSARKPSWLPVASTSR\V*QSSCP 
PLWFPGPPTMVSYFFFF\KRQDLPMLPR 
T •V*NFWPOVTTiTi550P P KVIiPT^TiO AP A 


6159 


20060 


A 


6212 


3 


340 


VTAGIIIIFMI*EAFASNRKVLIV*EPS 
I *SLQV/ * WRGLLLLGSPPPYHTLEEPG 
YVTSKRKRKESNPPKLASSQPHGLHDFF ' 
lrwvuan ppttp t t«rr & p PRortwowp rpp 

l\JvJ\XvCx.vjV3fir E Ivln, X. 1 1 it oltvCt v *JVj\jyiVjnixCi r c 
P 


6160 


20061 


A 


6213 


60 


493 


KSQQNFFCFETESRSVTQAGIQWHHFSS 
LQPPPPGF1WISYL/SI*K5NFSGYP*DF 
FCNGPPCS*GPIRFTYYICVIPSSWDYR 
PI.PPHP ANFCVF \ I KTVSPGWSOTPDLR 


6161 


20062 


A 


6214 


414 


1 


LCNGRPGLLVSPLPLGEEIiIPCIiPR*KL 
GPPRVL* \KGGPFL*SSFAPFF*EMESH 
SVT\RLECSRAILVHCNLCLPGPSDPPP 
SAS * VAGR LAPJVQKVEAAVSTIDHDAALQ 
PEQHSKTPSQQTNKNNQTKLTQKYPF 


6162 


20063 


A 


6215 


2 


403 


SLIiPLVQLPI^CFFFFIiKRGFNFGAP 
DKREWGEFG\*LKPWLPG*NNFSGQTI i R 
RSGD* RKTPHGGVNLVF* EKTGFSNIiGR 
GGLKPPPSGNTPAWPPKGVGITGGTPPP 


6163 


20064 


A 


6216 


2 


388 


Ki^QTLWKTDWEFLTKLNILYPYSPAFA 
DLGIYPNELKT/ C/CYTNTCI * KS IAPL 
FIVARTWKQPGCPLRIX5WIjNKLGCIQAL 
EYYSP*KRNEL*NHN1^YKKLKLIFLLR 
QEDRFSLGVQEQPKHNGEM 


6164 


20065 


A 


6217 


363 


302 


NKEFKTTLANMGEPPFL*KRGFS \MLAR 
WLNSLFHDSPPPASQKAGIPGVSPPPR 
PLFFFFFLLETVSHYYAQAGQLISNYDL 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
IXJIutamic Acid, ^Phenylalanine, 
<X5lycine, HNHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagtne, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
\V~Tryptophan, Y«Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KlaPS CLSLLDTWDYRH* PP YPVFKLLNS " 
TYKAC 


6165 


20066 


A 


6218 


576 


6 


RAVLRGLLVLIiLTKHTFPLFPPRQSLAL ~ 

LPRVGVQW\HDL\GSL\QPL\PPG\FKQ 

FS\CLSIPSSWDYRYPPPRPANF\*FSV 

ETGFHHVAQAVLELLTSSDLPALASQSA 

GITDLGQSETIHGGSGLDBCPPDHKVDKG 

PTEERACLLSYSAGRKYKEYHILPEQGS 

EPPHHGDILSIS FQAASHIAQTPPAHT 


6166 


20067 


A 


6219 


44 


496 


GFRVDPRVRPRVRLNS *PRDPPTSGSAG 
ITGV/SSHHARP*NIFT*KLGISFLVFW 
GFLFLFPVFCFFFFWKGGFLIiVAQVEGK 
/ WPKFGL IEPSPSGVKEFFFFNP PEGWD 
\YGLAPPPGVIFWVLIKNGVSFFCRAWV 
QTSGLSWINRLNLP 


6167 


20068 


A 


6220 


1 


421 


QRGRLTAI S RETS FTLVAQAGVQWHDLG 
SLQHLPPGFQGFPCLSLLST\TGNCSVT 
QAGVQWCHLGSL*PIiPPGLKPSSHLSPS 
SSWDYRL*KPHQGSD*VCVT\GAASPEP 
NTMTAWNKPPDIRMJCETLLLPPSSSEKM 
L 


6168 


20069 


A 


6221 


3 


36 


AYCIVKPSP * PLTGALSALLMTSGLAM* 
FDFHS ITLLILGLIiANTLT I YQ * WRDVT 
RESTYQGHHTPPVQKGLRYGIMLFITSE 
AFFLAGFF*AFYHSRLGPTPQLGGDWPP 
TGI TPLNPL * VPLLN\ TNE * hA * GVS I T 
*AHDP 


6169 


20070 


A 


6222 


895 


13 


ANSWANN+ FELATQ I AFSGRI PLTDSHL 
GLLKAHVEAPI \AGLI VLAAVLLKTGGL 
MLYTASHSFSNPLTK\HIAYPFPVLSLI 
EA* FITSSHLPSDKQDLKS \ LMAYSS \ I 
SPHSPS*LTPILIQTP\W\SFNGAGQFS 
I IAHGTYHAYITYSA*ANFKITERTSQC 
AC I NPAAQGLSNS * LQLIGLFE* LPRQA 
\SLTMGLTPPLINPTGENCSVASNQRFS 
/WTQISLLLPYRDSTIIiGSQALYSLYIF 
TTTQWGSLTHHINNIKPSFTRENTIiMFI 
HLSPIIiLLSIiNPDI ITGFSS 


6170 


20071 


A 


6223 


68 


405 


HSYALVYNPVGIRVKVHLSKCILYTWHI 
SNSSQEVCEMPNIiNI»KRVPTTIiAPRF*N 
IS*GISQLGAVAHTOJPSTLGG*GRW/I 
QQI*KTSIiANMAKPHLLLKKKKNFPNFPK 
RG 


6171 


20072 


A 


6224 


591 


283 


GTSTQAGVQWCNCGSIiQPPPPGFKRFSC 
LNLPCS WDYRRVP PHLANF\ * FLVEMGF 
HHVN/QAGLELLTSSDPPALASQSAGIT 
GMSHCARPRVAF*GEKNHRSL 


6172 


20073 


A 


6225 


302 


173 


GFFSFFIiKC*DR/SLTMLLRLVLNSWAQ 
V I LPLWP PKVLGLQA 


6173 


20074 


A 


6226 


417 


81 


TQTRGPPPPPFDGGPPSGPERTVPPPRG 
GVPQTGSRFFLFRQNPGPALFPKN/EGG 
F*KKFGPPIiPPF*KTPFFFTNRGNPCPF 
FVGENPQAP PVFGG FNP FPRKKKKRND F 
LS 


6174 


20075 


A 


6227 


41 


474 


TDATETFHFTSKH\HFGF*AAA*YWHFV 
DW*LFLYVS TY* *GSCVPHPA 


6175 


20076 


A 


6228 


503 


28 


AAHAPPRVRKVTGH*AFFTQRCTNSRH* 
1 1 KFPKDKDKARILKGAREK/RTTYNGA 
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Predict- 
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nucle- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G*=Gtycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PIWLAVNFSVETLQ/ A/RREWHDMFQMP 
KEKTFYLRILYPMKIS FKHEREIKTFPD 
KQKPRDFISSRPVLQEMLKSWPGARVHT 
GNPTLLGG 


6176 


20077 


A 


6229 


418 


175 


VSPIiSPIoKILFFPKGLKFWREGGPKCPP 
PKKRVLPKK/PPAMF* IPPPKRKKSKLP 
PPRKIWPPQGIFKSPPPQFYFFFFFF 


6177 


20078 


A 


6230 


45 


308 


ACVRLKH*TDN*QPNIYNQP/HKSLLP* 














FKKKPGGGPISWGGEKIIFFFFGGGKKN 
TLGIF 


6178 


20079 


A 


6231 


440 


209 


I QHFRGPSRRGLLG P EARAS PGQHRBS R 
SFLKMQP\NKKFS*VWWCTPAIPASPKV 
EARBSLEPGRRRLHLTEALQPG 


6179 


20080 


A 


6232 


419 


2 


SSPPPPGGKFF*KKPGKKKFSPPKKKRV 
FSPP\PPKKFFFSPKA*NFWGGGGQKFP 
PPKKGVFSKKPKGVFKTPPKKKKKFFFP 
PGENWGPPRVF*KGPPPFFFFFFFFFFF 
FLSFNVNI LFKPVQAPdiT 


6180 


20081 


A 


6233 


448 


112 


QSARI TGVSHRTRPMLRTSTPLSATDRN 
AGQKI SKDIADLNNNEQDHRD I YRNSTQ 
*(X2NTHSFQAAIRNKIDPILGHKKMLTH 
/CKQSDQTWNKLEI SNKYAKPQKIKHTI 
K 


6181 


20082 


A 


6234 


2 


351 


MDQGII/SDFESYYLRNTFCQAIVARDG 
DSS DGS GQRKLKTFWKGFS LLD AI KHTE 
DSW/EEIKISTGVWKLIPILVDDFEGFK 
TSVEE/VADVMDLSRELEL*VEPEDGTE 
LLQSHSQT 


6182 


20083 


A 


6235 


3 


252 


FHTYNSLLVCHSNLCYQNSLLFFI IH* / 
CI /HFHS * AKVS IWKHTYTHTHTHTHTH 
MHTHFFQATDRKI CVNVYMFS VAYIHKG 
W 


6183 


20084 


A 


6236 


3 


294 


IITF*LPQLSGYIEKSTPYECGFDPISP 
ARVPFSIKFFLYAITFLLLDLE IA\LLL 
PLP*AIiQTTNLPLIVMASLIi*IIILALS 
LAYEGLQKGLD*AE 


6184 


20085 


A 


6237 


381 


121 


PVPASF* FFFFYRDR \ VSLCYPG WSQTP 
GLKQSSCLGLLSHWDYKYDPPHPAFISF 
LLSSSLLWVL*RPVVLIiCATVPAFLDIR 
EQ 


6185 


20086 


A 


6238 


395 


62 


PFFPIPKFFPWGPNFFF/PLGPPPFLGF 
FPGSFAPPSHFSPPD*NFPRPPKNFPPG 
GDLFPQKKGDWFFPKKRGVFFFLGGFSK 
FSGGPPFSU3PQKKKRFKIKGPKKFCF 


6186 


20087 


A 


6239 


1066 


761 


SFFLRWSFILVTQAGVQWCDLSSLQLPP 
PGFK*FSCLSLPSSWDHRRPPPRLANFC 
IF/M*RWGFSMLARPVSNS*P/CSDPPT 
PASQSAGITGLSHRAWRLVFL 


6187 


20088 


A 


6240 


390 


27 


GELPTPIYIAPPASPKGGVSGGSPPSRV 
KFFPLEVPPPPPPGC* PGPFFYIWPQGG 
ARVP*QQIRFFSLRW\PPRPTRPKNPHL 
QVGPLRI CLSPETGGGVIKVFRGSLLKK 
KKRAAASRSN 


6188 


20089 


A 


6241 


366 


1 


KI KNFDRP I /TSKGA* SW /RNLPTKKS 
PEPNGFPGVFCPTFKKKLTPVF/SKLFP 
KF*KEGILPNSFF*SSIALIPNPFYPSK 
KKKKTKKNYRAISUWIDEKIlTtKY*Q'T 
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GMSIycine, H a Histidine, I-Isoteucine, 
K«Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0 = Glutamine_ R=A.i*flrinine S=Si»rifK» 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X»Unknown, *=»Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














E FRST/LKVLYT ITK 


6189 


20090 


A 


6242 


23 


439 


IASGRPFFFFFFFFFCLPPPPGGNPSFS 
FLKKKKKGGGGOENPPNPPGKAPNPQGV 
G VPP FP PP PVFOKNPP P P FFP * KNPPP S 
PKGPPGGPPFFFPPGGGGGPPPT/PYFL 
KKKGG WGGPPFF* KTNPPPKGEITPLVG 


6190 


20091 


A 


6243 


20 


261 


WAQEI LKKFS CLSLLSSWNYRCAP PHYA 
N\ FVFFVETGS YHVAOSGLELLDSRDPM 
TGESHHAQPHFC* ILYTGESQRIL 


6191 


20092 


A 


6244 


376 


76 


LLPRIiECSGVIKPFSHLSIiPSSRDYRCV 
/ PTMPNF* KFFVELKFNSL* P * GLTMT .P 
RLVSNSWCQVIPLPWPPKVljQGCATTPG 
LTDF I TTFKGYRHGDTY 


6192 


20093 


A 


6245 


222 


233 


KSFYFALIFEKNFCYVKKSQVGLVFFSK 
VKVLVRPFFPMVI KKNQP / FNPL/RFFF 
FFFODRVLLCHPGWSAVV*SKTIjS 


6193 


20094 


A 


6246 


2 


358 


EI ILTFQSYY* ENTFNEAITAIDSDSSD 
GSGQSKWKSFWKRFTIIiGAIKNICDL*D 
*V/KISTLSGVWKKVIPSLMDDFEGF\K 
TS VEEVT ADWE TAR * r.ET.RVEPFIWT P 
LLQSHDKT 


6194 


20095 


A 


6247 


3 


322 


PDHSLLQTPTPGFRRSSHLGFSSSWDYR 
IiEPPHSVCVCVCV\FCVYIYFFFFLEKT 
GFGFVPQAGLNLGVKRI FKA* PPGGVE I 

PGT»THT PfiPWCtfSP T R^fTO p 


6195 


20096 


A 


6248 


1425 


1280 


SI1DFQGLAI1PGWSAW* SCLTEASTSR\ 

L*RWGLTMLPRLGSNS*TQAILQPWPPK 
VLAL*ADEEMKAKGHISTKWGGQTLNPQ 
PDOGATSVYSLPIRKFQ 


6196 


20097 


A 


6249 


189 


336 


hapfpvsvislsc^krrrVwlamvahts 
npstlgg * wgw i a* gee fetsq 


6197 




A 




St 




FLPS FMGFWPVT/5HGR A<>T\7TP QRDT JtJ T. 
PFLLPAEPFLFFFFSLFRKRESIFGA/P 
GGLEGGQFNFLEPSPPG IKE I FWPNPPR 
GWELKI WPPPP INFCDF * EKTGFTMVAR 

wriiixJAV^rv^ a. t Alii \i Ul UIVlwL Xrl VXliV 

AGLIOj*TGGLEGGQFNFLEPSPPGIKEI 
FWPNPPRGWELKIWPPPPINFCDF 


6198 


20099 


A 


6251 


1 


198 


LTGRL/SKCEWSWRFDVQLKDLKRWQN " 

HL1VLSQQT^CIVLTSAGIMDHKEMR*KH 

TGGKSLGFFF 


6199 


20100 


A 


6252 


11 


214 


SLRLKSSSHLSLPSSWDLRCELPHLAN* 
KKKKKL/ CKN/RSLTGLPKLVSNSWV*T 
ILPPRPPRALGLRA 


6200 


20101 


A 


6253 


402 


3 


KKGFVLSPQVEDKDAISGSlFNPGPPGEK 
IPPPQPPEEVGPPGGPPQTRGNLGI /LG 
N*GGGGGF* PPGAGKSPAPGPQKG\PFG 
KDKREKPYGPPPKLFFFFLRQSLCFAQA 
GVQWCDLGPLQPPPPRFKQFCL 


6201 


20102 


A 


6254 


362 


78 


VGVN/MTFFPQHFLGLSGI PRRYSDYPD 
A YTT*NWSSIGSFISL , TAVLIMIFII* 
EAFASKREVISVSYASTNLE*LHGCPPP 
YHTFEEPTYVKVK 


6202 


20103 


A 


6255 


357 


2 


Dl^CHSHGEATQKHSEKCKASCTHMFIT 
AUFLIAKHWNQPTC/PHTMEYNSVIK/N 
EQTIF1TTTWMDLKCIMRHEQRQS *KNS 
VLVCFHAADKDI PETGQFTKERGLMDLQ 
FHVAGEASH 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
i^Hjriuxamine, **— Arginine, o — serine, 
T=Threonine, V=ValiDe, 
W^Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6203 


20104 


A 


6256 


2 


305 


QWCNHS S LHSNPGERETVSSLKIiPKAWN 
PRHMLLCPTTSFILRWGFIYVAQAGFDT 

DIACFHRESTMFSAENFN 


6204 


20105 


A 


6257 


335 


3 


GPFQSPPrcVTPISCFTPPKKGGNRGPP 

OP D r* V Pttf T W VT^flCZ^ P * V P 1? \TT .PPnPf? 

VPPPRPLKKGGLKGGAWAKIFFFFFLRQ 
SHS FTQAGMQWCNLGSLQP PPPGFK* F 


6205 


20106 


A 


6258 


219 


2 


rlLoDvM/lJXAyiujxor rui l xrujJLivuv 
TLVS/WPGWAYTFIPSTIiGGQGRWVT* 
AQKFETSIiENTVKPPSC 


6206 


20107 


A 


6259 


306 


117 


SFBLLGSSDPLISALSSWDYRCAPSRLA 

xri7*T?'c r \rnn?Mr , CQVT\7rv /<?UDn*TT.T.QW 
Bit? * r r VV- V CiPlijoo l ivy/oHry'' liujon 

PPKVLRLQT 


6207 


20108 


A 


6260 


572 


269 


ETDILSVAQARVGGLHLGSLPAPPPGFT 

EMGF\TVLARIVSIS*PRDPPPSGLP*G 
ALGLPGVSPLRPALCLLLK 


6208 


20109 


A 


6261 


357 


116 


FAQAGLEI^GSSSPLASASQSVGITGMS 
LSSWNYRHEPPHPANLCIFSFNTISY 


6209 


20110 


A 


6262 


3 


337 


HERHERHERHEQSAEITGVSHHAGGGIG 
LLNFFLFLFFEAGSCTVT*T/GV*WHDH 
GPQ+PQIPRFK*SSHLSLHSSWDYRHVP 


6210 


20111 


A 


6263 


515 


150 


FLRWSFTLVAQAGIQWCGVSSLQPLPPG 
FKQFFCLSLPSSWDYRCPPPCPTNF\*L 
LVEMGFHHVGQAGLELLTSGDLPASASQ 

FLLLRVLNGL 


6211 


20112 


A 


6264 


1 


296 


GTSFLPS FLP/LLS FLKFLNFFQI VSHC 
RPGWSAVA*SQLTAPSI\FGLKPSSQLS 

PT COWnVOPDTM5'17\ OT DMT PPT.VQMQ* 

PQAILPTQPPKFLGLQA 


6212 


20113 


A 


6266 


298 


182 


LHFFEVSKIIKLIRAEAKWCL*VLFTIA 

l\-t.W ft l»n^ ™ XV-lYl IjO JLUJDrliiVruTlW X Jinx V 

SYYSAIGNKEI/PVIVTIWIYMENIMLS 
PTN* *OKDKCMIAI l L*GI YIO 


6213 


20114 


A 


6267 


402 


234 


PPPPLFKNPPPPPFKNPPPKKKKI I FP/ 
SPPVNLAPPKEF*KGPPPFFFFFFFFFF 
FF 


6214 


20115 


A 


6268 


125 


17 


KGQAQWIiTPVI PALREA/ EEAGG* LEPR 
SSRPAWAI 


6215 * 


20116 


A 


6269 


1109 


454 


GDSAGAEGTMENFTAIiFGAQADPPP PPT 
ALGFGPGKPPPPPPPPAGGGPGTAPPPT 

2i2VT2iPP/3anifQr3af3fy3PPYTJVTRPriPf3<3T 

ELTGSTl^ITHxTOiEQAxmFCGKKVKE 
KLSimiPDLxKSMIDLPGSHDNSSlxRSLI 
EKPPILSSSFNPITGTML/VRLPPPHWP 
VAGAVSSDAYSASQEEE*AQAQTEPYPG 
SC!PPRNTT * FRSOEEEKEKRRGS 


6216 


20117 


A 


6270 


111 


86 


VVGITGVSHCAWSRNGTLTHCVWECEKV 
/QPLRKTV* QFLMLKI \PYQMI * PLLGI 
YLRKNKAYIHMKTLK*MFSLFLFEIAK/ 
TWKQPKYPSASBWIN*LDSFSLYWLGWS 
RTPDLK 


6217 • 


20118 


A 


6271 


2 


222 


RSCESNSQTSACYKSFYLYPLKS/GGFK 
I LERM * P I YFL I P YVVTI FNQRYYFYTW 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
V=Olutaraine, RpArginine, S=Serine, 
1 — lhreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possib!e 
nucleotide insertion 














NLKKTRGIYTVLYKMYKSF 


6218 


20119 


A 


6272 


787 


470 


LPFFFS FFETGSHCWRWERESLGVAQVG 
VQ*CDLRSLQPPPPRLKQSSHLSLPSSW 
DYRCAPP* LDKFFFC I FCRDGFHHVAQA 
GLKLLSSG\NLPAS VS PSAGTTL 


6219 


20120 


A 


6273 


2 


322 


FCMAIAAIDSSSSHGSRESKLKTLWKRF 
S I LDAI KNTCDS WEEV/ KVST * TEVWKK 
LIPTLTDDFQRFKTSVEEVTA\ELLWE 
ITRB IELEVELDDVTELLQPHDQT 


6220 


20121 


A 


6274 


175 


406 


KNSLGGPNLTGEGKKITFSIj*NFFFSLE 
ALIFVGRLSSFVPPPKIRFFSKNSRNLF 
/CPVGEFFKTRPPVNFFFSLP*RKSYFF 
SFPG* IWPP*GIFLKAPPFFFFFFFLRE 
RREVSMLPRLMSNSWVQVTPPPWPPKVR 
VLLGIVDVQ | 


6221 


20122 


A 


6275 


2 


181 


PRVRGRVGHRKGKKSF VHCWWQWKLL* P 
LWKTVWRVLKTLK\IELPYYP/ATKLL* 
SYPLE 


6222 


20123 


A 


6276 


397 


47 


PPPSKNFFFPPQKDFFLGGLPKKSPPPK 
K\FFFKKTPKKNFPPPSFFKKPPPPAFI 
KTPLKKKKKFFPPPQKFGPPPKFFKKPP 
PLFFFFFFFFITTAP * KITKS I P YFYFF 
TKTFYL 


6223 


20124 


A 


6277 


2 


264 


ETESHTVAQSGCSGAI IAHRSLNLQGST 
NPPTTAS * VAGTTDACDHAWLMF* FL\ V 
EMGSHYVAQAELELLSSSDPP\PQSPKI 
LGLQA 


6224 


20125 


A 


6278 


3 


204 


I RCYRKI FYERKHQWLQQISLLS YFKKIi 
PQP\ PPL*ATIH,ISQQSLISRQDPPPV 
KR**LTEG*DDC 






A 


6279 


57 


306 


INKANRDKL/GTIDFLiKb'L'riKVRKMQN 
TKTESKKRN*ESILHVE*EDTGHVRAAA 
VCRDMTYLGEKLTDEEVDK\IIREAKYW 




20127 


A 


6280 


419 


18 


SSSS PKACMFVYEWVCTQKCVWI YE YMC 
VFVYV*LVLMCCMNVCACM\C*SACV*M 
CVTCECVCM*GLSVTCEGVCM 








6281 


459 


235 


s liRNGV S CCHPG WS A VVQS * LTAASNS * 
AQVILPSSWDYRP VTSHPAKTHWFYMQ / 
CGNKKMCVTHSIVTFSLLQ 


6228 


20129 


A 


6282 


505 


470 


GVP*GQEFEPSLSHMGRPCFFIiKCKINP 
AWWQKLF* PRRQKFRLTQI/LALAPQPG 
QQKVTTFFS / NKKKKKN PNNC * IALR 


6229 


20130 


A 


6283 


206 


374 


GGGGGKKKWWGGEKKRGGEKKIFFF*SP 
PRFFFSPPYKKKKFFFPPPFFFPPPNFF 
FYPPPPPFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFLNHFF\FF*LYVIIV 

T LWIIPnTT kjfTJTl 7TJXTT7 


6230 


20131 


A 


6284 


431 


269 


YIKGGPPPEIETFLIFFYYIYLIIFIFF 
FFFFFFFFFFFFFFF* FVCLKYL I FNHI 
QRF*TYISIPHFMLLM\YTLHLFTFCIC 
SLFTI I WFYIFV* L* YCCTVKALCNSL 
LVSELDKYIKNNYNCKQ j 


6231 


20132 


A 


6285 






YLFFFFF*VEIiVFFLFCMFFFFFFF , TS , F , l7 
FFFFFFFFFFFFFFFFFFEFKNKVSFNS 
CFSLSLFKLYFLKYWFIQLQGNLHFNNI 
I 


6232 


20133 


A 


6286 


386 


1 


ANLHIKVNFWNFTLLNFSFLPWFGFH 
FLW*RVYLSHRGFYVFFQNFLRIFFFF 
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SF*VG\FFFI»FPLIjFFFPFFFFFFFFFF 














IEEPRTRGRTRGRTRG 


6233 


20134 


A 


6287 


190 


134 


LEFGRNRFQFSRMILRGSGLQWRNYADM 
VKIAKT* YLROTFHKVIAI IGGDSCDGS 
GQNKLKTFWKGFS \I*DAINGIHNSQEEI 
KMATLTGVWKK* IPVLKDDFEGFRTS VE 
E/V/TADMVKIAKT\LELEVEP\EDVT* 
LLERILF 


6234 


20135 


A 


6288 


394 


2 


RRQASLSCGGLHPVRSSRLIiCLPKQAWA 
MAGAPPPASLPPCSIiISDCCASNQ*DSV 
GVGPS KPGVGYHLLVHRFLS PLEKRS IR 
VGVARFSRCKLSPLSLTRKGN*LP\TPC 
ASRVRQCLTLLQliAHGALHP 


6235 


20136 


A 


6289 


233 


58 


DRVSLCRPRWHWSRTRLTVASTSR/IiQ 
AIIiPPQPPE*LGPQASANRSPRSPKLTG 
LRG 


6236 


20137 


A 


6290 


385 


48 


TLQYLCTPTSTV* KKKKPENIKYW*QCG 
ANRTLFQ * WQVCKVAQTLWKP WHYTVL 
NLNTOIVYDPGMPLLGTNNNAYMCCI C\ 
YQKTFTRMFIAALFE ITPK* KMHKYLTV 
VG 


6237 


20138 


A 


6291 


273 


36 


RNGRWTPCGGR* IHPS*GTDPVYLFVPG 
TLSSSQASNSHPIiLKAPGSRP\PWPTDL 
GYKDPRGP PSNPACRRRPS LFHSD 


6238 


20139 


A 


6292 


394 


5 


AKIYFS INPLVFSKQ*FLKNVFPNILHL 
QFFSNINSLMLIKV*ALLQDFPLVHNVY 
NKICDTSKDTTTLFMYT^FVFKMKTFFS 
IiKAYI /WSGTVTHTYNPSTLGGRGMWIT 
*GQEFKISLANMVIASLY 


6239 


20140 


A 


6293 


536 


190 


RRS FAIiAVQAGVQWRDLGS LQ P P P PRFK 
RFSCLSLPSSWNYRHAPPRPANFVFLVE 
TGFLHVGQAGLBLLTSGDPPVLASQK\A 
GMTGVSHRVQPD *GHGGSRLSSQPFWEA 
KTGGSPEVRSSRPA 


6240 


20141 


A 


6294 


2 


238 


HEADLGTTQVLTHPSTTAMYFVHYCQPP 
* ILYGTINT*PPVVHKNPIHIKTP/SPC 
IiQASTAINPQLSHINC\NPKDTPH 


6241 


20142 


A 


6295 


2 


447 


FFFLMGVSLFCPGL / WSAMGAI LGSLAT 
SCLLGVQVILLCLSLPSSGKLQVCAYPH 
LG*FLYF*AEDGGFSPCWPG*VSNS*PQ 
VIHLPWAPKALGITSMSHCCPACFHCTF 
FRHFRQNPHLNLFFFLEGIiGLSPPPPSQ 
PLEGGKGPQS[ 


6242 


20143 


A 


6296 


2 


133 


AEERERERERERERE \RERERERERERE 
R*VERERGGVPLRTYR 


6243 


20144 


A 


6297 


3 


244 


HDYSSFLAPPALGYMRMPLHPAYFILFY 
FIS/FYLFFSQDEVLLC*LGWSQTPBLN 
QASCLGLQKWWDYROSPQCLAECDS 


6244 


20145 


A 


6298 


1 


361 


GTRRSAMLARLVLNSGPQ/ CYLPALASQ 
RAGITGVSH*GCPFFSLLKKRWRSHCVA 
QAGVQWFFRG*TPGLSCFSCSLPSN*DY 
KCAPLCPP*DTVF*VRAVSSEVPLTITY 
RGEGAGVWA 


6245 


20146 


A 


6299 


3 


348 


HERHKLEELIKNLFFFFFFGKKIFFFFP 
GGKNGGPF*FIGAFSFRPQGIFLPYPP* 
KW\DWGPWPPCRGDFWFF*KKTPFSIIiA 
RWLNS P PWGILS PYPPKGVRFPGGTPR 
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E=GIutamic Acid, FHPhenylalanine, 
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nucleotide deletion, \=possible 
nucleotide insertion 














PGPV " ' 


6246 


20147 


A 


6300 


1 


436 


VFLVGIGFRYVGQAGLELLTSGDLLASA 
SQSPGITGMSHCTRPRKHH* IANSSSVL 
GDFRIDYPQISRKAQWITLGYNPWAESH 
LLMGAGLRVLPAPRLMI SQVDLLPATHP 
SSGRRG 


6247 


20148 


A 


6301 


370 


70 


PKG*STLPASASRSAGITAVIL*SHCTS 

PPM* LRFKHPTHSSTSTPF/CFFFFGKK 

ILI FVPQPGGKGPNLS * LKPLLSWLRKL 

PCTjTIi*PGC3'nYPf5VPPPP\ ciVPr'PWTTrw 
rWJiu AOOt/ x tr+ZtE C trtrc \0 X r X; X XvJV. 

GV* PCGPGGNS * PAS ASRSAGI TAVIL 


6248 


20149 


A 




i 
i 




mxtxc cue x xur KyKVy v ^ironol V \in/ o 

L*SWL1JUISN/<^SK*FSHLSLLCHHA 
LLVDFNFL * RRRLI ROPRLVSNTSWAOA T 
LSSQPPKALBLKT 


6249 


20150 


A 


6303 


194 


351 


GTFICIIOCPVQ/WLDTVAx^CTPSTIiGG 
RDGRVT * AQEFOTS1VGNI VRPHVYKK 


6250 


20151 


A 


6304 


354 


211 


YRHMP/ S *LADFFI FCRDSLFMLPWLVIi 
NTi* AO AT T.T .POPP TTUT /JT/^P* 


6251 


20152 


A 


6305 


1 


346 


GTSS LQL ISLHS S PTAS YPPKHLHTRLT 
ELl^VSLMAIiVLSVSPATSS /PILQPHP 
x xjtrv^x nxrxr inoououiiVJro^yiiiini x X 
HTHTRMCAHGSGHATLPP * HPLHPRHSA 
YLLW 


6252 


20153 


A 


6306 


395 


177 


WLNTLWY IHTVDYFLAMKRSDLLIHS P W 
MNLQGIMLSEKRQ\SQR*HW*FYLYNN 
VEMKKSQKWRTVLWLPE 


6253 


20154 


A 


6307 


2 


718 


GVTjPYWPGWSRTPDIjV/HPPTSASQIAG 
ITGKSHRALP \ KFAFLTS S QVMLTLLAH 
GTTAWFSKLS * F * EPPMTCAVFSWTG 

OH TW^ RT? TOTfT . * WCVB*P C r*XFWr , C M T 
av^iiuo r KXul v*u I lovr laLrXLaMl 

EKLTTGIS*QKEHELPFSQSSDI FKLAY 

LIFFW/CVVEMESHSVSQAGVQWHNLSS 

LQPLPPRFK*FSCLSLPSGGDYRCPPPH 

MANFCI FSPJDGVIiPC* PGW\ S *TPrjTjRM 

DPPGLRPFQSAGDLTGMEA 


6254 


20155 


A 


6308 


377 


67 


PPPPKIFFFSQTLFFFWGGCPQIAPPPK 
KFFF*KNPQGFFFPPPKKKKIFFFPPPK 
FLPPQKFF*KPP/LSFFFFFFFFFFFFF 
FFFFFFFPTI * I FYSMSFLP 


6255 


20156 


A 


6309 


1 


362 


QTYAKMHVTI I FFGVK\LTFLPQRFIjGL 
SGMPRRYSDYPVAYTT*NIIiSCVGSFIS 
LTALILI IFMIREAFASIRQELIVEEPS 
INLB*LLGCPPPYHTF*EPGYLKPKRKR 
KESSPPKLV 


6256 


20157 


A 


6310 


2 


411 


VLPTLWEAGARGS LBARSSRLQ* AMIT \ 
HCTPPWAT 


6257 


20158 


A 


6311 


59 


334 


THACGNMLS YAGS F I S LTAVLLI TFM I * 
EA\FASKRKVLIVKEPSINLE*LYGCPP 
PYHTFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLHDL 


6258 


20159 


A 


6312 


3 


22 


KPSP + PLTGALLDLLMTSGLSM+ FNFHS 
ITLLILCLLTNTLS I YQ*WRDVTRESTY 
QOTHTPPVQRGLRYGIILFITSEVF\FF 
AGYF\WPFTHSSIiTPTPQI*GHWPPTX3I 
TPLNPL*VT>LLOTFVI J LAI^VSIT*AHD 
P 



930 



WO 01/64835 



PCTAJS01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sentience 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Alantne 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


6259 


20160 


A 


6313 


11 


182 


INKMW*I*VRG*YSVIKK\DEILSFMAT 
WMBVEDILLSKIREAQKVTHPVFSFISG 
S 


6260 


20161 


A 


6314 


3 


421 


IQASTMAGAPPQASLLPCSLILDCCASN 
S /RGSVGVGPSEPCTOYNIiLVCLLLRPS 
EKCSIRVGVT*FSRCRPSP/TRKGHSPT 
PCTSRMR 


ozoi 






OK 

Oj 13 


i 

L 


1 1 1 


DKGMNVFMPANTASVLWPMDQGVILNFK 
SFYLRN\TFS KYIS AIDS DSSVGSGQSK 
FKTF * KGFTNLDAIHDS WEEIKIST*TG 
VGKKLI \PTFRDDFKGFSTSVAEVSADV 
VEIGREIj*LEVKPGDRTBLL*SVLWPMD 
QGVILNFKSFYLRNNIF 


6262 


20163 


A 


6316 


440 


312 


TSSFSLSFF/CQDG/SLTMLFRLVSNS* 
PQAILPSWPPKVLGLQIi 


6263 


20164 


A 


6317 


5 


404 


PRYMIVKLLKI/ERNKIIiEAAREK*HFT 
SKGKAIWI ITOLSSETMEVRKNLHNFQV 
LKEK/NKTWEVRHjYSAKISFQBOCGKIK 
IFSDEGKLR*SFTSRPTLRE* *RKFSK* 
K*NDRRRNLKTFGRKKEYPKELKP 






A 


DJlO 




182 


MFIVA/VFSKAKEWRQPKCLSTDEWV/Q 
YM* YIHTMKYYSAIKSNELVIHVAA* IL 
FENTVLSERSHIWHGSHI 


6265 


20166 


A 


6319 


1 


378 


APSDFFLFQSLNKPLNCTHFSSVHNVKK 
TSLT*LNSC2NPQFIiRDEPNGWYHCL/QK 
YIiELDGAYVKK 


6266 


20167 


A 


6320 


286 


314 


NRCCFSLPTQVKCFNLYLGICCVCDWGY 
YRWIiNF*NPI F* FI LRQFHSVAQAGVQW 
RDLGSLQPLTPRPANFYILVEMGF/TPL 
LARLVLNS * PSLIoKYKN 


6267 


20168 


A 


6321 


23 


397 


QKTLNMEKEMNIYIHKM*ITPNRSNIKR 
SSLRHITVKFSKVKDKQRI /MKLAIEKQ 
PITYKETTIRLSEDFSAETL/NARR*WD 

SLGGPILTGGGKEIF 














6268 


20169 


A 


6322 


411 


2 


TEPPPPGGNF\L*TKPGKKNFLPPKNLG 
FFSPFSPIiKFFFSPK\P*ILGGGGAPFC 
PPQKKGFFPKI PRGVFFSP P/ YKKKNLI 
FPPPVILGPPKVFFKGPPPFFFFFFFFG 
QSGQVKLKSPKCK 


6269 


20170 


A 


6323 


395 


280 


MIARLVSNS * P \CDPPGLASQSAGITGM 
SHHVPQVNHF 


6270 


20171 


A 


6324 


3 


359 


QHYLVFRETQIKAAVRCHYI PTGMAKI I 
KTGSTKC* Q* CGAAGTLIHCWEECR\ltK 
QPLWKTAW* FX I * MLCD+TVPFLQIYPT 
EMKMCV CRMTC\ RVFLSVLF I IASSWKQ 
PKYHKLVN 


6271 


20172 


A 


6325 


3 


174 


FFFETGPNFVNQAGGQGHDLG*LQPPPP 
\GSRI FPA* TSRVAGITGLNHHPHPLGS 
IF 


6272 


20173 


A 


6326 


265 


13 


TASPIRI I ITSQDVLHS*AVPSLGLKTD 
AIPVRLN\QTTFSDTRPGV*YGQCSEIC 
GANHSFMPIVLELIPLKIFEIGPVFTL 


6273 


20174 


A 


6327 


32 


259 


CYYEHLCTSLCVNWYFHFF*VYT*EWQW 
PKYLSADEWINKM*YIWTVEYYLTIKIS 
EVL/SATTWLGLBNIMLSKRS 


6274 


20175 


A 


6328 


1 


228 


PGU1HCSHLSWDHRHALQCPANFG\YFV 
ETRFTVLPGLVSNDPPALASQSAEITGV 
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SHCAQP*VCFHYCLTFFSLQ 


6275 


20176 


A 


6329 


346 


1 


LPKKKKKKISRGLFPFQLPKKKKDNEKS 
LTFS SETMPGRRQWS KIFKVLRGKKQPQ 
QPRI ISLAKLFLKSEDFL/ etqklrbfv 
ASRPALQAMLKEVIi*REGK*YRSLFKKG 
RALK 


6276 


20177 


A 


6330 


349 


77 


HTTPKKNKHN/RKKI KLDVLRYNKVDDN 
SKMSGLH*QCPSDECDAGVFMASHKQ/F 
DRHYCGKYGLTYCFNKPEDKAQHCGSLE 
PRSLRPAWAT 


6277 


20178 


A 


6331 


319 


46 


RPRRPETK* ALNK* KDlJ PMCS WKQRHN 
TVKMLILSKLINRFNAITIKISTE\FHR 
NKLILAI IWKCKGPRIAKTT* KGIKLEE 
LLYLISRPP 


6278 


20179 


A 


6332 


496 


309 


drataf*rqsftlsl/ragmqwhdlssl 

QPPPPGFK* FSCPSLQSSWDYRCPSPRT 
G*F\CIF\VEMGFHHGGQAGIiELIiTSGD 
LPASASQS AGITGVSHCTWPN I LNWK 


6279 


20180 


A 


6333 


3 


173 


HHHSRL/IFFCSVLGTSSHYVA*AGLEL 
LGSSNPPDLASQNPGITYGSHYAWPDI P 
VF 


6280 


20181 


A 


6334 


339 


153 


FSPPPPFFKGFFFPPPPPPF/SF*PLRG 
GGPPFFFGPPGVRPPPSPWSWCSPPPV 
VLASFFFS 


6281 


20182 


A 


6335 


32 


331 


QLLNHSVI VFLLS FDHS YFLRNIiSLFLA 
FKLIGVKNS*YIILSFRYLLHP*HLLNF 
FFF/CKRQGLPVLTRLVSNSWAQGICQP 
WPPKVRGLRAWSPGLSL 


6282 


20183 


A 


6336 


226 


177 


DGVSLIiLPRIj\GGAISAHCNHDLPGSGD 
SPVSAS * VTQLQYYFCICLGLFTFrrRL 
SAYLGTGKLTTALPNTPGLLDS 


6283 


20184 


A 


6337 


302 


59 


KMLLNTIMC2NRE I h/RKRKS PRVHQTS S 
VS YFKKLPQPPQHSAASTLI SQQPSSS K 
QDPPAAK>L* FVEGRIiIAFPSNNVF 


6284 


20185 


A 


6338 


197 


28 


AEKVRFSSAVTWHLGSHSL* * SKKNL/W 
LGMVDHNCNPSTLGGRGGR I A* AQAFET 
G 


6285 


20186 


A 


6339 


46 


317 


EVPSS ISSQISLFSGDFSS VPLEELSNL 
FFMFCPEHI IVTCERVTSNYS /YYKWKP 
EFSYP* IF/CFKMSLTMLPGLSLNSWAQ 
VIIPPQPPK 


6286 


,20187 


A 


6340 


4 


276 


QTTLRGTWQITKNLSSETMNASRCEKK 
NL*SAWGWGMWENNYQPILYSAKISFKN 
GVBIKTFTN/ KQKIKN/FITS ITLV*BI 
LKEVLQAEG 


6287 


20188 


A 


6341 


485 


2 


GVWEFLETESRSVAQAGVQWCDLGSLQA 
PP\PGFTPFSRLSLLSSWDCGSPPPHPA 
NFFFVFFVETGFRHVSQDGLDLMVSIS* 
PREPPASASQSAGITGVSHRARPKHSKI 
L IAGSRRS IQMFPVKFFQKLAWENFQN 
TKDFLNPYSEILLHTSQIHSSC 


6288 


20189 


A 


6342 


2 


339 


EDLRVKLKKEGYSNISYIVVNHQDPMLN 
SNGSVTWALLQAS * YLCILQAS * ASSQ 
SWTS/VTLAAVSFQRISNQEHI PVYQQE 
ENQTDVWTLLNGSKDDFLIYDRCX5CLVY 
HL 


6289 


20190 


A 


6343 


181 


12 


RWGLPSGWDYRSMPLHPVNF* 11/ CMRQ 
DFTVLPRLILNSQPPVILLRWPPKVLGL 
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K=Lysine, L=Le urine, M=Methionine, 
N=Asparagine, P=Proline, 
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nucleotide insertion 














QA 


6290 


20191 


A 


6344 


3 


295 


yWHLSCF*FIAITNKAAMNIRLLWTCAS 
IFPR*RSRKEMAGS*GS/SIiFNFVRNCL 
TI FSHDCTTL/HS /HQQWGGV / PVLPFP 
HQHLL»*SIFLVFDILIGI 


6291 


20192 


A 


6345 


334 


8 


KPSNPGGEGP P * KKGPKVFS P WGKKENF 
SPKKKKKI*CPQKPKGPPWGAPAFKKKK 
NF*NPRGSPQL*KKRGIIGPPTIiIPPSS 
QRPPPKKKKKK/ GPPKGALLAFGGIKFF 
FFFLEKNFLFFPKGKTLLAPFFKGGLPL 
PG 


6292 


20193 


A 


6346 


68 


256 


WVA\PVTHTOTPNTLGGRRGRIT*AQEF 
KTSIiGNIVGPHLYKNLKKKKKKKGEKKP 
GGGFSPL 


6293 


20194 


A 


6347 


385 


60 


KKKKKNRTPYDAAI/PSLGTSSKEFKSG 
SSRDICTPMFIA/AITKRWK*LKCPSMV 
NI/VKKMWFIHMMQY*SAKKKEILS*MH 
LKDVMLSERMLSQSQKKNFCMIUjIWGI 


6294 


20195 


A 


6348 


234 


61 


CHR I EH I KMVNL/ WLDVWHTCNPSTLG 
GQDTWIA*AQEFKMSLSNTAKPCLYKKY 
KN 


6295 


20196 


A 


6349 


2 


358 


FLS LG IHFNYCLVTTES KLS KLRLTYF F 
FFFLETEFPFCSPA* RPW\PNLG*LKFP 
PPGSRETPALTLPRTWNNRRAPPHPANF 
GFLRKNGV* FFGPGGFKTLDLGGPTRLW 
FQKGGNN 


6296 


20197 


A 


6350 


346 


27 


KMV*SLWKAVWQFLIKLNIH*PYDSAVP 
LIG/IYPT*MTT*IYTKP*TQMLAAVSF 
IMAKNTRNESNIFNWPIDKLWYIHIMEY 
**AFKRN*LLINTKT*MNLKC*LF 


6297 


20198 


A 


6351 


36 


158 


KNSVPSQE*LYYL \RQENRLNSGGIGCR 
EPIWCHYTPAWAT 


6298 


20199 


A 


6352 


2 


146 


RSSHLSLPSSWDYRRAPP/H/LANCCIF 
CRDR/SFTMLPSLVLNS*IQTILL 


6299 


20200 


A 


6353 


74 


303 


EPSICGQI YLSFS IYKIQK* / WPGAVAQ 
ACKTSTIiEG*GGW IT* AQFKTSLGNMMK 
P CVFS KKKKKKKPHPGGGGRKK 


6300 


20201 


-A 


6354 


3 


210 


NNKKIWTLKNLKFFRDRVLLCFPDWSTA 
VHSRLTAAS \TLGLKQTSCLS LVS S * DH 
RCMPPCLALiKRKKT 


6301 


20202 


A 


6355 


2 


627 


GRVGGFLVPMRDLAEASRGQQGQVKLGE 
MKPSVSCVOGGV* CISKFKLKN*L\WPG 
TVAHTCSPSTLGGQGGWIA*AQEFETSL 
DNMVK\PCLY 


6302 


20203 


A 


6356 


584 


166 


RQGHNVAQAGVQWRDLGSLQPPPPGFK* 
FFHLSLPSS WD YTRVPPHIANF\ * FLVE 
TGFHHVGQAGLELPTIiSDLPAWASQSAG 
ITGVT/HQARLQLL*KELINPSTLGPTW 
ELLEAWGRCPSWMGDDLALNCFHHFSV 
S 


6303 


20204 


A 


6357 


448 


202 


FTTLVSECI*DIHDSWKDV*ISTIiTGAW 
KTWLI PTLVNGFEG\FKT*VEEVTADVV 
E IARESELEVEPEDVTKLLKSHDKS 


6304 


20205 


A 


6358 


3 


237 


DRSSL* P* PPGLK* S YYLSLPSGWDHKR 
A* PHPANFCLFTF / V* QRGLDLLA* HDP 
PASASQGN* ITGMSHRAGPPLTF 


6305 


20206 


A 


6359 


49 


242 


SSYLSL*P*PPGLR*SSYLSLPSSWDHK 
HA* PHPANFCLFTF / V* QRGLDLLASHD 
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Ducicouuc insertion 














PPASASQSTS ITGMSHRWPPLTF 


6306 


20207 


A 


6360 


508 


223 


FVTQAGAR* LDLS SLQLPPPGFKQFS CL 
SLPSGWYYRHVP/STPIVF/CIFSRDGF 
HHVGQAGLELLTT * STCLGLRS AG I TGL 
SHRAQQERGFEP 


6307 


20208 


A 


6361 


2 


595 


AIWG*GTHFLIPWVQKPIIFDHRSQPH 
NVPVITGSKDLQNVNI IPCILFGPVTSQ 
IjPK I FTRIGED YDER VLPS I TTB I LKS V 
VARFDAGBLI TQRELVSRQVSDDLMERA 

atfglilddvsi*thltfrkefteaveak 
qvaqqdaera/rnslatagdglmelckl 

LPQ 


OJUo 




7V 
J\. 


oioz 


1 


259 


T OCDVT?T T WW* npinr o T tmr TT dTB 

Lu£o r I r JLtLi i ¥ I o» * JriiKXjKRJjXIjr XF 
ETECCPYA\QLECSGT I SAHCHLCLQGS 
GNSPAPAYRVVGITDSFRITHCXLTCIP 
SE 






A 




60 


221 


N 1 1 cLLfeM VAH 1 CNPt»IlAjARGGRIT* V* 
EFKTSLDNIVK\PCTPPKKSLKIKLN 


6310 


20211 


A 


6364 


1 


393 


RPRRPQTGVQWQNLGSLQPPHPGFKRFS 
CLDLPSSGDYRHTPPCPANFSGFS /HHV 
GLKLFDSSDLPWPPKFVGIARHBSHRKC 
T*RSRRGRRXfliJPIiER 


6311 


20212 


A 


6365 


19 


85 


CINSKIMQSSRITTIjCKMVQPL*RALWQ 
CLIKLNI PHPYDSAIPLLHFYQCK*RHM 
NPHN\TFTSMFITALFITAKN*KQSKCL 
SIGQWMKK* *CIHTVQ*NSGI * *HE* IN 
SKIMQSSRITTLCKMVQPL 


6312 


20213 


A 


6366 


2 


298 


PPRIiAKVQKFLLFFYLVF*DRVTLCHPG 
WTAVV*SQLTVTLAHRIfIiRLPSGWDYRH 
VPQRLADFVFL/CR/DRGFSMLVGLVSN 
SRPQEV*ATSPSQSLLS 


6313 


20214 


A 


6367 


67 


636 


QDNKKGIRYLFVYFRQSFALVAQAGVQW 
HUliGbliU PJ_iS P * FKQFS diSIjPS S *DYR 
HAPPRPANFL/QFLVETEFHHVGQAGLE 
LLTSGD* PASTSQGAGITGVSHCTGPRI 
itr J-iivl r on LiiSAPvjtoc WKRaqpi iKSKkTiQ 
FRAFPOTAPCAHAYLLGCVSHliEMGCRN 
IDIPELSFFLCPLQCLIYRGKSP 


6314 


20215 


A 


6368 


3 


251 


RSRHCTPAWAPLS TQDHF YVFYFFLHFIi 
IGSLFIFACFVSS YFIBFS YCL\ YFFVD 


6315 


20216 


A 


6369 


333 


193 


FKKLVLMAKAKYCKANFFFQGVPVASKN 
/TGDLWEEPGWSQTPDLR*SALLGLPKS 
WDYRHEPPPRIANFCI /FLVETGFQHIG 
QAGLALLT/ S/GDPPASASGALE*tjGGG 


6316 


20217 


A 


6370 


36 


369 


RPTRPEMRFHPVGQAG/PRS PGI*K*SAC 
LSIiSKCWDYRHEPP^PG* FSFFTYIYI 
YFFFLFFS FSFLEEMGFHHWKAGLELL 
C*NDPPASASYSDRI IGVSHQARPFFFF 
r t? 


6317 


20218 


A 


6371 


? 


375 


DIQMIDRNIKN/CSTSQSIRKLQIKRIM 

* YHLTSVRMAITKKTTHKC * RCGKK\LH 
C*WQCKIiVQPL*KTIQRFVQiCrENWINI 

* PSNSKN/WKSAC*RDICTAMCITT*FI 
RAEHGGS \ PCNPSTLG 


6318 


20219 


A 


6372 


75 


372 


NGNSQKKKKKKKKKGGPPLKKPSGGPNL 
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P PGGAKKS FPIiKGGQKKPPGG FLEKNP P 
LGGAPL\GPPPPPKKRPRGKTKP*GG*R 
GKNPAKP WLGKKI LKP 


6319 


20220 


A 


6373 


605 


350 


ISTGICLGMFIAAIiFTIAKL/WKAT*PS 
FNR*KKKMCYIYASYSAIKK\DAILSFA 
ITOMNLENIMI1SEISQTHKDKCCMISI1I 
CAI 


6320 


20221 


A 


6374 


167 


378 


KKSQGGTNI KP PAI TKYLS QMGG * KKLL 
TGFLKKKSLLGGGAGVTILLT\NRTFST 
KINDPDGGGDHILSQ 


6321 


20222 


A 


6375 


994 


747 


VSLALS PRLEQJGVI SAHCNLRFPGSNN 
S PASAS * VAGAHHHTQLTFVFLVETGFH 
HVGQAGLEliL/NIiVIRPPWPPKVLGLQA 


6322 


20223 


A 


6376 


1956 


1592 


HEFEMESYSVTQAGVQ*HDLGLLQPLPP 
RFKQLSCIiSLP/RYWDYCIAPPCIiANFX 
♦YLVKMGFRHVGQAGLKLLASSDAPASA 
FQSAGITGMSHRARPKSAFKIQAI IGEG 
GVSELHRKINI 


6323 


20224 


A 


6377 


94 


402 


KTQELQNKI SHCL I RRVDRKM/ WLCTVA 
HTYYPSNLGGQDGRIA*GLEFETSLSNI 
TRPRLYRKKKKKKNVFQAWPLWSNFLRR 
LRREIRAEKSF 


6324 


20225 


A 


6378 


463 


146 


GKUGPPKGSLKPPPPFFFFL*R/P/HPP 
PRGETPFFFKKPKKSPRGGGASPPFPPL 
GRVRPENFPPPRGQGSP*PKRGPTPAIiP 
PGGQKETLFPKKKKRKKKKKYMRIiI 


6325 


20226 


A 


6379 


3 


207 


LEYQYAQTMCIISSRIQLVAEKASVPFV 
DKTVLS \TMGQL* EAKAEDGFFVPG* YG 
LAVS PLKFQSTRP 


6326 


20227 


A 


6380 


328 


70 


rprrrlglprswdyrheplhpdqllve i 
mshyvaqaglkll\ itgvshharph/yr 
f*ughc* slvrrpwtisysflclgfpmw 

EVSD 


6327 


20228 


A 


6381 


13 


77 


VIFLLQPPK*SSCFSLPNRQDYRRVPPY 
PANF*FS/CRHSLTMLPRIlLF*TGLQAC 
ATIPS 


6328 


20229 


A 


6382 


342 


77 


KGQGFYPHLFFFFFFF*KDSPSLPRVEC 
CGEISAPCNL*LFGSNDFP/ASHAIVGG 
KLPKQRRGAFFS * GGGFFFFF *M* KHSV 
LNSSRN 


6329 


20230 


A 


6383 


1 


208 


RGRVDKI FYLFFETDSQSVT*AGV* WLN 
PGSLQPLPPGFK* FS*PQVI\TCIiDLPK 
CWYYRREPLCPVY 


6330 


20231 


A 


6384 


277 


50 


LVLKIiGRCKTSSF I *NFKGPQIAKTILK 
KNKAGSLT \ LPNFKTNYKATVT KTV* YW 
HKH*YRHGFISGLSILFH 


6331 


20232 


A 


6385 


2 


354 


AAASTNFRDR/IL/LCHPGWSAVAQS*L 
TAVSTLN*AQGIG\SSHbSLLNSIiDYRY 
APPLLAVLLCCPGWPQNSWPQVILLSQP 
PKALGYRCEPPCPAPLLLNFFFFFKNRL 
WFCCPGWSA 


6332 


20233 


A 


6386 


2 


188 


FFKYICY /CINMYMCFYIFINIYSLIYL 
*IHIYKYIFAYIFINTYL*IYICLYIYK 
YI FIYYR* * I YICLYIYKYIFI YYR 


6333 


20234 


A 


6387 


2 


241 


WATKCR * G * RRTGTLTGG * WDCKASGQL 
♦KTPQQFliISITY/PYGPAISLLEIDPS 
ELKTYVHTKTRT*LFIAASFIITKN 


6334 


20235 


A 


6388 


142 


429 


YCVCI Y 1 Y I FLYLFI FKCTFFT YSSJjHLi 
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nucleotide insertion 














LIFSFPFFIiERGVCFSPRGGGGGKYLGL 
wtcp r ffci w xJvKJN v i X i«\Uyi toAlityN 
EILS FAATWME LEAIMLS E I RQAQNDKH 
YMISLICGIKR 


6335 


20236 


A 


6389 


468 


250 


VHRRTPPCPIIFFFF/CKQ*VLPMLPKL 
VSNSWPQATLLPWPSLTVEFQALTPGWK 
NFKKMFLMVMSWS I KKR 






2v 
j\ 




616 


1084 


liCXiCQYAGX S VQCPD Cv FQFFHNVDSTY 
EDYLRKLKTS LSFFWRRS FALVAQAGVQ 
WCDLGS LQALPPG FK* FS CLGLLS S WD Y 
RHPPWHPANFVLLVEMGFIHVGQAGLEL 
LTSGDPPASASQSAWITGMSHRTQQE\Y 
FLNLKFSTSPGELLCLK 


Oj j / 




7\ 




A 1 C 

415 


200 


SPPPRFFFFCP PKKNFFFPP PRSKNFFF 
*RGPPFFFFFKKTPFF/YFFFKPPFPPR 
/ RFKNPPKIFFFFNRGPKNPGGGGGKKK 
PPGPGGFSPLWGPPPPFKKPPPPPKKNP 
PLFGKK3CKKKKKKKRAAPR*NPRARGVF 
J? r IrlrlAyr WiUrlj 


6338 


20239 


A 


6392 


3 


117 


KLKPSSH \DY* VGETYGKDITSRGKDfCP 
IAVCKTFSPLS 


6339 


20240 


A 


6393 


3 


204 


DRVSLCRPGWSAVA*S*LTATSNS* \VK 
QSYHLS I P *GS WDYSCVP \ PPLG* LVIL 
LRQP PKVLGLQA 


6340 


20241 


A 


6394 


501 


187 


AGVKWYEHGLWQPPPPGLKRSSHLSLPS 
S*DHRHEYPCPANF*KIFF\VETRSHYV 
AQTSLEFLDSSNPPTSASQNAGI \ *GMS 

Uf**TV/"IPMfVPC'0»T T TV T /""VKXT?T T f 

JdUAy Jf MyTFSIj V KIGTWFLiI F j 


6341 


20242 


A 


6395 


3 


393 


HACGAIMNSKPHAVTQYHTPI i LM*SVLI 
TT\VLLLLFVTILNDGITILLTDRNRNT 
I ljrx PoGGGDPIIjYQHIiF*FSGH/VGG* I 
L/LLPGFGTISHIVTYYSGKKEPFGYIG 
MV* AM I S I GLLG 1 1 V* AHHI 






j± 




1 A1 


IOC 

li5 


r Uof 1 irtyJL vi/vjl Wy 1 XKoLiNV YJjTRIjTT 

S FSLMPS * FLF I FFCRLX3/SLTILPRLV 
SNSWPQAILSPQPHKMLGLQA 


6343 




A 


6397 


395 


187 


SSYLSLLGIWSYRHVPPHPANFLFFCRD 
ijVVlr ±r r (J \ KMCjAWIj PRL VLiNS WAQ v I 
IiLL* PSKVLGLQA 


6344 


20245 


A 


6398 


35 


334 


QNETLS KTKXIKNKKKKKKKKRGGPLKK 
PLKGPKLTRACKKKPFSLKG/WRKKNNL 

KKF*REKGEKQPQNPGC 


6345 


20246 


A 


6399 


464 


146 


HIQIREFQRTPTRMSPKTSTLSHFKIKK 
QLVMNKRSFIKQWIS *QKPCQMARRD*H 

EIKPFLGKYKLREFITA*FIFKG 


6346 


20247 


A 


6400 


723 


632 


FFN*DKI LLCHPGWSAVARSQFTAASTF 
RVQ\QSACLGLPSSWDHRHIPSCLATF* 
♦KRGFAPLPRLLSNFWAQAILPPWPPDV 
\ Jin na 


6347 


20248 


A 


6401 


522 


46 


FFPPFLSFFFFSKIPKQSLGLVFYF*NK 
VWVFCPGWRAIR* FPFP A/ AP PLGS KNS 
/SCLNLPSTWGYRISGMVPPKAH* IPPB 
I FFFFFF / CRQSIiALS PRL/ WQWQVQA 
ILIiPQPPE*liRLQVPGRW 


6348 


20249 


A 


6402 


103 


261 


NIPFTCLIKKKKKKKKKKKKKKKKKKKK 
KKK/IK* KKKKNVWGGVVKKKLGGD 
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W=Tryptophan, Y=Tyrosine, 

nucleotide deletion, \=possible 
nucleotide insertion 


6349 


20250 


A 


6403 


146 


384 


GPLPSLSGHKELSLFFLFETESHPVAQA 
\DIiGS LKPLP PGFK* FS C\S LPRS WD YR 
RVPPCSVNFCVSPY* PG* SOTPDLR 


6350 


20251 


A 


6404 


3 


395 


ETESHSVAQAGVQWRNPGPLQPPPPKPK 
RFPYLSLPS STOHRHMPPCPANSAFSVE 
TRT. /l .H ACO ART JCYtil iTPGDP PT*TSOSA 
RTTGASHRARPSIiKYFLISFVISSLLHG 
LFRSVLGSMYLRISQVFFCC 


6351 


20252 


A 


6405 


30 


205 


CRLTRPAEFRTRKTHT\REKPYECKQCG 
KFFSCKSNLIVHEKTHKVETMGIQ*VMW 
LFL 


6352 


20253 


A 


6406 


3 


AAA 

444 


7\"pvTTY2V*T<2TrTT'UTST VT /ITFT.AFirRT V 
AK£.l lul "XoMlvrjjXl/ UC<UMJC<^EiXJ\. 

KAI PFTI VPNK* DN * ELNLTKQVKD PHK 
EDYKILMKETEQDTK/WKDIPSSWIRI I 

N T VTTVAUT ..QTfA T VT? P<3 A T P T Y TR VTRITT 

ECHWKKKI W I GKVAHACNPTTLENQGRR 
SLEARSLRLS 


6353 


20254 


A 


6407 


480 


307 


QHDSPSAKKPPQKPDKS/HPSSRS/HPF 
PP PP PC * TPPPE F * YPPPKGEWGARP P P 
QG 


6354 


20255 


A 


6408 


923 


833 


KAMKHFLIKTRAFFRHVTAHLIDQSIVK 
TFT»T»YUT^FI AT FAIiTiWWYGKE PML /WP 
GAAAHTCNPSTLGD *GGRTA* AQEFKTS 
LGS *NKTKP* CIjSAV 


6355 


20256 


A 


6409 


382 


152 


ECSGTLUfflCNIXLLCSSDFP/SLFHNV 
SLSSNSSS*VAEIIGIiC3IHVWLIFVSAK 
LFLGRRVRNLFS PT PKNMFS K 


6356 


20257 


A 


6410 


553 


173 


CLL I IKNTLPAKKKKKKKKKKKKKKKKK 
AR/WPP*K*RAP*NPVKTPGGSF 


6357 


20258 


A 


6411 


38 


562 


FLRQGL/DSAAQAGVQWCNHSSQRAP P F 

CZT /2P Q CU\ PACT .AfiWQ A VA * CP T JX ATQT 

FRVQVI LLSQPPE * LGLQSCS ITQVGVQ 
QVJRDLGSLQPPLPGFKRLS CLS LPSN* D 
YRRLPPRPDSS ITFSRKFFQSPQARNLP 
PTPKT.ICPOOP VP AP S PSELffLS WSSGNTC 
CHFLWFLCP 


OJ->2> 




J\ 


o4lZ 


At A 

414 


144 


FFRRTKQHSAAIfliECA ISAHCNIiHFPGS 
SKS PASAS * VAGTTS VCYHAQL I FVFLA 
DTGFYHVGQTCPDLMFCPPRJPPKVLGLP 
RLPLICF 








04U 


o 
j 




AEAGGLLEPRSSRLQ/CSQP*SHPCTPA 
WAT 


CODU 






A/11 A 






TQGPRLFPHPFGAPGGPVPLGWE VRPPP 
APPGKPPFPPKPQ\NSPPWGGQPSIPPP 
LEG*ARKSP*PPKGLVPVTPKGPPPPHP 
GQENPTPFPKKKKKKVN 


6361 


20262 


A 


6415 


499 


0 


HFTSKHHFGFKAA\A*YWHFVDW*LFL 
YVSIY**GSH 


6362 


20263 


A 


6416 


187 


236 


MCHINGIMQFFFFFFFENEVFFCPPARI 
GN*R/WPNLG*LKPPFPGLTLFSCLTPP 
RSWNKRPPPPARVIFLFFNKKGVFPLLP 
GRV* IPHTPEPLATTNLLSVTIDLliIIiN 
VSYKWDHAVFFFFFF 


6363 


20264 


A 


6417 


158 


412 


SSDPSRRSCLFSF/CFFFLEREFFFAPR 
VEWRGPNFCLLTP P PPGKKHFFAPS LRG 
GGDYGHLPHGPVT FGTLRKAGFS PCGPG 
GS*PPPPGKKHFFAPSLRGGGDYGHLPH 
GPVI FGTLRKAGFS PCGPGGS 
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W=Tryptopban, Y=Tyrosine, 
X=U nknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6364 


20265 

i 


A 


6418 


457 


11 


HTGQSPYRCSECQK/AFTQKSAITNHQK 
PQQKGRKSMQ*LRI * ES FFD * SHQLRHN 
RSSMNTMIVMGPFGTLVGEKPYRCNEYF 
KPSFLKISLVYRRDLTRRIIGQAHWPMP 
VISALWEAEAGRSLEVGSSRPAWPT 


6365 


20266 


A 


6419 


103 


376 


SALSLILGITFFF* IFLETGSHFIAQAG 
VQW/PIHTCDHSILQPQTPGLKPSSCLS 
LLSSWDYRHVPPVPRTSACFECTILRSG 
SVAHTVLWKAKAGGSIiEVRSLKLACATQ 
GAPCLTKKKCDPQD 


6366 


20267 


A 


6420 


463 


6 


FPPNPHKKQNQQTP \ PPKKAPQQQKYTA 
DP/QDPPGNDPKAQAYKKS PAATYPAAS 
SSPSSPSS/RPPSTAAHVLRVPL*GSPP 
GLVGNPRPPPPGGAQHPQPPDTWQKRPA 
HHGLQSLSASPPGIS*EPQLAPRNSPHW 
TRDL 


6367 


20268 


A 


6421 


1 


479 


RGFAVLFSAVEFPSSSKSHLDLETSPKA 
YKHGAPCSAPGSLQASLPKGRNQRNPFS 
NFFVLF*DKISLCCQGWSAVIQS*LTVA 
STSPGSDDPPTTPPPPHAPDTWDHRHVP 
PHPAKFYI FYRDR/ SFTMLPRLVSNSWA 
QTVCLSSLFMLLPSLLWSLSP 


6368 


20269 


A 


6422 


11 


242 


FEIIFSAVEFGWSDHGSLQTQPPG\SSD 
PPTLDS*VAGTTGLHHHAQLMF*LFVEI 
GSPGLKWSSVSASESEPLYPA 


6369 


20270 


A 


6423 


488 


335 


SCQLRPWNES IGQRSGIRWLTPVI PALR 
EA/EEAGG *LEPRSSRPAWAI 


6370 


20271 


A 


6424 


454 


229 


RDPPASASQSAGITGVSHHARL/VSLFL 
I VM*AEGCYLSG I WRTRLKMNPPQ* RKE 
RQIQVKSTRSDI FFFYFLFL 


6371 


20272 


A 


6425 


1575 


1309 


FESESHSVT\RTECSGTIS \AHCSRHRL 
GSSDS PASASRVAG ITGVHHHAQLI FLP 
LVELEFHHVGQDGLELLT\*VIHPPWPP 
KVLGLQA 


6372 


20273 


A 


6426 


276 


437 


CFHPPEPWDQQWGPPPLGNF* IFWKKRG 
PPFVPQIILNPWPQKNFFAWPPKTLG 


6373 


20274 


A 


6427 


445 


240 


RPRRADHL/RSGV*DQPGQHGETLRI*RK 
IQKISWVWWHAGTLWPFWERDGSHRLSA 
RTQMLGSWACSPGF 


6374 


20275 


A 


6428 


114 


450 


LLLLLLFES ESCS VAQA/ PSAVL / W ILL 
GSL*PLLPGFKQFLSLSLLSIWEYRGLP 
PYPANFFKFLVEMGFHHVCQAGLELLTS 
GDIaLASASQUSG I TDVSHRAQPVT I FLK 
GLS 


6375 


20276 


A 


6429 


380 


224 


LRHCLFTQAGIQRQEHGSL*P * SPGLKQ 
SSHLSLPSSWNY\GVKLPCPNKFQH 


6376 


20277 


A 


6430 


509 


228 


phnkgrvs ppppqnffs ppsrnlgggwl 
pkipppkkrvfskkpprgkkk/ppp*kk 

KKNFSPPPFFGPPPKIIKKPPPPFFFFF 
FFFFFFFFFFFF 


6377 


20278 


A 


6431 


3 


117 


HS WESI *KRGACFNIFIAAQFTIADIWN 
* FMCLSGDE * IKKLGL I YT/MDCYWGVK 
KKETMPLATTWIELKVNILSEIGQA*KD 
KYHVF*MNR 1 


6378 


20279 


A 


6432 


417 


271 


LSKKTKFRNSPQPPLPCSFL*NDRK\WP 
RVWHTCNPSTLGGRGRWIT 


6379 


20280 


A 


6433 


385 


145 


TPGIKQSSCLHLPKCWDFSCEPPHLANL 
NF/CKR*YIPIVPNSKV*TFSQEVFSYT 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Add, F=Pheny!alanine,' 
G=Glycine, ENHistidine, I=Isoteucine» 
KNLysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
V = vviuiamine 9 Jtv — /vrginine, o — oenne, 
l— inreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucjeociue insercion 














WPHSSSFTGKEVI ILFPKECPGMFK j 


6380 


20281 


A 


6434 


65 


383 


iifpnllkkkkkboockkkkkkkkggpikk 
kp * gdki ktrqkkknffpkrggkknqpg 
kfkkknnfwggkkmekpp\kkkktckek 
kki * kkkgekkpqiapekkkgs 


6381 


20282 


A 


6435 


3 


454 


kxyKlPVRx XJ.K/* IIjFKHXVIKIjoIS-o J.Xi 

k/ekkkkkkflraptekgqippkkip/ I 

KLTVNF*KKS*RREKW6\PIFNILKKKK 
F* PKI FYPAKLTFIREGKKKFFPKKQGL 
GKFVTTRAPIaKKILTGVLNLKMKKGYLF 
r wiN 1 Li iv I LtBUMxl Jr 


6382 


20283 


A 


6436 


424 


410 


NFFFPPQGGGGGGFFF*DKPPPGNPSSL 
FFFFLGGGKKRGPPPPPKG/QTPPPGGL 

FF*KPPPPPGFFF*KSPPGGGGPPSVKK 
KKEKKHKKKKKKKKK*R 


6383 


20284 


A 


,6437 


3 


214 


E IDCCAAVWKQPKCPSADE * RKKMWYLY 
TMEYCSATNKNBIQSF/VNNWKEVEIIM 


6384 


20285 


A 


6438 


83 


420 


QS FP * I S CRKYS / WLGAAAHACDANTLG 

nonnD 7V AliT/TI/'Ylf T7t/ T T 1 CT J'tlNrr .T7'Df2'DW T T*2M"lD 

uKBWKAAWvjyjvr ivl oJJuCiuJSlrvxir Jvl itoJtr 
RTSHGCELRRS S YTYALTHGFSSTPETA 
RPAPLLPPPPPPTQREDYKEEDLYDDLL 
NK 


6385 


20286 


A 


6439 


163 


289 


KMYCYYYY / C*DRVLLMLPRLVSNLWAQ 
EILPPWPPKVLRLQA 


6386 


20287 


A 


6440 


459 


431 


DKR1RXKX TPi-iAx K JKi<Wl KTT V / 

shymvvvkettnnk/ cgifihswweckm 

VQLLWKTVSQVFKRLDIiP YNLATLLI F I 
CQIiQIKTLSSPKYIjHMKVHSIIIKIRKK 
YIKKNIiKCPSTDEWI I KI *C* S IHTMEN 
ERA*K 


6387 


20288 


A 


6441 


478 


36 


V r vj<JKV 1 Xj V bJvLtir JjsJ&xr Lu\ V Jl r iuCLAjJxy 
LEKSL/WNFHDLELNISLHKNQPNRCVH 
QWRIG*RKRS*CSE* *R/TMQFINSNLT 
t .tkt* t d cnrraiwnT t t t anil / tthtvrv 

YSAI KKNE IMCFPATWVELEAI ILGEIT 
QKQSQIWRSPP 


6388 


20289 


A 


6442 


464 


206 


il V ynouXlUir I OO i V r Ll vji-t / Xl\^I\J.NEir-VL/ 

VYRNTRMFTAALFL IVQNWK*HNYLQTG 
EKINKLPYIHTMEYFSVIKSNQHS IT 


6389 


20290 


A 


6443 


414 


216 


CYIYTCPLTQQSRSWVAT*ASSKEEWIN 

klwhihtmd*y/ssnkknkixjiwttwtn 

LKS IMLTE I SLT 


6390 


20291 


A 


6444 


7 


411 


S FYCLHS FAFSRVS YCWNIQ YIAFLDWL 

iJOKOliOixtlVr On \WXl/ V-JM^V rij/ ICiX V 

WQFLTKIiNirifiTiHDSALTLJiGIY^K/DM 
KSYVHTKMCH^IFXAALFIxTUCTWKNL* 

LOO VVJUfi JCuM IVLlrl X UyiriEiIOJO V i-lXVTx. yLM 

ELSSHEKTWRELLRERSQSKKAIYCIFQ 
QYDTLEKAKLWRQ 


6391 


20292 


A 


6445 


257 


419 


KSLPFFF/ CW* ETGSHSVTQVRVQWOJH 
IALQPQTPGLKQPS CLSLQGSWDYKFHF 
TQACPY*RP 


6392 


20293 


A 


6446 


20 


417 


HQYCFASFFFFNNFFKTESHSVAEAGAQ 
WHDJjGSLKPIiPIiRFK* FSCLSLLSSWDY 
RCMPPHLDNF\* FLVETGFHHVGQAGLE 
LLTSDDPPALAS*SAGITGVSHRARPVE 
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peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Clutamic Acid, (^Phenylalanine, 
OGIyrine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine R=Arainine S=Sprinp 
T=Threonine. V=Valine. 
W==Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LFLLRRQGLRLLQNIGIGRQ 


6393 


20294 


A 


6447 


371 


3 


VIGCQFL/RKTKRYLPYDLP I VLLCI YP 
R/BKKNLCRH*TCTSMFMTALFI ITQH* 
KYPKCHS IGBWLNIIiWHIHTMEYYAEMK 
RNKLL I HEKTYMDIiKDTIjFS GGOR VYI.K" 
SSILSIKCIRRQE 


6394 


20295 


A 


6448 


46 


332 


LGEHLRQGP 1 1 LKVCQVILTHQQYFESQ 
WS IMALSNRKYNANCR YNLKCPNS Y I KK 
EML/WPGAAAHACNLSTLGGQGRHIA*A 
QEFKTIAKPCLY 


6395 


20296 


A 


6449 


2 


381 


IiCSIjVFTH I CS I FLFFFFFFKDRALLP / 
H/PCWRAGP+SYIjTAASNSGXVKQSSNIj 

PRLV*N\ ILLPQPPKVRGL*AAAHAGGQ 
GHDHTSLRPQTPGLKQSSNLSRGKPLCR 
ANFGS FW FLVLE KRGS HHVAQ AS LKQS S 
CIiSLPKLGDYRHEPSHRANMFNFRSLPG 
LRKADCLYPY 




jtvzy / 






J. 




LQNAVWQ FLKNCKI VPHY * A/ 1 PHLG I H 
PNALKTATQRNICAPMFIAALCZATAKRQ 

V-iv/ ti^N'i OXUXVi X XflXr]J\XlJdA JVtV\«Vv 

LLARVWINWSPVHCWWECEMVQLLQNAV 
WQFLKNCKIVPHY 


6397 


20298 


A 


6451 


314 


72 


MLPLTNGSCRTKVSKDIEGLSNSINPLP 

T»Tf2TVT?*T , "DWDl?*rZifZT r DI?T?T?I?l?T.X7' /DMPT 
liiuxini xr ~ f KUtux r r r r r r Liu / irfMu X 

FTKIDHILGHKTRLNILKMI*ILKYIP 


6398 


20299 


A 


6452 


54 


350 


LRSFCSALFCFETVLLCFILGWSAVA* PR 

AJ-t rtuoowrr t"0/ uiir x roan UrlreLiv ir ir rl 

QVFLEAFFFFFFF/CRKGVFFFIiPGGV* 
NSGVKGTPRPNPPKAIiN 


6399 


20300 


A 


6453 


368 


172 


WLTSVI \QRFGRPRREGHLRPGA*DQPG 
LR*STHLRLPKYWDYRRESQCPARPVIL 
fCEANMFHT/3 


6400 


20301 


A 


6454 


1 


365 


NILKUmKLLNNL\VKEEVAKEIKK*LV 
LNKNEimYKDLWNEAK^ 
17y^SA*RI^SN0CSK*ASTIiRlJHiaCKSN 
LNPRLI*QQKSITLKREKSMKPKAGSLI 
RS IKLI IRLT 


6401 


20302 


A 


6455 


364 


1 


PETRSCSVWPRLGGVQRQDHGSLPPLSP 
RLKQFSHLSXaPSS*NHRGMPPCPANLLF 
KFYLFIYli/FCLFVEMGSHHVSOAGLKF 
LSSRDPSALVSQSVATTGMSRRAWQDI I 
LQQKQAATAA 


6402 


20303 


A 


6456 


91 


300 


GHARHFGRPRWGDCLGPEVRDQPGQHGB 
TLTLQK\KNKN* PAWWHTPAVPATQEAG 
ITGVSHRARPCPPN 


6403 


20304 


A 


6457 


2 


195 


RFSCIiSLPSSWDYRHEPPRLAIFFFF/C 
FWNFKKRGGFPIiWARWV^NS * PRGTflTiT . 
WPPKGWGLRA 


6404 


20305 


A 


6458 


522 


202 


THNWRPSLQIPTAALFLLAKKWKQPKCP 

HSA\KWMNFENMLNEEACWERPHIV*FH 
IiYEMSRVGKTGETKSRLWV 


6405 


20306 


A 


6459 


377 


161 


EMNIKCPSTDK*VKKRY/LYIHTMEHCS 
ALKGNEVLI IHAMTWMNLANIMLSEKSQ 
SQNTTYSMITWKCIiE 


6406 


20307 


A 


6460 


92 


484 


Ll^ACFLRTVLIDYKMWAEKGMVGMCVL 
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X^Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














MRNMTVTII FRDRVLL* HPGWHAWRSQ 
LTSA/VKRSSCLKPPKC*DYRCEILCQV 
E IAFTHRSLFNS FLKRSMRVILLNYWKD 
PSLGFCSYDSTGMALLRNFS 


6407 


20308 


A 


6461 


483 


192 


MGLMLILRANFQSTKKKMICLLKKSLTF 
LNCIYLSIWPCTFNWIFVCVFAVYKLH 
VWAE \ PVAHACNPSTVGGQGERTA* AQE 
FETSLGNI IRPCLY 


6408 


20309 


A 


6462 


451 


98 


KQVWQLL* KQVWQLLBS LNMKLL YDLAV 
LNLGAYPREBKTHVHTKTCTQMFTAAPF 
I \ IKK* KKNFS IDMKYBERDKMWTIHVT 
QHYLI IK*S ID\ YAPTWMNLENIMLSED 
SQTQKTTHCIISFI 


6409 


20310 


A 


6463 


395 


67 


FLRFWCTHHWSS/DILYPVCSLLSLNRS 
HSFPRVHKSPLYHSYALCPHSLAPTYE* 
AHKMF/RLVNFLFLVETGFRHAAQASPE 
LPSASNPPFSASQSAGTTDASHRAQTK 


6410 


20311 


A 


6464 


485 


220 


FFFETWSHSVTQARVQWCNLGSIAKPLP 
P\GSSDPPTSAS*VAGIAGMSHAPTHEI 
PS IQARQQHLQGElOCIELIiSIiPTSASQV 
LRLLA 


6411 


20312 


A 


6465 


I 


371 


IS I PPKKYLRINIjTTYVQGLYEQNYKTD 
KRNQGRSSKWRDGPCSRIGKLNIRMSVL 

snliysfnvilvtvpasyfvsit/rlil 

KFT*KCSRPRIVPLFHTiTiT.PIiLVPFVCS 
FILCRPGAS l vk 


6412 


20313 


A 


6466 


489 


175 


YKIIEISRCIK/GLL*LWRTTHLFSHPH 
WRRGMAYEKNADFRLGTVAHAFNPSTLG 
G* GR /WNQGFKTSLANWKPCLY 


6413 


20314 


A 


6467 


2 


367 


FGTETLGQAGIKGDPGBKSHWGEGLHQL 
REALKILAERVLILETMIGLYGS*LTEL 
PAAAGSAGAPGrQTGHPPGSHHLA\E PEL 
GSGAGPAGTGTPSLLRGKRGGHATNYRI 
VAPRSRDERG 


6414 


20315 


A 


6468 


472 


260 


TTLGGAGNSGYRHIQPHVANYLFICRDR 
/SLILLPRIiILNS*L*AILLSRPPKVLV 
LQT 


6415 


20316 


A 


6469 


88 


435 


RGVLLHMLSCLPLCKTCFCSSFTFHHH* 
TSFSFSFFFFLLGNKVSFYCQAEGQGPN 
U5*IiKPLPPGLKQFSCFT/LPEKWE*QT 
EPPGPVKRHFGLSYFPLTRG 


6416 


20317 


A 


6470 


34 


106 


EFAIPFLTIYPIELNTHVHTKICI*MST 
AALSTR*KSPTCPSM\INKMWYIHFMEY 
YSAISKGRNHANVHQWIHKM*YIHTVEY 
YSAIKSNEALTYATTW/MNLENIMLSER 
SQS*KICI 


6417 


20318 


A 


6471 


390 


92 


DFLIDAINHNIiKVFFYS*FISMC/HPYN 
*SLSFSNIQKSLIKTMINVSLSFKY*YL 
IYFS*KKTF*N*RQGLTMLPRl4LSNSWA 
QAILLPWPPKVLEIiQV 


6418 


20319 


A 


6472 


414 


167 


HIH I KTCA* I FSAA* FI IAKK* KPKCPT 
ADEWINKMWYVHTIEYYSAIKK\NKLLI 
PWMNLENIMLSTGNQTQKATYCMIPFI 


6419 


20320 


A 


6473 


3 


244 


TTIIiMTTYDALFPELYTTYFVTKTLIiL 
TSLFIi*\lRTAYPRLRYDQLIHIiL*KNF 
LPLTIiAIiLI*HVSIPITISSIPPQT ■ 


6420 


20321 


A 


6474 


12 


224 


IASGRPFFCYDVCV/VKRLCRHSIVIIM 
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X=Unknown, *=Stop codon, /==possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GGALKKKPGGBPLFPH 


6421 


20322 


A 


6475 


417 




FQS FTBNKMLS SS I TCSSB I FGERKSQIj 
M*RPSLL/SY*EKLPLLASTTVISQRPS 
AL* VQPL/PTAKNL * LTAGS KGKYYFLN 
SKVFSN 


6422 


20323 


A 


6476 


1 


131 


RTRPPTRTRTRGRWLTPVI PAIxREA/ BE 
AGG*LEPRSSRPAWAI 


6423 


20324 


A 


6477 


830 


513 


WNSVKESLPADTWFLPFLGPLIDIFLLL 
IFALCLVNIiIjVKPVSSRIjO* PHVKMMTjA 
QGFQPIPSSDPEDKDILPI»SLLEQVS\G 
ILPLQC*AGSMPITSAGSSYRR 


6424 


20325 


A 


6478 


399 


196 


SKQSSHLSLLSR*KyRCAPPWWAFF/SF 
FLINIFCRDE/SLTML»PRLLLNS*PQVI 
LLPWPPKVFGLOA 


6425 


20326 


A 


6479 


1 


411 


NTGIMPAInTWSILIjLIIXJEVLVTVKTCY 
LRNTFCX5FSDGSGHSV\QTFWIGSTILD 

aiksl/r/dsredgktstligirkk*tp 
I U^DLVGFKTSQEEGTVDGlxNLA* elq 
VEGPRDETELL* PQDQTCMDEKFLLMDE 


6426 


20327 


A 


6480 


381 


162 


AGFTHHNFFFL F * DMVLLCHPGYSAVVQ 
SWLTVASNSW\VK*SSHLSFPKH*DYLQ 


6427 


20328 


A 


6481 


379 


85 


SILFFLFHPINFFFFFF*AWIFIIPPPH 
\RAFKVVRWGFFLADSVWFIKRLPQYY 
FFFFFPGPFFFPFP , PPPPPPP1?P1?T?1?I7P 

X * X X A * -L X X X X JL X X^ X X X X 1 X X^ X^ X X^ X7 X7 f 














GG 


6428 


20329 


A 


6482 


355 


117 


PBCVKIFPPPPLPRKWDPRGPP*SPGTNF 
S I F FLVE KDFPHVAO AGLKT. TjD55 55 «5TP<5 

VAS\QSV* itgmsphawlqvllsl 


6429 


20330 


A 


6483 


402 


97 


TAFHLLD I YHKS * KIHFTNQPLYKCPLF 
IIAKN*K*IxKCPl^MKQIVA/WCIHTI 
E YYSAP^NVLLCTTRLWMNLKS IMLCK 
RSQSQDVTYYVI PFICHS 


6430 


20331 


A 


6484 | 


714 


365 


NGFALVAQAGV/QWCDLGSLQRLPPrFK 
QVSCVT1,LNS*DYRSMPPHPANFCIFSR 
DGV * S YWSBWS LTPDLGLPKCCDYMRE P 
PCPANAKS FTVEETHVTOQAYETMLLSD 
HDESG 


6431 


20332 


A 


UtOJ 


4fi8 




xii\.ioi^xu * wit t i*ix a/vu-il* x, xj v / ftAnny 

PQCLSTDEWISQI*YICIMNINSDIKRN 
EALKHAVI CiMmKlWPFIS PTElxGKKK 
EKKK 


6432 


20333 


A 


6486 


243 


H 


GMSQYQEYSKSNFFLSNRVLLCHPGWSA 
VAH/SPiTFALl^FK*SSC^LPKriWDY 














RRVPLLI*AQNATR*NSL*SCPA 


6433 


20334 


A 


6487 


403 


161 


DMVSPRWPVWSQNS \SLK*SAYLSIiPRC 
WD YRRE PS CQASR1TTS IGRGNRKNQ WNT 
CMYLLFI LI PLK3CYMCNGRVMEYFKT 


6434 


20335 


A 


6488 


292 


24 


DWRGRMAI PLKKGS KC*QGCRNTGLLRY 
*W /MECEVAOPLOEMV* WFLORLNIITLS 
YDPDF/PLLGIYPKE*KAGT*TDICMPM 
LVATLFSG 


6435 


20336 


A 


6489 | 


3 


203 


SSWDCRCI PPCLANFLICCRDR/SLTML 
LRLVLNS * TQAVLLPWPPKGMPGLLFVF 
YGGIVLSYVNIS 


6436 


20337 


A 


6490 


413 


30 


RGP/ YQPGFLGFSPPNGLGFG* KVFLPL 
PPPPRKGPGKKGSPRARGFPGQAPPKKE 
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NQRGGGFWSRGRLAKKKGFWFG *GPAKG 
KKFFFFLGAGVWGLKVLI P I KKKKGFP P 
PPLFFFFFFLQTIHPIRG 


6437 


20338 


A 


6491 


101 


335 


SSSFLGFPKYWDYRREPL/WPS/LWQFF 
RLNIHIjLYNPS I P II1AVYPREMKTCDI2I 
KSCKRMFTVAVF\* ITQTWKQPNVF 


6438 


20339 


A 


6492 


532 


127 


STVFRARQLPCLHEPGRQSSP\PVPTSI 
LYT\PLPFPSSRYQQNGPCLSSPKLSPS 
PELGTQSFPVHKSFSPCLPSPKPLTVPP 
ALPSTGPSQSL*QKHSLASUjHTHSHTP 
papscnpeswgiidataaistcfyg 


6439 


20340 


A 


6493 


399 


1 


IiCPAPRGGVYRGR/ESSIiSWDGLHSVQA 
SWPLCLPTPASAMVDAPPPA\SLPSRSS 
ILDCCIS SEQGSVGMGPAEPGAG*NIiI»V 
CCLLRPLEKCS IWAGVSRFSWYSLSQLP 
liARKGKS PNPLS FPNEKMPRPVS 


6440 


20341 


A 


6494 


319 


75 


KKIWGPLKKKFGLQTRRPKKWFFKRPPP 
FFFFFFFFFFF/HVAI fllfcrdrg\la 
MLPRLI LNSWP * RILQFWLPKVPGLQA 


6441 


20342 




6495 




ion 


LFFFFFFFFFFFFFFFFFFF*GFFFFIF 
YSFRDGVLLCHPGWSAAMPS 


6442 


20343 


A 


6496 


1 


299 


LPRLBYSGTITASNSWAPAILPPQLIK/ 
YNFFCCC * GRVLTVLPRLVLNSRPQ I VL 
PPPPPKSAEITGVWHHVWPKKI ISNEKT 
THCSCSALLAKQQISS 


6443 


20344 


A 


6497 


51 


266 


PVNVILPSSWNLRHSPPCPARFFVLFCL 
LFFGKN /RGLAMLLRLVSNSWP* VILPP 
QPPE*IiGLQVQATMIjSS 


6444 


20345 


A 


6498 


366 


78 


LIiAHCHS SNNKSLL I LDNFNI YHYGPSN 
TVTS*F\*MSSIiMILLSSP/C*GTSWSY 
LFYALKKI FCKD/RGLAVLPSLV*NSWP 
QWLLSWPPKELGLQV 


6445 


20346 


A 


6499 


37 


396 


FPRPTRPKSSSPAKATQQNTISFKKKKK 
S KQKKGNTVI I IR VKNQRKRKQ KN * KL K 
Q*N*EQ*KINKISKPIiAKIjI/KKKRVKT 
* IANIRNERSSITT*STDIKRLIWKYY* 
QL 


6446 


20347 


A 


6500 


706 


394 


VPLLLPRLAVQWHNLGSLQPLPPGFKRF 
SCLSLLSS *DHRHTPPCPA/NSFVFLVB 
TRLHHAGQVCLKLLTSG/DTPSSASQSA 
GIKGMSHRTQPIEILFYSVFF 


6447 


20348 


A 


6501 


403 


216 


ILNVLSFK/NMLY*HYTTVLRTVRPLSP 
R*ATVTCTYTSGWPEAAEDPQKMTFPHC 
DLFLPHPN 


6448 


20349 


A 


6502 


377 


116 


LLPY*BN* FLMYLPPICCLPFDFGNSGF 
CHQKKNFFFFFKTKSLSYA\RLECGGAI 
PAHCKLCLLGSSDSMKNFLNEAEFPNLA 
FNGL 


6449 


20350 


A 


6503 


3 


401 


CVQHLRASWSSRWAWAWRAPHSEQPAGP 
AAPGNEGLS IRASGCGGCTGYPSS AS PP 
ALCSIS\PRP*LPSRRAGLATCSPPCLS 
LPPPPWAPVRPEPPRRVPPPAPWRPVPS 
TTQGLRSASARHRTGRQLHLQ 


6450 


20351 


A 


6504 


406 


55 


WPKGFTPPRFPRKEGSP\PPFPKVFILE 
TRQPGPQTKPPFFPPIFKLTRPGGRPP* 
FPPPQRVRAGKPP*PQSFRVPGAIFPPP 
PPPLKGHPPRPGFQKKKKKERKKRPTRK 
RGGT 
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residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleutine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamuie, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=*Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


6451 


20352 


A 


6505 


3 


281 


FFFCDRVLLCHPGWSAMVRVRSRLTATS 
/SFLGSSDSHACLSSSWDYRHLPPLLAN 
FCIFLKRQGFTMLPRPVSNP * AQAMRLP 
QPPKVLGLQA 


6452 


20353 


A 


6506 


1 


247 


LGSLQPPPPG\SRHSPASASQGAGIADM 
SHRTRQS IVFLRTNNKPY* IENMMITIS 
CTTAYKyEILRHSFNKICARSLYEKL 


6453 


20354 


A 


6507 


34 


47 


FDPNLTPRHIRI ILWAVNKERMLTASRK 
KKQII*RSSNIiSGDRLLNEEASWEP*CK 
KSMQETIQATEGWNDI FKVLKEKKNL\H 
SKILCPAKLSFKYEGEVKAFPHKPKWRE 
FTVTKPILQKC*RENQRTFDLTQI 


6454 


20355 


A 


6508 


403 


106 


HYVPPPSPPS/YFFFSPPGKRGFFPNPF 
ILVS PGFFPSPLF*KPPPEFFFFAPLKK 
I FFS PPRELKFFFFKGPPPFFFFFF 


6455 


20356 


A 


6509 


389 


230 


LSFFLFFRDKVSLCNPGWSAV/ VTFELK 
RSSCPGLPK/ SQSHCTKPQPGF*KYIYF 
SFFIK\KR*SLTTLPRLVSSSWDQVIHS 
ALWHPKVLGL*AVAHAYNPSTLGCQSAE 
WIT 


6456 


20357 


A 


6510 


101 


419 


ISFFFFFFFFLKKFFFFPPGWRGGGEF* 
LPQTPPFWVKKIFFPPPPKVGGFKNSPP 
PPFFFFFFKEKGFSPVCPGFFKTNPKK/ 
SPPPPPPRGVGLGGWP PPPGWDP 


6457 


20358 


A 


6511 


41 


376 


EWWCETRGGAGQGRAHGAAGGATGRVIiA 
I QARKRRPKREKHPKKVSCS VAQAGVQW 
RDLVLLQPLPPGLRVSLCHPS/WECX3GV 
VSAH*NFCLPV / I KDGVLPCCAGWS*TP 
GL 


6458 


20359 


A 


6512 


2 


332 


WEKVKIATLTGFWK* LI/ SGS SRTLMGD 
FERFKTSVEKVIADWEIARELELEVEP 
ENVTBLLQSHYKTPGTLAFSRLSPDVSL 
YSQLPETRSHEWOAAS KIQPRKASTSQ 


6459 


20360 


A 


6513 


3 


294 


FFFSEMESLSVTQVGVWWHNLGSLQPPP 
PEFKQFSCLSLPSSWDYRHAPPHIA/NC 
FCIFNRDGVLPC*LGWVSNSHLGFTC»IS 
HCTQ PIiLAIiS FLHG 


6460 


20361 


A 


6514 


1 


526 


FRK3RKPR * GGVM * PVWSRGEPGSVGAE 
AG/RS *SAPRRLLHHPAAGLATGLS ASG 
RRSAR WKMERASGLS PGGGLGATSRQMS 
PGTQLANPPDHGDKDCLGRI S PGSGKQ I 
QAAGQLPGPPTSLAPAQGRLRSLTPWGL 
QTPEHSEPEGIGHLQAATEAVLPHSTQN 
LITKRNLM 


6461 


20362 


A 


6515 


747 


451 


QAGVQW*DLGSLQPPPPGFK*FSCLSLP 
SGWDYRYRPPNPANF\* FLVEAGFHHFG 
QAGFELLTSGGPPASASQSAGI TGMSHH 
TPLGVHALVGENIQ 


6462 


20363 


A 


6516 


444 


3 


PGGGGPFFP P PWGGKPGKI S * /PPGGGG 
SNKPKLPPCLPPGGPK/GNPPQKXKKKK 
GFGPGWAHACNPI I PALWEAEAARGLP 
PDAGRYGPGGRREAVRREAGPQECQDPE 
GQV*GDPRGCGAAHAPARGHPRKA*RGA 
EGSSPGLPTK 


6463 


20364 


A 


6517 


257 


575 


ASSVTWAGVRWCHLGSIiQLLPLGFK* FS 
CLSFPSGWDY/ GHAPPGPAKFCI ysgdg 
VSLC* PGWSQTPHLR 


6464 


20365 


A 


6518 


337 


389 


SQHFV* ISCLHDMPALASQSAGITGVSL 



944 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino $trirl 

residue of 

peptide 

sequence 


Predict- 
ed end 
nuclei 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

npnttrifi 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RARLIFIFR\MDSCYVPQAGLE/ PPELK 
RSSHCGLPKPWDYRHEPQAQPVCRFIRA 
TKCAYVSTQALSPAPVQPGMTPPAKRGL 
PCQPPGSPCPSLCHHQLAGTPPYE 


6465 


20366 


A 


6519 


61 


323 


FMWDRVARKGS IEGASLELPIiGGSVGGH 
GS CISMWGALGHS S \THV* HTHTHTHT 
HTHTQTNTHTPSSDSKKSS I LDKBEAEG 
CMTP 


6466 


20367 


A 


6520 


399 


101 


KIQIKATMKY/HNIPIRMGFFFKLSSQG 
/C**RC3yc/lYTLKHCQ*KCETVWPLWK 
SVW /RVFTjKIiNIHPIjHS PE I PLLGTYPR 
/DMKT*TCM*MFTATLFITTYS 


6467 


20368 


A 


6521 


1 


276 


RTRGFEGLRFFLGFFGTVLIiCCPGWSAV 
E\QSWLIAASASCLW\VK* SFCLSLLRG 
WDYRCI PPCLANF* *R/CGSWAQVIIiPP 
QSPKALGLQT 


6468 


20369 


A 


6522 


280 


277 


GDDPSCSMGGGHCIiASRSE*GSVGVGPS 
KPGAG\YNLLVCYLLRPLEKRS I RVGVS 
QFSRCHQSWLPFDRKGNS PTPCTYWVR 


6469 


20370 


A 


6523 


2 


327 


SLFSS *D/YRHIPPHPGNFRIFCR/SAL 
TMLFRLVSYSWPQVILLPQPPSVGTTST 
SHCPQPCIMFTiKADLNPVRCQFRCALAP 
EPYIQDPFLRDDVAFPCLGASAPWHLA 


6470 


20371 


2v 
Jt\ 








PGPQSETLTKKKKGGGRLKGSNFTPAGM 
BGNIFF/CFFLSKSHSVAQARVQW/P*W 
LTAGSLQAPPPRFKIiFSSIiL*VAIYASP 
TLLSS *DYRCTPPRLANFCI FPRDGVS P 
CRSCKAAVFNLLDTRDWFHGRQS FHRLV 
AGDGFRM 


6471 


20372 


A 


6525 


376 


180 


KCKKYPGGVAHACSPS*DYRHVPP/RPG 
YFLHF*QKHFGQAGLDLPTSGDPPASAS 
Q3AGITGVSHRTQPQLNYF 


6472 


20373 


A 


6526 


415 


86 


LYLPYDIAIPFLGMYPRGKKKCAHTKIC 
TRIFTWLFIIARK*KRSKNPSTREW/I 
KMWNIYTVESHSVIKRKELLIHASMIIiV 
H* /WMIFEfJIILSDRSQMQKITYCMVPF 
T 


6473 


20374 


A 


6527 


391 


166 


FFFFFRDWFSLSHPGWNAE \S *" LTV I SN 
SWSQARLIFKFLKLRQGLNLLTRLVSNS 
RLQG I I*S S WPAKVLGLQA 


6474 


20375 


A 


6528 


544 


176 


TWRYLLTP VGMAI IKKAK\NT* C* *GGR 
E/IGTL/l*QCR*ECKLVLPLWRTVWRFL 
KKLAIELPYAPPFLFSLEYGLFSLGSTN 
CAQLRSVYRKAVQNHQGVSGECAGFQDS 
SHSAPVAI PTARQ 


6475 


20376 


A 


6529 


392 


139 


TYVKMHCK/ WLGMVAHAWNPSTLGGRGG 
QIT* GQASTKLSHS WGLSRESRFWIiECY 
SRSCSFCSCRLPADCTSLTDSKQ 


6476 


20377 


A 


6530 


362 


87 


NGPPFFFFFGEIDKLILILLWKCK/RPK 
IAKTVFKKKNKVGGLI *HYFKAYYKGTV 
IKLVQFWHKVRKTDQ*SRRESPETELHI 
YVLVNARDSA ' 


6477 


20378 


A 


6531 


1 


346 


APTRPRLKiOXSKLKTTNLQLPYDPAFS 
FLSVYPREMKT*R\FAKICV*MFMASLF 
MIVTTQ/NVPRCPLMGEWLNKWR\ * FIT 
* YHTGMKNT\ VMHKTTWMNS QR IMLNE K 
SQSLTY 


6478 


20379 


A 


6532 


3 


294 


FFFFRDRVFTjYHPGWS AW* S * LTAALS 
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PWIiKRHSHLSLPGS *GHRMH/ PPHLGNF 
FV\FCLFL*RQCIAMLPRQVLRSWPQVI 
FLPWPPKAVLGLQV 


6479 


20380 


A 


6533 


2 


280 


PRYLVRNPTPTLGSCVTVDK* LSLWAS V 
LYIYIYTHTHTDIYRKI YAF/IFVCVYI 
YI FI * KHI YYKEQAHAIMDAEKFQDLPS 
ASWRPGKLVE 


6480 


20381 


A 


6534 


96 


183 


DQLNVFHRKLKRYDTYT FTVALLARHGG 
SC/LSLGSS*D*RYTPPLQTN+K*FFL* 
ISLPMUDWLVSNSWPQAILLPLPPKSSG 
LEA 


6481 


20382 


A 


6535 


2 


241 


GFKRIDSNFESS VGKVPTN/TI FCERKS 
LSMHQI /SLLSYFKKLPQLPQCSAATTL 
\ I S * QP YTLKQDPPPTRLQPAEDSDGY 


6482 


20383 


A 


6536 


485 


97 


PTTRP ILLLGFYTK*LKS \NKSDSCPPM 
FIATLF/ ILKCPSTDKWLKKMWYKHTTD 
YYSALKKGNHA/YM*QQMNLEGIMLSEV 
SQSHEGQLLHSTYVRYLK 


6483 


20384 


A 


6537 


411 


57 


LFPSHIKFVLSWQSCFAKTSGPQTPSFF 
FF*LFFF/RDRVSLCHLSWSAV/VSWLT 
VASTSQAQACLSLPSSWDYRCPPPFPAN 
FSRRGFTILPRLVSNSRAQE ICPPWPS K 
VLGLQRRGI 


6484 


20385 


A 


6538 


438 


201 


IYPKEMKT*VHTKTCI*MLIATLFTLAR 
KWKQTKCSSTDEWINKMWHIQ\TMEYYL 
AIKRNEAQLBHGEIIMLSERGQSQ 


6485 


20386 


A 


6539 


466 


131 


S FPKKGG YRAP P P P PP P FFFFF FFFVEA 
VLSLCCPGGSQTLGLKRSAHLVLPKC* D 
HRHKPPCLASN/SFFNRHRMMTNNCLMC 
IKSWCLCI INPFYHLDSSLLWMLSLI YR 


6486 


20387 


A 


6540 


419 


50 


PLPFFKPGPRIF/CLGAPKKKI *VPT/Ii 
REIKLFPIiKGPPLFFFLNKGPLCFPGRK 
GRGVLFFLATPKSWGQKIFPPSKEGASF 
PFFFPFFSW*RWDI,TMLPRLVSNSWPHV 
ILPSQPPRVLGLQV 


6487 


20388 


A 


6541 


1 


341 


S 1 1 KTAREKCLVT Y * GTS I RLTGDF * KY 
I /SAKILQVRRQ*NDIFKVIK/EKNCQP 
RIQYPAKLSFINERKIKSFPNKQKLREF 
ITARPDLQEMIKG ILYLETKE * YLPS * K 
HMKV 


6488 


20389 


A 


6542 


409 


290 


NCKQPTCPSTNEW/ I KL* YICITEYYSA 
IKRNKLQLHTTT 


6489 


20390 


A 


6543 


32 


397 


AMIVPLQASIrGNRSRKTLSLKKRKKKKG 
EKETLPGGST* KEKVRKKQNNLLV* KTH 
IKPG\AGAHPGNPNIWGGGGRQITRSGN 
KNHPG * HGKTP FLQ KXKKRGALLKGTNL 
TARGRQSKTF 


6490 


20391 


A 


6544 


402 


0 


LSWTQ *CKPIAPATGEGGTGGS LEPRES 
RLWCAM IAPVNA/NCTPAWVT 


6491 


20392 


A 


6545 


1 


365 


RKVTSLPSTSRCIjS ILPHFSDGPRPVKP 
LQ* * XiRCS ICS ASL PTHLHTARAAGLLP 
\ LASSTAVNQN* SRELGACVKWEHSGI K 
YGLF* RD/RGLTMLPRLVLNSWPQAILP 
GQPPKLLGLQA 


6492 


20393 


A 


6546 


1 


331 


I STFKS YLRNI FS KAIAAVDSNSTEGSG 
QG*LKTFWRGYTIIDAIKNIYDLWEEGK 
TSTLPGVWK\LDDFEG\FQTSVEEVTAH 
WQIAKELELEMDPEDVTKWLQSHDTN 
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nucleotide deletion, V=possible 
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6493 


20394 


A 


6547 


1 


417 


TRPKINKTKNLLFERINKI KKP * VKLTK 
NKNKNKTPHITKIRNESEAFTIEKLKS I 

lre\ym™cvynllyanrkprqmdkfle 
r ykqlkltqeetetltd\ yi tskeikfv 

I*NFSTK*S*GPNGFNSEFGRVLWFTPV 


6494 


20395 


A 


6548 


410 


288 


AHFLFFCINK\SLTMLPRLV*NMWP*VI 
FPPQLPKTLGLQV 


6495 


20396 


A 


6549 


1 


216 


IDRGFTMLPQLVSNSL\LKRSSHLSLPK 
CWDCRCEPPCPACFHFLSECNTCIKNQIj 
HVTLFFPNT*GLTSEG 


6496 


20397 


A 


6550 


479 


146 


VPGFKQFSCLSLPNS * KYQRAPPCPANG 
FAFSVETGFPHVGQAGF* FPTSGDPPAS 
AS QS AG I TGLSLHAWP \ENL * KLRLHAT 
GLRLHDHVPLKVPNYLKVCSFKFEIADV 


045// 


zt/Jyo 


A 






AO A 

*to*f 


SMRWNSMNVTIMPANTTS I POPTDOEVT 
SVFNS/YYYYYRYCKATAAIDSNIiSGGS 
RQSKLKTFWKGLT I IYAIKNICDSWEEV 
KIST*TI*KKVIPTFMGDFGGVKNSIK\ 
E ITADVAE I AR * L * IiEVEPRVGWLTPVI 
PALRKAEVG 


6498 


20399 


A 


6552 


304 


74 


HTGKAAKKKEC/WRGTVAHAYNPNTLGG 
*NGWIT*GQ*FETSLINMVKPRFYSCIC 
I SGYTHKVIHRI 1 I CKGKKLE 


6499 


20400 


A 


6553 


432 


62 


FWAGNSRMQQWCTDT/VSAHCSLDLLGS 
SNPPAS AS * VARTTGARHHARLMLITMN 
EKPLGARYCARIFFENKLGIYPNESVPT 
KTNANWSKPRSCFSHDPDPGHWIYXMS 
SGLIVNKIGKVK 


6500 


20401 


A 


6554 


2 


174 


YCNLCI*PGPSDLPTSAS*VAGTTCMCHY 
TWLIFFCI \LVETGFYHVAQAAKLFI YS 
I 


6501 


20402 


A 


6555 


3 


268 


DAWADNIKCR*GYGAI*TLIHCWWECK\ 
W*ATWWFLI ILNMLLPYNPVTMV\PRTC 
IQIFMSTLFIOTKTWKQPRCHLVGEWIN 
CGTCR 


6502 


20403 


A 


6556 


3 


402 


DAWTDNIQCR*RYGAI*TLIHCWRQC\K 
W*ATWWFLIIIjNMIiiPYNPI IMDHK\TC 
IiQIBWSTLFIDTKTWQPPKCHLVGEW/I 
KLWSMPAKE YHSWLARTEL/ TVP *NT*R 
RLSCT*LWEKSQYEKATYCWPTV 


6503 


20404 


A 


6557 


396 


2 


KKQNAFQSPSFLPRL+TPKITILISFHN 
IYIYTHTYIYTHVCIFTYIHTHIYIFF/ 
RPRFLLCLPGWRAWQSQLTAASTSQAQ 
GIPG*FFFK/SIFCRDW/SITMLPELVS 
NSWAQAIPPPWPPKVLWDYRCGS 




904AS 


A 


6558 


393 


175 


FKSQGLVLLPRLE FS GMLXAHCS LKLPG 
FSDSLNSA\SQ*LGL*GCTATPGPYIiIjL 
IYAGELGLNKRSISSK 


6505 


20406 


A 


6559 


401 


188 


SPPEPFILGGPRVFPPPPFLKPPPRI*F 
WGPKKKKNF/CPPPGPKNFFF*KGPPLF 
FFFFFFFFFFFFHRSS 


6506 


20407 


A 


6560 


37 


343 


YEPAWIjLDIYPKELKT*AHRKTCTRIL 
IASPFKTVKMWQPPD/CP*ADE/WDK*T 
ACLQTVB*CSARKRSELSTM/IEMWRKL 
KCVALSERGQSEQAACYVIPTL 


6507 


20408 


A 


6561 


792 


141 


LFPPPP/YLSTSPNAGA*TTSPLPPQFL 
YHSPRRQQSGTKPPFFSKK/PEPPSP/S 
PNKFYPGPWWGVSPFKFPPPPQKS*GPK 
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Q = Olutamine } R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














ksplppeap/pappltqdcppfvsnpgg" 
kkktpp/sakkkkdptps/psppprh* i 
ggppgfsppplfktpppkliiigppkkkk 
ippppggknfpflkgpppfpffffpfff 

FSLLFPVGVWSRWLRSEAADLCGECYS S 


6508 


20409 


A 


6562 


1 


356 


FFFNFI FIYLF /NFLRWSLALS PRLNG V 
QWSDLGSLQAPRPRLMP FS CLSLPSSRD 
HRRPPPRLANFLYF* *RWGFTMLARMVS 
IS *PQ/CDPPASASQ\STGITGICHGTR 
LIPRNPMIA 


6509 


20410 


A 


6563 


416 


84 


LSLPSSW/DYRRPPPCLANFLYF* *RRG 
FTMVS IS * PHDRPTSASQSAGITGVSHH 
TQPI ISFLKSFFFTLSNRFNRVCIDSEL 
LLQCKDFYLQKFNLHIAFQEYTHCHGVH 
Y 


6510 


20411 


A 


6564 


2 


51 


FWREMQIKTVLRYFSLKLVKIBCK*DST 
FFWKGC/GETGNLIHKANW/R/SI*QYL 
KKLYVHFLFVPVI PPLEIYLKNKKILST 
K*LFAVFIVTEKC* KKKRCG*T 


6511 


20412 


A 


6565 


2 


<\ i 


VRW /SCAGL /I\HTAV* T PGB T TT<?RKT , A 

»*^»»/ «awuJiJ/ lilll/iv XrVjKll X oui\Ln 

QQVDETHRN/IiQRLQPALINRKGPILLH 
NNTCLYTAKPMLQKLNELGCEVLPHLSY 
SPDLSPTNYFFNHLDNVLQGKHSHTQQN 
AENAF*EFVKF*SMDFYATGTNKIjT^ /h 
WQKCIACNGSYFD 


6512 


20413 


A 


6566 


3 


248 


GD I CT \ SMLVP ALFTVP K/ TWNQPVS I S 
K*TDKMWYIYPVEYYIAIKKTKFC/SFS 
ATWMILEDIMLSKPGTER* ILHVLYLCG 
R 


6513 


20414 


A 


6567 


3 


258 


WRDHSSLHS*PFGLKWFSHI 1 SPLSNWDY 
RDVPACPANFEKVL*K*GLTMTjPRT,\ T.T, 
SSFPEW* ILSSRLPASSFQSVGITHMNH 
CS 


6514 


20415 


A 


6568 


392 


166 


GGGGPPRGGGLEPPGPPPFFLKNKK* /R 
PPGGPGPLFPPPGGGREEKFFYTjGGKNF 
H*TRVGPPPLSGGKKKKIRG 


6515 


20416 


A 


6569 


234 


411 


PGAVWHAYNPNTIiGGQGHR I S * gqe fkt 
S/LGNIVRPHLY 


6516 


20417 


A 


6570 


1 


175 


PLQHV+NKFQTMSDQI IGKIDDMNTLI D 

leknvkdlXlitqteveelegenkipil 

PG 


6517 


20418 


A 


6571 


392 


140 


LPSSWDYRHVPPCPANFVIitiVETRFLHV 
SQAGLELPTSASQSAGITGVSHQARP/E 
* *NS IRKGATTIQRDLGPVLQVR YLVRC 


6518 


20419 


A 


6572 


2 


228 


RTCTQMSVATLFLIAKNWKQPRCPSIGE ' 
WINKL * YI LTME YYLVI KK\NDFSSHKK 
TRGNPKCILLSKRSQSEKA 


6519 


20420 


A 


6573 


412 


100 


ALHSHPLGGPGQGVPKSRI PNPPCPQGG 
TPFFPKNPKLTRPGFARLPFPPLKKIjKP 
EKPPNPG/MPGVPLN* IPPPVSHPGVQP 
NFFSKKKKARGFCLVLNPSSS 


6520 


20421 


A 


6574 


406 


257 


NKRB KKTRKMLDS I VIRKRH I KS QMRC Y 
SAPIQMAKV\SNRQSPKFWVRCRQTISL 
*HC* *ECKIVQSMW*TLQKFLPAIVFIiC 
IYHTEKLRFTQQP /CT* IFLAALLMIV/ 
RTV/KRLVCPSMGEWIiSKLW*VNLSFSV 


6521 


20422 


A 


6575 


370 


68 


EQGSVGMRPSEPGPGYNLLVCR*LRPLE 
KRTIWAGVS* FSRCSLSRLPLARKGKSP 
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Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glu tamtc Acid, F=Phenylalanine, 
G=Glyctne, H-Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, MNYIethtonine, 
N=Asparagine, P^Proline, 
0 = dutamine. R=A rm n in** S=Serine 
T=Threonine, V=VaBne, 
W~Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NP \CTS * VRFSNGLS 






j± 


03 tO 


i 




IDLIDVLRPSLTVT*AWQWCNtiGSL* P 
* P PG \ SGDP SASASRVAGT I GQHHHRWL 
IFVIFVEIWFRHILVYFHTAIKNYLRLG 
NL 


6523 


20424 


A 


6577 


418 


218 


KGKTRLGRWPPCNPSTLGXGQGGWIT* 
GQAFRTSLANIVFPLM 


6524 


20425 


A 


6578 


I 


401 


EKAGCSGSYL*P*HrcRPRQEDHLTIiGI 
♦DQPGQCSENLSLQKNF/TNI SWA* WPG 
TVAHACNPSTLGSOGRRIT 


6525 


20426 


A 


6579 


3 


356 


HWDYRHEPPFPV*MHFFIWLPGCHSLLA 
SPNLTGCFFS I S ITESLLSLF I YLETRV 
*DLQCSYFWERRG\SLCCPCWSRAPGLK 
RSCCLRLPSSWDYSRLTPPHLANYCILG 


6526 


20427 


A 


6580 


411 


158 


IGSVAQARVQWRDI*SSSDPSAN*APPGL 
KQSSCVPASIiPSRWDYXNKCTPPDIiANF 
YIFFVETGFPyVARLADCFYPPPL*KMT 
GSY* VFTMCQI PH*AFYKF* FI * FRVGE 
DKNSLPAWQHRETLSLQKKYKN 


6527 


20428 


■a 

A 


£.COt 
Ojol 


too 
jyy 




GG AKIjKVPK*GGGPGSP PI*P PPIiOKPRP 
KGPLWPGG*GPPRPPGEAPSPLKNKN* / 
PLPG/ AGPPPVP PNWGGG FYPRG * GFG * 
TiPTiGP* TTARGKDRDPVFKKKKKN*NVN 
KKKAKL*KS*QFFFFFLKTGSRCFPRAK 
VQGPNGSYPKPQPRG 


6528 


20429 


A 


6582 


392 


214 


GSSDPPNSTSRVAGTTOEVLPCSPGWS * 
NSW/VSK*SACLGLPKCWDYTREPQHLA 
RSNF 


6529 


20430 


A 


6583 


390 


257 


PKRPPPPAFFC*PK\SLGMLPRMISNSW 
AQAI I *RWPPKVLEKQA 


6530 


20431 


A 


6584 


65 


251 


HYFNSFNFFLMGSSNHPTKNPG* FFFFL 
TFYKNK/ SLTIIiPRLVLNSWAQGILLPW 
PPKGLGL*AQGILIiPWPPKGLGL 


6531 


20432 


A 


6585 


2 


143 


RHMPPCI*SSF/SYFIa*RRGFAVIiPRLFS 
KSWVQVILMPWPPKVLGLQA 


6532 


20433 


A 


65S6 


142 


334 


GRWItSQLLILNFTTRSVLKQMSW/MACV 
SETWPA/STLGGRGGRIT*GLKEFKTSL 
QNMARPYLYK 


6533 


20434 


A 


6587 


485 


377 


RTRLSCRFFILSLIFRPGNFYCFTLKF/ 
H**VFLLSFVFYC*GHP/FEFIFKVFFI 
IFFHFKVFICVFKYLIiFFFILIASTFYF 
FVNI FSFKKFVSVMFVLLIBMFYYGRFT 
IPLR*F 


6534 


20435 


A 


6588 


241 


251 


NSLFTKKKKKKGGAIoLKEP I LRAGGGEG 
IIFFKGPQKSIPGPGFKKAGREKPGPSP 
*NWFF*KGPPFFFFFFCKE/RSFTMLPR 
PASNSWAQAICLPQPPKWGL 


6535 


20436 


A 


6589 


429 


54 


YFLFLPSLPTFFPSFSSFLPSVLSFFLP 
SI/SSFLPSLLFYILSSPPLSSPPSSLF 
LSFFLSRVSFCHPG* SAVAQSRLTATS I 
S WAOAILP /CR /NRDFTMLSRIALNS * P 
QVTRPPRPPKVLGLQA 


6536 


20437 


A 


6590 


411 


137 


KCIGTITTHCSLHIiLGPSDPPTSASRW 
GLQAHHHIQLAC\F*LID*LID*LNVEM 
ESRHVAQAGLEUjGS SDPPTLASQNAEI 
TEMSHCIF 


6537 


20438 


A 


6591 


412 


196 


QPQPPGAKKTSGGGFPIR/WGHQPPPPG 
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OCysteine, D=Aspartic Acid, 
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G=Glycine, H=Histidine» I=Isoleucine, 
K»Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *«Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AKKYNFFFFFFW*RQALAMLPRVVPNCQ 
T*AIILPWPPKVL*LQA 


6538 


20439 


A 


6592 


28 


400 


IMPNCFPKWLYITFFFFFF*RQSIi/DSV 
AQAGVQ *HSLGSTATSAFLQFK* FSCPP 
AFLB * LDYRHVPTHPTNFCVFNKD/MGF 
TMLARLVSNS * PQ/CNPPASASQSAGIT 
GMSHRAQPHLFVLKMEKPPPNFLFILLM 
NQSIHIYIiLNFYRF 


6539 


20440 


A 


6593 


414 


23 


QTHLFWFPPGFSPPPFFKNRPRIFFLGA 
LKKNFFFPPPRHKNLFFLKAPP FFFFFK 
RSFFPPGWCAIGLFSIiQPPPPRFK*FFC 
PSLPKKWGYRPLPPRPGKF/CFFFFFVI 
LVENGFHHVGQDGLDLLIS 


6540 


20441 


A 


6594 


91 


274 


ILHWS * PLGNYHL/ WDGWAHARDLSTL 
GGRGGRIAGGQDFKTSLGSKQGS ISARK 
KEKKKK 


6541 


20442 


A 


6595 


391 


251 


HLKRCSTSSVIREVQIKTTTTCHFSE* K 
R/ SEINMCW*ECGQTEIiSALIiIRC*\WD 
SEMVQPLWERKEYLKQISTESSWAPAI P 

lldi*p*ercaslslrkgwgyrpeppel 
asfllgsyi 


6542 


20443 


A 


6596 


465 


268 


I LLCHLGCSAVAQD \LTLCSLKR5SHLT 
SPSSWDSRRGPSHLTKFF/C* *RGIiNTL 
PSLV*NFWAQAL 


6543 


20444 


A 


6597 


3 


229 


CHGSPQPCPPGLK*SSHIiNLPTPK*L\D 
yrgmp PRS ANFLFL * RWGRCVAQAGLEL 

pdssslpasascprqpvlyf 


6544 


20445 


A 


6598 


122 


418 


I ivnelchimcfffffffkqnllgtqag 
gqggknng*lkpkpprlkgspwi*tlpgg 
wnyggfpp/hps* ffffflgkkgpyfcp 
paggkffnlkgppffa 


6545 


20446 


A 


6599 


1238 


930 


fetdscsfa*agvqwrnlsslqtppp\g 
snnscasas/riagitgthhharvifal 
lvetgfhhigqgwsrtpgpqairppqpp 
kvlgitgvtsraraiskhfv 


6546 


20447 


A 


6600 


398 


78 


PSSPFYLFIFRQSFPFFARVEEKGGHPG 
FPNPPPSGLKKFFYAPAPPGSKAPGQGP 
PA/RFYLFIFRQSFPF/ CCPGGGKRGPS 
GFPQPPPFGVKKIFLCPSPPRK*GPRAG 
PPRPRKrFFFFFSSLFFFFFVRWGFTOL 
PRLVSNSGAQAICPPQPPKVLGLQV 


6547 


20448 


A 


6601 


3 


473 


FPKGKKKGPPPPAGG/QLKNPSKKPNKK 
PGAPPRGPRGPKKGGGGLGSPPQSPQGG 
EKWTLKGGPNLPPPRGGLGGARPPPKIG 
GGPLFPKGGGPlCGGTPPPPLFFFFLRRS 
PALSPRLECSGAI*AQTSS/SPPARVGC 
PSPPAPTRPTGLTVSPGSN 


6548 


20449 


A 


6602 


398 


156 


NGRKHTDVKPEELPSWVPTRDLTPTGTA 
GAF/Q/RGYYGYYNKYVTVKKGSIAGVS 
TVLAA*VLFSYCLSYREIKHEQLLRAH 


6549 


20450 


A 


6603 


380 


77 


PFPKKSPPL*NFPPFF*KKKVPPPPPGK 
LPLSFLFFFFFF/RDRVLLCRPGWSAMV 
HGHGLS PVNFCI STRNRAL/ SMLPRPIS 
NTWTQAIRPS * PPKVLGLQM 


6550 


20451 


A 


6604 


352 


204 


KRGPKI *NFFFG*RQSLHMLPRWPNSW 
AQMILPPWPP\KVQGLQA 


6551 


20452 


A 


6605 


631 


357 


FFF*DRVLLYHPGWSAAAR\SLLTTTSA 
SWV\KQFSCLSLPSS*DYRCAPPCPASF 
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X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CIFSRDRGFTMLARMVSNS* PQVIRPPG 
PPKVLGLQA 


6552 


20453 


A 


6606 


3 


397 


VLHKLTQRCSAVPIiRAPTAFLEIGTQTP 
KRVWKCR * P /RETQNNIiLDKEQDFKTYQ 
KATVTKAVWCQHEDTIEQWDGIESTEKK 
FHTQRAGGRDHSHSDSVRKKRKESLFFF 
LKRGLILFPNLBGTRGGISTP 


6553 


20454 


A 


6607 


323 


35 


YLKTKIGKPGGI*AAIjQFLGPRPS!CNTQL 
GNRFMCTHTHTHTHTHVHPHTSQYLYPH 
RSISFCSYLDSI/SP*VLTESSNSNPTP 
LKCC*HFPSPCL 


6554 


20455 


A 


6608 


439 


252 


LVPPLFF PRSFLLFS PPKKRGGPKPV* F 
GPPRAFYPPPF/V*RGPPPFFFGAPKKK 
IFFSPPPG*KCFFF*SPPPG 


6555 


20456 


A 


6609 


3243 


2602 


FFFGRDGVLLCCPGWSQTPRLKQSSSIjS 
Ij\PKC*DDWH/RATSSGLQVFLSICCSI 
I S IFFMPS ICDKSTRQFSN / PQRAYILV 

g*r*viylgvvlaflfvflrqhl/ntva 
qagvqwcnlsll*ppparfkrfsclilp 
rswdhrrlpprpanffvflvetgf\til 
arlvsns * prdppasaslsagimgvnhr 
twpilvflhpsvffkvlqlfyl 


6556 


20457 


A 


6610 


81 


395 


TWAALKIjYSTCIFSSSETEPDVSQSQIT 
DTTACIMEPGLFW/YRSI IWTL* FFHFF 
WRDGGLTLLTRLVWNSWAQVISLPGPPK 
VLGLQASATAPNSFILIREGIS 


6557 


20458 


A 


6611 


1 


126 


GRLRQENHLNPGGRGCSEPRLHHCSPAW 
AIERDSVSKK/NKNGLKRCFHLRKKTLK 
GEKRIW* PGGRGCSEPRLHHCSPAWAI 
ERDSVSKKKKMD 


6558 


20459 


A 


6612 


389 


67 


YLFMYLF* /DRVLIiCHPGWSAVAQS*LT 
AASNSWPQVILLSNRFKSWTVLLbSSKI 
PWQGSTPCPRCYSWHPKCLQLQGLLGL 
DCSSPKYPRGPSLAAFRSLPKWHV 


6559 


20460 


A 


6613 


t 


384 


RCFKVKDLKMRPFWNIQVDPKSNDKCPC 
KRLKRRGHASLPPWEKRRRPGKDGGRGW 
NSAVTSQGT/VEATRSWKSQGSVIiP*SL 
*RERSPV\ELAHF*QRIiAMLPRLVLNSW 
PQDISPPWPPKALGLQEP 


6560 


20461 


A 


6614 


385 


174 


CLRLSALKKPKLNS WY* K* LYIMEY* SA 
LKT\NKG I LSFAI TWMNLDD I LLNE ISQ 
TQKDKYHMNSIiICEI 


6561 


20462 


A 


6615 


1 


370 


MNPRAHLSFIiLRRKERRXLWLKALGHIM 
/ FTV*GLTGIALANSSLYIVLHDTYYGV 
SHFHYVLSIGAGEAI IGGFIH* FPLFSG 
YTKKKKKNIRGGGTKNFLLCPALVQREK 
VNPFFALTNLWG 


6562 


20463 


A 


6616 


3 


342 


YRHEPPHSAI FAQLYKL* K* IiIYALKTG 
KVYVYKLYIiNKAIKSQ\SYNRASLSLT* 
* PRVKRKRS * PGTGAHTCNPSTLGSQDG 
h IA* VQE FETNLGN I AKPCLHQKKKKGA 
PV 


6563 


20464 


A 


6617 


1179 


915 


FFFETESCliVTQAGVQWRDIiGSLQSPPS 
G\SSDFQLI FVLLVHGQAGLELLI S * SF 
ALASQTARITGVSHHAQLVWSDFIKSIN 
LFCCL 


6564 


20465 


A 


6618 


3 


390 


HASVKMELP YDAAI PLPS I YPKELKARF 
♦DLCIPM/FITALFTIAKRWRQLNCPSV 
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(^Glycine, H=Histidine, I=Isoleucine, 
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VI RS WGREGVKE S CLMGME FHSAR * KS S 
GDTFHNTENCY* TVHLKI VSGKF* * L* C 
VXjJM x Vv»jf c xxuNJSX* LiUClia 


6565 


20466 


A 


6619 


392 


49 


QFLFERLFSSYSGYVSGVKMLGSTEVQL 
FPRKCEMKSCSVMPRLECSGGISAHCNL 
CLPGSRDCPASAS * VAGLI FY / 1 FLVEM 
ninruTVi /dt?t \/t tvtg * povt p oxrnD Dxnrr 

LxrrltlJLXj/ K-Ci-Li VJ-ilM J^F^VlKirii^irir.K.VJb 

VLHV 


6566 . 


20467 


A 


6620 


193 


282 


TPGLK*SSCLSLQNSWDCIWDYRRLPIiY 
PVNCFCC*FVCFVC/IFVEPGSCIAQYC 
LELLASSNSPMSASQSTGI IGMSHGACL 

TT T» 

XJjI 


6567 


20468 


A 


6621 


383 


8 


UJDTVSTFFF*DRISL\CPSWSAVARSQ 
LKAASTSWVQAILPPQPKTNLLGSSDSS 
TSA*DQS PGFK* SFCFRLPGSWDYGCVP 
PFSANF* IFCRDKLSL/WAQAILSPWPP 
K.V Iaj liyALiWllAE eajIj 


6568 


20469 


A 


6622 


395 


42 


HVSSTS FQGDELHIiRIVALPLP \PLHNL 
PLPLSPRSL*C/HSRGWTVLPPSRLTAT 

WSAVA*SW\SLQPPPPSRLPWPPKVRRL 
QPLPSRHPV 


6569 


20470 


A 


6623 


3 


327 


PSPPSRWNPKDVPPNQENLGIFFFFL*T 
RDLATLPRLVPNS * TQVI / WPPKVLGIiQ 

A 


6570 


20471 


A 


6624 


395 


47 


KNRALFFFPP PRKRGFFPKPL I WGTPRF 
FFFFi*.K*NPFFYr r FGAPKKKFr FS/SF 
PGGKIGSFKRPPPFFFFFKQNFFFVPPG 
GGPRGDFGSLQHPPPRVKGFSCLAP 1 1 F 
FSSF 




20472 


A 


6625 


100 


267 


uSIiSKKANiSKr WliCal \VAHAV\DPS TJU 
GA*GRWIT*VQGFETSLGKIVRPCLYFC 


6572 


20473 


A 


6626 


409 


126 


PSSSFSSSSSSSSPSSSSSSP/PFFFFF 
FFFS FW * KWGV S LCGPGWS QTLGLKQS S 

/~rr or TjTr^tmvtannrTST t_rr 71 ttt xot tt ninn 
V-Jj£j JUrKCWDxRNKFr iHI lAFIiASIiIIiFTF 

QEWDITD 


6573 


20474 


A 


6627 


506 


323 


REAPGTIilPCWQKYDMIKAQLVWQFLRK 

DNTVT.PVKTPATPT ♦VPfPP'E'TWTVYVP ff* 

FITKTCTRMFIAALFIIPQNWKQPT*PF 
CPFTI ISHSCSPRDNWF 


6574 


20475 


A 


6628 


330 


117 


DVCPRKI YTLVLQQLCI \RMFTAILFS I 

nli* r VwAxTl^XOJi UEiTO JLAI\lJ H 0XZtA loin. 

LVMRRNKPLLYSMPQM 


6575 


20476 


A 


6629 


396 


139 


FFSPPEKKGVFPSP*NLGTPGFFPPPPF 

kkrpp*fifgapkkkk/hispppaekff 

PTSTfPPPPT.Pl?T3 , T71?VT7l?T7T PPl?T?l?Q2il?V7. 
re JMUrr* fj-iE r ccsc rr3f Lr r r r roAr iUj 

KV 


6576 


20477 


A 


6630 


79 


405 


VSGRMTALESYLLASSA\RNKCSSNCKI 

AHKFETSQGHI IRPCLYKKKKKGRPFKR 
TQFSPARWQGNGFFWGTLKNSRAGF 


6577 


20478 


A 


6631 


2 


417 


SFIFPCKFLNQFISFYKNVCHDFDCNCV 
ESKDQLG / WPDAVAHTCNHNTLGSQGGW 
IA*AQELKTNliDNMAKPCLY 


6578 


20479 


A 


6632 


74 


417 


KTVFRYNGWAL*ASENRISLADLLLPSF 
HVLLCLPKAG\PNFPLPSS IKKFFDLPD 
LIRSHKTHISEGVLFPFP/RRKEPRTER 
PRRI*TDRSCWVSSLSLIVLDHAFFVQS 
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0=r2hitflminp R=Arainine. S=Serine. 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QFYM 


6579 


20480 


A 


6633 


740 


776 


CSFFLKKQTRSRAAVQAGVQWCHYSSLQ 
PSPPDLKQPPASASQTV/KD*GAYFWCC 
FFLFLKRWGL/NSVVQAGVALLPDLPVV 
RRT.VTPIiSLWAVEPALWAAAOL*LL / S P 
CPLASQSPRHQTGQSPHQKCFPDILGLH 
LALRSSFSPGLERPTLRLPPGLASFHT* 
ATWLSWPLPYK/PPKRVNMYLVKADRPE 
PPT, *GTI?GR KS VMOMTHAKTLOETL YSG 
PEAQAAASVTDEF 


6580 


20481 


A 


6634 


2 


296 


FFFLRQSFALWQAGVQWHDLSSLQPPP 
PRFK*FSCLSLPSGWD*RHAPPRPAIFV 
FLVEMGF \TMLATMV\* IS * PGDLPTS A 
CQSAGITGLSYRARP 


6581 


20482 


A 


6635 


407 


212 


LGGRDS YSQKKP FFF * RNKTRPGGGG P P 
GIPPPliKG*AKKKGLPRGPKFPVT* /NK 
PPPPPPGGNNQNPFPKKKKRKSTR* I * K 
IRISRFILYFFFFFFLERGFGCSPRGGG 
GGAFLSYGKLWPPG 


6582 


20483 


A 


6636 


20 


IZ5 


PCKKT.PO PliOPS AT \TL I S OOPLTLRKD 
LLPGKRI.*LABGSDDSYQFLVA!CYFKIK 
GLSFGRNKLANI 


6583 


20484 


A 


6637 


866 


647 


FFLDRVL/LCHLGWSAVMRSWLTAALTS 
GLKQS S CLSL PKLRDYRHE PQHQANNI S 
* SSWSDS NMQPRLKTTL 


6584 


20485 


A 


6638 


398 


26 


KGAPPPNIFFFFLFNIFFFFFFFFFFFF 
FFP INFLQCLLYYRALHFNKVKVTNNF/ 
PLNFCAYYHPL*NM* FS YIS FQNLYKFF 
LI /FGVFNLSGI YFNVL* IANF* *G* LI 


6585 


20486 


A 


6639 


1 


341 


NWDYRHALPCSANFFILVETGFCQLPRL 
VLNSCPQVSHHA\KFLFIFFFLFFLEEG 
GFVGPPRLEYREGSLQP+LPKLKGSTCL 
TTiPKG*N*RGBPLPPARSSFFFISKFFI 
ET 


6586 


20487 


A 


6640 


391 


118 


NANFEKSFSMGKIL*NSLAWYKKILCD/ 
RKNRSMQKTPLD* * YYFKKLPQPL* PSA 
NTTLIGRPALTSRQGLPPAKTL* FAEGF 
DPMILSLF 


6587 


20488 


A 


6641 


352 


285 


KKKKKGGGGGAYSGIPTF/ CFFFLNGVF 
LGPAGVQWQNQRSLQPPPPGVKLFFFLS 
LPSH*GPRHGPPRPGMFIYKNF/CFF/V 
RDGVLIiHCPGWSQTPGLK 


6588 


20489 


A 


6642 


433 


64 


" TGNFFLDSLFYFMPGPLGGGGILKGFFS 
FFFFLETKFFFFPPAGGQWRYFSSL*PP 
PPKFKQFFFLSM/WPRPGKFFFFFPGIL 
VEAGFHRVAQAGLKLLSSGNLPTSASQS 
AR I TAKS HRTVF 


6589 


20490 


A 


6643 


424 


11 


PPSIFLAPPPSRGGGPPPPPKKK/QSPP 
KKKKKGGGGGPKNPPPKKKGPPKKTTGF 
KKNPPQKKKKKNS PPKKKGGPPANK* KP 
PPRPQPFYFLTFYYKI FIYI FLLKFLLF 
YILFFFFLKSSKQGRPLAI 


6590 


20491 


A 


6644 


411 


2 


PPLRIWLASPPSKIF*PQPLHI/YFSPF 
PP*KFFFSLKPLIFLGGFFPKFPPPKKS 
FFPKNPQGVF*NPP*KKKIYFSSPP*NF 
APPKIFLKTPPPFFFFFFFFFVEMGSRY 
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W=Tryptopban, Y=Tyrosine, 
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nucleotide insertion 














IAQARTRERTRGRTRGRTRG 


6591 


20492 


A 


6645 


34 


381 


AKLGTRGAAKQBKE I KVTRI *WKEIELS 
QFTE YTQKNL /NVTDKXiL* LVYMFSKIT 
GYKVNLKNINCIFHKHTNQQQKILKMPF 
I IA* K/TIK*L* INLTKDTQDLYAEI YQ 
TLKAEL 






TV 




1 / 


a a a 
«t4o 


PTQASAMADAPFPARLPPNSLISYCCTS 
wrdc; <? vflvnP2\T?pnan vnt .v*ttt .t t p pt .r 

KHSI *VAVSRFSHYSLSRLPLARKG\NP 
QTPCASWVRQCPTLLRLALHGLHPLSNQ 
SQGD 


6593 


20494 


A 


6647 


462 


336 


SLPXPGSNDSPASAS*VA\GITGPCHHA 


6594 


20495 


A 


6648 


447 


62 


KDDIWIAITHMNRWSISLAKRKM* IKIT 
MKYPYTSIKMA/IKQTNKN*KKSSSIKC 
W*ECRTMRTLIHCW* * HITG* PLWKTVW 
KFIIKLNLYLPYDPA\TCTQMFTA/ALF 
I IIKY*KQSKYPSSGYCVPA 


6595 


20496 


A 


6649 


3 


1492 


LAYAPJVFIiDDTNTKKYPDNKVKLLKEVF 

MKANYREMAKELSEIMHEQI/CPPGGED 
ERLTBFPSHLQRHQWDSNKHNGSRDDQL 
VFGRVTTSHGPCVGTCFVICKLRMIjSKA 
TJRWnDRAn /PRTPfnimRRRPr+P'rTTiRT.W 

R/RFYRSWKKVHI ff 


6596 


20497 


A 


6650 


495 


170 


gaaefpasqsagiagvshharleeffsl 

DQnPVMPnTTrUT.lTVT /WVTTTT * Q TPT7P 

FSFF\WPGLGSLQPIiLPGFK\LSLLSRW 
DYRCPAPHLISL 


6597 


20498 


A 


6651 


9 


222 


QTTRPERRNSGDPTHLSIi*SSWDYGHVC 
HHAWLIF*K/CFSGGLALLPRLV*NSWA 
QAVLPPWPLKVLGLQV 


6598 


20499 


A 


6652 


241 


833 


KKTPLAMWNPWPGPPQIPHPLTKLVQMA 
G / QENGCLSGHDLE S I RPAGLHNSARGE 

VT/TT.ig^QT/^Kfc'l .WTi^RFT.nT.QPTTTTPP 
V JJVJXJOOOXJVaJVEiJjV J7 iJ^C)C<i-uJlJOi!iXXxXzra 

AQEVBMASGHFAFLHVPVPDGRAPYCKA 
SLSASSSLSPTPPEDTAISSLRPPSAPE 
MLTQHGAQEQVEDHPG/Q*QPSPHSQSR 
PSPQKDPQALVIASLRSRTERKQASHGG 
TRPG 


6599 


20500 


A 


6653 


1 


398 


FFFDIGSHSLTQAAVQ*HDHGHGS LHPR 
LPGFK*SSHLiSFPBS*DYSRHTPPRPAN 
F /EVETE FLHVPQAGLMRCNLLPAT /AT 
SKSARITSVSHHVQPEVSIKW*NNISNL 
IKCFLLFYKYKCI FINTKFVFTV 


6600 


20501 


A 


6654 


489 


282 


PAS/WDY/ RHMPPLLANACAFLC*RDFS 
MLPGLVSNWWARVICLPWPPRVLRIQAR 
ATXPG 


6601 


20502 


A 


6655 


953 


908 


SQSCSVS \RLECSGAI SADCNLCLLGSS 
YSSTSAS * VAGTTGARHHTQLI FVFLVE 
MGFHHVGQPGLELLT\ * VIHLSRS PKVL 
GI**APSSARGQNSVSQ 


6602 


20503 


A 


6656 


412 


44 


RIS ERRKSLGAVHTKDKKYSS SRVRTSN 
PNCARSGASS KQS FLFGKKRAPHPGI KS 
F\KDALLWDGITGVHHHA*QI FVRSRE 
R* IRPVIPALWEAEAGGSRAQKIETI LA 
NTVKPRP SLLRA 
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N=Asparagine, P*=Proline, 
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W=Tryptoptaan, Y^Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6603 


20504 


A 


6657 


3 


241 


FFIFLEARSHSVIQVGMQWCDHSSL*TQ 
TPGLKBS \PASSFHRAGITGVSHHTYLF 
L*GH* SHSQGPHPHDLITSKRPCS 


6604 


20505 


A 


6658 


467 


42 


FFFFFFKQGFGLFPQLEGKEGIPGPCNP 
CPPGANDSPPSTPLGNGGPGGPPPNPGN 
FGIFGI*GGEGGF*TPGSGKPSGPGSPK 
\IPFCWVIRERPHGPPPRPFFFFLRQSL 
CFAQAGVQWCDLGPLQPPPPRFKQFCLR 
NS 


6605 


20506 


A 


6659 


2 


286 


KNLKINWT * WLMP WSAT/ WGGS LKPGR 
LRLQ*AVIAPLHSSIjGDRARPYLSVSQY 
LYFSLSLSLSLSLTHTHTHTHTHTHKIE 
LLGCVAEIIHFF 


6606 


20507 


A 


6660 


3 


249 


LSPRP*CGDTISAHCmHLLGSSSSPAS 
VSQVAG ITGACHHAWLI FVFS VEMGFHH 
AGQDCLHLL/NLVrHPPQPPKVLGIiQA 


6607 


20508 


A 


6661 


3 


1507 


PKAPVPFIiDSNQENDLLWPEKFPERTTV 
TELPQTSHVSFSEPDIPSSKSTELPVDW 
SIKTRIiLFTSSQPFTWADHLKAQEEAQG 
LVQHCRATKVTLPKSIQDPKLSSELRCT 
FQQSLIYWLHPALSWLPLFPRIGADRKM 
AGKTSPWSNDATLQHVLMSDWSVSFTSL 
YNIiLKTKLCPYFYVCTYQ FTVLFRAAGL 
AGSDLITALISPTTRGLREAMRNEGIBF 
SLPLIKESGHKKETASGTSLGYGEYVTK 
ITL/SSSTDLWTHEIDLHRNSL*NRDSN 
CSNF/ LREQAI SDEDEEES FSWLEEMGV 
QDKIKKPDILS I KLRKEKHEVQMDHRPE 
S WLVKG INTFTLLNFLINS KSLVATSG 
PQAGLP PTLLS PVAFRGATMQMLKARSV 
NVKTQALSGYRDQFSLEITGPIMPHSLH 
SLTMLLKSSQSGS FSAVLYPHEPTAVFN 
I CLQMDKVLDMEVVHKELTNCX3LHPNTL 
EQLSQI PLLGKSSIiRNWLRDYIYNWRS 


6608 


20509 


A 


6662 


326 


1 


IMIFLFFQTGSRFVA\RLECSGTITTHC 
SLNLPGSSNP\PTSAS*WTHCRTRVIL 
NFSVDPGSRHWKGGLELLDAILKPWPP 
iCVLRTRGSTPEFRSGGSGVRIRGFN 


6609 


20510 


A 


6663 


414 


105 


QGKVGFKVGI1NPLSRLLHFFKQKLFVI1S 
*KGQVKKLSLKIPFPPGGFFSPPI *NPP 
P*NKPQRFKNPFMGFT/IRKKILFLAFK 
STKSGG*NPPSK/HKKI*FSF/ CFWETE 
SRSVTQAGVQWHYLSSLQPLPPRFK*FS 
FCKPHKRVFKPLGLVLWGG I SNRRRKKT 
SGGKRNFKA 


6610 


20511 


A 


6664 


1 


407 


GCVPWPAAREAEAG* IA*TRET/RLRW 
AGIVPLHSSVGNKSETLSQKKKGAPSKK 
SLQGVALFLSQLGKLPPFLQRKKRPQRG 
GGRGQLAPGYNPPAGKTLVGPGPHLSFC 
SKKAWKELRPWKKPRGFWFLIKST 


6611 


20512 


A 


6665 


1 


280 


FFFF*HRVSLCCPGWSTDIiHIiIiPP*IiAR 
FKRSSHIiSLPSSWDYRHTPT/HPGFFVC 
LFVFCRDRVS FVLPRLVSNFWAQAI LLC 
GDRKVMGLQA 


6612 


20513 


A 


6666 


408 


89 


WLPNSLILPN*NYFVSYLSDRWSLCRP 
GSTAVAQ *ELTA\A* TTXjIiKRSAS FSHS 
HSCSLPSSWDYRCTSSHLANFFFKRWGF 
TILPRLILKTSWPQVILLPQSQRH 
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E=Glutamic Acid, ^Phenylalanine, 
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N=Asparagine, P=Proline, 
Q^=Glutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X»Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


6613 


20514 


A 


6667 


484 


208 


ITWS AAGLIHYS FLNPGE PMTTE KYAQ 
EIDDIH*NLQHLQPALVDRXGPIFLHDH 
A* PHAAQPK\LQKLNKLGCRVLPHLPYS 
PDLLPLT 


6614 


20515 


A 


6668 


495 


227 


PAFVKVPPEKDQRQNSTHAGRNHLPME I 
IiLL* KKKLIBTESHSVT\RLECSGVITA 
HCNLKLLGSSDPPTSASDYRLKPWHLAP 
SSFSMS 1 


6615 


20516 


A 


6669 


2 


65 


DGVLLTLSLLARLVL\NSWPQVINLPWP 
PKVLG*QAGLGTPGLK 


6616 


20517 


A 


6670 




794. 


ILIj*P*PPGIiK* /FLPSSWDWRRASTiCT, 
ANFLFFVEMGYPRVARAGLKFIjGSDPSA 
SQSAGVTGVSRPGRFLIRTVL 


6617 


20518 


A 


6671 


406 


162 


RS YAQAGTQWLDFSSLQP* PPGLIKQSS 

HFSFSQSASH/ rhpdnyci pcrnrvyvg 

QASLSIiLGSCDLPASASQSAGITGSCL 


6618 




A 


6677 






VVS ATT iHTSVVVO! A TATPCVfiMnviTT JCZV 
CGCECVW*GCVCTS VN * \HTSTGWSEK 
V*LCYCGCV*LCTA£CMAFVSRQCVC3)P 
VCCLPEKMRLS P I S VA* VCAPMKVS PCP 
PQLSELHIiSLNIYINKCVSLSQNAWLF 
L 


6619 


20520 


A 


6673 


378 


180 


NEVIRSYKVSLFSPRIEKWINK/ I/HWY 
TYPME YDSAJKRNEVLIHATT * R SCLENT 
TLNERIFDIIPFI 


6620 


20521 


A 


6674 


378 


94 


PVKNRRTKELHIKDVCTPMLIVIIiFMIV 
K/S* KQPKCP* IDEWMNKMWYTHMVEYC 
SVSKKEGSPFICTNEDITLNEISHKKTN 
IYNSTYIRYLB 


6621 


20522 


A 


6675 


I 


380 


VCSL * VIjCRHMS CFQNCSL* FSFSFWFF 
WRQCLPLSPRLECGGPMSAHHNLRLPGS 
SDSCASASLVAGIRGACHHARVIFVFW 
EMG\ FHCVGOAGLELLTX * VICPPWPPIC 
VLGLQVLATVPGQPHIF 


6622 


20523 


A 


6676 


424 


135 


SHRWGVKCFKYNIFFFRD\RV*LCCTGW 
S * TPELK* SSCLGLPKCQDYRHETLRVS 
Y* F*NS I * QIGVLAMVSKLYLNS * AQAI 
LLPWPPRVLGIiQI 


6623 


20524 


A 


6677 


408 


91 


HTHFVWPPVLI P* * PMR*VLLLTPNTNF 
FLRDRVS PCHPGWS VEAQSQITAASN\ Y 
GLKQFSHPSVPKCWDYRRELPCMVSLII 
F I YVHLHMYVHARKE KPEYPL 


6624 


20525 


A 


6678 


1231 


217 


PPYFNNPIiTFPQKRFFKGDKTLKKWL* I 
REIILQNRGIG*KTDSIR*MGKTEAPLK 
EKFLLGTALQPSFLPYMLGHFKEE*GBE 
FPGTHSWP FPS FPVWGLGLSWHRGIHG 
PRVSLQTWGPGQEAMECECGNHGTEDTG 
\GAVLQPG*VT*GFHTPTSEPGLVAPPC 
TCSGLKAEA\GFLKLPGKQG\SEGALPP 
* SPLPSACFPEIiGIiRRLGSS PGLM THPfi 

WGLFTSPLGLKILGPRHPAGPALPQIAQ 
RPSI»KFR*RRLGWGPELLPAQTCGAQSL 
GPSTAPLLGQGPWGLG * HAS R WGTLGLA 
VAPGNEVPSLSPR* PRGAGPLHRHRVPV 
RSWG 


6625 


20526 


A 


6679 


1 


178 


RGWD YRRPLP / RPG * CFVFLVEMGFHRV 
GQDGLYLLTIi*SSQSAGITGVSHRARLQ 
PPIF 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-lsoteucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


6626 


20527 


A 


6680 


3 


135 


KSBHLEREPI IEQLSTMFF/ TAKYGWHL 
RGH*HRCQKKPEPPEDR 


6627 


20528 


A 


6681 


425 


107 


KGPPTPS/PPPPSPVFPTEKILVPPPFF 
PPPPFKKPPPQNI FWGPKK/ IKNFSPPR 
GEKNFFF* TPPPKYFLGAKKKKNFSPPR 
GEKNFFF 


6628 


20529 


A 


6682 


424 


284 


CFLQPGP\PGS KGAS PPS FQDNWGIiRQA 
LPPP/RKSFFLFLFLFFPIVQTGFHHVA 
RAGLELLTSSDLPASASQSAEITGVSH* 
A*PFP 


6629 


20530 


A 


6683 


1290 


449 


PLSAPPPGDPSP*KSSPRAPNLPAP*PP 
NLSPWPPKCPRPRR*PASS/PP*LPVPP 
KSPSPPV\PWATEASRGHQPQIT*ELG 
SLVLRMTEKGTWG*GIPDS IMPLPRRRS 
EPRTCRE * GDTRWRWRRRRRSTGAVSTF 
SPGFACGGGSGA/ PAHPGEQKLGASARP 
DE/PVCPAAWVUiPSARWASPAAAPPP 
LPSQPVH*RFKP\SPPY*PP\ALPPLSP 
FEPSAPPHTGSVPY*QGVPP/SEGCPST 
SKPLISVRPPVTINGCLPQSQPPKTHTV 
SRLYEEN 


6630 


20531 


A 


6684 


425 


40 


E WAS LKN * HP P / GANQAVEAVEAPTHCW 
*GCKS IEPLWKTVWQCLDKLNI CLLYDS 
TPE * MSEHVYQAACKKS FTAVLFMI P PN 
*KQPSCPSTGWLGQAWWLTLVIPAPWEA 
KVGGS LE PRSLRP AWATG 


6631 


20532 


A 


6685 


2 


107 


SWLTAASTSQ\VKRSSRLSLPSSGDHRC 
MPANFFFLKDDVSLLPRLISNSWTQAIL 
LSLPTKGLEL*AHCSLNLPGSSDPPASA 
YQ WGTTGACQLI FFF 


6632 


20533 


A 


6686 


12 


224 


TPPGSTHASGAPPCLANIKFNY\L*RIiP 
RLFWNSWPQVILLPWPSKVLrGLQVWATV 
PSPFLI FDRDIDRE I 


6633 


20534 


A 


6687 


404 


2 


TAARGI I YGGP I KKLLPRKPADVK/SGS 
FKTAAPFFFFLGTRTLTEFLFI I IQN* K 
QPKRPST/DRINKLWYISTIEYYSAMKR 
NKLLIHTTAWMNYKGIMVSKRGQSQKSI 
FSRTKTGFGRNHISGCFLYPLPLRIi 


6634 


20535 


A 


6688 


410 


163 


PRGNTPPGFSPPEKENQKVFNPPPGNIV 
*KKGGPPPQGAQKKGPPY*GPPLSFLPP 
KKKKKKGPPPPPPPKKNPPKGNS/SFSP 
KKKKKFFPPPPKKKGF*GGPPLKTLFFW 
GGGKKFFFFFWGKKEFPLGGFFFGGGGG 
GGPFFFFFFG 


6635 


20536 


A 


6689 


205 


405 


KSPPSPGGPPLIGKLEIGGAPKPKNWPP 
PLGPKGENPPLALGGGGGAKNHPPNPQF 
I LLFLGGKNP * KS P PS PGGPP / ANRKIG 
NRRGPKTQKLAPPLGAKGGKPPPCPRGR 
GGGQKPPPQSPVYSPFFGGKKPL 


6636 


20537 


A 


6690 


3 


199 


HGSLQH* PPGFKSSSSLSLPSSWDHGHV 
QPRPAI F/CFFLILEKRGVSPGFPGGPE 
TPGLKGLDLL 


6637 


20538 


A 


6691 


8 


350 


SVCIiGLP/RCWDYRYEPPHLA*LAFLCQ 
IIFHHMDWI YHVS ILWDGYLGCFYFLA 
I INNV\G* AGLELLTSGDPPTSASQSAG 
ITGVSHCTHPYCYTFLTREITTPKEVKI 
GLCW 


6638 


20539 


A 


6692 


373 


197 


LPSSWDYRFMP*HMLFFK\FLFLLFCRD 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine; 
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E/SLTMLSRLVSNSWAQATLPPQSPKVL 
GLQV 


6639 


20540 


A 


6693 


369 


100 


PMLARMVSIS*PRDPPASASQSAGITGV 
SHRARPIYFFK*R*SHAGYSAVPVHRHD 
PTTDKHNSFLFCFLQTGSCFVVQVGMQW 
HDHISL* PRTPGFKQYSHPSLPRH*DYK 
PTPP/HPS * FWQ VGMQWHDH I S L 


6640 


20541 


A 


6694 


12 


182 


FLSSWNYRCVPPHLAN*KKNFFFYTDR/ 
S LAVLPGLVSNS* TQAVLPLQPPKVLGL 
QA 


6641 


20542 


A 


6695 


402 


248 


SSWDYRCAPPCPANFYKIL*TGS/LLC* 
PRFVSNSWDQVILPPWSPKVLRGV 


6642 


20543 


A 


6696 


301 


309 


VQEF * DRILLS PRCHPGWRAVA* SRLTA 
ISASRLKARPSHLSLLSSWDYR*VPPGP 
ANFLHPFCR/DRGFAMLPRLVSNSWTQV 
ICLLRPPSVGITGVGHRARSILLD 


6643 


20544 


A 


6697 


3 


387 


PCLANFF/ VFFVETGTHYAA* CGLRLLG 
SSALPV*TS*SAGI IGMSHCTC/LQITL 
LKTESHS \ VAQAGVQWHDLGS LQ PLP PG 
FM* FFCLS IQSSWDYRRTSPRPANFCI F 
SRDEVLPCWPGWS *TPDLR 


6644 


20545 


A 


6698 


1 


287 


LWFLYQGNAGITGISHRAQPNSFVFFKD 
RVLLCCPGWSAVA\HSSLTVALTSQAQV 
IFLPQPPG*LGNFCIFSRD/RGFAMMPR 
LISNSWAQVIRPP 


6645 


20546 


A 


6699 


412 


3 


FEMESCSVARLECSGTIIAHCNLHLLSS 
SSSPVSASRVAGTTGACCHTWLFFFFFF 
V/FLVETGFYHVAQMGLNNWAKAILPQP 
P * VLGLQARITTSS PNSSFTREHTSGSG 
VGYPAKFHS IVLLLL YLLFHLLCGVF 


6646 


20547 


A 


6700 


393 


88 


QKLYPYLTPYAKVN*QWIKNLSVTAKNI " 
* FLGENTGVNLHS FRF I KGFLDM/TPKS 
ISTKRK*KIKWSSSKL\KNFYALKNNIK 
KMKRQPTKYEKIHLTLNNWV 


6647 


20548 


A 


6701 


494 


239 


KAGSHS \ VSRLECSGM I SAHCTFCLLS * 
NNPLTS AS LAAGTTGVHHHAWL I FVFSV 
QMGFCHVTQAWVL I FDCGFES PINT KYN 
LL 


6648 


20549 


A 


6702 


2 


379 


RPTWVTWQJJPASTKKKKNFFFFKTKSPT 
VPQAGGQGGNLG*RKFLPPGLKRFSGLK 
RPEAPKGQSCSLMGGGLCPLLPP/WSGT 
PPQR 


"6649 


20550 


A 


6703 


220 


47 


TGFSGSQIN*EK/ 1 WLGTVAHTCDPSTL 
GGAGGQ VT * GQEFKT S RHE PLHLTVLCS 
LF 


6650 


20551 


A 


6704 


80 


414 


ICYYFI VFAI YLNQFFSNCYRS FTFFFC 
PFETESRSAAQAGGYNPTLTLLHPPPLG 
IKQFSCLTLPSS \WARS * KPPCPTWGNS 
VFVGIYKNKLGGGGAIRRVQTCNQTYSG 


6651 


20552 


A ■ 


6705 


69 


416 


NVEKRNNIHNIKKTPVYFKGFLLYCCKA 
E VFYVLDNPFI FYFLFYFFETHCVAQAG 
VQWHNLRSLQPSHP\SSSDYRCAPPHLA 
NFCRDGVS PCWPS W/S *APGLKLIHPPW 
SPKVLE 


6652 


20553 


A 


6706 


373 


75 


HAQLTFKFYYFRNTFCKAVASTKSNSSD 
RSE*SKLKTFWKGLTILDAIRNISDSL\ 
EEVNMSTFRGV*KKLTPI IMDDFEGFDF 
TRESNHRGARHGGSRL 
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6653 


20554 


A 


6707 


331 


379 


K*KINKN*KGLTILDAIKNICDSWEEVN 
IST/LTGVWKKL/ITLIDGSEGFKISVE 
EGNAGWEIARELEIEV/EPEYVTKLLQ 
SHKKT 


6654 


20555 


A 


6708 


578 


152 


LQQHNPGSLQPLSP\GFKRFSCLSLPSS 
WDFRHAPPCSANFFVFLVERGYHHVG*A 
GLKLLAS SDS PTS ASQSAG I TGVSHRAQ 
PLIGFKVNSVKKSVSVPTRRILKGNMSM 
YEQL 


6655 


20556 


A 


6709 


392 


146 


FFETESLSPGLECSGTVSAHCNLGSSDS 
PALPS /RLLiGS SDS PAPAWLYSFKIi * I 
Y/ FHKTCNYVLKINILLHVAI pysfsws 
C 


6656 


20557 


A 


6710 


9 


393 


KSKPCN*DKSQSYRL*LFAQLFTF/SNF 
FYFLETF+ESHCVIQAGIHSGTIMAHHT 
IiKLQGSSNPPASAPPSSWDHRCGPPPLA 
TFLKIFIi*RQWAHYVSQARDNGIiIMFPR 
LVFELLGUCRFLLPQPS 


6657 


20558 


A 


6711 


34 


387 


QDFGFLSFFLLFFFEKGLGGPTCRAGPE 
SNLME \GGPPGLKQFSGLTLPRTGNNGR 
ASRSRAKFWN*KGKGGPPGPPVELETFT 
LRGPPHLSSPKGKNNGGGPKPDFFKKVG 
KKAPNP 


6658 


20559 


A 


6712 


1 


188 


IQAWWYTPWPATWEA*AGGIiR\EPRRS 
RLQ*AVIVVIiHSCLGDRVRPCLNNKKGS 
FPPQPLW 


4659 


20560 


A 


6713 


377 


98 


SLGLKQSSHLNLPSSWDYRPVPPCPRNW 
FVPVPFFF\RB*GLAMLPRLVSNS*AQV 
ILPPQPSKVLGLQARTTAPGFFSHLLKT 
IjVPNNVNIiNI 


6660 


20561 


A 


6714 


353 


229 


SSCLGLPKSWDYRHEPPHLAYFYY**RQ 
ALV/MLPRL*Y* FLDVNM*LPLPFHVAP 
SHLLNSVGMV 


6661 


20562 


A 


6715 


371 


64 


ENHGDCTPAWATRAKLRFKKKT/ CYQMF 
ITTLFTIAKRK*LKCPSSDK* INKMWYI 
HMMEY/YAIKRNWTLVKLNQNPYAL*KV 
LLRE*KQNP*SVRNIFKTHI 


6662 


20563 


A 


6716 


462 


232 


HFNLITI KI IPFF/ C*DGGLTMLPRLVP 
NS WAHAILLPQPPK* LGLQARACKL 


6663 


20564 


A 


6717 


425 


48 


GFGGPPFKKARFGFLiHPF*TPPVLRPRP 
PPVFFGGFKAS PPKKKTPK/ PPGGGGPP 
PYFHPFFLSPFFPIFFEKEPPPFFFWRG 
LGPPPPPFKNPFFFFFWVEGGKFLPQKK 
KKKFFQS I STRSVI HK 


6664 


20565 


A 


6718 


3 


413 


GTFLTKTSLOGTLLY*VISYMCV*FIS* 
\SYLVYIIYFKDICV/LLYMYIFYIYTC 
VHISVCMYMYVCMYVCMYKA* PANLKTF 
CTQKHLAWGWVQWLISVISALWEAEARE 
S PEVETNLAQLGPTIjSLPRKKIQCKKKK 


6665 


20566 


A 


6719 


408 


71 


SSWDYRC/ LPPGPANFCIFSRDGVSPYW 
PGWSRTPD*RSLPGGDVMPALVIRHVPE 
LLKKQHSSVRSSRPAWPIW 


6666 


20567 


A 


6720 


375 


57 


SVXRBMQIKTTMKYPFIPTGLIVIKKTE 
NN/RGWQRCREIKACIY*WWGCKMTQLL 
QKTIW*FLKKLNIEL/PYDPAIPPL(3^* 
LRELKTCLPRLGVVAHACNPSTLRG 


6667 


20568 


A 


6721 


391 


164 


RISETCT *MFPAALFIV/DQNNPDKWTN 
LMWS YTHTMDDYSAI KRNEVLLHSSTWM 
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Amino acid sequence (A=Alanine 
C=Cysteine, B=Aspartic Acid, 
Er=GIutamic Acid, ^Phenylalanine, 
G=Giycine, EHHistidine, I=Iso leucine, 
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nucleotide insertion 














\ENLENLMLSKRGQSPKTPFG 


6668 


20569 


A 


6722 


381 


278 


SSLQPQTPGLKQSSCLSLSNS\PPRFID 
♦SISLYRQGLTILLRLLSNSWSQAIVLP 
GPPKALGL*ASKKFLYSNI 


6669 


20570 


A 


6723 


2 


363 


QRSLEHRPCKKC/ CGLW* VS * KVQYLV 
YKESSSFKNLLRRIHCNFLFSGIVPHF* 
KLLRTPRSFCLCGLHI P INFTI F* I KI E 
KFKKC * PGTMAH P YNPS TLGGLGG W IT* 
GQDGLDLLTS 


"6670 


20571 


A 


6724 


352 


58 


HGGAIGARFFKVFNLI*FWLVCKDE/SF 
SLIiARAVINSWPRVILPPWPPK\WGFQ 
AKTPLPRLKGFLESLPSLKKKKNLAYAR 
TITAQ IGLLVCF I DQQ 


6671 


20572 


A 


6725 


383 


54 


KSSPFFFFEQGLLCCPGWSAWSPLQPK 
CPRLKQFSPLS/IiPSN*EYSCTPTTFSL 
HVCVSIHKYI*YIDQYIFKFF/CR/DRT 
LAMLPRLVSNSWSQAILPP*PPKVLGFQ 
V 


"6672 


20573 


A 


6726 


3 


343 


LEV*ARAT*RQE * KINGT* I IKYKKVKQ Y 
SQNDMILYIQNPKDSAKKLPELVNKFSS 
FKVNRQNS WFL*NSNKKS /EKEIKKKL 
/PFTIAS *RIKYLGINLTKEVKNLGRAQ 
WVTA 


6673 


20574 


A 


6727 


3 


615 


HTSQGNL*IQYHTPWHPPSSERVERMNQ 
TLKSHLTKLVLKTRIiSWTKCIiP IALLKV 
RTVPQ KEVGLS PCEMLYRLP YSHFTVD I 
PTFETKSQFVKSYVLGLSSTFSSLKAK/ 
ELFSTDATLGSSPGTSAFSLGDDVLIRS 
WEEGKLKPAWBGPYLVLLTTKTAVQT/D 
NKKMQTHHTPVKKASPSSKSCAI VPRP I 
PTKLKIKKKKA 


"6674 


20575 


A 


6728 


2 


228 


SGAI / SVHCNLHFPG/SSSDSSASAS * V 
AGTTGVHTMA* L I FVFLVETGFHP ISQA 
GLELL/NHMICPPWPPKVLGLQL 


6675 


20576 


A 


6729 


98 


359 


KVEYTVLLF*GFKGKFPTLPLYLSFSSF 
LKNHLISNFFliNRVLLCYPDWSWG*FT 
AHOILNI^CLSLPSTW\DSRCVPTGAWL 
RFI 


6676 


20577 


A 


6730 


10 


428 


RSTRTDYAALREVLPDPRRCRPRTGAWS 
NGRRDRPGAGHSGPKAEAETRETLEKGL 
LFCCPGWSAVA* PRPTATFAS RAQSLTV 
SPRLECGGWSAH*NF\SFHFQRWGLAV 
LRTLVLNSWSLALLLLWPRKVLGLYALA 


6677 


20578 


A 


6731 


425 


158 


SQTHDIiR*STCLGPKCWDHRCAPPRPAL 
FLF/FNKCPMSIRKIENAQ*LPGI*KKN 
FCRDDHLPLLSRLILNSWAQGIL*CRPL 
KVLGL 


~6678 


20579 


A 


6732 


431 


22 


RDRVLLCWPE*SAW*S*LTTVQNSWAQ 
GILSSI*DYSCVP\LPVNF*TFFKSVYY 
YFI I FIFE I YFCNSKALGL/LGVSHRAR 
PPRLVU)JSWPPQVILPSQPPE*LGLEAL 
ATAPAIPRNSNLVWHQYSFKSIPDDL 


"6679 


20580 


A 


6733 


44 


.414 


AIYDLPFLLIDLYPMELRASIRAHTCT* 
RFVATIiFTVTKRIiNQPTCPS TDE *QNKM 
WSL/ YIMNYYPAI KRNEILIHENIMLCK 
RSQSQKAHTE*FHICEISRISKSAETED 
ELWATRWGKGRE 


6680 


20581 


A 


6734 


375 


49 


QRLT YHCRFS YNTASNKMRLS * TQMHQK 
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SACSMCPGRL*GVHAMRPKVLMRLSRSH 
/VARPIAATCAQ*VCDRIK/R/SLLIEE 
QKI WKVLKAQTQS QTAKFKNETF FRNK 
N 


66S1 


20582 


A 


6735 


381 


88 


SLFPRLETPGLRP SFFFFLFSLSH \ PGW 
SVW*SRLTTALNS*ARFSLLSWCVPLH 
QAHFF\ FFCRDR / SLTMLPRLVLKSWAQ 
AIPPPEPLKRSGLQV 


6682 


20583 


A 


6736 


384 ■ 


293 


AHLSLLSGWDCRCMPPCLDNCVCLC/RV 
FVCR*GQAMLPRLV5NSWTQVTLIiPWHP 
KVLGL*AQDQEPLWGRSGPGIiPQAASG 


6683 


20584 


A 


6737 


24 


300 


NILSAFLCLS* P*H*YGLVIL*NVPQFR 
FLPLFFPMQMYTHIPFKIRIRPGWSHA 
YNPSTLVGQGGR\SHEFKTSLDNIARPR 
LYQVQNKIR 


6684 


20585 


A 


6738 


2 


353 


IHCW*KCKITYTLEKTWQFPIQLNIVL 
PYIPA/IPLLGIHTREMKTHIYTK/T/C 
MQMFGWLFI IAKIVFQLVWR* IKL* S I 
HKVDYHSE/ IRNKQLYEAT * LNLKCTVL 
SE*S*TQKTT 


6685 


20586 


A 


6739 


2 


336 


IHATYTNP*HRH\VHTPTHVHT/HVYTH 
NGTHTHTARAWTPAPPTSLILPSPTESP 
PALQIPNLBFGLSSSPHPAGSSQGEPEE 
GGCQRRAKNRVGHRAEKWETEIiCAGDMN 
L 


6686 


20587 


A 


6740 


3 


385 


KKQATLLWI IHLATTFSVE I *QSKIEVH 
DIFKVLKEKKK*TTFYPKITNPGKIAFI 
HEGKI KLY/ 1 VK* QLRDFINSGLVLKAM 
LKRVL* S KRKGH* QTIRNLRVQNSLVTA 
NTQPNTEYCNTVIWGN 


6687 


20588 


A 


6741 


406 


155 


TGSCSVTQA* /CWSAVIVHCSLDLPGSS 
NPPASASWIAGTTGMCYHAQLPSFNLCL 
QFYCMGCVFSRFYLRFTFGLQSYTRKQK 


6688 


20589 


A 


6742 


1 


215 


LPYSPDLSPTD/Y/HFSKHLDNFLKGKR 
FHNQQDAENAF*KFIESQSIDFYAVRIN 
KIiISHWQKRVDCNGSYFD 


6689 


20590 


A 


6743 


376 


109 


YHMEVWNIiYSLKGSSYLRLPSSWDYWCP 
PP *QANF*K\ FFCRDKDLAMLPRWLNS 
»LQAIJ.T,r,RPPKAEPLQLLGQFQFRRES 
GWN 


6690 


20591 


A 


6744 


382 


148 


NIPQFMFDILFQ*LITLSCPASF*SKFP 
IjSSLYFIQ*GR/ WLGTVAHACNPSTLGG 
RGQR\ * A*DQELETSLAHKARPHL 


6691 


20592 


A 


6745 


403 


209 


HIVINRI IRKYCA* HYANKFNNLDKLDT 
FLB\NTTCKNLTQEETENLNSPVSVKE I 
EQGKLCSDRN 


6692 


20593 


A 


6746 


366 


40 


QGSVhS I LNVKI VFWFKTGCVHFLFP LN 
FLSLFQNYVCSHTS/ YPLSL I PHESLWS 
FPPSFNHSS*Y*HFLKTN*MPDAMAHAC 
NPSTLGS*GRWIT*DEEFBTSLANI 


6693 


20594 


A 


6747 


392 


57 


APKKKGGGVYPL YPKKSGRKKKPEG / P P 
PKPKKKLKTPGRKNDAKDIKPGGAKFFF 
SPKKKRGAPPGVPPRLF*KPPQATI*GG 
PKKKKKPPQEGGKKGAPPKAPPLFFFFL 
YF 


6694 


20595 


A 


6748 


376 


140 


LEDNI /GYSNQNAGAGRVS* INTPKIKV 
S\QSKIAKWDLIKPKRLCAAKVSMNRVN 
RQPPEWDRIFALASAD\RGFLNKILG 
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N=Asparagine, P^Proline, 
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nucleotide deletion, possible 
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6695 


20596 


A 


6749 


540 


172 


FFLRQSL/DSVAQAGVQWRHLGSLQAPP 
HRFMPFSCXTLPSS*NCRCLPPCPANVL 
YF*QRWGFTVLARMVS IS * PRDPPASAS 
QSAGITGMSHCAQWNTLSLHAFSCDKV 
PQNDLL/WPSKSFCGTLSHEKACKDNVF 
TTWAQWLMPVI PALWEAEAGGSRGQE IE 
TILANTVKPHLC 


6696 


20597 


A 


6750 


1 


353 


WFLERESTRSENSVNIVBMTT*DLEYYV 
SKAVAGFEKIDSWIKRSFTVGKMLSTSL 
H/CYREI FHERKSQSMCQTSLLSYFKKL 
PQPPQPSASATLIVNIKAKISAKRIKLT 
ECSDDC 


6697 


20598 


A 


6751 


404 


231 


SPS/PPPPYFGGPPGFFPPPVF*TPPPE 
YFBX3APKKKKNFPPPGGKNFFFFKGPPP 


6698 


20599 


A 


6752 


77 


371 


I TCP VLYLTS RNT KTKKYYLDCCCFW P * 
Vn^NRDQTYIiHLFFF/CFIKKNFFFQPKR 
GARGGFCINETPPLGGKGNPPPPPFGRG 
EKRAPPKTLCFFFFF 


6699 


20600 


A 


6753 


354 


169 


ETESHSVTHAGVQWHNLGSLQPPPPGFK 
RFSCLSLPKK/WFSVKLN/ pnt*sfcyh 
ILKALII 


6700 


20601 


A 


6754 


1 


442 


CICRTAFELTGRATRSEKEERKKERMYD 
* KKERKKEKEGILLKKDGQEEEEGRKEE 
RKEGRRQTQDE FGGQNRLS TVKQGS * PK 
KRK*GIQ\QTGAPTQERDKETSQNKNEE 
KARHDS *QPQA*RTQSRLQQEGRKLWKR 
GLVSRRK 


6701 


20602 


A 


6755 


366 


262 


FF*RWGLTMPRKLVLNSWAQVILI*PPPL 
E*LG*QA 


6702 


20603 


A 


6756 


3 


223 


GhKQSS CFCFLSNEDCKHVPP \HRLFFL 
FVCLFCFVLFFW*RYGGLAVLPRL.VLKS 
WP*VIPQPWPPQMLRLQA 


6703 


20604 


A 


6757 


418 


244 


FSLLSSWDHRYMPPRMATF/ SFVLFCLR 
EGCLTRFLRPV* NS WTQAI FPQS PKLLG 
LQV 


6704 


20605 


A 


6758 


46 


256 


KLKNKTEARHS S S ACL* / LPGS WDHR YL 
PPRLANF*TFWRDRVSLLPKGVSNLWAP 
TILLPWPPQVLGLQV 


6705 | 


20606 


A 


6759 


391 


55 


FIFFFFPHSSHGVSPCCPGWSRTP\GSS 
GSPASASQHAGITGRNHHTRLQDIY*LE 
/ TGQTVTLQFS VLADTTLTKRLS WS ILC 
CYNKIPETG 


6706 


20607 


A 


6760 


377 


2 


FGEPPGFSPPPVLKTRPRNLFLGPHKKN 
INFPPP/WELNIiVPLKGPPFFFFFLR*G 
LAMLPRLVSTSWAQVILLPQHPKLLELQ 
VHAMVLSSRTSHSPRSRSLSPRFSQRLL 
PVSCNPAQPSPMA 


6707 


20608 


A 


6761 


2 


259 


LTIA*KNFIFLGL*QGICQIPKDYQTSL 
R* IKDNLNK/WDTPCSWIGGLNIVEMS V 
LSKLMYRFRAIQMKTTSGFPFFFFWKKG 
IFF 


6708 


20609 


A 


6762 


1 


169 


FFETE FRS FA* AG V* WHDLGS LQPQPSG 
\SSDSPASAALFFFFFFLKKKIFFFPPG 


6709 


20610 


A 


6763 


1 


496 


MATCTPKHINTQICVSFLLSLKLSEFEQ 
MS I FKKGENELALGTGNIRP YDSCPGNR 
P/AP FATS PAPRHPSGPSWRP PTSGGVP 
ARAECPASPNPLPPGFCGQSPSRGRKEE 
*LRGESTSTLAIF* *APNRYHSF*LPRQ 
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P *PHGDCPARKKKKKPNDPRLQPTVT 


6710 


20611 


A 


6764 


1 


229 


VLUjCFFFFVL* DRVLVGHPGWSSRLYIi 

ktktqkkqnnifvcrvgg\lttlprlvs 
nswpqmvls pqplnalgiiqv 


6711 


20612 


A 


6765 


580 


273 


WRQS LTHS VAQPGVQWHDLG S LQP P P PG 
FKRFSCLSLPSSWDYRRAPPRPTNF\*Y 
LVQTEFHHVGQAGLELPTSSDLPTSVSQ 
SAGITSVSHHAQLDMIFLR 


6712 


20613 


A 


6766 


220 


239 


AQEFKAAASTTALQTETLCliKKKKKKKG 
GVSLGGVSSNFRTGGGASVTSPSIF/CF 
FFFF * DRVSLSGVQWCSLQP * TPGLKRS 
SSLSLPSSWEYRHVPLHPA 


6713 


20614 


A 


6767 


89 


387 


FLSPSGSQKKHLQIQLRSTFSFFFFFKR 
EPRFGRVGGQCPNLG*WKPPPPG*RQFS 
GLTLT/SNWERWATPPPPTNFWIFKKKG 
NSPWWKRGAKIFGPWDP 


6714 


20615 


A 


6768 


397 


223 


IREVGWINWQHKDPCD/ KWNCS VY* I>* 
* W I HKPTHDKT\HRNTHTHTHTHTGMHS 
HTQK 


6715 


20616 


A 


6769 


3 


367 


SWKVCSKRPIPSPSI*FF*KHPRVLLRAD 
GT* * VSLATL*HMQLS*AGSVKEAEPRS 
NAPPHKATFLFIiFFCTD/ SSLTMLPRLV 
LNTWPQAILPPCPPKVLGLPARANSSSL 
SVFKXiTSFCC 


6716 


20617 


A 


6770 


490 


260 


FFETESCS W*AGVQW \LBLLTSSDPPA 
SASQSAWITGMSHCARRCGTHYHLCKRF 
LVSFVTALLSQLLELGLAHSRC 


6717 


20618 


A 


6771 


418 


130 


PSPRDLNLFPLNGP PLFFFFF * DRVLLC 
CMWSAVASN\FDLK*STRLGLPKCLDY 
RHEPLCPAMWDTIjSSL*EIFGLFCHRPS 
LPAPRTGPGSQ 


6718 


20619 


A 


6772 


311 


314 


FAVGELFT.T.T.HHVGHAGLPFCHDCKFPE 
ASPAMLPY/QPVEL * at*tgi EVLTPGC 
RFQAVFGLKVGFHQGPTPVYLGI 


6719 


20620 


A 


6773 


465 


283 


CPSLPSSWDYRHPPPRPANF/ * YF* YHV 
GQPGLELPTSSAPPASASQSAGITGMSH 
RAWPK 


6720 


20621 


A 


6774 


410 


91 


RDHSKLQPQTFRLKQSSCLSLPSN*DYR 
H^PCIiAGWLAGWIiACQTDRQTDRQTDR 
QTDRQTDR*TDR* IDSRDR/ SLAILPRM 
FSDSWPQVNLLPQ\WPPKVRRLWA 


6721 


20622 


A 


6775 


40? 


253 


L I FAFF \ CRDGGLAI LLRLVSNS * PQAI 
IiPPWPPKVLGLQARAPGVRPDLS 


6722 


20623 


A 


6776 


3 


278 


FFFFSLHLWAPLAFFFFGAISAHYNLPI 
PG\SSDSPASASQVAGTIGACHHVQLF\ 
VFLVEMG FHHVGQAGLELLT\ * VTHLPQ 
PPKALELQI 


6723 


20624 


A 


6777 


79 


442 


RDEDRGLERRCVSACSTPQLGNASTT*G 
CPGVSDPSLHILF/CFFKEN/SCFAPQA 
EGGGHNLG*I*EFGPPGIjKQFSCLTPQRI 
WNNGGAPPPPLYFFFLKKRGGSPLWPGW 
A*TS*PRS 


6724 


20625 


A 


6778 


77 


417 


HRGLHLSFFLF/SFLFFFLKRAAbLWPR 
EQGRGWIWVNCTLSLRGQGISKAIjPS*E 
AGTKGGRPHPTLFLVF* KKTS FSQVDQG 
GIDLWARGNPPPGNLKWFELQKETLCPG 
HIF 


6725 


20626 


A 


6779 


62 


235 


IQELRIREGKCLAYEHTARNGKSQDLHA 
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GLSCLOXCCCCC\CNCWP*STRKFFTK 
PB 


6726 


20627 


A 


6780 


908 


566 


SKMEFRSCCPGWSDLGSPQPL\PLGFKQ 
L\S CLSLP\RAWYYRHAPTHPANF\ * Fh 
VBTGFFHVSQAGLELPTSGDLPTSASQS 
AGIIGVRHHAHPECVFQKQLSEANLRRI 
WNQED 


6727 


20628 


A 


6781 


11 


483 


TTALL KAVRH / G * HLSLQRLLLSF \ VCL 
CPAPRGGAYRGRQAS * SCGGLHPVRASR 
LLCLPNQAWAMAGAPPPTSLPPCSLISD 
CCASNQRDSVGVGPSEPHAGYNLPVCRF 
LSPLEKRSIRVGVTRFSRCRIiSPLSLTR 
KGNSLTPCASRVSSASPCS 


6728 


20629 


A 


6782 


225 


589 


SRRSEPGNFRSCLGVGVTCCMCLSLSLP 
SRSFLRPLFLSLSVCVFVCLGTRVPCAP 
EGGFLARRPFFWSASPRVSAWVLWPVGN 
RFPGGSGLGVCEGLGNVGI\WSEPQGFS 
FPPHPEQPLC * AGSRRALPNQGQR\LPG 
AHRLPAGGCPQSFKKVWTPVALCPHRE 
M * PQLDAGTE KEAGKGMGQACLSLKG W P 
SWPSHPFDTPPRMPVVVAWPPRILPGLW 
PLL* PPAGRNARRKKPKTDRRGRSAGSQ 
PQACADGVRVSLPKSGALVMSRTGLPVC 
PWAALSPVARSRGDQNPATSGAAWGWVL 
PAVCVSMLSLLAIiSFALSFSIiCPSVCS 
CALGHVCFVRRRVGFLHVGLS SGQPLPA 
SLPGCCGRLAIVFPAVPAWGSVKAWATW 
ASASEPQGFS FPPHPEQPLC 


6729 


20630 


A 


6783 


458 


170 


PCI VTASASQSAG ITGVSHRTRLLVNFS 
KVGSVTISLG*EVGLRHWP/NI/GRLDK 
TGPGEKQHFILHAPQSAPSVYHCLGNTQ 
EliPPLSTAMISQPKR 


6730 


20631 


A 


6784 


1 


192 


ffffg* ilplspragvqwhdlgslqpph 
pgfkqfsclslp\rg*dyracatrdgi»i 

FCNFRLGD 


6731 


20632 


A 


6785 


3 


246 


SLCVCQYYCRCACV*VPKQVSG*VTMGI 
/CATGFVLIYLCG*QCLCG/C/C*EDPF 
YCEQLCVHVCSCVLGRGWSVCVCISRLIi 


6732 


20633 


A 


6786 


408 


128 


GPVPYKARG*SPFWPPGEKPSFPKNPKT 
PPGGGGAVLFPPPWRGRPENSPPPGGKR 
C/QPN* I FFLPPHPGHKTQGPPPKKKKK 
KISAEFLSW 


6733 


20634 


A 


6787 


I 


418 


LTILGVMKNICDSWKEVKISAFTGAWKK 
LIPTLTD/EY+GFKASVEEGTAHVLETA 
RELEVEAEAVTEL/LQSHDQT 


6734 


20635 


A 


6788 


3 


237 


FFFLT* VLTIV/PG*SAEVQS *LTAAST 
S*\VK*SSHLSLPSSWDYRRMPPHPANF 
FFFFGKKSLILHPGRGGPSLPTP 


6735 


20636 


A 


6789 


282 


3 


PHPDNF \ RFL V* TGCIiHVGQNGLALLS P 
GYPPTLASPSAGITGVSHCTRPKRYF*M 
VLSNKSLNRFGLCTVLPTCLQAPRRTGP 
PPATVLGRGT 


6736 


20637 


A 


6790 


62 


328 


FFRDRILFCHPRWSAVIQ \SWLTEASKS 
W\IK*SSRDGLPKCWDCRHEPLCLDNTT 
Y*CRQNTLPITLIGCFRVSLPYAKYTGK 
FWGAEG 


6737 


20638 


A 


6791 


67 


353 


TNKHTYFLELNKREKPQLTVFFFF*REL 
RGLIIAtfGNFRLRG*SNSPA* PSKVAGI 
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TGPPHQAGLILVFIiEKTGFNYVG/RRGP 
PPPPPKELALQA 


6738 


20639 


A 


6792 


744 


382 


MTFKKTAHIYFYLYLFIYLRQSL/DSVA 
QAGVQWHNLSSLQAPPPGPMPPSCLSLL 
SSWDYRRPQPRPA/NFFYF* +RRGFTVL 
ARMVSISRPCDPPASASQSAGITGMSPR 
AWPKLHIFKL 


6739 


20640 


A 


6793 


372 


129 


RVLPCHSGWTAWQS*LIAVSTSQA\KR 
SPHLSIRHVPPHPA/NFKQFFCRDGLIM 
LPRSVSNFWAQAILPPQPPKVLGLQA 


6740 


20641 


A 


6794 


395 


25 


QPGLFCFTPRKKPDS FSKVFAVLSF/FP 
PSL* FS/PLSL* VQEMVSICCPGWSAV\ 
TPGLK* SSCXiSLPKGRGYRCAPPHSANL 
KFFL*R*VLFMLPRLGLH\SWVQVVLCQ 
WPPKGPSLRACTSKI 


6741 


20642 


A 


6795 


412 


176 


KKS* /CLATPRSENWVL*NGPPLFFFFL 
*RRDIiAMLPRLVSNSWAQVMIiPPWPPK\ 
VLGLQAGATMDHSCVQYSHSFAPE 


6742 


20643 


A 


6796 


444 


95 


NFVRNCQSFLQSGCTFTjHSYQQFLL/PC 
PCQH* IMSVFQILTTVIGVYMHLV* ICS 
ALMTFDLFMCHVLI CHLHIFFGEVSVQH 
FSLLLFLFESGS I YWQAGLHLLFSSNP 
PASAS 


6743 


20644 


A 


6797 


378 


732 


FFFWLNLGSSQPPP\PAFTR\FSHLGLL 
NSRDYRHLPPRLANF\*FLVETGFRHVG 
QAGLKLLTSGDLPAIiASQTTGITGVTAS 

psldegvfkalpppvgttlclp PWTTGG 

PFLGPAL 


6744 


20645 


A 


6798 


444 


1 


plcfppggggflsshqgfpppppkrkgs 
vgakkknippgpfpffffkknfff*gpy 
pf/sepkkarwgppppgpgfspprslvs 
plgbkkkkkkipppetqpps fga* wgkk 
rapppgpraggrnsfleggrgqgrtggs 
vrefragr 


6745 


20646 


A 


6799 


462 


83 


IEPLRPALCLS*SNPASI*KPHFLLFF/ 
CLPFLSFFPS/SFPFSFPFLFPFPFPSF 
FLIPFLSYSLCCSGWSTVA* *GLTIAS\ 
TYGLKQSSHL/SLPASWFPCLTN* KKI F 
FRDGGLTMLIRLAYLFLNF 


6746 


20647 


A 


6800 


433 


185 


PPPGGRLLVEKTRGEGQLKPGQPGRQAG 
LGMRGNPGKAQPRPASGAPEMQLPARPP 
SGGHLYGCGAKNNEGGANPLLCGGTGSG 
SVAAAGEVSKSAPDSGLMGNSMLVKKE/ 
AGGGGEPQANQETEN*EG*PMSPESGAL 
LLTSPAAATLPLPVPPQRRGFAPPSLFL 
APQP 


6747 


20648 


A 


6801 


395 


60 


IEFLLFLPLSVDGRLWSHQLLPVMNEAT 
/MSIP*HAFAWTC/SSFLRGGCLGA*ST 
IiLYGGYMFSYLRNKQTIFQSSCCIVNGE 
QPRMRNRLFS*FIQHLVFSVFIiIRATLE 
GI 


6748 


20649 


A 


6802 


423 


98 


RGVPY* PGGLQPPPV/REFFFFFIGRVS 
VLQAGAAAPTCRAQSILPGSWDYRHLHA 
NMSS * FL YFV+RQVFSMLPRLVSNSWDQ 
AIHPSQPHPALFFTCWSVKPLATL 


6749 


20650 


A 


6803 


396 


194 


SLMKLAHV*VLHTDGGFMHPDSAISCHD 
IFDSLHLTGEGSAKICRALYELIMQLME 
EHP\GEKQTTIA 
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6750 


20651 


A 


6804 


373 


192 


PALNPWAQANLPHQTPALSSNFFYF/CV 
EMGLTVLPRLV* NS WPEAI L I LLPWHPK 
VLGLQA 


6751 


20652 


A 


6805 


370 


242 


KKGTSKFLFFPSYSFFSKFWYQKKVKIA" 

FFFF*DRV5LarPEWSAVW*S*LSAALT 

/CPGLQGSWGYRHVLPCLANFS\FFC*D 

GVSMLSRLENWPFCLF*PVFQPGQHGNS 

ISTKK 


6752 


20653 


A 


6806 


464 


182 


DGPHRFEPPISS *FLPPWPPQKLEIKG* 
APLAKQI FFFFFFL* RDXVLLCC *GWGT 
VYSVYSWVSSRLTAAS\TAVLKQSSHLG 
LLSSWDYKHHH 


6753 


20654 


A 


6807 


3 


228 


YTCAGFFRRQSLCHLGWGAVARSQLTVA 
LTL/VPPGUt*SSNLGLPKCWDYRCEPP 
HLACVNFNTVKDVCFSCILS 


6754 


20655 


A 


6808 


506 


173 


NLHLPGLKQFSCLSPPPRSRDYRRAPPC 
TANF\ + FhVEMGS RHVAQAGLBLLNSGD 
SPASASQSAGTTGVSHHGQISLFKQILT 
STRISEVIFLTLLLRILHDLPCLKLFSL 


6755 


20656 


A 


6809 


676 


397 


VSSCPRDQPGHHGETLSVLKIQKLARHG 
GGCL*SQLLRSLR*QTRLNSGVGDHSEP 
RLSHCAP / AWATEQVS VSRKCKKRQLHQ 
KERIPEWVRS 


6756 


20657 


A 


6810 


367 


31 


QVFQELYSQDLFIYIiETLSLCHPG*SAV 
ARS*WTAASDSW\VKQSSRLSLPSSWD 
YR* VPPYSAHFDVNSSFLTYCTS PGLTQ 
KIEDSWLGTVAHACNLSTLGGHSGQIT 


6757 


20658 


A 


6811 


469 


251 


LLSSWK/YQVPPPHLVFIKIL/CRIGRE 
TASCCI * PRLWNSWPQVILRPQPPKML 
GIiLSATMPGPISVFCFCFCF 


6758 


20659 


A 


6812 


16 


462 


EIISYCGFNbHFSNDYKKKERKRERERK 
KRKKENIGE/HI *DIGVGSDILDLIPKA 
WTTKLNIDKWYHIKL/QKLLSS*RVAKT 
NKLKSFCTAEEI ITRVKRQPMEWEKTFA 
SQTSDKGLISNTYKELKQLNSTETNKQK 
NNFDLLSADKG 


6759 


20660 


A 


6813 


492 


243 


SSWDTGCTPSRPD\I FVFLVETGFPMLA 
RMVLIS * PRDPPAS ASQSTIiVDTS RYKV 
YQNI HQQVNMF * NYGWCFFD I FTASVGD 


6760 


20661 


A 


6814 


32 


308 


EYTLPHPYLITGPPPVPKPETYSHS * YF 
LSPNTQCIQ\PAPRTLNTASSTCIiSRRV 
TARPVPIYQDPSLEPLQLPPNSLTLPTP 
IiRHSPCW 


6761 


20662 


A 


6815 


97 


274 


GRFLELKVKRGS LGLGMWHAYNLS TLG 
G*GGRIP\QGFKASLGNIAKSCLYKNKK 
GAA 


6762 


20663 


A 


6816 


74 


376 


DFFIVYSVFTHLVNIKDVKDVLYKSLKY 
FFAIYRMGENIYKIiYILGGINIQNISME 
YYTA/IKKD* ILLFAATWM/RI»EDI ILS 
EISHEQKS\KRCMFSLICGS 


6763 


20664 


A 


6817 


1 


1127 


MRVTAPRTVLLLLSGALALTETWAGSHS 
MR YFYTAMS RPGRGE PRFIAVGYVDDTQ 
FVRFDSDAASPRMAPRAPWIEQEGPEYW 
DRETRNMK\ ASAQTYRENLR I ALRYYNQ 
SEAGSHTLQRMYGCDVGPDGRLLRGYHQ 
DAYDGKDYIALNEDLSSWTAADTAAQIT 
QRKWEAAREAEQLRAYLEGLCVEWLRRY 
LENGKETLQRA\DPPKTHVTHHPISDHE 
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ATLRCWALGFYPAEITLT \ WQRDGEDQT 
QDTELVETRPAGDRTPQKWAAVVVPSGE 
EQRYTCHVQHEGLPKPLTPERWEAILPS 
PTSPIVGHCAWAGAVLASWWSSGAWAT 
VDV*EGRSSGGKGG\SYSQGWRPSRFVP 
QGFWMCLSTALKKP 


6764 


20665 


A 


6818 


34 


382 


STAGAIHVCMCMCVHVSTCVCAKMCVRV 
KF*IWDIETGAAQLSCCVLVAQTEIIYL 
QERNSSLSERECVCAK\CVCMCMCVHVS 
TCVCVCVCV*V*KRHNWPGMVAHVCNSS 
TLGG 


6765 


20666 


A 


6819 


1061 


305 


FFLRWSL/DSVAQAGV1QWRDLGSLQAPP 
RGFTPFSCLSLPSSWDYRRPLPRPANFF 
YF* *RRGFTMIARMVSIS *PRDLPASAS 
QSAGITGVSHRARRRNTILY+HLNFVIK 
SHKPPDIVFRDL*GPRI.HTjENPVL*GAK 
SHSSCPWLNLSPPPPHAGSHCLSSSSPR 
MGARQDLLFNASSHTGLFPFLQIiNQTAS 
RPTSQVPKSLCFPIiPWVQPFALPWVRQD 
EGRASVNSPQDPPRVPSCAGFTHSCNLS 


6766 


20667 


A 


6820 


610 


248 


ERRSHPVA*AWKWCHLGSLQPLPP\GS 
SDPP\T+AS*VAGTTGVRHHARLIFVFF 
IETVSHHVAQPTGF*TPGAQAIRLPQPP 
ENARITDVKPLCLAHFCTSKHFIMVPAP 
CISHQKVFH 


6767 


20668 


A 


6821 


3636 


3332 


TCLGSHATPHCGHSFSSCKILLF* FFVF 
EM*SHPVAQAGVQCRDLRSLPPPPL* FK 
RFSCLSLPSSWDSRRPPPRLANF\CIFS 
RDGILSCWPGWSRTPDLR 


6768 


20669 


A 


6822 


563 


146 


APGLSFNCYIPPAPAFMPGELTLFEVD/ 

NRSIFPTEY/TIHILISSBDHiHS*TIP 

LLGLKTDAIPRCLNQTTLTTGRPGLYYG 

QCSEICES/N/HSFIPIVLELILLKYFK 

T*STSTLKYHCKAIQHLPFKLKIDGVYT ' 

SLQ 


6769 


20670 


A 


6823 


399 


201 


KAEPPPAKSWSKGGYVIiPRS / IRLDHVK 
WALE PDD IAVLNFMKERHPQS KS * TI LQ 
KWIQPFSPIVQK 


6770 


20671 


A 


6824 


323 


323 


I.PTVIVI ILSFYNFFFFSMRLIRTLSFI 
CMYVCICIYVCIYVCIYVCVYIIYVCIY 
/lYINISF*NDRISFFFMIiE*YVCVYIH 
EWIKK1 


6771 


20672 


A 


6825 


396 


122 


LPPGGHRPAVPGGREMGPSRGNPGNPGR 
F*KKGPPSPGGGPAPGGPPSPEG*ARKM 
G*PRGPSLP*TQIPPPGSPP/WGPPGVT 
PETLVGFKKRGHPPRGGGPPPGVPLPRR 
AKPEKWVNPGGQAFHEPKSPPRGPPRGA 
KPETLSPKJOCKKV 


6772 


20673 


A 


6826 


414 


64 


RGVEGGAACIFKGLSVAGKPG/YLSAMR 
SCFRRMLLFKS *G/QVIATKHMLPTVIiR 
MVGDPYANVRFNVAKSLQKIGPILDNST 
WQSEVKP ILEKLTQAQDVDVKCFAQEAL 
TVI*SLA 


6773 


20674 


A 


6827 


2 


519 


KTKTCLVEE IS* ITGRFLERINKIDKPL 
VNL*RKK\VQVLNIRNEKVDVTTDRT*K 
\ 1 1 KEHYGQLYANKFN/ IDEMDKFLKRD 
KLAKLTQEEIENLNI PVSI * KFLI *NI P 
KMKTP/GPDGISREL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
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sequence 


M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=*Alanine 
OCysteine, D»Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Htstidine, I=Iso!eucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine 9 R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \~possibIe 
nucleotide insertion 


6774 


20675 


A 


6828 


413 


114 


GG KRGPGP I PHP P ARG PQ * G PGHPAVQV 
PEPPGLSFPHFRGGFPSSGGCQRGEGRK 
ERRPSWTGPSWMAQEGRTPAARLEGAVL 
PG/APGPEAPGTGTTMSAPTWPEPP /G / 
GRQ VQD VP AF P EGS ARQAR * HE E * LHWK 
BILL 


6775 


20676 


A 


6829 


479 


261 


PCFS FLSNEDCRHAPP / RPV * F /LVERG 
FRHVARAGLEFLT*SGLPASASQGAGIA 
GVS HCAWP LQVG IE AS LQY 


6776 


20677 


A 


6830 


492 


313 


HPIGWPVNWGRASAVGEFRDWKLFYPEC 
E I RTMGGREHGQS PGAQRTFFQLLLSFX 
VESK 


6777 


20678 


A 


6831 


378 


1 


SNGICLQASSASPNKIVPSTSLKVCSN\ 
VRVDGSRNPSVAFPPLTPITLFAVAPSIS 
SGMGI ETI P I QG YRVDE KTKKCYI PF\V 
*ANRHSPSGVYNINVHALVSGPPIiSDDS 
GVNKPQMKHQHCSA 


6778 


20679 


A 


6832 


374 


76 


QLVEKWTONVKSMTDRVLTMRCELRARIi 
*ALRTPGTWNLINDQIGMF\KFSGLS PK 
GVECLVNEKHIYLMPSGRINVSGLTAKT 
LGYVAASIQEAVTI IQ 


6779 


20680 


A 


6833 


369 


56 


LKYLGTSVENRYADQAKWLSPVISALW* 
FE / CGWIARS EDRDHPG * QSE I PS/ PAN 
FLFLVEMGFYHVDOAGLDLMTSGNPPAP 
NTGMSHOTRPGRTDVIAIItCSDP 


6780 


20681 


A 


6834 


457 


142 


DCLALS SRLDLQWCDLGSLQAP PPGFKQ 
FLCLSLPSSWDYRCLPPRPANF\*FLVE 
TGFHHVGQAGLKLIiTSSDLPASASQSAG 
TTYJTiSHRAWPDSS FEGSSS <5 


6781 


20682 


A 


6835 


407 


35 


TAPAIRGVHLRGRGWTVYGAV*AVPPAS 
IPAYPGWYQYGFYGADLYG \DMQHSDM 
RGLMVSQHSPMPHT\LQQS\SVTVMAGC 
TOPSHTMPLPLPLVY\ elalwrvyarva 
TSYLPPAEVMLAPCK 


6782 


20683 


A 


6836 


1 


428 


GKTIiNPPSLGKS FGLIGLPPENDWPPDV 
SVPRAAFPLRGPRSVQSWREMEQSVAQ 
LLL*MLTFTPHKRISAFRAL\QHSYLHK 
DEGDPE 


6783 


20684 


A 


6837 


3 


612 


FKGSGVFNRKKKLEEHPLWQDVALGTAL 
CPQLP SLPGWGVI EWPRRAACLPCNPQL 
CPHCCGPQEAQYPL*AGGQ*GQHQEKPG 
A\GPHPPSGPQVPSAPGEAMTAMEGRKI, 
SPAVAQNPPQRKNTIiPRGPLPRGAPTTT 
P*SLRPR*TPVGAGLAAPIRSHHPGCSG 
QEALALC PDRPHKDRGVRRGAGGPRGRR 
VQLGGIPP 


6784 


20685 


A 


6838 


3 


412 


YRWGFTMLTRLVLSS* PR/ CDPPASASQ 
SVGITGVNHRVRLFFFFFGTGPHS\MAQ 
AEGQGHNLG*LQPQPPGAKGTLPRTGNF 
K33GPHTPPNF/ CI FFGKNGVS PYCPGWD 
FGAKAILPPGPPKELELEGSPPAPGKVF 


6785 


20686 


A 


6839 


419 


26 


EETS RLWSIGQCPLLIQTAQVLS FTSGH 
GRHREHRKPTDSPLPSACPAP/RLTPPP 
SMCPASHSLPSSQCMPGPTLTPPTACAR 
PDTHSPPPSVCPAPLP/CRPVCPPPRSL 
PTPA*TKPRTQLPSSMYCGRSS 


6786 


20687 


A 


6840 


365 


66 


LLERQIGRSVYGPSSKSSIFDVPSLPHR 
GIKLVFCLCHTSSWDTGVHYAPPA*SWH 
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SEQI0 
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nucleotide 
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NO: of 
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sequence 


M 
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09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D==Aspartic Acid, 
EXvlutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TWHPRPRQVGQ/ AGFELLTS CHPSPLT 
SQTARIAGVSHRARPKFM 


6787 


20688 


A 


6841 


445 


188 


GFQ*FSCFGLLGGWDTRHVPPRIiADFCI 
F/M*RWGFAMSARI,SSNS*SAPSASQSA 
GITGMSHCARPATLNSPICGTSLPLGSF 
FF 


6788 


20689 


A 


6842 


3 


412 


PQQS PQLTTVTAPRPRALLMGTTMANLQ 
TRVSTSHLQGPRLTSALTYQMAQVAVAT 
PPWSVLAVWLSARVTVLP/LN/VNGI 
SWIRQLQKAAGRQPPSPHQTRNNSLE* 
VASNLSKFLILEVAEIiFCHVLTFFSYQ 


6789 


20690 


A 


6843 


113 


410 


NHKCNVNFYNGPRSLEKRLITLPILHYF 
LNFILCKSSYLTLLNLKLQTOLFCK/YB 
M/ESHFVTQAGVQWHDLGSLHPKQSSYL 
SLLSS *DYRGMPPHLAN 


6790 


20691 


A 


6844 


2 


357 


FCPLARILQSGRFAVLPWNSGIiVRGRRQ 
EGPHGAFTREHT / SLRAPQPRSENERR/ 
RIACALRAAHQASGWRLTPRAPRSPSGC 
HCPPAE PTSR* EAWGARRKGHGCCPAPH 
QPAPQDIS 


o/yi 






Dot j 




97Q 


HFFFPC30)/ KSLAMLTRLILNS WA* VIL 
LPQPSKVLGIiQV 


6792 


20693 


A 


6846 


3 


275 


VNIjYDKAAAGFERMDSNF \etsstvgkm 
LSSSIARYREIFYERKSPSMQQTLFLSH 

FRKLP*PF* platttli y*qpstsrqdp 

LPAKRAR 


0/7 J 






OB**-/ 


i 




FFFFBTQFHSLPRLECSGAISAHCNFCL 
LGSSSSPASASQWGITGMHHHAWI IF/ 
VLFLVETGFHHVGHAGFELLTTCGPPTL 
ASIMCILVYLLVFQHLVSLKNLKGONAS 
FYFSPFGQHLSGFVKCSIWCCGSLSGLB 
GHFLPPHSPPLH 


6794 


20695 


A 


6848 


13 


148 


GSAWRHTPAG\ELLKKLRHRYGLNPEGG 
GCTEP I WCYRTPPWTTD 


6795 


20696 


A 


6850 


204 


30 


T I FSRQVIoRI QNVLSE KLRVS TVYANNG 
S \ VLQGTLWASVYHGKI LIGTVFHKTL Y 
CEL 


6796 


20697 


A 


6851 


2 


429 


EVWASGITGKAIMAMPIVIQPKNPSSYP 
CRRQFPLQLEAKEGFQLLTEKFQKHGLIi 

I PYN/TP ilpvkksne kyrlvqdlri IN 

EAWPIHPMIPNSYVIIiPQPPDAQWFPV 
LDLKDSFFCISP/VDPSSQFLFAFE 


6797 


20698 


A 


6852 


3 


160 


sgwdhrrhtqlifmysflffcrdr/slp 

MLPKLVLNSWAQVILLLQPPKVLGL 


6798 


20699 


A 


6853 


2 


243 


ACMILAAATIILRW\PLRYI ILIWGIN 
KFTKKLRNPYSIDNNELLDFLSRVPSDV 
QKVQYAELKLCSSHS PLRKKRSAL 


6799 


90700 


A 


6854 


24 


458 


SRAARRAAGTTRRSSHTGCRPRPADPGA 
RCLPRGSPHPRMDPPESP/PSETSPGPP 
PMGPPPPSSKAPRSPPVGSGPASGVEPT 
SFPVESBAVMEDVLRPLEQALEDCRGHT 
RKQVCDD I S RRLALLQEQWAGGKLS I PC 
KEENGS | 


6800 


20701 


A 


6855 


3 


183 


CCDPPVSASQRAGITSVTY/Q/WPSLGS 
LQPPSPEFKRFSCLSLLSSWDHRHPPLR 
PACKVFC 


6801 


20702 


A 


6856 


2 


242 


GNHPYARKLA\AQRRASTVSSVTQVEVD 
ENAYRCGSGMQMAKDSKSLKTHQTQPGI 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Axginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y»Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RRARDKAL\LVANGKRLWCIEGSVC 


6802 


20703 


A 


6857 


54 


401 


EGERLQKFMHVLKTHTHTH/ AHTHBRAH 
THTHTRTHTPRSGMHRSNNRNLAKNLFK 
IFFFLSSSPPSLFFIillFPFLSLSLSSL 
SLSLYKHTPLCILSLRTEHLHRMGALLG 
GHILE 


6803 


20704 


A 


6858 


1374 


579 


CENPSRSTRPGQGPLLLPHQLPFH/RPA 
PSQSSPPEQPQSMEMRSVLRKAGSPRKA 
RRARLNPLVLLLDAALTGELEWQQAVK 
EMNDPSQPNEEGITALHNAI CGANYS I V 
DFLITAGANVNSPDSHC5WTPT,HrAA<?rw 
DTVICMALVQHGAAIFATTLSDGATAFE 
KCDPYREGYADCATYLADVEQSMGLMNS 
GAVYALWDYSAEFGDELSFREGESVTVL 
RRDGPEETDWWWAALHGQEGYVPRNYFG 
LFPRVKPQRSKV 


6804 


20705 


A 


6859 


36 


269 


lALGSMDVLTIIiMFNEHSKNEIKRTIPL 
KIATR/ IKYLGINLT\RGKKDLDTENYT 
TlilEEIEDDTNKWKDIPCSWTGV 


6805 


20706 


A 


6860 


47 


888 


TLRARALQARPRTGSSCTAATWTS/SGA 
SQHSLRALSWRRLYLSRAKLKASSRTSA 
LI^GFAMVAMVEVQLESDHEYPPGLIiVA 
FSACTTVLVAVHLFAIMVSTCLLPHIEA 
VSNIHNLN3 VHQS PHQRLHRYVELAWGF 

STAT ^TFTiFTAKVVTiVT^MVTrFVP Trta PT . 

DTPTPMVPTSRVPGTLAPYATS LS PASN 
LPRSSASAAPSQAEPACPPRQACGGGGA 
HGPGWQAAMASTAI MVPVGLVFVAFALH 
FYRSLVAHKTDRYKQELEELNRLQGELQ 
'AV 


6806 


20707 


A 


6861 


3 


391 


NKISFFCRDWGLPMLPRIjVIjNSWAQVIL 
/IiISTCQPPKVLGIjQA 


6807 


20708 


A 


6862 


322 


175 


GCKCVPPPRASFKIFGRV/RGLDIIiPKL 
VSNSWPQAILPPWPLESLVLQA 


6808 


20709 


A 


6863 


700 


294 


YVLRQGLTLLPRL/ECSSAI iahcslkl 
LGLRNPPTSASRVAGTIGACHHTRVIFI 
VFVVIESCLLAQAGPKLLGSSDPPALAS 
QSAX3IAGISHHAWPPYYFYWVLSSTIPS 
VLSIjHSHLIIiATILRGGDVTSQTDK 


6809 


20710 


A 


6864 


371 


77 


RSMFAN\NLVYDTSDSDDYHLLKVLEEG 
IQTLMGRLEVGSRRTGQILKQTYSKFDT 
NSI^HVALLKNYGLLYCFRKDMDKVETF 
LRMVQCRSVBGSCGF 


6810 


20711 


A 


6865 


273 


34 


0YLPT YLKLFRLLFF / CLI I ETESHYYA 
QAILELLSSSNSPTSASRSAGITGISHH 
TQPNFSNNTLLKPYLRLSYALLFA 


6811 


20712 


A 


6866 


1 


190 


GAKGMHHYTWLVRYF/ CCVEMRSHYVTW 
VGLE/PPALKHSSCLSLPKCRDYRHEPP 
HIiALFCFSF 


6812 


20713 


A 


6867 


506 


293 


EKPSNGHKKPYTWISIAIIiFI IAKTWKQ 
S \ PRCPSAGEWINC/W YTQTMEYYSMLK 
RNELSNCEKT 


6813 


20714 


A 


6868 


2 


116 


LFPFCRDR/ SLTMLPRLISNSRAQGILL 
PWPPKLLGLQT 


6814 


20715 


A 


6869 


10 


141 


GSAWLFFI FCR/DRGLALLPQLVSNPWL 
QAILLLWPPRVLQLQA 


6815 


20716 


A 


6870 


2 


603 


APTFINFPAKGKPKRGDTYHLQVRGFSA 
EQ IARWIADRTDVN I RVIRP PNYAGPLM 
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Predict- 
ed end 
nucle- 
otide 
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last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine ! 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K^Lysine, L=Leucine, MNVfethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LGLLLAVIGGLVYL.RRSNMEFLFNKTGW 
AFAALCF\VIAMTSGQMWDHIRG\PPYG 
PKDPHKGHVNYIHGSSQAQFVAETHIVL 
LFNGGVTIX3MVLLCEAATSDMDIGKRKI 
MCVAGIGI^VVIiFFSWMIiSIFRSKYHGYP 
YSFLMS 


6816 


20717 


A 


6871 


375 


1 


GFPGFPVPKFGIPAPVAPQNPFPPPSF/ 
PQGPPPSPGGPNPNGKNPFFPKTPFFLK 
NSPQKKKIFLKKFKRGPP\GPNQKPPFIC 
DFKKKRGLKPPKKGGPLKKKKKKKKRSN 
YSLKKIYSPFGFHFF 


6817 


20718 


A 


6872 


255 


23 


GRVDPTSTSQNAGITGVSHR/ GVQWRDL 
GSL/ QPLtSVGFKRFSChShSSSRDYRLV 
/PFCI FSRDRILPFWPGWSQTPHLK 


6818 


20719 


A 


6873 


5 


157 


YLLFTSYTHAHTHTHTHTHTQFLLLWTS 
GLS\SCCYIFSLSGPNTKCNMIF 


6819 


20720 


A 


6874 


351 


132 


SPRRCSAYGVA/ 1 YLFLYFLINIiLSLYS 
IFYGFAPNSFLCEVQEPSLG\SGWBPI*S 
GNHFS S I KMKIAHREVNM 


6820 


20721 


A 


6875 


99 


337 


AGSFFQKKGKKKISRVNRQPIQLEKIFM 
NCASDKGLVPRIYKELESAR\KYQSIPS 
KVG 


6821 


20722 


A 


6876 


69 


306 


YVSHHLSFFF/ CFFGGGEKALFFCPRTG 
IKWPQFGLLEPS PSGLKQFFGLNPPETL 
EYRVFFPPPGKFFFFFFFFFFFFF 


6822 


20723 


A 


6877 


335 


169 


CmDLGLDQPPPPG\SSDSCASASQVAGI 
TGMHHS FSNF I YLFCY FB KVFLTFTLD 


6823 


20724 


A 


6878 


2 


237 


ELGYKVLPHPP YS PDLS PTDYHFLKHLN 
NFL \ FHNQQCAENAFQVF I ESQSTDLYA 
TGINKLI SHWQKCVDSNCSYLN 


6824 


20725 


A 


6879 


2 


199 


RGRVGRQFQKE I EELKKKLEEGEE / VAP 
SSSSSSSDPLISEPDISGSEEDDDEEGE 
VGEDGEKKKKK 


6825 


20726 


A 


6880 


1 


206 


FFFFETBSCYVAQAGLELLGLGDPPASA 
SQWGTTGARHHAQLI F/VFVFLVEMGF 
HHVSQDGFDLLTS 


6826 


20727 


A 


6881 


1 


123 


WYI HTME YYSALKRRKFLS FATTWMNLE 
\EIMLSKISRRRG 


6827 


20728 


A 


6882 


15 


105 


SQLL / WRLRQKNRLNRGGGGCS FPLWCH 
CTL 


6828 


20729 


A 


6883 


223 


1 


AASTFLFPNLKNSLRG\SLRTFSSVTNV 
RKTALTWLNSQDIQFFFSRRSLTLVARL 
ECWRSWPAASRRLGCL 


6829 


20730 


A 


6884 


193 


3 


QGEQDFLPPCLPNFFNF \ FFCRDTVSML 
PRLISNSWPQAVILPEPHKVLELQARAT 
TPSHVFIF 


6830 


20731 


A 


6885 


2 


355 


RLLTSSDLPASASQSAGVTGMSHRARPR 
NF/CIIiPNLSPCNTHSPLSLPQPLAPTI 
LLSLWICL/SLDIS/WDYRHMPPRLAKV 
FFFGDGVSQ\YRPGWSAWPFAASTSGV 
EAILPPQPPK 


6831 


20732 


A 


6886 


1 


151 


HSVAQAGVQWWDHGS LRPGTPGLKQ/ FL 
PPLKQSSCLSLLSSWDHRCATLY 


6832 


20733 


A 


6887 


314 


203 


FLFFCGDKVIiLCPXRLVSNSWTQVILQP 
WPPKVLGLQE 


6833 


20734 


A 


6888 


334 


86 


QNRSTFKRGM/MGWAWWLMFVIPATWEA 
EVAAKIAPLHYSLGDRARLSKGGMKCTS 
IFCSKSAKDCQTLPVKKKISLPTLPSI 
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Ammo acid sequence (A~A!anine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F»Phenylatanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, JVNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=*Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


6834 


20735 


A 


6889 


345 


152 


QHARIMSVFI VETGFHHIGQSGLELMT / 
S/GNMHASASQSARITGVSPHARPSLPY 
YKCS KKQLCNE 


6835 


20736 


A 


6890 


2 


169 


ARECSGAITAHCSLDLMGSSDTFTSASQ 
VAGTVGVTIMPRYI FK\FFVEMESHCIA 


6836 


20737 


A 


6891 


361 


184 


NQSSCFSLLSSWEHRLMPPHLANF/SFF 
DRDE/SLTVLSRLVSNSWTQAILPPRIF 
FENS 


6837 


20738 


A 


6892 


3 


330 


HESAAITGVSHCTQTFFFF / CETGPRFV 
PQVKGQGHDLS SGEFGPPRLRNWPGLTL 
QNAGNTGTL PPHRTNFGD FRRG S FS P CG 
PGGFGTRDLGGFPHLTGKRDTGLDPP 


6838 


20739 


A 


6893 


2 


348 


ARAFFFWNFFCK/NRGLSMIiPRLALNSG 
AQGILPPWPPKKLGLRALPTWPKTGSSG 
LEGHSSLYPHHSL I FPHHCSKFCFIKQW 
RLFILRITENQGLTPLNAAKPAEPVMNE 
ARTQA 


6839 


20740 


A 


6894 


340 


240 


C3iD/RSLRMLIjTLVIaNSWIQAILLSWPP 
KVLGLQA 


6840 


20741 


A 


6895 


155 


1 


HLCFWAG \ PVAHS CHPGTLGGRGGR I TR 
GQE FETS LLGPGTQDVLAKCSRA 


6841 


20742 


A 


6896 


454 


294 


PSSWDYRHA/ PPMHNWPDFFCRDE/SL 
PML PRLVWNFWAQL I LLPWT PKALQLQA 


6842 


20743 


A 


6897 


49 


333 


I Y I CLS FYQS 1 I YL YDKQHD VLETVSHS 
VSLDLPGLRFSHLS LWDHRL ILPWLiASS 
NNFF\CRNRGLLVLPRLVSNSWVQAILP 
PWLPIVLGLPA 


6843 


20744 


A 


6898 


434 


324 


FFFFCRD/RSFAVLPRLVLHSWTQAIRP 
PWPLKVLGLQA 


6844 


20745 


A 


6899 


127 


365 


LPERRRYVS ISGYRYHLPQSLLFFYAQH 
ATFNNELW\LGVIAHACNPSTLGGRDGR 
MALGQEFEIGLDS I VRLHLCNKIK 


6845 


20746 


A 


6900 


208 


1 


FFFFFWILVETRFHHVA/RAGLELLSAG 
NPPTSPSQSARITGIWFFVWFSSNLSQF 
C YAHS F FTCWYSRA 


6846 


20747 


A 


6901 


1 


162 


GTSGTRLFTIGKRWQQPKCPPMAEIjINK 
MW\YTMEHYSVLKRKEIFCTWLTQIiL 


6847 


20748 


A 


6902 


120 


313 


DKVL/WRLRQENPLNPGGGGCSEPRSHH 
CTPAWATRAKLYLKKKKNFKGTLSNWYS 
FRRVANTNIjQMQSGSNYMQI 


6848 


20749 


A | 


6903 


340 


234 


I FFCRER\S ILPRLVSTTWPQAILLPRP" 
PKVLGLQV 


6849 


20750 


A 


6904 


1 


327 


GTSGTSRI LPWGPGC IRTPELKQSAHLS 
LPKCWDCWNYRHEPPVYSFAMLI IKLFF 
ELHLSVILWFLLIiANPQTQTESKTFSKT 
/ PRLVLNS HPQAVXi PWS LKALAS QA 


6850 


20751 


A 


6905 


532 


359 


rdg/dlvwwprmvsmpgfkqsshlghpk" 
cwdyrceplcqdfsysiitwfppsrel 

RR 


6851 


20752 


A 


6906 


270 


3 


GGTTVYPRKTKT/WPGAVAHACNPNILG 
GRGGWITRSGLPVSTKNTKISRAWCCAA 
WLATPEAEQENRLKPGDGDRDHHGQHG 
ETPSLSC 




20753 


A 


6907 


1 


312 


GTSWKKMLPTLMDDFRG\KTLVEELIPD 
WE I AREVELE \ MEHEDVTELLQ PHDKT 
WTDEELLLTDEQRWFLEVGSVPVEDAV 
NIVEMKTKDLEYYISLLIKY 


6853 


20754 


A 


6908 


331 


61 


FRHVGQAG/ LQKLLTSSDLPAPASQSAG 
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T TGMS HHT RPJCL ILR KfCSMRAKLDSROT 
RLQPPWEQGLLBEQHEVYNVYMAHVSK 
SSSRKLS 


6854 


20755 


A 


6909 


419 I 


199 


SAVQDGV\SWCDLGSL\QPPLSRLKQFP 
SPCWSDWSQTPGLK 


6855 


20756 


A 


6910 


341 


69 


EAEVGGLIiKARSLRPAWAIQQDP IS I LK 

GTTGVSHRAQ\L I F AFLVERRFHHAGQG 
GLDLLTL 


6856 


20757 


A 


6911 


174 


356 


KTYIMPIiRSSLRPGMVAHACNPSTLGGQ 

GGWITRSG/ esrps pvlvnrltrcqant 

v rvtr^JLtx 


6857 


20758 


A 


6912 


391 


3 


SPPPPPFPPSFPPFPPQNFFFPPGASSF 
FGPPPP / YFPPP PKKS PPPNPPPPFPFP 
PPLGKQFFS PPPPFFFPPPGFPFPPPPF 

LMICMLFFCKYILKFCRYS 


6858 


20759 


A 


6913 j 


43 


331 


IRDTYS FSLS APALAPMLGTKG VLLAYA 

SGCVCVCAC/CVCVCVRVCVCVCPHWPA 
SIiLTKDLQCLPFTV 


6859 


20760 


A 


6914 


250 


11 


TPNLWGLFLVFCFEM/ EVHYIAQAGLE/ 
PPGLKPSSSLSLLSSWDYRHVMPHAAQI 
CISVFSLAQKGVPPSNWILITHYS IKS 


6860 


20761 


7V 

4-i. 


0915 


532 




SAPNSSMSKWFYFSETESHSVAQAOVQ 

TYIiPPHLANFLFLVEMGFLHVGQAGLKL 
PTSGDLPALGSQSVG I TGVSHHTR\ PRF 
LFKIGPLFLFSKSMSQDEARI 








oyio 


z 


Hi 


WHTPVVTiATHEARAGGS LE PRRAKXOLA 
MIVA/NCTPAWVT 


OOOZ 




TV 


£01 T 

oyl / 


1 
1 




GTROSPRIiflTJ^SSCDHRRVSPHPADF / 
SFFYFYFFCR/DRGFTMLPRQVLIWAQV 
MLLSQPPKLLGLQVVGVSHGARPPHCIjG 
YSWVYNKP 


6863 


20764 


A 


6918 


2 


367 


RCAPHCPDNS SFFK/ RVKTGLTI FPRQT 
SNS WTPQ VI LLP WP P /KVLGLQA 


6864 


20765 


A 


6919 


3 


354 


HEVNIVETTAKDLEYS INLVDKTGFEKI 
DSS FKRRSTVS KMPSDS ITCYRE I FCER 
KSQSMLQTSLLSYFKKLPQPPQPSA/TT 
TVISQQPSTLRQGPPPAKRLRCAKGLND 
FQHFWL 


6865 


20766 


A 


6920 


366 


41 


STSGEDAVH I VEMTTKGLEP / YTAVPGF 
HPSSERRSWGNMLSNSVTCYRGIFGER 
KSQC/RQTSLLPYFN/KPQPPQPSATIT 
LI\SQQPSTSRRDPLPPKRLQLTESSND 
H 


6866 


20767 


A 


6921 


3 


368 


QLLMGLHLEPATLDLLQLKT\TIQGGSH 
SHHTTAIP\ PS\ FTRPTJTLMPTHT.SPTT. 
LI^iaiPDI ITGFFLLKKKNKPKNHKNRG 
PSYKRTPEGPLLLLWAQKTLPPKGGLY 
KIAPLRLFGEDS 


6867 


20768 


A 


6922 


3 


91 


FSVLPRLV\QNSWPQVICLPWHPKVLGL 
QV 


6868 


20769 


A 


6923 


334 


91 


ERCGKCAHIS EI FLSSLDYI YI FVF Y\ F 
FFLFFFFFFFFFFFFFFFFFFFFFFFFS 
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QFKFNTKS KE KMFNKTMKKVQVTF 


6869 


20770 


A 


6924 


375 


37 


TKAGVQWHDHGSLQPQPPRLKPSSCFSL 
PSS WD YRHAPP /RP/ALFLFLFGLVWFG 
LVFLVEIGSCYIAQAGLKLLGSSDPPTL 
VSQSARI IGPTHHAWPTLTTS IQCTIRG 
PG 


6870 


20771 


A 


6925 


647 


1054 


CFWKAHI QAGETNN IQEGSERFHFCPAD 
EVYVHSGPAVBDQPQRRS FAL/SAQAGV 
QWCNLSLIjQPLPPRFKQFSCL\SIiPSSW 
D\HRHVPACIiAKFC I FSRDE VCHVGQAR 
LELLTSGDPPASASQSAGITGMNHHV 


6871 


20772 


A 


6926 


3 


204 


DAWBTTGFGMI YDS LDYAKKNEPKHR\L 
ARHGLYEKKKTSRKQRKERKNRMKKVRG 
TAKANVGAGKKK 


6872 


20773 


A 


6927 


1 


112 


PTRPRTRGVASVLYFTTIIiILIPT\ISL 
IENKILKWA 


6873 


20774 


A 


6928 


1811 


478 


DRARS PRSERA\RRSSSRIiRQRDPSLGR 

RRLRREIRPGLPBSEPRPPRPPAAI/EAD 

QPPPRRLSESRGGGGMSE\AGE\ATTTT 

TTTLPQAPTEAAAAAPQDPAPKSPVGSG 

APQAAAPAPAAHVAGNPGGDAAPAATGT 

AAAASLATAAGS \EDAEKKVLATKVLGT 

VKWFNVRNGYGFINRNDTKEDVFVHQTA 

IKKNNPRKYLRSVGDGETVEFDWEGEK • 

GAEAANVTGPDGVPVEGSRYAADRRRYR 

RGYYGRRRGPPRNYAGEEEEEGSGSSBG 

FDPPATDRQFSGARNQLRRPQYRPQYRQ 

RRFPPYHVGQTFDRRSRVLPHPNRIQAG 

EIGEMKDGVPEGAQLQGPVHRNPTYRPR 

YRSRGPPRPRPAPAVGEAEDKENQQATS 

GPNQPSVRRGYRRPYNYRRRPRPPNAPS 

QDGKEAKAGEAPTBNPAPPTQQSSAE 


6874 


20775 


A 


6929 


324 


203 


LIFVFLVETGFHHVGQDGLDLL/NLVIR 
PPRPPKVLKLENP 


6875 


20776 


A 


6930 


3 


364 


HEAS LTQTRTTMTHCSRTTGCS TASGRT 
WTRSRHSCAWCSAALWRAAVASRCPSS / 
IPVTPPQCLSWP/WKVPLQCPPAIiS 


6876 


20777 


A 


6931 


2 


165 


GRVGFCSVAQAGLQWHDLSSLKPLPP\G 
SSDCPASASRIVGITSIAFHFLNNVF 


6877 


20778 


A 


6932 


1 


370 


TELRS YS PSNGAI SSQCNLCFLGSGYS / 
RSSTSQVAGITGA/R/HHTQLIFVFLVE 
TE FHHV\ AKAGLELLTSGDP PAS AS QS V 
GIAGVSHCAWPHVYILKEIMLGNLVRCL 
MSICHHHTESICIiFL 


6878 


20779 


A 


6933 


1 


441 


rrantpnnpeappn/qkkkkkkkkkkkg 

KNPPGGPPPFFKTPPPFKNGGGSPPGKK 
KGGEKKRGGKIFFPPPGGKKPPPQKKKG 
GGGGKKI FFFLFGPGGGGNFFKKKKGPP 
PFPPRKKKKIFPTGGGPKKKKKKKAPPK 
KPRGFYFP 


6879 


20780 


A 


6934 


518 


340 


PKPKNFPS / PSPPKIPPPKKKVFSKKPP 
GGFNNPPHKRKKYNFPPPEKLGPPKEFL 
KRPPL 




20781 


A 


6935 


396 


299 


WEVC/GVCVCVCVCVCVCIHSLCKSTL 
HRTGV 


6881 


20782 


A 


6936 


486 


'272 


PTRAPTR PAPPHLADFL I FCSDR /S LIM 
LPRLVLNSQPQMILPLWHPKWGLQA 


6832 


20783 


A 


6937 


1 


264 


YSNLRDRARS CLSLPSSWDYTCSWD YKC 
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ASLRLANF/ SNFCRERGLAMLLRLVSNS 
WAQVILLPQPSKVLRLCLAVSVFFGEKQ 
GQCS 


6883 


20784 


A 


6938 


2 


407 


TTGTCRSVWLTFVFLWTGFHHVGQAGL 
ELIi/RPPQPPKVLGLQA 


6884 


20785 


A 


6939 


422 


65 


LKGPPPSFFFFFFFFFFFFFFFFSP\LP 
YILYFDCASNACLFFKNNTWRYIKRIFF 
SPTPFI PFFHBVENVTSLS YMVFQYCFL 
FF 


6885 


20786 


A 


6940 


462 


209 


LSBFWDYRREPPLPAKFAIFMLORDRGL 
DTLPSIiVSS/SWHQAIIiAPQPPIMIiGIiQ 
VHS AYKGFCL PVLTFE EVI GNRPLGLLP 
P 


6886 


20787 


A 


6941 


491 


299 


CAEYSPESGPTHASAHAS / SNVQMVYSR 
ISCKEELLLGRTSPSKNYNMMTVSG 


6887 


20788 


A 


6942 


429 


256 


FSHLSLiPSSWDYKHLSSCPANFCIFVQT 
GFHHVGQA\GDP PASASQS VGVTGMSHH 

s 


6888 


20789 


A 


6943 


932 


661 


FKTGS YSV\TRLEGSGEI SAHCNLRFLS 
SSDFFHLSSQVARITGSSQYARLIFVFF 
VEMGFAMLPRVGLKFLGFKPLHPP\RTP 
RVLGLQE 


6889 


20790 


A 


6944 


381 


55 


PASLPPCSLISDCCASNQRDSVGVGPSE 
PGVGYSLWRRFLSRSEKRNIRVGVTRF 
SRCV/LSPLSLTQKGNSLTPCASQVRQC 
LALLRLAHGACTHWPAPTVWHSLVR 


6890 


20791 


A 


6945 


816 


513 


FFIFIYFFEMESLLPRLECSRMISAHCK 
VCLPRSSSY/PASASQVAKITGA/TRHS 
RLIFCVCVLVERRFHHLSQAGLELL/NL 
VTHPSRPPKVLRIiQGVTRTA 


6891 


20792 


A 


6946 


383 


234 


LIPSLMDNSEGLKTSVEEVTADWKIVR 
ELELKVKPE/NVTDLLQS\YDKT 


6892 


20793 


A 


6947 


423 


60 


LNPPPPPF\YKPPPKKKKFFSPPPLKFG 
PPTKFFKRPPPFFFFFFFFFFFFWHGTF 
PLNSLVGPPKVEGWSWAQLPRAVGPPL 
PNPKWGLNKPPAGGLNKP IHFPSLNGSG 
EATPPLIPWC 


6893 


20794 


A 


6948 


1 


327 


MRLLAAAfiTiTiTiLLALYTARVDGSKCKC\ 
SRKEPKIRYSDVKKL\EMKPK\YPHCEE 
KMVT ITTKSVSRYRGQEHCLHPKLQSTK 
RFIKWYNAWNEKRRVYEEAQGLRRRIG 


6894 


20795 


A 


6949 


952 


796 


FFLHFLVEMG\FIiLVGQAGLELPTSGDP 
PAIASQSAGITGMSHRARPSVSS 


6895 


20796 


A 


6950 


364 


6 


ITTHKNYFLSRTATPPKBGLPQRKKIFL 
FFTPI FIFFVGF I FWSGGGRGFLVFFFV 
FFFFFFFFFFFFFFFFFFFFFFFF\RVK 
TFIGFKVLFFKINVFFFLIRAVPLDYMN 
HALTIFQW 


6896 


20797 


A 


6951 


418 


266 


ISFFFXXXXXFFFXXXFFLLFFFPFFFF 
FFFFFFFFFFFFFFFFFFFFFFN 


6897 


20798 


A 


6952 


459 


7 


PSYLYKPGPSQPNRRRHSVP/HIRSNKD 
PPPPEVMKKWGPPTPQKKKYFPPRKKKL 
GGGGGPIYPPKKKVFFKKKPGGFKKPPK 
KKKKIFFSPPEKRGPPPSILKNPPHPFI 














MT.T.DMGA 


6898 


20799 


A 


6953 


325 


143 


LPSSWDYRRVPPHSAHFKFFCRN/RGLA 
MLPRLAKPRFPSISIFlJJHPQVSEIIiGA 
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TRLAR 


6899 


20800 


A 


6954 


585 


306 


KTESHSAIRSQCS\GEISAHCNLRLPGS 
SNSPASASRVAGI KGTHHQVQLIFVPLV 
EMGFHRAGQDGLRSPALVI CPPWP PKVL 
GLEAWPALHC 


6900 


20801 


A 


6955 


2 


155 


FLVETGFHHAGQAGIiELLISGDPPTLAK 
S\DSQDVRITGLSHRAWPFLDIY 


6901 


20802 


A 


6956 


3 


346 


DAWPDAWGK \RNWPPHQANF VTFVE/ TG 
FT/HVGQAGLELLGLSDVPTSSSLDAGI 
TGMSHCSRPRPEISSRDVHTIPGALQET 
WHTPDLASS IRSLRCRLDCPHNHCPLDK 
DGVTL 


6902 


20803 


A 


6957 


1 


224 


QCGR IRSRKDARKGNVQI IPREGRRRTS 
PRYLFPVTGQETDGWG\MGLPAWLLKTR 
STSPPRSPRLHRRRGATSW 


6903 


20804 


A 


6958 


102 


344 


GLIiKPKSLKLQCTI ITPVN \NHCTAAWA 
T 


6904 


20805 


A 


6959 


3 


296 


KEEEEGEGEGGGGGG\EEEEEEEEEEEE 
EEEEEEEEEEHAEFTGHVESITQCKLLL 
CHI^LRVEAGITHLLCFTVRAVTRSQRL 
LTKRRNGSVGHTSE 


6905 


20806 


A 


6960 


3 


172 


RLGLPKCWDYKREPP/RPGLELLTSSDL 
PAL VS QS DG I KDVGHDTE PQD FYTLGQD 
P 


6906 


20807 


A 


6961 


2 


224 


LALLPRLARSGTI IAHCSL\NRSLRLGL 
SDPPASASESTGTTGMSHCSQPAKLLST 
CIRNSVAETKYYLMFKCK 


6907 


20808 


A 


6962 


2 


330 


KGTI»RRQVTYKGTLIR\LVADTLAETFH 
ARKEWDDIFKVLKKHCQPRILYLAKPSF 
KNEGESFSQRKCITTRVARS KMAKGVLH 
LEVKDISTIJKQMKIRSSLVEQIHKW 


6908 


"20809 


A 


6963 


1242 


929 


ETGSCSWQAWQWCGHGYLKPQPSHAH 
/DDPPTASS IAGTTEAHYHAQLI FKKFF 
NSLCCPGWSQVSWRILGSSDPPASASQN 
AGITGMSHGAWPQLSIYRKGN 


6909 


20810 


A 


6964 


1 


378 


KFFLQIiFLHS FI YEHFLFFS FLFFFFFL 
RPS FAFVAQAGVQWRDLGS PQPLPPRFK 
QFSCLSLLSRWDYRHAPPQSANF\EFLV 
ETGFLHVGOAGLELPTSGDPPTSASQSA 
GITGVSHRTRLAFY 


6910 


20811 


A 


6965 


393 


43 


VS PFPLKNFYFSFTPKTFVGGGGPSGPP 
PQKRFFFHTPNSPFFPFPPF/YKSGPRP 
GFFFTPPPEKGKCTFP\PPLRLGPPPVFF 
TRPPPPFFFFFFFERQRTCSFLHFLWR 
PLFRFYL 


6911 


20812 


A 


6966 


225 


2 


TRVECHGVFFGSCNPLLPGLKEFSAPAP 
RGSGNSGPFPPPR/RNSFFFFFSWLLB 
MGFHHGQADLEILTSSDPP 


6912 


20813 


A 


6967 


8 


341 


FVSVPPI LHYLFKI FFFFLRRSLCLPPR 
LECRGTI S AHGSLGLPGSSDSPS /STKN 
TKLPLRGGTCIjGPQLLGLLRHEKTWTLG 
GGSGSYPKSGKRAMALQGGVQNKTCVPH 


6913 


20814 


A 


6968 


29 


362 


DYTCKHPHSSELKKKKKKKKKKKKKKKK 
KKKKGGPPKKKP /HGGPQFP PGRQKKI F 
PFKGGFKKPPRGFFEKNPFFGGGNLGPP 
PPPKNKPPGEKKNFLGGRGQKNS FLSAW 


6914 


20815 


A 


6969 


391 


17 


VFFFFFLKKFFPP PDKKLFFPPFPLKIF 
F/SPPNPFFFLGGFSQISPPPKKGFFPK 
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IPPGV/YFSPPFKKKNFFFPPPLNFGPP 
RVFFKGPPPFFFFFFFFFFFFFFFFLGL 
ROJAGDCNGYGDVSYK 


6915 


20816 


A 


6970 


374 


268 


IMLPRLVLNSWPQVILLPWP\PKVFRLQ 
VRAPVPG 


6916 


20817 


A 


6971 


614 


315 


FFFGDAIiS PGLECGGAVIiARLQA/ CLLG 
SHRSPASASRVAGTCKRPPPRPSGFFVF 
LVEAGLHRVGQDGLNLL /NLVI RPSRP P 
IVLGLQGMSHHARLFYFL 


6917 


20818 


A 


6972 


861 


528 


FETESRSVTQAWQWCNLSSLQP/LPPG 
FKRFFCbSliPSSVmYRRTPPCPANFCIF 
SRDGVLPWWPGWSRTSDLVIRLPRPPQC 
WDYKREPLRLAKSRHPWNHLPEQEVEHF 
K 


6918 


20819 


A 


6973 


274 


119 


PPN\FVFI»VETGFLQGGVKLSTPGDFPP 
PASRRAG I TGVSHP ACPGKEY I FKN 


6919 


20820 


.A 


6974 


370 


194 


KITNFAKIiFGYIIiPYSHYFWCLEKNRNR 
/SLTLLPRLVSNSWAQAFLLLWPPKVIiR 
IiQA 


6920 


20821 


A 


6975 


2 


239 


ARGPCSS I / SDCCASGEQGSVGVGPAEP 
G/MGYNLLACCLIiRPLEKCS IRAGVSRF 
SWYSLSRLPLT\RKGNPPTPCISQVR 


6921 


20822 


A 


6976 


361 


270 


PQWLVPVIPA/LIiETEVGGSLEPRSSRP 
AWAR 


6922 


20823 


A 


6977 


363 


48 


GPPKKMCPYPN \ PRP YECNLFGKRVFAD 
VIKFKI LRW\I ILDYTQVDPKSNDECPY 
KKQKRRGHRQIKRKSQADQGGRDWSGAT 
ARNAKMITSVGEDVRKRKPLAYC 


6923 


20824 


A 


6978 


9 


163 


AGYNFLVCC\LPRLLEKCSIRMGVSRFS 
RYHLSWLPFARKGNS PTSCTSWVR 


6924 


20825 


A 


6979 


408 


212 


I ETGFCHVAQDG/ LELLASDDP PAS ASQ 
S AE I TD VSHRAQPHS AFIiP WFS SKI FAY 
GSVWLVFPHS 


6925 


20826 


A 


6980 


395 


254 


LIFVFLVETRFHRVGQAGLELLTSSDLP 
SLASG\ITGVSHHSRPSSAF 


6926 


20827 


A 


6981 


408 


2 


KVSFFFPKGPRGKKKNGAGGFSPFFPPF 
LGGGTKNFFYPPIFSQGGGKK/QKPPPP 
EKKKNP PNPAFWGP P L VLKNWGGRPGGP 
LDPKVSNFMGAPPFLFGAKKKTFFSPPP 
PKKKKPQKKKGGGRSRSRTSRTRG 


6927 


20828 


A 


6982 


2 


302^ 


RKHLP PH \ PVI FVFLEQATLRHVGQAGL 
ELI/TSSDHRASASQS AGITGVSHCS LPA 
TSFSTPPFSVAKMLRGTQKMKVPLIGPL 
PQ P IRL VAGHYF I Y I T 


6928 


20829 


A 


6983 


2 


375 


RGRVGGPVGGPVGRTFIFFQAGSDSARI 
LVLFPYCTLPHPLPGLFFFFFKTGIKSW 
VUjNPSPRFVTFLEKKIFLPQPPFF/HI 
LS PPHPGGAGGGNPTARKIKGLS YRDPK 
EGHKRGWAHTPTP 


6929 


20830 


A 


6984 


1 


146 


PRPANFC/DFLVETRFRHAGQAGLKLST 
SSDLPTLVSQSAGTTGMSHRA 


6930 


20831 


A 


6985 


2 


95 


LTMLPRLECNDTI PAHCSLNS / SGSSHS 
LTSTSKVAGTTGVHHHFWLMF/PLFI IQ 


6931 


20832 


A 


6986 


340 


169 


VCSSGLSSPLLEQHKTMLIFYASGDICT 
/ANGKSGFNQPQPFLKTFCCTHRIMSCT 
YL 


6932 


20833 


A 


6987 


346 


218 


PCLTNF/ S I FCRDE / SLTML/ PRLVLNS 
WAQAILLPRPPRLIiRLQA 
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SEQH> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, t=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q =r G!utamine 9 R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=*Tryptophan, Y=Tyrosine, 
X = TJn known *=Stnn cndnn /=Dn*sihle 

nucleotide deletion, \=possible 
nucleotide insertion 


6933 


20834 


A 


6988 


345 


141 


NRPKCPSMIEWIKKMWHIDTMEYYAAIK 
KDKFMSF\ETITLGEVTQBWKTKHRLLS 
LIRGS 


6934 


20835 


A 


6989 


154 


236 


ELISFSRSVERCISSFIKVLFVETDLFT 
LSLS P FFSVLFFFC / RRWGFTMLPTLVL 
NSWPQVILLPWLPKVLRLPT 


6935 


20836 


A 


6990 


2 


156 


FLVE/TGFVHVGQVGLELLTSGDLPASA 
SQSVGIIGVSHRVRPCFLFVCICLV 


6936 


20837 


A 


6991 


310 


96 


KNPFFFLVKKSP/LNVAQGGLKLIiGSND 
PPASVSQSGGITGVSPRARPLNFYKEAT 
DPNTSVLKGLNI KKKKN 


6937 


20838 


A 


6992 


15 


279 


NLLCFPSISPLIGIFFFFERGFHHVGQA 
GLKLLTSGD /LRASPSQRARITAMTRHP 
HPLVCGLLNS THCDWRGR I PHWGFDLRF 
REGPS 


6938 


20839 


A 


U77J 


197 


^9 


KLADTPOSRFVLVCV / LIETKYSIjCCPG 
GLQLLASSNPPTSVSQSATITDVNHHAE 
SQVFLNIiWPKSSKPTNTGL 


6939 




A 




34*3 


1 10 


SSSBDH1GIVPPHSAMFCFF / CLROSLTM 
LPRLVLNS WAQVI LP PSP PKPFLDALKK 
KSSKWMSSLLIT 


6940 


20X41 


A 


U77J 




1 
l 


FFFBTES CSVTQAGVOWCDIjGS LOPLPP 
GFKQFSCLS/LPNSWDYRHALPCPANFC 
IFSRDGVSPGWSGWSRIPDLG 


6941 


90842 


A 


6996 

W77W 


^46 


191 


RDICTPMFVTALLIAKIRSPHKCPSVDT 
WIRKMWF\IYTMEYYKAIIKNKILGMSH 
HASLP VFNHMCKLKYLI S 


6942 


20843 


A 


6997 


2 


198 


SNNTVVAFRE PGI TGMCHHAQIjI VLLYM 
QFHHVDQDGLDVR/NLVICPPWPPKKLG 
IQALNQPSPDI 


6943 


20844 


A 


6998 


183 


31 


MRHLKSSQIQQGTVGHACNPSTLGDHGE 
RITQGQEFKT\GNIVRPCLYKD 


6944 


20845 


A 


6999 


j 


471 


L7VLS CSGT ILAHCNIjHLLGPS L P PTS AP 
RAAGTTGVHHHAWL I FVFFGKDGVAQPG 
LKLSFK/VTRPPQLPKVLGLQA 


6945 


20846 


A 


7000 


483 


239 


GNNNI LFLFLFFIETGSHFVT \RLECQW 
VQSSAHSQPSAPRAQVLIFVFLLQTGFH 
YVG\QAGLQLLTSSDPPHSALLKVRD 


6946 


20847 


A 


7001 


132 


353 


YAKIiGTRG FARGP WPGG FLTVAVWMS F 
AQAGINPFVCIFSNRELRRCFSTTL/PL 
LQKIQVTKGTLLCYMREHL 


6947 


20848 


A 


7002 


2 


399 


EDAQBVELQEGKVHSESDKAITPHSQEE 
FQKQERESAE\SELTES 


6948 


20849 


A 


7003 


2 


135 


IFQFLVEMGFHHVGQAGLELL/NLMIHP 
PRPSKVLGLQAYWHSTRP 


6949 


20850 


A 


7004 


2 


114 


I FQFLVEMGFHHVGQAGLELL/ NLMIHP 
PRPSKVLGLQA 


6950 




A 


700*5 


4^ 


i 
i 


ALFPAWL WLPS S S KGGLKLSHGTTLTLF 
LLPSSTLKDLWPQEQWL/GTCNSQHFGR 
PRKADRLAPGVQNR PGQRGETP SRKKGG 

NWTWWCVPWLAFQETEVGGPLEPEVAP 
AWVTVRPCLKKK 


6951 


20852 


A 


7006 


371 


172 


QVILCLSLPGSWDHRHTPPHPA/NFLVE 
TRFQHVDLELLASNDPPASASQSVGITG 
VSHCARPCYDL 


6952 


20853 


A 


7007 


486 


252 


PQTVFFFFFFFFFFFQALFSTFYILHVF 
FFFFPPFFPKFQFFPFP/LSLIKQGKFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQIB 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

smino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D==Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFCYLCNSLFIWTQYPFVNE 


6953 


20854 


A 


7008 


1 


113 


KNFCR/DRSLDTLPRIilSNSWPQVTLLP 
QPPKVLGLQA 


6954 


20855 


A 


7009 


105 


383 


SLHRPAHPGLLKFGKKAIHSFIHLCRDK 
GDLAILPRLVSNSWPQVILLPWST /NVL 
RLQA 


W7JJ 




A 


7010 

/Ml V 


609 


348 


ESHSVTQAGVQWYDLDSLQPLPPGS \SD 
SPASASHVAGITGMRHHTQL\ I FVLLVE 
TRSHHVSQTKFVLFDTTKFWIFYSGKR 
TRIRL 


6956 


20857 


A 


7011 


382 


233 


DLE I FRGKTFLHHI FPFFFFFFC/ RKRG 
LAMLPKLVLNSWPQVTLPKCWD 






A 




359 


199 


DLLPSIiPPFLPSFL\FFLFKTESHSS PR 
LECSGAIPAHCSLGDRARHRUCINK 


6958 


20859 


A 


7014 


3 


274 


CFFFFFETGSHSVTQAGVQWRHLDSLQP 
PPPGCKRFSCLS /LPSSWDYRHV/PSSP 
LLVRNSSICLLSCFEMTPELFDGVLAIY 
CCVTNDLKA 


6959 


20860 


A 


7015 


372 


54 


LPS AWEAEMRGPLE PKGLRSQRAE IAP \ 
CT PAWVTEQAYVSNKTNTERP FQKI EHV 
EGLSCTTQKFSSND 


6960 


20861 


A 


7016 


548 


158 


KFDT/GATLFDGRPAVFDEA/DFPSVAV 
YLTGAE YTGEELDSDTWQAELH IEVFLP 
AQVPDSELDAWMESRIYPVMSDIPALSD 
LITSMVASGYDYRRDDDAGLWSSADLTY 
VITYEM 


6961 


20862 


A 


7018 


396 


34 


QKNE I KPTMQLHLTPARMAI I KKILKNG 
GWHGCGE /METLLHCWQECKLVQPLWKT 
MWRFLKEPKVELPFDPAIPVLGIQQSQH 
WESSNPSTGDPAI P PliG IQQSQPRRKVT 
KKRYLHTRIi 


6962 


20863 


A 


7020 


2 


241 


FPGPTAARRRQKEDRYEEYDKVLSDKLK 
EABTRAEFAERSLTRTjEKS iddlqdbl \ 
yahklkyqaiseeldhalndmts i 


6963 


20864 


A 


7021 


347 


3 


NDIiKSLTHSVAQARVQWQ\NISSLQRPP 
PQFKRFSCFSLPSSWDYSLIjNCKFYHIiH 
FADEEIETLTSQRAQLECKSNMYKDIHN 
TVRSYKLKYKVPQKLERSRNRTKSSTRE 
VRS 


6964 


20865 


A 


7022 


2 


160 


LREDDRRGRGFHHFGQAGIiELLTSSD/L 
PTSASQSAGITGVTHRARPDWSFCQF 


6965 


20866 


A 


7023 


3 


248 


DRVSSVAQSSLELLSSSNPHLGFPKCWD 
YRPNPPLPFSSRSLLFLPSFLIFCRDRV 
SMLPKLASSFW\VRPPWPSKVLGLQA 


6966 


20867 


A 


7024 


483 


143 


PLNPLKFFFFPKAFKFWGGVGPICSPPK 
RRVLSQKSQVGFYFPPQKEKGYTFPSPG 
KFGPPKEILKRAPPFFFFFFFFLRRDK/ 
SLTMLPGLVTjNPWPQAVLPLQPPKALGL 
QA 


6967 


20868 


A 


7025 


414 


81 


PHPLKFFFFPETFYFWWGVFS PFSPPGR 
KVPPQKSPGGFLLAPPRGKSFPFPCPFK 
IGPPRGIFKRAPPYFFFFFFFNRRDK/S 
LTMLPGLVLNPWPQAVLPLQPPKALGDQ 
A 


6968 


20869 


A 


7026 


16 


218 


KFAPRCRNSARENTQNQPRKHSDTPFLK 
X/HKINNNNKLAGRGTHLWSQLLGRLRW 
EDHLSPRQSRLQ 


6969 


20870 


A 


7027 


28 


272 


EFVGVGPSEPGAGYNIiL\CAFI*SPSEKC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 

• 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phen>Ialantne, 
G=Glycine, H=Histidine, l=Iso leucine, 
K-Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possib1e 
nucleotide insertion 














SIRVGVTRFSRCHLSPLTLTKKGNSLTP 
CASRVRQCXiTLLRLAHGALHPLSSTP 


6970 


20871 


A 


7028 


277 


480 


I/3IFHMLSDSFLLLLLFCLFVFKIFFGI 
FCRD\RSCYEVWAMLPRLVSNS WAQVI L 
PPWPLKVIiGLQA 


6971 


20872 


A 


7029 


24 


153 


SVWWNS P/RSSGE FRKTGVHDADFEAH I 
IDMLAKASKI E VYRGN 


6972 


20873 


A 


7030 


2 


470 


IETPRPIiWWHSAYRGROAS LS CGGLHPV 
R/AS WLLCLPNQAWAMAGAPP PASLLPC 
SLIS0CCASNQRDSVGVGPSEPGVGYNL 
LVRRFLSQSEKRNIRVGVTRFSRCV/LIi 
PLSLTRKGNSLTPWASQWQCLJUiLRFA 
HGARTHWPAPTVWHSLVR 


6973 


20874 


A 


7031 


487 


107 


SPPAPGVAPAP PP PTRVFS KKKKKFFFF 
FLKKI FFFKPGGDPGGIFFFPGGPPPGP 
KKFSPPLFPKKGGYRVFPPPPGKISIFF 
FFFFFFFFFF/CRDR/ SLAMLPRLVLNF 
WAQVIIiLPWS PKVLGIiQA 


6974 


20875 


A 


7032 


576 


36 


GYTSQMGRPGRGAPHFPDRAAGQRRSSL 
PR WGG WAEALIiTSOMGWR PGRGAPH T pn 
DGQPGRGAPHLPDEEQLGRGAPHLPDEE 
RPGRGAPHFPDGAAGQRCASHPTRGGRA 
EVLPTSQTGQPGRGAPHLPDDGRPGRGT 
PHLPDG\GPGRGAPHIPDGAARQRHLSP 
PRRGNCTTRIVDQ 


6975 


20876 


A 


7033 


2 


129 


YGPTHASGAMLRS CAARLRTIiGAIjrii P P 
VGRRLP\ EASRDPS 


6976 


20877 


A 


7034 


1 


287 


RLSLALVARDGVQWCDLGSPQPPPTGFK 
RFSCLSLPSSWDYRHVSPRPANFVFLVD 
TGFIiHFGQAGLELPTSGDPPALASQSAG 
I\TGVSHRAWPK 


6977 


20878 


A 


7035 


3 


315 


HAS AHASALFYF I FFETRSRS VAOAEVO 
WRKLGSLQPPRFKQFSCLSHPYRHAPPH 
FANLVFLVETGFA\HVGQAGLELPTSSN 
LASQSGGITGVSQRARPGLNF 


6978 


20879 


A 


7036 


430 


304 


VI FYLI FFCKDGV / LTMLPRLVLNS WLH 
AILPPQPPKVLGLQA 


6979 


20880 


A 


7037 


3 


256 


FFFFNRQKDLFGNFQTOQKWHS PS PRPT 
/APHQAPKGNTSPPSLPIHLTLASTCTY 
P I PQAMLENWS VP S PAQAAGEAALWHP R 
R 


6980 


20881 


A 


7038 


13 


481 


FARLHEFGTSRVI YLLARLVSNS WPQVI 
CLPWPPKL/LGLQA 


6981 


20882 


A 


7039 


455 


255 


SCLSLPSSWDYRRAPRHPANFR\FLVET 
GFRHVALAGLKYPPALASQNAGITGVSH 
HARPEQFILTN 


6982 


20883 


A 


7040 


490 


172 


KCWQG/ C/GETGTLVHC/ WIGRNCTMMR 
PQWQQAWQVFKKLHREIiPSDPAMPIiIiGV 
QPGGLKIDTQT/RYMYMALLTVKRWKQP 

ECP PVDK/ ldklw y ihtteyrvti kkk 


6983 


20884 


A 


7041 


438 


37 


IFPNLPPKLHYFPQFSPPEKLPKPLLIK 
MAKRQSQGKVFETPKEK/SRAGPPPKFT 
TGFSAE KNGGPKPGGY I S QRVQKKKNNC 
HSRILHSAKLSFKNRSEIKTFLNKQKLK 
/DFTASRPALQKILQEAEFHRTD 


6984 


20885 


A 


7042 


292 


463 


IEAISFLPFFFGDSLTGAQWRDLGSLQP 
LPPGLKRFSCLSLPSSELP/CSGRCQS 


6985 


20886 


A 


7043 


59 


257 


LASLVSQSWMKKMWYIYTMKYYAATK\G 
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SEQ IB 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Ala nine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NEIMS SAGTWMKLE AI ILS KVTQEQKTK 
HHMLSLRSGS 


6986 


20887 


A 


7044 


279 


14 


RTADTEIGRGSSLCWRYGSLQIQVSKSH 
NFFFETGSHSATQYGVQWHNLGS PQPQT 
YGLQQSSHCGLPKCWDYRK /LP PSMECS 
GTISAHLSLKLMGSSNPLTVASQSAGIT 
GNSSSRLHDINYGLQ 


6987 


20888 


A 


7045 


578 


253 


AAASTFSCFFVFFETEPNSVA\RLECSG 
ATSAHHKLCLPGS SNSPVSAPQVAGI TD 
AHNHIQLIFVLVETGFHHVGQAGLELQ/ 
NLVISLPWPPKVLGIiQAWSHHAQPFV 


6988 


20889 


A 


7046 


3 


229 


FFKTGSLSVSQAGVQW/PNLSSIiQLPPP 
RFKQFSCLSFPSSWDYRHPPPHPANFCS 
FSRDRVPPCWSGWSQTPDLR 


6989 


20890 


A 


7047 


367 


151 


LSLLSS WHYGHVPTS PANFFYF / CVDTG 
PRCVSNSWTQAVCLPQLPRVLGLQA 


6990 


20891 


A 


7048 


370 


141 


CQPPHPANFCI FVE I VFCHVGQTGFKLL 
TSNDIiPAWASQSAGIWD\TGVSHHAWPS 
RFNSBVQFDYVSNKNVDFSLS 


oyyi 












VTAPGLEAAFKBRVWPGAVARTCGPSTL 
GGRGRRI \RGQE IKTILANTLKPCLY 


6992 


20893 


A 


7050 


2 


325 


RFSCLSLPSSWDYRHEPPRLAIFFFFFF 
GI IRKGGFTH\ VGRGGFKPWTSGNMPAL 
PSQGVGFPGIiFSPPRPVRGFFYPIjPKWG 
GLFLQNGVKFKKGGSGDLFKMEKK 


6993 


20894 


A 


7051 


14 


208 


AHEILSEIENPLEPGWDHRCVSSCPAHF 
FVF /CYRDGGLPMLPKRVLNSWAQAIIiP 
WAPKVLGLQV 


6994 


20895 


A 


7052 


103 


368 


GNRVHTYMLTALFMTAKKWNKPKC/PVS 
DEQIRQIWSVHAMEYYSAIKRNEALICV 
TTWVNLENIMLSESSQKTMGCMIiRRSIY 
MKGPE 


Wyj 






fKJDJ 


A91 


9 
JL 


LFLP PEVARGFPKGRGRAS PS PGGFFFG 
PRGGPIFPPP\SFFPTRPWSKGEVFPSP 
TPGKGFRGFFFPPPPKGKGGVFPLFFKK 
YLGGBGAPQNKGGPVFLLWPPHSGPPGP 
PKNFFFFFFFFETESCYVYPGCSAWQS 
Q 




OAOO/7 






480 
**oy 


40 


I PEAPGGPGFKREDS PGANKSGPKALYP 
P KTFHPP WEGRFRS FQKGAP P PPVFFP P 
FFPLPFFFIiRQSHSIPQAGVQWRDLGSL 
QPPPPPGFKQFSCVSLPCPP\RQMGFRH 
VGQAGIiELLTSGHP PDSASQSTGITGVS 
HCAEFHRTD 


6997 


20898 


A 


7055 


3 


261 


TCSGS VTQAE VQAE VMAHCSLNLLGSS D 
LSALASRAAGAAGTCHHAWLI FVFFIEM 
GFHYVPRL / GLELWARVIHPPQPPRVLG 
LQV 


6998 


20899 


A 


7056 


1198 


1040 


FIiWKIALLFYFKKLPQSLQPLVATTLSS 
QQPSTSRQDPLPAKRL\RLAEGSDDR 


6QQQ 
\>yyy 


900 Aft 


A 


7057 


971 


j j 


LFFFFKMESRSVARLECSGAILAQCNLA 
RLFLVE\QGFHHVGQDGUGLLIHPPWPP 
EVLGLQAQATPPSPKITYEAATVM 


7000 


20901 


A 


7058 


2 


194 


CYRRQPPHIiAMNFYFLLNFFCLFFC\KA 
LSLLARIiLLNSQLQASLPPQPPKVLGLQ 
ACGTTPCP 


7001 


20902 


A 


7059 


3 


272 


NMWKLNNIIJ^QimrGEIKMKIFKNCK 
E\NGNTTYQNLRDAAEAFLKGKFIAVNT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginniog 

nucleotide 

location 

correspond 

ing to first 

ditiSnn opiH 

^UIIUU Ut-IU 

residue of 

pcii IIUC 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lsoIeucine, 
K^Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
v^oiuiamine, k— Arginine, o— oenne, 
T=Threontne, V=Valine, 
w — irypiopnan, 1 s3 tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=posslble 
nucleotide insertion 














VTTf KTIYP ffTr , T.PT ^RVCJVTJPPWTrK'TfR V*PT. 
x x jvt\jj x fv.rv i. y . 1 1 » r* x o inr r w aj2>x\jz» x lu 

KTMIKV 


7002 


20903 


A 


7060 


397 


129 


TOGPHKKNFPLPARGRKLGSFKTA/LPL 
FFFFFSFSLSLFFFSGSCS VS \RLKCSG 
TITAYCSLEFLGSSNPPASFSQAAGTTF 
x r rxjr iu 


7003 


20904 


A 


7061 


3 


351 


SCFRLLCtiPKEAWAMAGAPPPASLPPCS 
LISDCCASNQCDS / VGVG PS EPGAGYNL 
VVRRFLSGSEKRNIRVGVTRFSRCRPSP 

IjoxjxKIUsJNoxjI rLoolj vKt^Ci-iAl il iKXiAil 

GARTH 


7004 


20905 


A 


7062 


60 


324 


DDFVSWQTGVHWCGRGSLKPQPPRLSS 
TSRLSSPSSWDYGRVPF\NIFCRDRVSL 
nyDf*WDPnpo - nT Ofio orocDinnnrT in»mr 

IrvaWJrKX xriJiaKv70o^-FolrJN.V V iSXitl J.KV 

TAPGQ 


7005 


20906 


A 


7063 


481 


160 


RPGAPG/ PDDFARMGPRPKGGAFTLVHP 

MPSGAAFPGLIiSEP\PHRGPRSFAARSP 

TLRYPGRAPSKGSGSDPARPPATGPGPN 
r.Torvr.i/T mot *n?xrt/'r* 


7006 


20907 


A 


7064 


65 


481 


CLCP APRGGAYRGRQ AS IiS CGGLHP VRA 
SRLLCLPKQAWAMAGAPPPASLPPCSLI 

OTW"»2V C! \T/"1 DT>OT Tfl 7/5 DODIV^T 7/5 VOT TfTTTj 

oULwuM UKUo VO V Vi JroxSFv* vvi xoLiVVK 

RFLSRSEKRNIRVGVTRFSRCV/LSPIoS 
LTRKGNCLTP CASQVRQCLAliLRIiAHGA 


7007 


20908 


A 


7065 


392 


183 


GGLKP/IPGNPGNSLFSQKKKXQPGAGG 
FGPAPPPGGQKKKSLF 


7nno 

i\J\JO 




2V 
/i 


/UOO 


jjo 


Ovf O 

248 


F\KFSCHSLSSSWDYRCAPPRLAN\FVL 

T . AFMflT?P'FHWT anifiT .T?T.PTQXmT .PTC a. 

SQNAG I TGVSHHT 


7009 


20910 


A 


7067 


2 


127 


VTGQAWHACS PGTLGARAG / WIVWAQQ 

C C% X O XJV7 X<*1 V Xvtr *■ - 1 » I. XI 


7010 


20911 


A 


7068 


379 


217 


KKMFCKNE/SFTIFPRLVLNSWAQTILP 
PWPPKVxXSLQVQTTVPGSYSPLSIiYKE 


7011 


20912 


A 


7069 


386 


176 


KRDKVDKWIjKKMWCIHIMEYYIiAIKKET 
LPFATWVNLQ\TFILSEISQEQDKYHMI 


7012 


20913 


A 


7070 . 


30 


408 

• 


NSSEaXSSVGVGPTEPGTGGNLLVCWLLR 
LWEKCSVWAGVSRFFQ/CTDYDGFPWVG 
KGNP PTPCASOVR R WP AT J .R P TT . Wf!T .HP 

CASHEKNQVPQLEMQKSSVFCXDLTGSC 
RVELFP FGHLGSKNP 


7013 


20914 


A 


7071 


310 


414 


HVPWRSFAPAHCNLGSLQPPPPGFKRFS 
CLSLPSSWDYRCPPPRLAN\FCSGSLQS 
WLTATSASWVQAI LLSQPPE 


7014 


7091 S 




707? 




74'* 


T FT PPR V / PflT .T T T , <5 R T tVWNTywrinVT T.T. 

PEPPKVLGLQV 


7015 


20916 


A 


7073 


1 


276 


ELWDNFKEPNIYLKSLKERJCECVCERGR 

IVUrUNiX XXj£tXJXl v l/Vl\XM r oIN wrJA.x X vif\£±\£u 

AQQTPN\QHTCRKLALRHITIKLLKTND 
KKKILKAV 


7016 


20917 


A 


7074 


362 


133 


GGGGGGGRYSPQKKGTLPLTPPSNWGPK 
GAPP PRGNFFFFFFFS \RDGCLAMLPRL 
VLDSWAQVMLLPQPLKVLKL 


7017 


20918 


A 


7075 


3 


177 


KWSPHHSLPNN\WLYRHMPPCPANVEFF 
CSLAVLPRMVSNSWPGAILSPWPTOMLG 
LQT 
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SEQID 
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nucleotide 
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sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

frig to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7018 


20919 


A 


7076 


1 


169 


PQLPSSRNYRHVPPCWANFCIFS/RRRC 
FTMLAiU,FT^SWPQVTRPHWPPKVLGLQ 
A 


7019 


20920 


A 


7077 


402 


231 


PSCLSLRSSWDYRCMPPHLTNYYYYYYY 
YFVEMGSHSVTRPQDILPPWP\ PKVLGL 
QV 


7020 


20921 


A 


7078 


422 


316 


FCKDK/SLPMLPKLILNSWVQTTHLPQP 
PKVLGLQA 


7021 




A 


7079 


JO 


791 


DPI CTKS KKLKKTQ\TDKEDE I TTKKAK 
KAKSKI KIEEDGEEGKWWEEEVS VKK 
KKDKKKPWLGMVAHTCNPNTIjGG 


7022 




A 


7080 


363 


123 


GVFFRDHARAFSLHTGVCAVHISRCF/H 
VLTHFKE FIjFHWFLTIHTHTHTHTHTHT 
HTPSFPELGKAVPMHRIVIVPTNYQH 


7023 


20924 


A 


7081 


3 


181 


YFLL\ FFWDRVLPCHPGWS KGWSQTPGIi 
GQS AHLStiPKHWVCRREPPHPALFLLFK 
NTIS 


7024 


20925 


A 


7082 


453 


142 


FFXXFXFXXXXXFFFFFFFFFXPPFFFF 
FFFLKNSFPYFFLFFFFYPLFLPLFFFF 
FPPNFFFFFLFS *NFFFFFFPPPPLFFF 


7025 


20926 


A 


7083 


268 


92 


KNNLPSSWG \YKCSPPHKGNRFFFYIKD 
GGGLIMLLRLISNSWAQVILLSWPSKVL 
ERQA 


7026 


20927 


A 


7084 


413 


o 


SCSVAQDGVQWDDLGSLQRLPPKNKPFF 
CLSSTPAA\FPSDWDYR 


7027 


20928 


A 


7085 


400 


125 


KTRSFSWRGGVQIj/CI/GSLKPRTPGV 
KSSSPSASPVGRNTGMCPPPOQIFFFLV 
ETGSHYAAQASLKQSSCLGFLILFLSQP 
LGEKEGPTICT 


7028 


20929 


A 


7086 


377 


38 


NPGPGF YFWGPKKKI KPFPPPGVKLVS F 
KRAPPFFF/CFFFETGSPFVAQTSLELL 
DSHNPPTSASQSAGITDMSHHARPMHSI 
CMIQLWKSYHRFHFISQGSRDCSSSRBE 
T 


7029 


20930 


A 


7087 


\ 


151 


S FCRD /KRLIMLPRIiVLNSQAQVMLLPQ 
PPKVLSHHTWPI IKFSFMHFNLF 


7030 


20931 


A 


7088 


405 


111 


PRLVLN \ S BLRRSAHLSLPKCWDYGPE P 
PCPAQMHVLKSGTAAHKVHDFLASWSHL 
PTHSPFPVLCSKTQISVAFFIQNLRAGQ 
RSGQRHFGKQCGSIH 


7031 


20932 


A 


7089 


1 


309 


KQ INKS S LR ISGKP PNCECVLHLGKS VN 
KSLLE PHKLASDP /HLCDFLESS \EQVK 
SVKELNCHV\TSLYTLGALESDMAEYF\ 
. NKYTKPGMLAHAYNSSILGG 


7032 


20933 


A 


7090 


414 


240 


FSLF/CKNLKKGLTMLPKLVSNFWPQAI 
LLPQPPKVTiGIi 


7033 


20934 


A 


7091 


460 


328 


RHPPPYLGK/FFCRDR/SLTMLPRLVSN 
SWPQVILSPWPPKVLVL 


7034 


20935 


A 


7092 


388 


199 


KQFSCLSLPSRWDY/SALPPRPAHFCIF 
NRDGFSPYWSGWSLGLPKCSDYRRRIFL 
RAKAHVRE 


7035 


20936 


A 


7093 


18 


271 


MHSFATYIiLVNLHLLVLCRLCPLSMYSG 
EKSYILLTBCRPMLWF/SSGPFPSWSFSS 
ITLRIFLTCLTLKLSCCSCCCCCYCCCC 
C 


7036 


20937 


A 


7094 


415 


291 


GDTJ3S PQAPPPG \ S RHS PAS AS RI DGTT 
GARHIMNFKNLAIK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M= Methionine, 
N=Asparagine, P=Prollne, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=»Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7037 


20938 


A 


7095 


21 


428 


DKGLTVTQARVQW/ HDPSSRQPPSPGLK 
RFFRLSLPSNWDHRCAPPRVANYYFPYR 
WGSYHVAQAGLELLASSNTPTSASHSAG 
ITALFLRDTLVRFFMGSFCFPQSPGAAF 
TAQIPQRGRTEELFIFLFLNDYLFS 


7038 


20939 


A 


7096 


320 


134 


LPL \ PKTTFYLFLNRDRVLLCCPGWSR \ 
ILSSKQSTLLGLPKCWDHRHEPPQQAVD 
HTLDFS 


7039 


20940 


A 


7097 


402 


221 


DYRHVPPFPANF/LVEMGFWHVSQAGLE 
LLGSSDPPNTSQSAGITETSHQAQAYTF 
IFIIY 


7040 


20941 


A 


7098 


427 


259 


PPPRFKRFSGLSLLRRWDYRPPPSCPGN 
FFFFF\CIFVQMGFHLVGQAKMIiKLHD 


7041 


20942 


A 


7099 


422 


275 


VQWYNLDSLQPASSGFKRFSHLRLLSTW 
DYRH\PRPANFCIFSNTKDTFI 


7042 


20943 


A 


7100 


3 


404 


HAS GATWRNP VSTRVWWHI P I VPATQEG 
EMGGSLEPRK/LKAAVSHCTPVWVT 


7043 


20944 


A 


7101 


539 


336 


FETESRSFTEAGVQWRDLGSLQPPPPGF 
KE/DSPALASRVTGTTGAHHHAGLIFVF 
LVRAGVSPHWPGWS 


7044 


20945 


A 


7102 


408 


287 


WLILFFI FCRD/ RGLTMLPTLVSNSWAQ 
AILLPQPPKVLGL 


7045 


20946 


A 


7103 


1074 


705 


S FLRWS FTLVAQAGVRWRGLGSLQPLPP 
WTFKQFSCLSLLSSWDXyGRPPPRLANYF 
F\VILVBTGFHRYWPRMVSNSQPQ/CDL 
PAVGRPNSAGI \ TGVSHCAWPKMFLNNC 
TCTHSPYLQGIVSLG 


7046 


20947 


A 


7104 


227 


55 


FSQHKINIQFYI SLS PTHI TMS FFLCRD 
/RGLAMLHRL/VLNSWPHVILPPWPPKV 
LRL 


7047 


20948 


A 


7105 


479 


314 


PRLLLIFFFIERGL\LCSPGWSQTFGLQ 
ESSCFGLPKSWGPRKEPQPCLALKWTLQ 


7048 


20949 


A 


7106 


37 


143 


NSFF/CKRRGLTMLSRLVSNSWPQVILT 
PQPPKLLEL 


7049 


20950 


A 


7107 


441 


106 


GAPSPASLPPCSLISEKXASNQRDSVGV 
GPSBPGVGYSLWRRFTjSRSEKRNIRVG 
VTRFSRCV/LSPLSLTRKGNSLTPCASQ 
VRQCIALLRTJVHGAH1HWPAPTVWHSLV 
R 


7050 


20951 


A 


7108 


413 


207 


CCIQAGPAFFGGPPPPRWGWGGVPPPGY 
/MPS PP VFRVGLLVFRGPPPNWFLPMGL 
QPDGPGLGDPKKFAK 


7051 


20952 


A 


7109 


92 


409 


PHLPCGAAGQRCPPPGRGGGCVRAGPQL 
PPGRGGWPGGGLPPPPSRTGWLPGGGAP 
HFSDGAVARQRVS S LLRRGGRAETLLTS 
QTGSRPGRGTPHISDDGRPGRD 


7052 


20953 


A 


7110 


408 


13 


PPWGPPGGGPPEARGSNPPGPPWGTPLS 
PKKPKNYSGQGGGPLNPPPPEG/SGPGI 
PFSPEGKGSLNPGGQKGKIGAPSPPPGG 
PKKKPPSQKKKKGKEKKEKVIKGESSQI 
LLFKVSKILSVNFLHLSSLI 


7053 


20954 


A 


7111 


290 


175 


LKKNCR/ DRSHTMLPRLVLNSWPQTILP 
SQPPKVLGLQI 


7054 


20955 


A 


7112 


1136 


552 


I LFKEMGCEAGQRCSCPVI P ITLGRPRR 
ADHLRCGVRDQPGPTWLKPRLYSNTKIS 
WAWWRVPA\ IPA\ IRRPEATRFT FNPRR 
WRLPLNLRSHPLHSHPGRQSKIPIKKER 
KKGQDRKGKGCGQNWGQKQKRRKEGKKE 
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sequence 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














MGMGRIGGKNKKEGRKERKEGRKEGNKW 
VWAELGAFPEQGKGLRRGPSEEAVGVSP 


7055 


20956 


A 


7113 


570 


199 


LCLLKQAWLMAGAPPPASLPPCGLISDC 
CASNQQDSVGVGPSEPGWYSLWRRFL 
SRSDKRNIRVGVTRFSRCV/LSPIiSLTR 
KENSLTPCASQVRQCLALLRLVHGAHTH 
WPAPTVWHSLMR 




9fiQ<7 

/•My j 1 




71 Id 




984 


RTRG IGWTANRRMKATQPGLTFFHLIHK 
QBGGNVGSQKGQRD\VTNSAFHWRLHDQ 
TANCLS 


7057 


20958 


A 


7115 


474 


224 


DSNFEKS STLGKMLSNS IKCYRE I FCKR 
KSQLMQQTSMlA FKKLPQSPQPSATTTIi 
ISQLPPTSRQNPPPAK\RLQLAEGSDGC 


7058 


20959 


A 


7116 


377 " 


111 


FXXNFXSFLFFSLSLTRKNWVAHITFFF 
FFFLPTPPFFYFIFLLIFCFFFFIiFYIF 
FFFFFFSFFFFFFFFFFFFFFFFFFFFF 
FFFFL 


7059 


20960 


A 


7117 


417 


148 


VPPCLANF \NF FVETGS PYVAQAGLELL 
GSSHPPTLASQSSQITRVSHCSLLYFLM 
HSNTSISFFSFWALVSFLEMPSSCQDGI 
TVLAR 


7060 


20961 


A 


7118 


2 


196 


RVPPPCPA\NFVFFVQTAFCHVAQAGPK 
IiLASSNPPASASQSARIAVLSYCPLPSL 
RLRSHLDFF 


7061 


20962 


A 


7119 


2 


417 


EAGGS LE PRNLRIiHCA/ LVAPMNS RCIP 
AWAT 


7062 


20963 


A 


7120 


801 


450 


GPKFLGPPGVRIN/RRPGGGGLPRVYKK 
HNYPEDRPIVLPAFFSGGGR/GPGPHDK 
HPLPRKKQKNSPGRGGEPPFFPPPGFPP 
GGERKNPLTPQRKRVHWGQGLAPPFPPS 
LSAKPKLF 








7191 


*i 
j 


1QQ 


DAWG FALWQAGVQCCDLS SLQPSP/RL 
ANFVFIiAETGFLHVGOAGLKLLTSOILP 
ALTSQRLRLQ 








719? 


*rli / 


164 

Ivrr 


NVAGLTGAYLDS F FGRDRV / LTKLLRL V 
LNSWAQAILPPWPPKVMGLQACTIEVSS 
LVAQIASKRLSGVQESDEAAWIRDC 


7065 


20966 


A 


7123 


33 


192 


RWDFTMLPRLVLGSKRSAHLGLPKCWDY 
KHEP /PCCPGWS WAQRDLPTLASQSAGI 
TSMSHLHQAIPLKFLEQFIRFSILVF 


7066 


20967 


A 


7124 


3 


399 


S I PVD WPS RAPT PRGRESAAHQGN/YPV 
GQRN\RMAGLSPRSFCWREVSFSR/SHS 
C/ICWVQLGKSAVLPQQSRGPGAHEGSR 
RRGGSSPPCPPLQTQLVFSPQELSMSAP 
IHRLSVTLQGDCISTGGAPFRCRLA 


7067 


20968 


A 


7125 


1901 


1574 


FFEMESCSVAQAGVQWHDRGSLQPLPPG 
FKQFSCLSLPSSWDYRHLPPHRANFFVF 
LVETGLHHVGQ\SGLELLTLGD/LLPAS 
ASQSAGITGVSHHAWAKNPCFLTSIDT 


/wo 




A 


7126 


408 


234 


LPSSWDYRSPPPQANFYIFSRD/MGFTM 
LARLLTSSDPPALASQSARIAGVSHHAQ 
PVYF 


7069 


20970 


A 


7127 


88 


111 


KLSTEYKKQRPEDSFFLFL/VBIGFRYV 
AQAGLEIiLS S S DP PTLAS QSAG ITGVSH 
HIQPPFKK 


7070 


20971 


A 


7128 


3 


140 


QRWWCVPATRKAEAGESLETESSRLQGA 
MITP\CSPAWATEQDLAS 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
KHLysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginfne, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Uoknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


7071 


20972 


A 


7129 


1 


212 


PLLFVSPPLLSVSPPLLPFP/EPAASPP 
PSTHSLLPSVQSESLGQFLHPPDPPPLA 
PPAPLKHQDLHISHL 


7072 


20973 


A 


7130 


361 


3 


RNNISQDIAVIGNDSSNRSGQSKLKIFW 
KGFTIIiHAIKDFCDS/WBEVKISRIiPGV 
WKKLIPTLMDDF/EGFKTSVEEVTADMV 
ETETELERLELEVKPEYGTGIiLQSRDKT 
LTDEKLLLMD 


7073 


20974 


A 


7131 


421 


201 


NFCILVD/TGFHHVGQVSLELLTPSNPP 
ASASQSSGITGLSDHSQRKSELVLSFNF 
FALFH I LLCVMFLAIDRND 


7074 


20975 


A 


7132 


221 


529 


LIKFTIHSFYFIYLFIFFSGDGCFALSP 
RLESSGVTSAHCNIRLPGSSDSPDSASS 
VAGII/CMRHHTQLILY/CLVET/BFHH 
VGQGGVDLLTSGDP PASASQS V 


7075 


20976 


A 


7133 


1 


227 


KGIHFSSANDVKKKTVLTWLNSQDPQLF 
RDGLDGWYHRLQKCL\ELDEVYVEK 


7076 


20977 


A 


7134 


1242 


300 


DFVLPDGRVI P \KGIVCI*INI IG\THYN 
PN\LWPTPLBVYDPFRFNP\ENIKERSP 
LAFIPFSAGPRNCIGQAFAMAEMKWLA 
IiTLLHFRILPTHTEPRRKPELILRAEGG 
LWLRVEPLAGTSRGNPRASSRPSLSDPW 
VLRRPLPPPI FLFUULLTLPLIiFSVMAR 
EPPSPLRVEAPGPP©1RTPPAIEATPKG 
TLQPAGGRLRFLSGCVPILHQVAGHRYG 
KDKMGILQHPDGTVLKQLQPPPRGPREL 
EFYWMVYAADCTDGVLLBLRQYLPKYYG 
IWPPPAAPNDLYLKLENVTHKFNKPCIM 
DVKIGQKKL 


7077 


20978 


A 


7135 


I 


259 


GTS CI LASMLTRHRLRFS F/ CLFETEFC 
SVAQSGVQWYDHSSLQPQPPGLKLFSPV 
NFPSSWDYRHMPPCLASASILKLLFTID 
LGFYS 


7078 


20979 


A 


7136 


1132 


900 


CSGVI SAHCKPRLPGSRHS PASASRVAG 
TTGTRHHARLNFCI FFLVEMGFHRVS QD 
GLHLL/NLWCPPRPPKVLELHA 


7079 


20980 


A 


7137 


1 


501 


FFLRWSLTVTRLECSDVISAHCNLHIjPG 
SSDSPASAFREAGTT/ GHDAQLIFVFLV 
ETG/FHHWKDGVDIjLTS 


7080 


20981 


A 


7138 


2 


229 


WHSHGLLWSQPPL \ YLSLSSS WD YSHVS 
PCPAGLCLFLVETGSPYVAQAGLSLRGS 
. ASQSTGIIGMSQPAGFWKQL 


/Uoi 


20982 


A 


7139 


255 


58 


IFGKGFPPFFCQNQGFGVPP PPRAFFFF 
FF/CPAKZYTMIiPRLWSNSWAQAIHPP 
WPLKVLGLQA 


7082 


20983 


A 


7140 


422 


274 


TMLPRLVSNS \GLKQSSRLSLPKWWDYR 
YKPPCLVYTFFINRPPEVPKE 


AJ83 


20984 


A 


7141 


417 


60 


VWRFLEELETELPFDPAIPLLGIHLKEH 
KLLYHKDTC\WNQPKWPSMVDWIKEMWY 
INTMEYYTAIKKINAIFRDPLTSPWDPF 
SWRKFCLPLEIWPASYFLPGIKTRSLDE 
PLHIKGER 


7084 


20985 


A 


7142 


391 


70 


SPPPHPPRKGDYRGGPP/RPGKKREI FF 
V er t Jj V U I\xb CHVAQAGLKLLAS SN /LP 
TSASQSAGITSVRHYIQPTLLKI IPQFL 
DILLFLKI FHS FLKSFNFNLGS FY 


7085 


20986 


A 


7143 


405 f 


304 


FCRDR/SLLMLPRLDLNSWAQGILLPWP 
PKVLGL 
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Predict- 
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nucle- 
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peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G1utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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N~Asparagine, P^= Pro line, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7086 


20987 


A 


7144 


3 


159 


HASEKIWYMYTMBHYA\SQKKMKIMSFV 
ATWMRLEVNILSKLMKNBMMSVNGR 


7087 


20988 


A 


7145 


343 


115 


CEFYFLSCIYVFIYFCR/DKGLSLLLRH 
VSNSWPQVILLPWPPKVSHQAQPEFYFR 
NKQTKIDLSFEETAADLAYYS 


/Ufio 


onnon 


a 


/ 14o 




HQ 


RCW KGCG LW I S LRTT /WREWQLVQPFWK 
TVWRCLRKLKVEPPYEPATPLPGIHLKK 
TKAVAQRDMCTPLFIVAFQHHLLKRLFF 
SRCVFLAPLLKIA 


ivjoy 


OAQOA 

zuvyu 




/I4/ 




IV J 


TIEWGEVTS /SSYSSTMANNEGIiFSIiVA 
RKLSRPL 


/uvu 


TfiOQ 1 




/ t4o 




97 


FANI FSHSAETLQTRREWGP I FNIL / EK 
NFQPRZSHPAKLSFISEGEIKSFPDKQM 
LRNFITTRPALQEFLKKEFMSFPGTWMK 
LETIIIS 


7091 


20992 


A 


7149 


1 


161 


RTRG FF YLDGVRS LAML P / RL VLNS W PQ 
AILLPWXjPKVLGSQTLAMV7AAYTVFV 


7092 


20993 


A 


7150 


1 


219 


DYRCTPPHPAHF/ S IFIFCRQR/ SLTMF 
LSLVLNSWTQAILLPWPPKLLGLQACTA 
MPIPGVELLWVSGCFVKES 


7093 


20994 


A 


7151 


177 


34 


ELVSFIoSFSLSLSFFPSFLPSFLPFLSF 
LPSFNPS F\ LPS FLPSFL 


7094 


20995 


A 


7152 


398 


40 


MNDRVLIiFHPGVfNAVAQSWFTGASN\FG 
LKQSSHLSLLCSWNYRHTPQCPASLLAY 
VCMYVCMYVCMYVCRD /RGLTMLPWLVS 
NSRPQAILPPQPSKIMGPLLGAVAHAYN 
TSPKGSQRR 










415 


17Q 


RNSCTOMRNEALPNSGYFPKLKYYYDF / 
CQRQDLAILPRLVYNSWVQAILLPWPPK 
VLGLQHEPPCLAVIQFLKGNLKIV 


7096 


20997 


A 


7154 


767 


537 


QALLCLPGWS \AVAQSWLTAASNSSDPP 
ACLPSS WDYRHVPPRQAEWS FRVS VSS P 
VKRDPGS PWRRMRPCSE 


7097 


20998 


A 


7155 


421 


301 


HAQLFFCRHW/SFTMLPWLVSNSWAQAI 
LLPWPPKVLGLQV 


7098 


20999 


A 


7156 


410 


269 


AGGPQTGS FFFF / CYERGVSLLPRLVSD 
TWSHVILQPQHPKVLGLQA 






j± 


71^7 


-a 


917 


MQIKA/ TMLYHFTPTNMTVLGFKKNSRG 
WQARGE IGTLVHCWWGCIMMQPIiWETVW 
HFLKRLNKLGVVVHACSPNYSGG 


7 inn 

/ luU 


X IWI 




71 


*tJJ 


211 


LFLVEMG \ FAVLASLVLTLB \ PRDSPTS 
ASQRARMTGMSHHTQ PLFCLFVS KFHLA 
KKLSLIWQNLRRAGI CRLS 




91009 


A 


71 

/ 1J7 




220 


SSSASQSGGITGVSHHARPSERQ\ SCSV 
AQAWQWCNYSSLITPTPGLKRS SRLSL 
LSSWDY\GALPHPSVAS 


7107 


91001 




7160 

/ 1UU 


1 

1 


17? 


FRKVLGRTGSNGLCTQVRVE FMDDTSRS 
IMRNVK\SPVREGDVLTLLELELETLRL 
R 


7103 


21004 


A 


7161 


1 


180 


DAAPVLKATARPTRPDRWK/ IDIDSLDI 
HGETP S LQKNTNTFRAWWC I P WPATRE 
ADMF 


7104 


21005 


A 


7162 


3 


108 


KKKKTGRFKLV\MQYNRRFVNVVPTFGK 
KKGPNANS 


7105 


21006 


A 


7163 


409 


110 


GQHGLTQSLPLSAPPRGAVPRWAPRPD/ 
SGLSLS PLLPTAFATTGASSANRFVS IG 
PRDGNFLNI PQQSQS WFL 


7106 


21007 


A 


7164 


35 


371 


YFHHLSLI^KTLLASMMVPLTVVVRIiIiE 
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W=Tryptophao, Y»Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LLFFFLGETEPAFVAQPGR \QGPNLGSL 
RPHFPSVKHFPGLTPPRAGTSGLDPPSP 
AIFGFFKKKRGSPCGPGRAWTPGPRGGP 
P 


7107 


21008 


A 


7165 


3 


382 


YTGPAWWSMIiWLATEEAEVGGSLEPRS 
SKLQ\QPPLRHCTPAWAT 


7108 


21009 


A 


7166 


1 


305 


TAVTVDPYHSVYIEVNARPGACMNGGGL 
LNGEQNSLKRKQLVQTLNKRHGEELAMV 
DSLTLGPKLEEA/ASPPAFARVRLRARQ 
ACRRKGSRLCWEHLVSKV 


7109 


21010 


A 


7167 


383 


55 


RDLSMVRMKSMLAIALCFTALMGMFNS I 
FYGRWAKLPFPPLSYIQGLSHRNMMGD 
DTTDCSFIFLYIFCTMSIR\QNIQKIVG 
LAPSRAATKQAGGFLGPPPPCGKFS 


7110 


21011 


A 


7168 


465 


166 


SPQGAQIKGVGFQKLRPWGP\PGFPRGK 
PFFWFKKKLPPQGGGPIiCSRFSGGLSQK 

N\SGPWGEI fpwgqngppps ppggketf 

FSKKKKKKKAVDYLNS 


7111 


21012 


A 


7169 


2 


323 


IQGFKTLSEEVSADEVEIVRELGLQAET 
EDVTELLHFNSTACCREI FHERKRQQMQ 
QTSLVSHF\RKLLQS PQSSAIATINQSQ 
PSTLRQDPMPATRLHLTGGLDNH 


7112 


21013 


A 


7170 


3 


509 


FGTIPLVLCADLNSLPDSGWEYLSTGG 
VETNHKDFKELRYNESLTNFSCHGKNGT 
TNGRITHGFKLQSAYESGLMPYTNYTFD 
FKGIIDYIFYSKPQLNTLGILGPLDHHW 
LVENNISGCPHPLI PSDHFSLFAQLEL/ 
YTAFPAPSQRHPPSWQEWKHLQRTALI 
HL 


7113 


21014 


A 


7171 


1121 


320 


SSCFQTSSHACFVTEFVPGGDPMMQIHE 
DVFPE PQARFYVACVVLGLQFLHEKKI I 
YRDLKLDNMjLDAQGFLKIADFGLCKEG 
IGFGDRTSTFOGTPEFLAPEVLTQEAYT 
RAVDWWGLGVLLYEMLVGECPFPGDTEE 
EVFDCIRLHGRPPTPAFLSVQG\VKFIQ 
KLLQKCPEKPLGAGDQHAEEIKVQPFFR 
PTNWQALLARTIQPPFVPTLCGPADLRY 
FEGEFHRAA\ PALTPPAPHSIiLTARQQA 
AFRDFD FVS ERFLE P 


7114 


21015 


A 


7172 


355 


3 


GPLLSMLGMNILFLTLFLLLPLSMLEGG 
VCGNF/ITNCCLBIDDNRKVIEDITAKI 
QKLAHVPVQIWKGWSPDTLFGGWFSSLG 
RFKTLVGlVIiVTLGVCLTLPCLLPLLVK 
NIQSANV 


7115 


21016 


A 


7173 


184 


2 


EKKVEDDDDDDDFEMESRSVI PAGVQWR 
NL\LHPLPSGFKPPSCLSLPRSWEHRCA 
PLHQCI 


7116 


21017 


A 


7174 


1190 


407 


SLKRRPSARGSRPMSMYETGSGQKPYLP 
MGEASRPEESRMRLQPFPAHIGRSALVT 
SSSSLPS FPSTLSWSRDESARRASRIiEK 
QNSTPESDYDNTPNDMEPDGMGSSRKGR 
QRSMVWPGDGLVPDTAEPHVAPSPTLPS 
TEDVIRKTEQITKNIQELLRAAQENKHD 
S YI PCSERIHVAVTEMAALFPKKPKSDM 
VRTSLRLLTSSAYRIiQSECKKTLPGDPG 
S PTDVQL VTQQ\VI P VCVRPFAKAAKQL 
VTITTKENNN 


7117 


21018 


A 


7175 


3 


136 


AIHLASLKVFYRYRVLSL/ LKRLVSNSW 
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PQ VI LLPWPPKVLGLQV 


7118 


21019 


A 


7176 


3 


316 


GIVKFFFFFFLTKCRTVTQAGVEWCSLG 
SLQLPPPRFKQFTCIiSPPVSWDYRHLPI 
CIANFCI F\ VETGFHHVGQMGLELLTSG 
YPPTSAPPDAKITGVTHRTQP 


71 10 
lily 






7177 
/ 1 / / 


S7Q 


917 


LLPRIiEYSGMVIAHCGLQLLGSSQAILG 
ACYCAW/HNFKAFFVETKSHYVTQAGLK 
LLASSNPPALASQSVGITGVSHCAQATL 
PFKTVDKGPSFNLFSFTPPLL 


7120 


21021 


A 


7178 


3 


150 


YTLCKSKKDIKSIYHIMT/HLKTHTHTH 
THTHTHTHKPSTTFSYTETFIM 


/ izl 






71 *7Q 

/l Jy 




Oy 


A f*R d A HVO AI/T YVEDGTAOGPTW 

riV *|T.VT-^*- r l V \£ V VJ1-/VJ»3X\ I * A X V i-AJ A. iU->VJ r 1 LI 

LSEjDHLCLPALAFIIFSTCTDSGERRICS 
HHGVTAWSLGWREKLTRGDRPLLTSPGP 

ALPIG\ csnegkci copdwtgkdcs ihn 

PIiPTSPPTGETERYKGPSGTNIIIDSIA 
GAVLVAAIVLDETGWGYK 


7122 


21023 


A 


7180 


452 


279 


CIDSIYQEIYHLSISILSVYLSIYRSIY 
LSTYIjPI FW\ SLYLGPRFLRTRDPOLLS 
LT 


7123 


21024 


A 


7181 


501 


183 


HVRCIAFICALIGFPSGYFSIMPWT/V 
DLVGIEHLPNAYGI I ICVNGISALLGPP 
FAGWI YD ITQKYDFS FYICGLL YMIG I L 
FLLIQPCIRI IBQSRRKYMDGAHV 


7124 


21025 


A 


7182 


446 


245 


liRTOSLALNQVSSASRYWDTGVATKLAT 
F/SLFWKDRNLDILPSMVFNSWPQVIFP 
SWPPKVLGLQM 


7125 


21026 


A 


7183 


420 


73 


DSSDDSGQSKLKP/ FWKGFTILDAIKNI 
CDSLEDFKI SLTGAWKNLI P PLMDDYEG 
FKTRVEEVTADWE IARELELEVEHGLG 
AV7AHAYNPNTLRLRSLKPRSLRPGQHSK 
TLSL 


7126 


21027 


A 


7184 


567 


174 


FFI4RDRVPSLTQAGVQWQDHSSLQPRPP 
RLKQSSH\ PALLSS WD\ YRHRPLCLAKF 
LI FliQRRGIiAI / FARGWSNS WSQ/CNL 
PASVSPSAGITGESHYSQPFLLSDSQTK 
VYKTHRSTISSTITEMGCERKD 


7127 


21028 


A 


7185 


345 


59 


RNGSSCXWQVIQSLTUC.PRLECSGVITA 
QCSXjGI,VGSSNPPTSASC\VESHCFAiQD 
GLELMGSSNPPTIAGITO^SHCAQPLYD 
LFLYEYGLSVHY 


7128 


51029 


A 


71 86 

/ ft ou 


I 


209 


IiSWLBIGSHSVTQAGVQWVHSSSIiQP* 
MLGLKPS SRFSLPKCWDYRHEPWCLPGL 
LRLKFD PSVGGVA 


7129 


21030 


A 


7187 


3 


225 


EKTPVSDRGAKGCTESLVNRRPCFSALE 
VDETYVP*EFNABTFTFHADICTIiSDKV 
RHFLKPIALVIIiNHHIF 


7130 


21031 


A 


7188 


1 


377 


ES * RNHNGDEDDSHVRAQGACIEAKHDK 
YMADMDELFSQAD17NRKKRDI PDYLRGK 
ISFELMRQPCITPSGITYDRKDIEEHLH 
RVGHFDPVTR5 PLTHEQLI PNLAMKEVI 
DAFISENGCVEDY 


7131 


21032 


A 


7189 


1 


359 


SRATDHVQRQGHG*GIHNYSDWENTDWF 
RETKNQSVPLSCCIETVSNGNGSLAHPY 
DL YAEGCEALVGKKLIiEFMMHVI WAELA 
FAAIQLLSMbSACIVMCRKSTDPAYELL 
ITVGTYA 


7132 


21033 


A 


7190 


496 


118 


QNQKFP FE PGTS * LGLYPKKYKLI YHKD 
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PKTCMFTAALLTIEKPWNHHKHLSMADW 
IKKVWYGTVGHACNQSTFLFWFFFBMES 
HAVAQGGVQWLNLCSLQ APAWATERDS S 
QKRPKIDPHKYIVN 


7133 


21034 


A 


7192 


1337 


342 


ISEATRTFRBS IWQFQSSPREDDRNRQL 
SASGSRQSHPGPRSTPGPGQGHRSSHQG 
CAPVSS AARPGGPS FRLSEGALSRSQFS 
VLVYI P * VAAGQFVKACLGASSPPSRL* 
GGLGTGHRTPGKEARSWPETSAPPAWHL 
LRSLCLPSRGEKLAQARSQKHLGWKWLS 
KLCP KTKYS WSGPDDS PS VLRD PRAGTT 
RWAPSGNPEEEWLQLKAKYLQNISEVL 
SVLHLLHIKALQLHSTNPFFVNARHLLV 
GQISSALTLNIfLSQriiFLIiLRFPFLSQV 
ERHSPLQKTGTTETNLVLFQDTVPMNFP 
SSI LLPS KYSCPALDKEASHFYQ 


7134 


21035 


A 


7193 


469 


130 


SASWDYRRPPPRPSNFLY**RRGFTILA 
RMVSVS*PRDPPASASQSAGCAYSIYLQ 
ERFARLHPACLAPRYMASDPLSSLHHSV 
PPGLHXiEHEHWLWHCLLPGO I CRPGKT.W 

£ J* ^-J +J m A all i41UJlAM4JfiilV mill \J\£ JL V^AVXT VJ XVaJ t¥ 

Y 


7135 


21036 


A 


7194 




Its 


TVWOFLKQLNIELSCDPVILLLGICPRE 
LKTYVHTKTRT * AFIVALCTI VPKQKQL 
KYPS VDEWVI KMWYVHTMDYFS VT KKNK 
LAKHRGSHL 


7136 


21037 


A 


7195 


257 


412 


TATSNKEPLKKVRCLIWRKKDCWARGFT 
ACNPSTIjGC*GRWIMRSRDRDHPG 


7137 


21038 


A 


7196 


1 


124 


PLI SGWHPCNPSTLGGRGWRIA* VQE F 
ETSLGELAEPCLY 


7138 


21039 


A 


7197 


2 


120 


YG 1 1 L F ITSEVFVFAGFF * AFYHSSLGK 
KISKKESVIIY 


7139 


21040 


A 


7198 


234 


416 


GVHAEVLISPGPQDRLRALWIPIIFGI 
LFAILLVLVFI KKVAKKPTNKVGHP * EP 
GTEF 


7140 


21041 


A 


7199 


372 


3 


VHYP VPQTGS P CCVPHPRGLS T IHPGLH 
SRYTGLGMVAHACNCSTLGGQSGKIA*A 
LEME I S LGNMVRPCL YERTNKQS KPRHM 
GITFGLTFSLTLTDKYSLI ISGFYLPQV 
LISLHPLGPY 


7141 


21042 


A 


7200 


396 


21 


GLKESHKCCAI PACLQRDPLVRVLGKAG 
F*RKSLSISTBLQSLFDSPDFSKITGKP 
I KLTQVEHRAG FEWNEDGAGTTPS PGLQ 
PAHLTFPLDYHLNQPFIFVLRDSDTGAL 
LFIGKILDPRGP 


7142 


21043 


A 


7201 


2 


148 


REPPHHSLANF*KFFVDKGFTLLPRLVS 
SSWPQWLPPRPPRELGLQA 


7143 


21044 


A 


7202 


1 


291 


AEGFRMHALVSQAACGTSGYKLSVGYMV 
QGPLTHLMLGIiLISLIFLSRGSGRTWAF 
SHS * YKTS DLLPCRNRREVI E FLHYRNL 
HSHICLSVTKTFL 


7144 


21045 


A 


7203 


262 


406 


ILARMTSVVNSIKHLILFLK*SSHLSIP 
S S WNYRCVLPHQANFLFLFL* R * S FTLM 
PRLVETVLLIQNSPTKIDSSPDDFSGKF 
YQTLNI VFENIRRGNTSNLLHKNSRIL I 
PNLTKTLHD 


7145 


21046 


A 


7204 


2 


247 


OTVLERSGQGTAALPRSTSLDRDWAITF 
EQILASLLTEPALVKYFDKPEayiMARIT 
SAKNIRMMGKPLT*ASDYEISAMTG 
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K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *==Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


7146 


21047 


A 


7205 


2 


168 


FFFLVETGFTVLARMVSVS *PRGLPASA 
SQSAGIAGVSHRAQPALRFLMLENRMC 


7147 


21048 


A 


7206 


1 


237 


GKCAHTHTHTHTRDTHRHAHAHTHACTH 
THTYVQGCSLSQWSQDCVRMWACHI FGV 
PRE * ASRYTSSLVYKPGCTGVCG 


7148 


21049 


A 


7207 


2 


239 


KVDSEHKLERSSGGVLGRFKGKGSGGLN 
LGNFFASRKGYSRTGFDRLNTEGSDQDK 
EDDGSESE* EYS * PMPALE PCSS 


7149 


21050 


A 


7208 


1 


405 


GSRDNMNVI L I CFPNAPKVS AESVKNDA 
QLDKYLECTAQE 1 1 +NHWEDVPDLFHVM 
RTLPSDNMPSLPPGGELASMRTVTEAVY 
YRLNPYINDDTDSTSTDDMWFNCSSSHG 
VYLDLHRRVQVNFDETFSMHPEIi 


7150 


21051 


A 


7209 


2 


761 


QDFGTRFATVLLFYGYFKCTRTIiKGTCL 
YNAI L YKVCS PRNDQPDVC YNPS FFLLP 
IKTGELU3FLVYASREKRS iaigd*kdd 
KWPPERIIQYYGPAT*AQDSSWGYRIPI 
YMLN* I IRLQAVLEI ITOKTGRALTVLA 
LQEIQMRNAIYKSRLALDYLLAAEGGIC 
GKFNMTNCXLHIDDQGQIVEDI IKITKL 
AHVPMQIPDIVWKNIVKIPVLFCSVLIT 
GTCSPQSGTPCWLSRSRPSHVDPFRWS 
P 


7151 


21052 


A 


7210 


147 


390 


KTGLSDQMPRAIQLSYSYI*SPLLY**SY 
WGGE KKSGPGAGLKNPGGGKNT FY 














7152 


21053 


A 


7211 


433 


214 


WGRGNALTIHPPWVPFQESETTPSPr J *G 
ITLLDNPFSPIPKQPPPPH*SITVPYLL 
SSPQPGFPHPFQAPGLA 


7153 


21054 


A 


7212 


532 


253 


KWINVFHKAAIMSDFSDRSRHSTLKAS 
E*GFTOIDAIKNICDL*KEAKISILTGI 
* KKS I PTLMPDF * GFKTPML * VTANVGF 
QRFSKKFYW 


7154 


21055 


A 


7213 


456. 


24 


DAQAFSWYSQGYFNSSQGFSISLPGDTS 
PVKIKRMGGPLFFS *LLKSTCWACSKTS 
RVPGKVLKTGLPRPR+PVYKDPLGGFYP 
RF 


7155 


21056 


A 


7214 


2 


445 


ILRDLCQRVPTWSGFPSWAMELLVEKAI 
SSASSPQSPGDALRTVFECISSGI ILKG 
SPGLLDPCEKDPFDTLATMTDQHREDIT 
SSAQFALRLLAFRQIHKVLGMDPLPQMI 
QRSN IHNNRKRRRDS DGVDGFEAEGK* D 
*KDYDTF 


7156 


21057 


A 


7215 


203 


3 


VRPGVYAHACNPSTIiGG*GGRIA*SQEF 
ETNIX3NTVRPCRNSGEVLNRKTVSIRRA 
LRVFSPIiHTA 


7157 


21058 


A 


7216 


1 


285 


GKCAHTHTHTliTHTHTHTHMHTRTHAHT 
HTHMCRGAV*VRGARTVCTCGSAPFLES 
PGKEPKDTLRPLFVSQAARAFLGEKAFC 
SVYRQREPPPI 


7158 


21059 


A 


7217 


1 


386 


GDSNCEWSVLYVI IATFGIGVALGI LSW 
T\^CMLKRQNGKPKMKSKYKNIiDATDQE 
SL*LKPASRAGIKQKGLLLSSSLMHSES 
KLDSDDAI FTWPDREKGKTjLHGQNGS VL 
NGQTPLKATSPREEIL 


7159 


21060 


A 


7218 


70 


379 


NILYFNIFFFFK*FLKFLWVFFFFFFF* 
VFFFYNFFFFVFFLCVFFFFFVKLKI FF 
WGPFFLFFRFFFCLGFFGTFQSWRWFFA 



991 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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LFPWGLMFLFPLLFFLVFF* LVFF 


7160 


21061 


A 


7219 


1 


208 


PRLAAS VS PGDMLGMQ VLGLGTVAS ACN 
CSTLGGRGMGI T *G * EFETSLVKKARTC 
FYKRLKRQKSANL 


7161 


21062 


A 


7220 


455 


96 


HYLYGSQGLSRF Y FVTQGRVQWHDHGS I* 
O PPT PGVKOSS S Ii *3 T.P KCWD Y KCP P PPT . 
ARPCPFFFFL*TSSVDTGLLLNQQPIL* 
IMTFPYPLLRKSLS FHHQI YE PTS ICSH 
LLSFCYT 


7162 


21061 


A 


7991 






QEFKTSLGHLG 


7163 


21064 


A 


7222 


307 


3 


NRRQAGPFPAKLGRPSPKGGFPNFFKLF 
FFKSS F*KNPKGOGWGFPPLI PGFOGPO 
VGGSLGAPGLKPPWGTPQNPFSKKKKKK 
RSNCTDTGTTIAHRRQM 


7164 


21065 


A 


7223 


331 


141 


SETITANVSPNL*DANDVPIQCEISPLI 
S YAGEGLERYVADKEFHAPVI IDENGVH 
EWKNGI 


7165 


21066 


A 


7224 


157 


314 


WLPGFRDGGEDGVCHVAQAGPKLRT*GD 

P Q T TCA55HP ARTTCST.TUPTViPl? * VT? 


7166 


21067 


A 


7225 


2 


392 


RDVRSLQPLPPSLK+ FSCLSLLSSWDYR 

LTSGDPPASASQSAGVTGVSHRTRPPSF 
VLVNPIPPI,LPPVPSPDGPNISSSTTSW 
RGYPEVLRWGLCSFWGCG 


7167 


— 1UUO 


A 




Of 




PQRPKVLGLHV 


7168 


21069 


A 


7227 


3 


307 


DRAAPRGAGRAGAVGPHRTGDWGLAGTP 
AGRGHARH IGRG P KQGRAAGQRGRLHPG 
GLLGRGGNSEP * RKQRTPGSMRRAAPLL 
STGQVWEGPASVGDRVP 


71.69 


21070 


A 


7228 


52 


1176 


GSQQPLLHISEAQGDTLIKKAPAEAAWP 
PLI AFSSGGRLPCSHES * SAAGYKRLDD 
TAAGGPPS PGGS PRS PATAPGLHP VWGG 
QLRPRGGRGLPATGSGEAFQALTQLSRG 
RACTGPPAPGSEDAAPSGCSAARWSP*M 
AGSSSCAIPDSAASCICPQGPPWRW*SR 
APARNRCSASSGRSRC*QLRRRRYI PGI» 
LHPVLHEALPDSWGCETQLGPPLASEAA 
P PRSRRREEG*RGSGRC* * KPS LPPS W«5 
LCRNQAEAGFVSI LVWLSGLAGAPSRPA 
AAAPQIAPNQHRSPPPRLAPVPAASGSP 
WPWERDLGVLETLNVGNFAQDPNPSACS 
GPQSTPDTYYPVQKPKTGLRLHFHSRPN 
T I VKS VGNMCL 


7170 


21071 


A 


7229 


3 


112 


ASNTjGSRGCSEPRPCHCTPAWV*E*DSI 
SKTNKQKN 


7171 


21072 


A 


7230 


103 


299 


AMSQIRNAI PFTIAT* RL* YLEIQLTKE 
AKDIYNENYKIPLKEIRDVTNKWKNIPC 
S*IGAQLCK 


7172 




A 


7911 


o 
z 


117 


SLTLSLS I CFCL * RWGLS VLPRL I SN<5W 
PQAIFPPWPPKVLELQV 


7173 


21074 


A 


7232 


3 


273 


LTHI VDGPNFLTTMYQCLMCVC * QETjGY 
QPVTTERCHNVCNDCL0R5 FKAQVFSCH 
ACRHDLGQNYIMIPNEIMQTLLDLAFPG 
YSLGR 


7174. 


21075 


A 


7233 


338 


136 


rprwedclspgvqnvcgqhses pflqei 
+ kisqtwwckpwps yqevegelle i rg 
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K=Lysine, L=Leucine, M=Methionine, 
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STCFGLPKCWD 


717^ 


21076 


A 


7234 


437 


261 


HLGFPECWDYRHEPQHPT* FIiKHFM*RR 
GLTTFPRLVWNSWPQAIFPRWSPKVLEL 
QT 


7176 


21077 


A 


7235 


41 


264 ! 


VTVMEIKGKINLVI4DIIKDIYVEGFSTY 
YF* RQGLTMLPRLDLDSWAQAILPPQPP 
E*MGLQECNHHAQLSRYF 


7177 


21078 


A 


7236 


1 


154 


GTRDFSVKTL*ARREWHDIFTGMKBKNF 
YPKKVYALKI PFKNEAETKKVSV 


71 7R 
/ 1 /o 


71079 


A 


7237 




179 


SCLSLSSS*DCR*PPPCPANFVKSFFRN 
G^TMLPRLVSNFLPQANLLAQPLSGFI 
ILL 


7170 


A 1 vOU 


A 


77/2 c 


140 


140 


KIKWVWWPVPVTPVTPEA* AQKLP* PGK 
LKFPLT*IPPLPSRLGHQSQTSFPKKKK 
KFYILSSDSV 


7180 


21081 


A 


7239 


2 


175 


QFESKPSPAWATEQDPVSKXKKKKKKKK 
KKKMKKKNSSKFKMIPFTKRQY*EGNWK 
N 


7181 


21082 


A 


7240 


336 


192 


QSETPSQSNQKFI F*GQGI ALLPRLVLN 
SWAQAILPPWPPKMLGLQA 


7182 


21083 


A 


7241 


1 


186 


DRPGQHGEMLSLLIFCFFCRDGGLTMLP 
RLVLKS * AQVILPSRPPKALGLQDTVHC 
SKFDAV 


7183 


21084 


A 


7242 


2 


111 


VKTLNKLG I DGTWLRI IRAI F * KP IANI 
ILNMCSFL 


7184 


21085 


A 


7243 


360 


217 


LP SRGDYKRKP PCIjVNFYFL * R* GLTLL 
PRLISNFWAQMILLPQSPKC 


7185 


21086 


A 


7244 


35 


276 


VQQQRVKQGRKYYS IVLKNKKKLCNYLR 
VGE*MG*RNIGWVQ*VAHTCNPSTLGGR 
GGRI I * PWEFETSLCNIG*TCISG 


7186 


21087 


A 


7245 


1 


356 


IQHTFSLL1FGIKYSYYVLKIVSWLAAV 
AHACNLSTIJGGRGGWIT*GHELETNLFF 
AVKPWVYDKNS F P PTLNLRFAKSGHLWE 
PRGKNHGPPETPPLSRVS PPQRFFPGGT 
RPLKKE 


7187 


21088 


A 


7246 


3 


293 


CKCGKAFHNFYSFQNHEISHTGEMLYDI 
RNVGK* SIFF*HHCQHKTTHRAEKLCEC 
KICREAFSHFGNLKVHEAGRSGSRL* SQ 
HLGWANCLTSGI 


7188 


21089 


A 


7247 


6 


387 


AS I PCLTYPSWE FL * F I * QAGHFS FS KK 
KLFICAFFSNRDEVSLMLPRLGLNSWAQ 
WLLPWPPKVLGLQVRATTPAKLGIFWG 
FMSVSLGQGVCLCRSLLCPLPLGPGQVH 
SGCSQSFVEKMNKEL 


7189 


21090 


A 


7248 


373 


146 


HLGKKVLI FWPQYLALLNPPNGGVSRVY 
PRCRPLMVFFFFFFFC*GRGLAMLTRLV 
SNSWSQVI FQLQLPKVLGL 


7190 


21091 


A 


7249 


2 


184 


AILRGI *KKFISTLMDDFEGIKTSLEEV 
TADWEIAREVELEVEVEPKDVTELLQS 
HDTT 


7191 


21092 


A 


7250 


795 


544 


EMKSCSCCPGWVQWRDLGSLQPPPPRFK 
RS PAS ASLVAG ITAACHHARL I FVFL I D 
TGFHYVG*AGLKTPEA*ATAPGPIFLYF 


7192 


21093 


A 


7251 


34 


242 


KOTSLKKKFFWCQKLRNVSLVEEFAFWP 
GAVAHACNPSTLGDRGGRVA*GQEFQTS 
LSNIGRPRLYRKI 


7193 


21094 


A 


7252 


216 


57 


MIQVLKYVLTSYLKKKKCAFRLGPVAHT 
CNPGILGAQGRRIS*AQEFETSLS 
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E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7194 


21095 


A 


7253 


126 


366 


INLTECQQYLDKAIFFPFLKKGLFAP * A 
EWQGGSFASLQPKPPRLKKSSHLTLLST 
WDYGGALPCRANFGIFFFWKKGG 


7195 


21096 


A 


7254 


2 


213 


RPTWATWQNPASTKKKKNS FFFETEFRT 
VAQAEVQGGNI/3 * LQSLPPGLKRFS CLK 
MPEAPKGQSCSLIG 


7196 


21097 


A 


7255 


150 


263 


EGVS Uj PRLACNGAILAHCNLRLPTS S D 
LCAFS*TTTRYYKYSGYCFSHPNLMIjIC 
SRFTLLSSHSSLGHLTLFLLIVCFEKES 
HSCPGWRAMVLSWLTATFASRRQATFAP 
SLBLSR 


7197 


21098 


A 


7256 


386 


93 


PPGGLPPQS IPSPRAQLKPKS PRGKKRG 
GPKPGGSQPGFWGFFFGPGPPFKRGGGIi 
NWVLGAKN* KGVKKNQKKKKGKMPQETK 
RGNRS I DMKFGGS 


7198 


21099 


A 


7257 


153 


369 


KENSRLQGYGPLFSHS* FTNSCPGQVQW 
LTPVIIALWEAEVGGSKDQPGQHGKTPR 
LLKIQKFARCGGGRL 


7199 


21100 


A 


7258 


180 


3 


CSTLI * IRKVWLGAVAHAYNPNTLRGRG 
GRI A*GQVFKTSLGNNVKTCLFLPS PHN 
QQ 


7200 


21101 


A 


7259 


22 


265 


QLRAI PDLKLDPLMEGKBILSRTLLGSG 
TVAPI CDPSTLRGQGGWIA* VQEFKTSL 
GNMAKPHLYRKKKLLGQVTKLEYKW 


7201 


21102 


A 


7260 


326 


2 


KSAMQICIVDPLNAFCLBICCHANLHCR 
PSNAFCFLKVDNVRAFVSPS IPQVSLIL 
KNFSWPGWAHTCNSNTVRGQGERIA*A 
QEFETILGNTGRPCLYFLKNNRKS 


7202 


21103 


A 


7261 


1 


68 


VNLCFILWLGPGWAHACNPST*GGRGG 
RXA* P * ELETCLSNTA* PGVYAHACNPS 
T 


7203 


21104 


A 


7262 


282 


176 


GLKKTFFFLVRERVLLCYPGWSAW*SQ 
LTAN* TPGLKLFSCLSLPCSWNHRHVAL 
HVAENI C * CHMPWPATWEAEAGEQLEP 
RSSVCSEL 


7204 


21105 


A 


7263 


588 


351 


NNTVCLLES IS SVNLCLPHLQHSPVKLE 
IFLFFFSSDNFPSTHHITVRPLSPS*AI 
IFPVTCTYTSRWPEATKDPQKK 


7205 


21106 


A 


7264 


327 


2 


KIKERINNFIFICRDRVSICCPGWNAW 
TLNSWAQAIRFCLSLMSS *NYRCAPTHQ 
QVTTIFFCRDQIEIHMVPKYHLGNTLNY 
FFVLFCFVLFLRHP FSAHRNFRLP 


7206 


21107 


A 


7265 


121 


270 


AQEAGFELLTSGDLPTSASQSAEITGMS 
HRALPSCY* FYVTTiHGLTVACA 


7207 


21108 


A 


7266 


3 


227 


SSQVSLPNNWDYRREPPCLRFVFIiFFVE 
TRSCYVAShELLGSGDVTVLAS *SARVI 
DMGHCDWPNLSEILLHVR 


7208 


21109 


A 


7267 


300 


1 


KMLLNTIMCNREILCKRKSPRVQQrrSFV 
SYFKKLPQPPQPSATTTLISQQPSTSKQ 
DPPA7UC*L*FAEGRLIAFPSNNVLYQLR 
YIMCFLPHTGIVHLVN 


7209 


21110 


A 


7268 


29 


344 


ASLLALKKKSGLGAVAHAHNASTSGGRG 
GRI T* AQEFATSLGNMARTCLYKKKKKK 
KKKGKNFTGVGGPQRGAPI FGPWGGGKS 
PAPGKPGGPNPFPGKGAPTLF 


7210 


21111 


A 


7269 


3 


209 


LHPLSPGPPAPVAAGIPGPQASR*GPCR 
PDPSPDSASVDASIiPGPALSLWSWGSNR 
GEYGTTPASQIH 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Htstidine, Wsoleucine, 
K-Lysine, L=Leucine, M=Methionine, | 
N=Asparagine, P=Proline, 

(~V=f~^ li ifa m int» Rr=Aroininp S=Spt*iiiP 

Vt V> 1 if LaI II IllCf XV /VI gllllllC, w VJwl iiic, 

T=Threonine, V=VaIine, 
W=Trvntonhfln Y=Tvrosi n e. 
X-Un known, *= s Stop codon, /^possible 
nucleotide deletion, ^possible 

rinrlpnHrip incpftiftn 

UULICUIIUC llloCI Ulfll 


7211 


21112 


A 


7270 


86 


206 


CQPGWAHAYNP S TWGGRGRQ I T * GQE F 
ETSLGNTARPPL 


7212 


21113 


A 


7271 


45 


274 


GLGTVAQACDNPSTLGG*GGWIA*GQEF 
KTINNDKSNKWOYIiAT TEOTNNRMLICK 
NKKELTDICNNTDEFQKHRE 


7213 


21114 


A 


7272 


1 


122 


VLVETGYLRVGQAGIjKIiLTPGDPPAIAS 
OS AGITGI SHRA* S YMHF* IiVKTGYLRV 
GQAGLKLLTPGDP PALASQSAG I TGISH 
RA 


7214 


21115 


A 


7273 


3 


242 


LLKQLDYRPPHPAKFFFFLE FLVKKGFH 
LVVKGGFKTQTSGGLPG*ASKGVGIPGV 
SHCPGLIGVFNLTLKVAVSLMDL 




jLW lo 


±\. 




i. 


1 AO 


TjAIiVAOAGGOWHDIjGSIjOP LS PS FPOPF 
YLSLPSS *DYRHEPPHSAWFSFFSNYR 


7216 


Of f IT 

211 17 


a 
A 


777c 

TZ75 




to 


* PSKFQVIFSSLGTGHFSNACFSCVMWT 
XVLLIINFWVTGIDGRTSGSSSVKCGNH 
S I YPMVLL 


7217 


21118 


A 


7276 


6 


87 


RRGLTLLPRLGSNS WPQAI LLP WPPKVL 
KL*AQTPGLRQSSCtiGLPKC 




211 ly 


J\ 


fZ.ll 


1 
1 




FOVKMLCSGWMWTiTPVTOALGAVAHTGN 
PSTVGGRGGQ II * S * * FYTS LDSGNPPA 
SASQSAGITLA 


7219 


21120 


A 


711% 


272 


51 


I FFPVN* CGGHD VQHHFNP S *NNLSSWS 
SWFMPX TLPLWEA* AGK*LEPRSLRPAW 
ATKLDSNSKKKIIEIHA 


7220 


21121 


A 


7279 


1 


131 


SRDYRHGPLHPTDFVFFVEMRFHHVAQA 
GIiKLIjGLSDLPALTS * SRDYRHGPLHPT 
DFVFFVEMRFHHVAQAGLKLLGLSDLPA 

XJ X o 


7221 


21122 


A 


7280 


411 


36 


FLECVPNFFPPKKKFFFKKNTGQVFFPV 
TFL*KGPALFFFIFPFYGKXFFFLSSAK 
DFPPRVFF* KGPLFFFFFFFFFFFFFFF 

UK tr XT XV V XT XT iwf XU CCiVlil>Ait X V b X J» 

FFFFFFFFFFFFFFFFCFKHFIYKKVHF 
*-SSVHFQPIIFF 


7222 


21123 


A 


7282 


395 


181 


IIKHESIHHDSVISVTFNMQSLRLFFFIF 
FILSL*LFRFIFFFSFFLVFFFFFFCFF 
FFFFFFFFFFFFFFFFF 


7223 


21124 


A 


7283 


392 


2 


IKFFGYNV*FNYILKNGLF*QLLI*LIN 
LFILRQGLI FWQVC2JVLVFFFFFFIFF 
FFLFFFFFFFFFFLLFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFLTQNMLFIVN 
FCKDTFKQQNIHLRTRGR 


7224 


21125 


A 


7284 


113 


291 


YVCVCVCVCVCVCVME Y* SAI KRDKIVA 
FTVTWMELETI ILSEVSQEWBTTHH>3LS 
LIN 




JLl IZO 




79RS 
/jSOJ 




AA 
*rt 


LIKVSPNFSPPKKKVLPKKPPCTIFPPS 
PF* KTPPEPIFIKPPHKKKNFNSPPPQK 
♦GPPQGFFKSPPPFFFFFFFFFFFFFFF 
LGFSSKLLIR 


7226 


21127 


A 


7286 


3 


215 


VILIIFMI*I^AWKR1CVLIVEEPSINL 
E * LYDGGYYBGLVLTSGLGPY 


7977 




A 


7287 


367 


119 


LKKAPAPPFLNPPFVJGKNFILSPPEKMG 
PPRGFFKRGPPFFFFFFFFFFFFFFFLG 
FFFFFFFFFFLQYWNMYFH* ILI INQ 


7228 


21129 


A 


7288 


73 


259 


INKYLQN* KKXKKKKKXKKKKICKx^ 
FKKTPRGAQI S PGVICKKMFPPKGGFKKK 
QAPGPF 



995 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
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nucle- 
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correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D= As par tic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MHVIeth ionine, 
N=Asparagine, P=Proline, 
0=Glutamine. R=Areinine. S=Serine 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion V^nosstble 
nucleotide insertion 


7229 


21130 


A 


7289 


337 


53 


XiRSFAPFHLPINNLL»TDIHMVI plppss 
LCSHYLFNEVTVTTLFKTTTHIiQLGTMA 
HTCNPNTLGG*GGRSA*AQESETSPGNM 
AKPRLYLLNF 


7230 


21131 


A 


7290 


1 


96 


KHHFGFEAAA* YWHFVDW* LFLYVS I Y 
**GS 


7231 


21132 


A 


7291 


X.OO 


2 


SRVBFKGPFMAHCSVTIPGSS YVPT<5 A <3 
*VAGPHGARHHAC*FLF*FLV*MLLEMG 
SRHWQAGL*RVSSSDPPARACQSGGIP 
GVKPHAQSVSV 


7232 


21133 


A 


7292 


121 


233 


YFPKKKKKKKKKKKKKKKKKKKKRKKK* 
KGGGRSRKSYLG 


7233 


21134 


A 


7293 


3 


346 


NPLNPL VNLTVS PKRNSSLDTRKKPCRE 
RHREKETLRKKRNQHTEARGRQRENHPE 
TE * ETGQRDRDKQTKKQKDTD I QAYT YT 
DREKRDKKRHROI R* AEAER T RTyTKRHP 
DI 


7234 


21135 


A 


7294 


46 


346 


ETGLKKWGDLSNGTVRAVLNLAQADFCV 
S I PS YH I KHCWTKTTOrflGFfiinCK T T .WT 
AKKSEPGVVTHACNPNTLGG*GGRVT*G 
QWFKTGLGNIVRPCST 


7235 


21136 


A 


7295 


254 


12 


RKDAENNGSRAE* IKRVKDLVQSKLKFF 
OELEKEGILPNPFYEAS ITLI PK3PDKDT 
SKEDYRPISLMNIDAKILNKILAN 


7236 


21137 


A 


7296 


258 


1 


C IGQCVIjVAVGRELLCVSRSWCXSLLHVD 
EVGGEALGRIjLWYP WTORF5? E^FfiDT! 
TRMVMGNPKVKAHGK*VLGAFSDGLAL 
LD 


7237 


21138 


A 


7297 


191 


263 


VCSKCEPF I YIMDYYAAIKRNKIMSFVA 
RWVQLEWSELTQEQKTENNMFS * WFG 
FLLLC 


7238 


21139 


A 


7298 


2 


164 


S RVAG I TGAHHHAOIi I FVFL VETC5FHHV 
GQAAFKLLTSNHHARPQ*LHFFYFFK 


7239 


21140 


A 


7299 


307 


g 


VMIITFWLPHLSGYIEKSTPYECGFDP T 
SADRVPFS I KFFLVAITFLLCDLEKALL 
LPLF * ALQTFNLPLI VMSSLLLI I ILAIi 
SMAYE * LLKGLDCSE 


7240 


21141 


A 


7300 


205 


317 


KSSPACTPS TIX3G * GGWFA* VQE FETSI* 
DNMVKPQLY 


7241 


21142 


A 


7301 


140 


22 


FNFLFl»YP*NYFFFFFFFFFFFFFFFFL 
FFFAMNINVFY 


7242 


21143 


A 


7302 


382 


151 


S IHRSEFRPRRSHS VSRPESRGI ILVYC 
SliYFIiGSGDSAPWS *VAGTTGVCPWAS 
LIFGFFFFFFNRDGVLLSCPG 


7243 


21144 


A 


7303 


318 


17 


LVPTKRPQVENYYPVHTYIWYCTKGKRK 
WAGIGGSCLSSQHFGRPSWQDPLSLGV* 
DQPGQHRETPSLQK* KNWWQBPI VPATQ 
EAEAGEWCEPGRQSLQ 


7244 


21145 


A 


7304 


310 


132 


QWCDLGATT PANFC I FFVRNGVS PCWSG 
WS * PPDLKLSARLHFQS VFFNP 1 1 SIGL 
FFL 


7245 


21146 


A 


7305 


J JO 


16R 

IKJO 


REPPLPAANFVFFVEMRSHHVRQAGPE P 
PSSSDLPASASQSAGITGVHCHAWP* *T 
YK 


7246 


21147 


A 


7306 


276 


161 


I INAAI KKKE FMS FAGTWMKLEAI ILSK 
LTQELKVK* KLRKFNTDTILLLLL I FKI 
F 


7247 


21148 


A 


7307 [ 


310 


61 


ITLVYCNFFYFINFNPLKLFDSFVI ILS 
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Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X~Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possibie 
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UCQIYSCCAIVYILIL*APVQPKFLCFY 
FLYFKNIFIYKNIFYKIKYV*NMYFYF 


724$ 


21149 


A 


7308 


257 


1 


CGRVDLH*AVFFFFKRWSLTVLPSLDLD 
SWAQVILLP*PPE*LGQQSCTTPPSSLS 
TFLYAFLHIPVNKMS CFVGKDRGMFFS Y 
H 


7249 


21150 


A 


7309 


2 


181 


RDGFCHTGHTGL*LIjTSGDPPASASQSA 
GITGVSHHARPCLHSLTAFFKNWMVWPG 
QQE 


7250 


21151 


A 


7310 


25 


340 


FCVFNTMDVKNI* IFLIFVPDVHLG*QK 
NLLLDTI*KKKSIRIQE**QYKSISKRN 
YNTVSDKSWLGTVAHACNPSTIjGGQGGR 
TA*TQEFQTSLDNTVRPCVYY 


7251 


21152 


A 


7311 


2 


193 


FVDFSMHILDMLSLYLSIVICYITLFIiS 
IIDNI YVNGPHKI I IPYFYCIKYIFYI Y 
IWFCFWFFETVLLCCFSVFTFNDYCSYL 
HYFLLLGVFGLFHFF+ FLYL YMVLFLVF 


7252 


21153 


A 


7312 


1 


245 


RTPGLKQSSHFSLLSSWDYRHVPLHPAN 
AFFFFKGP*SEPLTTiASSLHWYFPGGHF 
LQGPAKIDDPNFSPILGKLKKKSPF 


7253 


21154 


A 


7313 


167 


2 


GKPCLLWMCRDSQKCGEKGTGLKREAL 
RMGK*GQARWLTP VI PAL WELEAGGS P 


7254 


21155 


A 


7314 


334 


152 


CHVAQAGVQWYNHSSLQP*PPGLR*FSH 
LSLPSGWNHSWTPPHPANFFIMGSCHLP 
LINK 


7255 


21156 


A 


7315 


2 


164 


PHVGQDGIiELLTSGDPPASAFQSAEITG 
VSHHARPTFYLS * QVGLTVTS FRNPQ 


7256 


21157 


A 


7316 


155 


9 


QRIKH* FHWLGAVILTCNPSTLGGQGGW 
ITGGQKFETSLAYMATPRLY 


7257 


21158 


A 


7317 


1 


293 


GIPSTWDYWCTS PQPHF*FFFFFF** KR 
DLAMLPGGVIjNSW IPAILQPKGPKVLGL 
QAWTILYKGYYGKEMWHKE I CKGTHDVL 
SKLLTTKRFKRLL 


7258 


21159 


A 


7318 


262 


73 


SNSLVYL I KPDS VY I FLL FVETVS SHVA 
QVGLELVASSSTP I * ASQS VE ITGVSHH 
ARPKKVT 


7259 


21160 


A 


7319 


169 


265 


GGRMALS PGRGGCSEP*GCHCTPAWETE 
QNPA 


7260 


21161 


A 


7320 


169 


260 


KPQK1 PVACDQQFTDKRNCSHGDYECNM 
VDFFFFFFRERVSLCHPGWSAW* SGDH 
TTALQPG 


7261 


21162 


A 


7321 


580 


400 


FFFFERES RSVARQ PGVQWRDLS SLKS P 
PPGFKRFPCLSLPSSWDYRRAQPCPANV 
FEFW*RWGFTTLARIVIiLS*PCDPPTSA 
SQSAGITGVSHCAQP* FFFFERES RSVA 
RQPGVQWRDLSSLKSPPPGFKRFPCLSL 
PSSWDYRRAQPCPANVFEFW 


7262 


21163 


A 


7322 


130 


26 


SPGPLFGIiIEPLPSRFKGFFWIiNLP*TW 
EYRLALTAPIKFFKFLIKTGFPH*GIKL 
RG 


7263 


21164 


A 


7323 


17 


303 


KSSGTIKGRAQS PFLQLVAI EQCQDCFH 
FRSQF* YUCKQLQPGMMMHAYNPS YS KG 
*SRRIT*AQEFKSRLGNITKLCCLKINN 
SLIFKITENIE 


7264 


21165 


A 


7324 


194 


8 


DWVEVLKKSTYRPDTVAHACNPSTLGG* 
GGSIT*GQKFEIRLGNWGPCLKKNLTN 
QLGVGG 


7265 


21166 


A 


7325 


413 


72 


GVP I FLKTLTGKT IS SERES KVPLEKGK 
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ANFRDKRGIPPHHKRVIFAEKKL*KGRS 
PFDDGFQKKLTQPRVSAFKGHPPLQPLY 
AFIKS FREGTHHQDCPFWHLTLLSARRQ 
Q 


7266 


21167 


A 


7326 


394 


177 


NVNHTHTHTHTHTHTHTHGWCKSDTGYS 
SDS * CI TPRCIHDPTWPSS SQTVNLAMG 
LTLAKEVLANMTQTEV 


7267 


21168 


A * 


7327 


232 


22 


SRPSGSKKGAPALQQSSCLSFLSSWDYR 
HKPPYLTNFCIFIi*SQSFPMLPRIiLSNS 
QPHSNLPKWELQV 


7268 


21169 


A 


7328 


232 


3 


YHRSSGSLISLIQIRLRSKI*APSIPII 
KEWLSSVAHVYNPNTI/3GQGGRLT*GQE 
FETSLGNIARPCLHENCLKN 


7269 


21170 


A 


7329 


364 


3 


PKARRATVRRTRVSKAPSNRQCLGVNTH 
VliGHRRPVKGNAKTRWSASESQAHTCNI 
TTLGGQGGLIT*DREFESSLATMQNPVS 
VSGWLQKMFSKQKLYSKFFPNVAYKAFR 
IWLLCARA 


7270 


21171 


A 


7330 


177 


2 


DPHIiHKKFKARCX3GMTIi*SQLLRRIiR*K 
DQLS LG VGGCFELL WCY CTPAWVTERD P 
SC 


7271 


21172 


A 


7331 


413 


40 


RHMYLMFYTYIYVLDIYTYTVFFMYIYI 
P*CVCKYISCLHIMYRFISMYPCVYVYV 
CMCVCNVIHVIVENAGSCLALSALIEME 
GMARDSKMWTFKPDEKMRLSYKKQLEKL 
NFMILCPTWKEF 


7272 


21173 


A 


7332 


147 


49 


KNYIYFCRRGLPMLPRDGLNSRAQEILL 
PQPPM*IiGLNSRAQEIIiLPQPPM 


7273 


21174 


A 


7333 


148 


361 


SWLGTVAHAYNPSTLGGRGRQIT*VQEF 
BTSQSNIARPLLYSVLAYAVQPPLPMVA 
MRKLWLIPTSRITAL 


7274 


21175 


A 


7334 


305 


2 


CPGGFVTRVHR* YVRLSQPDLRVFTSGA 
MPASAAQSAG ITGMSHRARPVCS I IIFL 
NIFKVTCI FVELTI PNQNKQNTFLSSTV 
ASCRILASRPTPIVWL 


7275 


21176 


A 


7335 


1415 


1164 


FFLRQSLIiYCPGWSWAQSQLIASSITQ 
A*AILPPCSWDYSSQHTWLKFFFC*DGF 
SMLPRLVSGSWAQMVLLPQLPKAQAKL 


7276 


21177 


A 


7336 


1 


206 


GTRADHLMSGFQDQPGGHTETLSLIiKIQ 
RSQAWWHVPVMLPTR* IDARESLERLMT 
IALLKVRNRYIF 


7277 


21178 


A 


7337 


65 


302 


KSSLFYFLKIETGFHYVAQTGL+LLSSD 
DPPASVFQSAGITSMSHHTQPKELLKST 
RNTIiEE PNREWAKAREWAHRRNT 


7278 


21179 


A 


7338 


208 


38 


NPVSAKNEEYPRGTFDLGTQAEVQWHDL 
GSLQPTPPGLK*FS*LSPRV 


7279 


21180 


A 


7339 


2 


198 


ARADHPRSGVRNHPGQHGETPVSTKNTK 
N * AWWR VPLVPE CWD YI LFYMFTNCMDN 
KLPKWLTLC 


7280 


21181 


A 


7340 


2 


227 


ARGE IMS FAATWMQLEANILSKLTQE * K 
AKYCMFS L LSGAKHWVLMNI KVGTGDTR 
DYLGCSTQAVLKNIiGTRAE 


7281 


21182 


A 


7341 


3 


300 


HEGRNYTLTGRDSCTLAASAEKACGACP 
LWGKCDAESSRCV CREASDCEEEG I S IC 
VDVNCKEQTMPEC * AVAL*CTVHLI YVT 
INNALCIiPQLTLFL 


7282 


21183 


A 


7342 


210 


351 


RGIjGT VAHACNLNT FGG* GRWI I * AHEF 
ETSLRNITKPSLRENPDML 
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C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
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7283 


21184 


A 


7343 


306 


164 


I YMLYF I Y I TL I Y IHTHTHTHL YIiDKYI 
YL*CVHCFL**AYMI CHQR 


7284 


21185 


A 


7344 


442 | 


8 


S QMS KLKTWFRAVAHAWNPS TLGGQGRR 
IT * VRE FETSLGNTARPNLYWN 


7285 


21186 


A 


7345 


3 


248 


HELSLSLLLSPFFFKNBKGSR+VAHAGL 
KLLTSSDPPT*ASQSARITGISHCARPL 
LSHFlAYLALPCHQNRIiCTGHLPST 


7286 


21187 


A | 


7346 


2 


246 


HELYQS ILLSPLFFKNDIGSR* IAHAGL 
QLLT* SDPPT*ASQSARITGISHCARPL 
LSHC1AY1ALTCHQNRLCTGHLPST 


7287 


21188 


A 


7347 


1 


130 


GTRLLKRLTQETRLNPGGRGCREPSLHY 
CTPTWARQSQTVAHN* KRLTQETRLNPG 
GRGCREPSLHYCTPTWARQSQTVAHN 


7288 


21189 


A 


7348 


516 


288 


FFETESRFITQTGVQWCDLGPLQPLPPG 
FKGFSCIiSLLSS *DYRCLPPHPANFCIF 
SRDGVSPC*LGWSQTPDLK 


7289 


21190 


A 


7349 


417 


148 


S TS VVVQAGVQ WHDHGLLLPGP PRLK* S 
SHLSLPHG*DYRHILSCIANF*NIFVEA 
GFRHTAHGLEPLGS SDLPASASQSAG IT 
GMSHLA 


7290 


21191 


A 


7350 


143 


2 


TQKFKTS LS S WDYRRVSAHVANFC IFSK 
DRVSPY*PGWSRTPDPARA 


7291 


21192 


A 


7351 


189 


37 


EAIAVRRI I *NLFFFFFF*RWGLTVLPK 
LVSNSWAQAlLSPQPSKVliRLQA 


7292 


21193 


A 


7352 


131 


2 


KSRIiGTVAHACNPSTIjGGRHGWIT*SRE 
FKTSRANMVKPRPRA 


7293 


21194 


A 


7353 


1 


106 


GTRGQCLTMLPRLVLNYWAQMILPPWPP 
*VLGLQV 


7294 


21195 


A 


7354 


939 


653 


KGNFI FhRQS FTLVAQAGVQWRDLGSPQ 
PPPPGSK* FSCLSLRSSWDYRHGPPHPA 
NFAFLVEMGFLHVGQAGLELPT * GDPPT 
SASQSAGITGR 


7295 


21196 


A 


7355 


270 


26 


EARFLGLLRDI AQLCYPGWSAAV * S *I/T 
AASTSWAQVKASTHLSLPNN*D*ARATM 
PG * LKKYVFF* RQGliAI LPRLRLVF 


7296 


21197 


A 


7356 


329 


206 


HHLANF*TFCRDRGLTMIiARLNSWPQAI 
LLPQPPKVLGIiQM 


7297 


21198 


A 


7357 


128 


253 


MRPDTVAHTCNPSTLGGQGGRIT*THEF 
ETSLGDMMKPYLYK 


7298 


21199 


A 


7358 


2 


108 


AREGSGGQGCSEP * WRHCTAAWVTEQDP 
VSKTIKK 


7299 


21200 


A 


7359 


1 


340 


GTRKRNWHIiA*AIxHTHTHTHTHRHTHTH 
THFSFPANS *RDTFVT F*QSLLLE IMTS 
FSPSLGLKCSFSQQPSGLLSS IAGERDS 
RIHGENTVSMCLHNRLKAELRSDVI SLR 
V 


7300 


21201 


A 


7360 


385 


1 


SVGPSKPCAGYNLLVCHLLRPLEKRSIR 
VGVT* FSRCHLSPLSLTRKANSLTPCTS 
WMRRCLILLRLTLGVPHPMSCTHCPTLP 
SEMNLGPQLEMQKS PI FCVAHAGSPRLE 
LFLFGHLGSTPNLPSC 


7301 


21202 


A 


7361 


234 


29 


I^TEPKGSKAIjKFNVTLLI 1 NIAF*CNSG 
CNSNL*S * QQY*RVIQSHHLSQTLWLTP 
VIPALWBAEAGG 


7302 


21203 


A 


7362 


68 


339 


DRDIANKPAWQLCQSRILYLAQLQFIDQ 
GELLPDFRIHYKVT * TCTAWH*RRNRLI 
DQWNRADIPDI^JPCQAWRLTPVILVLWE 
ADVGGS 
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7303 


21204 


A 


7363 


168 


2 


CFYS THAHI FKELRQRPG AVAHAYNLS T 
LGGRGGRLT* GQ KFETS LANMVKPHS C 


7304 


21205 


A 


7364 


399 


62 


QPGQYGKHPVL I KNSKIKPFWGDPPWP 
NAREG*A* KMVEPGKVRVQSAQ IKALEF 
NI/3PKKKVPFKKKNPWLAKGGGLRPEIP 
APGKASKGGSPRSRILRPPGFLPYPAHP 


7305 


21206 


A 


7365 


289 


76 


VCDVLRESRAQLSRLKCSGI I * AHCNVS 
IAGSSNCHASAIiRVPGSTGVWVESERGD 
VDRFEAYGSRGYVCA 


7306 


21207 


A 


7366 


364 


182 


Q WHDHOXFKLLGS SNPPTLAP * VAGTAG 
TCHCAWLI* FLTWKIGFKTSLKI IVLLG 
YFWF 


7307 


21208 


A 


7367 


155 


2 


SLGTKREG^KSATVAHACNPSSLGGRG 
RWIT*GPEFPTSMANMMKAHPRA 


7308 


21209 


A 


7368 


342 


184 


IiSLPSSWDRRYTPPRLANFFKFFFKR* S 
LTMLPRLYRPFYITTHQTLNSAMTF 


7309 


21210 


A 


7369 


2 


211 


AREPKCL*TDEWIKKMQYIDAVEYHIiAI 
KLNE ILTHATTWMNXiENMLIGRS*TQKA 
TCCVI PFMDS IYV 


7310 


21211 


A 


7370 


190 


344 


YKNVTVKRILTWPSTVAHACNPSTLGGQ 
GGRFG*GQEFETSLGKIARPCLY 


7311 


2I2I2 


A 


7371 


24 


254 


KPQGIGAHASSQHYGRPRREDHLSLGV* 
DOPGOHGEIPSLOKKKKKGLPGRGGVCL 
KPWFLKILGEKNVWVLTPKG 


7312 


21213 


A 


7372 


43 


358 


DSNLNYSLFFHGEADLGTTQVLTHPSTT 
AMYFVHYCQSP * ILYGT INT * PPWHKN 
PIHI KTPS PCIiQASTAXNLOLSHTNCNS 
KATPHPIX5 YQQTYP PLTVHST 


7313 


21214 


A 


7373 


2 


288 


IHI I LAFTSSLLG I LVYRSHL IASLLCL 
HGIILSLTIIAPLIPLNTHSLLAPIGPI 
AILVIAA* OAAVGLALLVS ISTTYGIiNY 
VHNLAIiLQCCN 


7314 


21215 


A 


7374 


335 


1 


VQWNDFGSLQNPPPGVSPFFWIiSLPNNIi 
GFKGPPPRPGPFLKF*+KPGFSALTRMV 
SLSRPRDSAPLAPPKAGVSRLNPRARAP 
FFFFFVFCFHFFLLSGEKKSLAHKGGI 


7315 


21216 


A 


7375 


399 


91 


RIFFFPPSLKAPPHRQKPTPFPPTKKPL 
VFSI PPI FPPKFLKKNPFSPWEVFIiFLS 
FFQKDAFFGPQV* KGNFFFFKGPPPLTQ 
NSPQGVPPKKKKKKKKKKN 


7316 


21217 


A 


7376 


377 


131 


YLYIKFYNFII ILFYIFIFNI F* FNMI F 
NLLFYIFYYILINFNILIILI INFFYFI 
IiLIILNIILILYIirYYFIIKJTIIBCT 


7317 


21218 


A 


7377 


2 


156 


FHHW* AGLELLGS SNPPTWAFQSAETT 
GVSHHAQLPHFILRKQELLSLGL 


7318 


21219 


A 


7378 


319 


59 


DTKMNSQDILYSIQSNRT*W*LIKRQ*T 
KQKSAPRNGNEVP * THQHSTATTLISQQ 
PSTSKQDPPAAK* L * FAEGRLLAFPSNN 
VF 


7319 


21220 


A 


7379 


308 


12 


FSPTISNFNYKSNPSPLIYLFQ*PITQL 
I PTN I STNTPLNL * FYLRLI YSTS ISLL 
P I SNNVKI K*QFEHTKPTPFLPTLIALT 
TLLLPISPPILIIL 


7320 


21221 


A 


7380 


11 


169 


RRDLAMVPRLFSSSPGLK*SLHLGFPEH 
WDYSHEPPSWLLFFLFLITEGYFF 


7321 


21222 


A 


7381 


300 


2 


KFFPPPGFFFKGFFSPL*SHQKKTQIFF 
GEFFWGFKKFFLFFFPPGGFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFLFFFF 
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FF*INSLFIQNSFLT 


7322 


21223 


A 


7382 


163 


160 


PGTSNDEIQRRNSRNRISPIAER*GLSL 
LPRLEW4GSNEAHSNLELQGSSNPPASA 
SKSTMITGVSHCA* PRLEWMGSNEAHSN 
LELQGSSNPPASASKSTMITGVSHCA 






j± 


/JOJ 


1 


10 j 


VCVCVCVCVCVCI KWDSYSAIKKNELVA 
FWTWKRLETIILSEVSQE*NTIYHMSS 
LICGS 


/ jZ*I 




A 




2 


196 


KIQMWPGWAHACYPNTI»GG<2DGRTA*G 
QBFQTSLGNIARPCLYQQQQQQQQPRCS 
LTYDGDKS 


/ JO 




& 


/JOJ 


j 


197 


RNI*SSLQPPPPGLK*WSCI*SIiPSSWDYR 
RPPPCPANCFMLCEFFRTNGVFPCSPGV 
YFEPRN 


7326 


21227 


A 


7386 | 


266 


198 


TOLSGFLRSAGLTMIS*PHDPPASASQS 
AGITGVSHRARPLVLFLRDSLYSGTQAG 
VPWRDLS SLQSRLAEQMRFCFP 


7327 


21228 


A 


7387 


164 


311 


SRQQKGLSGVAHPCNPS ILGGRGRWIN+ 
GOBFETSMASMVKPCLSLLKL 


7328 


21229 


A 


7388 


3 


218 


EIMSFAATWMQLEANILSTLSHE* IAKY 
CmSLLSGAKHWVLMNIEVGTGDTRDYIi 
GCTYPSSTEQLGYYS 






A 


7-3 on 




•SO 


SDCGLTP IAQTGVHWHHLSSLQP*LPWL 
G*FSHIiSLPSSWDYKYVPPCPASFLYYF 
LVBTGFCHVAQDGIiELLDSNNPPSSASQ 
SAG I TGMSGCAGLRRF * KLRS WR YLVLL 
AMQVNEQGTSQAGEAVSEADAGQGCCCG 
LCLHRIIiSQQYALP 


7330 


21231 


A 


7390 


169 


322 


APGNFKTN*RKNGFWAGNTLGVSVKNMK 
NGWAWWLTSVI PALWEAEAGGS P 


7331 


21232 


A 


7391 


1 


353 


GTSPHYGSFHGHQKDVTGKGSQSRPRER 
VLGSHARKNSLPVHE VKAYLL * KYLS I L 
RATHILISPDGCWVPIFIAQSYIFLSSC 
GFFLLFFLLLLTLSFCSWHLIISFLALF 
VIPLY 


7332 


21233 


A 


7392 


372 


3 


IQAGACCAECIPSKGIRWNPSIVEGLHS 
DMSWSLCLFLSGAISVLCNFRLSGSSDS 
PMSA*QAARTAVIRHHTRIiIFCIFSRDR 
ETGFHHVGQENLLMIjLLQLFSGIiGFS IS 
ATLIiTLMLLV 


7333 


21234 


A 


7393 


362 


176 


ARMECSGAI *AGLRGSSDSLASAS *EAG 
YTGKCGHAQ*TFVFLVEVRFHHVGQNAI 
NIRTSL 


7334 


21235 


A 


7394 


2 


304 


ARADCCARNE*GSVGMGPSEPGVGYNLL 
VFRFIiRPLEKHS IRVGETRFSRCCLSQL 
OLARTGNSLTPCTSWVRQCLALLRU^IiG 
ALHPIiSCTQRPTSHSE 


7335 


21236 


A 


7395 


256 


3 


MYYAARYWNKAKITTESLRHWAGHGGSH 
PYNPSTLGG*GGQIS*AHKFETSLGNMV 
KPHLS KNKQTOKQTNKQTNI S WSWCTCL 


7336 


21237 


A 


7396 


346 


2 


RDHFYRESHSLITYICLENVKSNLKCGE 
GPLNFTFHLQRKNCNLGHTRS PGGVYHV 
GRTETATGPIKRRNGLGAVAHTSNPSNL 
GGQGGWIT*GQEFKTSLSLPRWLTPVIP 
LV 


7337 


21238 


A 


7397 


2 


242 


RRYSDYPDAYTT*NIIiSSVGSFISLTAV 
ILIIFMI *EA*KKK30aaOCKKKKKGGPF 
KKTLGGPKFNRRMEGKOT*StiKGGR 



1001 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alan in e 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N-Asparagine, P=Pro!ine, 
Q=Glutamine, R=Arginioe, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


7338 


21239 


A 


7398 


1 


289 


TKLLGLYYTLLQASNDFKS PFS I FDGI Y 
GSTFFVTTGFHGLHVIIGSTILTI * FIR 
QLIFHFTSKHHFGLEADA*YWHLADARR 
HSESITHQCCSL 


7339 


21240 


A 


7399 


346 


40 


IIIFTNKIYIKINKRLKLFIIKILDLYI 
1*1 FFFRK* FTFKQ* KF* I ININKIYYN 
NIK* * IFIYLFFFFFFFFFFFFLFFFFF 
FFFFFFFFFFFFFFFFFFFLKK 


7340 


21241 


A 


7400 


1 


334 


NSCRRQRIJjDLIiLLPITL*LPQLIGYIE 
KSTRYECGFHP I S PARVPFS I KFFLVAI 
TFLLFDLQIAI»LLPLP*ALQTTNLPLIV 
MSSLLLI I ILALNLAYE* LPKGLD*AE 


7341 


21242 


A 


7401 


2 


302 


HTINLLGERYVLATTY* *STMTLLLTGL 
TVLHTGLYSVYLMTTTQWG*LTPHI inm 
KP* FTR*DTLVFIHLSPILLLSLNPDI I 
TGWLVLLSYBLYLLT 


7342 


21243 


A 


7402 


57 


338 


'^THNVIGYIEASTAYESGCDP IS PARVP 
FSI RFFIiVAITFLLFDIiE IALLLPLP* A 
LQTTNLPLIGMSSLLL I 1 I LALSLAYB * 
LQKGLD*AE 


7343 


21244 


A 


7403 


407 


68 


QEELVDPLTTVREQCEQLENCVKAR*RL 
* LCDERVS SRSHS EEDCSEDLFDFLHAR 
DHCVFLILYINLKYMCELT 


7344 


21245 


A 


7404 


494 


154 


SLFI FYLYLIFLF* * FFFHYFFYFIiLS I 
FSFKLFFFFLLHFFFHMFFFLFFSPPNK 
IFFPKIPPRFFLFPLF* *KFFFFLPFKF 
LSPLGFFF*SPPFFFFFFFFFFFFFFFF 
F 


7345 


21246 


A 


7405 


1 


400 


NPGRFLSTSNS S LYERTRE I RPTSQSAF 

KKGGPLKKKPRGGQK*TGGEKKKFFPKR 
GGKKKPPGKF*KKTFFWGGKKMGKTPQK 
KINPLGKKKIFKGKRGKTPPL 


7346 


21247 


A 


7406 


2 


257 


RASLCHPGWSAW* SWLIAALTSWLKQF 
SHLNSLKCWDYRHEPPHLA*LRFEHRHS 
GSSICTLNR*ILLSFPILSIRYKNGLII* 
Y 


7347 


21248 


A 


7407 


259 


47 


YVFFFFLFLLFFIRLQSVSPFFFFFFFF 
FFFFFFFFFFFFFFFVK*SFIEMFSFW 
L YIiLRQVS I S IAYTLFG 


7348 


21249 


A 


7408 | 


3 


248 


VTFL*CDVEIAVLIPLP*ALPTTNIiALI 
DMSSLLLI I ILALRLAYE * LRGGLD* AE 
EHQATHSAYERAPPL 


7349 


21250 

t 


A 


7409 


2 


350 


RSGMPRRYSDYPDAYTT*NILGSVGAFI 
CIiTAGIliIIFMI*EAFASKRKVLIVEQP 
SINLE*LYGCPPPYHTFEEPVYIKSRHI 
RKESNPPKLVSSQPHGLHDFFKKKLGTI 
TEKN 


7350 


21251 


A 


7410 


2 ' 


232 


LDC3TYAKIHFTIVFIGVDLTLLPQHFLG 
LSGMPRRYSDYPDAYTT*NILSSVGSFI 
ALTAAILI IFMI * EAFAKKF 


7351 


21252 


A 


7411 


2 


321 


TSIiLAVLLIIMTIi*LPQLNRYIEKSTPY 
ECGFDP I CPGRVP FSI KFFLVAI TFLLF 
DLETAIiLLPLP*SLQTTNLPLIAMSSLL 
LI I ILDLNLAYE * LQKGLD*AE 


7352 


21253 


A 


7412 


2 


194 


SIECFLVPITLLLCDLEIALLLPLP*AL 
QTSNLPLIVMSSLLLIIILALNLAYE*L 
QKGLD*AD 



1002 



WO 01/64835 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, EMAspartic Acid, 
E=Glutamic Acid, F=Phenylalaniue, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T~Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibSe 
nucleotide insertion 


7353 


21254 


A 


7413 


1 


197 


lt\ tx Jf FVTK1LLLTSLFL*GRTAYPRLR 
YDQLIHLL*KNFLPLTLALLI * HVS IP I 
TISSIPPQT 


7354 


21255 


A 


7414 


1 


185 


STPLP * ALQTTNLPL IVMSSLLLI 1 1 LA 
LSLAYE * LQKGLD*AEKKKKGAAVLKDP 
SGGPS 


7355 


21256 


A 


7415 


347 


228 


LE*LYGCPPPYHTFEEPVYIKSRLKRKE 
SNPPJCLVSSQPHGIiHDFFKHIKNYRKNK 
RKEGLASVKENDRGP 


7356 


21257 


A 


7416 


1 


296 


HAYDHASAQLNGYIEQSTPYECGFDPIS 
RARVPFCIKVFLVAITFIiLFDLEIALLTi 
PLP*AI£TANLPLIGMSSLLLI IILALS 
LAYE*LHKGLD*AE 


7357 


21258 


A 


7417 


3 


202 


TCS I * HVLVRMT * L I YDLE IALL* P VP * 
ALQSTNLPIjISMTSIjIiLII I LALSMAYE 
*LQTGLD*AE 


7358 


21259 


A 


7418 


1 


321 


SNTLLALVLITITS *LPQLSGCVGECTP 
YECXJICRILNARALCCMQIFLKAITFLL 
FDLE IALLLPLP * ALQTTNLPL IDMSSL 
LLI I ILALSLAYE * LQKGLD* AE 


7359 


21260 


A 


7419 


1 


212 


RTOGII FRPLSQSLY*LLAANLVILT* I 
GGQPVS YPFTI IGQVAYVLYFTTILILI 
PTICLI ENKILKWA 


7360 


21261 


A 


7420 


2 


257 


GRVGGRVGYSIAFFLQAITVLVLDLESA 
LLLALP*SLQTTNLSLIVMSSLLLIIIL 
ALSIiAYE * LPLGLD* AEYWGEMMTTTLD 
A 


7361 


21262 


A 


7421 


367 


269 


HCDLKLLGSSDPPASAS * VAWITGTCHH 
TKLS 


7362 


21263 


A 


7422 


2 


116 


ALQTTNLPLIVMSSLLLII ILALRLAYE 
* LQKGLD* AE 


7363 


21264 


A 


7423 


2 


336 


KPECGREGILPCCSSSAWPEGSFRPFQM 
NLGSFLSFFFLFFFFFEKGLFLCPPV*S 
PYGKSNLLE PSASRFKQFFCLGPLNNWD 
YRLFPPFPGNFGVFKKNRVFPFWPSLV 


7364 


21265 


A 


7424 


3 


154 


LFL* IRTAYPRFRYDQLIHLL* ENFLPL 
TVALLI*HVSIPITISSIPPQT 


7365 


21266 


A 


7425 


3 


154 


LFL* IRTAYPRFRYDQLIHLL* ENFLPL 
TLALLI*HVSIPITISSIPPQT 


7366 


21267 


A 


7426 


1 


314 


INTLLTLLLIIITL*LPQLNGYIN*STP 
YECGFDPISPARVPFSIKFFLIAITFLL 
FDLEIALLLPLP * ALQATNLPL I DMPSL 
LLI I ILALSLAYE * LHQGLY 


7367 


21268 


A 


7427 


1 


331 


ILIINTLLALLLIIITF*LPQLNGYIEK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLLFDLE IALLLPLP * ALQTTNLPL I VM 
TSLLLI I ILALELDYE* LQPGLD* AD 


7368 


21269 


A 


7428 


2 


373 


SDRMHI IHALVITILLGLYFTLLQASEY 
FESAFTISDGIYGSTFFVATGFHGLHW 
IGSTFLTICFIRQLIFHFTSKHHFGFEA 
AA* YWHI VDVAGLFLHVS I YRGGS YNHQ 
IIPSVAQESTPA 


7369 


21270 


A 


7429 


378 


/**> 
63 


FPPPQKGPPPKIPPGFFFSPPFWEKFFF 
PPPPLILPPPRFFFNPPPPFFFFFFFFF 
FFFFFFFFFFSFFFWDFLGSGC 


7370 


21271 


A 


7430 


1 


109 


QTTNLPLIVMSSLLLIIILALSLAYE*L 
QKGLD*AE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aminn <*riH 

alUIUU <XVJU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylafanine, 
G=Glycine, H-Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=fwlllt£krmn<> Q~A rainina C-CarSna 

u kiiu iijc, Ar gin i iic, o~-ocnne, 
^Threonine, V=Valine, 
W=Tryptophan, Y=>Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
uiiucviiue o etc uon, \ — possioie 
nucleotide insertion 


7371 


21272 


A 


7431 


358 


290 


FLLVIPNFLFHFLIYIIISIVPNVSFFH ' 

TTVTTiSTPRPPTTPTT T T CC T7V7CTT T?T?Z? 

FITSFPFFYFFFFFFFFFFFFFFFFFFF 
FFFFSFFFFFFF+ FFFTN 


7372 


21273 


A 


7432 


1 


95 


PTRPPTRPHNS3jQIj*TPGLKQSYPRLPS 

o W JJ * jxtlin f oil lr AW r KJvrr *KWH1jxI_PGW 

S *TPGLKQSYPRLPSSWD 


7371 


9 1974 


A 






o/U 


FSPSLRPSFLPSFLPSFLPS*LPSFLPS 
FLLFLSYS FCPLS FLCLRCI PHSAFFWG 

IMjrJCVVf V VtfUiiN^V X V X X I KlJ-lOr iAXjTi\Jj_i 


7374 


21275 


A 


7434 


163 


1 


SVYVKRIGIiGRVAYTCNPSTVGGQDG* I 
T*GQEFETILAKRREPLFLACFLFLI 


737^ 






7 A 


1 A Q 

148 


JoZ 


JN UJJn o V Hi H\r± KIM. JvOJjtoX VAHx iI^STxj 

GGQGGHLA* AQEFMTSLGHMT* S SLYHK 
LTE YGWRDTAS LLMC 


7376 






7/tlA 


01 


Til 


UU^PRLDMSWSQAILPLKPPNALGLQ 
A 


7177 






/**J / 


AA 

44 


z42 


C*T TOT. T7T PPPUD Vt? O T* CVYLJT"\T PPT rtnm T> 

RFKRFSCLSLPSTWDYPJjAPQCIjANF* I 
F**KSGFTFG 


717R 


919*70 






HQ 


inn 


X V- V b JjJj I r r JL r X J. x 1 r 1 Lib xi*lvJ?jjr XT1* 
NYISFLFFYF*H*FYF*KNFIFVFPFGY 
FPRP 


7379 


21280 


A 


7439 


1 


221 


LKx^rttTOPTTISLVITRiWKQPKCPlWQ 
y I VH* £>AI KKNEIiL IHAKTWMNLRGI ML 
SEICQPQKILYLEIPFI 


7380 


21281 


A 


7440 


35 


345 


FSFCCVCVCVCVSLPKHAI IEMLNQTHP 
YVPKGVAYLPERBP F I VPMDPELTA* YE 
DYRTHESAQEPSEPHQLSRRSLPH* PGH 
2\ Tonne t> T>rn owr* in> dtou 


7381 


21282 


A 


7441 


3 


384 


VIHL*AGAVI IGSRSKSTNALAHFLRQG 

L it X Jr V A Jj V JLAi X 1 izT 1 rlN JjJLiXy P ViVbAERIi 

TDNITJ^HIjl^MHlilVSATL^ 

TL IMFTVL I LLT I LQ LAVALMHAYVF AL 

LVCLYLHDDAYDHGG 


7382 


21283 


A 


7442 


15 


277 


GLAILPRLVPNSWPKVILQWPPKMLG* 
QA 


7383 


21284 


A 


7443 


21 


694 


NGGlxNAHLAS ASE FDHSGVQL I EREEE I 

CIFYEKINIQEKMrOjNGEIEIHLLEEKI 

f\j?TJfCWfV T ai?if m?nT n/rnvT t t>i\ vt> c?t r\ 
v r iaru^uvxAJirUfcjKy x v„v xyr^jjjriVRa<oJjJJ 

ADLAVLQ IQFSQCTDRIKDLEKQFVKPD 

GENRARFLPGKDLTEKEMIQKLDKLELQ 

UUOCEEKLLEKDFIYEQVSRLTDRLCSK 

MMALCA* LSMKQAPT IELQKE VQGERRP 


7384 


21285 


A 


7444 


141 


428 


DRS^BAGDGRIAGDFFFF*TGTNFVAQ 
AKVQWGIHSSLHP* Pl>GLr^SSYl^LLG 

CVTTWtJr.POH'IXr.PPP'T.'KTffDnCT T?H Di^dTCV - ' 
o XriArtJUJXJrXXHXxj? r IT XJi\jNKooxjr ^*roWrvJ 

TGFRGTTFWGAK 


^7385 


21286 


A 


7445 


35 


343 


GYS I IGC PKFLDEKI FDSGHEKERKINS 

LLF*KKjvXKKKK3CKKKj^ 

TSGGPKI KRGGGIKNFP I KGGVKKTSGE 

FFGKKPFFGGEKFWEILP 


7386 


21287 


A 


7446 


403 


193 


EYIKQGAE*NNQPKLHAHRKRLTWKETP 
RC IHEG LAP AI MNS DCLVFDTS I AQLFA 
ENGNLGINVTISMC 



1004 



WO 01/64835 



PCT/US01/04927 



SEQIB 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!euciae, 
KNLysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7387 


21288 | 


A 


7447 


411 


110 


LTDSLINTVYR* KMSY IS * TNSA* VYLY 
IFCIYIYIYIYIYRRIYSIYRC*YIYYG 
PAHIVLVYSSSYILVYYESHKYGYIMGL 
NILSYIFIYSYFSACS 


7388 


21289 


A 


7448 


1 


380 


PTRPQKLKKNCNGVSNGDLCFFGGGQFL 
IWGFFFFFCILIEGGLNSKNYELKSLIK 
TYGAKKAEIEHTEN* KNRETLLERAAQL 
AIKVTNPNAMLRTVEHDYAMYMHNFWLN 
KLNNCHKVAI S CKQ 


7389 


21290 


A 


7449 


200 


437 


VQSNNNKNFKRRRL* PGVVAHTCNPSTL 
GGQARRIS*AQE1^TCLNNLMRPCNYNK 
FINVLDDVHGVTSPVSQYYKKVC 


7390 


21291 


A 


7450 


405 


3 


PPRFFGFFFPFYPLKFFFFPKGFNFFGG 
FFPIFFPPKKKFFFKNSPGGFFLPPFLG 
KIFFFLPPFKFGPPRGFF*RAPLFFFFF 
FFFFFFFFFFFFFFFFFFFFFLQRINFT 
ARPGTPLSRAVPLRGALPPSA 


7391 


21292 


A 


7451 


2 


348 


PRVRPRVRYKLQTKRCDLTLPS*GHSHV 
SLRNTTFDALKS*DTOSIiIjSPKIjECHGV 
NTPNCNLRLPSLSDYPASLSREAWIAST 
RLRAWE I FVFLAQMRQD YCMNPGGGGCN 
ELK 






A 


7452 




168 


LVHDGHAGLKLLTSGDRPA* AYQSAGI T 
GVRPLGPPTYYYFRTASFMTFICEFATV 


7393 


21294 


A 


7453 


184 


260 


SSRGGSLASRSSR*STLLGLPKCWDYRC 
MPPSLANFFFFNF* I *NFCFW*RRSLGM 
LPRLWNS *MQAILPLR I RLQGIHFAGT 
QTFSL 


7394 


21295 


A 


7454 


405 


64 


FFFFFNFF*KKKI*DVFAKIFFKIFFFF 
SGLKIFWGGGFQNAPPQKKFFF * KIKS F 
FFLNFFF*KNIFFFVAGWFFLLNFFFR 
GAPLFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFF 


7395 


21296 


A 


7455 


368 


2 


SQSFGRPRWENRLGPEQHIKISRYRPAQ 
VHACHPSTLGGRGRWIT*GQEFKGSTHL 
GLPKYWDYMPBPLLLVYIQCFHFRTFCN 
KAAIHCFKPEQLLYNKTSYFKESYKVKA 
ALVACKPTRP 


7396 


21297 


A 


7456 


2 


183 


TERDS VS KNKY I YIHTHTHTHTY I CVHG 
VTYVYI*SLLLIYNIAYILIYHVFYILL 
IFKA 


7397 


21298 


A 


7457 


281 


127 


YNFCIFM *RCGFNNVGQSGLKVLLRGYS 
PASASQIAGITGVSQCAQPVAYS 


7398 


21299 


A 


7458 


2 


194 


TSVRKYF * YPFTITDRIYGSTWWATGL 
HGLHVI IGSTFLT ICLIRQLI FHFTSYH 
HFGSEAAG* YWHFVEPARLFLYVCIY* * 
GSTWVVATGLHGLHVTIGSTFLTICLIR 
QLIFHFTSYHHFGSEAAG 


7399 


21300 


A 


7459 


22 


319 


GGS VTGAYHR I LDHDRNQI FLA* GGS VT 
GAYHRILDHDRNQIFLALLISMLLGLYF 
TIiLQASKYFECPFTICDGIYGSTFFVAT 
GFHGLHVTIGSTFLTICWIRQLILHFTS 
RHHFGLEAAA* YWHFVDW* LFLYVSI Y 
* *GSVTGAYHRILDHDRNQ I FLALLISM 
LLGLYFTIiLQASKYFECPFTICDGIYGS 
TFFVATGFHGLHVIIGSTFLTICWIRQL 
I LHFTS RHHFGLEAAA 


7400 


21301 


A 


7460 


1 


222 


TILLGLYYTLLQA*EYFKATFTIYDGIY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino add 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=dycine, B=Histidine, I=Iso leucine, 
K=Lysine, JL=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Senne, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

ft a * l ■ ■ a* % »m » 

nucleotide deletion, \=possib!e 
nucleotide insertion 














GSTFFAATGFHGLHDIIGSAFLTICFIR 
QLIFHFTSEHHFGFEAAA*YWHFVDAT+ 
LCMYVSIY* *GSTFFAATGFHGLHDIIG 
SAFLTICFIRQLI FHFTSEHHFGFEAAA 


7401 


21302 


A 


7461 


3 


246 


LNYSLFFHGEANLGTTQVLTHPSTTAMY 
FVHYCQPP*II i YGTINT*PPWHKNPIK 


7402 


21303 


A 


7462 


2 


155 


SRSRAALLLPLP*ALQTTNLPLIVMSSL 
LLIIILAIiSIiAYE*IjQKGLD*AE 


7403 


21304 


A 


7463 


1 


136 


PTRPAPSHLLYCKNIKKLFPSGTANEAL 
I1ADIFLRCKKKKFKVK* *VLKASEE*N 
GCICHLHGMH*NSAVCTVFWIiKQIMNSD* 
**KVNTQ*KKKKKKKKBCKK3CKKKKKKKK 
KKKKKKKKKKKKKKGGENNK* KNTIKKLF 
PSGTANEALI IADIFLRCKKKKFKVK 


7404 


21305 


A 


7464 


166 


192 


N IN * MW YMHTVE CYS DFKRKE I L IHATT 
WMNLEDIMVSEISQSQKDKYCMILHTEY 
KLYIHM 


7405 


21306 


A 


7465 


416 


55 


LFFLRLGYFFLPNIRVFFFLSTLFFFFF 
VWGFFFFGRFFFFFPPPQEVCFFFIF*G 
IFFFFLFK*KNMFFFFFFLWGPPPFFFF 
SGAPLFFFFFFFFFFFFFFFFFFFFFLR 
KANTKKMF 


7406 


21307 


A 


7466 


295 


84 


IWGNVTLLQLVSCLFCFLFLH*GFFCCC 
CJCLFLFLFVCLFVCWFLNLKPS I KCLLP 
FSLSLGCAEIPSSF 


7407 


21308 


A 


7467 


142 


258 


PGTVAHTCNPSTAGGQGGRIP*GHEFKT 
NLANMVKLHLY | 


7408 


21309 


A 


7468 


3 


192 


PSEYFESPFTISDGIYGSTFFVATGFHG 
LHVI IGSTFLTICFIRQLIFHFTSKHHF 
GFEAAA*YWHFVDW*LFLYVSIY* *GS 
TFFVATGFHGLHVI IGSTFLTICFIRQL 
I FHFTS KHHFGFEAAA 


7409 


21310 


A 


7469 


2 


no 


GRVGKHHFGFEAAA* YWHF VD W* LFLY 
VSIY**GS 


7410 


21311 


A 


7470 


380 


8 


TPQKRKKLKKAGEGVFPPPPKKTTPPP P 
PIFIYFIFFFFFIYFFFFFFFFFQYMSF 
FYSHIMIQRHFLNF*ESFYSCISIIiQ*L 
FKFFSFLGGLIKL 


7411 


21312 


A 


7471 


1 


107 


PTRPHHFGFEAAA* YWHFVDW* LFLYV 
SIY**GS 


7412 


21313 


A 


7472 


2 


360 


VNTLLALLLIMITF*LPQLNGYIEKSTP 
YECGLDP IS PARVP FSIKLFLYAITFLL 
LDLE I ALLLPLP * ALQTTNLPLRAMSSL 

T T7T7T TT 7V T J*1T RW*T^DPT T\+ ■» TPT.TT Tir»T 

TQPTNNN 


7/117 


21314 


A 


7473 


423 


167 


VK.VHK.L-r LiPtj VAPHAt NPSTLGGRGGRI 
A*AQBFKTSLGNIVRPPSDTCNP IVLGL 
UA 


7414 


21315 


A 


7474 


1 


73 


IKPE*YFLFAYTILRSVPNKLGGV 


7415 


21316 


A 


7475 


131 


3 


ISWVGAEWLTRVI PAL* EAESGVSRGQE 
IETILANTVKPDAW 


741/; 




TV 


7/1 7£ 


*> 
2 


103 


Dt3\7TD CTTITTCC T7^U'TT , Drv\rKTVr , + T TDVT U 

rKVKoJLVjroolAnl lxCLiVriliVy^xXKxJjri 
ANGAK 


7417 


21318 


A 


7477 


2 


325 


NTLT.AT»TiLIIITF*LPQLIGYIEKSTPY 
ECGFDPISPARVPFS IKFFLVAITFLLF 
DLEIALLLPLP*ALQTTNLPLIVMSSLL 
LI 1 1 LALSLAYE * LQKGLTERDT 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

rag to first 

ominn <4<*irl 

uiiiiiiu <fuiu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

■ caiuuc u* 

peptide 
sequence 


Amino acid sequence <A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, ^Phenylalanine, 
&=Glycine, H=Histidine, Msoleucine, 
KNLysine, L=Leudne,M=Methionine, 
N=Asparagine, P=Proline, 
0=l?lufarain£- R— A rainine. S=Serine« 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=TTnknnwn *=Stnn rod on. /=DOSsible 
nucleotide deletion, V=possible 
nucleotide insertion 


7418 


21319 


A 


7478 


423 


170 


FFS FFFFFFFFFFFFFEIDKNVYLIM* C 
TNPQEKRNLLFCFLVMVSPVCITPHTYI 
CVCVCVI YIYI YI Y I YIYI YPFI FSCCD 


7419 


21320 


A 


7479 


433 


50 


RVFFFFFFTPBKKTFFPQE I YLFFS PFP 
L*NFFFSPKPLFFLGGFAPNFPPPKKVF 
FLKFPPGFFFSPPFKKKFFFFFPRFFFA 
PPRFFF*GPPSFFFFFFFFFFFFFFFFL 
TVKFTRGFRGFGGOI 

X> V XW A IVViT \oWV ^ 


7420 


21321 


■A. 


7480 


AM 

4U1 


117 
DJ 1 


F FFF FGKFFF POE FTVFFP P FP FKI FFF 

X L X A. X wlVb X X * \flXJ X JbwXXXXXXX ft- V-L. ******* 

SLRIiLFFWGGFAQFFPPQK*VFFSKFPR 
WFFFPPLFRKKFFFFFPG* FLAPQGFFL 
KGPPPFFFFFFFFFFFFFFFRP I* VGGR 
VGNNPNPNTNYLPN 


7421 


21322 


A 


7481 


l 


84 


KNFLPLTIAIjLI*HVSIPITISSIPPQT 


7422 


21323 


A 


7482 


562 


412 


LFSFHVSGTMAHTCNPSTLTG * GRIT * G 
liEFKTSLGNTVRSHRYRKKKIA 


7423 


21324 


A 


7483 


12 


Oil 

J.DO 


DT FHFT*? khyfgfeaaa* ywhfvdwrl 
FLYASIY**GSLL 


7424 


21325 


A 


1AOA 

7484 


IOC 


DO 


GKTFPTPKKFNFFYPFYLS I FFFFPKKLK 
FF*ElFPIFPPPKFIiFLPPNFPVLFFFF 
FLFWEFFFSFLFWFFFFPFFFFFFFFFF 
FFFFFFFF*KTTEFYFVNSNFQHTK 


7425 


21326 


A 


7485 


424 


6 


fffffsegsgxiredpigagalydygr* 

LVVVTG*TIjFVGVYIVIEIARGNRLCD 


7426 


21327 


A 


7486 


16 


129 


FXXXRXXLIXKBPSINLE*LYGCPPPYH 
TFEBPVYIKS 


7427 


21328 


A 


7487 


183 


387 


IASGLDI FLLLLIiETGSHSVTQAGVQWH 
NHDSLOPRIjTGIiK* SSRRS PRGSWDYRV 
LLCHPGWSAEA 


7428 


n 1 ion 
2 J 329 




1AOQ 

7488 


3d I 


ioy 


EKKIFFFSPRGYGPPPFFFLKAPPHIFF 

Ct f\ f\ 1 X X X l7iTft\W*VJ*T XT JT 1 i i» JJlw*t> *> IX ^ A- 

FFFFFFFFFFFFFFRGGGGDHKNFFFFF 
WGFLKNFPLFFFFPRV* I* I FFFFFFFF 
FFFFFFFFLGGGGGTIKIFFFFFGGF 


7429 


21330 


A 


7489 


47 


212 


THASALFL*IRTAYPRFRYDQLIHLL*K 
NFLPLTLALIiI*HVSIPITISSIPPQT 


7430 


21331 


A 


7490 


98 


273 


TSGSCCSAHILAFKP*AGMATFITKSL* 
IF 


7431 


21 332 




7491 


70 


Wj 


T.YGCP P P YHTFEEP VY I * EAFASKRKVL 
IVEEPSINIiE*LYGCPPPYHTFEEPVYI 
KSRQKRKESNPPKLVS SQPHGLHDFFKK 
KKKKKKKKKKKGGGPLKKPRGGPNLTGG 
GKKNFFFFRGGEKKPPGGFWKKTLFDGG 
GKLGTPPPRRF 
















O 1111 






00 




RS SLP ARPTRAV I L 1 1 FMI * EAFASKRL 
VLIVEEPSINIjE*LYGCPPPYHTFEEPV 
YIKSYV 


7433 


21334 


A 


7493 


2 


204 


KliPETVKKXKKKKFYS FRS I VMLLLYLF 
S * SLSFFLLYSLLFFLFLLS FYI YSFLC 
FPLCKTATTEH 


7434 


21335 


A 


7494 


2 


104 


TTLLLSRT*NKLT* LTPLIPSTLLSLGG 
LPPLTG 


7435 


21336 


A 


7495 


291 


160 


PRHF* IFVEMGSHYVAQAGLKLYASSNL 
LAWASQSAGVTGVSHGS LLITSCL 


7436 


21337 


A 


7496 


422 


180 


IiGAPQGVF*KGPPFSPSSSSSYSSSSSS 
PS * QNI FNKI FLFVCFVIiETES VTQVGV 
QWRDLGSIiQPttPPGFKRFFTITTH 


7437 


21338 


A 


7497 


244 


296 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lyslne, L=Leucine, M=Methionine, 
N=Asparagine, P= Proline, 
Q=G I u tannine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=TrvotODhan, Y^Tvrosine. 
X=Unknown, *=Stop codon,/=posslble 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFG 
GNF* FYK*LLGNRDGVCLPPSSYL 


7438 


21339 


A 


7498 


1 


410 


PTRPDQLIHLL* KNFLPLTLALLI *HVS 
IPITISSIPPYT 


7439 


21340 


A 


7499 


51 


478 


PQKYTL I MKI C I I P PGKG FR P VEMLAGG 

IFSSLLYKYNMIYCLMLCYYIGYIFIFY 
DMYYRO YS VLYYKLYNYVMIj r * I Y I T.YT , 
FMCI * KYI YFIFM* YKYI FI YLYLY* YK 
YIFYIYLCVYKNIYTLYLCNINIYLYTY 
IYININIY 


7440 


21341 


A 


7500 


3 


110 


FTSKHHFGFEAAA* YWHFVDEV* LFLYV 
SIY**GS 


7441 


21342 


A 


7501 


168 


327 


AFCYKATOCKTI * Y* YKHRYTEQWSRTK 
NPDINPYICX3QMIIFLQLKSLHKI 


7442 


21343 


A 


7502 


400 


159 


NNFSPLRNIiQGFFGPLKFPMGFQAGPPG 
*GFSFFLRSNFFLDVFRGKY*TFGF*DF 
FYCPS PNEVLGE P PLMGGNTTPGFP 


7443 


21344 


A 


7503 


401 


257 


TSQKKKKKKKICTPKKKKKNFJSK^ 

GTRGEPFKTGGGKEKKKFFKKKKKKGKG 

PP*PQLVTCRGGREKKKKNLWKKKGAF*K 

GTF 


7444 


21345 


A 


7504 


44 


344 


LLASIANLALPPT INLLGELS YLVTTFS 
* SNITLLLTGLNILVTALYSLYI FTTTQ 
WGSLTHHINNIKPSFTRENTLMFIHLSP 
ILLLSLNPDI ITGFS S 


7445 


21346 


A 


7505 


286 


252 


I *FAFHHVGQAGLEFLTSDLPASASQSA 
RITSVSQHAWPVHNIFYSLL 


7446 


21347 


A 


7506 


3 


213 


RELPGLKRFFLLSLLSSWD* WRMPSHPA 
TFCIFCRDEVLPCCPAGLLILLVWNLKN 
CCCYHHHRYYYCY 


7447 


21348 


A 


7507 


400 


20 


S LPQKKVSLLKLKSAVQLVMF * DRFSLC 
HPGWRA* GQSGVTTASSLRAR* SSCLTW 
EYRPSPLPLADFLFW*RRGLPI* PRWS 
NS *GQ VGLIiHQ PS KMLKFSSLS PCARP L 
FFFFNRSTNLRVQS 


7448 


21349 


A 


7508 


349 


227 


SDFLTSANPPASASKSFGITGVSHHTRP 
* PRFLKVNLI YS EMEVYN* AS FRHVD * G 
GLELLTIi* FTHLGLPKLWDYRREPPHPA 
LTKI FKGKPYIQ 


7449 


21350 


A 


7509 


371 


21 


FLVETGFHHVG*AGIiEPLASSNLPASAS 
QTAGITGMSHCAQPNPGSLLNKTMLAVA 
TEQWVGYMWPSVFGHITARPEGKRLSDC 
I*HSDTNVCVVCVCSGLIFVCFETGCRFA 
LQPG 


7450 


21351 


A 


7510 


330 


14 


DKVCSVTQARVQWHDHGSLLFLSSGLRQ 
SSHLSLLSSWDHRHMPPOIANFFYF* *R 
LRSHYVAQAGLKLLASSNPPTLTCQSAG 
ITGVSHNTWPKALFSVTDLKH 


7451 


21352 


A 


7511 


417 


33 


APPCGGGPPKRIFGPSRKRGKGPPPPKR 
FF*KNKPPKGGGKKSPPPPKNFFPKKKN 
LGPKKKKSGFWGRFAPPRAPGGGVFNFP 
FKAGGEKFFSRGEINPPRGAGKKKGFFL 
GKKNTKKKTPKKKDL 


7452 


21353 


A 


7512 


81 


296 


CLREGAS HS AAQ AGLHWHDHS S LHP +N P 
GLRGSSRLNLLS I WDYRCVPPSLANFES 
CLLKI ILGS PFPKNL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
OCysteine, D=Aspartic Acid, 
E=dutamic Acid, ^Phenylalanine, 
G=Gr/cine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Lencine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginme, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptiphan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, Y=possible 
nucleotide insertion 


7453 


21354 


A 


7513 


412 


279 


KKYTQGLGWAHAYNPSTIjGSQGRRI p * 
AQFETSLGNTGRPCLH 


7454 


21355 


A 


. 7514 


74 


126 


KI * DWVPKGAPI>QGPVF*NSTFRPRVRN 
RGPPWPRPGPTVWKG*NKKN* PAPRGSN 
PPFFFPGNPGPFKLRGWPGSS*GSFQNC 
KKAQAG*APT*KPGPPPPQNKV 


7455 


21356 


A 


7515 


332 


20 


GERQVGSYFFMNTERLFGMTKKF* KWIV 
SMVAQHSSS I SGVWVFRGQEIiAFPLS Pt) 
WQVDYESYTWRKLDPGSEETQTLVREYF 
SWEGAFQHVGKAFNQGKI FK 


7456 


21357 


A 


7516 


1 


167 


RTRGQRFTVLARIVLIS *HCDLPASASQ 
SAGITGVSHCTRPGLS FLYTPTAKHST 


7457 


21358 


A 


7517 


2 


223 


GRMEEMGSHYVA*AGLEFLGASDLPA*A 
PQRAKILLLLGGHKPGLTILPRLSSNSW 
PHVKWPRWPLKALGIiRG 


7458 


21359 


A 


7518 


151 


476 


SLGKFAVSFSFSFliK*RRALAVLPRLCS 
NSWP * AVLLPWLPRVGIAGMS YHTQPDY 
HFQARAVAB I LGLVSTQHSS VTGLPVEN 
SFRAGHGGPRLQSQHFGRLRWADC 


7459 


21360 


A 


7519 


3 


134 


CSFRLGAVAHI CNVTTLGGRGGRIA* AQ 
EFETCLDNIARPCLY 


7460 


21361 


A 


7520 


14 


219 


APSIHGFGFLYTYIHTHTHTHVYMYI * V 
YIHTH I ES I KCQPIiTQLQSNKKANR IVF 
RTQFMLTCYFMR 


7461 


21362 


A 


7521 


134 


334 


RKHSC* IFFNFCFYIiSLETRSHYVASTG 
LDLLAS SNPPALASQRARITGMSHHAPP 
VLTRFKKSLF 


7462 


21363 


A 


7522 


406 


145 


FLKTGFSPFGPFVF*TPPPGFPPPPPPP 
IFGVPGGGPFAPPLFFFFPQTKHFPPFL 
NPPSPPPQRGFFRGFFIiFPPAFFLKTPP 
FPF 


7463 


21364 


A 


7523 


3 


382 


HPQGEGGRLTS LE *NTSLRLKKKKKKKK 
KRGGPFKKKPWGAQI*PGKKKKNFFLKR 
GAKKNHLGNFGKKPYFWGGKKIiAKPPKK 
N*TFKGKKKFLRGKGGKKTPKPWLLKIF 
FSGFYIiKKIFPPGP 


7464 


21365 


A 


7524 


319 


106 


GTRS CS VT* AGVQWHDHS SLQP * TPWLK 
RSSWLRNRWDYKHEPPCLNNFYFFCRDG 
GLLCHPGWSQSPELK 


7465 


21366 


A 


7525 


433 


228 


MQPYNWEVNSSSQLSLSSARQRVTHVLT 
GNFF I K* ERWDLPVLPRL VS KS WPQVT F 
PPWPSKVLKLQV 


7466 


21367 


A 


7526 


26 


351 


CRSLVFSLPKPGLRFVS I LHIQVFSHFD 
FFFLTGNTQK*RK* TPFI FPA* IjFKTPA 
PKPGMVAHTCSPSYSGGLDGRIA*AQBF 
RTSLGNKTNPSLEGGRKKKKPSGS 


7467 


21368 


A 


7527 


438 


62 


QLSKIHRPPAGGRFFKSGGGFKPFCPVG 
LPPFGKKANQPTPKNPPKKQICPSGGGLS 
PMGKPP*FPCSMPSDSPGSVSFSRGGVA 
PAPKDYCPIV*AALIPSKKKKKSKPG*V 
KKGKPNLYSIYNK 


7468 


21369 


A 


7528 


3 


132 


FYYL*RRGFPVLPRLVLNSWLQVILLPW 
PPKVLN*AAHHPHG 


7469 


21370 


A 


7529 


2 


628 


FFFLPGVVNRVLLMGRGLGEANSREGRG 
PSQPRGMHGQLEVRGGRSQGDGWHGTLS 
PPWGRI YRGAP PTFAAPQAPKP FRQLLP 
M*LPGRPSSCLLPETPALSPLPSAEWGG 
FKLLTGTSETPGPSGSPSVGSANAQAAG 
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SEQ ID 
NO: of 
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SEQID 
NO: of 
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sequence 


M 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutatnic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine> P=Proline, 
Q=Glata mine, R=Arginine, S=Serine, 
Threonine, V^Valine, 
W=Tryptophan, Y«=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














ATTIVARGHTPCTRGPLDTHAHSSLRWH 
AHTRP PPRPLAHEHGTHTHAHPHTPGAW 
VGEGPFQGWGRA 


7470 


21371 


A 


7530 


63 


205 


GRPTRPRTRGVKIL*QFEHTKPTPGLPT 
LIALTTLLLPISPFILIIL 


7471 


21372 


A 


7531 


390 


56 


KTFFFFNIiIAFTEAKSVLLFLIjLFFETG 
SRSSAQPGVQWHHQSSLQPQPL* LKGS S 
CLSLLSN*DYRCAPPCQ*SPCLSLLKCW 
DYKREPPHPAKESHMQQSYLIYSLYIA 


7472 


21373 


A 


7532 


543 


335 


LAAYGGTCLQS * LFSRLRWEDHLSPGVQ 
GCNE LQLQHCS PAWATE * DPVFKGEKKE 
RVLVKVFS 


7473 


21374 


A 


7533 


3 


145 


WCDHGSLEPRSPGLKQSSCLSLPSNWDH 
SSEPLHTTSMWNF+ IFKFLNLK 


7474 


21375 


A 


7534 


386 


106 


SWVTDEITMBI *ECVLNDNSDTPYQNLW 
DTEKVVLRAKFTAIiNACWKKSERAQIRH 
TCVHTHTHTHTHTHTGSKCIiLRKYLIiLT 
IQQQNIRIN 


7475 


21376 


A 


7535 


398 


2 


SLQPLPPRVKCFSSPSPPRRWDYRGAPP 
RGGKFFFFGKKEFPPVGQGGF * FLPPGV 
LAPPPPQKGRISGGSPRPRPFFFFFKIR 
ISYPILAKQLKLCFLKKKKKSIQAIKLY 
HHQKNPITKSSVFWESQPRA 


7476 


21377 


A 


7536 


2 


344 


HSKCVPAEWTMAYYIRYLS ISSLLHIL 
ENIVHF?FHFLY*GPNNLFFFYFAGTRIi 
IQRSWK*PMII**VHVQIKREGQIFNK* 
IFCMIGRGG* ITCX3QEFETSLVNMVKPC 
LY 


7477 


21378 


A 


7537 


116 


224 


LHTHTHTHTHTHTHTHTQ FNKLP *IiFIP 
CGHIPRRY 


7478 


21379 


A 


7538 


1 


383 


WHERTHSRIIILFQGLETIiIiPLIAF* *L 
IiASIANLVLPPTINLIiGELFVLVTTFS * 
SNITLLLTGLNILVTALYSLYI FTTTQ* 
GSLTHHINNIKPLFTRENTLMFIHLSPI 
LLLSLNPDIITGFSS 


7479 


21380 


A 


7539 


31 


332 


DNYLSSYSSSGSWMRELILASQGIRWEP 
IVDRTPSHHRTHTHTHTHTHTHTHTHTV 
L FRLG P WTHAREPRGHI CGN * IRNQ S P * 
KNPRTRGKNLRTPHGW 




21381 


A 


7540 


199 


15 


NNVQIK+QFEHTKPTPFLPTLIALTTLlj 
LPISPFILIIL 


7481 


21382 


A 


7541 


337 


157 


AEIVPLNSSLGNRVRLH*MESYSVAQAG 
VQWHDLGSVQPLPPGFKRFSCFSLLSSW 
DSRLYFRCHD 


7482 


21383 


A 


7542 


28 


313 


RARIGSDQCLSAGTQWENKSQISGDFM 
WNDFRS *SVTRVGVRWRNHGSL* PWIPG 
LR*SSGFGLPNCWDYRDRNEVWGGKKEV 
TCPTVNLQKSQ 


7483 


21384 


A 


7543 


493 


180 


GGFRGS WLVPRDGGQGVGFCSLPPPPSR 
VKQFFCLRFPRRWGGREGSPRPSYFFFP 
*EKPRFFFLGRWLNFLGQGVGPPWPSQ 
RVGFPGLTPGPRASFFFNLI 


/484 


21385 


A 


7544 


403 


426 


FF*KFFITFHTLFFFFFFFFFFFFFFFF 
r err v r r J-ifilririroVjrfc J?£>r riQKIrTi 
KFPGQPEKFALPFGGGFFF*GKFPQFFF 
CFLVFFFFFFLFLENGVSLYCPGRSRTP 
GLKLSSRLLASLRAGITGVSH 


7485 


21386 


A 


7545 ! 


3 


335 


RHYSDYPDGYST*NILSSEDSLI SLTAA 
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E=Ghitamic Acid, ^Phenylalanine, 
G=Glycine, HMHistidine, Msoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, ^Proline, j 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
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LLMVFMI *EAFAYRRTGLIA* EPSMNLE 
* LYGCPPPYHTFEEPGYI KSRLKRKESN 
PPKLVSSQPHGLHDFFKKKKNRGGPF 


7486 


21387 


A | 


7546 


424 


140 


RRVKFVGGGAPQGPPPKRGVLPKIPREK 
VFPPPPG*NGPPGPAFKTPPKKKKNISS 
PPPGNWAPPGGLLKGPPPFFFFFFFFFF 
FFFFFLNPLK 


7487 


21388 


A 


7547 


2 


348 


FCHVAQSGLKIJiASSDPPTLASQSARI T 
GMSHHGGLK*GEFLGPSSNKGALLLWEA 
TWLQS FMKSASIiFVKPRPTFVYFVFAFV 
GYRD I YFTASSLIKVIQEGLKYTMKCAP 
LTL 


7488 


21389 


A 


7548 


2 


90 


RKQAYTHTNTHTHTHTHTH*KKTGIHAH 
KHTHTHTHTHTLKAEKQIEGGK 


7489 


21390 


A 


7549 


256 


405 


CELYSGKEMELVFGLFIFTGVTRLETFY 
KKECSSFFHSQSRGHKLQPSILNEWWHI 
RNPLGLTHFLLL* KTGVABYKNSLNWH 
HPFFLS YAVSFLLQGWSFAMLS KLGSNS 
* VQAR CLPQS PKVLGLKES PQERTVNVS 
SIRGKKWSWYLDYLFSCX5LQGLKLFIRK 
SVHHSS I PRAEGINCNHQY 


7490 


21391 


A 


7550 


2 


362 


ILIMNTTJALI.T,IIITF*LPQIiNGYVEK 
STPYECGFDPISPARVPLCIKFFLAAIT 
FLLYELE IALLLPLP * ALQTTNLPLIGM 
SSLLLI I ILAI*SIiAYE*LQKGLD *AELK 
EEQKTLQC 


7491 


21392 


A 


7551 


1 


317 


TTFDNSALLFFWDGGBIE FHS VSQAGVQ 
WHNLCSL*PQPPGFKQKLILPQIVKENV 
SKISSQLLFSRVNINISPSEQCITPPSL 
QLARIGIFYAAKIHLTKGLRG 


7492 


21393 


A 


7552 


479 


183 


YLFHVHTTASHSNGACTGPTVGD I TVS S 
TL* SMTEQVTMPLAS AVSDGTVPSVRTA 
SRGSEQAAESMVSSEHIEIIiEHAGELVI 
AS PEGQLEVQTVTV 


7493 


21394 


A 


7553 


454 


105 


KPPWGNKIRPPLFPGRKVPPPRF*NPVR 
GSPPKK*FVFPVGRVLGTGVTPLSPFLK 
TTPLLLWDPPLSQPPGGVKPENSIiYPGK 
KRFR*PKFPPCPPAWGTKRKPPIiQKKKK 
RQVL 


7494 


21395 


A 


7554 


439 


1 


LLKRCVRKDSPPPQNKIFFFILKKFVFF 
FAPY* VRKFYFLTAHFGKRPPQ I YIFGP 
PPVFLIFCCFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFPFFNKQVFI 
ERLtLCARHCFRP P P EE KTKNKI EAANPN 
SEGEBR 


7495 


21396 


A 


7555 


323 


82 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFR* * AYL 
VRSPLLFIFVFSYF*LI*HVYHRYSVYI 
ELDVLLTGSKIILI 


7496 


21397 


A 


7556 


194 


412 


KMYRTSLGHLVEAKKK2CKKKKKK3CKKKK 
KKKKKKKKKKKKKKDSRGGVV* KKFGGG 
HNTRGVKI I FFFSLGG 


7497 


21398 


A 


7557 


414 


219 


KGFKKLFPPPVF* FFLCPPPI FLKGFP * 
GRKPPPPKNPQGGPPFFFFQTPPPLFFF 
FFFFFFFF 


7498 


21399 


A 


7558 


377 


25 


THVGGVFWGKFFFSRRVFFFFYHLIQIL 
SPPPQKRPPPKGTPEG*IFPLFKEINFF 
FF*DFFFAPPPFFFFFFLFFFFFFFFFF 
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FFFFFFFFFFFFFFFFFFFFFLFCVKTGV 
YCVPPIAVILYR 


7499 


21400 


A 


7559 


386 


2 


FFFFSFGEIFPPPGISGFFPPFPL*NFF 
FPLRPliFFLGGGAPFFPPPKKGFFPKIP 
RGVFFPPPLREKFFFSPPP* IWAPPGFF 
LKGPPIiFFFFFFFFFFFFFFYRKARTKP 
ICLWGDVSPSKHFQCI 


7500 


21401 


A 


7560 


1 


396 


KICHRILRPIKKVK*EKKKKKKKKKKKK 
KKKKKKXKKKJCKKKKKKKKKKKRGGGAS 
LKIPSGGPQLPRGGEKNPPPFLGGEKKP 
PGVFLKKHLFLGGGHFGPPPPKKNSPWG 
EKKNFWGGGGKKKQNIPGGKKI PSHKKK 


7501 


21402 


A 


7561 


1 


390 


NYVFKEFSDKLGAVAYTCNPSTIjGG*GL 
WIT+GQEFETSLGNMVKPYLYQKYQKKK 
KKKKKKKKKGGGPFKKTLGGAQFWGGGE 
KKPFFFLGGVKKPPRGFLGKNPFLGGGK 
MGAP P P PKI KALGE KKNF 


7502 


21403 


A 


7562 


406 


182 


FVFFFFFXXXFXXXXFXFXXXF*TXXFF 
FFXXXXFFFFFFFFVCFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFF 


7503 


21404 


A 


7563 


2 


113 


FHFTS KHHFGFEAAA+ YWHFVD W * LFL 
YVSIY**GS 


7504 


21405 


A 


7564 


313 


97 


S PTYPCYIRNS AQS FSLIAKS FCSHPHT 
YHHDPHLBKSLTYCPQS PS* PI IPPVTC 
TYINRWPEATEEPQNK 


7505 


21406 


A 


7565 


3 


144 


DAWVFVATGFHGLHVIIGSTFIiTICFIR 
QLI FHFTSKHHFGFEAAA* YWHFVDGV* 
LFLYVSI Y* *GSTFLTICFIRQLIFHFT 
SKHHFGFEAAA 


7506 


21407 


A 


7566 


1 


130 


FIRQLIFHFTSKHHFGFEAAA* YWHFVD 
W*IiFLYVSIY**GS 


7507 


21408 


A 


7567 


434 


1 


PDLNKVI FFPHRKKKEPVKCHI VGKNLR 
NEMYS IQOTNSLPRRER*RKTRTCI I FP 
HKIKTSEHSFDIKEKH*NYVTLCKVLRK 
LSTNQRQS * I FLS TNKNDTLGRARWLTP 
AI PALWEAEAGGS * GQEIETI LANKVKP 
RGRV 


7508 


21409 


A 


7568 


441 


86 


NFS FREKGGEFVPPPPLKI FFFP PPPNF 
FGGGGPPSPPPQKKVFFPKPPRGFFFPP 
PKKKKNFFPPPQKIGPPPSFF*TPPPPF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFL 
MFVCLY 


7509 


21410 


A 


7569 


1 


155 


PTRPDLRSLRFEAKSPNSRRTIiHKPGVT 
I *EAFASKRKVLIVEEPSINLE*LYDYG 
FFYYKI KEPVYFYSNQKKE 


7510 


21411 


A 


7570 


434 


1 


SPTEAGGKNFFKKNPGRKTPPKKKKNIF 
FSPLTPKKFFFPPRG* ILGGGGGPNCPP 
PKKGVFPKNPQGVFNTPPKKKKKKFSPP 
GKNGGPPGVFLKGPPPFFFFFFFFFFFF 
FFLRQSHTLLPRLECSGA 


7511 


21412 


A ! 


7571. 


■493 


196 


SSLDCS * SLQGS I * FLTLMMLAVDFVDV 
LY* V* RVLYVLVQVAITKCHI VSGLNNR 
HSFLTAVKARKS KI KVPADLVPGEVSLP 
SLFS 


7512 ' 


21413 


A 


7572 


1 


239' 


■KGLTILDAIKNICDS*KGVKILIFSEVW 
KKL VPTLTNDFER FNTSVEEVTAHWET 
ARELELEWPQDVTELPQSHDQT 


7513 


21414 


A 


7573 


391 


32 


FFPQPSPPGVKGFFCLNLPNNWGFRAGP 
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nucleotide deletion, V=possible 
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PPPGYFVFLVNKGVFPVGPPGFELPSPS 
♦VAPPASQKGGVSGVSPCPRVWVPFFKK 
KKKLRGVEQRCTMHLLVYS KNVPVKPQK 
KSSIGSL 


7514 


21415 


A 


7574 




383 


PRVRSTAFSS IAHITRDVNYG* IIRYLH 
ANGAKKKK3CKKKKKKKKKRGGALKKKPG 
GGQKKGGGEKKNFFFRGGEKKKPGGIFE 
KKTFFGGGENGAHP PQKKKTPGGKKKI L 
RGKGGKKPLYPWGEK 


7515 




A 


7575 


407 




KI FFKNSPLWKI SHPPTYRGFFP PFPP * 
NFFFLPGPYFFLGGCSHLGPPPKKGFFQ 
KIPPGLFLSPPFKKKIYLFPPRIFLAPP 
GVFLKAPPPFFFFFFFFFFFFFFFFF 


7516 


21417 


A 


7576 


1 


119 


LIFHFTSKHHFGFEAAA*YWHFVNW*L 
FLYVSIY**GS 


7517 


7141ft 


A 


7577 


910 




• NDISANTAIKKFWKLPKCPTTEGWLNT* 
LYIHLVEYYATLKVCGDLYVR 


7518 


21419 


A 


7578 


3 


287 


HASAQLNGYIEKSTPYERGFDPISPARV 
PFFIKFLIiVAITFLIiLELEIALLIjPLP* 
ALQTTNLPLIVMPSLJaLSIIIALiRLAYE 
*LQKGLE*AE 


7519 


21420 


A 


7579 


10 


125 


ALQTTNLPLVVMSSLXiLIIILT*ALrQTT 
NLPLWMSSLLLI I ILTL I LAYEGLQKG 
LN*AE 


7590 


71471 

Alt* 1 


A 


75X0 


3 


299 


DAWVRLLDFDF*VELPTEARIRIIITSQ 
DVLHS * ADPTLGLKTDAI PGRLNQTTFT 
ATRPGVYYGQCSE I CGANHS FMP I ALEL 
VPLKIFEIGPVFTL 


7521 


21422 


A 


7581 


3 


116 


AFAS KRTGLGVEEPSLTLE * IjGGGPPP * 
HTLEEPVNT 


7522 


21423 


A 


7582 


398 


2 


RVFPAPKNSLQIFPPLFFFWGSPWKFFF 
TPFPHFVFWFGNFKFNPFFAFLFRLPFP 
GEKIPLVFFFFFTD*NFVENFFFFLSLL 
FFFGFFFFENM* S * LF*SPANFNV*VTH 
LFIFMVFFHPDFWSFGPILW 


7523 


21424 




7583 


288 


9 


RGSKNLEKNSFPYKVNNHGNGS PLFPHL 
FFSPRGVGPFFFFFETRSRSVIQAGVQW 
HDLSSPPGLK*SSHLSIQSSWBY*RAPP 
HPANFVYF 


7524 




A 


75R4 

# JOT 


\ 


124 


RDGGFTMLPRPILLTGLKQSSCISLPK* 
WDYRCEPLHLASC 


7525 


21426 


A 


7585 


339 


76 


THPGHNG ISVSPKKKKI PRGGGP PPLFP 
PLPRVGGGDSL* PRKGGFN*TKPAPFPS 
SWGKKKELPFSKKKKKKKRKKGS I STPF 
SRIB 


7526 


21427 


A 


7586 


205 


87 


SISLSLSLSHTHTHTHTHTHTLSHPRCV 
PQPIGLPISKWAKVKELIT*R 


7527 


21428 


A 


7587 


353 


256 


PHPS VTTLVTSQQYQDPPPAKRL* LAEG 
SDDC 




9149Q 


A 


7588 




195 


QICRHRWHINFLMSMGSIMITPPSFLF 
LNFL*SYVAQAGLKKuSQSDPPASASQS 
AEIRGMSHNAWPHF*YFVELVHFIYINK 
FVGIELFILTS 


7529 


21430 


A 


7589 


76 


205 


RKKPEPEKTCFDNITGNTYPCLSTYEHD 
SD*YYVNRT*LCAER 


7530 


21431 


A 


7590 


218 


20 


LSLYLASHLWLKPPHPLEWAPSKSSLRL 
GTVAS PCNPSTLGGRGGWVT * GQESETS 
LANMAKPCFY 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7531 


21432 


A 


7591 


142 


12 


QIDYYYYIiFI*RKGLTLMI*RLVSNSWLH 
IILLPWPPKVLGLQM 


7532 


21433 


A 


7592 


2 


224 


QIIEALLITILLGKYFTLLQA*EYSKAP 
FTISDGIYGSTFFVTTGFHGLHGI IGST 
FLTICYNPQLIFHFTSEHKFGIiQAAP*Y 
WLFVDVARLFVPAS IY* *GSLFYPPTSI* 
RVLQGSLHHFRRDLRLNIFCNHRIjPRTP 
RHYWLNVPHYLLQPPTNIS LYLRT 


7533 


21434 


A 


7593 


1 


323 


HAYGNTLVA*LWTMTF* LPLVDGYIKK 
FTPYEGGFEPKSAARVLFS IKVFLVAIT 
FLlLSDLQS ALLLPLP * AVQTTNLPLIVM 
AALLLDI I LALSLAYA* LLKGLE 


7534 


21435 


A 


7594 


1 


104 


RACVCVCGVCVCVLCLLKMCVCVCWCV 
CVFCAC*KMCVCVCWCVCVFC!AC 


7535 


21436 


A 


7595 


383 


43 


VFISBTLCBHVAWSQTTIESLGLGTVAH 
TLNASTLRG*GRMIA*GQ*SKTRRGNMD 
SISSHSQTIAQSPHLLGRHQPKWCMCVQ 
AAEAVLKGQVETDAS FQIQKI P INSLIN 
L 


7536 


21437 


A 


7596 


2 


331 


WPPTGITPLNPLEGPLIiNTGGLLA*GVS 
MT*AHHSLGENNRNQVIQALVITILLGL 
YFTLLRASEYFESPFTISDGVDRSTFFV 
TTGFHGLHVIIGSTLLSIWFIRQLICNF 
TSKHHFGFKAVA*YWLNFTKYLVHPPTN 
M 


7537 


21438 


A 


7597 


384 


23 


LFKTKQYRFLLPPFPLKIFFFPliKA*IF 
LGGFSHYFPPPKKGFPPKSPPWF* IPPP 
YRKKHFCSHTPKNLAPRWFF* KRPPHFF 
IFFF* *RWGLAMLPRLFSNS * AQAVLLL 
QSPEVLGL 


7538 


21439 


A 


7598 


401 


189 


RVLPCFAGWS*TPGLKCPTCIiGL* KCWD 
YRREPPCLIRFYLFVYFLRQGLAMLPSL 
VLNSWP*VTLLP*SP 


7539 


21440 


A 


7599 


81 


368 


KCVI YPFLS ITLGKAKYDFFFFLEKELA 
FFPQGEMQGGNLG*LKPPPLR*RDFSCL 
AIiPRGGNGRGAP PS PTNFG FLGGNGVYP 
SGPGGFETPDLK 


7540 


21441 


A 


7600 


264 


25 


AVEHPQLRLFCSHY*NNIVEERLGVLAR 
T(3JPCPLEDKAAWIT*RHEFETSLANMV 
IFCLDIGSKSFLSMRFSRCLLWKL | 


7541 


21442 


A 


7601 


1 


196 


WEPSLVGETNVNS FNQKYINWPGAVAHT 
YNAGTLGGQGGWIT*GQELEITIiSKKVK 
PSTYFTNTR 


7542 


21443 


A 


7602 


315 


82 


TCTQVFIALFITAKKFKLICPLTDNWIS 
RVWYVHTMKHISAI KRKAVIHATMWVKL 
ENIILSEIKQSQKATGCMISFI 


7543 


21444 


A 


7603 


334 


118 


PKIKTEGGPPPKKNAGGMYK*KKFILVL 
AAPPHPPFGVFSVFS FYSFWFFRFFFFF 
FFFFFFFFFFFFFFLR 


7544 


21445 


A 


7604 


3 


288 


DAYTT*NILSSVGSFISLTAVILIIFMI 
♦EASSSSSSPPGGLRPRAFPVRPPGAGL 
VF* VSVPPDLPQCSLAGLQVLRFMELAR 
LAPHIRWIIQ 


7545 


21446 


A 


7605 


354 


33 


EGRARYTRVPQRPRFHLLGIFFAPQLGK 
KKKFSFSKKKKKKNQLWLGAVSHANNLS 
TKGG*GRGSRFT*A*EFRTSQKLGTKGN 
MVKPI STNTFKNERGGAELSQL 


7546 


21447 


A 


1606 


401 


161 


YLVSHLLQIILPPALFWIFFP*DRVLL 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CGPGWSAWWSYCSLCUjGSNSSPCQSL 
PSSWDYRHLLWRFLMPRGILCVYV 


7547 


21448 


A 


7607 


354 


50 


SGAQDPAGQHGQTPFSQKVPKKVYPAWW 
CMWPATQEA*IiKKSIiEPWRRRFLIjRBA 
FFVYLGVWBSNGEPPPLGGGGSVFLEAL 
EKSVGFSSLGFGWVFFF 


7548 


21449 


A 


7608 


370 


115 


TIKNICNSWEVKIPRFTGVWKKLIPATL 
MDDLEGFKTLVEEPTEDW* ITGELEVE 
PENVTEFLQSHDQTSTOEGIiIiIiMDBQRR 
W 


7549 


21450 


A 


7609 


120 


358 


TTAVYLGHYCQSP * ILNGTVNT * PPWH 
KDP ILFLTP FPCFHAS SAMNLQI»S HI S C 
NSKATPHPLGYQQTYPPLTVHMT 


7550 


21451 


A 


7610 


3 


164 


" HE* AS I P *NTHTHTHTHTHTHTHTHFYK 
EENLSHLWNISCIFLGAHKGKKKMN 


7551 


21452 


A 


7611 


85 


324 


FRFFAFFFF* KKISFCPQAGGQGGILPS 
LDPPPPGLKKFSGLTPPRSWN*GPCPPP 
GLIGVFLKKRGFPLVGREGPNIiWT 


7552 


21453 


A 


7612 


268 


317 


FLENCFNPGGGGCSQPKLGPCPPAWGAK 
*ASLSKKKKNKNKN*LSYKAPSYGYKGK 


7553 


21454 


A 


7613 


101 


339 


AVPLTMVKIHALWKRVWRFHNKSKL*LP 
CDPAISLVSMYPKEMKSVCLKDVCSPRIi 
ITGPLTIAGMWNPPNRSSMDYSG 


7554 


21455 


A 


7614 


366 


208 


LELLTSGDLPASASQSAGITGISHRARP 
GTkFF* AVNGGGNQ VFLFLRVIiNGL 


7555 


21456 


A 


7615 


30 


328 


NYCLDFHGETELGTTHFLTHPSTRAMVF 
EPYCQAL* IL*GVINT*PTVLHRNRSHI 
KAACXCIiQARTVVNIjQISHINWKY*TTP 
HPL*YQQTYPALEYI 


7556 


21457 


A 


7616 


39 


159 


TPGLRQFFHLGLPRCWDYRHEPPRPCPN 
MYLI*ISF*CV 


7557 


21458 


A 


7617 


290 


91 


KMSFLIFFRDMASLCCPGWT*TPGLKQS 
SCFHLPRSWDHRCTSLHLAVKTFLYYFL 
KMFYYQNVWH 


7558 


21459 


A 


7618 


37 


246 


TQQEIjVY I YK. YIOTHTHXHTHTHTHTHS 
HTHYWIYTWKHTLKGPKLF*EKTKMGST 
PIISWKERLSLYRS 


7559 


21460 


A 


7619 


23 


190 


I PPQPCPFLLSLFCT* IDMSCNMCACVC 
VOTCTWCVICDMVLLPFCFCKLECSK 


7560 


21461 


A 


7620 


367 


2 


FFSPKALIFLGGGGPIFPPPKKSFFSKI 
PPGGFFPLPGFLKPPPPPLF*NPPIjKKK 
WFFFPPPENWAPPRVFFKGPPPFFFFFF 
FFFFFFFWSSRFLI FPQFFFI*NNKIGKK 
KIFRHLGAG 


7561 


21462 


A 


7621 


6 


25 


SVILQS III FIATTNLLGLLPHSLTPTT 
QLSINLAMAIPL*AGAEIIGFRSKIKNA 
LAHFLPQGTPTPLIPIIiVIIETISLLIQ 
PIALAERLTANTTAGHLLMHIjIGSTIXiA 
ISTI *LLYYNP 


7562 


21463 


A 


7622 


383 


67 


AFLNPGKKPFPPGFKRFFCPPPPKKWGK 
NRGPPPRGKI FFFFFFVKTKFFHVGPGG 
F* ILTSGDPPGQQGGLGGFFFFFQIGGT 
GDLNIiSHKSPMPFTINTHNKG 


7563 


21464 


A 


7623 


372 


35 


TGGFWGVFFFFFFKQGFPFKSTGGNUGP 
KNPCPKGPRVSPPPPPF*NRGPQGGPPP 
PGKKFPFFFFFFFIETGFPHVALAGLEL 
PGSGNLPTSVSQSAGIPGISTWPNGVFQ 


7564 


21465 


A 


7624 


287 


6 


LSGVEWVDHMINICLFFKKLPPCV*KWL 
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Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=GIycine, H-Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M^Methionine, 
N-Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=^erine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /==possible 
nucleotide deletion, \=possible 
nucleotide insertion 














YQPVAVAHACNPSTLGGRDRWIPGGQEF 
KTTCLSLPKRWAHRREPLYLAFCPFYNG 
IGCLLIIGF 


7565 


21466 


A 


7625 


271 


277 


NRGKGKTGSLGILEKNPFFGGGPFWANP 
PP * GIiKFWGGVGPKRPPPKKRVFFQNSQ 
GTSFPLPPVLKSGPGLVLKKAPQKEKNI 
TFQPPGKFGPPRGSLKRAPPFFFFFFFS 
LRKSSVIEK 


7566 


21467 


A 


7626 


2 


269 


LLGELSVLVC * FPPKHTTFLLTGLS I LF 
TGLYSLYILTTTQWGSLTHHI INLKPLF 
TPKNTLMFIHLYPILLLSLNPDIIKKKK 
KGGAF 


7567 


21468 


A 


7627 


397 


164 


FPPPKKVFFQKNPKKKFSPPRVF*TPPP 
PPFFLPPPKKKKFFFSPPP*IWPPPGIF 
*KPPPLFFFFFFFFFFFFFFK 


7568 


21469 


A 


7628 


3 


324 


DGMNVSTLYGATGSHGIiHDIMGSALLTI 
WYIRQLILHFTSKHHFGFEAAA*YWHFV 
DVD * LFLCGS I Y * *GSPGGPKFSRGGKG 
IAVPHWGDD 


"7569 


21470 


A 


7629 


2 


246 


SGLGTTLS * KN I S VLLTGLKI LVTGLYS " 
VY I FTATQWGS LTHHINKI KP * LTRENT 
LMFIHLCPILLLSLNPDISYPGFTS 


7570 


21471 


A 


7630 


112 


290 


LISGKGQWTQMRLPVLVTLFTDSFSK* K 
NPSTIiGEQCGRIT*GQELETSLGNIVRP 
HLY 


7571 


21472 


A 


7631 


248 


2 


KVPASGKRPEPHWYP YHAVPGCGR I PAT 
AIWTQKWYQGFPPALSPRLECSGAISAH 
CYFGFPGPSDS PAS PS * LAGTTGTRP 


7572 


21473 


A 


7632 


383 


106 


GWGQCFKPAI PALERPNPQECFRPGVLN 
QP * PQNKSPFFFFFFKKKKKYI YICMVV 
HAYNPSYSGG*GGRIT*AQKFKTSLNNT 
VRPHLYKK 


7573 


21474 


A 


7633 


435 


133 


YACLGLPKCWDSRHEPPGLANFCIFSKN 
RGFPISDRWINS WPRVFG* PKPPKMGG 
FQTLVQGASLGFLLIFLKTGFPFFSQHE 
EQGWDTS S FQHRFFRGK 


7574 


21475 


A 


7634 


1 


209 


NAYRIVILCQKLFPLLSGKMNFKKTRCW 
IjGMVAHACS PSTLGG*GGRI A* AQE fgt 
SLGSILRPCHCKK 


7575 


21476 


A 


7635 


117 


495 


STSFIDKVQLRHSFIjFFSLFFETKSCSL 
TQTGVQWHDFGSL * PPPPRFK* FSHLGP 
LSSWDYRHAPPRPASFFVFLVEMRFHHV 
GQAGLELLTSGDPPASASQSAGI TGMSH 
HALAHLYFSSGKS 


7576 


21477 


A 


7636 


1 


241 


LPGSSDSPTSASQVAGITGMCHHAQLFF 
FLAF*LIFNF* FFFGNSFFFVPQAGGHW 
GIGG*LNPPPPGLKRFSCLTLWGG 


7577 


21478 


A 


7637 


3 


93 


EAEAGESLEPGDRGCSKLRS CHYTPAWQ - 
QSVTLVSKKETKNTKTSEVP FGVLNLLY 
KVSINILIFKEIFYQAPLILV*DNPDSS 
TLSSALQPGQQSKTVSNKTKQKTVS* PG 
NRV 


7578 


21479 


A 


7638 


46 


228 


GI PHIQPSKS VFNNVHGPGTVAHTRNPN 
TLEY*GGSIT*GHEFETSLGNIVRLPPP 
SLQSI 


7579 


21480 


A 


7639 


1212 


1023 


KNHSNEWIKKM*YICTMEY*AIRRNEIM 
AFAAT+MKLETIILSEVTQEWKTKYCMF 
SLVSGS 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Iin known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


7580 


21481 


A 


7640 


479 


318 


AEFLNLGDGGCSEPR*HHCTPAWATRAK 
FHLKRKKK 


7581 


21482 


A 


7641 


470 


172 


RNIRPRENQYPVFMVAFPLIAKNW*QPK 
CPSSEEAMNKLRCIHTTAVLHSSEEEQP 
TDTHNNSDLRS IAPS WRSQTRAAAHRMI 
PWHSGKGKTPSTENA 


7582 


21483 


A ! 


7642 


2 • 


201 


AGAPPPAWLPPCRLISDC*ASNQRDSVG 
VGPS EP YATVROTiELLLCQMHHDMCTY 
RFSIRIVLNL 


7583 


21484 


A 


7643 


521 


385 


GGSPLETH* YTSQGGGI P YTDLTGHHPS 
QGRIQEAPKLTHL 


7584 


21485 


A 


7644 


331 


3 


DLVPKKGGGGKKTTPRI KKFPPKGEDFF 
FFTPHYFVVFVVVFFWGGAFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFT*DYYIS 
F*EERFNKFYRCQSTLVQIGHTSNL 


7585 


21486 


A 


7645 


88 


367 


HCS FIiLIiGKKKKKKKKKKKKKKKKKKKK 
KKKKTKKNEYKNLLGGAPP * KPPSGALY 
KYRRENFIPFSFERVSKYPPRDLCKKNL 
FWGGLNIPS 


7586 


21487 


A 


7646 


378 


37 


FFFPXFFFFFS PPPXFFFLFXFFXXPFF 
XXSFFSLPPLPLXXPPPPPPLFFFFFFF 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 














VL 


7587 


21488 


A 


7647 


34 


293 


TGIPGVISAIFMI*EAFVSKRKVLIGKK 
PS INLE * LYGCPPP YHTFEE PVYIKSTQ 
KRKESNPPKLGSSQPHGLHDFFKEKKKI 
IK 


7588 


21489 


A 


7648 


22 


106 


KNFLPLTZjTLL+KNFLPLTLTLLI*YGS 
IPITISSIPPQT 


7589 


21490 


A 


7649 


1 


231 


FDRTISKDISK* MVLVSKKKKKKQKKKK 














GEEKI IFFFLGGGKKNTWGFF 


7590 


21491 


A 


7650 


79 


286 


WSRCLVSLRNPCI IKTCS*AKKKKKKK 
KKKKKKKKKKKWGGAFLKIPWGGPIFPG 
GGKFFFFFFGGGY 














7591 


21492 


A 


7651 


57 ■ 


375 


SGKSEFHRVPQWPGTGADACNPSTLGGR 
GRWIT*GQEFBTSPQGDPISTKXKKKKR 
GALLKDSLGGPNIiPGPGNItKVFSFRGGI 
LKPTWEFWEGTFILGGEKIGPN 


7592 


21493 


A 


7652 


3 


357 


IiAFLLAIiSKITHASIPIPVSSPSKSPRS 
KXaTKKKKKKKKKKKKRGGPLKKTIiGGPK 
INGGKKKKIFFFKGGEKKTPGGILEKKL 
FLGGGKMGPNPPKKIKPLGEKKNF* GEK 
GEKKP 


7593 


21494 


A 


7653 


113 


378 


MGAFNLG P FLiLMGGGNPSGPNGHWVTiGA 
GSYLGPGKTPPKKSQKGGPPPDL*GRGG 
QCSPGTKPGGGGEKTGEjAPPFTKSPLGF 
LQKK 


7594 


21495 


A 


7654 


341 


147 


NFFFSLKPFIFFGGFCPIFPPPKKSFFS 
KIPPWFFSPPF*EKIFFFPPPIiNFAPP 
RVFFKGPP 


7595 


21496 


A 


7655 


395 


3 


FFFFFFFFFFFFRGKSE*FVLFPVSPAP 
SLRG 


7596 


21497 


A 


7656 


2 


162 


ESKGYTHTHTHTHIYTHIYTHTYTPREH 
* PKKADVAMLILNKLFFKPRCVALL 


7597 


21498 


A 


7657 


333 


28 


IiEFFFNFFFSYFLFFS*RVWGPERPPPP 
LKKAV* KFFIiFD I FLFFFFF I FGLGFFF 
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OCysteine, D=>Aspara'c Acid, 
E=Clutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, MNMethionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine f R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLEHIRLLMFVTFSSHR 


7598 


"21499 


A 


7658 


1 


137 


HSSIX3DRVRPCLKKKKKJCKKKKKKKKKK 
KKKKKKKKKKKKGGGLF*EKKKKKKKKK 
KKKKKKKKKKKGGGLF 


"7599 


21500 


A 


7659 


236 


32 


1 1 FLI KFLYF* FI F I * F* FFLTFFYPKR 
LKILFFFFFFFFFFFFFFFPFF*KNPIK 
NI IYFFI KYLFYYLKFFI YIKTFIjFIiKI 


7600 


21501 


A 


7660 


301 


65 


KAPPKIFWFFFFVFFFFFVSFYIFFFFF 
FLFFFFFFFFFFFFFFFFFFFFFFFWKS 
LAAT* FLRGFETF* LAYILKTT ! 


7601 


21502 


A 


7661 


70 


273 


KHSPPHKPASD *OTELTIKKKKKKKKKK 

KKKXKKKKKKKKKKKKKGGGALIK^ 

GPIHRGGEKFFFSFLRRD 


7602 


21503 


A 


7662 


1 


353 


IIjIINTLIiAIiLLIIITFGLPQLSGCIjQK 
STPYBGGSDPIS PARGRFSLKGPLQAMT 
FIiti*ELEIALL* PIjP*ALQTTNIiPLIDM 
GCLLLNI I LCLS LAYE * LP * GLDCS RYH 
SQRXQ 


7603 


21504 


A 


"7663 


1 


131 


FIRQLIFHFTSKHHFGFEAAA+YWHFVD 
VA*LFLYVSIY**GS 


7604 


21505 


A 


7664 


2 


192 


PLSQSLY*IiLAADLLILT* IGGQPVSYP 
FTIIGQVASVLYFTTILILIPTISIiIEN 
KILKWAY 


7605 


21506 


A 


7665 


356 


2 


TFFFLSS PGGSCGS PTPARPKKNPPPWG 
GGLPLFSRRGALFPKNFFWGGYPFFFLF 
♦KKPPFFPPPGPQSPVTSPKDWPPLRM 
PPPRPHVRPLGLPKKSFSSPRWEKQVKK 
TKKRAA 


"7606 


21507 


A 


7666 


312 


2 


GFFFFFFFFFFFL*DRVLLLLPRLECNG 
VISAH 


7607 


21508 


A 


7667 


1 


233 


FWLSSNAEFDSANSCECLEVQRMIQDQD 
S FPTYH YFDMYVCI YV* *RSLAVS PGLV 
SNSWPQAILQPQPPQSLGLQE 


7608 


21509 


A 


7668 


174 


295 


IFFLPFCGEHSLAVLPRLVLNSWTQAIL 
PPLPP*ILGLQA 


7609 


21510 


A 


7669 


183 


330 


NKFKLYWSPVAMAHACNPRTHRGGWIT* 
CEEFETTLANMVNMVKPCLY 


7610 


21511 


A 


7670 


3 


319 


TSNlTbliAIiLLI I ITF*I*PQIiNGYI EKST 
PYESGFDPIS PARVPFYIKFFLVAITFL 
LFDLEIALLLPLP*AVQTTNLPLIVMSS 
LLLI I ILPLALAYE * LQKGLD 


7611 


21512 


A 


7671 


1 


313 


ARGERBRERERERERERERERESGGGGP 
TQTDCKGGRNT*RGREIYRESE*DDRPP 
FLPTYRVNLQRPVGLRRLKGAGDKTFCL 
ILTLARDYVWPDYRMKRADHM 


7612 


21513 


A 


7672 


3 


91 


TRRERERDR*REREREREREIFREKNSQ 
S 


7613 


21514 


A 


7673 


1 


115 


DELANLFI I * KAGFPVLPRLVLNS WLQV 
ILLPWPPRLN 


7614 


21515 


A 


7674 


3 


232 


TRRERERERERERERERERERERERERE 
RYRE*DRERERERERDCVWGGGVISLSR 
AVALSGAPAVGRQTRERISR 




21516 


A 


7675 


1 


234 


ARGERERERERERERERERERERERERE 
RE IERERERLTQRERERERGRGVWHAPL 
AIERG*KPQGGFWWKEERVKKEPLGKLL 


7616 


21517 


A 


7676 


I 


262 


ARGERERERERERERERERERERERERE 
RERERERERERERERARAPQNL* ERWAH 
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nucleotide insertion 














TPPAPGVSLSWREHTRGGPPWGKKEGGW 
GRLGHPP PSILRAKGG 


7617 


21518 


A 


7677 


214 


2 


QQAS VAQAGVKWPDLCSIiQSMP PRFK* F 
SCLSLLSSWDYGPPPPCAGLHIWRRNVT 
LALQGLPTTSLALV 


7618 


21519 


A 


7678 


3 


341 


HERHELEELIKTLFFFFFFRKKNHLCKP 
GGKKRGPIWFLKKFFSPPQKKPPPPPPF 
LREIGAMAPQPGGGLLI *NKTPFRPRGQ 
GGPKPPTPGKPNPPPPPKGNFPL*GPNP 
QPPGSPTPPPPQKVTFPFEGHQA 


7619 


21520 


A 


7679 


1 


357 


GTRINTLLALLLI I ITF * IiPQLTGYI KK 
STPYECGFDPISPARVPFSIKFFLVAIT 
FLI^DLETALLLPLP*ALQ'rrNLPLIDM 
SSLLLI IILTLSLAYE*IiQKGLD*TDFY 
SVGGEAN 


7620 


21521 


A 


7680 


365 


1 


PDASQ * HCTNGQSNRLLNLLI KALLTTA 
KIWIQPKCPPVDARIKKMQ*IPRMEHPS 
AIKKE ILPS VATWMSLEDTVPSRISQAQ 
KDKYHTVS CMWNLRMLNABPEGRPVVPR 
GVQGWGAARA 


7621 


21522 


A 


7681 , 


387 


3 


FFFKKEFCPRKKAKK*NWGPGNLGPRG* 
KNFPPPPPQKRGKKGGPPQGGEILGFL* 
KKGFPPMGRGGSKLWPLGFSPLNPPKRG 
GKKGEPPPPPQRGKPGGPPPKGLFFFFF 
FFLRQSLTLSPRLLV 


7622 


21523 


A 


7682 


107 


290 


ELNKRWGLGAVAHACNPST* * ELNKRWG 
LGAVAHACNPSTLGG*GRWIT*ALEFKI 
NLDNKARSHLKI KKQHQQQKNSGLDA 


7623 


21524 


A 


7683 


1 


348 


GTRERKPTWLWYHRERGMERDCSCVPGS 
SGISESRVWVQVGMEVYPAI IYLCLEPK 
YLVISEFSPAFRIWFLGLSWAHTCNPS 
MLGGCKMRIT*SQEFKTNLANIVRPHLF 
FFLF 


7624 


21525 


A 


7684 


364 


77 


GGPS I FPRIiAIiLGGGGPGGPPLPRGNFS 
FFNFFVKKGVFFPPGVFPPLPPGVIiPPP 
PPFWAPG* PRPPNFFLGAPGFFPFFFLG 
GGFFPVAPGGV 


7625 


21526 


A 


7685 


3 


24 


HEET I IQVKREPVE* KKIFANPTFNKGL 
TSEIYKKQLNRKKPNNPI*KQSSK 


7626 


21527 


A 


7686 


1 


125 


GTRPGMPGTYSNYPDTYTA* KI IS S IGS 
FISKTAFQPGPWB 


7627 


21528 


A 


7687 


459 


298 


LSLPSSWNHMCAIjPHPASFFL* RQGLAL 
LPRLVSKSWAQVMLLPWPSKVLGLQV 


7628 


21529 


A 


7688 


374 


248 


FPHHNVHEVIVRPLSPRYPIISHVTCTY 
RSRWP*ASEGSQKK 


7629 


21530 


A 


7689 


284 


163 


GTVAHTCNIjSTLRGRGGWIT*GQBIDTS 
PAWPPKVLGLQA 


7630 


21531 


A 


7690 


83 


255 


KVD YVS I KS E FF * DRVS L YH PS* GTVAR 
S*LTTTPNPGIKQSTCPTVPSSWDYRHV 
LPLPTRTI FSVPTQSLATIL I * ITLNLY 
INLGKIHFL 


7631 


21532 


A 


7691 


3 


193 


HERLDPADFSFNFFSVDVGLAMFPRLVL 
NS*HQVILLLWPLTVLDYRHELLRPAWG 
FYYLNFT 


7632 


21533 


A 


7692 


1 


364 


NFKSFFP*LFYLYHLLFFGFGLFLFFLI 
KLGSSL FINFS KKS VL * FTNWI YFS VLY 
FTDFCFIFIISFFGFSLIYFIIAVFPLK 
FIGFILSFI * ITVFYGQAQWLTPVIPXL 
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OCysteine, D=Aspartic Acid, 
E=Glotamic Acid, F=Pneoylalantne, 
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nucleotide deletion, \=possible 
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WVAEVRGSL 


7633 


21534 


A 


7693 


370 


151 


GKKYSVASPGKLWPPQGFFKTAPPFFFF 
FFLK*R03FTMLPRLVLNPWPQGIIiPPq 
PPKVLGLQVLRHHIWLR 


7634 


21535 


A 


7694 


3 


373 


WPGYTLNQAYAKIHFT 1 1 FIGANLTFFP 
OHFLGLSGMPRRYSDYPDAYTT*NILSS 
AGS FI S ITAVI LI I FMI * EAFASKRKVI* 
IVEEPSQSAGITGVSHCARAEYLFIDRR 
DGLSLCWPGWT 


7635 


21536 


A 


7695 


406 


394 


KFF * KKFLFP PTPTFLLKACLFFFLPPI 
KGFLFKYIAPGIKTPPPKKEKFFSSLKV 
LFSPPYFFFIPPPPIFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFRFEG 
ECWRL* W*FYF YFFKESPHCI P 


7636 


21537 


A 


7696 


2 


334 


ILINNTFLALIiLLI ITFWVPQLKSYI KK 
STP YE QGFDP I S PARVPFS I KFFLAAIT 
FLLFDLEIALLLPLP*ALQTTNLPLIVM 
ASLLIiIIILALSLAYE*LQKGIiD*AB 


7637 


21538 


A 


7697 


2 


95 


LNIiTlYIILTTTAFLLLNLNSSTTTLLL 
SRT*MKLT*LTI YI ILTTTAFLLLNLNS 
STTTLLLSRT 


7638 


21539 


A 


7698 


423 


326 


RQCLTMLPRLVSNS *SQVILLPWPLKVL 
GSQA 


7639 


21540 


A 


7699 


400 


32 


FHKEYESYFFSPNQPQFFFFFPHR*NFY 
VGVLXKQPPKKKFFFLLFTPERFFFPFL 
*KKNIFFFP*YFFFPLVIFFL*PPPLFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFSSNSFPFY 


7640 


21541 


A 


7700 


125 


407 


FINFSSTFWKPTTCNMQKHTPIARTKD 
LCITI FFFFLE KNFLFVPQVGGQGG I LG 
* LKI LLRG* KQFSCFTLPGRWTNGGI PP 
PPKNFLKKFF 


7641 


21542 


A 


7701 


50 


225 


PWNMVKMSCWLGKVAHVCNPSTIiGGRGR 
RIA* AQEFETS * VTQ*DPVSVIKKRKKN 
FK 


7642 


21543 


A 


7702 ; 


2 


146 


EIALLLPLP*ALQTTNLPLIVMSSLLLI 
IILALSLAYE*LQKGIjD*AB 


7643 


21544 


A 


7703 


1 


93 


ILIIOTIJ^LLLIMTMGLPQPKGYIKK 
STP * ECGFDP I FPARVPFS I ki fli AIT 
FLLFDLETALLLPLP *ALQTTNLPLIVM 
ASLLLI t ILSLSLAYE*LQKGLD* ALLL 
LIMTMGLPQPKGYIKKSTP 


7644 


21545 


A 


7704 


2 


112 


GRVGKHHFGFEAAA* YWHFVDVG* LFLY 
VSIY**GS 


7645 


21546 


A 


7705 


1 


325 


TAGQFLPKLSILLSYNPAITFLGIYPKI 
LKTYVYMKTCTWMFIAAliFIVVQTWKQQ 
SKLWYIQTIKYYSVLK*NELSSYENTSK 
KLRCI LLRERSQ YKKPP YFLI PTM 


7646 


21547 


A 


7706 


2 


169 


SRSRAGTIiAI*TINLPCTLMIFTILIVIi 
T I LE lAGAIiIQAYVFTLWS L YLHDST 


7647 


21548 


A 


7707 


263 


346 


MLINVPLGLLFVGVI LSKES PS VDQGGV 
QWINI,FSMQPPPTGFK*FSCI.KA*ASDN 
LSPHBQYRLALSFLKLTL 




21549 


A 


7708 


343 


163 


PKEF* *RQGFFHVGQAGFKLLRSGNPPP 
S ASQNG KITGVNPLAWQTNNS I P PMAP 


7649 


21550 


A 


7709 


459 


144 


FWPGASSHAFDPTTLGGRGGRXA*AHEF 
KTSLGNIVRPPSDTCNPKVLGLLA 


7650 


21551 


A 


7710 


2 


83 


NFLPLTLALLI*YVSIPITISSIPPQT 
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7651 ; 


21552 


A 


7711 


19 


181 


KYVHTKACT*MFISSLFIITKNWKQPIF 
SSIGE*WKT*YIHTMEYLLFRNKRK 


7652 


21553 


A 


7712 


392 


49 


SDIFNSDIFSHY*CSVF*SYLAYNRYLS 
FL*LFLEFIYSYP*IDHAFHCFLHPVSS 
FWLDFFLFIFSFFLFSYFFSYFYF*FYF 
*FIFNFIFNLFFILFYF* FSFFSKCLKF 
PC 


7653 


21554 


A 


7713 


334 


208 


FF F * E WVLTTLPRLVLNPQAP AI LLP W P 
PKVLGFTGVSHIKI 


7654 


21555 


A 


7714 


2 


408 


WNGMEWNGLKWNRLERYGMQWNGLEWNR 
TE * TRMVWNAMEWTRIBLNGLEWNGMEC 
NGMHLNGLE WNGMERNGTDPEWNGTD PE 
WNGTDPEWNGTEWNQVKWNEVDSNGMEW 
NGIDSNEMGWTRMEWNGMESTRV 


7655 


21556 


A 


7715 


362 


223 


VWWFMPIIPTLWEAEVGG*LEPRSSKPA 
WATWQDAI FKKKFKHQHP 


7656 


21557 


A 


7716 


2 


335 


LI VI INTLLALLLI IITF* LPQLNGYIE 
KSTPYECGFDPISPARVPFSIKIFLVAI 
TFLLFDLEIALLLPLP*ALQTTNLPIiIA 
MSSLLLIIILALSLAYE*LQKGLD*AE 


7657 


21558 


A 


7717 


2 


142 


SITL*LPQLNGYIEKTTPYECGFEPICP 
ARVPLS I KFFLEAI TFLLFDLE I ALLLP 
LP *ALQTDDLPLIVMTSLLLI I ILTLSL 
AYE * LQKGSD * AYMPRPRP FVHKXLLRS 
YYLLII 


7658 


21559 


A 


7718 


6 


92 


FFEAAA*YWHFVNW*LFLYVSIY* *GS 


7659 


21560 


A 


7719 


417 


164 


PLEPLISGRGLPQIAPPPKKGSPPKSPR 
WFFFPPP*KKKNYFPPPPKILPPPGFF+ 
NPPPPFFFFFFFFFFFIFFYFFKTALPL 


7660 


21561 


A 


7720 


25 


401 


THNS*DPNWD*IPHYA*P*TSTVKKKKK 
KKKKKKKKKKKKKKGGGLIKKFRGGPKY 
TGGGKI I FFFFMGGKKKPLGDFLKKNFF 
LGGGNLGKHPPKKLSLQKKKKMFKGRGG 
KKTPLCRRGKKFS 


7661 


21562 


A 


7721 


388 


2 


APPFFFYFFFFIFFFYFFLGVGFSFK*N 
SKVFWISNFLKKILRVFICSLEKGINPF 
LEKCFYMFFFFVAF I IRLNFSLQAKFFL 
LNFALLPFPEIFFFFPFFFFFPQIEDFW 
HPYIEQVYQLSLFSLS 


7662 


21563 


A 


7722 


389 


71 


FFLPHQKQVFFPPPPFKIFFFFSRVFFF 
WGGWAQKPPPPKKFFFLKTPPGFFLPPP 
♦KKKFFFFSPRFFLPPPGFFFKPPPPNF 
FFFFFFFFFFFFFFFFFFFFLN 


7663 


21564 


A 


7723 


77 


220 


TPRRGWAYWLMPAYPAILEA*AGGLLEP 
KSLRS TWPTWRNS ISPKRK 


7664 


21565 


A 


7724 


16 


258 


ISS FVFKRRS FAMFPRLVLNSWPRWLP 
PRPPE *LGLQACTTTFSFNFFDGIFLRP 
LKEDVTARDQREILYVFALFYYGGA 


7665 


21566 


A 


7725 


353 


111 


IFGAGPRFVFKAGGQGANSRSLKPFPPG 
NKWSFPFSPSKKWEGRG*PPRPPFFFFF 
FFIREGVSLCCQGWSQIPGLKQSS 


7666 


21567 


A 


7726 


2 


185 


MSKGHTRLSSAWTGKPPLSVEDDFEKLI 
WEISGGKLEAEIDLDPGKDEDDLLLELS 
*MIDT 


7667 


21568 


A 


7727 


346 


143 


SQAFLYLESFLLLLLLY*FLFFEMESHS 
VTQDRVQWRHLSS LQPLPPRFQQPSRLT 
LPSSWEAEVGG 


7668 


21569 


A 


7728 


411 


221 


NLRPPGSS YSPASAS * IAGI PGARHHTQ 
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LIFAFLAEILKNLAKLYLI PRKPNI YYL 
PNCYINI 


7669 


21570 


A 


7729 


2 


303 


AAASTOMLLFPLGRHLVVEWLGHMVGIiI 
PTFKKLPNQAGRRGSHL* FQHFGRQRWE 
DHLRSRVQDHPEQHSKTPSSIiQKNKMSI 
ESNIiLWIHKLFSTGRM 


7670 


21571 


A 


7730 


67 


65 


SICEYCFVSSVFVLKRLFSVMQAFSLEG 
CVSCQS * AFLQHYLFKKLYFIiY* WGLTM 
LSRLVLNTWPQVIHLPWPPKVLGL^AFC 
GNEA 


7671 


21572 


A 


7731 


354 


238 


QSILDDVAMVLD*QABVFIVKMWRLLIY 
VHEAKKIGLVK 


7672 


21573 


A 


7732 


33 


186 


RGAIAVLSRLILTPGLKESSCLDLPKGW 
DYRWE P PRPGCFF * LMVLVLS F 


7673 


21574 


A 


7733 


1 


181 


VNAGADCS S IGGVP FLQHKKCHGKDYBP 
RGITTLERS YVEETTEHLVSKSK* PLRA 
QINL 


7674 


21575 


A 


7734 


336 


80 


HKLKEPPGVFPVFPFKNLEFGPGPPKFF 
FKKAFS PILSFFFGKI FKI PRFRGENFA 
P*NFGKNS*KPRFSPPPPKKKGFFFFFF 
F 


7675 


21576 


A 


7735 


2 


163 


TPVSTGTPVPTLT*VPSPIIFPVSEKWA 
GCLHLCLNFTCTEIiRLLTSLLTIRS 


7676 


21577 


A 


7736 


87 


258 


KAPSVCLFSAIJ^LRMSARTSVCTVRP 
LSPS * AVIS PVTCTYTSRWPEATEDSQK 


7677 


21578 


A 


7737 


1 


103 


LDSKGIYMTLQPSKLEPKLEANVEIREH 
MLENSSRF * RDLYDSAAFKARTKARSKC 
RDKRAHVGEFF 


7678 


21579 


A 


7738 


193 


300 


GGLPPPPFFFFFFLKKRGFPWVTRGGPN 
PPPLEMGGPTPPKGWNYGGGPPP * FHPF 
GGVGPP I S KGGGLGP PRVTQGKPLFFKK 
KKKKNGGGGKPPYSPFLGGVKQKKGVNP 
KGGGSKKPKFPPPPPPGGKKKKPFLKKK 
KKKK 


7679 


21580 


A 


7739 , 


150 


17 


GRVAQVWWLMHAIPAL*K7^EAGRSIjEPR 
SWTPDWGTWHLPI STK 


7680 


21581 


A 


7740 


1 


358 


SPPRPPPPPPPPPKEKFKKLENPPPPPK 
NPQKQNPPPKTKKKSPTTTPKKKPKIGT 
PPQTPKKINCS*KKKNNPPENKKGLNYP 
NKPNHPHAPPKEPKQREKPLVPTKKKKG 
PSPKKKK 


7681 


21582 


A 


7741 


235 


147 


F* FFLGDEISLCCPGWSLTPGLEQASCP 
GliPRRWNYRCEPPHSVKTIF 


7682 


21583 


A 


7742 


131 


12 


AASTYGQ*FKICGAILRLMPIVIEFIPL 
KILQIGPVFTL 


7683 


21584 


A 


7743 


1 


184 


NQYPWCFCNHMSMGKES KTLNRSGMVAH 
TCNPSTLGGQGGRIT*A*GSKSSLGDKV 
RPHLY 


7684 


21585 


A 


7744 


347 


202 


SLFVETGYHYVD* AYLELLASGS P PASA 
SQSFGISSVSLCTQPMFQFE 


7685 


21586 


A 


7745 


138 


263 


KKRKTYIHTKTCM*MFIAALFIRVRKWK 
QS KCHSADEWINKF* YI KICLS IHQLND 
IWIASTF 


7686 


21587 


A 


7746 


358 


169 


AAAVAERTQNTEKTEDLVLGLWI KKVTY 
WPGWAHVCNPITLGGRGGWIT*GQEIE 
TSLPTRR 


7687 


21588 


A 


7747 


78 


346 


AIQMLRNQVKIKLPKSPVEIIPFKIVPQ 
FQVQKI KSQNNFLLLFQESQ I FFATSTG 
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SVRPLSPS*AIISPVTCTYTSRWPEVTE 
KSQKK 


7688 


21589 


A 


7748 


222 


98 


KKKARHGLGAVAHACNPSTLGGQGGRIT 
*VQEANTVRTCLY 


7689 


21590 


A 


7749 


314 


52 


LNNVDLQFFLGPRKKKRNLNDSSLFVSA 
EEVRLMYCFNKLAILHYF* FSNKFIVFV 
* KFGHLLDENMG S KFDN I GMNAMANKDN 
ASK 


7690 


21591 


A 


7750 


371 


216 


YHTHTHTHTHTHTQIPSI1PI1NI1EGPVGY 
IjLS CGS IiHLKNLW * KCMRKERKK 


7691 


21592 


A 


7751 


3 


238 


FLIQTGFGHVSQAGWELLR*GNLPALAF 
QSAGI TGMSYLCVAESL YLLPPFLKILC 
SSLLTLSPYDTWKVALLCSYTV 


7692 


21593 


A 


7752 


2 


247 


NSDIFDIYIIQYIFLIYLIYIIQYIYLI 
*VFLI IFFKFTLK*EFIFISLSISFRIiA 
FIVYRDVILLFLFFFS *NFVGDS IS 


7693 


21594 


A 


7753 


359 


2 


TLKIRPI*IiYLPLFFFFFF*KKKIFFFPP 
GGGKGV* SGFYLMGRGTFSLFFKRFFI*F 
YPPKEGGLRGSPPPPSFFFFFFKKRGFP 
FLARGV* KKGPQGTPLPWLP KSLGLRG 


7694 


21595 


A 


7754 


3 


174 


CSVIQAGGQWCGHSSLQP*IPGINQSSY 
LSFPSSWDYRHATMPS*FLFFIEIKMLQ 


7695 


21596 


A 


7755 


292 


219 


IWPLSHVHSNPLTHSPSQVDQGLFTTLL 
PCLSGYMYVQFPLAEMPFISST1I1T1KNAC 
LSLCEGSEAQKPLLWEGLSYLPPLNQLP 
LCL+LCVCISGGVCRCLMCILGVCVOIH 
VCVSVCPCTHTLAQETKGC 


7696 


21597 


A 


7756 


2 


195 


VALMADGAIDTESNDYGAFMPIiGIERGJj 
DRIWEMPELWLRPNEFDCMTD* PCIQHA 
PSVSCGLA 


7697 


21598 


A 


7757 


4 


280 


DHIVDMITPSFTRRTIAVF*DLNLYIVI 
RGHITSILKPNKWLKLWYIYTIBYYSAL 
KGNB ILIHATI WINLENMQDE INQTQKD 
IYCTISLI 


7698 


21599 


A 


7758 


352 


107 


FLTRASRGHNGETIVTSWPGAVAHVOTP 
SSLGGRRRWIT*GQEFKTSIiANMVITFF 
ISKNSNRFLSQFPISLRPTHYKVLS 


7699 


21600 


A 


7759 


92 


4 


RPSPRRGGWTA*GQEFKTSLGNMVKPCL 
Y 


7700 


21601' 


A 


7760 


92 


4 


RPRRRRGG WTA* GQE FKTS LGNMVKPCL 
Y 


7701 


21602 


A 


7761 


328 


124 


PLFSFLFFPFFFFFFFFFSRRDGVLPCC 
PGWS * TPGLKLSSSLCLPKCWNYKHEPL 
HTAHHNF 


7702 


21603 


A 


7762 


1 


167 


VDSTDKRPGAVAHNYNPSTVGSQGGQIT 
*GKFNPSLANTVKLFKKKKKRKRGALF 


7703 


21604 


A 


7763 


309 


226 


RPRRRLTLSPRL* CSGAISAHCKLRLPG 


7704 


21605 


A 


7764 


349 


7 


QVFCFLFFVCFSSIFMTFNE*QGLKVTS 
GISSNVY*FLLIWILGKITYQDIGRGFS 
EYGS P EQHNIiG I VRNASSRLGWAHTCN 
PITLGG*GGWIT*GEGFETSLANMGKPL 
LY 


7705 


21606 


A 


7765 


43 


339 


FFFFLFFFFFFIIFLLLSFFF+DHFFVS 
LLFISNFVFLFPI FIFTVFQAHLFICLL 
FFPICMFLNLFFPNFNYYFYP FPLFFYL 
LFFFFG* FFNHLFYL 


7706 


21607 


A 


7766 


352 


102 


HTSCLIIKSSIiSKMNISCVSSSISFFF* 
FILIE IGSR* WQAGLQLLGS TDPPASA 
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SQSARITGLSHHAQLSSTSLWENVEW 


7707 


21608 


A 


7767 


346 


90 


TPPSSFFSFPPFPFFSFIPFRFFFFSNR 
DRVLVCYLGWS * TPELK* SSQ FGLPKCW 
DFRCGPPCIiTSTAFYISITVNFSPSNCY 
S 


7708 


21609 


A 


7768 


278 


87 


AASTGVGPSEPSAGYYLLVCRLLRSLEK 
CSIRVGVT* FSRCHLSPLSIiTRKGNSLT 
PCASRVR 


7709 


21610 


A 


7769 


5 


272 


AVYHP INESCVLNIGKDSCLLY * LFBKK 
KKKKKKKKNKKKKXKKNKHYE * FNTFPG 
VARKINLFLCIQKVDAGGGTLWGGGPPI 
FFFFL 


7710 


21611 


A 


7770 


198 


33 


QRPQSRQCRAWLGMVAHAYNPGTLGG* D 
RRTT*GQEFKTSLGNIVRPYLYEKMF 


7711 


21612 


A 


7771 


97 


410 


A*WLMPVIPIIWEVKWRLLEPGSLRPS " 
WATWRKP 


7712 


21613 


A 


7772 


5 


415 


ILCVYLHFVGTHPFVHL*YMHFLPLTIiA 
LLI*YVSIPITISSIPPYT 


7713 


21614 


A 


7773 


1 


342 


WRVTSGHSG*AAAYAT*YILSAEGSFF 
PLTBELLI*FMKREAFT*KR*VLIIHEP 
SXNLD * LYGCPTP YHTFPDPVYLNSKRR 
RRDSHPPKLASRLPHVLHDFFK1CKTCGGP 
RF 


7714 


21615 


A 


7774 


398 


122 


SPPPPFPSSPSFFLPPPSSSFFLFFLFF 
FFFFFFFFFFFFFFFFFFFFCRD*VSLC 
CPA* SOTPGLK+ YS YHGLP KPWMVWHR P 
PCLAYFL 


7715 


21616 


A 


7775 


1 


336 


FNFLLI I I IEMESRS VSAHCKLSLPGSS 
NSSASASRVAGSRGAHHHTWtiF F RFPT? P 
F*KKKGAPPGGQGGFLTLKKKNPPPRAP 
QRGG INGGTPRPQNGGGKFF P WGEKKKF 


7716 


21617 


A 


7776 


1456 


1232 


FAIESHCVTQAGVQWCNLASLQPSPTEF 
K*FSCLGLPSSWDYRCVPPHPANFYIFS 
RDRVSPCWPGWSKTPDLK 


7717 


21618 


A 


7777 


400 


2 


NTEFPYGPPISLLGM*P*EMETYIPRKP 
CTQMFLEVtiFTIAKK*KKPKRS*LING* 
I LVYPIiME YYS ATKRFDVM I HT KNVDAI 
GNIILSERR0«*KTHIVYSIHRASCTAE 
VRIKVSYKRAATWIKSILIA 


7718 


21619 


A 


7778 


360 


218 


PYLANF*NLL*RWHLNMLPRPDSNSWPQ 
VII*SIiWPPRVLG*QRGGRVEQRGARGNW 
MSKCPEGGDCLCWVAGA 


7719 


21620 


A 


7779 


1 


372 


FEVRSCSVAHAGVQWHDHRGFEFLGSSD 
DPSVSAS CISR ITGASHCTQG* L * LLTK 
VL*VSAFCLSVNTLRLTSLLKSDTFLGS 
RSKLNSLGESLQTYISPSRHFIQLLCFR 
SLFPRFLVKHKK 


7720 


21621 


A 


7780 


372 


219 


ATSLLS YFKKLPQPPQSSATTTI ISQ*S 
STSRQDLPPAKRLELTEGSDDP 


7721 


21622 


A 


7781 


358 


145 


I KNIHDS WEDVKISTLTRV* KEL I P VLT 
DNSEGLKTSM*EVTAEWEIARELELEV 
EPKDVLVQFHDTINR 


7722 


21623 


A 


7782 


317 


146 


GRVDCKCPFYRFQMLLVRLGMVAHVCNP 
STLGGRGGWI P*GQEFKASVSRMPRPHL 
Y 


7723 


21624 


A 


7783 


13 


186 


DRVSITQTSWAHPPTSAS * VAWTTGMHH 
HTWLNFVFFVEMGFHHVGQVGIRLPLLS 
WK 
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nucleotide insertion 


7724 


21625 


A 


7784 


339 


221 


KWDLAMLARLVSNS WLQGS S YLSL * NCW 
DYGSEPLCPAL 


7725 


21626 


A 


7785 


355 


3 


PGNSGRVDLFFFFFFLDRVSLCCPGCSA 
VTNIxEVHHSAIiQPQTPGLKPSSHLSLLS 
SWDYRRVPPHPS *GLF* *GK* YS ITQLR 
FLNLHLBSGLRSLLPKPVASTVRIRTQS 
GLGLK 


7726 


21627 


A 


7786 


1 


201 


QQFSHVFRLLFQFSFSCQEPPSSHIQPG 
VCTRNTKVSQAW * CVPVI PANREAETGE 
SLEPGRRSTRP 


7727 


21628 


A 


7787 


270 


1 


RPRRHLVPCVLMLG PKCPGL PHTLTTLP 
YTLTTPSPTRSLGPPECK*SFHl4SFPSS 
WDYRRMPQRLANFCRGRILLC YPDWSS S 
PGLKQS 


7728 


21629 


A 


7788 


229 


2 


DGWVRINWAQPGTVAHAYN I SNLGGRGR 
RIT*GQEFKTSLTNMEKPCLYSDHEVRS 
LRPARPTWRNPISTKNTKM 


7729 


21630 


A 


7789 


118 


263 


CSKSYYWPDAWHTCNPSTLGGQDGWIA 
♦AQBFKTSLGNIARPPISVF 


7730 


21631 


A 


7790 


246 


359 


TFIFSETRSHS VTQAGV* WHDHGSLQP* 
PPWAQVILSC 


7731 


21632 


A 


7791 


70 


102 


AA*SRIiTATSLSRVQAIPLPQPPEQLGG 
QGGRIS 


7732 


21633 


A 


7792 


7 


312 


FLDFQLRLHSNSYCEDKGVFDSEIYEIC 
I VLI CKK* RKFLNQKKKKKKKEKPGLI * 
KKKKNSQKKWFDFPWLPPNPETKNSLFL 
PKEFLWLKTKPPFPLTLT 


7733 


21634 


A 


7793 


1 


341 


IKPE*YLLFAYTILRSVPNKLGGVLALL 
IiSILILAIIPIIiHISKRRSIIFRPLSRS 
IiY*LLAADLLILT* IGGQPARYPFTIIG 
QVASVLYFTTILILIPTISLSENKVLLW 
A 


7734 


21635 


A 


7794 


1 


143 


YGSTFFVATGFHGLHVI IGSTFLTICFI 
RQLI FHFTSKHHFGFEAAA* YWHFVDW 
*LFLYVS I Y* *GSTFFVATGFHGLHVI I 
GSTFLTI CFIRQLI FHFTS KHHFGFEAA 
A 


7735 


21636 


A 


7795 


2 


257 


KWAIIEEFTKNNSLIIPTIIATITLLNL 
YFYLRLIYSTSITLLPISNNVKIK*QFE 
HTKPTPFLPTLIALTTLLLPISPFILII 
h 


7736 


21637 


A 


7796 


22 


326 


RDASDCSFQNLPVPLWVEK*MVFIiLTKK 
KKKKKKKKKKKKKKKKKGGGP FKKNSWG 
AKIIPGKKKIIFFFIiEGQKKKLWGFFKK 
KPLFWGGJOCRPNPPKKN 


7737 


21638 


A 


7797 | 


506 


281 


RGAN*NRSGQGKRHEERERERERERERE 
RERERERBRERES PRPPKRQRERETEIQ 
TLSLTVSLAPPPTCVF 


7738 


21639 


A 


7798 


435 


336 


MRSNIHFFSHTHTHTHTHTHTF* ILKQH 
TFSK 


7739 


21640 


A 


7799 


411 


106 j 


RNPPPLFFSPPLQRGGFPPIYWGPPRFF 
PPPPL+KTPPKLKIGAPPKKKKPPPPPG 
EKMVSF*PPPPFFFFFFFLRRPFPliFSP 
GGGPWGHFRGPP PP PPGV 


7740 


21641 


A 


7800 


2 


296 


FVPSTANWCFFLWSVF*TGGSNYFFVHR 
S YSQAGVDLI FIRLANT ISNLS F I * * RG 
GLAMLPRLVLETRAQTILHSWPPKYLGL 
QVWATVPSFQFLKN 
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peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
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Predicted 
beginning 
nucleotide 
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Predict- 
ed end 
nucle- 
otide 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pfaenylalanine, 
G=Glycine, H=Histidine,I=Iso leucine, 








26 


correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


K=Lysine, L=Leucine, M=Methionine, 
r^Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X«Unknown, *==Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


7741 


21642 


A 


7801 


344 


105 


PGFFFFPPPGKRGFFPKPFFLGTPGFFP 
PPLF*OTAP*FFFWAPKKKIFSPPPPGK 
KIFLLKGPPLFFFFFFFFFFFFF 


7742 


21643 


A 


7802 


68 


216 


VHTIAKIWKQPQCPSMDD* IKNMRYLYT 
VE * YS VLKERE ILLYWGHYAM 


7743 


21644 


A 


7803 


407 


3 


GFIH* FPLFSGYTLDQTYAKIHFTIIFI 
GVNLAFFPQHFLGLSGMPRRYSDYPEAY 
PT*NILSSVGSFISLTAVILIIFMI*EA 
FASKRKVLIVBEPS INLEWLYGCPPPYH 
TFEEAVYIKSRQKRKESNPPMY 


7744 


21645 


A 


7804 


2 


156 


THRRIIILSQGIj*TLIjPIjIAF* *llasl 
ANKKKKKKKRGGPFKKFPWGAKI 


7745 


21646 


A 


7805 


2 


309 


THSRIIILSQGLQTIiLPLIAF**LLASL 
ANKKKKKKKGGAPLKKIPGGAKIKRGME 
IKNFSQKGGEKKTQRGIFGKKPYLGGGQ 
NWEKLPKKFKGLKGKKNF 


7746 


21647 


A 


7806 


110 


362 


TLSDLERLLMKAVSHFIjMI DIiLEKC *CV 
LKNQSSKKKKKKKKKKKKKKKKKKKKKK 
KKKKKGGPLLKNSWGGPNFPGGEKIFFF 
FFRGGF 


7747 


21648 


A 


7807 


295 


108 


KQRG FVFF FFEME S RS WQARWQ WCNL 
SSL*PPPLGFKQFS CLRIVYRJCDSLSFN 
KPRLLP 


7748 


21649 


A 


7808 


2 


142 


GSTFFVATGFHGLHVI IGSTFLTICFIR 
QLIFHFTSKHHFGFEAAA* YWHFVDW* 
LFLYVSIY**GSTFFVATGFHGLHVIIG 
STFLT I CF IRQLI FHFTSKHHFGFEAAA 


7749 


21650 


A 


7809 


390 


2 


PASLLHCX3*ISDCCASNE*GSVGLGPSE 
PGAGYNLLVCHLLRPIjEKCSIRVGVTRF 
SRCCLS PLPLDRKGNSLTPCTSQVRQCL 
ALLQLTLGALHP VS CTHCPT I SGEMNPV 
SQLEMQKS PIFCVTHAG 


7750 


21651 


A 


7810 


1 


157 


FLHFGQAALELLTSGDPPASAS * SAGI T 
GVSHRAQLVCTFITIYVFLKNSSY 


775! 


21652 


A 


7811 


368 


44 


QNFPPPKKRLPPQPPLFFFLVPSFKBEG 
FFFFLRGFFFFSSAQRALL * F I YFIFFF 
FFFSFFFFFLYFSISPDSKGDTHHDLLL 
GVSWWTRSLPLWIARHNMHKWGW 


7752 


21653 


A 


7812 


400 


80 


PQNSFSPPGIGGFFPPFPL*NFFFSPKA 
FFFLGGFSPFFPPPKKSFFFKI PPGFFF 
SPPLKKKIFFFPPPFFIAPPRFFFKAPP 
PFFFFFFFFFFFFFFFKYFRHI 


7753 


21654 


A 


7813 


119 


230 


SEEFETSLRCIVIPSL*KKKKKKKKKKK 
KKKKKKKKKKKKK 


7754 


21655 


A 


7814 


3 


412 


HEQEHL * LLLP * * P LAI I * VI S TLAETN 
RTPFDLAEGETELDSGFNIEYAAGPFAIt 
LFIGE YTD I I IRNTLTTTIFLGTTYDAL 
S PEL YTT YFVTKTLLLTSLFL * MRTGYP 
RFRYDQL IHLL * KNFLPLSLALLI 


7755 


21656 


A 


7815 


1 


183 


LRLERLSEFSTRRERERERERERERERD 
TRIDIYIVSQKRKKYIV*IRRNIFVYHA ■ 
EFSKR 


7756 


21657 


A 


7816 


484 


97 


QPRTPDLK* FACLGLPKCWDYRHKPPCL 
ASDGNHS * SSPGLLVLSSASVSPSGHMS 
PSQQTS PWVSEESLLLGRVPGFLYFPPV 
DAPGPGAWLAHWQAVLKLLGSSDPPVL 
P S QS AG F AGLSNCAW P W 


7757 


21658 


A 


7817 


2 


219 


ADRLRNS * EA*EREREREREREREREPY 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cyst«ne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PPSHIYTDRNRH*MRHEERTLFLWGSGE 
VRDAAPWVSHIPNTAL 


7758 


21659 


A 


7818 


190 


292 


NSHL I FPWTKVFFMS * CT YNJjVLPGSE K 
KPYSHA 


7759 


21660 


A 


7819 


498 


139 


RRAATPAPAAAEPPPPLQRP* PPSEPPD 
TQRDARGRRRGEDPGNS PFHPNPDRQPS 
RALCSTPRMHRLWIGPAFFIiMTSLSVSG 
AVI PRNGGPGGVSSGPCLLQLLCGQAGS 
STIRKIPS 


7760 


21661 


A 


7820 


489 


330 


ELGFLHVPQAGEELLS* SNPPASASLPT 
SWDYRHDHRTGHHPGIYDSKMCIF 


7761 


21662 


A 


7821 


490 


260 


FFLXXNXFXFFFFXXFXXPXFXXXFXFF 
FFXXXFFFXFFFFFFXFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFLY 


7762 


21663 


A 


7822 


493 


482 


VGQAGLKRLTSGDLPASASQSAVITGVS 
HRARP IMYFRYVQ* AKGSHV* RWY 


7763 


21664 


A 


7823 


456 


121 


ASFFPIQYKGLGGATPPQEGGCG*GIPI 
KWGYKRRPGGPHGGSKRPPTQ*KKSYFI 
NVLVLF YLRDKGLAI PTLVSNS WAQVT L 
PPWPPKVLGLQTRVTVPGQNIIiKENMFF 


7764 


21665 


A 


7824 


294 


457 


LCIGFVILISYFNIMENWYCRPCTVAHT 
CNPSTLGG*GRQITRSRDRDHPGQHG 


7765 


21666 


A 


7825 


26 


235 


SVWWNSQTGNKVKFTISW* IDKQMRCIH 
IWBYYSAIKRNEVIiINATTTWINIjNTIK 
IjSKRSQTLKSNTL 


7766 


21667 


A 


7826 


30 


303 


S YVS WEFPGSSGPTLPS P WHQTPRTYL 
NSVASAINLTQCPH * PEKKKKKKKKKKK 
KXKKJCKKKKKKKKKGGAFKKKGP*KTPK 
KTPGGF 


7767 


21668 


A 


7827 


464 


21 


REPPHPAPADILIIoHFWPLKL*ELISW 
LNTRCWICDDTPRKLTCQPMPRLLLQT 
EPL* VI,FSNLGSFGSIiSKS *MGLVNFFC 
KGTDSRSFSLFVAVFCLFL*RWGLAMIiH 
RLVLNCWPQVI LPP PPPKLLGLQVEATT 
HEFHGTE 


7768 


21669 


A 


7828 


372 


205 


FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFIKKFFI 


7769 


21670 


A 


7829 


463 


313 


DI YPERRNSKIMS FAATWMKLEAI ILK* 
LKNRKPNIVCTHIi 


7770 


21671 


A 


7830 


449 


266 


WQMRFLHVGQAGL * LLPSGDPRASQVQ 
SAWITGVSHHAQRS IFFKKRNNDCDRPD 
NTECT 


7771 


21672 


A 


7831 


480 


148 


IEIiRLSHGAAEFHRPREWYGIFKVLKER 
DFYPRIIYPAKIILRHEGEMKTFPDKQT 
LRHF INTRP VLQEMLKKVHQS K I KGC * * 
ATNNHLKVKKKKNSLGW 


7772 


21673 


A 


7832 


23 


131 


QRGNSKGYHLKMIQQEEIRKLEEEKKQL 
EGEIIHFYKMKAASEALQTQLSTDTKKD 
KHGKKQ*FL 


7773 


21674 


A 


7833 


16 


339 


NTOTLGSLMAFCRDGIAMLPRLFLNTGL 
KRSSCPDLPNCWDYRHGPPYLAS FVLLK 
*TI^IIiLLPYYHKMHTCVFMVLCTOTCG 
GVCPGMELLCLKVYGYLSLKHTFH 


7774 


21675 


A 


7834 


355 


70 


KKTPRGFSG VFLGP rYKKG PIjFFFFF F F 
FFFFFFFFFFFFFFFLFLQFFILCQQFL 
S*STEKTVHSKYILSFPFHCLIFCPFIL 
LCSTFCNGAHY 


7775 


21676 


A 


7835 


494 


282 


PENGMIQGGGACSEPR* *HCTPAWAKER 
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peptide 

sequence 
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nucle- 
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correspon 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
EXJIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutaraine, JR=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














DSVSNAKTIQNKKNLGN*HSISTAKAKN 
1 1 IS * YKSELRKKM 


7776 


21677 


A 


7836 


305 


185 


LVIETCLTORPLSPS*AIISPVTCTYTF 
RWPEVTEESQKK 


7777 


21678 


A 


7837 


179 


191 


USCRKYLFYLNGGLKCLYI KCVCVCVCI 
YIGVYIGMCVYI C I YI YIiMAS * NQVTS F 
LSFYVFLLNYR* I PVKVITVLFNLRMTG 
LP 


7778 


21679 


A 


7838 


466 


343 


LGRLRQENHLPPEGRGCSEPRSRPCTPA 
WETEEDAVSKINK* INK 


7779 


21680 


A 


7839 


451 


89 


LRDTR* SGVI CLPKQAWAMVGAPPPASL 
PPCSLISDCCASNQRDSVGVGPSQPGGG 
YNLLVRRFLSLSEKRS IRVGVNRFSRCH 
LSPLSLTRKGNSLTPCASRVRQCLALLR 
LAHGARTH 


7780 


21681 


A 


7840 


442 


164 


AHTNQFSQCIKKSVTPDLMEEMYK3CAHA 
AI*ENPVYEKRPKKEVKKNRWNCPKMSL 
AQKKHQVA* KKASS LRAQEQAAES 


7781 


21682 


A 


7841 


494 


359 


ICMLPRLVLNS *PQVILLPWPPKVLGLQ 
A 


7782 


21683 


A 


7842 


577 


224 


IFFFFNKIFLFRPGLNLMGDIWVPSTPP 
LOG* TNFPPQPSQEVGIiQKAPPPPGLIL 
VFIiSRQGFPQLGRVGFHFPPPGDPPPRP 
FKKIWSHPGGGKSPPKKKKKKEIAEVLG 
AFRLR 1 


7783 


21684 


A 


7843 


404 


56 


HTLSTNVCSSYLNLDFFSLRGRGLIMLP 
RliALNSWAQE ILLPQPPE * LQLLRRLRQ 
ENLLS PGVQGQPGQHNETPTS 


7784 


21685 


A 


7844 


1 


323 


INTLIiALLLIIITF*LPQGIGYINNSTP 
YEGGFDPISTARVPFCIKFFLEAITFLL 
FDLE IAIiLLPLP * ALQTTNLPLIAMAS Ij 
LLI 1 1 LALSLAYE * LQKGLD * AE 


7785 


21686 


A 


7845 


419 


159 


FLFFFFXFXXYVIjSCGIY* YWCNLFSS * 
QYFFFYVMAFRIFLFTFIiFLHFYYNMF* 
CTYFYLSCIi*FEYICSHIIHYFIYFLHL 
ENF 


7786 


21687 


A 


7846 


475 


124 


FFFFFQPPQKBGGPPPPPKNFFFFFTRI 
FFFGVFFFFFPPQKNFFFF * KTKGVFFF 
PFF* * KI FFFFPGVFFFFPVFFFLSPPP 
SSSFFFFFFFFFFFFFFFFFFFFFFFFF 
FFLKRGQAS 


7787 


21688 


A 


7847 


6 


189 


LIDTVSPSVALAGVQWCNHSSPQP*PPS 
LKRSSRIALPRFWDYRCAPLCKAHSFNS 
NHEK 


7788 


21689 


A 


7848 


20 


249 


KINFIWELTCSNTVHTFYVYGFDKCIL 
PTTQLFFFIj*ETGFHSVARLECSGVIMA 
HCSLDILGSSWRPANFLNFL 


7789 


21690 


A 


7849 


101 


398 


LFQK3CKKKKKKKKKKKGGPLKKKPGGGK 
NKGGEKKKIFFLKGGEKKNPRGNFKKKT 
FFGGGKKGEKPPKKKKSPEEKKKF*RGK 
GGKKSLICWVEKNLG 


7790 


21691 


A 


7850 


87 


226 


GSLSPIMLVLPFLIHLL*KNFLPLTLAL 
LI*YVSIPITISSIPPQT 


7791 


21692 


A 


7851 


2 


440 


GAIIRRLLH* FPLFSGYTIiHQTYAKRHF 
TIIFIGVNLTFLPQHFLGLSGMPRRYSD 
YPDAYTT*NILSSVGSFISLTAAVLTTL 
MS * EAFASKRKVLLVEEPS I DLERLYGC 
PPPYHTFEEPVYI ISRPRRKESNPAKLV 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
Kf Lysine, L=Leucine, ^Methionine, 
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SSQPHG 


7792 


21693 


A 


7852 


1 


80 


SSLLLI I ILALSLAYE *LQKGLD*AE 


7793 


21694 


A 


7853 


5 


135 


ATFLYFS * K* SLSMLPRLVLNS WTQAIL 
PPWTLKVGLCFFDPL 


7794 


21695 


A 


7854 


38 


419 


FVMMPLHSSLGEGVRPYLKKKKKKPPWE 
GGS PPKNFKNQI FA* NQKPPFFFFI PPN 
KKWGSPPGGGASPL* SHPSGGPGGPNNK 
VKISKPPCPPGGTPVFTKTPTFT*PWGP 
IPEKBGRKTVCPRTH 


7795 


21696 


A 


7855 


398 


61 


FFSPRPRGGGFPPPPPKNFFFSPPTIFF 
GGGVWNFFPPPKMVFFLKISPWFFFPL 
IRKKIFNLSPGSFWSPQGFFF*GPPPKF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFF 


7796 


21697 


A 


7856 


1 


121 


LDASGRLPPSPKPIKIKNYNRP*EPK*T 
KICSIiHSLPPQS 


7797 


21698 


A 


7857 


1 


332 


TPPIARPAENSARGAEGRRR* I IALQVC 
NKKWP PTKLHC I KKIOCK3CKKKKKKKKKK 
iCKKKKKKKKKKLWGGGGGKNFFKKC^KK 
KLGGGFKKRGRKKKK 














7798 


21699 


A 


7858 


3 


191 


SPRHLTRAAGIRHEGQTGLKLLTSGNLP 
ASAS * SAGITGMSHRIGPRMLDLIiTLVS 
WKQHGL 


7799 


21700 


A 


7859 


503 


62 


FFFFLNCCLSGLRHLIKLFEPQFIjHL*N 
GGMVLSIILSIKSPLRSPSLSYSWL**P 
REKIKWWGGQVRWFMP inprtlgggggq 
IA*SQEFKSSIiDNMVRPYLSEPP 


7800 


21701 


A 


7860 


355 _j 


86 


yikg*qmvgnrgasqagmtcygmprlil 


7801 


21702 


A 


7861 


336 


225 


PQLAGP VKTVRKVYKKEKARVI PEEKKN 
FKAFASFRKGRANGRVFGIRAKKAKEAA 
KQDV*KKPPIPIAGGS PTREPPSQAHLF 
PQNPPRGNPP 


7802 


21703 


A 


7862 


465 


170 


GGPPPKTHPFGGERGGKYHRGKKKP PGH 
QRVNPPSPKKKKKNWGRGKNPFTPLIGG 
GQAKKP P QPGRGE CP* KKVP PNNSVLGK 
KKNFFPKKKKKKKK 


7803 


21704 


A 


7863 


495 


275 


RS FFFFFFFFFFFFFFFFLVARK* YYFL 
ATKKKKKKKKKKKKKKKK 


7804 


21705 


A 


7864 


493 


70 


PLTAS ISIiPVNWRYIAPHRWKIK* KSV 
Y* KYLA* CLAQSKCPKSVCSHLSS P* LH 
FVPPPNPAEAMWTARQC* CIRGLGGKGT 
LCVAHTYNPSCSGG*GRRIV*AQE FKAS 
LGNSEPLILKKERKKESQAWWLIPIIPA 
F 


7805 


21706 


A 


7865 


67 


385 


VARTTGMQHHAWLIFFIFIIFVETGSHY 
VAQTSLEFLASSDPFTSVPQSTWITGVS 
YSWS*PGIAHTCNPSALGDRGERIT*GQ 
EFKRSLGNIVRPCLYKNNKNKKN 


7806 


21707 


A 


7866 


454 


0 


LSFFFFPLGDRVTLCCPG*SAVARSQLT 
VALNSWAQAILLPQLHKHVPPCLANF *N 
FF*RQRYTMLHRLVLNSWAQVILQPWPT 
CPTAAQA 


7807 


21708 


A 


7867 


1619 


698 


PATSSSSSSSSSSSAAAAVAAAATAASL 
PPCRPPLRPPQPLAPHEVATLAPWRG*K 
PSFTSSGKRWLTSEPQTSGSCASCL*SM 
RASSPSSG^SKKKPPLPAEAAASVAACA 
VCWVRAPPYVAATTAYTMAVMGWMAFP 
V7EVIWTRWRMMSQAIRPLQTWTNSVTAP 
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S 


7808 


21709 


A 


7868 


52 


187 


LDFN*RRCLTKLPRLVLNSWAQVTLPPW 
PPKLLGLQV 


7809 


21710 


A 


7869 


33 


373 


AGVQWRDLGLLQPPPPGFKRFSCLSLPS 
GWDYRHAPPHLANFWLVEMGFHHVGQA 

GLKLPT* gdpptlayqsagitgmshrar 

PGIS VFHLKLHSFDPFKI I FRPGAETMP 
S 


7810 


21711 


A 


7870 


100 


119 


VFVEIVFPEGLLFSPCSFHLCGFRIHLK 
AEF*FFVETGSMLPRLVSNSWAQVILPP 
WPPWL*LQA 


7811 


21712 


A 


7871 


471 


446 


LLMLPRLVLNSWAQVILPPQPPKVU5L* 
AYI 


7812 


21713 


A 


7872 


449 


33 


FSFFFFFFLGKKDLFFSPRGGGGGANSA 
FFPPPNFGKKGSAPPPPPLKSV*NPAAP 
TGVFFFFCGGGA 


7813 


21714 


A 


7873 


334 


41 


PCPSLFFFFFLRQSLTLWPRLECSGVIS 
AHCNLCLPSSWDYK*HCPYLATGAPGLT J 
SQSELSYVTL*LQESRPLCIPVDFMTFL 
PSLCFQS PNSTTL 


7814 


21715 


A 


7874 


342 


716 


STGI FIiFVCF* DG VSTLAQAGVQWPDLG 
SLQPSPPR1^*FSCLSLASSWDCRHVPP 
LLANYLYF* *REGFTVLAMMVSFS* PCD 
PPTSASQSAGITGVSHCAHPGMFFU5WP 
LKSCPSESGAAV 


7815 


21716 


A 


7875 


454 


215 


PPPARCH* PP* * TLVHAVE FKSHHYSL I 
S STQGHKHCGRPQG PLPRKTRDLCSLVY 
LLTFPSLLSYDPAKSLSARNTQB 


7816 


21717 


A 


7876 


492 


176 


PCGGKGAGGNWGWPPWHLRASASSRWAW 
ALQPHTRS SQPAG PTG PRQ * GAQHLGQQ 
LLCS I SHQALAAP PQGRAQDLQPAMPE P 
SPRPPLPWAAQ 


7817 


21718 


A 


7877 


281 


439 


TWSIDL*HMAYFLFFSFLFFAFLSFSLL 
SFSLLFFAFLFL+VPSFSLFQRQSL 


7818 


21719 


A 


7878 


507 


338 


LLRRLK* EGHLSPGGRGCSKPRLHHCI P 
HWATE* EPFS KT INK* N*S*MKMKMKIN 


7819 


21720 


A 


7879 


55 


298 


P PCLAN FLKFFL * RQS LPMLS RLFSNS W 
PQAVPLPRPPKRNQLYLYFGCCSQLCNP 
QHLCSKQHTFKILFDIEKHFRLREH 


7820 


21721 


A 


7880 


145 


292 


YFGGKQIYFIFCRDRISLC*PVLLTSLV 
SNSWPEAILPSQPPKVLGLQA 


7821 


21722 


A 


7881 


3 


203 


LSLLPRLECSGTFTAH*SFKLLGSRDSP 
TSAS * IAKTTC34RHHAQLRFFFFVPGGV 
CVCVCGCVYV 


7822 


21723 


A 


7882 


324 


211 


AWWLMPVIPTL*EAKMGGSLEARSLRPV 
WAHNETPFLF 


7823 


21724 


A 


7883 


2 


124 


QHFGRLRQVDHLRLGVHYQRGQHGETPS 
LLK* INVYI YMT 


7824 


21725 


A 


7884 


367 


141 


KCWDYRREPPRLAQGLEFLFNFCFEIVT 
EPPCPALAYF*NFL*RWSLATLPRLVLN 
SWPQAILPRWPPKALGLQE 


7825 


21726 


A 


7885 


471 


215 


RGMCVSHI FLCVGARYNISSARESLLKE 
FTECTERKHTHTRAHTHARARTHTHTHT 
HTHTHTRV * NSRSQPYCRVHACS PAYLG 
G 


7826 


21727 


A 


7886 


462 


204 


RHSS*LGLPKCWDYRHEPPHLACISSF* 
RKSDISLSDELGTKCHKWWPQWKSRAG 
SVGRDPCSGHFILAGFGQLGA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
EXJIutaraic Acid, F=Pbenylalanine, 
Glycine, HhHistidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknovrn, *=Stop codon,/=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 


7R97 


21728 


A 


7887 


409 


152 


PVFLALERPNQQDFLRPGVLDQP*QNNK 
ISFFFFFFKKKKKYIYICMVVHAYNPSY 
SGG*GGRIT*AQKFKTSLNWTVRPHLYK 
K 


7R9R 




A 


7888 


1 


152 


GVHVCCIK^CICTCIjCWGVHECI^CV 
SK*VHMCHREGTIFGNISWEGD 


7829 


21730 


A 


7889 


101 


399 


EI INI VFFF YFFLPLLFFFFWGGRFKKQ 
LNPEFLTNSWKKNHKTIGGKNPMGG*KK 
AITNPQKKGRKRSFSF*NKAQPTQGKNG 
QNSQGSKAVGALTAH 


7830 


21731 


A 


7890 


478 


142 


KFSKKKKRGVLNFYKKKAFFSKPPKPVF 
KKKKLLKNPFFKAKKQFFFKTPTRPLFR 
EKFFPGPPKKKKKRPPGLFKNN*KGPPW 
PREKKRVKKKKKKVKSHYYNNHMPISVH 


/OJi 




A 


7R91 

/071 


497 


320 


PPWKFEFEFWFS P * LHKYYPP YVCVCVC 
VCVCVCVFDFRFFKPAPSPQSHFGINLS 
FLL 


7832 


21733 


A 


7892 


217 


23 


I PP FKKF F* EMGSHAVT * ARWQWHNHA 
SLQPETCGLKGFSHLTLPNSWDCRSVPS 
SSARRQSH 


7833 


21734 


A 


7893 


652 


470 


SSSSSSSSSSSSSSSSSNSSSSSQFERK 
TRSAPQVEGKGPNFCSLEAPPPGLSLFS 
CLNF *RSWDPGPP P * PHATPKTRAGGDL 
LCATHHRARHCTPGLRFPLFLAL 


7834 


21735 


A 


7894 


25 


178 


GILFFWRDGGI1VII1HRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW*LLKH 
SF*FFWRDGGLVIIiHRLVSNFPVSSDSP 
ISASQSWDYRCDPPCPACPLIFW 


7835 


21736 


A 


7895 


104 


451 


ILSFMTTWMEAEDIMLSEISQTQTDRYC 
MILLVESKNVNIiRPGMMAHAYNPSTIiGG 
*GERIS*GQEFETSIiVNIARHYFYKRNF 
KIKSKFKKIVRKSQKKNVRRWLLEARNK 
GRYG 


/ojO 


917V7 


A 


7896 


510 


335 


SCIVSLHRSWDLWHVPPFLANF* IFRKG 
GVLPCCPGWQIYTFQLQIPFIiSITPVAF 
SG 


7837 


21738 


A 


7897 


37 


260 


AGNSQNQGGGACSE PRWRHCTPA* ATER 
ESVS KWKQTKFLCS VPYFES THFHCTNE 
ATGAWGVEKVALGHAASW 


/Ojo 


91739 


A 


7898 


35 


466 


THTCGGIRKKLTNKRENQK* SQQKPSAP 
KEPEVKNKQKKTKNKPKKKKKTPQIQPT 
KKPFRNPNLRVFTFFVTTQNKTPTRGRK 
TFQKVKLFPPNPCLGNKPQTLLPRGGLQ 
KKKKKIPFFYSHPPPLAQKKKATPWQNG 
LGFT 


7839 


21740 


A 


7899 


494 


149 


PARI*I»SIjHSYEVKSVLPKEEG1*LNFFVQ 
SVTAHPTSCIGI»EEI*LLD 


7840 


21741 


A 


7900 


494 


114 


gaeggplskkkkknllpdyhtgpmslsl 
ylwpflsnghtlti^cahththththtn 
h*i£rvglpvlkpgqyqangnrwsp*yn 
ike*prktrvmpvtpqlrndgpmltfwc 
ssrnsfamhvlgkl 


7841 


21742 


A 


7901 


19 


454 


VWCNSSVS IHTHTHTHTHTHTHTHTRSW 
VGWHMPIVI1STQEAEAGRLLGSGSLRIQ 
RAMIVPLHSSLGNAARPCV*HTGSCCVS 

qagvqwhnhsslypqtpgpkqssslsll 
sr*dyrhvpphpascvcvrvcvcvcvcv 
cvygdrgitph 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

pd 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

itiiJinu <iv-iu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

A inn fn 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glu tarn ic Acid, F=Phenylalanine, 
G=Glycine, B^Histidine, IHkoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
v=\jriuiamine, iv— Arginine, 2>=oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possib!e 
•nucleotide insertion 


7842 


21743 


A 


7902 


3 


393 


DAWDPLNPLVNLFVSPKRNSS LDTRKKP 
CRBS KKFNTHSRPKSSHQLRKRSSSTPT 
T*KI PNI *LNSSTPSRAKKKRKKKPKKK 

LGGAKFFGGRKEKIFFF 


7843 


21744 


A 


7903 


424 


139 


LFFFFFFFF*G*GFVIFCYVGQVGL*LL 
1 olsLIJUrll bAoy £> AaXKlj VbHHAJr FJjSXjX 

VVFISQCVGILNHTVHLQYIQCSFVHHF 
LINQEKIDIIK 


7844 


2174*5 






i 


lol 


oy w Jj X Jj unna lir X u I * 1 oGQ PGSY P FT 
IIGQGASGLYLTTILILIPTICLIESKI 
LKWA 


7845 








•> 
j 




nAiAHAbAlrr Vb i LrHGJbriDIMGSAFli 
TMCF I RQLIFHFTS KHHFGFQAAA* YWH 
FVDW* LFLYVS I Y* *GAVLKEPWGGQS 


7846 


21747 


A 


7906 


1 


84 


PTRPLTMLPRLVSTPGLKQSSQIiSLPKC 
*DYRCEPACTAYSF*LTMLPRLVSTPGL 
£> QiiS IiFKC 


7847 


21748 


A 


7907 


3 


392 


DPDNYTLANPLITIiPHIKPB*YFLFAYT 
iLRSVPNKLGGRLALLLSILlIiAIIPIL 
HIS KQQS 1 1 FRPIiSQSLY* LLAADLLFL 
T*IGGGPRSYPFILIGQVAFVLYFTTIL 
IIiIPTISLIGDRIXiKWA 


/©■to 




a 
-rt 


/yuo 


*> 

L 


231 


WJVE KfarTLjSHC * WE CKLVQPLWKTVWGF 
LKBLKIDLPFDPAIPLLGIYPKKNKFTL 
GPLHSVGRSSRKLPGYSPRV 


7849 


21750 


A 


7909 


529 


297 


KIFIGAPVFCPPPHFFFLIPPF*GVEKI 
FFSFFVLIjPFGFFFLFLGLFFFFFKLFF 
FFFFFFFKXPFFFSKKVGVGK 


7850 


21751 


A 


7910 


410 


61 


FNNQAMQWFPPFPLKIFFFP *RV* FLE 
GGWSNLSPPPK*GPSPKFPTRVL*GPP* 
KyrJS * a Xxjc * ct WAr HKr r *RAAPFFFF 
C*DRVLLPRLVSNS *AQVIHPPWPPKVL 
GLQA 


7851 


21752 


A 


7911 


512 


314 


ANFC* IiFIETGFPHIGQAGLELLTSNNP 
fc J /\ioAoy IALjI rGVNURPGQDFSSYHSEN 
PLVYGSIiYIA 


7852 


21753 


A 


7912 


1 


264 


SSTTTLl^RT*im,T*LTPLIPSTLI.S 
IJRGLPPLTGFLPKWAKKKKKKKKKKKKK 

KGGP 


7853 


21754 . 


A 


7913 


416 


129 


FFFFFFFAPPPKIKTPPFFFKREVYIY* 
ALSVFQSVO®iYIHFRIISSQYPC*LG 
FFFFFFFFFFFFFLKGSLDLLSAYCVPG 


7854 


21755 


A 


7914 


397 


34 


IFSNLFFPPQKKMFFPPPPLKYFFFSFT 
ALFFFLVFFPFFPPPKKIFFFNNPPKFF 
FYPS *KKKFFFFIPVYFLAPPEFFF*AP 
rrrrrr rrprrrrrrrcfrrrfj? 1* r E r J? 
FSVSKQPLFCLKI 


7855 


21756 


A 


7915 


396 


86 


IFLSPRK*GFFSPPPP*KFFFSPKPLIF 
FGGFFPKFPPPKKNFFFQNPPGVFFYPP 
FKKKKFFFPPPLNFGPPRVFF*RPPPFF 
FFFFFFFFFFFFFFYLWVC 


7856 


21757 


A 


7916 


351 


112 


SFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPI*KYVFIYISLSLIPFCKFLK 
GRECDI HFCDIQHS YSKYLGHFKCSTND 
RQMNK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ1D 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glyrine, H=Histidine, I=Isoleucine, 
K=Lyslne, L=Leucine, M-Methionine, 
N-Asparagine, Ps=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X— Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7857 


21758 


A 


7917 


445 - 


2 


FFFFFFFFFFFFFFLSYF*LIHYAEGIG 
ASPCYIMLGYNFSS FPCGTIS IAPGFNF 
YRLYFISHASADAWVDPRFLDRYRRGIN 


tOJO 




A 


7918 

/710 


2 


407 


PPPPQVYFNIWRTIFFGBGIiLHIFPPQK 
RFCFYKTYTNYIKTLFIKKKNIFSLAKI 
KMVPPRSIY*TPPPLFFFFFFFFIiFFFF 
FFFFFFFFFFFFFFFFFL 


7859 


21760 


A 


7919 


107 


11 


KMWYIYTLECYSALKKKKE1LLFD1TLI 
NIjKDIMLSKISQHRKHTHT * FKT 


7860 


21761 


A 


7920 


1 


70 


FFFF* AAESCSRIKCSGTI IAQCNPELP 
GLSDPPTSAT* VAGTKGVCHHTQLRFI F 
I V*LSGRI LliQDQVQWHNHSS V 


7861 


21762 


A 


7921 


123 


260 


GIQARKEPGAVAHAYKLSTLGG*GGWIT 
*GQELETSLVNMVKPCL 


/OOZ 




A 


IzJLL 






LHLLMRVLKKKKKMGGPLYRS PRGAKVN 
PALQRLDSIiLIGSRILSNLGIFGKAPHS 
WGG*AFFAATGLHGLHVT IGSTFLTIGF 
IRQLI FHFTSKHHFGFKADD* YWHFVDV 
A*LFLYDS I Y* *GSRI LSNLGI FGKAPH 
SWGGTKWDNPPPDMKSS 


7863 


21764 


A 


7923 


2 


407 


GRVGMI * EAFAS KRKVL IVEE PS INLE * 
LYGCPPHPLHTSNKKPPHTKKKQKKKKK 

NTGGKKKNFFFFWGGKKKNLGDFLKKNI* 
FWGGGKIGPTPPQKKNPLGEKKF 




own* 

Zl /Oj 






1 




PTRPPTRPTSSRSRAPPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFVSAALFFGKKKNFFFFF* FFRG 
*GFYFFP PLYFFICGGGGVFFFFFFAGW 
NIFFIPPKKNGIFKTLRGKGG 


7865 


21766 


A 


7925 


2 


335 


NLTFFPQHFLGLSGMPRRYSDYPDAYTT 
*NILSSVGS FISLTAGILI IFMI *EAFA 
SKRKVLIVEEPS INLE *LYGCPPPYHTF 


7866 


21767 


A 


7926 


1 


328 


RTRGRTRGESNPSIiRIQNSPCHIjSHPIIi 
KKKKKKKKKKKKKKKKKKKGGPLKKNPG 
GAKI* PGPKKKNLFFKGGLKKTPIiGNFE 
KKPFFGGGKKRKKPPQKKKTLKEKKKF 


7867 


21768 


A 


7927 


27 


226 


I*SR*KM*NKSHIYSFEKKKKKKKKKKKK 
KKKKKKKKKKKKKKQKKENRPKNGDSEN 
GHPKSKIFWL 


7868 


21769 


A 


7928 


398 


2 


GGGGGPGGRGRP* *NPHKKEGRDPPHPK 
KKFFPRRRKNRGGGGREKS PPQKKKAPQ 
KKPGRGFKRAPQKKKKTPPPPRKKGPPP 
KNFKKGAPPFFFFFFFFFFFFFFFFFFF 
FFLGILARTTTTTNDMKNHR 


7869 


21770 


A 


7929 


3 


156 


HASAHASAPVFV* S VLMTAVLLLLSLPV 
IiAAGVTILIiADRILNTTFFDPA 


7870 


21771 


A 


7930 


405 


249 


ASAHASAPLFV*SVLITAVLLVLSLPVL 
AAGITILLTDRNLNTTFFDPA 


7871 


21772 


A 


7931 


921 . 


699 


DHPGQHSBIIiSLQKMKTMSQCGGAHL*S 
QVLRRLRREDCLSPGSQS CSEL* SHHCT 
LACETE * DLVFQKKKKF 


7872 


21773 


A 


7932 


423 


259 


HDLGSLQPLPPGFR*SSYLCLPSSWDYR 
CEPLRPAQRRGILMAATSRILTRSKGW 


7873 


21774 


A 


7933 


395 


35 


PRVKRFSCFWLPSKWGPRPRAPWPGYFC 
FFC*NGGSPFLPGGSWTLAPNYSPPPAP 
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NO: of 
peptide 
sequence 
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Predicted 

beginning 

nucleotide 

location 

correspond 

big to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamrae, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y«Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \-possible 
nucleotide insertion 














QNGLVQALRPPGPPTFPFL I FPTKAPWS 
WPFFGGKKKKKLSPKGHSERHLHHKYIR 
LLSQHGGM 


7874 


21775 


A 


7934 


1 


118 


NF*IFCRDWGLALSPRQLSWSWPQVII,P 
PQPPKVLGLQA 


7875 


21776 


A 


7935 


322 


383 


KKKTGLI KISTLCFVKNTFKKTKWQAPE 
RKKKVPIFLSYKRPVFKIYKKLFPFCRK 
KIKKKIPPPGFTSFSCLSLPSSWDYRHP 
PPCPVNFFCIYNKNGVSPG*PG 


7876 


21777 


A 


7936 


3 


131 


GFHRVGQVGL * LLTLGDPPASASQSAGI 
AGLSHRSQPLTWSL 


7877 


21778 


A 


7937 


334 


146 


HPQGLNI *NKMVSSSGAKHSSRLSLPKW 
WDCRQEPPCPVIMFLFCRGINHALYSPSR 
KTJiHFL 


7878 


21779 


A 


7938 


2 


371 


SEPGAG YNLLVGRF *S PSEKCS IGVGVT 
RFSRCCLSPLSLTGKGNSLTPCASRVRQ 
CLALLWLVQGVLHPLSCTHCLALPSEMN 
PVPQMEMQKSPIFCIADAGSCRPELFLF 
GHLGSSPQSKL 


7879 


21780 


A 


7939 


2 


95 


RLNLGGGGCCEP * S CHCTPAWVTKRNS I 
SKK 


7880 


21781 


A 


7940 


1 


191 


PTRPSQTPGLK* FSHLSLPKSWDYRCEP 
PCLAVTLS I PQNSSMPSNIFAAS YSS * F 
GLYSSRW 


7881 


21782 


A 


7941 


127 


383 


SKDCVRIVLIJIAQAKAJSSYRTVNWCKPG 
AVAHTCNHSTLGSRGGRIA*GHEFKTSIi 
GNIVILPLYKKNKKKKKRGARLKEPNLT 
P 


7882 


21783 


A 


7942 


235 


3 


KNILPGYFCFFIiREKPPPPGLESLFSKK 
KKKKKIiARHGVAPLLGRMR* EDRLRPGV 
QGC3JELWWCHCTPAWATEQNP 


7883 


21784 


A 


7943 


3 


225 


EHGSh YPPTPGLKQSS YLSLLSR+ DYR* 
AAPCPANFFI FNFL* REDLTMIiLRLVSN 
SWTQAVLLPPKVLGLLV 


7884 


21785 


A 


7944 


2 


232 


TLLPDNLSQTIYPNKKKKKKKKKKKKKK 
KRGG PFKKXKF * TRGGGKKN FFFRAPKI 
FFGGRVLKKGGGKKPGEPTN 


7885 


21786 


A 


7945 


394 


41 


WCRWLETWGAGGSGAVPPPPFFPPHSSS 
WDLALQGSGASS P FS PGAVAHACNPSTL 
GGQGGW IT* VQE FETS LANMMKPHLFS K 
KGPVTAVAHPAKAHRAMWCTPTHMHTQH 
VLHVL 


7886 


21787 


A 


7946 


405 


100 


FFFSPRFSSSLFLFLSSFFFFFFFFFFF 
FFFFFSTGLFVYWSPTH*SLPVPRIFFF 
S YLWQTEVETMLGTE 


7887 


21788 


A 


7947 


391 


212 


KEMQIMPAMR*HLTPVRMVITKKSKAKC 
WQGCEE KGTLAHS WWKCKGTAETGKLAG 
F 


7888 


21789 


A 


7948 


106 


301 


LQLPITQLVWHSIYFMKKIRYRLGWAH 
T YNPSTLGGQGGR IA* GQ VFKTS LSNTA 
RQKKKKKKG 


7889 


21790 


A 


7949 


342 


208 


GLGAGTHTYNPSTIiDN*GGWITSGQEFD 
TSLANMVKHHWQTGQF 


"7890 




a 

n 


/yjo 


391 


21 


Kg^fr'FMLiPKljlSNS * PKGSAHI1SI1PKCW 
EYRSEPPSLATWYVTLSRHFCSTLLLGQ 
GRSKFHMKWESRLBGLLAKVSSLRSSYC 
SSLRILGLEILLSPLLFFLETGSGSVSQ 
AGVQPLCPGLK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~AJantne 
C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoteucine > 
K=Lysine, L=Leucine,M=Methionine, 1 
N=Asparagine, P=Proline, 

1^ — WlUtdllllllc, IV gin i uc, a - oci mc, 

T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 

j\. — V ii kjiU wiij — oiup cuuon, / — pusoiuic 

nucleotide deletion, V=possibIe 
nucleotide insertion 


7891 


21792 


A 


7951 


146 


■ice 


LGWAHACNPSPL*G<^RRIT*AMEF*T 
SLDNIARPCLYQK 


7892 


21793 


A 


7952 


289 


327 


KGI PPLPPRLKDKGPFFFFFFIjKRESIiF 

i_ur c w^w\jwI\.Oa xr uc r r c c jjiwij v o uur 

RLVSNSWP * VTLLPQPPKVLRIiQA 


7893 


21794 


A 


7953 


861 


586 


DRVRSVAQARLQWHDLGSLQPPPPGF*Q 
FSCLSLLSSWDYRRMHTWPFVF*WRWGF 
TCVAQAGLDLLSSDNPPALASQSARIIG 

DOLL V JblilV 


7894 


21795 


A 


7954 


402 


52 


PRKVS IS * PHDLPASASQNAGITGVSHR 
TRPSFNFSYIETAITETRKSLQPSPGMN 
♦SLPSGMVGFKPHSFIHSANFCEALAVQ 

yi JjuftVrutUUr JJO ir JrXvSJO c IX. V ^ tr JXIX v \— Xj 

KTALK 


7895 


21796 


A 


7955 


259 


3 


YNTKVLS FP I LL* TFLKHLWLGAVARAF 
RLYKNLKLFLESGLFU«AWRGAKRVRTR 


7896 


21797 


A 


7956 


171 


386 


LPIRLFSLVFTTIEKSRPGMVAHTCNLS 
TTiKG * SGW IT * AOE FETSLGNMAKLSLP 
KIQBSASLITRSQNK 


7897 


O 1 TOO 

21798 


J\ 


7957 


5y 1 




POPIjP PGIjKOFS CTiSLPKC * D YRRDS AR 
PAKKNPTFNK 


7898 


21799 


A 


7958 


169 


324 


GTHRVEMCSQWLG WAHACDPS PLGGQV 
RKIT*AQE FGTS LGNIGRP CLYIK 


7899 


21800 


A 


7959 


290 


397 


DCSSCCSPLKGR*RG*AQWLTPVIPALW 
EAEAGGS 


7900 


21801 


A 


7960 


496 


298 


j_iox\i vjoIjI vftyo v v iwur ivjjl x n v riuoij 
TTFQPDVYRL 


7901 


21802 


A 


7961 


404 


174 


PGPQKGSFF*KKIOCKKKEIDHILSHKAN 
UJKC*RIKII*TMLSDQNEIKLBIIIKR 
OT .T . KPTT j FT .FMKfTYT X RKLS 

\J* f J. XJX XJJ-J 1 liVlV. X X JLXVXVUW 


7902 


21803 


A 


7962 


25 


99 


PGLFLSS*HQVILLPWPPKVLGLQL j 


7903 


21804 


A 


7963 


436 


106 


GGGKS PPPGNWAPPGNPGE PLFF* KKKK 

GFYLTOTPPPPPPWGKECKNFFFKKKKKK 
KKKKGRIAICYIILFITSINFKKAFI 


7904 


21805 


A 


7964 


212 


395 


ERERKERDRDGEREREREEREEGRKKHG 
GR*GGREGEPJ1RPRGAEDPDRAGRATDK 
RTGAL 




ZlSUO 


A 
r\ 








FSDCYKRSSCASWTWYTRLAHHQPPARP 
GAAPSPRKRWVDFAVNRNFPSRRHPCAT 
PRPAPGATGRPSRVPKSTAFTQMLTQPR 
DPHIjPLP I PPLSQP PPSP* PAPLVMR 


7906 


21807 


A 


7966 


397 


298 


RGFTMLPRLVLNSRTQVT *LPWPPTVIiG 
SLSIF 


7907 


21808 


A 


7967 


92 


268 


IKMSLCPNFFFFFKKQNVGPRAGAYPCN 
PTILGGQGPJaT*GQDLKTTLVNMGKPH 
FY 


7908 


21809 


A 


7968 


257 


107 


RTHTHTHTHTHTHTHTHTHTH^ 
TKNIANNCS * LESQRNNGKCTYAKF 


7909 


21810 


A 


7969 


413 


12 


PPPPPLLTPWPCFFPQIL*PVGHSAPPL 
FK*LKIPRPFFFPPPPKGIGAPPPKIFF 
PPGFSLPPFVKLPPGEKILFPPPKKKKY 
PPPPPQS*SF*PPPPPPFFFFFFSTSAK 
KSKKNFLFIFSPFFNPKYYNT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Phenylalanine, 
G=Glycine, fiNHistidine, I=lso leu cine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
0=Glutamine. R= Any i nine S=55prin#» 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X==Un known* *=Ston cod on /=nos«jihle 
nucleotide deletion, V=possible 
nucleotide insertion 


7910 


21811 


A 


7970 


399 


212 


TGVQWSNHSSLQSQTPGLKRIiFLLILLS 
IWDYGCMPP YLS FKN* FCTKGGSCYVAE 
ADVNIL 


7911 


21812 


A 


7971 


584 


424 


RPRRENRLNPGGGGCSEPRSRYCTPAWV 
TE*NS VSNRNKTNQNPI FFSCKQFD 


7912 


21813 


A 


7972 


105 


i 
■ 


PSMPOLS HL * NG I LGRVRWLT PVI P AI.W 
EAETGRS 


7913 


21814 


A 


7973 


363 


AO 


SSSLGGHPQLWFHTLKLRPVTVSYA*NK " 
LCDKIEKKLFFTIALHNKFLRKKLTRDV 
KILFNNK* IKKEGPNEF*NILCIiWIERI 
NIKKVPYPSKVMYRSNTLPIEVAVFKSQ 


7914 


21815 


A 


7974 


404 


250 


FFFFW*R*GFTMLPRLVLNSWTQAIHPP 
ALASQSAGITGMSHHAQPLS II F 


7915 


21816 


A 


7975 


2 


60 


FSCLGLPFCWDYRHAPPHLANFCRDGVL 
NY* PQVIHLPQLPKVLGL* ACTTTPG 


7916 


21817 


A 


7976 


325 


81 


KTIHSLLFGQFFLLQPLLPSPIPHAAPP 
TPHLIFLIjLLFCF* FFK* RQGKQGLTTL 
PKLISNSYSQVTLLPQPPKVLGFQA 


7917 


21818 


A 


7977 


406 


282 


QWRDLGSLQPPTPWFK* FSSIjSLPNSWE 
TKAGRS REPGGGD 


7918 


21819 


A 


7978 


382 


221 


DCIISASYLQKNFFFFLGDKVIjIiCHPGW 
S AVA* S * LTVTSKRVHFYSSE ISIiY 


7919 


21820 


A 


7979 


423 


29 


flw* rrgftslprlvsnfwaqgiclpwl 
pkvlglqa 


7920 


21821 


A 


7980 


3 


—JO 


STjAFFVETGSHFVA*AGLELLSSSN<1 PA 
IASQSAEITGVSHHTQPE*GYSHM*PQY 
PYQLRFNSDTI ILSNVLFVPNF 


7921 


21822 


A 


7981 


324 


128 


WLSVS P YTFELS LLVGYE FVIHDVI * TL 
IFWWWSLFCFLWFFLVOIVFWFFC 
CCFLSVFCI 


7922 


21823 


A 


7982 


I 


ou 


*AQAICLSWPPKVLGL 


7923 


21824 


A 


7983 


1 


137 


RPANFCI FFVETEFCHVAQAGLELiLGSG 
DPPTSASQQAGTTGMSH* D* PANFCI FF 
VETE FCHVAO AGT.ELT/3SGnP PT<> A «;on 
AGTTGMSH 


7924 


21825 


A 


7984 


2 


277 


PRVRS VQRVI YQYV* NLQ IHVPFDSTNL 
GIYLTDILPHVWNDNMYKVFAAVSFVIA 
KY*NPCLTRRQCNKLWPIHAMEYYVTIK 
KNEDGRA 


7925 


21826 


A 


7985 


211 


42 


GPQKWPRGPGENLGGFFFFFFFCRDDGL 
TMLPRLALNS*VHAILPPQPRKVPGLWV 


7926 


21827 


A 


7986 


409 


183 


LLRLK* SSHLSLPKWWDYRCE PPCPAI F 
FKRKQILKESFPGENTCCHLFNSLEVLK 
NKRRTCYNFPNTLAVLKNM 


7927 


21828 


A 


7987 


2 


194 


RFFETQSRUjQPPPPQIjK*SSHLiSLQSS 
WDYRHVPPCLANFFLFWYFLQGQGFAFL 
LKLKFLFK 


7928 


21829 


A 


7988 


483 


209 


SSPSLGNFCIPSSGGV5PCWPGWFQTPD 
LVIWPPGPPSLVFFFFFFRDEGLSMLLR 
LTLNF * AQVILLPQLPE* LGLQADRQVP 
LYPAEVF 


7929 


21830 


A 


7989 


164 


29 


KFWFWLGTVAHACWPSILGSQGRRIA*A 
QELEPSLGNEGRPCLYK 


7930 


21831 


A 


7990 


3 


474 


PTRAPLRAPFETIVQEEEFSKHHFGFEA 
AA* YWHFVDW+ LFLYVS I Y* *GS PPPP 
PKILIKKNGAETKKFFF 


7931 


21832 


A 


7991 


469 


323 


VETGLHHVAQDGLELPTSGDSPASASQN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D= As par tic Acid, 
E=G!utamic Acid, ^Phenylalanine, 
OGIycine, HNHistidine, 1-Isoleudne, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine t R=Arginine, S=Serine, 
T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IGI TGVSHHA* PDFTFLEQF 


7932 


21833 


A 


7992 


469 


13 


KGDTVSSSDSS PSCGGLFPVGAS *LLCL 
FTQALAMVGARRPFSMLPCSLISDCCAS 
NERGS VGMGPS KPGVGYS LLVCRWLRPL 
EKRTIKVGVTRFSRCRLSQIiPIiARKGNS 
LTPCVFWVRHCFGSHSVGCTHCPAPAVR 
QAPVK 


7933 


21834 


A 


7993 


15 


439 


IASGRPFSIKFFLVAITFLLFDLEIALL 
I*PLP*AIK3TTNI,PLlVMSSLLLIIIIiAL 
SLAYE*IjQKGLD*TEPNKKKKNKKKRQE 

GGKNMFFFFWGGEKKTPGGFLGENLFLG 
GGK 














7934 


21835 


A 


7994 


392 


186 


YFYI INNFFFFKKI FI * * 1 1 FFFLKKFL 
*YSPKKVFFLFF* IFFFFFPSSSLFLFF 
SSSS PFFFFFFFFFFFFFFFFFLRG 


7935 


21836 


A 


7995 


11 


386 


TWEVEVAVS*DHAIAIJQPGQQE*NAISK 
KKMTGRMFIEVSF 1 1 ASSWKQLKCS * TG 
E*MKSLWYTHI IEYYLTIKENKVLEHEV 
SRLTLIDTGRERS KTKEYTLYNP IYTNL 
RIAKLSGCGGSHL 


7936 


21837 


A 


79% 


3 


186 


DSFCF*RQSCS ITQAGVQWGDRGSL* PQ 
S PGLKRS SCFS LP KHWDHRDE PLHPAGV 
AVLN 


7937 


21838 


A 


7997 


316 


649 


RDHS FFFEMESRS VAQAGVQWHDIjGSLQ 
ALSSGFMPFSCLSLPSSWDYRHPPPCPA 
NFLYFLVEMGFQHVGQNGLDLLTS *STR 
LGLPKWLGITGVSHWHPAQDGGFHHVG 


7938 


21839 


A 


7998 


468 


336 


RRGFTMLARMVS IS * PSDMPASASQSAG 
TTGVNHHIRIHRSLL 


7939 


21840 


A 


7999 


464 


283 


TLCGG I RPTNI MKGNLLYS KS I GLiNVSL 
LPKNPRRNIQNNV* PNTWALQPSHVDI 


7940 


21841 


A 


8000 


46 


487 


RQGRLSLQKFLLP FVQLCPAHRGGVYRG 
RQASLSCGGLHPVRVSQPLCLPT*VSAM 
AAAPHPASLLPCSLISDCCASSEQGSVG 
VGPS KPGAGYNLLVCHLLRIiLEKRS IRV 
GVSQFSRCHLSWLPIiARKGNSRTPCTFR 
VRRCLAL 


7941 


21842 


A 


8001 


295 


1 


TQPWGTKRKLPIiKKKKKNSGDREWRWLC 
NSVTILNATTMYTSEWLRPGTVTHTCNP 
KTSGG*GGEIA*TQEFKASLGNIA*SWL 
YKKILKIS PLWWQT 


7942 


21843 


A 


8002 


144 


317 


ELFYLKEMV*RPGPVAYACFPSTIiGDRG 
GRIT* AQE FRTDLGN IARPCVYLRKKKK 
RG 


7943 


21844 


A 


8003 


69 


291 


HCVWTMI FIVIKYFYLNPQNNSMRYYYS 
HFMDKETDVQSD*MTC*RLGAYAHACNP 
STLGGRIA*GQBFKTSIi 


7944 


21845 


A 


8004 


3 


176 


IVAHSSIiTPGLKQFSCFSLPSS * VYRNM 
LPRLAWY* FFGTNRVSLSCQGWS*TLAS 
LLGSSSS PASACQVAKSTGTCYHAWLII 
DFLVQTGSRCLAKAGHELCSCL 


7945 


21846 


A 


8005 


419 


178 


NPPPKKLFLSSSSSSPLPHLNVIGTPGFF 
PPPPF*NPPPEFNFGAPBCKKKKLSPPRA 
EKLVPFKGPPPFFFFFFF 


7946 


21847 


A 


8006 


429 


225 


QPLPPRFK* FFCLSLPRR*GHRHRPPHP 
FNFFFF* YFLRWS FVLVAQAGVQCMDLS 
IYLNBFWLLDF 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acta 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(Xjlycine, H-Histidine, I=Iso leucine, 
K=Lysine, b=Leucine, M^Methionine, 
N=Asparagine, P= Proline, 
Kl— ijiuiamine, K— Arginine, o— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7947 


21848 


A 


8007 


756 


224 


PPRHQPQHVPPPLLHFSTPSPAPPKSHA 

RGATPPFPPPPFSFLAAVGRAP I FPREP 
LLPSSSSPPPPP*IGEPPGFSPPPLLKT 
RPPK*N*GPPKKKNFFPPPRPKNWFL*K 
GPPPFFFFPFFFLRQSFTLVAQAGVHKT 
Klin bl\J\J\oW 


7948 


2 J 849 


A 


8008 


170 


559 


SSHLSPQSSWDYRSGMVAHTCNPSMLGG 
QDGRLT*AQKFQTSLGNIVRCHFYFKKY 
FKNKIFFySPFINTNPKKKKKKGGAVLK 
NQSLRPRAGKVLVFLWGHLIQFRGPRFK 
NGGTGKPGGLPNLIiAPR 


7949 


21850 


A 


8009 


1 


662 


RTRRTRS WWYMP VLP LSHYGEAGESLG P 
* KWMVD * AP IAPLHSTLGDKMTLPD I KK 
KNCjBTVANTS PIjSTTAKPAKLFIjKAKKD 
QLMRDLFLPKTPKKPQMSTGLOARSKRW 
LKI I WRRHGIWPLKNIGPTEDVQASAHG 
GVEBNMTS DIB I PEAKHDHRPTEDVQVS 
AHGGVEENITSDIEISEAKHDHHLVEDI, 

o COT G^TfT PnTTMTTPOnMgPTPT O 


7950 


21851 


A 


8010 


377 


2 


NFFFFFPPFWGFFFFUGPGFFKVFF* IF 
frt ifjSKNb'k'rc^b&Lfavyijb FiLK.* r XFFr 
PFSIRFCFFPKKFFFFLILFFFF*FPFF 
FFFVYVFDFFFS F ItiYI FFFVYFSLPFT 

VTJTQT.TVHPliDa 


7951 


21852 


A 


8011 


1 


363 


PTRPLF* *RGCFTMLPRLILNSWPQVIC 
PPRPPKVLGLQT 


7952 


21853 


A 


8012 


278 


39 


FXXXFFLSPPXFLI FFFFFFFFFFNFFF 
FFFFPPFFFFFFFFFFFFFFFFFFFFFF 
FH*DQ*FSNFDFTPFTFTLISNII 


7953 


21854 


A 


8013 


79 


412 


MNECIIiGKGGKDGGFFEQCENLGLC*FL 
KVP *DSD I WLGAVAHAYNPSTLEGQGER 

o 1 "AvjEfr JS.1 KiAjlMrl 1 Kir^ljxy KMJLrlivKJv. 

TKKKPHPGLGWGAWGPTYLGGVGTKIT 






*\ 




1*7*7 


in 

ly 


rVjWJLrtNr r uiuoKov iyAlUOy WL-Nrlool-i 

*PQTPVLKQSSHI»SIiPSSWEIEAAVSQV 
RITALQPGRQSKTLPGNSGPVLRADARR 
SI 


7955 


21856 


A 


8015 


1 


130 


ARIVSIS*PCDLPASASQSAGITGISHH 


7956 


21857 


A 


8016 


4 


417 


RLMPPHLANFLF F * RHGFTYVAQVGLKL 
LGSNDPSASTSQSTGITDVSHCTQPHLL 

TACRALLPCPLPSACFPSLSQVPSVFTC 
CVPATLT I TTiL P LLRA FEHVTL C PL F 




4.10JO 


74 


owl / 


19.9. 

JOO 


135 


VLI *GEI YNVCIS I YLCIYMCVCI YISV 
CVCVC I YVH * TP YL T HK!C!L I 


7958 


21859 


A 


8018 


154 


382 


GIKRPGAVAHNCNPSTLGG*GKWIG*AQ 
EFKTSPGNMVKPQ I YLKKKKKKGGAVLK 


7959 


21860 


A 


8019 


396 


237 


FFFFFFW*G*GFTMLPRLVSNSWAQVIH 
LPWPPKALGLQVRTMVPGLRLANF 


7960 


21861 


A 


8020 


412 


168 


GNLLGS PLPGNV CKNKPFILF * KKK*SL 
AVLPKLVSNFWP*MILPPWPPKI*GVQA 
RAPPPSQILKKKKSSHFYNSHVNSL 


7961 


21862 


A 


8021 


1 


124 


TRFLHVSQAGL* LLTSGDLPASASQSAD 
YIFIISPAKIIFL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


ML 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aminn Qf»iH 
H1LIIUU 4UU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

rp^IHup of 

IC91UUC Ul 

peptide 

SClllllilllpb 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIme, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valiue, 
W^Tryptophan, Y=Tyrosine, 
X^Un known. *=Stoo codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


7962 


21863 


A 


8022 


3 


200 


HLSP YFYFFKDRVALCHPGWSAWQ* P I 
VTS IFWSSC* KLTS WRRWRPSWLTR*N 
PVSTKNTKN 


7963 


21864 


A 


8023 


3 


132 


FLFFFQF*FMRLGLAMLPMLGMNSWA*V 
ILLPQPPEWLGLQA 


7964 


21865 


A 


8024 


3 


187 


VGQAGLEPPTSGDIiPASASQS AE ITRMS 
HRAQTK*DFIGSFSEEFS 


7965 


21866 


A 


8025 


405 


231 


SD KW I KKMRY I HTTEYY * ALKRKE I LTH 
ATTWLNIjENILLSEISQSPKHRYYVIPIj 
T 


7966 


21867 


A 


8026 


175 


382 


GKRIFPPYPPQEGGKTGPPQTPGLFFFF 
LKKK* PGGLGGARFS PLLGG VGRKNS FT 
LKGGGAINLKGPPSLPPGGKRWAPPQKK 
KKKGQAWWLMPVT PALWEAEAGRS 


7967 


21868 


A 


8027 


83 


374 


GDQ VWWLMHACN PS TLGGQCGRI TWDRE 
FETSLANIFFPISTKKKKKKKKKKKKKK 
KKKKKKKKKKKKKKKKKKASSGGGVS*K 
KNFS PRAGKEI FF 


7968 


21869 


A 


8028 


382 


148 


SPHCSLDLVGLKQSSHLSLPST*DHRCV 
P PRLAI FSFAFLFFWEATTTTTKATMLP 
RLVLNS*AQAILLPQPLKVLRL 


7969 


21870 


A 


8029 


88 


208 


SGS V*ENNPLSWAWWLQS I IPATQEAE V 
GRLLE PRSSRS A 


7970 


21871 


A 


8030 


1 


283 


NKSR I KKAI KMTEKMKEKI EMMKLMNQL 
PNGEHHAMIQMMMKIEMKGRMKIJCE * DK 
ESDKDDRKDEREDRDDETDBPTPERRTS 
RDDSDDDEDRDEGKNEDKRKDDSKDDDE 
ADBDTDQDDYDPMDAHKAADEDDDEDED 
EVAEQMTKKKKKKKKKKKKKK* * KDDRK 
DEREDRDDETDEPTPERRTSRDDSDDDE 
DRDEGKNEDKRKDDSKDDDEADEDTDQD 
DYDPMDAHKAADEDDDEDEDEVftE 


7971 


21872 


A 


8031 


2 


139 


LQTLLPLIAF* +LLASLANLAIiPPTINL 
LGELSAQFAVIGMYILY 


7972 


21873 


A 


8032 


80 


324 


YLHFYFFVR*KIiMGLDTVAHTYNPSTLG 
DPERRIA* A* EFKTSIiGNMVKPCrYKKK 
KKKKKKKKKKKKIOOaaaCKKK^ 




xio/4 




OVJJ 


700 


158 


GNQPGASAHTCNPSTLVGQGGRII *AQE 
FKASLGNMVKHYLYQKHKN 


7974 


21875 


A 


8034 


416 


60 


AQWLTS I IPPL* EAEVGGFFEPRSLS PA 
WATE 


7975 


21876 


A 


8035 


398 


271 


FYFL*DEVSLLPRLVSNSWTPAILPPWP 
PKVPGLRHEPPRPA 


7976 


21877 


A 


8036 


385 


151 


FLYFL*RLGLTLLPQLVSNISSSDPPT* 
ASESAGITGVSNHAHLRICILTRIiPGNS 
SH*NVRRAGLGDTQSSALIPPG 


7977 


21878 


A 


8037 


3 


406 


PASAS * VAGITGTHHHAQLI FVFLVETG 
FLFFFFWAVFLFFKTKNGPPGGQLFFFA 
APPALGGGDFGGQGATFSPKGLGVLGET 
RGGAPAPKKLGAKKEPSHLLGGGAQNLP 
KPRGQKGFGFYFYFLARDFFLG 


7978 


21879 


A 


8038 


428 


20 


" LANQ*WKPPSNPWETKISPKKKKGPPGQ 
GPPPLTPPLWEAPKARKLGSPWLPR*NP 
LFPQKPKKFFGFGGGPPYSPLPQKLSPK 
NGVTPEVGPSHKPKFPPSPPPLGQK*TP 
FPQKKKPKRLKRGMFLHGSTFQSL 


7979 


21880 


A 


8039 


3 


194 


QNKISQVWWYTPVILATWEAEGGESVBP 
GRQRLW*AKTAPLHYSQDEKYLKASVDS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od* 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

nminA arid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sentience 


Amino acid sequence (A^AIanine 
C=Cysteine y D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H»Histidine, I=Isoleucine, 
K=*Lysine, L=Leucine, {^Methionine, 
N=Asparagine, P^Proline, 

T=Thrpnnin*» V=Valine 

1 lUICUUIIICf T TdllllV) 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /=possib!e 

nucleotide deletion, \=possibIe 














YCLISCH 1 


7980 


ilOO 1 


A 




41 ^ 

«tl«> 


yo 


HE* IT U MT C JL. XjLJJ-% JL M'i JLV UOWBilaVa 1 i> X 

LTGV* KKLI PMLMDNFEAPKTSMEKVTA 
DWB IGELELE VEPAD VTELLQS CAKLE 
GMRSCFLWMSKESGFLRWIYSW 


7981 


2)882 


A 


OVrrl 




*>S4 


PTS PTTO K <? PT /TPP KT? * FJVT5 WTTP P PD & Q W 

FKIN YLRNI YRSL ILKN 


7982 


21883 


A 


8042 


409 


102 


GFFPFGPFFF*PPPPGVFPPPPPPNFWF 
*GFGPPPPPLF*FFFRGGPKKIWFFPFF 

T*WPPT.T?1?T?P'PDT?WDTnf*T*PDRTr2VT? QT.TfTT? 

uwtrirjjr r r r err iniirJ\j\.JL ^Fi\jRAKoiJJ\.jLr 
FFLGPKKIPNPFFFFFFF 


7983 


21884 


A 


8043 


414 


153 


GRGWATNFYFFLI ILK*FFFIFKKFFFF 
HLFFFFFFFFFFFFFFFFFFFFFFYKIF 
LFTLVLTCPQTSCREAEPFDHKVCSVLK 
nil 


7984 


21885 


A 


8044 


401 


292 


NYLPPQ* KKKYYPQGPFRKP IKKSPPPC 
FFFFFFFFPSFPPFIjNIPPPQKGKFPPK 
KYFFNPPIiFPPIiFFLKPPPLFFF*GPKK 
KNLIFHPPSKKICPFKRGPP*FFFFFFF 
p 


7985 


21886 


A 


8045 


370 


14 


TPPPTKTP FFFLYP PPWREKREGFF* KA 

AKLSLNLKGSIYLSFCFPPLPPSLPLSL 
FERWGLTILPKLVLNSWAQMILLPWLPK 
VLGLKV 




JLIOO / 


A 


QCl/tC 
0\r*0 




ZOO 


DT«T?I717*T? , PPCPQTVrkT7P\7AtJT PWTTT ir 

LISVGVLTCSISDL 


7987 


21888 


A 


8047 


379 


98 


GPAHNSPPIjGGESETPLGYPGKPRFF*K 

PRRGFQLTQNGPPPPPPGGKKKPPFQKK 
KKKREPAIP 


7988 


21889 


A 


8048 


369 


IT? 


RNTRGGPTFFFFFFFFK*NLARLSRLEC 
HGTISAHCNFRLLGSSDSLPLTSNDIGQ 
INKSLFA 


7989 


21890 


A 


8049 


1 


128 


ALGI,VAHAYNPSTLGSQRGKIA*GREFG 
TSLGHRAPPPTiYKTT 


7990 


21891 


A 


8050 


166 


163 


PVHCX»EQTTQDKCIiR* STHLGLPKC*DY 
RREPPCOAS KLVPTG I LADS KH<5 JCPfYTM 
DSCFSNQLYKQR 


7991 


21892 


A 


8051 


402 




YFKS VCOAO WIiNT YWPSTIiGG * DGW T TP 
GQEFRIiPWPSGITGASfiHAQPHFF 


7992 


21893 


A 


8052 


404 


268 


QQLWRLRQRDCLNLGCGGCSEPRSCYCT 
PA*ATEPDPVSNKQINK 


7993 


21894 


A 


8053 


i 
i 


154 


GFLRVGOAGLELLTT.RDLPTS A <?H Q Af3 T 
AGVI YRAWPGEAYLFSG * WLRFT 


7994 


21895 


A 




H\J\J 


1 50 


PICKKKNPPPPPRWPr) <?T .K"R APT.PVPT7T . 

EESCFVTQAGLECSGAILAHCNLCLVGS 
SDS PLSL * RWVGNSHF I FTS LT.S VS PRS 
N 


7995 


21896 


A 


8055 


1 
i 


195 


TAVILI I FMI *EAFAS KP KVT.T VEEPc? T 
NLE*LYGCPPPTH 


7996 


21897 


A 


8056 


400 


260 


HMS PHTjAFFFSFLFL* RIiDMLPRLVSNS 
WPQAFLQLLLPQVLGLQV 


7997 


21898 


A 


8057 


396 


263 


FLQPGTVAHACWPSTFRGGLIT*AQEFK 
TSLGNKVRPCLYKKQY 


7998 


21899 


A 


8058 


402 


211 


FSCWWCTKPKAAG*FPAPWGPPFLNNPP 
GAGTNFPGPPRSGGFGSPPPPWPGPRRP 
CPNGIFFC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleutine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutainine, R=Arginine, S=Serine, 
T=Threonine, V=Vatine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=*Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


7999 


21900 


A 


8059 


1 


279 


VLLYHPGWSAWQSWIiMQ*TPSWADIWR 
IARGFKRSSHFSLI*SSWDHRHVTPCIiAN 
FFLIL* IWGLAMLSRLLPLRLRKPVLLD 
*APSPCGLI 


8000 


21901 


A 


8060 


408 


315 


SLTMLPRIVFNS * AQAILLPWPPKVLGL 
QV 


8001 


21902 


A 


8061 


198 


2 


RIQHVESNCHLEVEGGWAQWLTPSTLGG 
RGRMIS * AQBFETRLGNTVRPCFYKNLK 
SQPGMWRI 


8002 


21903 


A 


8062 


3 


294 


GLS PTAHPITAFTSQPLPSGIiS VCRLFL 
WAMCEQCPHLPIRPCCPRI IGPECALYT 
TSCLLHQLKKREGF*KKKKKKKKKKKKK 
KKKI F * WGGGKK 


8003 


21904 


A 


8063 


197 


184 


IHDFLFFT^GRVSLCCPGWSAAAQSRLT 
AL* PGELKETFYL* IRRSWDHKHAPPLP 
APF*LFWVTLIITPSTHEVSSAVCSLIiY 
VKSEQKCQ 


8004 


21905 


A 


8064 


3 


373 


TSGSLAKRPADCIiSAQPTTPSSLCGCS P 
VRVCITVPRCTHARAHTHTHTHTHTHTH 
SNHG*APGALYIQELQRGKVSHVPEKEK 
SDPETA*TKGG*HVTNPHGRQRSGEQEE 
HRICVGRCCSG 


8005 


21906 


A 


8065 


532 


106 


ENKPIHNLSINKYIGKILRDKALLYCPG 
WRAVMQS*LPVASNSWAPAILLCPGFLS 
TWDYKRVPPQLVKIFSKKKKNKPTNILR 
PKCHTRNQDIYKRGKVGFLCSYCPPALL 
LLLSLLIitiTtTiLLFSFSTRDSASGAGCSS 
LP 


8006 


21907 


A 


S066 


411 


130 


PHAQLIFLRR*SLTVFPRWSNSWPQAI 
LPPWPPKVLGPQARAMVPSPKLVLESIIj 
RIPTSVIFFPIiNLCSNIKLAEIKEGCSA 
AFLFNQRFAN 


8007 


21908 


A 


8067 


3 


32 


DAWADAWGSLTDK WVEKI WYIHAMQYYS 
SFEKREIL*YVRTCMNLEDTMVSEIRQS 
QRDDYCMI PHI *GRTRGVH 


8008 


21909 


A 


8068 


410 - 


116 


STFNIQTWPGTVAHTYNPNTLEGQAGRI 
T*GQELETSLGNITRPNFLLFCQACPGQ 
GLSTVRMOGSLHYFVLPLRPLLKCn^DNR 
KSPFYHPVILPCYE 


8009 


21910 


A 


8069 


158 


289 


DLCSSLRNTVRPQLYKK* KISQVRKKKK 


8010 


21911 


A 


8070 


388 


1 


APTPPCFLFKPFVLGGPPFGKAFFQKFP 
GGGFFPSISGPKPPPPRNPSLISKGKRV 
PSPQSPIMVFPGAF*RPPPFPIiMVG*KI 
KIFLPPPPPFKTPPAIiSPFFFFEAEFRS 
CWPGWNAVMRSRLSATS 


8011 


21912 


A 


8071 


15 


336 


KLDKRYDRTGSPGTTHASGYLIEPLVAE 
ASYELILSLAFFFFERRGLALLPRLEGS 
GVIVAHCTLKIiLGSGGPPASAS*VAGTA 
GMNHHEGHEPGLFE KQNSGFI F 


8012 


21913 


A 


8072 


396 


274 


PHLHTWESSL* RQGLAMLSRLVSNSWPQ 
VILSPWPPKWVL 


8013 


21914 


A 


8073 


189 


54 


NKPLGPGMWQACNPTPLGGQGQWIP*A 
PEFKISLGNLAKPCFY 


8014 


21915 


A 


8074 


3 


257 


HAFVLLFETRSCLSLRIiESSGAMIAYCR 
LELLGSGDPLTSASQVAGTKGM*DDAWL 
RQVSNS WPQVI LRLSHPHMFLFQ FSLBT 


8015 


21916 


A 


8075 


1 


119 


TLiPATWEAGAGGLLE P KS LRV * CVFTAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQJD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
EXJIutaraic Acid, ^Phenylalanine, 
G=Glycine, HMHistidine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=(jiutamine, R=Arginine, S=Serine, 
1=1 breomne, V=Yalme, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














VNSSRTPAWAA 


8016 


21917 


A 


8076 


232 


266 


LPYCSGQWCLGIiPIIQGVQWHDHGSIiQP 
RLPGLNQS S CLSLPKCHD YGREP * YLA* 
YQAR YHGSLARQNVFNIRPGI MVHARS P 
STIjGG 


8017 


21918 


A 


8077 


390 


143 


LGGFLVFCPPAKKGFFPNPINLGTPRFF 
PPPPF* KPAPEINFGGPKKKKIPSPPPG 
VKFDPLKRPPPFFFFFFFFFFIRLLV 


8018 


21919 


A 


8078 


424 


144 


LKNYIFFTFL*NRDRVSLCCPRLVIiNFW 
AQE I LLPWPPKVLGLQA 


8019 


21920 


A 


8079 


182 


45 


VHKAGMVAHAYNPSTLRGRGGWIT *GQE 
FETS LAS LDNMVKPRLC 


8020 


21921 


A 


8080 


405 


163 


YFLEEMGFHHADQAGLBLLT IGDPPGGM 
SHCTWLNLNVNLIQKHPHRNIQNNV* PH 
ISGPVKIiTHEGNHHNWHMSTQKHSP 


8021 


21922 


A 


8081 


39 


240 


QS KTVS KKKRKI I FCRYRVS * SPGFKQS 
SHLGLPKGWNYRREPPHPASTFSQSFPV 
KDHFCSGVSS 


8022 


21923 


A 


8082 


2 


263 


DCCVSS E *GSVGVGLSE PGAGYNLIiVFR 
LLRPLEKRSIRVGGSRFSRYYLSWLPLA 
RKGNS PTPCTSQVRQCPTLLLGIiHPLCD 
KPQ 


8023 


21924 


A 


8083 


1 


248 


S VGVGPSEPGA* YNLLVCCLLRPLEQHS 
IRGGVSQFSKYRQSWLPLARKGNSPTP* 
ASCGMRGPAIiLV/LTLLGLHPLSNKPQ 


8024 


21925 


A 


8084 


3 


255 


HLSLPSSWDYKRMPPHLANFFFFFLGKK 
GVLP*GQTGLEL *N*KNPPA* PPKGAGV 
KGVTHRPKP*NGGKGRFEKDTPGGPFL 


8025 


21926 


A 


8085 


409 


145 


LRG P YKKNFS FQAPGRKI GS F KKAP P FF 
FFFFF* GQSLCCPVGLSTGVI IAHCSLE 
IiLTS S D PAS SAS RVAGTTGACHCS * VLQ 
TFTF 


8026 


21927 


A 


8086 


199 


26 


GFPPPPNKTEIF*RPPDFIiNFFFLVEMN 
IiTSLSRLVSNPWAQAVLLPWPPKALGPQ 
A 


8027 


21928 


A 


8087 


222 


3 


SVGITGVSHHVLWYFTLKGFLGRGAHSC 
NNS ALGGRGGW IT* AQE FETS LGSVARH 
HLYKKYKNYPVAWRL 


8028 


21929 


A 


8088 


415 


197 


TILCFYHQNNSVFSSNPFLSL* IFIF*D 
RVS FCHPGWSAWQS WLTVFP I LCATVK 
YFTCQQYI IMI FALNS 


SU29 


21930 


A 


8089 


297 


71 


F XFPENKM KPE WAHACNTLRGRGGRI t 
*GQBFEKSI»SNIARPHL YKTKTQNILS K 
x IQ YIiG r UIj XTjQ XDXjAN I Y 


8030 


21931 


A 


8090 


185 


415 


PFGLKQSPHLTLLSS*DYKHTPPCRVNF 
FFFLGGFGFFPKAGFNLGN*RNFLASPS 
KRAGMAGINNIAGPIFRFFKK 


8031 


21932 


A 


8091 


2 


125 


RKQAS KKERKE * KK* RKKERKKERKKER 
KKERKE KERKEEP 


8032 


21933 


A 


8092 


8 


412 


KGPQGTTLTCIFDIiSYSSSLSHWSQSLC 
VAFPIiPECI FPHKPLCPACHFFFFX3SWG 
LALLPKREYRGK^GVNGNLEIiLG*NNPP 

FPYIAQTGF*LRPSRNPSTLEV 


8033 


21934 


A 


8093 


249 


143 


NI FAATWMRLET 1 1 LSEVTQG* KTKHQM 
FSLICGS 


8034 


21935 


A 


8094 


3 


280 


P ERWDYRHE P PGP AS FT * LLLFVFNL * R 
KGLYVGQRGRQLLAS SSLPAPASLSAG I 
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sequence 


SEQ ID 
NO: of 
peptide 
sequence 
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od 


SEQ ID 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, FHPhenylalanine, 
G=dycine f H=HisrJdine, I=Isoleucine, 
K=Lysine, Lr=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W^Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IRMSHCTPAWVTVRPCY* INK* INGKDT 
IRQNLKKM 


8035 


21936 


A 


8095 


404 


16 


QMGFCLFFLPSSKEVFFPPIFFFYRESF 
FLFHFFLDPRQY * LYFI IKKNI FLPS PF 
IFFLFIRVSP YI FFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFWGSMEL 
LSKAHQCISSHGSG 


8036 


21937 


A 


8096 


358 


200 


KSYFIFSLIIFSHYFFLLFLKHLLATYY 
CFLIDCTLFFS FYCPFCCLFVLLSGRFP 
RSCL*TFH*RLYFSFPIFNVQKLF*LSD 
ICFCCIYLFIYLFIYFETVFCSCCLGWS 
MMA* FNGMFKDRI 


8037 


21938 


A 


8097 


2 


132 


KEEKKERKE*KKRKKERKKEKREKKRKR 
EGGRKEGKEGKNSST 


8038 


21939 


A 


8098 


48 


331 


PGTWHACIjGGEDIIiKWQTKGRGRRAGSH 
RTSWIEPLETVFLELEAGGRLESPGKNY 
SRLGAVAHICIASTLGGRGRQIT*GQEF 
DTSLADFCST 


8039 


21940 


A 


8099 


7 


373 


NGRLRRGFAMLARLVLNF * PRDPPT* AS 
QSAGITGMSHHARPTIESFIHYCWECKM 
E*PLCKLILYFHLFIFKRQKLGVHWCHH 
SLLQPPTPGLQRSFRLTPPSGWDYRRGP 
PCPANI*LII 


8040 


21941 


A 


8100 


116 


432 


YLRNLF YQRQAFS PAILNLMCS AL*BSH 
TVAQAGVQWHNLGSLQPLPPRFKRFSCL 
SLPSSNIFAPLSYFFRNSSFTOISPSEI 
GPQITDPLFIKKKI KVSVMVE 


8041 


21942 


A 


8101 


361 


138 


FNLLGRKKKNFGGLKEGESRRPPNPGPQ 
FFFFFIiFFFFFSG*RWGLTMLPRIiVLNS 
WAQVIIiLPWPPKSLGLEV 


8042 


21943 


A 


8102 


1 


199 


PTRPFVFLMEMEFYHIGQTALELLTSGN 
LPASASQSAGITGVSDR*LA*YLFFFHB 
LFCYFALLLI 


8043 


21944 


A 


8103 


412 


46 


KGGSPPPPPPPKIGRKGPPNPGDPFFFF 
SPFFFRGF*GSEFPPK*KVLLVGFLKPK 
PPFPLGWGNPPPPPVGGFSPPQSLQFFF 
FKKVFGWVFFFPLFFFFFFFSPFFLFFS 
ETESHHIAQA 


8044 


21945 


A 


8104 


192 


2 


EETELSFFIIKYYLPRNQKVCKRL*RRL 
GPVAHACTPSPLGGQREW ITRSGVQDQP 
GQDGETP 


8045 


21946 


A 


8105 


373 


183 


GGPF* FQYUjILFDNFFFFSLSFPFSQD 


8046 


21947 


A 


8106 


373 


281 


SFKTAPPFFFFFFW*RLSLTMLPRMVSN 
SWAQWLLPQPPKVLRL*ATQRKKHFRQ 
EQI 


8047 


21948 


A 


8107 


3 


114 


FLIFVHIGPNYVAQAGloELLSSDDPPAL 
ASQCAGITGMNHHARPYYLL*TLNIPLP 
YDPEIFLLGIYPR*MKTYVHTKACT*FF 
ITPLFVIAKTWKQPK*PSVLGLQA 


8048 


21949 


A 


8108 


416 


243 


PGVFPPPPF*NPPPEI IFWAPKKKKYFP 
PPPPLNFVFFKGPPPFFFFFFFFFFFFF 
L 


8049 


21950 


A 


8109 


2 


211 


SVTQAGVQWHVSLIiQPRLPGLEQSSCLG 
L* * CWDCGHEPWHLALVTTS KHTESCS L 
LCI PTATFLGLVT 


8050 


21951 


A 


8110 


172 


28 


HWCFITSFFGFYFL*RQGLTKS PRLVP 



1043 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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amino acid 

vac 1/1 ii a nf 

peptide 
sequence 


Predict- 
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nucle- 
otide 
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ding to 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbeny)alanine, 
G=GIycine, H=Histidine, Wsoteucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
V=v»iutamine, K— Arginine, 5=denne, 
T^Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 














W b W V * V 1 Jji-iPW F FK. V LfoJUQA 


8051 


21952 


A 


8111 


2 


225 


QKSAKNFYHLSIYLSIYLSIYPSIYHLS 
IYLS II YLSSTYE * VI ILEQKWNS IKKG 
CIi*CTIQTNCL*QLKE*VIRLIPKL*YK 

YITT D*P<T»TOT7TJT MML"IT rtrtTPravr ■wrrr nisi 

* * RKLVCFA* FWYNS NKLL I T I KRIGH 


8052 


21953 


A 


8112 


392 


221 


PFPKRES PPDKVNPPPPDI FGGGKTKKK 
TLANN* FPPKEKEFFVFFFKKEDFF*NI 

tlPMOP'I'PQtrT* T\ Trot? i-xmt rrrtrvT/fAn n n rr 

VGNFKTPSKI *AVCF * NP VFKKKKGAP K 
KKSSFKNF 


8053 


21954 


A 


8113 


369 


112 


IFFSTIYYFXHPFFFPPPLFFTPPPLFF 
FFLKKKKFFFFFFFXXXFFFFFXXXXFF 

SF 








oil4 


418 


248 


KYYTjPTPG Y* NL I LLNGPP F FFFFFF * D 
RVSPCCPRLP*TPRLTQSSCLSLPRSWD 
HRCMPLRLVTF * FTFKLSHQSLRTLIiTY 
TLAGTTLG 


oKJjD 


Zlyjo 


7V 

Jx 


Ol If 

81 15 


229 


78 


S FKGAAP Ffc"FFF * * RQNLTMS PKL VSNS 
RPQAVLPPWPPKVLGLHRGGWIT 


olDO 


1 1 OCT 


7\ 
J\ 


8116 


356 


66 


PIiFPFFY IKNFKKKFFFFFQYFFFFPPF 
PKHYSHKDIFFFFI IKFFPSSSFSFPPF 
FALSSFFFFFFFFFFFFFFFFHA* FPVF 
FLYKI PMLYPQL 


8057 


21958 


A 


8117 


212 


81 


BRLSHCSLNIiPDSSNPPASAS * VAGTSG 
THQIi vmwhags prf 


8058 


21959 


A 


8118 


367 


2 


GVFFFFFSPAKKGVFFHPFFFLGPRFFP 
PPPFFFPPIjWFYFWGP * KKY* FPHPRGY 
Er rrF*GGPPHFFFFFFFFFFFFFFFFF 
FFFYAEGIGVSPCYIMLGYNFSSFPOST 
ISPRVRPRV 


8059 


21960 


A 


8119 


38 


215 


FFKSAFHHTRLIFNFFFFLEKTCSHYVA 
QAIbvajjiAS* SSPT i S AFE IGG I TGLFLS 
FKN 


8060 


21961 


A 


8120 


266 


2 


PQIFLFIFETRSHS ITQVGVQWCDHSSL 
QPQNPGLKQSSLSS *DYRHTLTAPG* FC 
r Jj w KtA^liATac FJYUj V iiSo W PQAVLKIHPC 
CGMC 


8061 


21962 


A 


8121 


3 


332 


DAWADAWGHVPPCPANFVFLV* TGFLHI 
GQAGLELPTTjSDPPASASQSVFIYFLET 

NFTSAGLQRFIFFMGPPKLISRAGV 


8062 


21963 


A 


8122 


378 


66 


LFIPLPIREEVFPCIPG*NWAPPGVLYN 
GRPFFFFLLETVFLHAGQAGLELLGSSD 

T.PVQIiQOQVirSTTYSMCirDl/OD'trr DT"C\7M"T 
Lie VOnoVo Vul XVsr'XotlKvyrJxJjiCJ.r 

S FPFEI FCE IMLLVPLAQP 


8063 


21964 


A 


8123 


2 


316 


ISMLLATiTiTiRIITFRLPQUiNGYVEEFTP 
YECX3FNPISPGRAPLFIEHFLVDITFLL 

T7T^T7V4T7ST T T TIT TJ + 7V T /^TVVTT TIT TT7MT»C»T 

r JJ V y JJUj JULi F LaF * ALiy £ AaNJU PI* I VMPSIj 
LLIIILALRLDYERLQNGSD 


8064 


21965 


A 


8124 


331 


14 


ITSILLII ITF*LPQLNGYI * KSTPYEC 

/-i PHOTO TtrpOT T7"7V T'['0> Y T>T"\T 

GFUPISPARVPr fa IKFFLVAITFLIjFDjj 
EIAIiLLPLP*ALQTTNLPLIGMSSIaLLI 
I ILALSIiAYE*LQKGLD*AE 


8065 


21966 


A 


8125 


2 


228 


I I ITF*LPQLNGYIEKSTPYECGFDP IS 
PARVPFS I KFFLVAITFLLFDLE IALLL 
PLP*ALQTTNLPIiIVMSSL 


8066 


21967 


A 


8126 


451 


200 


KA* WNRFAI LNVTKYTCESQEEVKI PTF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=Pbenyfa!anine, 
0=Glycine, H^Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methroniae, 
N=Asparagine, P=Proline, 
Q=Glu tamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=OYyptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AGVWKKLI PAHTDDFVHFKTSVQEVATD 
VAKIASLLELEVESEDVTE * LQSQDKT 


8067 


21968 


A 


8127 


2 


396 


LIVIINTLLAT.T.T»IIITF*LPQLSGYIE 
KSTPYEOGFDPISPARVPFSIKFFLVAI 
TFIiLSDLEIALIiLPLP*ALQTTNLPLIG 
MSSLLLIIIIiALSLAYE*LQKGLD*TES 
GWEKLVKKE KADKLKGGRP 


806S 


21969 


A 


8128 


1 


350 


GTRGLQTLLPLIAF* *LLASLANLALPP 
TINLLGBLS VLVTTFS *SNITLLLTGLN 
ILVTALYSLYIFTTTQ*GSLTHHINNIK 
PSFTRENTLMFIHLSPILLLSIiNPDIIT 
GFSS 


8069 


21970 


A 


8129 


375 


1 


IjTCFS EHRP PPRANFFFSRKTRAP fflp 
GGFQ I PGPNFFAP P I FPNGG P I PGLDPQ 
PGPLNFF* KSFFPDPKFFFLFFPPPI *N 
SGVFLPRNPVFPKTPNFLFFFS PFFPGD 
GVLLCHPGRLPRA 


8070 


21971 


A 


8130 


32 


211 


GIRHGFAMLPRLVLNSWPQAILLPWPPK 
LLG*QAGVELLAS SNPPALASQTAGMTG 
VSTRPPVIiLPTLGTHLCLDACLGSSL 


8071 


21972 


A 


8131 


396 


5 


KYRDGPPPPGKFFFFFF*DGFSLCCPGW 
SQTSCPK*SSCLPEG*DYRHEPPHMTLS 
SSY*CSQMQKCINKNCU5SCWAHACNL 
SPIiGGRRITWGKKFBTSLDNIVKNRLKK 
KKKKI YQGGGDHP C I FTN 


8072 


21973 


A 


8132 


347 


34 


KAGPVI PRGPQEFFFFFFPQDGILLCFP 
GWRAVAKS * VTATSNSWAQAITGMLPPC 
PANF*KIS*RQSISMFPRLVPNSWPQVI 
LPPQPPK*LGLHVCLVPSLA 


8073 


21974 


A 


8133 


102 


270 


DYRRTPPHLANFFYFL* RQILTVLPMLV 
SNSWVQAI LLLQDFFSVQLKMGFLSVP 


8074 


21975 


A 


8134 


338 


47 


FLSTQKESS PLHTHTHTHTHTHSHLLTH 
HHGRGQERGKRGR * GG * WRTGTHS PHG P 
HCVRLRKSRPSGELGAAQGGCSLIQPKI 
EDPGQSGGPREVP 


8075 


21976 


A 


8135 


68 


467 


S LLNRI P FdiCLNS FDCVFLLGLLFEAG 
FCS VAQAGVWWYEHGSLQP * PRGLK*SF 
HLSLPKFWDYRNEPPHPPFSCFKALNFK 
DYCDSGTVPCPFMKELSMLFIHALTESF 
VLFIVMSKLITTKHLTI ISNV 


8076 


21977 


A 


8136 


500 


355 


FHHVGQAGIQUiTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8077 


21978 


A 


8137 


58 


222 


IYGYEGRGSVAGSLSSL*SATTDSDLDY 
DYLQNWGPRFKKLADLYGSKDTFDDDS 


8078 


21979 


A 


8138 


345 


196 


KGDQAQWLRPGMVAHI CNLSTLEGQGGR 
IT*AQKFETSLSNTLRPPISFK 


8079 


21980 


A 


8139 


500 


355 


FHHVGQAGIQLLTSGDPPA*ASQSAKIT 
GVSHRARPQTSFLNKLPSLR 


8080 


21981 


.A 


8140 


3 


231 


HAS ADAWGRNEWPCAYAHAHN PTTSGG* 
GGWIA*AQEFEFBTSLCKIVKPHLSKKK 
KKKKNFSPGWGSPPCTPLF 


8081 


21982 


A 


8141 


416 


26 


ARI FPNNSFWPGQGGFPFPPRPLEGPGG 
VIP*CQGVQARPAPPGEPRLFPKPQKIV 
GGGGEPPLFPPPERAGPQKCRPFGEGGF 
N* PKPRPCPPGWGAQGGFVSKKKKKRNE 
DSKI PLIGKYPVG IRNMA 


8082 


21983 


A 


8142 


413 


35 


RGAGGGGSPPGPRYPSQQHLLRARTPTP 
FARSRGS FPGVRG SGWRGGLGG * S RERP 
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SEQID 
NO: of 
nucleotide 
sequence 


smn> 

NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I^Iso leucine, 
K=Lysine, IHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=01 u tn mine R=Arpinine S=Sprin*» 
T=Threonine, V=Valine, 
W=Tryptophan, V=Tyrosine, 
X=Un known *=*Stoo codon^/=nfi^sihl#» 
nucleotide deletion, \~possible 
nucleotide insertion 














LPSLP PRALPRSGASSGPLDGGAQGPGR 
RERWAFLGG I RADARGCRAVTS PE P YRL 
PGAGAARARR 


8083 


21984 


A 


8143 


166 


368 


ENILWAAMVVLDCHPSTI*GG*GGRIT*G 
QEFKTNLANIERPHLFFFFFFKRDSLFA 
PRAEGGGPILF 


8084 


21985 


A 


8144 


1700 


621 


DLRDHS VLQQGPACLTRCGEPI LSHSEG 
SKIGSERESGEHSRHPEAPPPSVEVGSR 
RS*GQDS*GFGPGNGQDTRGPLSHAAPN 
WTPDPAAAVAAGGTSLPPRGGGGI*RGR 
SHTARSGARVGSANAHARGGSGFSRHSG 
SHGCLRNQSPRLGKAGALCGSAWVSGL* 
GSGLTS PRGRRBRLRMPG VAKGRPR *GN 

LQPRTRAPEELFVPQESSGSTPEKMLVS 
FHGSSLRNEATPRYSQQEEAGNGRWQQS 
LSLERWPPWTSHPLGTPPLMPVAVARCC 
ILPGLWPLLSPPSGSASPVSQGPGCLSL 
WPNAFKKDD F P VROGDTS WPJ3 VS 


8085 


21986 


A 


8145 


390 


38 


NTTTFNIYFFYFFFFSLFIYFFFFFFFF 
FFF FF FFFF I FI LFI YSLLVFGVWVTiRl. 
GLALALLAKLFLVNSLCRRYRG * SLLYY 
AWL* FFIFPLRYYIYCARFQFLSPILYL 
GKWFG 


8086 


21987 


A 


8146 


20 


187 


KLTTSTGRNP<3RPTRPNF*VF*RDGDLI 
MLPRiiVSNSWPLVi^LPWPPirrr^riOM 


8087 


21988 


A 


8147 


417 


64 


VRPRVRPRVRPRIRK3CKKNKKKKKKKKK 
KKKKKWGGG F * DFWGGE KGP P PT/VKNnF 
LLFWGFFLKSFFFFCGGLFFWGGPFFFP 
PPPNFLGVGGKKKTFRLLCGPFSAFFGG 
EKN 


8088 


21989 


A 


8148 


351 


187 


RLGTFYLSNILRFYLSITHGPIPYIGRL 
TSONLPLLLl^LT*i^EKLLPirKSr>TYN' 


8089 


21990 


A 


8149 


465 


60 


PPHFWGNPKKKNSQFFWAKKKKKIFLGP 
* KMVAKKG WFWGRKKGFQRGGKKG * KG 
GQLFYPVLSQVLGGGQNHGPPGKAGFGE 
NFFFFGQGLGWAISRWLRTPIiFLAGGGE 
DPKKNFKPPPKI KTLGAGVKKK 


8090 


21991 


A 


8150 


1 


175 


WDYRHVP P YLANVFFVTFRGDKVSLCCP 
V*GPTTEIiKRSS CLGFP *GWD YRREPPH 
LA 


8091 


21992 


A 


8151 


59 


35 


RWNSRPRRPGLKFLTSGDLPASASGSAV 
ITDVSHHAWPETFFR*RA 


8092 


21993 


A 


8152 


269 


159 


ARVEVSKKYIYIHTHTHTHTHTHIHTHT 
HTYTTKKLPS * LFQVCKME KFGRLWQ 


8093 


21994 


A 


8153 


1 


256 


PGPPPRFFFFFFFFFFFFFFFFF*TLKI 
R FIWRKQKTHNPKDG I LHQD ISQVISLQ 
ITNHFHRRYFCTNFTCIQERDINQSLFL 
F j 


8094 


21995 


A 


8154 


252 


41 


LIIPVLICIVFIVLQFKIFSVISLNKGL 
FRNHFFIF* *RWGLTMLPRLVSNSKAQV 
ILPPWPPKVLGLQV 


8095 


21996 


A 


8155 


21 


443 


HELSLI * FISTLAETNRTPPDLVDGESE 
LVSGFSIEYAAGPFALFFIGEYTWIIII 
NTLTTTI FLGTTYDALSPELYTTYFVTK 
TLLLTCLVL* IRTAYPRFRYDQLIHLL * 
KNFLPLTLALLI * YVS IPITISS I PPQT 


8096 


21997 


A 


8156 


2 


205 


LLHLMKHFNLEIQKAQ*TRIiNLKRSISR 
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SEQ ID 
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sequence 


M 
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od 
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beginning 

nucleotide 
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ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

Mnguclf vC 


Amino acid sequence (A=Alanine 
C=Cystetne, D=Aspartic Acid, 
EXJlutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 

O — fZliitnmino rcrininp_ ^=Serinf» 

\^ \jl U Lit IJlIIlCf XV — /*! KllllUC, kj kJWl IUC) 

T=Threonine, V=Valine, 

X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, \~possib1e 

UUUCtlllUt! III9CI i-IUII 














HVI INLSKAKHKG* ILKAVREKMLIMYK 
KPLVKLQVISN 


8097 


21998 


A 


8157 


2 




T.TJTT.MKTTE'Mr.R TOKAn*TRTNI±KRS ISR 

HVI INLS ECAKHKG* ILKAVREKMLIMYK 
KPLVKLQVTSN 


8098 


21999 


A 


8158 


2 


222 


VYTWGTGKARVFPGEF* LPVWI PTRHLK 
PYNEPI5DANESASAETKNPOSSIIHPO 
DEQNGDIRRTDKVTIHQ 


8099 


22000 


A 


8159 


1 


297 


GFHHVGQSGLBLLTSGDPLAS TS *SAG I 
TGMSHHAWPLNCSYGDSFSWLKCPELTV 

INFYIKDSMTKRKYK 


8100 


22001 


A 


8160 


138 


437 


NEYDHFS I IKBRRDFI VCVFFFFFS KKS 
GSWDNGGATTRLG* FFVFFRKKGVS PCG 


8101 


22002 


A 


8161 


330 


148 


QQSKTPFFFFFFKKKKKYIYICMWHAY 
NPS YSGG *GGRI T * AOKFKTS LNNTVRP 
HLYKK 1 


8102 


22003 


A 


8162 


391 


255 


RPAPADF* FFFFYRLGLAVLPRLVWNSW 
PQATLLPQPLRVLGLQI 


8103 


22004 


A 


8163 


131 


3 


HKVSLCYPG*SALMQS *LTVASNSWAQA 

TT.nP*PPW*T.riT.nA 


8104 


22005 


A 


8164 


97 


440 


NKGSLCPPGGRPGALSHPTARSRGSPGP 
TPQKTGNNGPGPPTPVNLBFFEKRGFTP 

Vf20f!f5T .ITT ,PT * P Q P P PQ APOIt AGTTOGS 

RP^GPIQKKFLKTLIXjBTKWMAKLIKTY 
LK 


8105 


22006 


A 


8165 


541 


Zyj 


SHSVIRLLEGSGAIMNHCSLKLPSSSDP 
PASVS *SAGIIGVSQHLANYTFKKIF 


8106 


22007 


A 


8166 


440 


259 


PPQKFFFFFFFVKTGFRHW*AGFRLLD 
SSDPPPLAFQSVEITGMSPSARPMFGVF 

WQT.P 


8107 


220Q8 


A 


8167 


23 


405 


FRICHLLRVHLLVYFIPSFTDNHLLSTS 
AMFLYSNDWIAQNVSGPFLGI IFRTFF 
FFFLKGE FFFAPQVGGQGGNLG* LNPP P 
PGLKGFSGLTPPGGGNSGGG P PNKPNFW 
I PKKGGGSTLWPRI I 


8108 


22009 


A 


8168 


23 


388 


PYFSCALPRHHPPTTNLGPHLGMGAPWG 
OPSVAHHTLLFFFFFLKRDFCFVPRVEK 
LWLEESSLNPPLLGLKSFSCLTLWRTGN 
NGGPPPPPVI FCF * KKGGVYPWGPGGGE 
TPALKETPGL 


o i nn 

8109 


22010 


A 


oioy 






NIHFSLSFSHTHRHMHTHTHTHAHPTYI 
RTP*DNVPQ 


8110 


22011 


A 


8170 


2 


288 


VNLTFFPQHFEiGLSGMPRRYSDYPDAYT 
T*NILSSVGSFISLTAVILIIFMI*EAF 
ASKRKVLIVEEPS INLE* LY IKKKKKKK 
KKKKKKKKKKN 


8111 


22012 


A 


8171 


381 


101 


QKSLQQPSN IATILEEVQVI FLGS PYHL 
QRWEMRLTKWLRGLGAVAHACNPS TLGG 
MGGWIA*AQELETSGDPPPSSNRQLFTG 
EHNNDNKLI 


8112 


22013 


A 


8172 


136 


360 


TKKHFGLLK* *TKKHFGLLKKKKKKKKK 
KXKKKKKKKKKGGGAFKKKKFFPRGGGK 
I F FFFGAP KNKFGGGVLKTGGGKKPGVT 
KK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

Ing to first 

dUIIUU UCIU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

cling iu 

last amino 
acid 

residue of 

pepnue 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K^Lysine, L=Le urine, M=Methionine, 
N=Asparagine, P=Profine, 
i^uiDiaiuine, jk— /irginine, o— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucieonue ueieiion, \— possioie 
nucleotide insertion 


81 13 


99014 


A 


OI / J 




•>o 


DSPAS VCRMPAIAGPRRHA* LVLVETGF 
RCVGRAGLQPLTASDPPTS AS PGAAIAD 


8114 


22015 


A 


8174 


254 


370 


GLTVLPGLVSNSWAQVILLSWPPE*LRP 
QAYATGAGFLL 


8115 


22016 


A 


8175 


48 


269 


FYVILCI*RDRVLLCCLGWS *TLGLKLSS 
X Jjo Iujo o W UL/JH V F FHoAb CJj 1 r * VKAC 

FFLLKYLKRLKIYAKIM 


8116 


22017 


A 


8176 


1 


338 


PTRPINTLLALLLriITF*LPQIiNGYIE 
KSTP YECGFDP I S PARVPFS I KFFLVAI 
Tr xjlir ULia XRIiIiLiPIj P * AiiQTTNIiP L LA 
MSSLLLI IILALGLAYE*LQKGID*AEF 


Oil/ 






ft 1T7 
oJ // 




130 


PPPGRLGGENPPGGSKEQKRRPAPQGG* 

PRf*PPQPirPnOJ2iriQT?I?QTT?MT.O'D&T.T r»ir 

O YC T VLHAIiPNKT LF 
\£iv~x v xixxrvLJir xvj. ux 


8118 


22019 


A 


8178 


396 


244 


KIHLLYICMCIYIYVISRHYILR*HTHA 


8119 


22020 


A 


8179 


2 


119 


TRPGNWPG WAHFYD PSTLGA* GE LIT* 
GQEFETSSSLY 


8120 


22021 


A 


8180 


2 


80 


FLPLTLALLI * HVS IPITISSI PPQT 


8121 


22022 


A 


8181 


438 


333 


SPWRLRHYNCLNWGGKGCN* PKSCPCP 


8122 


22023 


A 


8182 


393 


19 


IFFFFFPPPSPAKFFFIFKKKRFPPLWG 
GVF*SPPPGFQKTPPPQKVGFSRVFPPP 
PPKKr FFGGGKYF FFKEPPPPPPPIL* K 
GGAKISPPPPPGKKKNPFFIiKGGKKKKT 
PFIFFFPPKKIKN 


8123 


22024 


A 


8183 


24 


269 


GSQQ VLG FCFRDR VXiLCHPG WSAVM * S * 
LTVASTSWA*RILHCSLLGSWEYRHTPP 
CL*NRNPQLTVKWRVKDHNNKKSHR 






7\ 

A 


01 OA 

olo4 


154 


361 


WHNliDFIjQPQTOGLKRSSHLGLPS I WHY 
RHVP PRLANFLNLS FRQGLAMLPTPSVC 
x1L4j V if 1x5 *nl YWTFGYIiS*SVCSHIiLNF 
L**WHI^PLQPQTHGI,KRSSHLGLPSI 
WHYTUIVPPRLANFIjNI^FRQGIiAMLPTP 


8125 


22026 


A 


8185 


228 


337 


RPGMVAHACIPGILGGRGGWIT*GEKFB 
TSLANMVK 


8126 


22027 


A 


8186 


314 


127 


DIQPKNEGEGEEEQELWLGMTAHAYTPS 
TLGD*GGRIA*VQEFETCLGSLARPPSL 


8127 


22028 


A 


8187 


10 


420 


TQEAEVAVNWDHATALQPGRQSETPSPE 
KKKKKRGPPRWGENFFPTPGGF*KDGGN 
KKTGFPPFKRDGPPPPKNWTPWVFQKRG 
APPE PPPLGGALGREPLGGFFGKGGVET 

P PpBT PITlTMlvrDDRlf VPTJDr , 'CKri?J« , A 1? 
r'lri'rxrJUVXVWJri^ f Ap 


8128 


22029 


A 


8188 


398 


1 

• 


LKKKRPQTPFSPKIQF*KFFPPFPFKKS 
NPSPL*MGSPPPPLFNPKN*KKPPLLKK 
IiPRHTPSCLVNFCI FCRDGI 


8129 


22030 


A 


8189 


30 


153 


AWYVRREDHVSLGG*GCGKPWLCHGTPA 
WVTEENPKGKIK 


8130 


22031 


A 


8190 


206 


45 


HPSQNSLSKN*ERVSVCCPGWSAVAGSQ 
LTAASTSQAQVILLPQPPE*LGLQA 


8131 


22032 


A 


8191 


3 


228 


TOTOjCVCVYICTVCVGCVCVYIDEYIQ 
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SEQID 
NO: of 
nucleotide 
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SEQID 
NO: of 
peptide 
sequence 


M 
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od 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

pepuae 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

UlUg W 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=CIutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleurine, 
K=Lysine, b=Leucine, MNMethionine, 
N=Asparagine, P=Proline, . 
0=^iInt?*iTiinp R=A rpinine. S=Serine. 
T=Threonine, V=Valine, 
W=Trvnto n h a n Y— Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














I YRYIHTYIDIYTHI * IYIHTYIDTHMW 
VCVYIYIYIYRCMGVRET 


8132 


22033 


A 


8192 


41 


416 


ICRVKEIASRPLLETHSLSLVELS+FCL 
TAQ*RNDFLALKHLTAVLYNLILHSSGF 
WSSIiKSHFIFLAL*YYIj* *FI*SN*RRD 
IiAVLPRLVSNSWPQVILPPQPPKVLEIR 
YEPPSLALILFFN 


8133 


22034 


A 


8193 


2 


183 


KRLSIWQKKVEGEGFQGMDVGPVSVAHA 
CVSS S LEGQGGKIA* GQEFET I L SN IVR 
PHLY 


8134 


22035 


A 


8194 


2 


465 


an\/nwrmT/^cjT,n < ?PPPRF , K* F^^Ti^tiPQ 
AVjVynni/iJvdJjywir rruc coou^jj-jro 

S WDYRS AP PCPADF C I FRRDWAGWGVGC 

GFHRVARAGLELIiTSGDPPASAS PGAGI 

AGMSHRTQRGQDNFASWRRGGPQMSPSP 

PSOjFLQNTFSLKPSWVAPYPHASALGG 

GEYGGPPSPQPHIiE 


8135 


22036 


A 


8195 


397 


149 


LlttTDLPIRFLSUiALVFTVFNTKTKV 
KDLI KIjGVVAHTCNIiNTLGG *GGRIA* A 
HEFETSLGWIVRAVXPVCCFSVILSR 


8136 


22037 


A 


8196 


355 


95 


KKIMPTLCLKIPS* YRCEEI IQLSIjQSS 
CDHRHMPPHPTNPF*RQHLSK!jRSLVLN 
SRPOAVLPHGPPKVLGLOAPATAPGILS 
LF 


8137 


22038 


A 


8197 


382 


112 


KTDFGKKGSPSSPPPFFFFKKSFAPSPR 
GEKKAPLKSQRGLCPPKPKDPPPPGSG* 
TQDKGPPPPGQP I FFFFLKKGVCLFSRG 
GSKTFI 


8138 


22039 


A 


8198 


32 


361 


ASRIGFILQBFLPVRVWTTNPSAGADGY 
NFLEKQKGTEKTSCSVAQAGAQWCHRNL 
L* p * TSELKQSS CLSLLSS WD YRMYFVL 
IiGKQVTLLIGKRALSVYPRDWSLC 


8139 


ZZ04U 


TV 
t\ 




1 




INTLLAIiLLIIITF^IaPQLSGYIEKSTP 
YECGFDPISPARVPFSIKFFLAAITFLL 
FDLKNALLLPLP * ALQTTNLPLIGMPSL 
LLII ILALSIiAYE*IiQKGIjY 


8140 


22041 


A 


8200 


3 


109 


FTSKHHFGFEAAA* YWHFVDW*LFLYV 
SIY**GS 


8141 


22042 


A 


8201 


3 


216 


DAWAAVGHHHTRLIF* I FVEIGSLCVSQ 
AGLEIIiDSSDPPSSASQSAGIRGMSHGA 
GQEHLFHRP IGKI K 


8142 


22043 


A 


8202 


1 


267 


RIHTGEKPYKCBECGTGFKGPSTLTTHK 
FFVYCREVAVLLKNCYSHLYPH* IIVNG 
SEADKREMCIiLCIFFHPLAEEQENVENK 
RKYNP 


8143 


22044 


A 


8203 


422 


92 


PLVFALPPKIGFAPTTPSSSPPPPPLFF 
FGPPVFPPPPFFNPPPFFFFSPPPKKKF 
PSPPPRGKIFPF*TPPPPFFFFFFFFFF 
FFFFFFFLFLYYIiFILC 


8144 


22045 


A 


8204 


445 


260 


VCSPPLFIWVAPGFSPPPLFKNPPPHFF 
FGAPKKKWFFPPPPA*NFFFLKGPPLFF 
FFFFF 


8145 


22046 


A 


8205 


103 


248 


GPGWAHACNPSSLGG*GQWIA*AHEFK 
TSLSNAVRPISAKKKKGGPF 


8146 


22047 


A 


8206 


407 


197 


QL I FKFL * EQASHYVAQAGLEL LGSGNP 
PASTSQSVKVTQ^SHGAWLLAPTLRSTS 
LNYNSLSVLLLPRP 


8147 


22048 


A 


8207 


1 


142 


SCS VAQARV+ FRDLGSLQPPPPRFTPFS 
CTPAWETVRLHLRKKKKKN ] 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

iog to first 

stninn arid 

•allllllV auu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutarnic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, l=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 

\^"V>l U la III 1 IIC) W%r~J±i glulIlCy O — ocrinc, 

T=OThreonine, V=Valine, 
W=OVyptophan, Y=Tyrosine, 
X=»Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucicuoue inseruon 


8148 


22049 


A 


8208 


2 


370 


PEVFDYPN* FKNL*LHKTCTQ*L*AALF 
VITKTWNQGCPLTGEWIWKLEYIHTMDY 

SRKATYYMI S I IRHPRKGKTI EIVQNSS 
CQGLQERQKGL 


8149 


22050 


A 


8209 


363 


108 


KKPRVEACIiSPGVGRERAKRER*KRERE 
RERERERERENEL*KLSLADLCIKNG*R 
EFFKEKRYVKRYAKILRVSGRKQKYGNV 


O 1 J\J 




a. 






zM 


iilN IAjWjVUs IS FKJjV_FI_ 1 ir AW V IK* Do Vo 

BKGKKKRKGPENSLTATNF*A£*SCNLDY 
LRCSMLKMQRQRKRKERTEVRFWQLHHN 
LTLGFV*LIVYQRE*RRTGWFVAKYMLY 
TFS * PGSFPRLRKCVEHVFGYEPACSPL 

c ir -Li -L nPi \J url fv>\ A.v- 


8151 


22052 


A 


8211 


3 


131 


AHLFIAWFTEYFKPTVITYCSKKKIPFK 
MLLLIDSAPSYPRALMBMYKGMNWFMH 
nwT.T , prrxjDMnn*UTn w rQ* q wt dm* cr* 

UHXjX r iJjWrnJjy w V J.c lo'ol X-UrCN r V- 

TIAAIDSDSSDGSGQSNLKIF*RGFTTL 
DAI KNI CDS * Eli 


8152 


22053 


A 


8212 


86 


286 


WSI WECTVISVQLFHKPKTS FKNTSYWS 
uvn/ii x<njnxjo iiAjDuuryi x oyer £ lo 
LANMVKSCLY 


8153 


22054 


A 


8213 


393 


344 


SFPFSLPSKWGFKNLVPSPGYFLFFLNK 
GFRYFGQGGFKILNSNYPPPIASQKGGI 

QflT CD HDD JiTAn?!?* liJD"MT^ , 'DM *T>TT"\re TS 
oulornfKnljVrr nKTilAjIrn KX Ivor 

NYGLK 


8154 


22055 


A 


8214 


3 


313 


QGLVLWHSHGLLQP* PPGLRRSSCFSLP 

GSHFVAQAGUSIiLGSSDPPLS ISQKSWD 
YKV* AI I PRAKQNYLMLFY 


O I J J 




A 
x\ 


OZID 






RDr , D7.Tf , 'DCnT T.TTTQD'KTSDr'TMlVU'ar'MT C 
KJrv_.fXir caULiij I v oKJCivjt'vj I WHnALJNlio 

TLGGQGGGIA*GQSFETTLGNIVRPHLY 
FYLKK 


OIJO 






oZlO 




01 ft 


S VSAGITGVSQKCRF* YNVFI FQKSSLL 


8157 


22058 


A 


8217 


32 


276 


RYLPI IAALFTIAKI *KQPKSSIixjDKWI 
KKLWC I YTMEYYSALKVILTVVTTWMNL 
VGFIPGDINHIQKAKYCMISLICGI 


8158 


22059 


A 


8218 


171 


2 


KGVFGLGRLFFFLDRVWLCLPEWSTWQ 
SRLSATSAS*IQAILPPQPPEELGPLCP 




zzuou 








15 


X £\JT JL V vjOW O X\0 X HN^XvII IrO 1UX JCOolJ 

MRLTIYHFFFLLVYYNPRVMVTPAYM*L 
LILFYSSSIRDIiLIAKQPCLPNEGFSPH 
AC* TNATLLFP YS I * YIjGLPSFHLIFFV 
CFSFYFL*R*GLTMPPMLVLSSWPQVIL 
PPQPPKVLGLQA 


8160 


22061 


A 


8220 


350 


135 


FFFFSQTGVAQNRVQ*RDHNSLQP*PPG 
I^QSSCLTlaPKCWDYRCKPLCIiASWIYF 
KDKVYTSTPVHTQEY 


8161 


22062 


A 


8221 


185 


331 


PGMVAHAYNPNTLiGG * GGKLT * AQE FET 
SLGNINSVPKKRRKRRKEMM 


8162 


22063 


A 


8222 


73 


232 


YCSFKFITLTCLSWLGTVAHTCNPSTVE 
GGGGRIT*GWEFTTSLGWIVRSCLY 


8163 


22064 


A 


8223 


418 


211 


AMPSLLTYLLTFIFCRKKKIFFYWGEFT 
MLS*LVLNSWTQVILPPWPAKVLVLPRL 
ALFYSLIiVLFPSF 


8164 


22065 


A 


8224 


227 


33 


ESHSTTQAGVQWHDHGSLQPQPPGMSLY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

iog to first 

a III III I* aviu 

residue of 
oen tide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

seauence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenyla!anine, 
G=Glycine, H=Histidine, I=Isoleudne, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Giutamine, R = Arginine, S = Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














CH* CLRFWSWRPAEKPGWCSSLRFMELE 
SWSGASDAF 


o 10 J 






oZZO 




987 


TDS PTSAFRSAGITS ISHHTLPKSTF* I 
FCRAGGIAMLPRIiVLNSWPQVILPSQPP 
KALGSQA 


o lOO 


77AA7 


A 


S77A 
6ZZ0 




915 


NNKWCKSVVAHTCNPTTLGG* GEWIT* GQ 
EFBTSPANMAKIPPIiVMFAFCFEETDNK 
SHTK 


si fn 
olo/ 


79A<ft 


A 


Q007 


975 




GVLMFNFKFCI *QNSPFMLPGWLNSWI 
PAILPPQRPKVLGLQAWTILYKWYYGKE 
MWHKE I CEGTHDVLSKLLTTRRFKRHL 


8168 


22069 


A 


8228 


386 


211 


QLKLPPLKGNKLRPP**KFFFFFFFQRQ 
GLTILPRLVSSSWHQM I FPPQPPEVLGL 
QA 


olOV 




A 




t 
l 


1 17 


VFILFYF* DGLAMIiPRLATNS WAQAILL 
PSAPKVQGLQA 


8170 


22071 


A 


8230 


356 


26 


WHVHGSQQP*PPGLKQSFCFSLPSSWDE 
RCAPPHPINYFQICRDRVYKRWANMVIS 
PNPLVLISQSAGITSKIPCPGSEISTIF 
I FSHQLLLVRGVYP DMLVLQLGKTCI 


£171 




A 




j 


loo 


FPSLRPA* IFFYI Y FFFLKNKVFLCHPG 
WTQLLDPNWTQAAGLKQSSHLSLLSSLN 
FRGSA 


8172 


22073 


A 


8232 


2 


253 


KLDYINCGTOLAQKCLFCLFGSLFFQR* 
GLTLLLRLFMNSWPQAILLP * PLTVLGL 
QATMLGSQVPVFDLGITLCRCRFGKLE 


8173 


22074 


A 


8233 


373 


65 


PEKAGSCLELAFGLQMQHPPWASTLPAA 
LTDFGLATVPNHVSQFLKSHTH'i'HTHTH 
THTHTCTRTLLVLFIj*EALTTTPIi3jRSI 
CKHRESACSLS LTWPLCC 


8174 


22075 


A 


8234 


390 


46 


RVFKPG*NPPLVKAPAPPLFFGGV*GRS 
PKKKNPQGPGGGGPPPGFHPFFLTPFFP 
IFLRREPPPFFFRGGLGPPPPPFKTPFF 
FFFWVKGGKFLPQKKKKKFFQSISTRSV 
IHK 


8175 


22076 


A 


8235 


387 


216 


QKLGVGIiVPPGRVIFFFFFFFCC*GGGL 
TMLPRLVSNSWPLAILLPQPPKVLGLQV 


Ol /o 


22077 


A 






201 


RFFFTIPPQCFPQFILSF*RRGLAVLPR 
LVLNPWPQAILPPQPPELLGLQAHPTTL 
GPVFYSYCVRA 


8177 


22078 


A 


8237 


399 


229 


SVGGGGIjTMLPSWSQTPGLK*SSHLSIjI 
KCWDYRPEPPHLAHFSLLNLQLYCHTKL 
S 


8178 


22079 


A 


8238 


192 


335 


shtlphssnfsyfw*rqgfamlprlvlw 
swaravclpqspkvlglha 






A 




442 


254 


GRDRVLLFCPGWSQTLGLKRSSCLGLPK 
HWDYRHEP PHPAENI KFCVWPTLYFYWP 
AIjVQINLP*QLQGTECVACMC 


8180 


22081 


A 


8240 


381 


180 


VLKLFFCSHRGPTMLPRLVSNSGLK*SS 
HLGLPNCWD*RCEPWHPASLCLLTSTFW 
PPTLICNIHK 


8181 


22082 


A 


8241 


388 


260 


NAIPFYITKCCNPSTLGG*GG* IT*GQE 
FENCLANMVKPCLY 


8182 


22083 


A 


8242 


1 


145 


GFTILVRLVLNS*PIiDPPALASQGAGIT 
GVSHCAQL I YVFQRSLQLLW 


8183 


22084 


A 


8243 


1 


173 


GLTVSQ AGVQWCDHSS L* SQTIiGLKQS S 
SLSLPSSWDYKCKPQHQADRFFLMEFFQ 
M 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo aciQ 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/linn 

aing to 
last amino 
aciu 

residue of 
peptide 

scquemc 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaroic Acid, F=PhenyIalanine, 
G=Glycine, BNHistidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline } 
v^vviuiarnine, k— /vrginine, o— serine, 
T=Threonine, V=Valine, 
^V = Xrypto ph a n , V^Tyrosin e, 
X-Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8184 


22085 


A 


8244 


202 


327 


TLFITFI FF*R*GI*AVLPRLVLNSWPQV 
ILLLWPPRVLGLHV 


8185 


22086 


A 


8245 


42 


289 


KEKRS IILFIFFCRDGVLLWLPRLDLNS 
WPKTIFPPWPPK*IiALQAHTTIPRGGGS 


8186 


22087 


A 


8246 


592 


485 


NIVSQAGVQWHNLGSLQPLPLGLKWFSC 
LSLLSS * DYRHVPPCPANFCIS VETRFC 
HVGQAGLELLASNNP PAPASQS TGI TGV 
£3 1 K FK V I V FIjIj W I IjANGPS * RAETIjV 
NI1ATNQEDADSSI1APGLGSGHCF 


8187 


A/.UOO 










F v ra F J\Uxl£l FJjr iaWrr lUvKrLir r r F 
GGRQGGHFGFLAPPLPGLNKFLVLPFPG 
NGDTGKGPPARGNFFFFFFFGI FSRDFV 

QPP*Pf2WQr.TDnPP 


8188 


22089 


A 


8248 


392 


237 


HYGQAS LAPLTSGDLPASAPQS AEVTGV 
SHRAWPTVLI *SFS * KFNWV 


8189 






£9/1 Q 






r rFFoWFJ\± l c)ljiibFFPPWiiKGr PPPTL1L1 
GGGD*KRPPPPPGNFCFFKKNGVSPLWE 
GWF* TSNLGNPLPS PPKRVGIT 


8190 


22091 


A 


8250 


387 


166 


IETILDAI KNIHDSWEEVKISTLTRVWK 
N * JL F 1 FMDNFfiGFlCrSLEEVTADVMGTA 
RELEVEPEHVIELLHDIS 


8191 


22092 


A 


8251 


405 


266 


LFAI VKR * NHP KCLLI YAR INKMWH I S I 
bWN JL JL 5>li 1 0 MiMKr 0 x Mr Jb JvWHN IjEVIM 
LSEISHSQKVT*YMVPLA*GHLEKLNCL 
PIKQQTPIPSSSSP 


8199 




a 


oZjZ 




Z01 


FvjAN iALN FbTLiGGRSGWI A* AQELETS 
LSNMTKPQPCSTPQCFLVNQLLIiQNLYP 
YFP 


81 <tt 




a^ 








J-» 1 Jj 1 r H 1 KJvKJflb Jb K* I FY F I * S X LSQKD 
LKVGTGPGAGAHTYNPRTWGGWKGLQAG 
KRIT*AQEFETSLDNTVKTCL 


8194 


22095 


A 


8254 


303 


81 


TPCAENFKLRKRFHHI ISIjQRKMNYLIS 
FLYF* *SHSLTILPRLVSNSWPQAILLP 

IjFFIVV FlxLiy VSA1 LiFGJj 


8195 


22096 


A 


8255 


2 


376 


MGYNLVCHLLRPLEKRSSIRVGVT*FSG 
CPPS PLPLARKGYSLTPCASQVRQCXAXj 
LWLTLGGLHPLSCPHCPTSPSVMNPVPQ 

TQSSLDS FFKKVI 


8196 


22097 


A 


8256 


70 


356 


LI IAYNLTYDLSWKMFYVYLRKIYTIiLH 
WVECS I YTCWV* FVYSMVSWNRI DSPGK 
KKTMI YDOL I PNKGAKQTTTr ,P TfW<2 T . RKTlf 

WGQDKISTYKK 


8197 


22098 


A 


8257 


403 


85 


PFCPWFPPVFIPPPLRKSGTPPGPGPEF 

PflPGFSNGONPRPTtKTf PITP<? PnwrrpnP 
t\jtvjjc Oil vjvji.N lr r\. t JLiiVXvO t tVC O JrO V » VjrivO xr 

IjFPPSRGV*AGNFLGPGRWRGPWAKFVP 
LPSCIiGHKARFFFPKKKRRKIK 


8198 


22099 


A 




940 


9 


YPEGDTjGSYRPfiVQfrfMPPCiPPPnTDTM 
PWPGAVCTAYN^TLGGRDGWIA*A*EF 
GTSIAYIVKPHLYTHTHTHTHTHTMY 


8199 


99100 


a. 


89 SQ 


9 I 


ioy 


TS*A*VDFSHLSLLETSWD*RRVTPRIjVN 
LILFPS 


8200 


22101 


A 


8260 


453 


128 


FFFFFFL I ETVSHDVP *AGLELLSSSHP 
LASAFQSAEITGVSHHAQPSSVSWLIHS 
S 


8201 


22102 


A 


8261 


420 


79 


KNPTPFFFFFFIiIBTVSHDVP*AGIiELL 
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SEQTD 
NO: of 
nucleotide 
sequence 


SEQH> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQH> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, I>=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine> I^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=C*liitaminiL. R=A rpinin^ S==Sf»rinp 
T=Threonine, V=Valine, 

W—Trvntonhan Y=TvrfKine. 

X=Un known, *=^top codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SSSHPLASAFOSAE ITGVSHHAOPSS V<5 
WLIHSS 


8202 


22103 


A 


8262 


1 


262 


NSRTERLGSWQSHFCHKEMEAHFWQSG 
VOWC!DIiG^T.RPLPPGFK* F^CP TT.T.tjriW 
DYSQCRPCRTCSLI FMHSRSPSQPSEKK 
WTC 


8203 


22104 


A 


8263 


321 


100 


SATKNYLLSRQS FIGK*NNGLGAVAHAY 
NSSTS RGLGGQ IA *A* EFBTS *GNMMRP 
HCYRKYKNIiARHSTYSPR 


8204 


22105 


A 


8264 


145 




AILPPWPPKVLRLQG 


8205 


22106 


A 


8265 


71 


J J 1 


SIiOYP^FARCDRPTCKGOPFQK'QVT fITlin 
WARTLLSTIAQNNSAEHHIiKTTDW*GAV 
AHACNPS TPQGRGGRIA* VQEI ETSQGN 
TGRPHLYKK 


8206 


22107 


A 


8266 


382 




lvetgfchvgoagIjKIiLTSGdpp Asa <;n 
SAGITGVSHCAWL*T*YS*DFQ 


8207 


22108 


A 


8267 


397 


130 


IGQAGLKLLTSGDLPTSASQTAGITGMS 
HRAWPTFS Li IAR * NAS FOVGPTF <3 PTTVOV 
GTLAKSLKCHQVRSDTLEFILCCRIIYE 
TLGKF 


8208 


22109 


A 


8268 


309 




DTWYRTRRKDHCTTTETRTOSVPPT^PP 
* AITSRVTCTYTSR* PEANEEPQKITFR 
HCDLFLPTLTDQLTL 


8209 


22110 


A 


8269 


2 


224 


DSSGIPGSPTRPPTRPKKQKLPCDPTVS 
LLG I C I YDQRQ* S VFQRD I CHSHMF ITA 
I»FT I VNI WKOT* HPIiMDE 


8210 


22111 


A 


8270 


374 


2 


TSraSARKPPPIHSHTIiKGFFFFIiFMFF 
FF*NKI CKTX3SLT^PRWLNSWAEGIL 
LPRSPKVVGIQELATQRRAS ILLCNPTY 
ACX5IFQHVVVNVKLLMKQCPFCPAAPTK 
TQNFFPRV 


8211 


22112 


A 


8271 


1 


379 


PTSASQVAJ3TTGTHHHIVLFFFFGIFEK 
KGVFPFSPGGF*TPKPKPPPGFGPKRGG 
KKGGGPPPPARGMGSGVFYPPPQKPFFV 
LIGGQ ILFBSGRTIiDFKKRAPPFPPFSLF 
IiPKHTPSFWIKWDP 


8212 


22113 


A 


8272 


80 


401 


AWGMRFCLLLYSCQYLISDXAHFSKAFF 
FWKREPPPAPQVKGQGPNLS * WNPWLQG 
* RGFSGLTPKGGGNYGPPP PPPLI FFFT, 
RKNGFFPGWPGWSKTPDLRKPPC 


8213 


22114 


A 


8273 


405 


95 


PPKRFWGSSKGFPQREKKLGSPTFPKKK 
KDFIYS YLSKSLKKE* INRPSAAAHTCN 
PSTLGGQGGRIT*GQRFKISLSNIGRPH 
LYEKI KNVKKLKI KKPNKL 


8214 


22115 


A 


8274 


167 


52 


DREQRTCAMAHTCNPSTLGG*GGWITRF 
IxANMVKLRLY 


8215 


22116 


A 


8275 


276 


23 


KYVSSRPGTVAHSCKPSPLDGQGEWIA* 
VRKFKTIL(3JMMKPH1aLFYKKLTFKNVK 
C I FKKYCATESRYVTKE KQKDRKKGTKE 


8216 


22117 


A 


8276 


1 


114 


FxWARMVWIS*PRDPPASASQSAGTTG 
VSHHARPGLS 


8217 


22118 


A 


8277 


2 


87 


TMLLRLFLtNS *AQAVI LPWPPAVLGLQG 


8218 


22119 


A 


8278 


357 


347 


SEGLALYQLEAFTVCDLRERGFCRGRSL 
WPLI TWAWRGGVF1VL VQFQEVCCRLALG 
SLSPDPVLLPESGGPTSASQSAGI IGVR 
DRAQPNCVFEIGS YS VTQ AGVQ *HNHGS 
L*P*P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

q m inn op til 

residue of 
peptide 

t An itatl *«A 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/linn t'tx 

uing w 
last amino 
acid 

residue oi 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^Hjiuianune, k— Argimne, o — serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8219 


22120 


A 


8279 


371 


125 


FFPKKNEPPG* PPPKK*RKPGKFILAPS 
ILKTPPRLGKNKPISPPPGQKKQNFP 


8220 


22121 


A 


8280 


377 


63 


PPVFPPPWGGQGGGFP *ARGLRTPWPTR 
ENPVFF*NPKN* PGGGGPF* IPPPWPG* 
AljiUf ¥ U PPKGG V P LTKN I PP P PRFSHKR 
RVFFPKNLKKKKNPCLIESL 


8221 


22122 


A 


8281 


2 


284 


LPK*VKNFVHAKICT*MFVAALFVITSS 
WKQPRYYSS I EKW INKLWHIS AMEYYSS 

KDTWCLIPVI 


8222 


22123 


A 


8282 


318 


40 


APRLS L WFMWFI KRLDSS PGSRITWT 
NCFENHLCVCVCDRFLLCQAREQWRVLG 
SPQSPSPKFKRFSCLSLPSSWDSRCKTP 
♦PANFCGLC 


8223 


22124 


A 


8283 


30 


343 


CYQKASHATEKSFMR*RVNPCSKLCCCY 
KILPQPPQPSAvITLISQQNKKKKKKKK 
KKKKKKKKKKKKKDGGGALKKKKIFSPR 
GGRIFFFFFWAKKNNLGGGL 


8224 


22125 


A 


8284 


163 


392 


FWLYNFSVIYSISDPCWYKEQLCKSVF 
SFLSS*SLLDEPNPNSPANSQAAQLYQE 
NKRIs xEKRVSAXVEQSWNDS 


8225 


22126 


A 


8285 


406 


63 


KKKLILAKPGWNIjDSLKRAPPFFFFFF 

RHGVPLCCPGWSKTPEPK*SSCLSLWSS 

WDYRREPLCLAYSPVLNITHLSSPNVKL 

L-KJLfSQESxDPIIMMMVI^ 

IM 


8226 


22127 


A 


8286 


1 


322 


TNTTiTi ATiLLFI ITF*LPQLNGYVEKSTP 
I is CGLrDP I S PARVP FS I KFFLGAITFLL 
FDLE IALLLPLP *ALQTTOLPLIGMSAL 
LLIIILAliRLAYE*LLKGLD*AE 


8997 


9919Q 


•rl 


o/o/ 




1 D1 

lol 


rlooJjxlFV? 1 iAjJjyyj? oKlibliPooWGYKHV 
PPCLANF* I FCRGG VSLCCLGS VEGLEC 
QARLPVLVRAELGLD 


8228 


22129 


A 


8288 


3 


246 


SSTLSFPSG*DYRHVPPCPANFFFFWIL 
civlPrUr VP\2(^rNLWA*TSPljNPPPWA 
S KGAKI TGGTPGMGLINLANGRP P 


8229 


22130 


A 


8289 


138 


421 


LLSSVFFFFERDFCFVPQAdGQWGNLG* 

W JN Jcr W C Ko-JUJVN if if VjJ_i I ±r FKo W IM W WjHjPPP 

PLIFCFFLKKKRVSPRGRGRANFLDLGT 
PPPGPPKERE 


8230 


j i 






141 


j J j 


PQS PGLKGSSLLRRWDYGSMPPCPS I FF 
F* IF* KNGASLCGPNNSKTGG 


8231 


22132 


A 


8291 


407 


209 


IGGPPGFS PPP VFKNRPPNLI FGPPKKK 
KNFPPPGGKNWFF* KGPPPFFFFFFFFF 
FFFFFYYTTj 


8232 


22133 


A 


8292 


86 


433 


PPXPNKLIEXGGAF*KRHFFPPPRGRVR 
FFFFGPQKKTPAPGQNPGGGGKPWGSPL 

VLI* KKTGLPLFVGRAGTNPNLGGPPAP 
PPKG 


8233 


22134 


A 


8293 


147 


342 


LVKFEYTVIiPKNRYRIiGWAHAYNSSSL 
GGQGGRIT*A*BFKTSLGNILRSHLYYK 
KRKEKKLN 


8234 


22135 


A 


8294 


6 


152 


CCGHAPPHPANFYSL*RRSLAMLPRLVS 
NSWTQAI LLP* PLKVLALQV 


8235 


22136 


A 


8295 


18 


205 


I PTRPPTRP IPGRLNHFTFTATRPGVYY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: io 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

Ulllg w 

last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Hsoleuctne, 
K-Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Pro!ine, 

\£ Vjt I U lit ill III c, JV gHMHC, O uci ill 17, 

T=Threonine, V=Valine, 

W=T rvntn n h a n V=Tvrosi nc 

X-Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 

IIULlCtlUUC llljtci UUII 














GQCSEI *GANHSFMPIVLELI PLKI FEI 
GPVFTL 


8236 


22137 


A 


8296 


1 


199 


YLYI YLSILLS IYLSIYLS IYLASYIAT 

* T ,a T * TTFT YVFF WT .VMTFAR T FHT TiPS 

PSWRPRKVMV 


8237 


22)38 


A 


8297 


1 




PTP P PTP T .T L <?P P T K T DT ATjHK" FS LCPEY 
LQ*ADYLF*SLWIRPDAWPHACNIjSIL 
GG*GGRIA* AQEFE ISLGNIVRPCLYCI 
IKNKIKKKKPLQI IAFFLCFKD 


8238 


22139 


A 


8298 


416 


1(\A 


nPT.WVr^OART.KTJ.T ^fiHT.PA* ASONVGT 
or xjfi vv3^JrUJXJJ\4JUJLOOXuiir .r\ Sc i -' v vj j. 

TGMSHHARL 


8239 


22140 


A 


8299 


352 


1 1 
31 


ETLQARREWYDI FKVLKKKNKTKNPFYT 
KIEYLVKLSFKYEKVTKTLLDP*Q*IiRD 
FXNTRHANRNTSNRRKRPIIKHK 


8240 


22141 


A 


8300 


382 


ZOO 


fv W U Z. V L IJ_i *r IV1_1 VOlXtJ jn^nxiin V vjJLLti\.V-.ri 

DYRREPPHLA 


8241 


22142 


A 


8301 


2 


273 


PLTRGPSSASDALSSGGPYHPSECCFTY 
TTYKI PRQRIMD YYETNSQCS KPGIV* V 
VHTHHTGGRGSQQGLLEGSREWWWNGDP 
QRTSQV 


8242 


22143 


A 


8302 


77 


430 


GMGLSMHRI^QYIJUIIHPCVHVHTHTH 

mJ*T , LI , l , LI , l , LI*PIJ , I , l .nui*tJ , l*I?TUXrCT70'V(f2. , T T lS > 

XrllrllrLlrllill JjlJI!l w ttxlsxnJxo VKlwlll 
HAQDPFS IFSPYTPMCAClliTHTHTHTH 
THTHTHSVYVTHKVCEETCLSFFYPESG 
TISHAQEILKRCIiF^KTHLGWCFTSlxL 
SVANIVFFSYKRQVAAARGGIiFFSLSQA 

F 


8243 


22144 


A 


8303 


137 


396 


SSVKTMNYLKPVVHMVADRC*LFQCLPS 
KKKKKKKKKKKKKKGGPLKKKKILTRGG 
OCZj FFFFGAPKKT SPGRV* KKKKKKKKK 
KKKKGGPLKKKKNFNPGGGGNFFFFWGP 
KKNIAGARUCKPGGGKMPRS P 


8244 


22145 


7V 

A 


8304 


329 


AAA 
144 


WOPPFFFPW*DGVSkCCPGWTAVARSR 
LTFAVFFTLNCSWGIHSELQHWFQDYL 
KLKTF 


8245 


22146 


A 


8305 


404 


157 


RTYNPSTLRGRSGQ IT*GQDFDTSDGDR 
VGPHURKRKKNYYSLVECFYRIKEKYPE 
SHKKAMKMLFQLKPDFLHIFQHIATN 


8246 


22147 


A 


8306 


394 


294 


SLCFLTP PRDFNLGAL I KNFYS PTPGWV 
SCVx^KGPPFFFFFFFK3X3VLLSCPH*S 
QNPFVFALEWSGTPELK*SSHPGLPKAW 
DYTYEPPHOAQVELLRY*RSFPVLLLDG 
ICIFHSFFSPRGSLIP 


8247 


22148 


A 


8307 


280 


2 


WDSHVTFPTSVITVMLFSIQYVKQLEN 
IKSWPGll/ARACNPSAIiGGRDG+AQEFD 
SSPGNTAKPOxPQKNKTKQKTKNKVGGT 
LETRSWRL 


8248 


22149 


A 


8308 


136 


330 


MSYQPJCSVNKYKSNNACEEVTTWARHSG 
SCLRLYNPSTLVGSGGKIP*TQEFETSL 
SNIARPYLY 


8249 


22150 


A 


8309 


3 


118 


IFWLVQMGFHHVGQASIiEL * PQVIHPP 
QPPKVLGLPA 


8250 


22151 


A 


8310 


2 


366 


TRTOPRVRVRSTHRNLCIiPGSSHASTSA 
S * VAGTTSACHHARLIFSFSFFFLGFSE 
KTGFNFFAQGGFEPLGEREPPPFGPPGF 
WGGGGGGTFKRPRRWKFQKD IIP PLVSN 
RGDQPEPFF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ami tin arid 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

■"ooiff iv* nf 
I Co I U UC UI 

peptide 
sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidioe, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

(^V^rl^l nfom int» P=tA rain In a ^=$ai>!na 

wiuiHiiijiJc, ix - virgin iii c, i3^3erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A"uu ivnuwn, =oiop coaon, /— possioie 
nucleotide deletion, V=possible 
nucleotide insertion 


8251 


22152 


A 


8311 


253 


446 


SKTFQARGGVTEMGFRHVGQTGLEILTS 
GDPPASACQAAGITGVSHHARLR I * YFI 
PTPTLMTP 

In. J. C J. 4-il i X V» 


8252 


22153 


A 


8312 


430 


88 


GVGRIQISWSPWLPKPKNSPGSPPQ*VG 
APGGPPPPGPYFFFFFQKKGFPLFSGRV 

*nQn*5ITKf RT?\rPPPPTOtfWfVrQ P do DDirrD 

GFFFFFFFLSRDGASPCCPPRPPKVLGL 
QV 


8253 


22154 


A 


8313 


400 


323 


HRLMPPHPIDFFLFFVETGSCYVAQGAC 
ELLGSGNLPGS AS * NAR I TDVNHHAQ P * 


8254 


22155 


A 


8314 


4 


421 


GGRIARTQEAKVAVSQQCTTAIjQPGRHS 
KTPSQKTKKKKSPGGPPGPPPGRFFFFF 

LWKKPGGKLGGPTKKKTRGGGANPPPLW 
GKGGGPKGAPRAFPFTKKKPPPSGRNQ 


8255 


22156 


A 


8315 


213 


32 


RCPGTVGHGCHPSPLGGQRGRI P *GQBF 

t?TCT fcKTMTn?"OV1?T?T KTC VTrVTlTKrilD'T'WKr 

olo j_i/\i>J rl w rvir I r r j_i in o rvivtv 1 ivWHPlivKri 
SQAW 


8256 


22157 


A 


8316 


422 


153 


AATLYTHKTPHCIYFFLDRNLLCCPGWS 
QTPGLK*AFCLSLPSSWDYRHKPPRPVT 

C DTT7D T VQVTYtTT DT5DVT D4MDMMT VT T)C 

YCHHI 


OZ J / 


ZZ1 Jo 




00 1 7 
OJI / 


i 


254 


LHQLAMKKGDPQLCDFLESHFLNQQVKA 
IKKLGDYIjSNLCKT*APEAGLAEYLFDK 
LTLGGSEEDT 


8258 


22159 


A 


8318 


187 


37 


TRDFFWPGWAHA*NCSTLGGQGRRIA* 
AQKFESNLGNRVSHCLYKKFKN 




ZZlOU 




oiiy 


2 


OCA 

250 


WTCf^ G TAP! TCJT^/^t/ IT D OUT O T T/^TrODDnB 

i\.z tiUo xjUci V WJUCVKRPNIjRI igvpeeee 
KSKSLENI FGGI IEENFPGLARDLDIQI 
REAQRTPEKSIAKRSSSRHIVIRIjSK*E 

FGGI IEENFPGLARDLDIQIREAQRTPE 
KSIAKRSSSRHIVIRLSK 




ZZ lOl 






J /I 




tfxrPQ TT.l?T?l?iri7 r PD GU#xn\r\a/T ttt t y*>o o 
XMiMrroiijr r c rciiton" v/^ilurxjVxjijuoo 

NPPCSASRVAGVTGVHQRYQQDDFKNFR 

HN 


8261 


22162 


A 


8321 


3 


235 


FISVSFKYSKHIPNNMHLKFLVLFIJJFF 
FRDGFWLRSG* S WAI LRCGHS S LQ PLT 
GLKOS SHLS FOGS WD VP RAPTf 


8262 


22163 


A 


8322 


1 


395 


LEVPLLNTSLLLASGVS IT* AHHRLIES 
NRNQI IQALLITILLGLYFTLLQASEYF 
ESPFTISDGIYGSTFFVATGFHRTtHGTT 
GSTFLTICFIRQLIFHFTSKHHFGFEAA 
A*YWHFVDED*LFLYVSIY 


8263 




A 






OJ 


MHOS FT, VAKVKKG TCHG R WFYS GT.MT tr q A 
PVl^VlWlTiLCWPGAVVHACNPSTLGG 
QGRWMA*AQEFETSLDNVGKPHP 


8264 




A 




19 


9AQ 


CI F PTiP P P PORGE KR R P P P P P T .P PW F 

KGEFPIjGGGNFLTLEPPRLAPPKGG^G 

GDPPPPGFFFFLKKKPKPLLKG 


8265 


22166 


A 


8325 


139 


335 


ENILWAATVVIiDCHPSTLGG*GGRIT*G 
QEFKTNLANIERPHLFFFFFLKRNFFFA 
PRVEGRGPV 


8266 


22167 


A 


8326 


1 


246 . 


FRRGLAVAPRVLLCYPGWRAMVQS * LTA 
ASNSWAQAIHLSLPKCWNYKHTPPYPTC 
*NSACWDESSFPLLVSAMEVCGQPHS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 

• 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
oing to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Gfutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, ! 

0=^21 ii to mi no 1>— A rnintnp C=Carinf» 
^ \j» u ui in 1 u c, iv* - ixm giuiiic, 0 ijci uiC) 

T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, ^possible 

n nr>ljM*itff1 a Ho I Aft An \=rnftCCihlp 
nUCJcUUUc UclCUUIl, v — puioIUic 

nucleotide insertion 


8267 


22168 


A 


oo2/ 


422 


1U/ 


TYYVROORTCKPCJL.FFP FFRFPLPPGGGO 
WGHFWFLQPPPRGV*RFFPLNLPRNRPW 
DFKGLPPPPAYFAFFLKKGVFPPGQGDF 
* ILAPKGPPTSTSQKGWGFKD 


8268 


22169 


A 


8328 


I | 


lol 


LILNSWAQATLLPQIjPE * IX3LQTSS 


8269 


22170 


A 


8329 


422 


103 


PPYSPPFGGPGGGVPKVRESGPPGPPQG 
NPFFFKKPKNNPGGGGGPLIPPSSGG*A 
KGMGLSPRPRLPLS*NWPPPPPPGGQNK 

r.DVDV tCVTV VI HDTV J?Sf JTWt?13'WP/3 


8270 


22171 


A 


8330 


46 


422 


STPFPIiASQSAGITGVSHRARRSSSFQC 

TQIPLFPETLPNLGFGSRHGRGFSSQVL 

PAQSSSFDLDLISVLGY*MI>SHISLGPN 

P J\T J^P T DTWT.C!lfWT7MrJ <ITI .12 <3 T VT . T R 
XsnXjV7ff X tr .trnxjOJM» dj/tV70 x jjao x vuoyi.D 

GIxSSTLSFPHPVS 


8271 


22172 


A 


8331 


429 


112 


FSHLGSS PRQGFTMLtARLWNS * PCDP P 
TSASQSAGITGVSHHAQPHAVFYIRLPD 

NLSVPPPLWPIQSQPKASILLM 


8272 


22173 


A 


8332 


291 


19 


KTPFPPKI*KFPPNFLKFTPGCGGPPPP 

lTT.irirPVCS PTJNPP PKTT T PFFFFP PPP P FF 

FFFFFLRRGSLISIAQDGIQWHYHGSLQ 
POT»PG1aR 

XT S^XI t V7XXCV 


8273 


22174 


A 


8333 


307 


123 


KYFRGGGGGGR* FPPLGGVKQKKRGNPG 
GGGSKKPKLGPSPPTRGKKKNFFSPKKK 


8274 


22175 


A 


8334 


3 


169 


HASADMCHQAQLLLLI J.VETGkCHDGRA 
GFGS*SQEIHPPRPPKVLGLQHFGLIY 


8275 


22176 


A 


8335 


1 


208 


IXVC^OTCVCVCTCVLFC*WRKGLAIiSP 
RGECSGTNMAHCSIxNFSGSSNPSVSPSH 

\tW WITYSYTPDTT^l 
VADl iuVKrllAJ 


8276 


22177 


A 


8336 


1 


352 


ILFYFIFYYFSFCFEMGSRSVSQAGVQC 
PDLSSLQPLPPRFKRFFCLTFPSSRDYR 
CVPPHLAKFCI IRRR*AAIGQAGIELLT 
LSDPPASAS*SARNTGVSHCSGPSFINF 


8277 


22178 


A 


8337 


420 


272 


KAPCQFSGEKWFPKHCWNSWVPFRKKQ 
QQKKNLCPCLVL YTS INS Q * TMGLN I KA 
RTVNT S * KYKEDP * TGKK* INKS CPWTG 
{OaaCFLARTQKTLAIWDTLINWTSPKLK 
MSTKQRHHLKMNR* PKWLIiFFVS 


8278 


22179 


A 


8338 


427 


202 


LIIYFCHQSVQKKGVLPLLQEREGWAEA 
SLKISNISI^HTHTHTHACAHTHTHTHT 
HTSS * NG IKTWIT r KS FA 


8279 


22180 


A 


8339 


2 


164 


TPGKG I S LCRPGWS AVAP SRLTATSAS W 
VQT I LLPWPPE * LGLQES RL LRDHLC 


8280 


22181 


A 


8340 


422 


1 


PPGPPGGKXKRAPPPSFFKGPNPLLKGK 
GPGAKPPPKNGVFFFPPKKVKGPQKNFG 
GTQISQPNKRGA* KFTPFFIr*KPPQKKK 
GPPPGEKIFKGPKF * iCFPSPGFFFFFLK 
QGLTLSPRLERSGANTAHCSLKLLSSSD 


8281 


22182 


A 


8341 


1 


456 


KKOTAPGGKGKIGGTAKKKGRNFIxDAKI 
LGPNSRPGGAFFGG*TAPPGKKRGSPGP 
IFGGGKGGPKKKILSF*KAGGGAKFFPS 
P VGTPFFRGF* KGKIlYlxKKKKFS KPGGG 
NPENLSPTNFFF 


8282 


22183 


A 


8342 


407 


2 


RQFILGAPGVFPPPGF*KPPPKIIFWGP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning ! 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutaroic Acid, ^Phenylalanine, 
G=Glycrae, H=Histidine, I=lsoteudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKKNFFPPPRGEKFFSFKGAPPLFFFFF 
FFFFFFSPFFFGALFFFFFTILFKTKFG 
VFSCKGIPPKFFKRSRPPNGAGRLPLLG 
VGPLPGVPQGGEAERKISQGS FQ 


8283 


22184 


A 


8343 


441 


1 


PPPYILGGPRVFSPPPFFKTPPQIFFLG 
PQKKKNSPPPPGKKIFFF*RAPPPFFFF 
FFFFFFFPPFFFGALFFFFFTILFKTKF 
VVISWKGIQPKLFKRSRPPNGGGRLPLL 
GGGP PPGVPRGGEAERKFSRGS FQRTGG 
SAPEFRA 


8284 


22185 


A 


8344 


466 


323 


FGVAGTQNPGRNPRVPS FRGGP P PGVPQ 
GRNPPPPWFFPLNPPKHFLTP*VFPFLE 
PPIEWVGPGPP*PSYGFSSPVARK 


8285 


22186 


A 


8345 


657 


347 


LFFWFFLRQS FTLVAQAGVQ* CDLGSLQ 
PPPPGFK*FSRLNLPSSIiDDRCPAPCPA 
NFVLLVETGFHHVGQAGLELLTSGDPPA 
SASQSAGITGVSISAQPAQ 


8286 


22187 


A 


8346 


3 


212 


DCSRGP P PHLANFGLLEKTGSNHVDQAG 
LKPLSLRDPPASAS*SPGITGGSHRAQF 
* FI FKRGMAPKGF 


8287 


22188 


A 


8347 


9 


255 


IPGPTHPSDFFLYEIDLPFKN*KGFPPP 
PRGGGKNPPFPPPPFFFS * PPWKKKITF 
QTPLTGVPGAPKNFNFWGGFKKIGF 


8288 


22189 


A 


8348 


79 


312 


NAAGFEVGISKA* ALETLQI ITREGHSN 
KPRYAVTFESHEMGTVLHLVEQEHTVGY 
ITASCSALKC*IESGGFLNDYI 


8289 


22190 


A 


8349 


1 


369 


FFFFLKRDRVS PLSSRVKYKWHDYSSLQ 
ZjQTPGLQ*SSCLRFIiSSQDYRHrPPCPT 
NF* KCFVEMGS CCVTQGWSQTPGAQAVL 
PPWASQRHWGLQGVRPLCSLLLSNCPHS 
RLVWSWLFSGC 


8290 


22191 


A 


8350 


434 


324 


LPILSPVWWHMPVVPGTREAEA*EQ*SL . 
EPGRRRLQ 


8291, 


22192 


A 


8351 


3 


253 


YMCSKFFFLECSGAISAHCHLBIiPGSSD 
PPPPT*NF*VIGTTGTCHHEWLLFELFG 
KMKPHY IAQVGFKLLERNFLKTKKVFP 


8292 


22193 


A 


8352 


3 


1431 


SPRGGIPSLKQVYSSLTTNSRPSQHLF* 
EEALQRAREEEEKRKEITSHFQSTLTDI 
QGQIEQQSERNMKLCQENTELAEKLKSI 
IDQYELREEHLDKI FKHRELQQKLVDAK 
IiEQAQEMMKEAEERHKREKEYLLNQAAE 
WKLQAKVLKEQETVLQAQLTLYSGRFEE 
FQSTLTKSNEVFATFKQEMDKTTKKMKK 
LEKDTATWKARFENCNKALLDMIEEKAL 
RAKBYECFVMKIGRLENLCRALQEERNE 
LHKKIRDAEISEKDDQSQHNSDEEPESN 
VSVDQE IDAEEVNS VQTAVKNLATAFMI 
IHHPESTPHQSKETQPEIGSSQESADAA 
LKEPEQPPLIPSRDSESPLPPLTPQAEA 
EGGSDAEPPSKASNSPAGLGAETQCEGL 
PVGAQADQASWKPEAEASGQAPQAPTEA 
SLQKMEADVPAPACAAEEHVAAMVPACE 
PSRUPPRAAAEBIjPVGASAGPQPRNVA 


8293 


22194 


A 


8353 


376 


65 


IHCGDHVQEVPKLKVCALHMSSWACSQI 
PKAGDKILTFDQLTIiDTLKGCGT ILLSG 
PHKGQEVYWHFSKALGTQHSHTKPCVHS 
RDQKFKHIRG*WASQGYKN 


8294 


22195 


A 


8354 


459 


85 


' LEEVESLERVLEKKMIESEELKSKQTRF 
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SEQIB 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude* 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
^Glutamic Acid, FHPhenylalanine, 
G=Glycine, HNHistidine, I=Iso leucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S^Serine, 
T=Tbreonine, V^Valine, 
W=Tryptophan, V^Tyrosine, 
X=Un known, **=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEEIKNQDKLNKSLKEEAMIiQKQSCEEL 
KSDLNTKNELLKQKTIELTRACQKQYEL 
B*ELAFYKIBAKFEPUTYYPSEYAEIDK 
APDESPYIGKSRY 


8295 


22196 


A 


8355 


231 


158 


PFFIFYLYQSNDIARGLERGLEPEKVIG 
ATDS CGDLMFLMKW * ESVAPMTFSGSRP 
LSKPRAISLLVmC 


8296 


22197 


A 


8356 


2 


164 


IHRGVENQMAGEQGWDKDVSVYWVGAVA 
HACNPNTLGGQGRWIT*GQEFETNLP 


8297 


22198 


A 


8357 


23 


291 


CTHTRAQAHAHTHHLFYYFLYIFNFFEE 
TGSH* IVQACLEIiPASSDPPASFSQSAG 
ITGMSHHAWPTSCIIiKVKMALCSINCSA 
ACFLY 


8298 


22199 


A 


8358 


1 


307 


FFFFEIESCSFAQDGVQWCDLGSLQPPP 
PGFKQFSCLSU^SSWDYRHVPPRIANFC 
IFSRDGFHHVGQAGLQLLTSNDPPT*AS 
QSAGITGVSPRAQPCPAF 


8299 


22200 


A 


8359 


75 


266 


FVTCTRNLFLTVLEAGKSNI KARAG* VSG 
EEHGLCLQDGALLPLEGRNAVS SHPRGQ 
NMMRAKC 


8300 


22201 ( 


A 


8360 


1 


295 


NTWWCFVLLFRDRVSLCCSVRSTVASS 
*LIVNMNLSFPSS*AYRHVSSHLDFFFF 
FKIQR I LI LPRL I SNFWPQE I LLWLPQG 
LGLQPRPQVLVGNS 


8301 


2^202 


A 


8361 


376 


293 


MGFHHVSQAGLKLLTSGDLPASASQSAR 
ITGAGHHAQPDF*HSGDSWQAPPEPRDQ 
G 


8302 


22203 


A 


8362 


375 


142 


PQEEFLRFATDVGEYRAVTEIiERPVSES 
WNTQKDLL * QKRGRVSTYCRHTYGVGES 
FSVQRRGEHVGGRGLGPCELEI 


8303 


22204 


A 


8363 


36 


363 


QI*S FNPL I GKKVKLRTVFIGKQ FLFLLG 
GGVREAGAHSGAQATVRWHNLS SHAKRP 
SCLSLPSCDHRHLLPHPANFCIFWTD* I 
PPCRSGWVQTPELKQFTHLGLPKR 


8304 


22205 


A 


8364 


455 


235 


GAQARAPPRGIIFFFFW*RKGFCMLPRP 
VSKFWG*VVPLPWPPKRVGLDSWAPPPR 
DFIFFSFFVKKRVCILW 


8305 


22206 


A 


8365 


432 


175 


STDQVSVAMGPS* PGAGYNLLVCRWLRP 
LKKCSIRMGVS*FSRYCLSWLPLDBCKGN 
SLTPCASRVRRCTVMLRLMLRGLHPLSD 
KP 


8306 


22207 


A 


8366 


383 


83 


RRLIVFI CYFSKK*GLPLLPRI»VSNFWP 
QVILVPWPPKVLGL *ALKKLWLDMVTHA 
YNPSTLGGQGTRIT 


8307 


22208 


A 


8367 


27 


304 


FFGFILTLKKSSN* CTLRFCFLS IGKVR 
IHTLGNILSRRGYGERCStiPHCLWESIC 
LPVQSN* ALHKKVQMHI P FDQAI PSRGI 
LPTDTGHK 


8308 


22209 


A 


8368 


165 


204 


QNIPIAVSFFFFFLKRVSFCPPG*RAGG 
QSLFIGALTFQV*GILIPYPPGNWELRG 
PPPNLTNFF*FLNLRFLIiGPINLHYVHD 
KTSPUjYIiFFFFF 


8309 


22210 


A 


8369 


400 


201 


KERPLPFRXiIGSPFQGTWPRFFSKGEVK 
NSNFSKY*PPPFINPAPRVCFEGIEVIT 
KFFQVRPFFP 


8310 


22211 


A 


8370 


21 


408 


LIPLSTYVCVRVCECVCVCVCVNI+GYA 
CNTLLIACNILQDVPFTS PTKDKI FSGD 
FLTLVGKK 
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SEQU) 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acta 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, h=Iso leucine, 
K=Lysine, L=Leu d ne, M=Meth ion i ne, 
N=Asparagine, P=Proline, 
v^oiutamine, k— Arginwe, o— oenne, 
T=Threonine, V=Valine, 
w— irypcopnan, i — tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8311 


22212 


A 


8371 


1 


94 


NTLGHEN*LNSaFGGCSEPRSCHCTPrW 
ATE 






A 


5372 


546 


972 


V ±Hi* U Y GR* PS FCLGIjAIjFKTLiQ VMHFL 
KDSVRNLKSSNLKVGLNFRS PQIQATI S 
. PPLQPKTKTYEBDPKSKPKKEEKNMEVN 
MKEWILRYAEQQNEEEKNENSKSLEEEE 
KJ*v PWcJc iIJIIjAAKLIjDAKEQAAT FKLE 
KK 


8313 


22214 


A 
* 


8373 


502 


178 


QHFWRLRQKDSFALGG* KQPGQKGENP F 

Cf^WTi VTkTTT -* r»rrv> nx nrri'i"i'A zr/-if~*f~*t^ n» n 
r yisJftJUSjN JLJjLj* WUyP V V PT 1 yjKjWJGGES 

L* PGRTKSPPCTSGWGKKQNPVLKKKKN 
PQKTOCIPLCIYHIFLINRKTNTV j 


8314 


22215 


A 


8374 


362 


112 


YRMS VNPQVRAKRVSQRTNTP PNFFF * D 
RVLIiCRPG*STWQSQLTVASISWAKAI 
LIjPPQPRQAKYFLRIj VKCS 5 EKGER IE 


8315 


22216 


A 


8375 


411 


21 


VLGIYPTLVSPSPGLKMTRVKLPSGSKK 
VT S S SNRP VFGWAAGG * FD KP I LKAGR 
ASSKYKAKRNCWPRVRGVAMNPVEHPFG 
GGILQRIGKPSAIRIYAPAGRKVGLIAV 
KRSGRIiRGANTVQEKEN 


8316 


22217 


A 


8376 


450 


108 


AEGGRGPRK* *LRSCRPVDGSAGQSPGP 
LRTESIdSTPGCS PQGWQELEP PPRRPQ 
WWPGPPGHRDRRQHRRPGARSRQQGPES 
SH *QRSGESRPPAVPRPPCPL P WKKGVL 
E 


8317 


22218 


A 


8377 


25 


306 


EQQPPPVAVHQSGPPGAPPEASPSHPGS 
S EKPQP VP PPRAPLvPQPKP * LS PPHHP 
AAGLLAVHALAGGPGTVQRGLPGRAARA 
VSVRVPAARC 


8318 


22219 


A 


8378 


376 


149 


PAHCSLLI PGFRGSSCLSLPCNWYPGRA 
P PR VGNF FFFF * EMES CYVAMLPSLVLT 
SWALAILPVLPPKLPGLQA 


8319 


22220 


A 


8379 


98 


374 


S VPPPVGT YBGDAAPTGGGGLPALGR * G 
AAGGHVQQGHPGPHRGHAAPKPSAQGGP 
GHSRECLE PSGGAAGGEREP PRAGGGQS 
WMRAPSMT 


8320 

- 


22221 


A 


8380 


2 


807 


RPQCCVRAPRAGAVSGVPKLWSVP*SGR 
SAPHPTRS * VRS WSRWLRTQS WQTSRTR 
RRSESKVRVSYQAWTASPPPAASASARP 
A*RTSSRSYSSSSTCCSWLWPSSWSTGP 
bQTr VKWS S TI»S CiiCLTRKw IAMMPQ VL 
PCTPVRPSCSAVSTI TRSFLL *QALASR 
VT*IAPPRGSTTRTPSPXRL*NLP*LSR 

ftDf2V*V C/3CUCGCCCK *TDTnrDTe »nr t 

TSSSLLSRSSCKAQTG*ASCRPVRVPIP 


8321 


22222 


A 


8381 


104 


327 


CIAFPEMLGIKYEVGKCPSGLKSLKILK 
NTHTHTHTHTHTHTHTHTHTHALYIilRE 
GRKK*CVCMRKTOSTCVS 


8322 


22223 


A 


8382 


400 


270 


KPKLEVRASGQVQWLSPIIPALEEAKAG 

r»*T dddct ■DtrownrmtT 
li w UakrKJz Lutslr m V (JN 


8323 


22224 


A 


8383 


178 


588 


ADLLSGAPGPPGGSQDNANSSEHSRACA 
PAAPGP * WLQPWLCP * WTQS PLEARGS * 
NPGVSHSSRIPGIPGAAGTSRPSGALPP 
OPAGHGGSSGELHVARGCACGRPLWSHR 
PGCGQLLRVTAADSHRCASRGAAPM 


8324 


22225 


A 


8384 


423 


78 


QGQELAELTGVTVESPEEPQRLLVS*GG 
TVPCLACSLHRTL* TVGHGSASS FLAQG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

Ulug IU 

last amino 
acid 

rflr i rl 1 1 ^ ftf 

peptide 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Asparttc Acid, 
E=GIutamic Acid, {^Phenylalanine, 
G==Glycme, ENHistidine, l=Isoleucine, 
K-Lysine, L=Leucine, M-Metfaionine, 
N=Asparagine, P=Prolme, 

0=<^lnf o minp XI— \ roinirip Q— Qr» r i n p 
\^ WlUUlIlllltC, IV r\l gllJlHCj o— OCI lllc, 

T=Threonme, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

^C— Tin If nnwn £=C-f/\rv pnrlo n A^rwrccihlf 

A UlllVUUlVllj "UlUJ/ V.UUVH) /~ JJU9MUIC 

nucleotide deletion, \=possible 
nucleotide insertion 














DMEPLGDRGLQGTEQLH * GGDRAS LQD I 
CPLLPVTjSIiLCPRSLDLHRHRVGVTGFT 
ED 


8325 


22226 


A 


8385 


434 


176 


FADDSRFDFLLLV*CAMIiMLIREQIiLEG 
HFTVNMRLMQDYPITDVCQ I LQKAKELQ 
D^fC* PGGKRPMFGREASRPCAVAPGTHR 
NL 


8326 


22227 


A 


8386 


2 


305 


RLWEDPPGRNTPPGAR* PGPCGEGGHI I 
REEOACGWPRGHSGPWRHPOAGPIiOLGO 
WRGKGAGQGARRS PQGSLESRPPTLSAA 
PPAIHICSVGGVRLHCY 


8327 


22228 


TV 


8387 


331 




FNTP Q W^t/TO ArTVOWRTYYG^ T .OTT? P pfTT ^ 
CiiiEvono v x\^nv3 vv^rsjvu iuujjv/ xx\.ir r uuo 

FNVNWDWVGL*GDPSSPPMUiPSS*ARP 
GGGWGPGGEGHVS KSAAGRIiQGPP 


8328 


22229 


TV 


o*>oo 
oioo 


2 




P * S CTCTVTWAFFIiES clrcahwyvtl 
LTALLGSLPCANPRQTAVSLPQAQIiV 


8329 


22230 


A 


8389 


388 


176 


YWDSQ*C3^PSFAVPVESACICAIGHWFS 
KSWSVIAVOnXSSSHKYVBTHDGRCHR 
EAFDVYRD I CDDDA 


8330 


22231 


A 


8390 


445 


330 


PPRFTPFSCLSLRSSWDYR* PPPRPDNF 


8331 


22232 


A 


8391 


559 


142 


LRTLPVTLGKSLALSVPSFPYVGGNWIT 
NREPSABCGRGSGPNRGPGRAASCPARTH 
SLRHS PAGGVMLTQPRP ILILPETQHPS 
ES AVARTDIS KARRWLHSRS PWPH* DET 
SAGAE RPG I RCY I GHP AT 


8332 


22233 


A 


8392 


104 


400 


YYEIFKKEJViySYYHILHFFFFFSFFFFE 
TKPNS VMGPEGKGPIT^*LKPWLIjG* KH 
FCIiTLPSGGNYRHVIjPPPVNGFFFFFFF 
KKKKVFPFVAPRGV 


8333 


22234 


A 


8393 


408 


169 


DQVWI KDWNIGSIjRPR* KGPQTI ILTTP 
TAVKIEGIPAWIQHSQVKPAAPETWEVR 
PSLDNPCKVTLKKMTS PAPVTLRS 


8334 


22235. 


A 


8394 


1 


269 


YCCPLFSSKALTQENSPYSSFRLVNPPG 
LSTiHPEGRGGKW INERGRELGPSAGPLL 
LFIiHFAEAGRRQPPDWADSEADLiQQVRH 
KLGQP 


8335 


22236 


A 


8395 


2 


204 


KDCKVNKEVERVLREFHQAGKP IGCDAS 
TSLPAPAPWWSWNAALCCPGLS CVWCWQ 
GESRTCVGEHQ 


8336 


22237 


A 


8396 


3 


285 


KVTVKTORKELRiCRLNLEEW I LEQLTRL 
YDCQEEEIPELEIDVDELLDMESDDARA 
ARVKEIxLVDCYKPTEAFISGIxLDKIRGM 
QKLSTPQKK 






TV 




i 


109 


ETGFHHAGQDGLELLTSGDCPS LASQS A 
RITGVSHCTRAQLLTEQEYFKNQNHLPS 
APFYKRAF 










o 
o 


LfW. 


LNP F L Y AFLDP RFROACTS MLC CGO S RS 
SGTSHS IRGEKSAJS YSSGHSQGPGPNMG 
KGGEQMLEKS I P YSQETLVVD 


8339 


22240 


A 


8399 


2 


240 


S CLN P FIj YAF LE PRFRQACTS ML CCGQS 
RSAGTLHSSRGEKSASYS SGHSQGPGPN 
MGKGGEQMHEKS IP YSQETLVVD 


8340 


22241 


A 


8400 


10 


272 


CNWQNAYLT\A/RCAQDCEDYFAERLYRS 
MKGAGTDEETLIRI IVTRAEVDLLGIKP 
KFHQKYQKSLSDMVRSDTC^DFRKLLVA 
LLH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leucine, M-Methtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Un known, *=Stop codon, /=po$sib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


8341 


22242 


A 


8401 


47 


248 


TRGIFFFFLNTRSHSVAQAKGLGHDHSS 
tiKPQPPRLKQS FNLS LLSTWNYGGLLPH 
SALWKAEAAKS 


8342 


22243 


A 


8403 


497 


303 


TVEAS VS YIaL YVAMVMQLPWGKS PASGA 
ESHRQGCGGSWPGGGMEEGRGAERGGGC 
LLPQALLY 


8343 


22244 


A 


8404 


3 


235 


LTLLAHCRMMAAAHMGGPPMMPMMGPNT 
PGMMPVGPAPGMRPPLGGHMPMI PGYPM 
MRS PARLMMVPSQPRMTRPDR 


8344 


22245 


A 


8405 


3 


246 


LVFSVDVGLLAHTRQRGLKRSSGMSSLL 
GKIGAKKQKMSTLEKSKLDWBSFKEEEG 
IGEBLAIHNRGKEGWIKGAQPLGQ 


8345 


22246 


A 


8406 ] 


1739 


804 


WEPDVSGSAAWLASAAFGMATFSGPAGP 
ILSLNPQEDVEFQKEVAQVRKRITQRKK 
QEQLTPGVVYVRHLPNLLDETQIFSYFS 
QFGTVTRFRLSRSKRTGNSKGYAFVEFB 
SEDVAKIVAETMNNYLFGERLLECHFMP 
PEKVHKELFKDWNIPFKQPSYPSVKRYN 
RNRTLTQKLRMEERFKKKERLLRKKLAK 
KGIDYDFPSLIIiQKTESISKTNRQTSTK 
GQVLRKKKKKVSGTLDTPEKTVDSQGPT 
PVCTPTFLERRKSQVAELNDDDKDDE IV 
FKQPISCVKEEIQETQTPTHSRKKRRRS 
SNQ. 


8346 


22247 


A 


8407 


2 


241 


CLRFPSSPAMGLLRSGTKLIFRRRPKQK 
EAGLSQSHDDLSNATATPSVRMKAGSFS 
RRLIKRFYFKSKPKANGNPSPQL 


8347 


22248 


A 


8408 


285 


186 


KQGLTSLPRLVSNSWAQAFLLLWPPKVL 
RLQA 


8348 


22249 


A 


8409 


1 


246 


RHENREELQVIADLCIKYDTLCISDEEF 
MNGVEYIWKGPCFCLTTFTLYLKGDIFP 
RPPSGCLNLWIVLWPMYPVIHLITSC 


8349 


22250 


A 


8410 


1 


102 


AGS YTLHI IKGDDGTRGVTGRFTFTIiHR 
IQTSEE 


8350 


22251 


A 


8411 


877 


647 


FFFETESHSVTQAGVKWGDLGSLQSLAP 
GPKKFSCLNLPSSRDYRSPPPPLANFCT 
FSKNGVSPCWPGRSSTPDRR 


8351 


22252 


A 


8412 


1 


130 


RMLNDKTLRTDIGGNF PKNGWAAIATHS 
FEFAQFDNFLEEATR | 


8352 


22253 


A 


8413 


97 


401 


CNPENGLTASALGRRMCLATCKAPRTLP 
DSGDTASCRFPAVPRPPHSRRS SGSGHL 
PGRPRCPALPGLBWSNPPGPPTSGYLFP 
TFSTPAAHSHQKTLLGI 


8353 


22254 


A 


8414 


348 


516 


VSAYGFITEGHERFSDHYYDTSWKRIilF 
YINHDFKtiEREVWKRLHDEGI IRLYQRL 


8354 


22255 


A 


8415 


714 


410 


LGVFRSAIjHGSLWLLLRSFPQKSPNPLA 
LLLFLQ CNTAYQCLL IADQHCRTRKYFL 
CLASGIPCVSHVWVHDSCHANQLQNYRN 
YLLPAGYSLEEQRILDW 


8355 


22256 


A 


8416 


2 


102 


GHDPQDRLLAQDSEHNHSDRNWQRNRPW 
PKDSY 


8356 


22257 


A 


8417 


1 


309 


FYFENALSKSNKPIHTIILNPHVHLVGD 
DAACIAYIRLTQYMDGSGMPKTMQSEET 
RVWHRRDGKWQNVHFHRSGSPTVPIKPP 
CIPNGKENFSGGTSLWQNI 


8357 


22258 


A 


8418 


3 


242 


ARALTNAASHVDDMPNALSALNDLHAHK 
LRLDPVNLKLLSHCLLVTLVDHLPDEFT 
PVEHACLDNVLAS VS TVLTFK YR 
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SEQ ID 
NO; of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
OGIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, t=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codoo, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


8358 


22259 


A 


8419 


711 


296 


NVANSDGLIASLWKEYGKADARWVYPDP 
TIVSVEILTVALDGSLALFLIYATVKEK 
YYRHFLQITLCVCELYGCWMTFLPEWLT 
RSPNLNTSNWLYCWIiYLFFFNGVWVLIP 
GLLLWQSWLEIiKKMHQKETSSVKKFQ 


8359 


22260 


A 


8420 


3 


97 


FWSSLCNGLIAAQLIiFYWNAKPPHKQK 
KAQ 


8360 


22261 


A 


8421 


J 


279 


GCG IADLAMS AI FNFQSLLTD I LLL I CT 
CAYI PSLAPNLLDRNKTCLLGI FWKCAT 
IVERKS P YVAVCCILMAFS ILLIQKLVK 
MPQCICHNI 


8361 


22262 


A 


8422 


1 


228 


RHEVF I ELNHIKKCNTVRGVFVLEEFVP 
EIKEWSHKYKTPMAHEICYSVLCLFSY 
VAAVHSSEEDLRTPPRPVSS 


8362 


22263 • 


A 


8423 


3 


182 


S EDTGEEQ WTAE FXNRGEYE I D IAGYR 
FQAKAKLYPVASLFTQKRRKDDMELSDL 
HGK 


8363 


22264 


A 


8424 


1 


70 


VQFVFDAVTDVI I KNNLKDCGLF 


8364 


22265 


A 


8425 


2 


151 


ETTAS SCTPASLESRRCCAPCRMPRTGF 
FGSSPLWRPSGSRSLKPGFQQ 


8365 


22266 


A 


8426 


1 


187 


RGRVGPGGERLVPGVPGAEAQQPAGDGV 
RAGPLQARP PAPVGVSQGRCQAAGAAAG 
PPRPDG 


8366 


22267 


A 


8427 


520 


423 


GKYQLQSQENFEAFMKAIGECWTERQSC 
GSHQ 


8367 


22268 


A 


8428 


1 


627 


GTSGTRGVTGYFTFTLYLETPKPS ISSS 
NLNPREAMETVILTCDPETPDTSYQWWM 
NGQSLPMTHRPQLSETNRTLFLFGVTKY 
TAGPYECE I RNSGS ASRSDPVTLNLLHG 
PDLPRI HPS YTNYRSGDNLYLSCFANSN 
PPAQYSWTINGKFQQSGQNLFI PQITTK 
HSGLYVCSVRNSATGEESSTSLTVKVSA 
STRIGLLPLLNPT 


8368 


22269 


A 


8429 


3 


390 


ILGCNIIiRVEYSLVICVSVPGSKKVILD 
LPLVIGSRSGLSSRTSSMASRTSSEMSW 
VDLNIPDTPEAPPCYMDVTPEDHRLESP 
TTPLLDDMDGSQDSPIFMYAPEFKFMPP 
PTYTEVDPCILNNNVQ 


8369 


22270 


A 


8430 


3 


208 


NGTHVI I LLCLKTCGTWNVANDM I VAS 
NLGTGVPNQTPVSSGDLLIRSNGLLI PG 
TCEITRLYTISE 


8370 


22271 


A 


8431 


3 


563 


LPTSRVDPRVRLDRMKKDQEEEEDQGPP 
CPRLSRELPEWEPEDLQDSLDRWYSTP 
FS YPELPDS CQPYGSCFYSLEEEHVGFS 
LDVDEIBKYQEGEEDOKPPCPRLNEVLM 
EAEEPEVLQDSLDRCYSTTSTYFQLHAS 
FQQYRSAFYS FEEQDVSLAU3VDNRFFT 
LTVTRHHLAFQMGVIFPH 


8371 


22272 


A 


8432 


2 


119 


DGDNILVTTFI YIKS VTELNGDI ITNAS 
WRCSERVAPSA 


8372 


22273 


A 


8433 


192 


1285 


AGVLS I IEETDSEGIiQTKWENQTYDER 
LE INDS EEVAS I YTPTPRHQGLPRS AHL 
PNKAMADNSSDECEEENNKEKKKTSQLT 
PQRGFSENEDDDDDDDDSSETDSDSDDD 
DEEHGAPIiEGAYDPADYEHLPVSAE IKE 
LFQVNQLRCVPTDVLDHKLKPFI PDFI P 
AVGDIDAFLKVPRPDGKPDNLGLLVLDE 
PSTKQSDPTVLSLWLTENSKQHNITQHM 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Oyrosine, 
X=Un known, *=Stop codon, /=possibte 
nucleotide deletion, \=possible 
nucleotide insertion 














KVKSLEDAEKNPKAIDTWIESISELHRS 
KPPATVHYTRPMPD IDTLMQEWS PEFEE 
LLGKVSLPTAB I DCSLAEYI DMI CAILD 
I PVYKSR IQSIiHLLFSL YSEFKNSQHFK 
ALAEGKKAFTPSSNSTSQAGDMETLTFS 


OJ / J 


22274 


A 


8434 


3 


251 


TLQADHFN TKXiS CGDAAQTJL WARTGYLG 
FVRRTELTAATGERHDALYVVGALDETL 
ELRGLRYHPIDIETSVSRIHRSIAEW 




22275 


A 


8435 


1 


225 


QTFNLEGSQIYEDS I VLQSVFKSARQKI 
AIEEESEDESNEEEDEDDEEYHEWKRYD 
RLGENMCLNCIWNGGGYIV 


8375 


22276 


A 


8436 


95 


340 


CGCG1AGLAMS AI FNFQSLLTVI LLLI C 
TCAY IRS LAPS LLDRNKTGLLGIFWKCA 
RIGERKS P YVAVCC IVMAFS I LFIQ 


oJ/6 


22277 


A 


8437 


1 


317 


GPKPLVRTSREPGKDVTTSGYSSVSTAC 
PTSSVDGGLGALPQPTSVLSLDSDSHTQ 
P CHHQARKSCLQWRPPSPPESTVSQQQV 
KRINLCIHSEEEDMNIiGLVRL 


8377 


22278 


A 


8438 


80 


213 


PHLS FNAG ITT I KVN I RNANS LGGGFHC 
WTCDARRRGTLQSYLD 


8378 


22279 


A 


8439 


3 


767 


HEDNIKQLKEMKFTYLINYIQDEINTIF 
NDYI PYVFKLLKENLCUJLHKFNEFIQN 
ELQEASQEIiQQIHQYIMALREEYFDPS I 
VGWTVKYYELEEKIVSLIKNIjLVAIjKDF 
HSEYIVSASNFTSQIiSSQVBQFIiHRNIQ 
EYLS I LTDPDGKGNEKIAELS ATAQE I I 
KSQAI ATKKI I S D YHQQFR YKLQDFSDQ 
LSDYYEKFIAESKRLIDLSIQNYHTFLI 
YITELLKKLQSTTVMOTYMKLAPGELTI 
IL 


8379 


22280 


A 


8440 


103 


354 


NGCECDFLFLFLFYFFETESRSVAQAGV 
QWHCLGS PQPS S PRFKRFSCLS TLGGRG 
GWIMMSGVQDQPGQHGETPFLMQAGLKT 


8380 


22281 


A 


8441 


3 


160 


KLYPLKI VFGMNGR VWVKGKTVQQTLI Ii 
ANVLEACELMTLDQRIIjMFIIiAES 


8381 


22282 


A 


8442 


459 


3 


CGGLHPVRASWLLCLPKQAWAMAGAPPP 
AWLPPCSLISDCCASNQQDSVGVGPSEP 
GAGYNIiLVRRFLSLSEKRS IRVRVTRFS 
R CHLS PLS LTRKGNS IiTPCTSRVRQCLA 
LLRLAHGALHPLSCAHCLALPS EMTPVP 
QMENAEITRLL 




LZJjSq 


7V 
t\ 


o443 


2 


It o 

318 


RQGWMTAAijQATIiLNPPINTKS QAVNDR 
AKui vJjuM vjUXor SSJ\£4UXEKAVQShuf\NG 
VDLLMNYMYKGFES PSDNSNAMLLQWHE 


8383 


22284 


A 


8444 


2 


109 


VTPFPMSCDLQGDCACRNPQAQEHSRKD 
lJXvoxorivj 


8384 


22285 


A 


8445 


:i 


69 


LVLNSWPQVIRRPWPPKWGLQV 


8385 


22286 


A 


8446 


1 


163 


PSEKHNIWVGVTQFSRCRLSPLSLTRKG 
NS LIP CAS RVRQCLTHLRIj VHGACTH 


8386 


22287 


A 


8447 


1 


147 


GQAGLELLMSGDRPTSASQSAG ITGVSH 
RARPSS I S FI LELRGS vrkkf 


8387 


22288 


A 


8448 






czj .TwrnTD icnvnp p khopm t y t cnv nin? 

vji-i x rixy x \£i\-Liv yr xr rvs^y *r P1X x Xv-uliV-Jx xxi 
NEIKSRDPIRCRECGYRIMYKKRTKREiV 
VFDAR 


8388 


22289 


A 


8449 


134 


392 


TCPPPPEPPS PLTCA VLVPQTRRWRGLG 
SLFRRACCVALPLQLLLLLFLLLLFLLP 
IREEDRSCTLANNFARSFTLMLRYNGPP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Pbenylalanine, 
G=Glycine, IMTistidine, l=3so!eudne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagfne, P^Proline, 
Q=Glutamine, R— Arginine, b-oenne, 
1=1 nreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














PT 


8389 


22290 


A 


8450 


37 


248 


AKITPIiJjQPGRQEQDSIb NKK.ltiKv.CLi I 
MLPRLASDSSGFKRSSHFSLPTSWDVRH 
SPPPLASVPFLIiMC 


8390 


22291 


A 


8451 


367 


165 


NFFVFLVBTGFCHAGQARLQLLASCDLP 
VLALQS AG I TG V SHCARP 1 S i JjKIJ 1 XK.JL 
SHFVPLKISGK 


8391 


22292 


A 


8452 


2 


282 


GKRMAGGPEMMQIjRLiLXjNKIjY i 1 lb Mr b 
AWDKHLYPDLIMGSAMIiHVDVDTAIiGG 
LKLNPYFLVDFWXEPLGPSLAHSLRyPG 
GDCoSDXWX 


8392 


22293 


A 


8453 


4 


163 


HHDFHAQSLIiANIERIAILVFAACEAAA 
VLSLLVS ISNTYGIiDYIiHNLKLLQC 


8393 


22294 


A 


8454 


374 


197 


TPls PKyS S HJJjuS KCWD x KHK P PHUAKN 
STFDSNHSSLAAEDKLLGYFGLSQWQPG 
YQP 


8394 


22295 


A 


8455 


343 


161 


MGMGP5 V Pb lXilWiiljvCKLJ^PijKKCbi 
R VGVS QsS KxaLS KXurXi PKKjbJMo trl Fv_A 
FWVR 


8395 


22296 


A 


8456 


1 


193 


MGFLHVGQAGLKLPTSGDPPTSASQSAG 
XTGMSHRVU P x X Vlio lV_y 1 .hrio V wldjJj 
. IILLTTVLf 


8396 


22297 


A 


8457 


342 


263 


WIjRPVVPAliWBAlSA!bt»jun 


8397 


22298 


A 


8458 


101 


303 


rARJv X Jj u y rCLirlr r jjJXljJ\Jruui*inj_ujr in txtx. 

MENFK^LFGPGTVAHTCNPSTLGGRGGW 
ITRSADRDHPG 


8398 


22299 


A 


8459 


3 


140 


YXjJVttXivM X V v*X\. l XjK\^lJjUriJairKjjVlJWOfi 

AQRILLPWIiPKVLGLQV 


8399 


22300 


A 


8460 


327 


248 


FPRliVliNo WP JSAXLtFufYI JjV-JV V lAjLAiH. 


8400 


22301 


A 


8461 


327 


79 


SIPFIIAFFIYLSFFFKETGSQSVTHAG 
VPGX 1 JAyLAjl^ijJjoovjL/ i?FUoA2>liV Vv» 

TTCTYPTPILSVFPNMPSFFILLCMLF 


8401 


22302 


A 


8462 


365 


273 


MGFQHVGQ VGXiRWPQVXtJliP W PIjKVIajIj 
QA 


8402 


22303 


A 


8463 


375 


217 


\jAL7liJ\Jjij 1 biJlJijPAoAo\J^>^ J -L x vjvorlrl 

AQPkHHFFKTkFIYIFFSSPSSVLS 


8403 


22304 


A 


8464 


238 


354 


RGKCFVQAFLYKECSGGQARWL.TPVIPE 
XiVVfiAa VCfKWX 1 


8404 


22305 


A 


8465 


367 


248 


FQMRIHKWIiIDLHSPSEIKQITS ISIEP 
GVEVEVTIADI 


8405 


22306 


A 


8466 


341 


153 


NMILGWAQWVTPVIPAIGEAHWGGIoLYP 

Kooivi'AWAl W ixz-ir 1 nJvAiVIX J. X JL X JL X J. 

YIYERY 


8406 


22307 


A 


8467 


368 


101 


VtSMor KH v VtUAUIjxSLiIi 1 ooLTrJerAoVoyo 

AGSSTSLPTADLLTPGWVGVYGLVDKNP 
VTL LAMQRIi PAVLEKKS VARKNS ENKI V 


8407 


22308 


A 


8468 


362 


248 


HVGQAGLALIAS GDP PTSASQSARIAGV 

onKAKr'iSAr'li 


8408 


22309 


A 


8469 


384 


196 


SARIiGLPKPWDxT^PPCPADDDlLILMV 
GILETFSLIFTATOESRNYCVYFKMRNK 
HIISKTS 


8409 


22310 


A 


8470 


386 


300 


LAMIaPRLTONSWSQVVI^PLPPNMLGLQ 
V 


8410 


22311 


A 


8471 


380 


178 


PKKNKFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFMYIYTF 
ATVYOTLAC^NKFlXK4ERVTRia^ 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

pepiiae 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
fling to 
last amino 
acitt 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteioe, D=Aspartic Acid, 
E=dutaraic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leu cine, M=Methio nine, 
N=Asparagine, P=Proline, 
V^vjiutamine, K— Arginine, o = oenne, 
T=Threonine, V=Valine, 
w— 1 tryptophan, 1— 1 yrosine, 
X=Unknown, *«Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


0*rl 1 


XXJ 1 -6 


A 


04 /z 


1 




QS P I FFFLFFFFEKQ I LAQAGG 


8412 


22313 


A 


8473 


3 


162 


GLYHVGQAGIiKPLTSGDPLASASQSVRI 


8413 


22314 


A 


8474 


266 


107 


LCLFLIEMRFPYWQAGVKLLSSSHLPS 
JbAbUoLJKXTG VSHL.FW PEVFKRGI F 


8414 


22315 


A 


8475 


361 


169 


RLPGPSNPPISASLAAdlTGARHHAQLl " 
FVFLVEMGLCQAGKHNFEKYSRVLIYS P 
Ivor IHr J_ 


8415 


22316 


A 


8476 


I 


154 


KKIiIPTLMDHGAGFKSSVKEITTAWEM 
ARE VELEVE PEDGNELLQSRGKT 


8416 


22317 


A 


8477 


3 


193 


SHYQSLISSNHGHKHCGRPQGPLPRKTI 
DLCSLDYQLTFPPLiLTHDPVKSPSVRNT 
QELSLIP 


o*» 1 / 


115 10 


A 


8478 


408 


103 


XXji>KbTL*bS KVEGSGVI TAHCNIiELPGS 
SDPLASaSQVaETTC34HHHVWIjIFLGIF 
JjVKMASHYVAQDDLKLIjGSTDP pvs asq 

svgitgishhawaticyk 


8418 


22319 


•A 


8479 


59 


166 


NVNTCLCLHTHTHTHTHTHTHTHNYFKE 

rnnTtlrtVT 

XPIHQYI 


8419 


22320 


A 


8480 


394 


253 


FT.T.TiT.LESQFCHVAQAGLKLLSSCDLPA 
LASQSAEISGVSHCAQPLA 


o**ZU 


lliZX 




8481 


379 


184 


cfslpkcwdfrgvpispgdfwnfii klg 
vcpfcqggfbllapndvaasgfqnagis 
ginfwawal 




I lb 11 


A 


o4o2 


367 


170 


CGJjS sgijggrsgreserkkerererdre 

ivKKiiGEGERKiCERERESVSSYK^ 
SGPHPYSLI 


8422 


22323 


A 


8483 


324 


136 


qrssclslpsswdyrrmpphlanfcifc 
rdgvlpccpswsqmttsflslnfliclv 


8423 


22324 


A 


8484 


1 


57 


gerrygtciyqgrlwafcc 


8424 


22325 


A 


8485 


35 


339 


gwllrfbwqwkngnnlnvslidgwlsy 
r/tlgyngilnchmyiiitegdsqkkkkkk 
kkkkkkkkkkkidpagnskiynrkifkt 
prgffkwalgpllilff 














8425 


22326 


A 


8486 


330 


66 


rpgvsrfnpwplifflrrvflcrpgwna 

WQJb 1*1/1 KX SAS Q VQA I FLGGGGCS B PR 
SCTCTPAWATTAKLCFKKKNGKNFTDIM 
FSK 




ZZJZ / 




0/4 0-7 

o4o/ 


i 
i 


zuy 


k^oJUijKoWUAUAlrFt'rli^ANr r XiMrNr t_R 
VGVLIiCCPGLASSDPPPLVSQSARITGM 
SHCTWTI FLSFK 


8427 


22328 


A 


8488 


3 


216 


GQS ITVSCTATNSDVGSSNIiVSWYQSSN 

T.^7CtJVr\r\lTD/!!/^7i , DVT.TT'VT?nXTtrOT ooov 

IjVoWiyyiii^jyArlUjiXYBDNKRIjSSBK 
KKKKKKKKKKKKKK 


0*tZO 


ZxJzV 


7k 

rl 


Oyf on 


izo 


62 


sgaisahcklrfpgssnspasggggrse 

DVCTJTJTPT'T. AWTTTD AVT ITT MTMVTirnTin? 
fi\oxlrtr li_i/Wt ± lrCAAl inLtWlVlNiVlJxl VKE 

FLSS 




ZZJjU 


n 
t\ 




L 




oixftvj iuK l vj^- r XA1 K xvjC v v JjKAKocj VD 

ilgivcqlrldrwvcgckqgqqr 


8430 


22331 


A 


8491 


399 


254 


KNNKRNKTINNKITNKLI IKLKNISFLY 
FFFFFFYFFFFFFFFFFFFF 


8431 


22332 


A 


8492 


2 


130 


frrvgqaglelltsgdlpasasqgagit 
gvshraqptflksc 


8432 


22333 


A 


8493 


3 


94 


HLPAEFTPAVHASLDKFIiaSVSTVLTSK 
YR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pbenylaianine, 
G=Glycine, H=Histidine, Msoleudne, 
K-Lysine, l^Leucine, M=Methionine, 
N=Asparagine, Ps=Pro!ine, 
\£— vjriuiaraine, is.— Argiiiiuc, o — oerine, 

T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucieouae aeieuon, v= possiuie 
nucleotide insertion 


8433 


22334 


A 


8494 


327 


JV 


GITGMSHCTWPLYP xTGIXSSRSFHS IEE 
HCWYQLAPFPSCRNPFVIVPFFVHRFYL 
TF 


8434 


22335 


A 


8495 


97 


219 


PGTVAHACNPGVLGGQGGWMTSGQEFQT 


8435 


22336 


A 


8496 


3 


191 


GIRRGEIIiEVIEFTSNEEMLCRDPKGKY 
GYVPPTAl^PIJ?TEVYDDVDFCDPLENO 
PLPLGR 


8436 


22337 


A 


8497 


144 




c; VPVRi? T T.F^TVTKTOTATVS FTATWVF.li 

Oil. luulUtOi V IXUwimi ViJi i"l n v DU 

EAI ILSELTQEQKTKYHMFSLVSGS 


8437 


22338 


A 


8498 


2 


90 


LTILVLAMWKVGFFKRNRPPLEEDDEEG 
i? 


8438 


22339 


A 


8499 


1 


156 


PSSQDYMHAPPHLANFLNKFFVKWCLAM 
liPRIiMSNWPQTILLSPPKALGLHV 


8439 


22340 


A 


8500 


67 


319 


KLMS FGPTTiLSARCVSTGQI GMNELNLN 
LI ILTLEHKXKYCMCSLISGSLRMTTQG 


8440 


22341 


A 


8501 


3 


299 


IS IVjotlL V LJ\£jtj v y W 21 X OrtXlV^o 111/ JUJLAjO O 

DPPASASRWGITSAHHPIiLWSISSLPP 
PPLIiPPIiRLLCSPC 


8441 


22342 


A 


8502 


56 


142 


NNSAKKKKK3CKKKKKKKKKKKKKKKKGG 

JUU7 


8442 


22343 


A 


OCA'S 

8503 


486 


1A Q 


TfTPtfPV2VOA<TT/RT.T.St QCJMPPAT JVQONGG 
j \TP rl n v r\\gr\T>TL>r7 iff f*iri~^~Pr it rmifwyMUw 

ITGVSHHAQPAEYNFKK 


8443 


22344 


A 


8504 


1 CO 

158 


4U3 


ivXf rr riiii vur vnyuu v yvj nuni uur iuj 
FELKRFSCLSLPSSWDYKRAQPNPNDFF 
KRQMAFDPKIQRISIiGSKKKKNHPIE 


8444 


22345 


A 


8505 


3 


78 


DWHIAYVLLYGPRRVE IMEEES EQ 


8445 


22346 


A 


8506 


164 


291 


I Kr W Jj r V V W rv. V 1A- V v- V AV c Jtrtrnv.J. 

LFFIWLFFFFFFF 


8446 


22347 


A 


8507 


372 


69 


CAPLPPLKFFFPPRGFIFRGGGSPIFPP 
PKKGVLPKNPPKGFFPPPPFLKPAPGGA 

WMVDT) KY2/*JIfT FFQ PPPfJTTWZi PPnTJFT, JfTJ 
i*I>i X irirP UV7A.J. IT jp O irxrIClJlU'l*iIrIrVJVffXjX*Ajr 

APPFFFFFFFFFFLFF 


8447 


22348 


A 


8508 


282 


1 


S FQHVAQAGLKLLGSRDPOTLAAQSAGI 
TGVSHHARPVLYFSSVLFSMEFLSTMSA 
VIIiPCICSFCVSLAVPSSCIIiQQPIiPPT 


8448 


22349 


A 


8509 


400 


45 


TPPVSSLFFLPLKEIIFFCPPFRQSLPP 
PFFFFPPPLFFFFFFFFFFFFFFFFFFF 
FFPFFFFFFFFFFFFFPFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFKYLSLPFTL 
SKEKNK 


8449 


22350 


A 


8510 


241 


419 


TPIXK3WEKKKKKKKXK3CKKK1^^ 

KTCKKNKKKKXKGGGFLKKPLGGANFIGA 

GNN 


8450 


22351 


A 


8511 


317 


28 


QSLSPSPQTVYFIYLIIYAVFFTTLTII 
T . T F^P *5 FPMFT .fTFTiT iP fYFPAPFYAFT • F 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFSQPGAQTTAGRAGV 






A 






99? 


MCCY IMSLLKVICTI IKEFSTiCKKKKKK 
KGGVPL 














8452 


22353 


A 


8513 


1247 


910 


QAQIiGDIGTSCYTKSGMIIjC!RNDYIRLF 
GNSGAAGACGQS I PASELVMRAQGNVYH 
LKGFTCSTCRNRLVPGDRFHYINGSLFC 
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SEQH) 
NO: of 
nucleotide 
sequence 


SEQU> 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQH> 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ammo acio 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eudne, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=ProIine, 
Q=Glu<amine, R=Arginine, S^Serine, 
i— mreonine, v— valine, j 
W=Tryptophan, Y=Tyrosine, 
A— unknown, — otop codon, /^possible 
uucieuiiue ueieiion, \— pussioie 
nucleotide insertion 














EHDRPTAL.INGHLNSLQSNPLLPDQKVC 


8453 




A 


Stf Id 


oo 




PFKKNPGGAQ I FPGGEKKI FFLKGGL 


8454 






OJ 1 J 


OJ 


z 


f2CJ<"2Waf"\WT TDT^T DT7T rifCBPR^eD^Ao 

^WOvtAUWiJl FVXF\MjWlfiAKAljV3SRGQE 


8455 


22356 


A 


8516 


244 


423 


VFAVFNFLGWVMDMLINIAWI ILQCAC 
ISNHHWHLE YIQFLPANYTS I KLGKKP 
SKSV 






a 


5517 


3 


114 


KINGEDKQKI LDKCNEI INWiiDKNQVCV 
FFFFFFPI 






n 

A 




413 


91 


GGVRGKI SlfRGGGFLKKKKKKLLKKKKK 
WGGGPKEKKKRGGFFFFFLLNFFFWGGV 
KKKKKKKKKKKKKKKKKKKKKKKKKRRK 
GRKBGREGRRKEGEGRQE INMTA 


OtJO 




1\ 
rx 


Of 1 Q 


1 


163 


S RL PGRS RRRS RGRRRRRRRRRRRRRRR 

cpT/T. ttpdcjtv^mt.t^t xrn/VTTBT«ma» 
oKViJlr rTiUUfXLkUU£4± V l±ir»UVDAA 


8459 


22360 


A 


8520 


191 


21 


RFTTHVDAWATVTVFL I ES VFLHVGQAC 


8460 


22361 


A 


8521 


2 


144 


AAATSQAWWCMPWPATWBAEVGASLKP 
GRLQLPLTMIALVHLSYAI 






^ 




OA 


o 
O 


V U* I'UfT BIT T H T5«l MitT T T T^T€»Y>T TT T "T -r 

lUrlFlrijPTXjXARTTTtTiTiPISPIjILIIIi 


8462 


22363 


A 


8523 


305 


146 


VFRIHSCGIRGSVDVKITDGLLVIRRIE 
NVPPGPNNKNKNPYAIFQSSSIESQ 






A 


8524 


176 


2 


LAWPFLFLPKCWDYRHEPLQPASLTTFV 
REWDKIGWAWWLTP VI PALWEAEVGRS Q 
GQ 






A 


8525 


2 


200 


CLIPSS WDYRHMP PRPANFCI FFFFLGD 
HVSQVGLKLLTSKDPLAWTSQRAGITGF 
SHHTKPQGFF 




771£/C 
ZZJOO 


A 




2 


Oil 

217 


SFSLLFPPSFSIjUjPPIiASIiLLPPXiPSF 
SLLFPPSFSLLSPPSFSLLLPPSFSIIJi 
hftfshi1yppspsfi 


8466 


22367 


A 


8527 


2 


186 


IRGLRPLEKRGIRVGVSRFSRYHIiSRLS 
FARKGNSPTPCASRVRRCPSLLHGLHPL 
SDKPQ 


8467 


22368 


A 


8528 


3 


153 


CLGFLSGWDCRC I PPCLANLKKLFVEMG 
S IjFWPQ W I IiPP WPLKVLGLQV 


8468 


22369 


A 


8529 


2 


388 


RTAVHTF ILVLQVCVCRWHLSLAALTPA 
HLVCSSHRIIiSyLSIRGSWDHRCMTTTP 
GLVI b 1 r rVEMKTRYAVHAGHI LLGSS Y 
PASAYQSAEITGVSHWAWPTDSLWQKPH 
MVIWCISCFLNVLIGTM 


8460 


07^7ft 


I\ 


ODJU 




64 


1K-L0 iSsXSKjNE KSjfWVSSFWIFFFFVFFF 
S FFFVFFCFFMFFLKQYLKFI INI 1 1 IL 
TQALYICTTSKICYS FLYFR 


8470 


77171 
ZZJ / 1 


7V 




JL 




orr j~*t DCCUnvDPn'T Tir*T tmltp^t vjilt f>i*\^i 
oulxaiiFoo WLI xKV> 1 JUPLJjARIr CIFNRDG 

VSLCCQGWSQTPGSQVIDEAVYSIYAYY 


8471 


22372 


A 


8532 


2 


201 


FHHVGQTGLELLS S GDLPAS ASQCAG I T 
W±*o JLJUiKWNr KNIiSAVF IAWVI 
EGNYTNKTCI 


8477 


~" 77T73 
ZZJ / j 


A 




no 


200 


i r It Jf i^KWV^ij ITUjPRItVSNS WAQAICI* 
LQPPKVLELWA 


8473 


22374 


A 


8534 


160 


297 


KTETLALVAHACNPSTLGGIGRQVSSAH 
EFKTILGNMTKPILYTKL 


8474 


22375 


A 


8535 


333 


228 


PNRGHYIAIVKSHDFWLLFDDDIVEVSS 
FLISYF 


8475 


22376 


A 


8536. 


3 


325 


PPSSLLPCSWILDCCASNERDSVGVGPS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
carrespon 
uing u> 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
G=GIycine, HNBistidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

^ — WlUiaiulllc, XV — ATgllllUC, O— OCl UlCj 

T=Threonine, V=Valine, 
W=*Tryptophan, Y=Tyrosine, 

Y — ITnlrnnum *=Stnn rnHnn / riR'CSl hip 

A. — UlllVllUnll, Olv|J tUUUU)/ — JJU30IUIC 

nucleotide deletion, \=possible 
nucleotide insertion 














FPGAGYMT.LVCCFLSLLE KRS IRVGVTR 
FSRCRLSPLSLTRKGNSLTPCASQVRRC 


8476 


22377 


A 


8537 


216 


5 


KLGFLVNLVBFQVKFFFLFFFLEI ILIE 
MRSHYVAQAGLEIjIiGSGDLPASASQSAE 
ITGMSHHALPNYSF 


8477 


22378 


A 


8538 


136 


22 


RPRRRLVLNPWPQAILLPWPPKVLGSQV 
WATCGRRPAS 


8478 


22379 


A 


8539 


136 


22 


rCXrxvtvKXl w IiiMr Ff c \lr\X. 1J I-itr *i IT ir £\. v .LJV-) V 

WATCGRREAS 


8479 


22380 


A 


8540 


8 


319 


YIYMSMYMLIYVCIYVSIYICVCIYINS 
lorn x inn iniji ivxvi i*iv» v x jiuvuv a 
TYTtfQ TSFOTHTTPIKHSDVHTVSTTRC 
NLGGRGFCHTWPLPHIiAGW 


8480 


22381 


A 


8541 


331 


156 


SVNGLLSLPACLPVCLPACLPS FLPSFL 
PSFLSFFLLLLLLLQKPVNSLIFSQSLS 
IS 


8481 


22382 


A 


8542 




i 
l 


KIGOARWLAPVI PALWEAEAGGSP 


8482 


22383 


A 


8543 


2 


83 


QAGVQ WRDLGS LQAIiP PGFMPFS YLGG 


8483 


22384 


A 


8544 


293 


2 


KTVWHYLVNAPEVEIQAIYSQETCKSMY 

GNKENGK^TLDS1X5FLVRFQSLMRSNC 
PLYPFNKFPILLK 


8484 


22385 


A 


8545 


314 


too 


GAGIAAVSHRGQPVDFKNNI stqiqgrp 
IICNYKTF 


8485 


22386 


A 


8546 


316 


123 


T .WTfll?! .TJVRR AttT *K T PTWS D P PTW ASO 

l_l V X\. i \3U i-lVi VUIUuJiJuXr Ji nJWttl "fWV 

gagiawshrgqpvdfknsistqiqgrp 

IICNYKTF 


8486 


22387 


A 


8547 


3 


247 


GIMATERIANYTGGIYAEYQDTTY1j>JHV 

VoviUjWVjlolAjl £» X Wx VflilO mj J3t*rW v ISA* 

FHPLKVTPRjXKTSWYSSRLNGSVYV 


8487 


22388 


A 


8548 


289 


25 


QRREKFFKNGPPIFFRGPFGFPQIFVSF 
PFFFPKIGVFFVFFFRFSLVLKGDFFSN 














RPCL 


8488 


22389 


A 


8549 


289 


2 


WITK'T HFTT J^KT,T<T ,f!KICPT jDVTPKAOF I K 

iVtS-Tv. J. VjX-i i i j if\in-iM\r iiuv 1* I\f»yC J- tv 

QKKE KL INWQ F I KI KNFRS PKALLRRGK 
GKPGTGRiCE^NANHISDKGPLCRIQKKNS 
RGRAQWLTLW 


8489 


22390 


A 


8550 


290 


187 


RQENPLNLGGGGCSEPRWCHCTAAWVTE 
QDSINK 


8490 


22391 


A 


Off 1 

8551 


293 


mi 


RDEGRG I S YMLP RLVLNS WPQ AI LLP Q P 
PKVLELQA 


8491 


22392 


A 


853Z 




o/ 


MLPLDFFIjGPS LDFCPPFCE SGSGGI kv 
PES TTPRASAFLPPGAANLHHI LQLVGA 
PRVPPGFHHVGQDGLDLVDLVIRLPRPP 
KVLGLQGG 


8492 


22393 


A 


8553 


57 




KWnKWT P WAGGVAHACNLSTLGGRGGKI 
ACAj^FKTILENIVRPCLSKNKWISFS 


8493 


22394 


A 


8554 


326 


2 


KSKRGYSEPLVFSICSARAPRSHSVSQA 

GDYRHDPPPLAYICISVIGNDVEHLFLC 
LLSTCISSVNRSHVYSTLVPIFGP 


8494 


22395 


A 


8555 


372 


148 


YICFLRSSVRQILFTPDCSDCSMWVPCN 
IRLPGSTHWCASASQGAGTTGARHHARL 
IFFVWFFCIFSIFSRENH 


8495 


22396 


A 


8556 


422 


269 


AADQE R LHT YQLNYYHFCKR WGLTT FPR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
Ing to first 
amino aciu 
r Aci H n a f%f 

IC9IUUC VI 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
umg 10 

Irt6l 21 111 1 II U 

acid 

residue of 
peptide 

C A/111 Anr*A 

Sequence 


Amino acid sequence (A=AIa nine 
OCysteine, D=Aspartic Acid, 
E=Glutatnic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysine, L=Leucine, M-Metbionine, 
N=Asparagine, P^Proline, 
v^oiutamine, tv— Argmine, o = oerine, 
j — inrconiDC) v — valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unkno*vn, *«Stop codon, /^possible 
nucleotide deletion, V=possible 
nucicouoe insertion 














LVSSSWVQAILPTWPHKVLRLQA 


8496 


22397 


A 


OJJ / 


ooy 




AfiVTntfPWT C r PT.APPDPD'I?TrDU , GIJT' OT'ri O 

SWDYRCPLPCPANYCIPLVEKEFRHVGK 

AfiTiRT .T .TQfTTlP P A WVGT .T A TTY2t7QTTP 'h. 

HPD 


8497 


22398 


A 


8558 




^S9 


T/TTiOAfiTTAQfrVGTVTT/'SrSMTiirPPCWliW ' 
WPWPATREAEVGGSLEPGRSKLQ 


8498 


22399 


A 




IAS 


910 


HHGWLNL PKS IHL 


8499 


22dfln 


A 


0 JOU 


1 IT 
1 J / 


z 


VI\.IV^Jjr / IKoiilAvi VKx*i-OiroQAIAoPVT 


8500 


22401 


A 


8561 


324 


193 


RSS6NLPASAAQS ARITGVSHRTRPKC I 
T Q JTErWAWMNUT.irG TG 


8501 


22402 


A 


8562 


326 


121 


Hl^THTHTKNHKKHQQPQHTHTYHTQTD 
IYVLQAASQKSIiILISSLAV 


8502 


22403 


A 


O JUJ 


j 




14 W ri'rHTHTOCU' I'U'I'LTDU'PU'I'U'I'T padm 

£i£ixi 1 n 1 n x nvjri x ri i xiKii 1 Jcl X *i 1 JjGAKH 


8503 


22404 


A 


8564 


2 


153 


ARDRHI P PCLTNFS VFWRDGGLAMLPRL 


8504 


22405 


A 


8565 


2 


177 


ARVGFHYVGQAGLGRLTSGDPPASASQS 
AG 


8505 


22406 


A 


ojOO 




010 


2Vf2T?D WV/'2D7Vf" , T.T?T..TnV»TV"»OCT»0 74 rv^C c/~i 

nur KrlVv3K/iijlJE.JjV 1 UlAslrK 1 SAGQS bG 

ITGMSHCAHPEKSTAFILP 


8506 


22407 


A 


OjO / 


/O 




bXi 1 r UKlAaOJj 1 V r ir KUXSNT W P X AVLiPIj 
WPPKVLGLQV 


8507 


22408 


A 


8568 


1 


142 


GTRGFLHVGQAGLELPTSGDPPALASQG 


8508 


22409 


A 


8569 


250 


95 


SASRAPLPiiPPSHLPLRAAGLSPLCPPR 
LVSS YRPHVILTjPLPPKVLGLQV 


8509 




A 


oj/U 






t_f PUffir l^ni FmrTTTTf Tint tfm Ymr t^tt Ttm *fniT^^»^w^ ^ t 

HKtl 1H1HIHI HTH ITiTHGHTHTHTHTIjV 
LVYSLCEHIWRPIVISRARVCCIYWVP 

T7HOnV71T^OOTr7AT^TnaT MTU 


8510 


22411 


A 


8571 


t 


216 


GTRGIXjVGVSHIjVS knthththththth 
YTSNSYCSSKFYLYNH 


8511 


22412 


A 


O J / £. 




ZOo 


r.TMT.APT.VT WTODrtT7TDT1DUDDVMT , r»T 
1j xi*LLi/itcu v Ltd J: Klry v 1 irUlr Wr tr aJIIjCsXjQ 

A 


8512 


22413 


A 


ft ST* 


19 


1 / j 


PWfiT7HPV^T\7A^lT»T?T.T. , PQGnT DTT jicnifn 
iv. nor nr v Vj^i-ioUlLlJlj 1 o oUijJr X LifVoy t\A 

EITGMSHCARPSFFSFFLSFFFWKHL 


8513 


22414 


A 


8574 


1 


163 


GTRTGFHHVGQADLELLTSGDPPVSASQ 


8514 


22415 


A 


8575 


344 


176 


EFLVETGFRHIGQAGFELLTLGDQSTSA 
SONTE TTGVSTfP APttTt AT T TMf^f^MOAPW 


8515 


22416 


A 


8576 


2 


179 


ARGILCHSGTFKI^SRPSMVAHTCNPST 
QIP 


8516 


22417 


A 


8577 


1 


100 


GTRGFTMLPRWSDSWAQVILLPWPPKV 
TjELQM 


8517 


22418 


A 


RS7R 




1 i*f 


WRRffRTTjQTVT.TTVT GTVT.GTVT CTVT t 

norma x uj i. x ig i ixijoixijoxxiiolxjui 
YLPTITRI 


8518 


22419 


A 


8579 


1 


264 


GTRPCRIELYRWESIAKAQETSGEEIS 
KFx^PNCmNGFYHSRQCETSMDGEAaL 
CWRVYPWNGKRI PGSPEITGDPNCHIYF 
NVQN 


8519 


22420 


A 


8580' 


1 


245 


GTSGTSGILGFYLEREDRLLQIRTDTIT 
I^HGYSTFSLNRCDSRHHMNRMFQMLYQ 
SPPLGYIQLSHEVLAFGISYIYMTE 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Giycine, H^Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N-Asparagine, P^^Voline, 
Q=Glutamine, R=Arginine, S=Serine, 
i— i nreomne, v— vannc, 
W=Tryptophan, Y=Tyrosine, 
X=Unknovrn, *=Stop codon, /^possible 
nucieouue aeieuon, v— pt/ssii/ie 
nucleotide insertion 


8520 


22421 


A 


8581 


1531 


1457 


QARWETIARRLWDVSCDLLGLPID 


8521 


22422 


A 


8582 


191 


2 


TTTPVDPTVPT1X COVTl TP/*'!? VIHi'TWO D 

11 AoiroP\LFAFobrixAlr*-r iKxUxoK 
YFIKVELWEAETGGSRGQEIETILANTV 
KPRLSRA 


8522 


22423 


A 


8584 


2 


212 


ARAu 1 1 r ivci I tSJb Pii ftJJ 1 V-auKrlr XJ\{jur 
TIPKTWTQSKCPSWIDWIKKMYRQGPIL 
SPMLECSGBVSGFK 


8523 


22424 


A 


8585 


2 


192 


ATl/-'HTPriAPCCtnn\r , TTr , 7MJtnJAOT 

AKv3rWi>PAoobHVAvj± l\3AtlriiiAKij ±t V J? 
FLFVRPFFSFLKTGFKLVAHSNFQPLTL 
r AfJUrlr ft. 


8524 


22425 


A 


8586 


277 


66 


PLDFRLGHKRRLPFPKKKKKESKFVCVC 
ILLSWPPKILALQV 


8525 


22426 


A 


8587 


395 


286 


PHLGLPKCWDYRREPPCLAEVVLKKYVW 
VTYFSFRR 


8526 


22427 


A 


8588 


307 


177 


RYPSTINYS I LNRDKIMP KLDRMVYKAR 

irMX V K.V-Jjf VW ffJVV LAsLi 


8527 


22428 


A 


8589 


167 


349 


RNLRELGEFFFFFFWFFFFFFFFVGKFS 

\rOT T \TCKTT TTT.WPrc/V^Tl^T.QPVMPPKTJT? 
V Jp XjJj VkaJN Lir All: r oVa^a J.J7 VjJjoK. v PlV-rrUif 

RCFS 


8528 


22429 


A 


8590 


375 


240 


LLSRFSWFCFVFRQGLTMLPRIiVLNCWA 

K(jlJjiilrWoJrft.v JjoijyH 


8529 


22430 


A 


8591 


347 


215 


ETGFPHTOQAGLKLLASSDPPASASQSA 

nTTY"* MODD ZMTDC a TTP T7 


8530 


22431 


A 


8592 


340 


114 


HRTAHCSLYLSGSSDPATSSSQVTGTTG 

tmnxmjT t ."cttvcpp a mrc t? nnrvifQ twit 
MJyn l nlai-u? ft. X J? LKnK vo iujn *vx rvo ahjy 

FYFLNTQTTPSEYGKASPG 


8531 


22432 


A 


8593 


86 


253 






KKKNLHPRT^GGFEEn^FFPKXK5KKLF 


8532 


22433 


A 


8594 


3 


135 


TGVSHRAWPPVLFFF 


8533 


22434 


A 


8595 


2 


274 


PRVRTISLLGILVYRSHLISSLLCLEGI 
rMIiFIIATLITLNTHSLIiANIVPIAIL 

TrCTN 7\ f*t< TV ?\T //*<T TV T .T tTr T OMTVP T .T"YV"\ 7T-T[\T 

ItNIiLQC 


8534 


22435 


A 


8596 


3 


316 


O J. Ir X o W ftJSij J. Jr UriAJr Jry X y ^loKJaiMl/ 

F^ALSLHLNTHTHTHTHTHTHE IQHLPP 
NLTVRHCILSRDTWLATASK 


8535 


22436 


A 


8597 


385 


291 


KAPPLFFFFFFFFFFFFFFFFFFFFKPV 


8536 


22437 


A 


8598 


275 


369 


GVQVLKLLTSGDLPALASQSAEITGVSH 
PAP 


8537 


22438 


A 


8599 


98 


344 


VHYEEFMCVCVCVCVCVCVCWPRVRGVD 
MNPVEHPFGGGTHQHIGKPCTIRRNASA 
GRKVPJJtfAALWTVRrGCTrKTVTIiRKT 


8538 


22439 


A 


8600 


328 


195 


THVPPSLAKFVFLVKMGFLHIGQAGLDL 

QTQfJDPPATA^WVPKR 

O J- uVjLfr JT/U-irtOIl v 17 f\j\. 


8539 


22440 


A 


8601 


2 


214 


HVDVVMGTFAILSEMWDMLHVDPENLR 
LLDSVLNCELANHFGTDFTPPVQAAYQK 
VVADVANALAHMYH 












181 


GLRHENRLNSGGGGCSEPKLCHCTPAWA 
TE 


8541 


22442 


A 


8603 


340 


57 


KEGHQEMEQAEIX3GHGRKAPGTS PWTLG 
ELPl^ARGHTGLFVSYRGCRTQFRFKKV 
LFGGDRNRGLAMLPRIjVIiNSWPQEILLP 
QPSKVQGLQK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQfD 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 

3 en ii pn re 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

1 coiu uc %t§ 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F^Phenylalan ine, 
G=GIycine, H=HisrJdine, I=Isoleudne, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
>< v»iuMiiiiiiitJj xv— Arginine, Ocrine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
/v cuKnuvTii, — "Oiop cooon, /^possiDie 
nucleotide deletion, \=possible 
nucleotide insertion 


8542 


22443 


A 


8604 


330 


132 


TGFLHVGQSGLKLLTSGDLPTSAPQNAG " 

TTfVlCI'UPTDDPQT PUQVTVT CV^OOOA 
X J.V7V»anl»iKxrf oLrno ixiVxjr xtjv_£>S£jy 

VSLSIItARM 


8543 


22444 


A 


8605 


546 


279 


RPGPTVSPRLECICMIMAHCSLDFPGSG 

\7QDTQ2lGO , \7TM? r T* , rCT , lTLJlJ3VTflT Tt7VeeiJT« 

vorioiu>KVAKi lollinnA.WIjJLrxo&ViL 
TGSHHVAQAGLELLGSSNPP I SASQSAG 
ITAL 


8544 


22445 


A 


8606 


3 


249 


NRFNLGSGGCSBPRSHHCTPSWATBQET 

P T TVT i P V CP P PVQ WrMT A/V PVD T .XTI /T T "D 
x v» j. iw untw o iVTVjououi luunu 


8545 


22446 


A 


8607 


159 


324 


VIQxTiTNIlxHIiHLCEVLVFWFLPYFFLK 
fCTYTY* T ROMP WT.TPVT D AT.tJP IVT? C 


8546 


22447 


A 


8608 


334 


179 


TGFPHVGQSGLKLLTSGDLPTSGSQSAG 
ITGVSHCTWLEVT YPT.NT <? T YP<? 

ju luvonWiini iiti v x x r i_iii io X X ro 


8547 


22448 


A 


8609 


278 


202 


QQPSTxxRQDPPPAIOvxxHLSEGSDGH 


8548 


22449 


A 


8610 


236 


66 


AASTFFLQKIKPJiLIjSGTVAHACNPSTIj 

OVJIWj X IV X OXvV7£v~XL r JV X o J-iVjn £T1 V iCirSs. w I aJv 


8549 


22450 


A 


8611 


29 


177 


GKQVMALHCFIFFHFFFRRSLAMLPRLV 

QWQWWiPTT.DDttTPT WITT T?T.T*A 


8550 


22451 


A 


8612 


1 


215 


HTSRTLFVHlxRLKNFNKYLIEWGIiS^ILP 

PTiVT.M'^W2iO7i'\/T.P0'LirDCTinrT y TTVMeiir' 
kjj vxiTJ on AS^nVlj r* rnroiV VlAsJ. 1 oCloilL. 

TWLYxxHLSTDLLKLC 


8551 


22452 


A 


8613 


3 


91 


LRHGNCIxDPGEGGCSEPRSCHCTPAWVT 
E 


8552 








I 


I Of 


f IxbiiljvatlljViJRbljrHHlNNIKPSFTRE 
NTl^FIHLSPILSTRKQIiAIQHLSRLE 


8553 






oOi J 






J? V- J. Li V Ji 1 torxiv* V AtiAGJbQIjljo 5GNPPAS 
ASQSVG ITGMSHCTSPNLSLLTQSKS FC 
LLIQNHNPYAFINSQLTFS ILFLLS YFI 
PYTjQT.PT.T.P 


8554 


22455 


A 


8616 


282 


182 


GQAWWLTPVI PALWEAEVGGLLEPRSSR 
PAWST 


8555 


ooa 




5J£1 7 
OOI / 




7/V7 


owyi livJvx jJWxfrr vciMKoxlx VAv/AGIiKF 

LGSSNPPTSASQSASITGISHYVWPISP 
g 


8556 


22457 


A 


8618 


705 


483 


DKSFALVARAGVQWHHLGSLQPLPPGFK 
QFS YLS LPSS WDYRHAPPRPANFCILSR 
D(3VS PCW PRMQPT PDT .P 


8557 


22458 


A 


8619 


359 


220 


RQSLTMLPRLVSNS WAQAI LP PWLSRLL 
GIjQAlxAPIPGKS YI^EKTP 


8558 


22459 


A 


8620 


3 


186 


HEVS WVKRKQDE W I EFDDDTVS I VAPED 
IIxRLSAGGIKHIAYVTiLYGPRRVQILEE 
ESEQ 


8559 


22460 


A 


8621 


1 


116 


GTSFCKDGGLTVLAQLGSSFWPQRILPP 
WPPKWELQA 


8560 


22461 


A 






ioo 


MYQ TVRMC5 1? TRNfxWA fix .PT .T . a CP nY"G RT "a 
SF^AGITCMSLRYRTPPQPinJATPfn'.TjT. 
SLQS 


8561 


22462 


A 


8623 


0(10 


77 


YI LGFYFM P VI C YOER KTCTiK TTMWR UTT T 
MSFVGIC^LGAIIX^KLMQEQKTKYHI 
FSQV 


8562 


22463 


A 


8624 


387 


315 


SNLGGGGCSELRLCHCTPAWATE 


8563 


22464 


A 


8625 


2 


135 


xTO>LC5LVY1xLTFPPUjSHDPAKS PSVR 
NTQELSIIvKKKKGRPF 


8564 


22465 


A 


8626 


3 


232 


HASAFEEPVYIKSRQKRKESNPPKLVSS 
QPHGLKKKXKKKJCKKKKKRGRGLKKKKK 
KHNKKFKKGQKKYNFGRGAF 
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SEQ H> 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino add sequence (A=A(anine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Hsoleucine, 
K=Lysine, L=Leurine, ^Methionine, 
N=Asparagine, P=Pro!ine, 
Vjpvviutamine, k— Arguune, a— oerine, 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \— possible 
nucleotide insertion 


8565 


22466 


A 


8627 


97 


373 


QHGPLKKKKKKKKKKKKKGGPLLKKPLG 

KNLFLGGKKNGK^PKKLNPLGEKKNFK 
GKKGKNPP 


8566 


22467 


A 


8628 


2 


188 


PKLVSSQPHGIiHDFFKKKKKKGAAVIiKD 
PSGGPS 


8567 


22468 


A 


8629 


287 


68 


RTGRGEGGGGBAKGRPRSARRRGRGSRG 
SKSQTPSRHLPAHT ITN 


8568 


22469 


A 


8630 


364 


136 


VSLIiLGLEYSGTITAHYSLNFIiLFXVET 

r* ourvrTTV nifT vr Ty2CQMT3'DTCnQfYT 1 17T?T 
(jori x V Ay JAxLi JVLilAao olN ir c X &U&\£ X r £» X. 

TGVSHGAQPKVQFWSKNLDT 


8569 


22470 


A 


8631 


355 


70 


LFPPRFFPFFSPLSPLKFFFSPKGFNFF 
REFFPIFSPPKKRVLSKNSPGGFYKPPL 

FFLFFFFFLVI 


8570 


22471 


A 


8632 


2 


266 


AFTISIiLGILVYRSHLISSLLCIiEGIIL 

O T TfT T ATX JHTTT UTHQT.TAMTVDT JVTT.VT? 
A APR A AVCIT ATiT/lS T SNTYGIjDYGHNUI 
LLQC 


8571 


22472 


A 


8633 


1 


186 


LTHHINNIKPSFTRENTLMFIHIiSPIIiL 
PGGGQN 


8572 


22473 


A 


8634 


2 


80 




8573 


22474 


A 


8635 


713 


333 


BGPPPPRSKKKGTQGGGKDPPS PPEPKT 
POJPQGRKTGPPPPGRPYTGPRPPGSGP 
TRGGGSRSSSSNTNAPGEKIFFSKNPGR 
KIFPPRAILVFFSPFPLKNFFFSIjRLLI 
FLGGCAPFFPPPK 


8574 


22475 


A 


8636 


389 


1 


FXFSFLiFljlXr r c iiNXNrr r X lrbbKsr 
FFFLNFNl^LLKHTPPIILFFFSIiSPIFF 

t.'UPUL'P 17 17 171717 13 17 17 17 17 1717R17 W 17 17 1717 1717 1? 

r if rift rrrrr rccFJf rr rrcrrr r c r c r 
FFFFFFFFFFFFFFFFFFFFFGTKKELY 


8575 


22476 


A 


8637 


2 


192 


VRTLGLRTDAIPGRIiYQTTFTATRPGVY 
vnnWYR T PGANHS FMP TVIiEIiIiALTI FE 
IGPEFTL 


8576 


22477 


A 


8638 


1 


87 


PTRPIiE TAVAL IOAYVFTLLVS VYLHDN 
T 


8577 


22478 


A 


8639 


2 


132 


INLA^TLIIFTILILLTILEIAVALIQA 
YGVTLLVSLYLHDNT 


8578 


22479 


A 


8640 


2 


113 


LFFIAEYTNI II INTLTTTI FLGTTYDA 


8579 


22480 


A 


8641 


375 


22 

* 


FIFISPLAYLFFLPFPFFFIPPLCFFFF 
T PT.FFT.F 1^FIjF^T,T FFTiFSIiIliI FPPS 
LFFFFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFPTWLPHRVMTAKVGISWSKKI 


8580 


22481 


A 


8642 


180 


34 


WFISDSLWLGMVAHTOJPSTLVGQGGWI 






j± 






133 


NliP LGL FY S LQKKKKKP F LRPGTLGAPK 
GNFQGAPPLFGKFAFFFFFFFFPFFFFF 
FFFYGFFFFLFK 


8582 


22483 


A 


8644 


156 


287 


LTYPIKPPLLCSQKKXKKKKKKKXKKKK 
KKKKKGGAFKKNPWGAQI 


8583 


22484 


A 


8645 




325 


LHHCHTHWFVNKVVC F IRKKKKKGKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=>Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K= Lysine, L= Leu cine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutaraine, R»Arginine, S=Serine, 
T=0*hreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














KKREKJCKKKKKKKKKGGGPLKKPPGGPK 
FSGGGKKKIFPQRGGYKKPPGGFLGKTL 
FLGGEKMGENPPKKIKPLGEKKIF 


8584 


22485 


A 


8646 


370 


3 


PVRASRLLCLPKQAWAMAGAPPPASLPP 
CSLISDCCASNQRDSVGAGPSEPGAGYN 
LVMHCFLSPSEKHSIWVGVTRFSRCCPS 
PLSLTRKGNSLTPCASQVRQCLALLRLV 
QGAGTHRTRG 


8585 


22486 


A 


8647 


341 


88 


KKKKIFSPPIKRGPPSVYFICPPPPFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFRMGCDRWHGELYNKI 
PIWHKLCGDL 


8586 


22487 


A 


8648 


I 


108 


PTRPRTRGVASVLYFTTILILIPTISLI 
ENKILKWA 


8587 


22488 


A 


8649 


3 


96 


E I CGANHS FMP I VLELI PLKI FE IGP VF 
TL 


8588 


22489 


A 


8650 


457 


297 


SPS FFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFLFYFALF 
FIYLIIKDNYMFQC 


8589 


22490 


A 


8651 


84 


352 


YEKTDVKIISILFNTDYMLEYNVLHILG 

FFWGGLILTLIiGIVLySPIiBALKISPSF 
GFLEK 














8590 


22491 


A 


8652 


2 


111 


NNIKPSFTRENTLMFIHLSPIIiLIiSLNP 
DIITGFSS 


8591 


22492 


A 


8653 


1 


305 


RQQQQQQQLRNLRDFLLVYNRMTELCFQ 
RCVPSLHHRALDAEEEACLHSCAGKLIH 
SNHRLMAAYVQLMPALVQRRIADYEAAS 
AVPGVAAEQPGVSPSGS 


8592 


22493 


A 


8654 


129 


268 


LSEVLYIiFPPKKKRLIILCLFFRNPGSS 
NLQKITKEPIIDYFDVQD 


8593 


22494 


A 


8655 


281 


117 


CIITLSFINTLQAYIIWLWFSEVCLYV 
SLSIYLSIYLSIYVFEYRYIKISFHL 


8594 


22495 


A 


8656 


3 


177 


DLHAHKLGVDPGNLNIJ^SHCLLETLAGH 
LPGEFTPAGHAFLDKFLDFVSTVLTYKY 
R 


8595 


22496 


A 


8657 


3 


270 


RRRGRAHCSIiDLIiGSRNPPASASRVAVT 
TGTRHLAWLIWGGFLFFVLRDVLVRFH 
ATDKDIPKTGQFTKERGLMDLQFHMAGK 
ASQS 


.8596 


22497 


A 


8658 


107 


11 


QFEHTKPTPFLPTLIALTTLLLPISPFI 
LIIL 


8597 


22498 


A 


8659 


21 


136 


VPIEHDVPVPNSRAELLKMFG IDRDAIA 
QAVRGLITKA 


8598 


22499 


A 


8660 


1 


395 


LVTALYSLYIFTTTQWGSLTHHISNLEK 
KKXGKGKKKKRGGALFKGIFGGSHFFGV 
WELLFFFLKGGIKKTILGFLGKTLFFGG 
GLLGAPLPRKIKGLREKKNFKGELGVKN 
RVFFFFGNFSSLGVYLKKY 


8599 


22500 


A 


8661 


1 


255 


RTRGRTRGRTRGLTRGKKKKKKIMKKKK 
KKKKKKKKGGAFLKNPWGGP I FWGLPKF 
YFLPNKGS FFNLIGDFLKRPFFWGGAYF 
G 


8600 


22501 


A 


8662 


136 


358 


FHIVKGVS I IKFKCVILKKKKKKKKKKK 
KKKKKKKKKKGKKL KKKKGG KKKKGGGK 
NKI FFWVGFFYKKVWGYF 


8601 


22502 


A 


8663 


198 


368 


LINLVQFTCILGTLLGILFIYLFLRRAI 
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sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

SUIIllV 2sCIU 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

la«t amino- 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine 7 
G=Glycine, HNHistidine, Hsoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y-Tyrosine, 
X=Unknown, *=Stop codon, /possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TOLARLVLNSWPQAVLLPQPPKVLGLQA 


8602 


22503 


A 


ooo4 


4U/ 


oz 


KFLPHLGFPPFKRFFIKKEPFQWFFFLF 
GFFFFRRSFFLLPRVQCNGAISGPCNLC 
LNVKVRLS FKKKKKKCWDYTFE PLCPAL 
HFS FINKTEDCLFVCFTVYWPHGF ! 


8603 


22504 


A 


S665 


166 


293 


KIXAKHNIARTNPYTFCIMKKKKKKKKK 
KKKKKKKKKKKKKKGGAL 


8604 


22505 


A 


8666 


1 


164 


RAKAKRRNTTVSCRMRHLKIVYRRFRYR 
LYVGIIIGPGGLNETLVFTCKKITVP 


8605 


22506 


A 


8667 


295 


85 


YIYMSHICTLSLSLSIYKRVKCFCFFKT 
iiicuc\r7irmfrwnwPT)T /^TjOPPPPTCWTT^C* 
ISVFFTVLFVMKKN 


8606 


OO CAT 

225U7 




OOOO 


z 


10 1 


INLPSTLIIFTILILLTILEIAVALIQA 
YVFTLLVSLYLHDNT 


8607 


22508 


A 


oooy 


A.'yi 






8608 


01c An 

22509 


7\ 

A 


00 /u 


*k>Z 




SQILGRFSQKQHLSLGGGGCSHPRSCHC 
TPAWARGRLRLKKF IK 


8609 


22510 


A 


8671 


1 


106 


UCLLTSSDIiPASASQVAEITGMSHCARP 
LSRIYMC 


8610 


22511 


A 


8672 


383 


252 


CPANFVFSVEKGFIiHVGQAGLELPTSGD 
PPGCWFCFLQQQLHW 


8611 


22512 


A 


8673 




1 jj 


SSWGMDS IPTSSNMEETQQKSNLELLHI 
SLLLIESRLEPVRFLRSTFTNNLVYDTS 
DSDEYHLLKDLEEGIQMLMGRVEDGSHL 
TGQTLKQTYS KFDTNSHNHDALLKNYGL 
LPCFRKDMDKVETFLRMVQFRSVEGS CG 
F 


8612 


22513 


A 




*> 

z 


3ZO 


SLIjMCVHRCECVCMRACLCAGVCMCIAS 
CLGLPMNVVECYTWRVLVFHQFQDEELH 
DTVDLETI PLBRQPRDVQHPVSTRILYL 
HVYFVAVTLTLIRILQLWTEAFSP 


8613 


22514 


A 


8675 


171 


272 


TQCCLLKAEKKKPKKKPKK3CKKKKKKKK 
KQGPF 


8614 


22515 


A 


8676 


60 


409 


LVWLESHEDVGPLIKDSKQEKXKKKKKK 
KKIGCKKKKKKKRGGAFKKKPGGGQNLRG 
GEKKNFFFLRGEKKNPRGNFGKKTFFWG 
GENWAKP P PKKKS PEGKKKI LKGKGGKK 
YLNFLGE 


8615 


22516 


A 


8677 


106 


286 


LFWNKEGNNWKKKKKKKKKKKKKKKKKK 
KKKKKKKWGGALLKISGGGPIFPGGGKN 
FFFFFGGGtF 


8616 


22517 


A 


8678 


i 


97 


UGQVASVLYFTTILILIPTISLIENKI 
LKWA 


8617 


22518 


A 


8679 


1 


141 


NEHQPGQHGEMPSLLKIQKISWAWWSMP 
VTPATQEAEMEKESQISIM 


8618 


22519 


A 


8680 


499 


140 


GVFFFSRRKRCGVSPPPPPKPVFFPLPG 
FFFWWGGCPSAPPP PGGGFPPNPPRGFF 
YPPPLRGNFFFPPPAGGGPPRGFFWAPP 
PPFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFS 


8619 


22520 


A 


8681 


60 


224 


KQTKYIKE ITKHICTWQGRYHDHEGGFP 
RVKLIHCTPDMLTPVISPDVGNSTTV 


8620 


22521 


A 


8682 


252 


383 


SQXKKIOCKKKKKKKKKKKKKKKXXGGGL 
LKNFWGGPNLTGGGKI 


8621 


22522 


A 


8683 


405 


145 


KFFFSGGCFFFWGPPGKNVPPPKKMFFF 
LIPPPKKKKPPPPKKIFFPPPGVFFPPP 
PPFFFAPPPFFFFFFFFFFFFFFFFFFF 
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sequence 
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ed end 
nude- , 
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location 
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residue of 

peptide 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartk Acid, 
E=Ghitamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucrae, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown> *=Stop codon, /possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FFF 


8622 


22523 


A 


8684 


1 


406 


LYYFLTKSTTTYLA.VPQPFPPTPSTPSR 
AKKKKKKKKKKKKKKKKKKK3CKKKKKKK 
KKKKXKRGGG PKKKTRKGGGGQKKKKGR 
GPBKNRGGGETTPGEKKKKKKKGGGGKK 
KKKKSGGGKKNGGKPGRGGGGGR 


8623 


22524 


A 


8685 


391 


265 


GRPFFFFLDGVSIjLLPRLECSGVIIAHC 
NLCLPGPSDSPSSA 


8624 


22525 


A 


8686 


2 


177 


DTALYSLYIFTTTQWGSLTHHINNIKPS 
FTRENTLMCIHLSPILLLSLNPDI ITGF 
SS 


8625 


22526 


A 


8687 


479 


366 


GLPKCWDYRREPPHPAANFCI FSRDGVS 
PCWPGWSCL 


8626 


22527 


A 


8688 


35 


161 


SNPPSSAFQVAGTTGVRHHDQLTFFFFF 
FLGGTGQILCRGGD 


8627 


22528 


A 


8689 


1 


122 


THHINNIKPSFTRENTLMFIHLSPILLL 
SLNPDIITGFSS 


8628 


22529 


A 


8690 


2 


152 


NIILAFTISLLGILVYRSHLISSLLCLE 
GIILSLFIIATLIYPTPSFSCY 


8629 


22530 


A 


8691 


541 


85 


IFLRQFLSPRLEYSGVIKAHCSIjNIiLGS 
SDPPASAFKVAGTTGIRHHAWLIFVFFF 
AETGFHYVAQGGLELLNLRNLPATAFQS 
AGI IGISHHDLLGIQITTWLLQKLSYLQ 
KKICKDQBTGLLHAFHLHPYPYPMPSSP 
QLLQGPDILTNG 


8630 


22531 


A 


8692 


122 


38 


GRVDHSFMPI VLELIPLKI FBIGPVFTL 


8631 


22532 


A 


8693 


276 


77 


GVS PCEPVFFF I LGRCNP P AWGVP KGRD 
FFVLSFFFWFYILLFFFFFFFFYFFLF 
SFYCFNGSNK 


8632 


22533 


A 


8694 


3 


70 


FMPIVLELIPLKIFEIGPVFTL 


8633 


22534 


A 


8695 


230 


3 


GSGI PSHPGHPSLP YHIiEREWLAKTGIR 
DTSNSVHLLELdiRSQRHGRARWLTPVI 
PALWEAEAGGSRRQEIEPI 


8634 


22535 


A 


8696 


401 


240 


PPPPPPPOGSBPRPPHPSLSPPGSREMF 
VLALSQESQMTFFFFQLRBCKLHLYL 


8635 


22536 


A 


8697 


103 


313 


EKKKKKKKKKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKGGGGPKKKKFEAIFGKTP 
GGV5QGGGPPPLNFFFLNPGPPFF 


8636 


22537 


A 


8698 


354 


110 


KKGTLFEVSES F If TTEGVSAKINRIRS I 
VSVNKVARCIGEIFTDAVQSAFQKEVGG 
VS DS MVHKYE YAGCVD VKAVCS HGL 


8637 


22538 


A 


8699 


2 


164 


FLVETGFXjHDGQAGLKLLISGDPPASAS 
QSAE ITGMS PHACNSSTLGGRGGRIT 


8638 


22539 


A 


8700 


191 


371 


PPDSRWAIQGGHLELTPTTRQDIAQAPR 
RKLALLEGSFQGQAQWLTPVI PALWEAB 
ASGS 


8639 


22540 


A 


8701 


2 


170 


ARAGLYHVGQAGLiKPLTSGDPIjASASQS 
VRITGVSHRTRPKI IFKLI FFLFFFFFF 


8640 


22541 


A 


8702 


31 


229 


MEKYNVHPHSG I LRSHEKEQAALtFT iak 

rwrqpsypsidewinimwsshtveyyta 
mkrnkqlysq 


"8641 


22542 


A 


8703 


3 


126 


LQELRDPTLTFRLLGSPRPVWETRPVD 
DPTAPSNLYIQE 




22543 


A 


8704 


438 

# 


31 


S EPCAG YNLLVCRFLS PLB KHN I RVGVT 
RFSRCHLSPLSLTRKGNSLTPCTSRVRQ 
CLALLWLAHSALHPLSCMHCIiALPSEMN 
PVPQMEMQKSPIFCIAHAGSCTPELFLF 
GHLGSTPLTHLLSLR INVFLRTS E 



1076 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

oimSiia qi* i rl 
alillUU aviU 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/find tf\ 

last amino 
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residue of 
peptide 

CPA II An C0 


Amino acid sequence (A=A!anine 
C-Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, I=IsoIeucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 

Ptz/^Jiif om Ino Rr= A rcnninp S=SArirtf» 

T=Threonine, V=Valine, 

W=Trvntnnhan Y— TvrOslne- 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 

ntirlpnlirip incprtinn 

UUUvUUUv IIISCI IIUII 


8643 


22544 


A 


8705 j 


429 


319 


PGAI IAHRSLNLPGSSDPPTSASQVAGN 
TGVSYCARP 


8644 


22545 


A 


8706 


426 


245 


RAATPSPIKLTKKKKKKKKKKKKKKKKK 
KKKKKKKKKKRGAPLKKTLGGPO INRGK 
KKIFFFF 


oWD 






R7(V7 
o/v/ 


i 


IvO 


KHPRGGS FGYCFALAWVAFPLALVSGI I 
YIHLRKRE 








o/uO 


J 




QI EGHT I SALGDGAAWPVQGL I RHFRP E 
LEERMQRFAQQHQARQAAS 


8647 


22548 


A 


8709 


2 


83 


LVLE IAVALIQAYVFTLLVS L YLHDNT 


8048 








i 


SI 


RPLSKTVRFNVLKVTKAAGTKKQFQKF 


8649 


22550 


A 


8711 


156 


4 


RPLSFHPGGKKRLFLKKKKKKLOWAHA 
CNPSTLGGRGGRIMRSGDRDHPG 


8650 


22551 


A 


8712 


A A A 

440 


z35 


TPQ VT?VT .T .\7PT3riV<5MT iPPTiV<?TiTC5r?nP P 
Iroir V Jj Li v HiX\.yJV CtViutr JvU voui uUJJr xr 

ASAFRSAGITGMSHHAQPRTGRS KAYLP 
FIiEE PDLKHI FPF 


8651 


22552 


A 


8713 


92 


302 


RMHPTMGPQDQGHTGGLYAHTPLHPHSH 
THMITIjIHSHIYAHAHSHTIPATCPKGP 
TQPYLCLLSPHIEL 


8652 


22553 


A 


8714 


2 


166 


KKINS FFSQGLTQAKVHGGRPS S LGPQT 
PVLKRSSCLSFPSSWDYRHBPLYPAN 


8653 


22554 






t 
1 




TGVS PCAQHEFFKSN 


Q^4 






O / 1U 


JO J 


105 


RRSLLHSVLNGAQAGVQWRDLGSIjQPPP 
PSSLPWPPKVPRLQPLPGRHPVWEVRSV 
<5 AR P P T VWDVRS PS AWIiPS LE SEERLCT. 
AAIPSGK 


8655 


22556 


A 


8717 


385 


1 


FlH?FT,PPAFT.1^SttYFWFFFFYI FS FFFF 
FFFLGSPPPPIFFFFFFFLNKIFFFFFF 














FFF 


8656 


22557 


A 


8718 


4oy 


zo4 


SEDDDDDPFMNPSSLRRNRR 


8657 


22558 


A 


8719 


3 


269 


KKHVKRVLSHLKASCPEKT.LHFSSWPQL 

HAJTTHTVVAKLNELPWVEMNIiIiSIjGAFQ 
QRIiP 


8658 


22559 


A 


8720 


430 


141 


I*FFFLSFFEDGVLLCHPGWSWAQSQI/T 
ATSTSKKKKKKKATPPGIPKFLIGKSGK 
PPRVLLIGHWAPFKFFFFNPRIAFF 


8659 


22560 


A 


8721 


386 


290 


rwgltmlprlvlnywlqailllwppkvs 

GLQA 


oOOU 




A 


£799 




1S7 
J J / 


HK VF iiVKT V FIjH VCrQAGliELPTSGDIiPT 
S ALWEAEWGGLIiEDRS SGPAWET 


8661 


22562 


A 


8723 


493 


186 


FSPQGGEKRAILGLRNLCPPGVKDFSAR 
PPKEVGNEGRVPKTKLFFVFLKKKGFPL 

IGRGGFKSRPWBS pprppqkvgvqrlnp 
ppgpffffdliwpnrngs 


8662 


22563 


A 


8724 


3 


186 


GGGTPPFPGVWGKGNGGGTPPAKTIiPP F 
PSPLFFFS pcpspgeggsrepllstpla 

CICI 


8663 


22564 


A 


8725 


370 


67 


FAPPKNFYKGAPPIFFIFIiFFILFIYFF 
FFFFIFFKFLVFLLETGFHHVSQEDLDL 
VIHLPQPPKWGLQAWATTPGQFFVFLI 
E TGIiHP VNQDGHNLLTL 


8664 


22565 


A 


8726 


966 


652 


LGSLKPPPTGFKRFSCLSLJPSSWNYRHA 
PPCPAN FVFLEETGFLHVGQVSLELLTS 
GDPPTLASQSAGITGMSHRDWLGQHLIV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteucine, 
K=Lysine, LHUutine, ^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NHCYLFYQGHIKTTRSLLPPP n 


8665 


22566 


A 


8727 


85 


425 


HMHSFAHELHTFLHTHTNSHTNSQNQSP 
TYTSK5HTHSYMRFTDSHDSQMQQTHRH 
SLLHMTQLYTCTHTHTHTRTVIPLHLHS 
VKGWLNTKVALGGRTSHGRESHIAGRLL 
A 


8666 


22567 


A 


8728 


393 


124 


LGSGEPQLFQSPPSGGPPGPKSHWNSWG 
NANPPVPPYIIESAPPPPHFPFGAQSKY 
SGGQAARLCPCPPFFLKKKKKKAPGSGT 
GTFSFQ 


8667 


22568 


A 


8729 


25 


217 


AVQP IRVQWRRSLQSPQEQI ILAPSLAK 
VDMEMTQLTQENADFATRDRYHHSSLVN 
REQLMPHY 


8668 


22569 


A 


8730 


1 


611 


PGIFYSALLSLDTS ILNQLCFIMHRYRK 
NliTAAKKNELVQKTKSEFNFSSKTYQBF 
NHYLTSMVGCLWTSKPFGKGI YIDPB IL 
EKTGVAE YKNSLNVVHHPS FLS YAVSFL 

x&rsssssssssssssssssssssssss 

SSSSSPSPEERTVNVSSIRGKKWSWYLD 
YLFSQGLQGLKLFIRSSVHHSSIPRAEG 
INCNNQY 


8669 


22570 


A 


8731 


120 


378 


VEKTQAS I LGCXCSTPRAALGFRBRTLP 
LRRCPGCGPPEAQMELEKRALGSQAQWL 
MPVFPALWEAEVGESLEARSFETSLGNI 
A 


8670 


22571 


A 


8732 


2 


68 


VYVCVCVCVCVCMCVCRAKVGM 


8671 


22572 


A 


8733 


382 


258 


NYHSSLHLETPGLKQSSCLSLPKSWDYR 
HESPCPALIFNSI* 


8672 


22573 


A 


8734 


479 


326 


PPLYGFFFFFFFWGGGFFFFFFFYFFFF 
FFFFFFFFFFFFFFFFFFFQTTF 


8673 


22574 


A 


8735 


422 


2 


GRRP PGL YFFFFP PGKKNFQGRGGFLF F 
FPPKGFFFFFFPIGFFFFSSEIiGKDWPP 
PKKKVFSQRFPFFFFPPPLFLFLFFFFF 
FFFFFFFFFFFFFFFS FFFFFFFFFFFF 
FFFFFFFNKQVFI ERLLCARHCFRPPAS 


8674 


22575 


A 


8736 


7 


75 


IAVAL I QAYVFTLLVSLYLHDNT 


8675 


22576 


A 


8737 


2 


176 


KFGI1LAETGFLYVGHAGLEVRSSGDI1PA 
SASQRAGITGVSTAPGFNRYFYKQTIYK 
YG 


8676 


22577 


A 


8738 


408 


111 


GSSPPPRAGGENFLKKNAGGKNFPGGEG 
GGGFYPLYPKKFFFS PKGFI FWGGGGGK 
WPPPKKGGFSKKPQKVFFTPPQKKKKFF 
WPPGGNWGPPKNF 


8677 


22578 


A 


8739 


5 


249 


YEGLKKLHMQAS P FQRGHP VNHKKKKKK 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK 
EGALGGKFLAQTS FKGEKQKVFI ICY 


8678 


22579 


A 


8740 


1 


226 


FFLRQSLSVTQAGVQWCDLSSIiQIiliAFQ 
GSSNSPTSASQVAGITGVHHHIQLIFIS 
L VETGFHHVAQAGIiE LNLQ 


8679 


22580 


A 


8741 


4 


96 


DGLIMLPRLVSNSWPQGILPPWPSKMI/3 
LQV 


8680 


22581 


A 


8742 


19 


112 


FGRILLVREKKKKKKKKKKKKKKKKKKK 
GGGPL 


8681 


22582 


A 


8743 


362 


234 


KMDQKCLSEVLQRWFPCCCCXXCCCCCC 
CCFSLPFIFEPSRS 


8682 


22583 


A 


8744 


3 


419 


LTLRRFQLNLTALTKKKKKKKKN1JQCKK 
KKKKKKKKKKKKGGPLFKNSLGGPIFPG 
GGKFIFFFFWGGFLKPPGDFLKKTFFLG 
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S£QQ) 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
nen fide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
&=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
KNLysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S~Serine, 
T=Threonine, V^Valine, 
W—Trvotoohan. Y^Tvrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GEI FGPPPPKNI/TPLGKKKI FKGVGGKN 
PPLFLRRKKFFSGGFFKKIFSPGLGF 










AAA 


i 
i 


KKP S KE I LYPENSRPFS PL S PLKFFFS P 
KGFNFWGGGGPLCPPPKERFFSKNPPGG 
FFSPPLKEKI FFFPPPVNLGPPRDLFKG 
APPFFFFFFFFFLWE3flFYWLSLSSGSQR 
LPQQVPTVEPSELGGMGAYFVSKRSTYL 
QRGWRPER 


8684 


22585 


A 


8746 


104 


222 


NKTFCIjLKKKKKKKKKKKKKKKKKKKNK 
kkkkknwggal 


8685 


22586 


A 


8747 


403 


201 


MEKYNWQPHSGIIjYSHBKEQAALFPIA 
KRWRQPSCLSIDEWRWIMWSSLTVEYYT 
AMKRNKQLYSQ 


8686 


22587 


A 


8748 


1 


90 


RTRGAVYAALERMGLDGCVEDLRSRLQR 
GP 


8687 


22588 


A 


8749 


444 


85 


SPPPPGLFFFFFFPKKKTSPPPTKKGFF 
SPPPPQKFFFFLKPFFFLGGLGFNFPPP 
KKNFFSKNPPRFFFFPPFKKKNFFFPPP 
FFFAPPKFFFLTPPPLFFFFFFFFFFFS 
FFFFFFFL 


8688 


22589 


A 


8750 


2 


309 


EFFPPYWEFLKINACMFSPEKKKKKKKK 
KKKKKKKKKRGPPLKKTPGGPQI PRGGK 
KKIPPLKGGQKKPPRGFLEKNPLFGGGP 
FGPPPPPKINPPEKKKNF 


8689 


22590 


A 


8751 


3 


76 


TASVSEGGGLQGITMKDSDEEEEG 


8690 


22591 


A 


8752 


433 


145 


SFFFFFFFFFSFFFFGKKSSFFTPb 


8691 


22592 


A 


8753 


2 


124 


GHLI^HLIGSATIiAISTINIiPSTLIIFT 
ILIKKKKKGRPF 


8692 


22593 


A 


8754 


80 


201 


FINANSKiCKKKKKKiCKKKKKKKICKKKKK 
KKKKGGGLLKKTPGGAQF 


8693 


22594 


A 


8755 


415 


1 


IYFPTPEKFGPPKETLKKGAPFFFFFKQ 
KFPFFGPGGKQKGGFGSLQTLPPGKKKI 
SRPNPPRKRGFKDAPPPPGKFVFLKKKG 
VFPGGQGGFKTPTPRDLAPPAPIiNPGAL 
FFFFLRQSLSLSLRLECSGTILAHAS 


8694 


22595 


A 


8756 


3 


136 


FTATRPGVYYGQCS E I CGANHS FMPI VL 
ELIPLKI FEIGPVFTL 


8695 


22596 


A 


8757 


80 


326 


KKFSLGSQGGRAGVFPRPPLGKLQTTAL 
NSGAKGTPPFLFPEPRKRGGPPPAPGWE 
GIFCPHFFPMGQKGEGGLWHTGEGPH 


8696 


22597 


A 


8758 


1 


164 


SLISSTQGHKQCRRPQGPLPRKTRDPCS 
HVYLLTFPPLLSYDPAKSPSLRNTQE 


8697 


22598 


A 


8759 


431 


79. 


FFFFFIFLKKVSTQSPPPQFYFFFFLKT 
FFLFQPLLKEGGSWGGRGDKPPPLAPFK 
KWGVFFFFFSFFCCFVFFFFPLFFFFFL 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFPRCVIiTRL 


8698 


22599 


A 


8760 


60 


194 


ARPTCPATAVTQKKKKKKKKKKKKKKKK 
KKKKKKGGGVLKKKQGGRKY 


8699 


22600 


A 


8761 


383 


38 


CASMTFPKKKK3CKKKKKKKKKKKKKKKK 
KKKK 


8700 


22601 


A 


8762 


399 


164 


HITVKSLLVPMDDPPKKKKKKKKKKKECK 
KKKKGGGFKKNLWGGQKLTGEKKKIFFF 
LKGGKKKPLGIF 


8701 


22602 


A 


8763 


353 


48 


NFFFKVFFFLKDFSHKGAPPKKGGPPKK 
TPPRKKFFFLKIKPLFFIAFFFFWSLFG 
FFLIFSLFFFFFFFFFFFFFFFFFFFFF 
FFIVFTITLMHFKIYLH 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
C=Cysteine, D=Aspartic Acid, 
£=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine > 
N=Asparagine, P=Proline, 
Q=Glutamine, R— Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X= Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


8702 


22603 


A 


8764 


2 


93 


FAMLPGLVSNSWPEAICLPWPPKLLGLQ 
MM 


8703 


22604 


A 


8765 


398 


63 


PPPPVFFFFSGGFFFWGGGGQKSPPRER 
CFFFKMHPGFFYI PPFVGKKVFFVLEGV 
VCPLGFFLSGGAFFFFFFFFFFFFFFFF 
FFFFFFGFFFFFFFFFFFFGFPSVSHL 


8704 


22605 


A 


8766 


3 


83 


ILEIAGALIQAYVFTLLVSLYLHDNT 


8705 


22606 


A 


8767 


450 


306 


QKKKKKRRNKPEGLGTVAHACNPSTLGG 
RGRWITQPGQQEGNFISKY 


8706 


22607 


A 


8768 


1 


111 


WSLALVAQAGAPGFKQFSCLGLPKSWDY 
RHEPPCPGL 


8707 


22608 


A 


8769 


385 


261 


FLAEMGFCRVGQAGFKLLNSGDPPASAS 

qsagitgviapvl 


8708 


22609 


A 


8770 


3 


293 


SLGS KPLGLLSLS PVKWFR I FGKERNKV 
WGKDGGTDRNQSSSAFKHLQRGDSDPKQ 
NKI KACS S KFYLRRCVKRS FLLI KKKKK 
KKKKKKKKKKKKKKK 


8709 


22610 


A 


8771 


2 


281 


CVCVCVCARVYI Y IVYVY I C I YTYMCVY 
MYICIYIHVYIHVYTCVYVYIRIHMYIC 
ICVCIYTCIYTYIYTHTYKHSVYYLHNF 
YINPKLLQS 


8710 


22611 


A 


8772 


295 


161 


THTHTHTHXHTHTHTHTHTHTTCSLP PS 
LAQLLGNCCIKAAISILYIL 


8711 


22612 


A 


8773 


1 


318 


FFFFVRWSFTLVAQARVQWRDLGSLQPL 
PPGFKQFSCLSLLSSWDFRHTPPCIiANF 
VFLVETGFLHVGQAGLELPTSGDLPALA 
SRGAG ITGVSHHPQPPLCFLFL 


8712 


22613 


A 


8774 


3 


70 


FMPIVLELIPLKIFEIGPVFTL 


8713 


22614 


A 


8775 


369 


191 


TYKINKrGWAWWCAPLVPAAWKAEVBGL 
LEPGSLRPASVIQQDPHLKKGTYSRVLT 
THL 


8714 


22615 


A 


8776 


2 


124 


GFRHVGRAGLEFLTSGDPLASAFQSAGI 
TGMNHHVQPIVE 


8715 


22616 


A 


8777 


1 


162 


LKYYTADENGKTSRLI*PQRPSDEOGAGV 
FMASRFDRH YCGKCCLT YCFI KPEDQ 


8716 


22617 


A 


8778 


346 


138 


YNSPPYKEKTIPLQARVNFGPPRDSLKR 
PPFFFFFFKRQSLTMLPRLVLNSWAHGI 
LLPWPPKVQVLQA 


8717 


22618 . 


A 


8779 


2 


117 


LNLDTTAVQVRNYPRI RES YKVS FLS AL 
EEYTKKLNTQ 


8718 


22619 


A 


8780 


2 


97 


WDLTMLPRLVSNSWAQVILPSWPPKVLG 
LLA 


8719 


22620 


A 


8781 


403 


276 


IISTVFQRGSCPIPSRKEVCSERPRRLQ 
KDSLTGFPHTVYKH 


8720 


22621 


A 


8782 


2 


119 


QAGLELLTSSDLPASATQSAGITGMSQR 
ARPHFQCKNVP 


8721 


22622 


A 


8783 


245 


101 


ELQFKMRSGWRRSQP I SWGLTLVPRL I S 
NSWAQVILLTWPPKVLGIQA 


8722 


22623 


A 


8784 


2 


124 


AKLEKKKKKTGRAKRRMQYNRRFVNVVP 
TFGKKKGPNANS 


8723 


22624 


A 


8785 


3 


355 


VGPSEPGAGYNLLVCRLLRPLEKHGIRV 
GVTGFSRCHLSWLPLARKGNSPTPCASW 
VRHHLALIiRLTVCGLHPLS CTHCLTSPS 
EMNPVPQLEMQKS PVFCVAHAGSCRLEL 
FLFGH 


8724 


22625 


A 


8786 


238 


9 


PVKLGAGPPQAPFWLRKWAPLIKPFFKG 
PIGSLQIPLFFFLRQSLTLPPRLECSAT 
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SEQ n> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
C=Cysteine t D=Aspartic Acid, 
E=G!utamic Acid, F=PhenyIalanme, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, p=ProIine, 
Q=Olutamine, K— Argimne, o— oerine, 

T— TltraAitSna \f — Valine 

l— 1 nreonme, v— valine, 

vf — lrypxopnan, i— •lyruoiuc, 

X=Un known, *=Stop codon, A=possible 

nit**! &t\titi& rlaif^ttrin \snnccihiP 
llUucUUUv UclcilUU, V jjua&ivic 

nucleotide insertion 
















8725 


22626 


A 


8787 


336 


10 


CLSLPSSWDHRRLPPHPASFCIS 


S726 


22627 


A 


8788 


1 


147 


IFYLLETGFHYVGQAGLBLLTSGDSPAS 

no AO TVnTTYtnTMTnillT .PFTWVT? 
HjyoAu X xo \i ailiu\ljtrd inv c 


8727 


22628 


A 


8789 


3 


130 


DLBKG I QTLMGRLEDGS RRTGQ I LKLDH 
SSBFS KTRELYP VF 


8728 


22629 


A 


8790 


206 


3 


VPPSGP I KKGDKKKNLYLIFLI KNVPNK 
KJjKxvCjVb trvecrt! r V r xii-ia xvsriui 
VGQAGLEIiLTS 


8729 


22630 


A 


8791 


124 


390 


RAGAQSNLME P P PS GVKAI F FPNFPKKW 
GNGTP PPAP L I FGGLKKKGV S P CGPGGS 

e»T»T>7V OGT5T TV DD V/^^KOWPT^OT DBDD 

EPPAPGtyJroirJj/UrJb'IUiuW^ 
DKPLN 


8730 


22631 


A 


8793 


2 


301 


HEERERERERERERERERERERERERER 
ERBRERDALFAFFPPRGARPPPEIERGV 

m 7/~i 7V C CH TO X? T VDTT . O TPT7T7 C DP TitTDflO 1? 

LRSRCVSHSLHLISRA 


8731 


22632 


A 


8794 


1 


285 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERGAPPP 

YIFFHMRARPPHRYIjCAQRETRPAL 


8732 


22633 


A 


8795 


1 


321 


ARGERERERERERERERERERERERVSR 

DTTyDfTYr DTVM?T3XI'DT.ADOT3r*Cl?rY2GAriTI 
KJL FK<aDiiKL%J « irHFlaAKoKlJor UVjo/iX/Xl 

FGRGVFNKVSVVTDPPTHRVTSSLGGGV 
ERDLLTLSGGGTYAPWKNMCATGEDQ 


8733 


22634 


A 


8796 


2 


140 


ERERERERERERBSLPPPAGCAKGAGWG 
E 


8734 


22635 


A 


8797 


2 


223 


SAKEKiiKJilKi&KislCEiKI!^ 

RNLSRGGGGVPPPLQNVRIHSGGPARGT 

UkiH 1 KKKl oxi X U VvyJjftW 


8735 


22636 


A 


8798 


1 


176 


ARGEREREREREREREREREREREKISF 

T r2r2^3T>n , mi?7A T TCP fSVTfWnfTP P YTTY5P V 
1jOV7VJX\.0 iTUf rxX JjiCVjr V lur VAjrlu^ X X X VJlx V 

RN 


8736 


22637 


A 


8799 


335 


27 


PPTKFFFFFFLVEMGrcCIAQGGLELLS 

COOT DICTA VnQ7ATrTTnVQWWAW<5VT7 


8737 


22638 


A 


8800 


342 


191 


STSIiSLPKCWDYRREPPCPANLSYFFKD 
PFS KCNPRIiRYWGLGLQHRNFG 


8738 


22639 


A 


8801 


104 


,332 


AVPLTMVKIQPLWKRVWRFHNKS KLELP 
STGPLTIAGMWNPPKRSSMD 


8739 


22640 


A 


8802 


563 


258 


FFLNLETRSWYVAQARVQWLFTGTVTVH 
LTLPFLKl^Ll^LGGGGVGYPSTTOLEG 


8740 


22641 


A 


8803 


2 


199 


ARGLLVRRFLS PS E KRS IRVGVTRFSRC 
P P<? PT .S T .TMC^S LTPCASOVROCLAT.L 
RLAHGACTH 


8741 


22642 


A 


8804 


266 


108 


JCT.VVCJTCTAIiPPSFSSSCSSHIGCACFP 
FTFHYDCKFPQASQAMLPEQAVEP 


8742 


22643 


A 


8805 


1 


140 


KWWFKRPGVYYGQCSEICGANHSFMPI 


5743 






oouo 


z 


on 


SSAAAEENTOEKKEEAEKTEDDMEFGLF 
D 


8744 


22645 


A 


8807 


229 


34 


KKKIFFFFPGFFWPPQKFFFKAPPPFFF 
FFFFFFFFFFFFFFLDLFIYLIILSYTK 
LISIRAAAPI 


8745 


22646 


A 


8808 


43 


173 


NADSGHAQWLTPGM PALWEAKAGGSQDQ 
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E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
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X— Un known, *=Stop codon, /—possible 
nucleotide deletion, \=possible 
nucleotide insertion 














EIETILAGAGKPRLY 


8746 


22647 


A 


8809 


259 


134 


IYMI FKNKFFNRDRGLPMIiHRLVLNWAQ 
VlLLSWPPKVLGIi 


8747 


22648 


A 


8810 


391 


193 


VLPKI PGIQFPPPCRFKKRPRPRFKKPP 
PKRKKIS FSNP PKI WPPQGYFKRGPPRL 
FYFFFFWIL 


8748 


22649 


A 


8811 


304 


3 


TTPPTNIFFCFLFFIFIFLWYFIFTFFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
QCFIGHDDQKMMVHTHTQTSSHLLQGGS 
NPQGKGQS PGWVLWCA 


8749 


22650 


A 


8812 


395 


268 


FLVETGFHHVGQAGLELPASSDLPALTS 
QSAG ITGVSHHGWS 


8750 


22651 


A 


8813 


782 


1174 


LSRLFYFCVLFCLYMKTTQLPYFRGLVC 
IfFVLROGLTLSSRIVE CSfiM T AAPPQT.NV 
PQSTDS PTSASRVVRTTGVRHHTQL I FV 
CFVEMWFHYYAQAGIiEPLGSSSPPALSS 
QGAGITDVSHHTPLEbCF 


8751 


22652 


A 


8814 


95 


236 


ATMPGLKNIYFLKNKDKGLTMLPSLVLK 
SWARVILLPWPPKVLGLQT 


8752 


22653 


A 


8815 


381 


58 


GLTPFPPKKPKNYWARGGGPFI PPPREG 
WAGGFFLPRRGRVPLAPGSNNLELGPIiP . 
SPPGYQKKTPFSKKKKKEKKKRKR 


8753 


22654 


A 


8817 


441 


158 


FFFFLRTDGYLTMLPRLVSNSWPQAII*L 
PQPPKMLGL 


8754 


22655 


A 


8818 


415 


342 


RLVLDSWAQEILLPWPPKVLGLQV 


8755 


22656 


A 


8819 


400 


171 


NFFFPPGVKFLGGGGPQFPPPQKRGFFQ 
KTPGG FFKPPPKICICICFCFFP P PP ITTrtP PO 
GIFKKAPPFFFFFFFFFFFF 


8756 


22657 


A 


8820 


2 


268 


INIILAFTISLIjGILVYRSHLMSSLLCL 
EGI ILSLFI IATLITLNTHSLLANI VP I 

VLAR 


8757 


22658 


A 


8821 


2 


258 


LCLPNQAWAMAGSPPPASLLPCSIjISDC 
CASNORDSVGVRP55 R PGVR YQT ,WPP WT . 

SPSEKRSIRVGVTRFSSWVRWLRTVIPA 
T 


8758 


22659 


A 


8822 


250 


456 


GGODKFGLIETFPPGLKPFFLLNLLSGW 
EliGPIiAPPPFKFCFFKGRGFPFLPRFVF 
VANLLLTCCKKRD 


8759 


22660 


A 


8823 


492 


376 


QENCLNPGGRICSELRSCLCTPAWATER 
ACLKKQNKTNQ 


8760 


22661 


A 


8824 


1 


116 


LTPLPPSAPPSVDDNLKTPPBWVCSLPF 
HPQRMIISRN 


8761 


22662 


A 


8825 


2 


191 


FLVETGFHHFGRAGLKLIjNSGGALTSAF 
QRAG I TGVI TGVSHCFEVNR VWTGTO YF 
LCYILKS 


8762 


22663 


A 


8826 


393 


75 


PRFFPPPPPKIFFFPPPPFFFLGGFPPI 
PPPPKNFFFPKPPPGFFFPPPLKKKIFF 
PPPPFFSPPPNFFFPPPPPFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFS I 


8763 


22664 


A 


8827 


81 


380 


KI ITKHNIARTNP ytfcimnylkkkkkk 
KKKKKKKKKKKKKKKKKiOOOCGGGPFLK 
NPGGGQNNPGGKKNFFFFFGGGKKNPPG 
FFKKKPFFGGGKFGAPPPPKN ! 


8764 


22665 


A 


8828 


354 


13 


TKPKTPPIiTKGPRRGFLAPRVPGGGGPI 
TPPPQNFLGEKGTLARKKVFPPGRPAPK 
RP PRKKNGFEMRPLKKNGPGP WKFLGFF 



1082 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

am in n acid 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 


Amino acid sequence (A=Alanine 
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nucleotide deletion, \=possible 
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LANPPGPGPKKKKPENQTHKKKCQRRKP 
RF 


8765 


22666 


A 


8829 


! 


215 


KHSTTHSHCPRTIKLKKKKKKKKKKKKK 
lOKKKKKKKKKKKKKGGPPFKICrLGGPPF 
SPGGKKKFFFFLGGL 


8766 


22667 


A 


8830 


1 


127 


ILFFWQRRGLALLHRLVSNSWAQVILPP 
RPPRVLGLQVIYFK 


8767 


22668 


A 


8831 


264 


55 


FFFXFFXFFFFFFFFFLFVFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFFFWCS CS 
YVLIFIQPFIKSFH 


8768 


22669 


A 


8832 


399 


85 


PPQKNFFFPQAPKISGGGGPQIAPPQKK 
GSFQKTPGGFFIPPQKKKKNYFPPPGKM 
GPPQGFFKRPPPLFFFFFFFFFFFFFFH 
NRHLFKVKHLQNS FHLVQLRL 


o/oy 


2.2.0 /u 




OOJJ 




zoo 


NCL YRTKKKKfCKKKKKPPYKKKTLRG pn 
FPPAGPPAPLPLSGGEKKPSRGLIiRRPP 
TLGGAAKGPPPPPKLTPIiRKKKIF 


8770 


22671 


A 


8834 




146 


INIIliAFTISLLiGILVYRSHLISSLLCL 
EGIILSLFIIATXjITPSTLC 


8771 


22672 


A 


8835 


2 


79 


EDPKTSPKPKI IQTRRPGLPPSVSN 


0/ 12 


-TO ATI 




ooOO 


l 




PTRPTI ITP ILLTLFLITQLKILNTNYH 


8T71 
olio 






9917 


1 


292 


AKTFNF YKVEFINVFFNGLCLLI I KNTL 
PKKKKKKKKKKKKKKKKKKK 




ZZO / J 


A 


OQ18 
OojO 


1 
i 


199 


VASNS IHPKAKENTAPHTHTHTHTHTHT 
HTHTHTHLFMAIFK 


8775 


22676 


A 


8839 


2 


106 


HVGQAGliELLTSGDLPTS ASQSVG ITGV 
SHRAWP 


8776 


22b/ / 








994 


APIiOGRQICDCDCTYPSPHTYIHTHTHT 
HTHTHTHTHS 


8777 


22678 


A 


8841 


266 


1 


NSLSVEFLILFPPYNYSPKQCICSHQNC 
LNFWVT KCVNTPHPNFLNLRMPLLAKRG 
AHACDPNTFGGGGRWITRSVDRDHPGQH 
NETP 


8778 


22679 


A 


8842 


343 


52 


KTGFRHVGQPGELLASSDLPASASQSAG 
ITGVSHRVRPGLHILDNSSFLDTSFADI 
FSVLYLKAGIASLLHIYIHFLLPLRDLL 
LVSSLTVSFPTAV 


8779 


22680 


A 


8843 


256 


35 


LFIFSLSLQYIFCVCGCVCVFMSLCFLV 
CAWGHVQLHVCS CVRMCVCENDVCVCGW 
VC^CFCIKNFQKBVYQI 


8780 


22681 


A 


8844 


333 


11 


PPIKKKAPPPKGRAFFFFFLKKKKGGPP 
PPQKKTGGGGPKKRGGVKKPPPGFFPGF 
FGPLFFWGPPFLPPFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFNFISTKNLF 


O/ol 


ZZOoX. 




R845 

Oir+J 


15 


288 


RCGLTVLPMLVSDCWAQVILLAQPPRVL 
GLQAGASASTPITSVNCSQGAIYPEVGT 
TGSTI IAAASSGGEFDSVLQNDICMCFL 
TQQFHF 


8782 


22683 


A 


8846 


1 


397 


ECAHHTQLIFLFLIESSLHHVGQAGLKL 
PISSDPPTSASQSAGTTAVSHHA 


o/<Si 








i 
l 




ARERERERERERERERERERERERERER 
EREREREREREREREREREREKKKLGGG 
GGGPQFPIGGGLTQGGGGG 


8784 


22685 


A 


8848 


2 


233 


HEERERERDRERQRERERBRERDRERER 
ERETFFSS SGGGP KETTLFFETGAPKEG 
YQNMHARLI PS AVPFKKRGRG 


8785 


22686 


A 


8849 


2 


180 


HKEkERERERERERERERERERERERER 
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ERERERERERBRERERDFVFYRWGEGHI 
DRQYCSFLCPGWCHCPM 


8786 


22687 


A 


8850 


2 


79 


PEERERERERBRERERERERERERERER 
ERERE 


8787 


.22688 


A 


8851 


1 


178 


ARGERERERERERERERERERERERERE 
RERERERERERBRERERERARLFARGEK 
KKHTGGGPPLPYYLYI 


8788 


22689 


A 


8852 


3 


76 


TRRERERERERERERERERERERERBS P 
A 


8789 


22690 


A 


8853 


5 


227 


EERERERERERERERERERERERERERE 
RERDFFFFFLEKKYLFSPIGELKGRGGG 
SLTPLLTGKWNPCPPPPH 


8790 


22691 


A 


8854 


1 


189 


ARGERERERERERERERERERERERERE" 

RERERERERERERERERERERERERERE 

RHSRLFPRESVCVARAGGASSPPRG 


8791 


22692 


A 


8855 


2 


271 


HEERERERERERERERERERERERERER 
EREREEPPRGEKYCIRVPPQKILSPACD 
SLLYVTHNLCVSHSTPRGERYVALFNSR 
ETRCF 


8792 


22693 


A 


8856 


2 


309 


HEQQQQQQQLRNLRDFLLVYNRMTELCF 
QRCVPSLHHRALDAEEEACLHSCAGKLI 
HSNHRLMAAYVQLMPALVQRR I ANYEAA 
SAVPRVAAEQPGVSPSGS 


8793 


22694 


A 


8857 


44 


203 


GYSTCVGMHAHTHIHTDTHTHAHTYIHQ 
CLIQIGLCTFATCTSQVNKSSMAEL 


"8794 


22695 


A 


8858 


329 


182 


IIFVCLVERGLHYVGQAGPELKASDDLP 
ALASQSAGTSGVSHCVSSKMS 


8795 


22696 


A 


8859 


1 


175 


GLTLPPRLGCHATITAHYSLNLPGSSNP 
PTSASQVAGTTHSRPCPE ITQTALQRGP 
HD 


8796 


22697 


A 


8860 


3 


350 


HEAIGLRSNIKNGLDHFLPLGTPTPLIP 
ILAMI ETI LLLVQP IAIAVRLTANI TAG 
HLLMRLNGSATIAISTISLPSTLIIFTI 
LILLTILE IAGALIQAYVFTLLVGLYLR 
DNA 


8797 


22698 


A 


8861 


139 


3 


TFKEAD IKGGGPPHLASFCIFGRDGVSP 
CWPGWSRTPDLKRSTRA 


8798 


22699 


A 


8862 


3 


134 


SPTPPPSSKPSSIPRKSSVDLNQVSMLS 
PAALSPASSSQRHES 


8799 


22700 


A 


8863 


346 


219 


RIiVPPCIiDNFLNFCRYSCLTMIiPGLVSN 
S WAQVT FLPQPPKD 


8800 


22701 


A 


8864 


1 


147 


GTRFFFERGIDHVGQAGIQLLTLGDPPT 
PASQRAE ITGVSHHTWPNFI F 


8801 


22702 


A 


8865 


267 


132 


KKKFWPFFFFFLRDGVSLLLRLVSNCWA 
QEILLPWPPIOjIiGIiQV 


8802 


22703 


A 


8866 


343 


215 


LAQGLKTPI PKRVP PHTQKKKFCS ERPT 
RIQTDRSCWVSPLS 


8803 


22704 


A 


8867 


1 


242 


GTSKKIDKLDYIBLKCFCTAKETISRVK 
RPVEWEKIFANSSPEKGLISRINKEAKK 
LNSSAIAAHNCNPSTLGGQVRRIT 


8804 


22705 


A 


8868 


1 


219 


GTRRPTWATWQNPASTKKKKNSFFFETE 
SRTVPQAGGQGGNLGSLQSLPPGLKRFS 
CLSMPGAPKGQSGSLMG 


8805 


22706 


A 


8869 


1 


181 


ATGGILLLIiDWSIiAYESKHLLEGAKSE 
SAEELKKRAQELEGKLNFLTKIHEMLQP 
GQDQ 


8806 


22707 


A 


8870 


1 


255 


AAAPPNAPGGPPGPQPAPSAAAPPPPAH 
ALGGMDAEL I DEEALTSLELELGLHRVR 
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ELPELFLGQSEFDCFSDLGSAPPAGSVS 
C 


8807 


22708 


A 


8871 


383 


233 


ATTFPNPGFLFVRLFVLVQWPDHEPPGF 
KQSSHFSLPSSWDYRHPPPPPG 


8808 


22709 


A 


8872 


211 


294 


AXKKKKKKKKKKKKKKKKKKKKNIIWGG 


8809 


22710 


A 


8873 


398 


100 


QKERPPGFPPFKPGPPQKRGGKGAPPKD 
GEHYFFWVLEKSTGFPPKTKRVYYPSPV 
YPHPPAPQKEGFPGGSPPPRPIASPFGG 
KKKEKKSYIEKENLF 


RftIO 

OO 1\/ 


97711 


A 


8874 




214 


DAHVLGILVYRSHLISSLIiCLEGI ILSL 
FIIATLITLOTHSIiLEKKKKKKKKKKKK 
KGGAFKKNPGGAQI 


8811 


22712 


A 


8875 


2 


254 


PRVRPEGRNRTPQSWFQANPMACMTFSK 
GGGKNFREKEKKNFLLIRGKI KKTGGDF 














RR19 




A 


00/0 


X. 


1 j j 


TINLPSTLIIFTILILLTILE IAVALIQ 
AYVFTLLVSLYLHDNT 


OO 1 J 


99714 


A 


8877 


2 


296 


LARGAEVLG YGS HS RGRVPGALVGQGAG 
RLFTEHPGSS PATLAI YPTPEGTS VAVS 
ISAPPKARSRPYPPSRSCHNLSIAGSSV 
LVPPGPVHRWVGRP 




22715 


A 


8878 


164 


29 


RKIGPARWLTPVI PALWETRAGGS RGQE 
IEIILANTVKPGACEVL 


8815 


22716 


A 


8879 


381 


219 


GEKMTEEEVHMLVAGHEDSNGCINYEGK 
R 


8816 


22717 


A 


8880 


2 


156 


ANGNS FATRLSNI FVIGNGNKP WI SLPR 
GKGIRLTVAEERDKRLATKQSSG 


8817 


22718 


A 


8881 


3 


402 


S GFN I E YAAGP FAL FFIAE YTNI 1 1 INT 
LTTT I FIjGTTYDALS PELYTT Y FKKKKK 
KGGRP FFFFFFLKKKKI PPPKKKNFGKK 
GETLKGRGGAP I FQPKKILSPPKKKKKK 
RGGAFKKSLGGAKFNGGGRN 


8818 


22719 


A 


8882 


1 


406 


RYSTPSEGEVGERYSTPPGETLERYSTP 
PGETLERYSTPPGETLERYSTPPGETLE 
RYSTPPGETLERYSTPPGEALERYSIPT 
GGPNPTGTFKTYPSKKKKGGGRFKEPLG 
G PRFTGVGRVKVFS LWGGVLNAR 


8819 


22720 


A 


8883 


4S4 


307 


KKILNRRVRWLTPVI PTLWBTEAGSSRG 
QEMETILGNTVKPPASASQSAEIKGMRI 
IIP 


3320 


22721 


A 


8884 


1 


131 


GSLTHHIHNIKPSFTRENTLMVIHLSPI 
VLLSLNPDI ITGFSS 


8821 


22722 


A 


8885 


406 


106 


KGGAGPKIAPPKKKAFSPIPPPFFWPPP 
VFLKGPPPPSPENFFPPPVFFFRGPLFF 
FFFFFFFCETVVLPFRPSNLVFTLPHVF 
AWLIPIDQYIjFISHSC 


8822 


22723 


A 


8886 


425 


325 


SSTHTHTHTHTHTHTERSHLLAFEPSAG 
ELWMAM 


8823 


22724 


A 


8887 


394 


278 


VCVCVCMCVYICIYPCWLPCWHMCLELY 
KVFKGIMKGV 


8824 


22725 


A 


8888 


2 


86 


YSRTRAAAL IQAYVFTUjVSLYLHDS AY 


8825 


22726 


A 


8889 


400 


62 


FLYLKSFPPPPERGVFSPLPPQKFFFSP 
KALNFLGGGGPKFPPPKKRPFFKYPPRV 
FLYPPHKKKNYI FPPPDDFGPPRVFFKA 
PPPHFFFFFFFFFPFFFFFFFTRLEWVQ 


8826 


22727 


A 


8890 


3 


100 


SLEPGTSGLKGFSCLTLPSSEDYRHEPP 
CPAN 
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nucleotide deletion, V=possible 
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8827 


22728 


A 


8891 


32 


142 


VRPTRPGQVAS VLYFTTILILI PTI SLI 
ENKILKWA 


8828 


22729 


A 


8892 


3 


382 


TPHNPLSRPSYEPKPASTPSRAPKKKKK 
KNPKKKKKKKKKKKKKKKKRGGALLKKT 
LGGPKLPGGKKKNFFFPRGGKKKPPGDF 
LKKTLFLGGGNIAPPPPKKKNPWGKKKI 
FWGEGGKKNFFFFG 


8829 


22730 


A 


8893 


368 


199 


KKKRIS PPSRKRAPQGDFIiRPTPP IKNL 
YYYLSFFFLILYIFFFFFFFFFFFIFFF 


8830 


22731 


A 


8894 


3 


136 


TXNLPSTLIIFTILIIiQTILEIAGALIQ 
AYVFTLLVSLYLHDNT 


8831 


22732 


A 


8895 


1 


144 


IAT INLPSTLI I FTILILLT ILEIAGAL 
IQAYGFTLLVS LYLHD YP YN 


8832 


22733 


A 


8896 


153 


375 


PQKKKKKKKKKKKKKKKKKKKKKWGGGF 
LKNSGGAPIFGGGGKFFFFFLGGGFKTP 
RGFFGKNPFFWGGKKKKN 


8833 


22734 


A 


8897 


332 


400 


WLTPVI PXLWDAEAGGS FEVRSS 


8834 


22735 


A 


8898 


215 


359 


TELVPNAVLFCPLPCSSGHHLLCAIHAK 
RVTIMPKDIQLARRIRGERA 


8835 


22736 


A 


8899 


414 


73 


PRLLCFHKKAWAMAGALPPALIiPPYSLI 
S DCCASNQRDS VGVGPS EPGVG YNLWR 
RFLSLSEKRSIRVGVTRFSRCCPSPLSL 
TRKGNS LTPCTS RIiRQCLP LLWLAHGAR 
TH 


8836 


22737 


A 


8900 


374 


88 


VSPPSPLKIFFSPKAFNFWGGVGPHFPP 
PQKRVLPQKPPGGFFSPPLKKKKFSFPP 
PVKDGPPKGS FKGPPPFFFFFFFFFFGL 
DFMYTGSSNVW 


8837 . 


22738 


A 


8901 


433 


56 


PEGPPPPGGFYPGKRGFFPTPIiWFGGP 
KRDPPPRGKKNPSWRSGGKQTPRFPRDP 
LLKAFFPGGPPQRLGKAQGPFWFFFGPK 
KGKTQTGPPFFFKKLGDPWGPKKKPPPG 
GAPRLGP PKKKFL 


8838 


22739 


A 


8902 


3 


73 


IAVAL I QAYVFTLLVS LYLHDNT 


8839 


22740 


A 


8903 


3 


149 


PNIiS YI IGKDTWVEHWPBKDECQDEENQ 
KQCQDLGAFTESMWFGCPN 


8840 


22741 


A 


8904 


351 


137 


FPPEFFAAPKKSGILFFSKTWMEVEVLF 
LSQLMIiGQKTQYCMFSLINGRLIIKAPG 
SLKGDHHPPLGLLEG 


8841 


22742 


A 


8905 


142 


5 


KJiFLQKKKKKKGLGTVAHACNPSTLGGQ 
GGWIMRSRDQDHPGQQW 


8842 


22743 


A 


8906 


367 


244 


HVPQARAKFFCIFSRDGVSPYWPGWSRT 
PDLMICLLFIVBL 


8843 


22744 


A 


8907 


146 


248 


SQILGRLRQENCLNLGNGGCSEPRWCHF 
LAWAKE 


8844 


22745 


A 


8908 


2 


197 


AAAS THHARLFLC VCLF AYLLVE ME FHH~ 
VAQAGTELLDS SNPPPLASQNAGI IGMS 
HHTQPQYIC 


8845 


22746 


A 


8909 


1 


127 


KKTRLERAQWLTP VI PAFWNAEAGTLLE ' 
ARSLRRLPWPPNVL 


8846 


22747 


A 


8910 


368 


280 


A 


o<$47 


22748 


A 


8911 


355 


211 


QQQNLHLFWPGTVAHTRNS STLGGQGMR ' 
I TGAYE FETTLGNTARPHL Y 


1*848 


22749 


A 


8912 


325 


190 


KKKKKKGGGGPQKKKKNPPPPPTSKKKK 


"8849™ 


22750 | A 


8913 




411 


VNVPAPRGGAYRGRQASFSSGGLHPVPA ' 
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a m i tin flcid 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

npntirlp 
sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=]soleucine, 
K=Lysine, L=Leucine, M=Methionine 7 
N=Asparagine, P=Proline, 

0=OliitfliTiinp R=A.rtrininp RssRprinp 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 

mirlpfvfiflA HpJpfinn Y=nnc<:ihlp 

nucleotide insertion 














YRPLCLPNQTTNSAMAGAPPLASLPPCS 
LISDCCASNKRDSMGVGP S ETGTGYNLL 
VCHFI1SPLEKCSIRVGVTRPSRCRI1SPL 
SLTKKGNSLTPCTSRVRRCLALLRL 


8850 


22751 


A 


8914 


258 


9 


GAIPCLHCETNPSHISSTQELQTPBPQL 
PGVPLELPPPGS CFKCRKSGHWAKECPQ 
PGIPPKPCPICAGPHWKLDCPTGPRL 


Rft^l 
Oo J i 


997<9 


2^ 






49 


EKI HRKYFCYS PFVFIjVEMG FYFVGOAS 
LKLLASSDLP AS PSQSARVKAMSHCVRP 
VLVI FKNKI LQNTENSTKI YVYPPPNWQ 
ILTFCHFCFK 


8852 


22753 


A 


8917 


3 


106 


GLELPTSGDLPASASQHAGITGISHHAR 
PIMTYS 


8853 


22754 


A 


8918 


320 


171 


AASTTDGSYKCLCLPGYVPSDKPNYCTP 
LNTALNLEKCPFGLPHLSGSS 


QQKA 
OOJH 








1 


ZU1 


VKPSDRYHLMP I PTAAYPHONSTYNVP V 
STRMVMAEEFKQGKRLSLCSDLYRKDLR 
TIVDPWSCAT 


8855 


22756 


A 


8920 


3 


182 


LLRPLEKRGIRVGVSRFSRYHLSRLPFA 
RKGNS PTPCASRVRRCPSIiLHGLHPIiSD 
KPQ 


8856 


22757 


A 


8921 


142 


1 


SCHPGWSAVVRPQLLRRLRYENCLSLGG 
SGCSELGSCHCTPAWTTEG 


8857 


22758 


A 


8922 


1 


117 


LFFNYAWGWSTTLLSRLSLNSWVHVILT 
PWPPKEU3LQA 


OOJO 






007-3 
07ZJ 


1^7 


171 


I/lAFSDGLAHIinNIjKGTFATTiS ET.HCD K" 
LHVDIiENFRLLGNVLVCVLAHHFGKE FT 
PPVQAAYQKWAGVANALAHKYH 


8859 


22760 


A 


8924 


343 


133 


LFFNFNFLETRLREWLI FVFS VETGFP I 
KRAGLKLOAS S DP PARGSOSAEI SGMSH 
YAWPCKSNRAFKCS 


8860 


22761 


A 


8925 


321 


44 


PDSGGSPASASQVAGTTGACHDAWIiMF I 
FNMD TNIKfl P FS AKRDKTiSMDKINK T T F 
SLLINRISSFQIMYFLCSSRIESRKRWL 
KQTGRKKE 


8861 


22762 


A 


8926 


335 


106 


HFVFLIETGFHRVGHAGLELLTSTDPPA 
LAYKS SGITGVS YRSFTKLHLCACVCLC 
IYTYLGLVLSFTPGSWEKKT 


8862 


22763 


A 


8927 


2 


160 


LSRVPPRSANLS IS VFAEKRCHFTMLPR 
PISNSWAQGILLPSQAPKVLGLQA 


8863 


22764 


A 


8928 


3 


140 


RVITEEEKNFKAFASLRMARANARLFGI 
RAKRAKEAAEQDVEKKK 


8864 


22765 


A 


8929 


126 


59 


NSGGGGCNELRSCHCTPAWATE 


8865 


22766 


A 


8930 


3 


104 


KNMACYCRI PAC XAGERR YGTC I YQGRL 
WAFCC 


8866 


22767 


A 


8931 


311 


112 


MENYLSKMCX3EIiEXNITRELKEAAAELE 
SGS lASPIiGSTDESNLNQDLVWKASREY 
VQVLKKNYMI 


8867 


997tfft 




8Q^9 




lid 


GGGSSPWFPPFWRPRRADSLRVGVLNPP 
GPGGGAPFFLKNPLIGWGGGARLWFQL I 
LLRVRPENSFDPGR 


8868 


22769 


A 


8933 


262 


39 


RPRRPHPGNFPFFFNSGSHYVAQAGLEL 
IiDSSDPTLPSQSAGTTGMSYQAWPLDHT 
FENRDCVLIiHCSSPVSNT 


8869 


22770 


A 


8934 


84 


22 


TPAWVWWLTPVI PALWEAKAG 


8870 


22771 


A 


8935 


277 


111 


ERI INHAAGSHGVSG I FMKYDLSS LMVT 
VIEEHMPFWQFFVRLCGIVGGI FSSTG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D==Aspartic Acid, 
E=Glutamtc Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleudne, 
K=Lysine, I ^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Clu tamine. R— Arpinine S=5?prinA 
T=Threonine, V=Valine, 
W==TrvDtODhan. Y=Tvrosine. 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


8871 


22772 


A 


8936 


2 


106 


DSAI PRRLECLKEDVQRQQEREKELQHR 
YAEFVF 


8872 


22773 


A 


8937 


1 


193 


LMWDFS PSGLDGAFQRGYYWYYNKY INV 
KRGS ISGFTMVLAjGYMLFIYCIiS YKELK 
HERLCKYH 


8873 


22774 


A 


8938 


240 


69 


DSGNQLKRSFSGRARWLTPVIPALWEAE 
VGESLEVRNI IY ILONMSNIGNTHIil T K 


8874 


22775 


A 


8939 


329 


177 


RWNLAMLPRLVLNS WAQAI CPS WPLKML 
GLEAEPSCPARTNFKVTSPRLH 


8875 


22776 


A 


8940 


358 


120 


HPSN FFV KLVBMGFHHVGQPGLELFPAR 
YVPTIiAFRSGGITGISHCAWPKEWALPR 
KAS PGLSGRRGS PHCATCSS KAW 


8876 


22777 


A 


8941 


333 


104 


PHVYSNGHIdiS IIjTEDW^ PAT , WOC VP 
LSI I SMLSS CKE KRRPPDNS FYVRTCNK 
NPKKTKWWYHGKQFEFVITF 


8877 


22778 


A 


8942 


1 


241 


APLVAFTOVNLEDKGGriSKTjVRRTR T MP 
NDRYDEICHHWGGNVTJ3PKSGTRIAKLK 
KAKARELATNWIKCTLLSFLYI KR 


8878 


22779 


A 


8943 


1 


240 


APLVAFTQVNLEDKGGLS KLVEAIRI NY 
bTDRYDEICHHWGGNVLGPKSVTRIAKIxK 
KAiCAKELATNWI KCTLLS FLYI KR 


8879 


22780 


A 


8944 


306 


150 


AASTliCPRIiKCSG A T MAHf*<3 T iNT .T fZ Crm 

PPASASQVGGARGSLEPGRWRLQ 


8880 


22781 


A 


8945. 


284 


2 


GKSRRS PLDLCTSGFTGSTHFTLI ICWV 
PR YVATCPP IGLNFVFI WYTVn a 

DLIOiMVSSNPPVSAjSQSAjSl^MSHHTW 
LENRLLTTKK 


8881 


22782 


A 


8946 


1 


226 


WSFALFAQAGIiEFLTSGDPPASASLSAG 
ITGMSHHAOPAVST.lJc; t PC A55TPPAT Q <! 

YCAKMELKELKFML IVDAA 


8882 


22783 


A 


8947 


363 


96 


NTAPGYMPLLS I P FLVLS KAVFLFLLGM 
FTAALL 1 1 AORWF.O PTCHA CTT) pw tnttmw 

HMHTMEHYSTCCGVHAAVNISIQISEWN 
CWEW 


8883 


22784 


A 


8948 


1 


208 


GRVGO PGLRLLTSGDLPASASOfi AGV AO 

VSHRAQPTPALNCVLTCWVLQHYIKHNH 
GKRSEYFTI FNDM 


8884 


22785 


A 


8949 


69 


298 


SRGHQRVADA7\J^SFPDVICl^T^r J WG<3 
SSIVCILHPJ1KRRVQHI11RTSVSPTSSP 
ESGATKTILLEQERTRSPRS 


8885 


22786 


A 


8951 


2 


201 


LYDLT^mSISLPLPSLVCIHSHTHEHT 
YTHIHTHAKT XTHTHSHWVLSLKQKKMV 
KHRPRLSVWI 


8886 


22787 


A 


8952 


1 


285 


GAS PPASLPPCSL I SDCCASNQRDSVGV 
GPSEPGAGYNLVVRRVLSPSEBCRSIRVG 
VTOFSRCRPSPLSLTRKGNSLTPCASRV 
RQCLTLLWLVH 


8887 


22788 


A 


8953 


341 


256 


I^LGGGGCSEPRWHHRTPAWVTERDFIS 
L 


8888 


22789 


A 


8954 


227 


43 


I LFF IGKHTHTHTHTHTH'I'HTRAKCLFP 
PARFPTSPNSTTNCTFPTEPAANPRGLV 
RAGLG 


8889 


22790 


A | 


8955 


310 


79 


RPRRPGLMERVNVFIFSILRESFYKIRI 
TNQAQCCMPIVPVTQETEAGRSLEPKSL 
RIAWAIQQNSISKKNNNNNFF 


8890 


22791 


A 


8956 


374 


198 


FFFFLPNRHGLAI^PRMVTjHWVQAVLPQ 
PPKVLGLQG 


8891 


22792 


A 


8957 


138 


10 


RPRRPRLCQAQWLMPI IPALWES VAGGL 



1088 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
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SEQ IB 
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M 

eth 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, ^Phenylalanine, 
G=Glycine, BNHistidine, I=Isoieucine, 
K=Lysine, L^Leuciue, M=Methionine, 
N=Asparagine, P^Profine, 
Q=G!utamrae, R^Arginine, S=Serine, 
T=Threonme, V^valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LEARSLRPAWFFFNT 


8892 


22793 


A 


8958 


354 


216 


AGPRVFPGGEGGFRPPPPGGb PPQuPPlv 
GGESGGKPPPPAHWVFF 


8893 


22794 


A 


8959 


364 


223 




8894 


22795 


A 


8960 


259 


11 


FEALKDPSNl^SLFSFrVTVLTVAHVY 
EKLSTLTSLAARRGLAMLPRLASNSRTQ 
T ILLPWPS KVLRLQARATE PGLGLLL 


8895 


22796 


A 


8961 


334 


213 


AASTKNFERMEWGLTTLP^VJjNSWPQV 
ILLPWPPESMRL 


8896 


22797 


A 


8962 


260 


3 


KGGFFPIPFIGIKPGFFFSPALIFPKVF 
I FSQNFFVFLKKI FFFFFVETVS SHWK 
AGLELLASSNPPAFTSQSAGMTGMSHHA 
W 


8897 


22798 


A 


8963 


425 ! 


248 


ADEWIHKMRHIHMIEYYLAIKRNEMLIH 
AtAGMHLENILSEKDHMYDS I YMKCLAW 
ANL 


8898 


22799 


A 


8964 


1 


m 


GKLMELHGEGSSSGKATGDBTGAKVERA 
DGYEPPVQESV 


8899 


22800 


A 


8965 


2 


130 


GRVGPRMAIiNSWAQVIHIiPWPPKVLGLQ 
VAMKRKIMI LSTS Y 


8900 


22801 


A 


8966 


119 


364 


LGWYLIEKIQICCSPCLGADPFQQMISP 
LRTCSLTVLPKLVLNS WPQ VFLP PQTRK 
VLGLQAEVTAPGHI I FMI IMINTFK 


8901 


22802 


A 


8967 


238 


333 


TRCLRLLPRljVLNSWTQALLLPSP PKVL 
GLQV 


8902 


22803 


A 


8968 


297 


169 


NPPPPHIYKFFFIFNIFFFFFFFFFFFF 
FFFFFFFFMSKIIiI F 


8903 


22804 


A 


8969 


2 


212 


RVRPRDRPbVKVRVGKJNKJJGAfcTl"iVSPG 

LLPAHLTFPIJDYHLNQPFIFGIiRDTDTG 
AliLF IGKNMDPRGP 


8904 


22805 


A 


8970 


242 


357 


TI^IIKSFKK3CKKXKKKKKKKKKKKKKK 


8905 


22806 


A 


8971 


191 


265 


ILMPXKKKKKKKKKKKKKKKKKKKR 


8906 


22807 


A 


8972 


398 


97 


MFLYNRlLWFFSPHPPIiRFFFSPRATIF 
LGGGAPHFPPPKKGFFSKKPPGVFLSPP 
QKEKIFS FPPPIjFIjAPPKDFFKS PPPFF 
FFFFFFFFVSTGSSVS 


8907 


22808 


A 


8973 


370 


265 


YNSSIjPoSWD xKHVPPCPDWFCX rbKbW 
SQTPDLR 


8908 


22809 


A 


8974 


3 


119 


G LiKLltTiJ S DP PAiSAS.K/i'ALS ITVi V £i HHTy 

TIFVFSSHKS 


8909 


22810 


A 


8975 


1 


110 


VAQAGFKLVSSANLPTLASQSARVTGVS 


8910 


22811 


A 


8976 


3 


152 


WNQPKYPSTGEWMEKMWYIYTKLVQPMA 
HGPHVAQDGFEGS PTQIHKLS 


8911 


22812 


A 


8977 


375 


193 


FQDLPIiGSTSNISFFKRWSPAMVPRLIS 
NSWPQEILLPQTPKVLGITGMSHCTWPG 
GHISI 


8912 


22813 


A 


8978 


2 


395 


RAALPTQASAMAGAPPPASLPPCSLSSD 
CCASNERESVGVGPSEPGAGYNLLVCYL 
LRLI^ KRSIRVGVTRFSRCHLSPLSLTR 

LSCTHCQTLPSEMNLVPQL 


8913 


22814 


A 


8979 


120 


19 


ARVQWHDHGSTATLNSWPQVILLPWPPK 
VLGLQA 


8914 


22815 


A 


8980 


204 


48 


LNRDKGLTMLARFVRNSWPQAIPLPWPP 
KVLGLQALATAPCQMSFLRAPLS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D»Aspartic Acid, 
E^CIutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glu famine R=A winine S=Sprino 
T=Threonine, V=Valine, 

W^TrvotoDhan Y=Tvr«v;inp 

X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


8915 


22816 


A 


8981 


335 


221 


FFFLFLFFCMITSHCSVNVLGASDSPTS 
VSHIAGTIGT 


8916 


22817 


A 


8982 


360 


150 


LSHMPLLFVSLYPDHRAFMOVT PPPT7RT 

ESPSVTRIiECSGAIIiAHCNVCLPGSSDS 
PASAS YAGYSFYL 


8917 


22818 


A 


8983 


346 


116 


GQGRGVETESCSVTQAGVQWCDHHSLLS 
LELLGSSDPSTSASPVTGTTGGHYHTSS 
HFLFS VS SSYFPTPVCLAYR 


8918 


22819 


A 


8984 


350 


263 


LPMLPRLVSNAWPQAI FLLWPPKVLELQ 
A 


8919 


22820 


A 


8985 


3 


159 


EEYNLLVCRLLSPLEKCS 1 GVRVTQFSR " 
CHVSPLSLTRKGNSLTPCASWVR 


8920 


22821 


A 


8986 


2 


253 


KHWGELKNDLS I CGTLHS VDRYLNI KL 
TNISVTDSEKYPHMLSVKNCFIQGSWR 
YMQLPADEVHKQLLQDAARKEPIiQQKQ 


8921 


22822 


A 


8987 


2 


115 


GARI VGHLTHALKOGEYGLAS T CNfififif: 
ASAMLIQKL 


8922 


22823 


A 


8988 


3 


159 


GFYHIGOAGLKLLMSGDLPASASOGAfi t 

TGLSHCAQAGI ffflp ydi ftvf 


8923 


22824 


A 


8989 


1 


123 


shyvaqagpkllgssnppasasrsagit 
gvshhawppslfl 


8924 


22825 


A 


8990 


462 


303 


LARTPSRPTRPPTRPPTRPPTRSPTRPP 
FMI*SVDCLLiLY 


8925 


22826 


A 


8991 


382 


178 


ppfffyffffsfffffffffqkgfkgff 
pqkffffffggkpvknfqkfflrakgkg 
gekkppqnkkvf 


8926 


22827 


A 


8992 


532 




RDAGTiTMT • prt .v^ro*; wanvT.T .t . dtsd dw 

*VLy-rs»j u x vxxjtr cvxj v o FI AW v -LI J-iUirVJ Jr f tV V 

LTLQV 


8927 


22828 


A 


8993 


418 


172 


KKGKKKKKNIFLIFFNYFFFLIFFFIFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFGGAWSLFRGGDSGEVKGTDSMS 
RQCFQGF 


8928 


22829 


A 


8994 


2 


722 


AVRLNI S YP PQNLTMTVFQGDGTASTTL 
RNGSALSVIiEGQSLHIjVCAVDSNPPARL 
SWTWGSLTLSPSQSSNLGVLELPRVHVK 

TGKMRP I S G VMLGAFGGAGATALVFLS F 
CI I FVWRS CR KKS AR P AVfi VfHTTOMR n 
ANAVRGSASQGPLIESPADDSPPHHAPP 
ALATPSPEEGEIQYASIjSFHKARPQYPQ 
EQEAIGYEYSEINIPK 


8929 


22830 


A 


8995 


498 


323 


EPKAELNEGDIAWHIKREEGWFKGTLQ 
RNAKTGLLPGSFVENI 


8930 


22831 


A 


8996 


21 


511 


LI IDGVEAYAIiNASGWNT I VFDPTCRWa 
LFRSFKAVKEKLDTRRGSNSELETAVKD 
LGKAVS YKGMYGDVAI WYSGQ YVENGV 
KKNFLPDNTMVLGNTQARGLRTYGCIQD 
ADAQREGINASARYPKNWVTTGDPAREF 
TMIQSAPLMLLADPDEFVSVQLA 


8931 


22832 


A 


8997 


178 


306 


GAINAINWPPFNNVGMGKTLGFPNLMAL 
KQIPFSPVGEKTKRR 


8932 


22833 


A 


8998 


351 


211 


SIQSLRMQPVPCYISGRHHHYHSHHHIH 
HHHYSSYFIiKYPQEYLRPL 


8933 


22834 


A 


8999 


1 


381 


FRLFKFALMELCS IEGISDQPLFFVFGF 
FFFGKEASFGPQGGREGGKFKLMEPLAK 
GVKGISCPKPPHPCLNPPECPKRGGESSP 
PEGPKKTGAPGPAKKGLFFLTKGKNPWP 
GPGKTRKKAVFPKNF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQrn 

NO: of 
peptide 
sequence 


M 

etfa 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

IC9IUUC UA 

npntirfe 

sequence 


Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F-Pheoylalanine, 
G=Glycine, EHHistidine, I^lsoleucine, 
KNLysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=Oliitaminp R=/Vroininf> S=Serinc- 

\^^VIIUulllllliC| AY. <rm~I gill Hl\,y U t_7v>l lllCj 

T=Threontne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V~T InLnnwn *~Cfnrt rndon /=no«ihlp 

A vUlUIUTill) kJlvJJJ tUUUli^l JJU>S1U1C 

nucleotide deletion. V=oossib!e 
nucleotide insertion 


8934 


22835 


A 


9000 


173 


399 


IKSEWAAKKKKKKKKKKKKXKKKKKKKK 
KKIFFFGGGRFFFFWGGDYFWGGGGVKK 
KGGGKKNPG PQKKNGGGKK 


8935 


22836 


A 


9001 


2 


81 


EFTPAVHASLDKFLiASVSTVLTSKYR 


8936 


22837 


A 


9002 


3 


835 


DAVRANDDLKENIATVERRNNLLQAELE 
ELRAWEQTKRSRKLAEQKLIETSERVQ 
LLHSQNTSLINQKKKMDADI^QLQTEVE 
EAVQECRNAEEKAKKAITDAAMMAEEIiK 
TrpnnTQaTTr.RPMTfTTMMRnTTTrnT.OWTJT.n 

]\J-i\£LJ X Onfl Lt a RniuUxl lay 1 J. XULJ J-f>^XlXvlXL/ 

EABQIALKGGKKQLQKLEARVRBLENEIi 

EEDRKNLLRIjQDLVDKLQLKVKAYKRQA 
EEAEEQANTNLS KFRKVQHEIiDEAEERA 
DIAESQVNKLRAKSRDIGTKGLNEE 




09 Rift 


A 






zou 


RGFRGHKFLEPPGFPGFPPIMGPPSPPF 
GSPPQKNFPLQPKPPLGFKPPP 


8938 


22839 


A 


9004 


2 


108 


LSAYQGTPIiPAKIIiDWQALNYEIRGYVI 
IKPLVWV 


8939 


22840 


A 


9005 


3 


158 


SSTQGHKHCGRPQGPLPRKTRDLCS PVY 
LLTFSPLLSYDPVTSPSPRNTOE 


8940 


22841 


A 


9006 


3 


370 


LLLTGIiNRLTTDLYSLYI FTTTQWGSLT 
HHINNIRPSFTRENTLMFIHLCPILLLS 
IiSPDI ITGI FS YTRHYVNSSTYLETNER 
GGSYKlJRIiC3GPESKGLRIKDDSLiCWGPIj 
HDTSAYYENK 




99SM7 
ZZo**Z 


A 




nil 


7£ 


KLFNVGGGGLDKPPPPKKFFFFKKNGLF 
FFYPLKKKKNFFFPRELLWGPPRFFI KT 
PPPYFFFFFFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFFVMIS 


8942 


22843 


A 


9008 


1 


87 


KMLKKPKFELGKLMBLHGEGSSSGKATG 
D 




ZZIW* 


A 




«*ZU 




RFYFPKPRKRGGGGVFFFFPPKKKFFFL 
NPPPFFFFPPPKKKKNI FFP PRKGGRGS 
KVQTMGPPLFFFFFFFFFFFFFLFFFFF 














LAYQIENSLIiKLDIjQRQFCLPIjPGRV 


8944 


22845 


A 


9010 


2 


74 


LHRLGGPEAGLGEYLFERLTLKHD 


8945 


22846 


A 


9011 


33 


132 


IDPRCRNSARAPLSLAGPHPGMGDASNH 
MGQM 


8946 


22847 


A 


9012 


83 


607 


RRSGTPCGYNVTSQNGTI YS PGFPDE YP 
ILKDCIWLITVPPGHGVYINFTIiLQTEA 
VNDYIAVWDGPDQNSPQLGVFSGNTALE 
TAYS STNQVLLKFHSDFSNGGFFVLNFH 
AFQLKKCQPPPAVPQAEMLTEDDDFE IA 
ITGEGKPTRMPQSRKKSCHI ISVLISAK 
LNRAFL 


Of**/ 


zzo^to 


A 


znju 




164 


PPPPPPFLNPAPGI FFPPPLGG IGARPP 
PPSFFFFFFFFF 


8948 


79X4.9 


A 


9014 


74 


180 


UjSTYVGRLSARPKLKAFIAS peyvnlp 
INGNGKQ 


oyny 




A 




too 


117 


HVGQAGLVLLTSGDPPTSASQSTGITCV 
SHRAQPLRSFSFVLSLLQKRIVLC 


8950 


22851 


A 


9016 


149 


13 


DERGGLQAVAHTCNPSTLGGRGGWITRL 
GDGDHPS CRXLAGDASQ 


8951 


22852 


A 


9017 


20 


107 


FDSTADSDQVNPIQGLASKWDYEKNEWK 
K 


8952 


22853 


A 


9018 


448 


239 


VDHQQVIWNRERISNSQNGIVKEIKGAD 
TFIFGHTPAVKPLKFANQMYIDTGAVFC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

fog to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A~Alanine 
OCysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=Glufcamin<*_ R=A raininp ^=Q&rin<i 

T=Threonine. V=Valine 
W^Tryptophan, Y-Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion \=no«ihlp 
nucleotide insertion 














GNLTLIQVQGEGA 


8953 


22854 


A 


9019 


3 


209 


CPPLSPGCTNSARAEPSVHALPDALNNL 
RTHEGSGDGPSSSVDWNRPEDVYPQGIY 
VISAPSIYAREA 


8954 

• 


22855 


A 


9020 


Sftd 

Jv*r 




xrtuiio I o lAK/loANcioWAi v HvaKKKr V 

ASRQASQMPLCWLEEAIVRRWTLPSKA 
RFSFOFJlP^AWGNf^WTflCfipMATnrST v 

EVQEAVMIilEAGLSTYEKECAKRGDDYQ 
EIFAOOVREThfflRRAAGTiffPPAWAAAai? 
ESGLRQSTEEEKSDSRPAAGS 


8955 


22856 


A 


9021 


3 


168 


LPRLEAAARIRHERRRRRRRRPRRRRi?!? 
RERERERERERERERERAPPPTLCET 


"8956 


22857 


A 


9022 


2 


89 


LCSLVYLLTFP P LLS CDP AKS P Q ARNTO 
E 


8957 


22858 


A 


9023 


508 


371 


FFCRDKVLATLPRLIaANAWPQVIFLPQP 
P KVLRLOPCTT PSTjKDTi 


8958 


22859 


A 


9024 


2 


170 


RTHSAGRPYC3CNQCEKAFRHSSSLTVHK 
RTHVGRETIRNGSLPLSMSHPYCGPLAN 


"8959 


22860 


A 


9025 


3 


IKS 

1 O J 


P PAATPOfl PQPTNQP ptrrtta vxr jtwi?'p v 

c tmm. tr^\?tr&t? xvtartr XJ\AJvfVRJMISxMKxI\. 

KTKJCKKKPPPSKKKGPQKTPKKTMGGPL 
KTTT 


8960 


22861 


A 


9026 


104 


319 


LVFLSKYTPPLLVSLWIHGIiSLLSFLPS 

LPSFT.PS LLTYTiIj P <5 RT iP^ RT .T C VTM DT 

PSLFFLVFKLSQFWT 


8961 


22862 


A 


9027 


476 


364 


SLKRLTSGDLPASASQSAVITGVSHRAR 
PIIYFKYVQ 


8962 


22863 


A 


9028 


36 


154 


FLVGQAGLKLLTSGDLPASGSQSARITA 
VSHRTWRTHYF 


8963 


22864 


A 


9029 


2043 


1237 


WIJ?TRYAPAJ^ERIX5ACTQLGPVLPCSQ 
PYWCRQCPEYRRQAAQPPHCPAPEGEP 
GAPQALGDAPSTSVSLTTAVQDYVCPLQ 
GSHALCTCCFQPMPDRRAEREQDPRVAP 

FCELNLGDKCLDGVIiNNNS YES D I LKNY 
LATRGLxTfJFJJMLTESIiMAIiQRGVFLLSD 
YRVTGDTVLCYCCGLRS FREI»T YQ YRQN 
I PASB LP VAVTS RPDCYWGRNCRTQVKA 


8964 | 


22865 


A 


9030 


15 


389 


IVDHYPCGGIPTSGSVTQAGAQWHNLGP 
LQPRPLGLEGSSCLSLICSWGHRHAPPH 
IiARGHFS SEVEOLF^RIiOHGAVRVKHRK 
GTPEDPRC^LRTLISSCLLHLVZUiAIiWF 
SLPFHGPRIHMQ 


8965 


22866 • 


A 


9031 


3 


243 


RHSLAFNRFSCLSLPSNWDCRHPPSCPA 
KFCTFVEMEFHHVGQAGLELLTSGDLPT 
IiASQSAGITGVSHHAWTRCCCCF 


8966 


22867 


A 


9032 


1 


245 


PTRPAG IAI CICMCMKNHRATRVGIIiRT 
THINTVSS YPGPP PYGHDHEMEYCADLP 
PPYSPTPQGPAQRSPPPPYPGNARK 


8967 


22868 


A 


9033 


46 


124 


QELMTHG AKS PDGTRVHNAFL FVTTV 


8968 


22869 


A 


9034 


235 


435 


KKKKKKKKKKKICKKFJUvPFKLKGPFKPP 
IiKPOGGLFEAPP FGG P P P Pfi RRRA^nn a 
PPRGFFKKKGK 


8969 


22870 


A 


9035 


333 


70 


CPIYQIPILKDQVNLSSISSPFSHLIKD 
LWKTLFISYITELIFVCFVEMMFHHVAQ 
AGLELLGSSNPPTSTFQSARITGISHHA 
CLNI 


8970 


22871 


A 


9036 


297 


411 


NKTMS FAATWMQLE VILS KLMQEQKTKY 
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sequence 


M 

etfa 
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nucleotide 

location 

correspond 

ing to first 

21 111 1 1] U <SC1U 

rrciHn^ nf 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Wsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaJine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














CMSSLISGS 


8971 


22872 


j\ 


Grin 


AAA 
401 




NKNYPLIiGGPPFGSPSPENLNGGFPI/TP 
GGGGPINPNSPPPPPPGGRRPPFLKKKK 
KKKKEKETNTY I IiQDTCDRCS AVLP IMQ 
VKTVMICIiIMHRIiF 


8972 


22873 


A 


9038 


466 


292 


NGCLLEPASWNSLAN I FIFCREEGLTML 
PRPQVTLPPLPPKVLRLLPPPLSS 


8973 


22874 


A 


9039 


34 


124 


GIRNIKFSSLSTFIAHKKYYCSSHAAEH 
VK 


8974 


22875 


A 








RRRRKLRRWGLAMLPRLVSNSWHQAILW 
P PKVLRLQ ACATT PGLTRR WLHGQ I WS P 
GAGSV 


8975 


22876 


A 


9041 


469 


223 


PWARVCVCVCVSVCVCACVRVCARASES 
QTCAAVP SLGPAGAGRPALGCCAVCEDE 
VGSCPHARPLPRAAVDKAAGRRGIjCF 


8976 


22877 


A 


9042 


491 


380 


SPGWAHSYNPSTLDGRGSHITRVRKFM 
TCLATEGAV 


8977 


22o78 


A 




1 A A 
144 


30V 


IHSHWGYWLDVSLIAAHLAINPDCFMYV 
YIFSQCTDKLTEKEFIEGTLANKEILRI. 
IQFEPQKVKEKMKNA 


8978 


22879 


A 








RWNSDGKGSLRPAWATWPDPVS IKNTQN 
THTTHPTPHSPHTPHGQQPFLSPTPFPW 
PVSSPSPFPPFSHTCLSCLLPPGA 


8979 


22880 


A 


9045 


461 


287 


PPSLANIYLFIYLFIGRDGVSLFPRLVS 
NCWAQAI LIjPWLPKVXiGIjQGE P PNLALK 
SXi 


OVoU 


ZZool 


A 




*tO J 


< 

w 


NNKNSWAWWRVPVLPATREAGAGESLES 
GRRRLR 


8981 


22882 


A 


9047 


475 


307 


LFLRDGSLTVLPRLVSNSWPQAILLPWL 
LKGLGLQARATLHMACS S C INS TQNI LQ 


8982 


22883 


A 


9048 


483 


338 


LREGERQREQERERERERERBRDRERAH 
TTAFTQLLFQ YVAI I LLPCY 


8983 


22884 


A 


9049 


680 


303 


NDtiLFGSLYVKQFAASGTGRVTLPEPSE 
EQWCLRCFQDLFGDDHNKNGFKMFDSS 
NYHGQDLLFKmTVRAVPVGEKTTYRGW 
LGLDYVAALEGMSSQQCSGAGRTGPPCR 
LPWCYSDPFTPEP 


8984 


22885 


A 


9050 


26 


165 


AFGYHRVGQAGLELLTSGDPFASASQSA 
GITGVSHHARSQYFFVYV 


8985 


22886 


A 


9051 


461 


83 


HKINLPSHIOCKKKiCXKKKKKKKKKKKKK 

K , 




22887 


A 


9052 


475 


209 


MAAPPKKKKKKKKKKKKKKKKKKKKI 


8987 


22888 


A 


9053 


531 


404 


SVTQAAMQWQKHNSLQGSTLEFKQSSLL 
SLPTIWDYKHADLS 


8988 


22889 


A 


9054 


66 


416 


KXKTNKI KMGGGPLP FKISPPFLKKTIjS 
WKKGAPKTPPLEMAPFFFPQGGKKNSPF 
KGGKIFFFFFPKKTFIKGPPGGEHTPPL 
GGGPQ I FS L P PPKGVPQKNPP PLLRGGL 
FFTI 


8989 


22890 


A 


9055 


1 


104 


II1AVNGETI1EGVTHEQAVAILKHQRGTV 
TLTVLS 


8990 


22891 


A 


9056 


3 


106 


LKRWGIAMLTGLVLNSWAQVILPSWPPE 
VLGLQV 


8991 


22892 


A 


9057 


1 


299 


PTRPINIILAFTISIiLGILVYRSHLISS 
LLCLEGIILSLFIIATRIYPIAPPLPPL 
EPEKKKKKKKK3CKKKKKKKJCKHKKOTKK 
RKI KAKRKNliGGGGL 



1093 



WO 01/64835 



PCT/US01/04927 



SEQTD 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Iso!eucine> 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W»Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


8992 


22893 


A 


9058 


482 


146 


LTSS KK3CKKKKKKKKKKKKKKKKKKK """ 


8993 


22894 


A 


9059 


2 


290 


YFLFAYTILRSVPNKLGGVLALLLSILK 
KKKRGGPFKRTLGGPKFNRGGQGNI FFL 
MGGGLKPNLGIIXSRNLSLGGGKNWPNLP 
PKI KGFRENKNF 


8994 


22895 


A 


9060 


413 


48 


PQKKGFFQKNPKGFLKPPPKKKKKKFPP 
PGKIWPPQKIFKKPPPPFFFFFFFFFFF 
PKFFFFFFFFFFFFFFS IRNPPSS YTKW 
LRSLDVCHLLP P I TPQVPGPTPVLLFST 
LFDCNHSLL 


"8995 


22896 


A 


9061 


2 


274 


FTISLLGILVYRSHLISSLLCLEGIIIiS 
LFI IATLITLNTHSLLANI VP IAILVFA 
ACEAAVGLALLVXKKKKKGRPFKRILGG 
PQINPG 


8996 


22897 


A 


9062 


42 


255 


QEFRTRRERERERERERERERESSGWR 
AKFRSNFSAGAIAHTIRVMLPPSGIYSN 
EKS INKCGFALLFF 


"8997 


22898 


A 


9063 


20 


413 


YALRNSARAGDPHGQLGRRDGAVGGRGW 
LMPREALHP WWANVQAWAGS PGLFLGAP 
WPQGGVWLPWLCVSMSASSSS INGLPPQ 
KKKKKKKKKKKKKKKKFWGGGPGPQSPF 
WGGLKKFGGRFIiKRGGGEN 


8998 


22899 


A 


9064 


6 


306 


ECEHS TTYSHFAGGGGPVMVGGDADARS 
KALLGVCVGSGTEAYVLVLDPHYWGTPK 
SPSELQAAGWVGWQEVSAAFDPNSFYNL 
CLTSLSSQQQQRTLD 


8999 


22900 


A 


9065 


463 


126 


AHYQL VYTCKV CGTRSS KR I S KLAYHQG 
WIVTCPGCONHHI IADNIiGWFSDLNGK" 
RNIEE ILTARGBQVHRVAGEGALELVLE 
AAGAPTSTAAPEAGEDEGPPSPGBCTEPS 


9000 


22901 


A 


9066 


23 


244 


SCHAGIRHEERERERERERERERERERE 
REREREIYFLALFSQKGRVHGFFKTPRG 
GQKRELLSNRVNTLCTE 


9001 


22902 


A 


9068 


396 


143 


KDIQNFGQAWWLTPVI PALWEAEAGGSR 
GQEIETILVNMVKPHLSVLLAKNSSSHC 
PMCWLAELQWYDLCSFQKI FFGI I LI PG 


9002 


22903 


A 


9069 


828 


99 


VDREPVDREPWCHPDIiEERLQAWPAEL 
PDEFFELTVDDVRRRLAQLKSERKRLEE 
APLVTKAFREAQ I KEKLERYPKVALRVL 
FPDRYVLQGFFRPSETVGDLRDFVRSHL 
GNPELSFYLFITPPKTVLDDHTQTLFQG 
E P LPGRSGCNLGAEE PAGVYLE PGLLEH 
AISPSAADVLVARYMSRAAGSPSPLPAP 
DPAPKSEPAAEEGALVPPEPI PGTAQPV 
KRSLGKVPKWLKLPASKR 


9003 


22904 


A 


9070 


20 


427 


DGTRRERERERERERERERERLYLTEDT 
LHSLSIjEGARSGASVCVQSVYPTGASAP 
HS IINGHKRVLVLCVETEHTFPLAEKDR 
AFLGDTSVFVGAAPVGGANTGASAGGAP 
PSRGEPKEESGfiSHEDMGFGLFV 


9004 " 


22905 


A 


9071 


297 


126 


CS I YDKRI PTFGLGEGVSERQRLTLLLP 
CLPPALTETPTSSDWQSTDATPTLTOSS 


9005 


22906 


A 


9072 


117 


286 


WLRGSEKMFQVQEINEQYLQCILIKGI 
FLVKKKKKKKKKKKKKKKKKKPRPEYIK 


9006 


22907 


A 


9073 


3 


302 


PPMETLRQWTTXVHSQEEEEEHFEVIKD 
EVKVA^ARKHGQPGTPYAIATQLPPRTSA 
AFPAQQQPLQPLAPTTVLSVSADWNALL 
QRPNKYI WGLQYL EM 
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SEQID 
NO: of 
nucleotide 
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SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
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09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
0=Glycine, HHHistidine, I^lsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagme, P=Pro!ine, 
Q=G lutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 


9007 


22908 


A 


9074 


437 


79 


NQKREEEEERRKKRRRRRRKRRRRKKKK 
KZLRQKKXKRRKAGSREKKKILRHLPEK 
NKRSKKHYELINALSRLQDTRLIYQKMH 
FYIIiAKNNWKIKLKIQYHLQALENKVPR 
HKWBTYA 


9008 


22909 


A 


9075 


51 


185 


CRADFMFFHDKI IVYLCSLVYLLTFPPL 
LS YDP AKS PSARNTQE 


9009 


22910 


A 


9076 


452 


310 


QFSRCHLSSLSLDRKCaJSLTPCTSWVRR 
CLALLRLTLGALHLLSDNPO 


9010 


22911 


A 


9077 


438 


326 


LLHCWWDCKLVQPLWKSVWRFLRDLELE 
IPFDPTLHF 


9011 


22912 


A 


9078 


37 


169 


GIPGFRHAGQRRLQIjIiITGDLPTSASHS 
AGITGTSDRARPPSTF 


9012 


22913 


A 


9079 


1 


408 


RKGRLSKEEIERMVQEAEKYKAEDEVQR 
ERVSAKKALESYAFNMKSAVEDEGLKGK 
ISEADKKKVLDKCQEVISWIiDANTIiAEK 
DEFEHKRKELEQVCNPI ISGLYQGAGGP 
GPGGFGAQGPKGGSGSGPT IEEVD 


9013 


22914 


A 


9080 


31 


168 


VRRNPRLE HTVT I TAYCSLE LLGS SNLP 
ASASQVARTTGLQLLNLQ 


9014 


22915 


A 


9081 


460 


319 


YFFVEMGSHY IAWAVLELLGS SNPPAS A 
SQSAEFTDAHKPTDCRPLR 


9015 


22916 


A 


9082 


465 


326 


PCVAEFCTHTHTHTHTHTHTFKCVGSHL 
HTNND 




22917 


A 


9083 


33 


269 


PGNSRPrSSVRGIHPPCLANFCFIiNVFV 
EMGSHYWLAGL I LGSSDLSALAVQSGR 
VTGMSHCAWPEIFFIiLENWIII* 


9017 


229 \% 


A 


onsM 


47 
*t / 


300 


DLITSLFLLKQSSLATMNIQTPFKNIYF 
DLIIKTLFLRFFIFCFIYFYFLFFSYGG 
FFSFFFFFLKGSSFItPPRLNGGAKIWVN 


901 X 




A 


90RS 


437 


220 


NIAI KSLSLSLSLS LCVCMCVCVS ACVC 
MCLSLSLFSPSSLSHSLFPFSTPNMDIS 
HLPILSTQENMLLLLQ 


9019 


22920 


A 


9086 


227 


23 


DCPKANILGLTPRIjVSNSWPQVILTPWP 
PKGQSLLCRQGEGDRIWTWHPWWLLHPP 
PQARFPPRTHFH 


9020 


22921 


A 


9087 


206 


326 


PHWAQWPTPVIPALWEAEAGKSQGQEFE 
TSIiANTVKPRLY 


9021 


22922 


A 


9088 


364 


123 


SRWLPLKAPGEDTSLPIiPASGGSRCSL 
AWGSIPPISASIPTWPSVLCPCBIFPLL 
FLLYGDCNIiGPTLIPDDLIiyfRSLP 


9022 


22923 


A 


9089 


373 


205 


MPPRLANFCYFSRDWVSPYWSGWSQTPC 
WDYRREPPFPPCVRTSYSVQYCACYAGL 


9023 


22924 


A 


9090 


283 


2 


REFSFSAWLAFKIFWKLFSPPFWRVRP 
EDWF I PGGCGPNEPNFPPCPPGV GKKGE 
PRFKKKKKKNFI S SQAQWIiMP VI PAFWE 
AKAGGSLEP 


9024 


22925 


A 


9091 


16 


261 


CPEDLLVGQNVKVEDSDMGDPDRRPMCM 
FACFYLLVYRRCLAMLPRLYLNSWAQVI 
LLPQPPASLELLQAHATTHPALCLSF 


9025 


22926 


A 


9092 


17 


109 


RQGLAMLPRLISNSWTQANLLPWPPKEL 
GL 


9026 


22927 


A 


9093 


282 


114 


KKGGKGALLVFLKTGRKPFFFFIiRQGLA 
MIXRLVI^SWAQAILLPWPPKVLGLQA 


9027 


22928 


A 


9094 


205 


337 


NANLHGRMQRLTPCLPEIWEAETGGSQG 
QEIETILANMVKPCLY 


9028 


22929 


A 


9095 


62 


287 


EHSCAKLFCFAFESGPYSVAQAGVQWHD 
HSLLQSQTPGLKQSSCLSFPASAGFYKE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQED 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!a nine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, l=Iso!eucine, 
K=Lysine, L=Le urine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Gtutamine, R=Arginine, S==Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X ra Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SPS FYKDKSCHHLGKHSGA 


9029 


22930 


A 


9096 


170 


9 


AASTGYCIGRFCTKLKIQSGGQVRWLSP 
WP KREAEVGGLLEARS SRP IWATW 


9030 


22931 


A 


9097 


2 


104 


RRPGGRAEPRARVRGDRDPPVLLRHYPH 
IFEECG 


9031 


22932 


A 


9098 


350 


142 


LCLSCPKGGGYRAMAPWRGNFCSFGKNG 
GFPMLGRWLKWPQMGPLPQPPKRVGFQ 
GLSPPPGPPPFLV 


9032 


22933 


A 


9099 


494 


342 


YFYFYFVEMVSHHVTQAGLELIiASSNPP 
ALISQHAVITSGSHHTQPKIiYF 


9033 


22934 


A 


9100 


30 


232 


GGIMLCHGRCACFPFAFCHDCKFPKVFQ 
PCFLYS LQNCS FLYKI PSLRS CRLQLAE 
LLSIIYLENPL 


9034 


22935 


A 


9101 


26 


171 


RNSRRIiRHKNRLSLGGGGCSEPRSCQCT 
PAWQTARLSLKNKKQQQKKP 


9035 


22936 


A 


9102 


486 


239 


HHTCPHQVCTCVCVCVYRVCVCVCVCVY 
IYMYMCIYIYIYI YICI YICRERERBRE 
REMWFHHVAQAGLELLGSRDLPAPAS 


9036 


22937 


A 


9103 


487 


344 


TGFHHVGQAGIELLTSSDLPASASQSAR 
ITGVSHHAQPLFMI FCAKC 


9037 


22938 


A 


9104 


470 


221 


TPGLKRYTCLSLPKCWDYRCEPHARQGA 
FLCAFSPLSSQQPWGKRPHFHFSYETNK 
VERGKRLPLRSPSTEQGFDFRCVLFQT 


9038 


22939 


A 


9105 


468 


259 


YVFFCRDGGLTISSRLVLNSCPQQSPCL 
SLPKCWDYRHEPPCPSCCSNIiLLMFHFT 
NVFLIDSECSHFTP 


9039 


22940 


A 


9106 


466 


329 


RVAGTTGTHHHAW L INI PSVETGFHHVG 
PQVIQPPRPPKALGLQA 


9040 


22941 


A 


9107 


409 


306 


AGFELLASSDLPASASQSAE I TGVSHHA 
RPSLIF 


9041 


22942 


A 


9108 


417 


269 


MWAQADMESSKEPGLFNWI INDSLVQ 
AYPELKEALYEEIKKAQRRGA 


9042 


22943 


A 


9109 f 


533 


399 


RGSLRHETRLNAEARRCSELNSRQCTPA 
WATBRDAVSKKTKQKA 


9043 


22944 


A 


9110 


16 


239 


LVHFEVRWNSHEVSLTADSRKSHLIFPQ 
PNYADTLINQESYEKSEPLLITQDLLBT 
KGEPRQLQVSFFP PKREE 


9044 


22945 


A 


9111 


68 


219 


EPPHLANFLQRLVFTMLLTLILNSWPGV 
ILPPGPPTVLGLQDKLPVLLPS 


9045 


22946 


A 


9112 


192 


57 


DRVPLCPPDWSAVARSWLTAASVSQVKG 
ILSPQPTECAPPGNST 


9046 


22947 


A 


9113 


21 


201 


TIVSWEFMNLGGGTCGEPRWCHCTPDW 
GTERDSVSGEKKEDKDVARRHWVGSFYN 
GPQ 


9047 


22948 


A 


9114 


2 


144 


TBTIDRCAKDLFVAKQVGTKLSVNEPLS 
FSVESILKRPSSAITRVSQ 


9048 


22949 


A 


9115 


156 


316 


MILNKKKKKKKKKKKKKKKKKKKKKKKK 
KGGGPFKKTIX3GAKITPGWKNYFFFFMG 
GV 


9049 


22950 


A 


9116 


1 


84 


TKPTPFLPTLITLTTLLLPISPFILIIL 


9050 


22951 


A 


9117 


1 


176 


HLNLGGGVCSEPRSCHCTPVWATEILSQ 
KKKKKNGFKFWEPRPQKEGGGGRKKPPG 
VP 


9051 


22952 


A 


9118 


1 


184 


AAAG Y PGQQQNFHS VREMFES QR I GLNN 
SPVNGNSSCQMAFPSSQSLYRTSGAFVY 
DCSKF 


9052 


22953 


A 


9119 


69 


374 


1 1 FSKKKKXKKKICKKKKKKKGGPPKKKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

I Colli UC Ul 

peptide 
sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
&=GIu tamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, Msoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, P=Proline, 
0=T2 lu mine. R=Areinine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X— TTnknnwn *=Stoo codon. /=DOSSible 
nucleotide deletion, V=possible 
nucleotide insertion 














WGGPKIFVLRRVLSPFIRGEKKNTFGDF 
LKKTFFWGGEKLGKPPPKKKTPWGKKKI 
FKGERGKKPPFLGGKKK 


9053 


22954 


A 


9120 


405 


93 


EPSPLIiLTTNQKRGPFPPSPPKI FFFPK 
RIYFLGGGGPHFPPPQKRGFPQKNQKGF 
FFSPPKEKKYNLPPPGKFCPPQGFFFPP 
PPLFFFS FFPFFFFRCQIK 


9054 


22955 


A 


9121 


2 


380 


WDYRCPPPHPTNFCIFSGDGVSPHWPGW 
S WPQVI CLPQPPKVLLTPWVRDRTRDLE 
GGDIiWYGACLGACLHLGPQAASFGHGII* 
GPASLLCCIPKHVHMDTLSPGVKLQSCD 
TGVALRVGEKREGS 


9055 


22956 


A 


9122 


2 


534 


DMIREQKIYHKYIiAQRREEEKAQEKEFD 

VDEVMCTRKLQVQEKLQREAKEQEERAM 
EQKHINESIiKELNCEEKENFARRQRLAQ 
EYRKQIxQMQIAYQQQSQEAEKEEKRREF 
EAGVAANKMCLDKVQEVLSTHQVIiAQNI 
HPMPOGMPO 


9056 


22957 


A 


9123 


389 


46 


GPNFLPPPIFGFFSPFSPLKFFFSPKTL 
IFLGGFFPIFPPPKKSFFPKNSPGVFFC 
PPFKKKNFLFPSPLNFAPPRVFFKPPPP 

VW 


9057 


22958 


A 


9124 


422 


59 


AQQKKKKKKIKKKKKiaCBCKKKKKKKKKra 
K 


9058 


22959 


A 


9125 


333 


168 


KGPPCIVVCMVEFHKLEAGSHSVAQAGA 
QWCDGNLLQTLTLGLKRSPHFS LPSR 


nncn 




A 
£\ 




1 ID 


oy 


KPPPHFFFFFFFFFFFFYFFFFFFFIFC 


9060 


22961 


A 


9127 


368 


250 


CFNYLSRDRGLPLLPGIiVLKSWPQAILL 
PQPPKMVGVQG 


9061 


22962 


A 


9128 


392 


157 


FFSPKRLNFGGGGGPKMPPPKKKVSPQK 
PHPGFFS PPQKKKKKFFPP PVKMGPPKS 
FLKSPPPFFFFFLISS IRGLFL 


9062 


22963 


A 


9129 


396 


13 


FPPPPPPIFFFSPRRFFFRGGGSVALSP 
PPKVFFLUttPPPVFFFSPPFKEKTSFFP 
PPFFCPPPQILFLAPPPYFFFFFFFFLF 
VF FFLFFFFLLPLLFLGKTS FT .T J ,LFLF 
IYLETGSCSVAWARVQ 


9063 


22964 


A 


9130 


55 


180 


LLYNKLIFI'LKKKKICKKKKKKKKKKKTO 
KKKKKEKYEKTKAY 


9064 


22965 


A 


9131 


2 


95 


SFVHMEHEKAVLLiaCSFQNTVDIiGIQRE 
LTV 


9065 


22966 


A 


9132 


172 


404 


KKKKKKNKKKKKKKKKK3CKKKKNW 
FKNS WGGPKKRGGGKI FFFSLLGGKKKT 
LGVFLEKPLFLGGRKMAPPPP 


9066 


22967 


A 


9133 


1 


125 


QPVSYPFTI IGQVAYEIjYFTTILILIPT 
MSLIENKILKWAY 


yuof 




A 


yi J*r 


i 
i 




QPVSYPFTI IGQVASVLYFTTILILIPT 
ISLIENKILKWA 


9068 


22969 


A 


9135 


7 


269 


lATGQPRLQKLTMPTI^LKRFLLTKKK 

KKPPGGPKTYPVKKKIFFFFKGKIKKPP 
WGI 


9069 


22970 


A 


9136 


1 


64 


LVSISNTYGLDYVHNLNLLQC 


9070 


22971 


A 


9137 


400 


259 


SQPLRRLRHKNILNPGGGHYSELGSCHC 
TPAWDTEQDSIHSWLLSSV 


9071 


22972 


A 


9138 


3 


J 310 


ECFPNSPNMNGHCNIjIiVQSTSPS ISTMP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted* 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine j 
C-Cystei ne, D== Aspart ic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HNHsstidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glatamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=»Tryptophan, Y=Tyrosine, 
X— Unknown *=Ston cod on /=rm«ihlp 
nucleotide deletion, V=possible 
nucleotide insertion 














CPKWALEKVFLLNVLWFFYLKYRDNRP 
GSVAHTCNPSTLGGRGGWITGAQEPETS 
LGNIARPSLIKRKKYKLF 


9072 


22973 


A 


9139 


34 


142 


NCKKLNTHTHTLTHSHTHMLTHSDTFTH 
THTHTKYK 


9073 


22974 


A 


9140 


146 


2 


PPPPPPEGVSLCGPGWSAVARFRLTASS 
ASRVHAILPPQPPQQLPTRP 


9074 


22975 


A 


9141 


3 


102 


PP AS AS Q S AG VTGMSHHAQ P I E PLRCS S 
HRTT 


9075 


22976 


A 


9142 


245 


51 


NREGDLLCCPRWSQTFGLKQSCLGFPKC 
WEYRRGPPHLAYIMSMRKNILIQMIFYY 
NNPCLLYS 


9076 


22977 


A 


9143 


2 


108 


GAKCVSDRIKHAFLIKEQKI IVKVLKAQ 
AGSQKAK 


9077 


22978 


A 


9144 


3 


200 


MGFLHVGOAGLKLPTSGDI.PAS A550<? A« 

ITGVSHRAWPVSGFFCSEAKECFWEEQL 
XNLDKNSQK 


9078 


22979 


A 


9145 


405 


227 


WFFLGFPLKGKLGPPKMFFRAPRVYSPF 
FPPFFFWPPPPRI IFPPGFFKFFVFIiAG 
PIW 


9079 


22980 


A 


9146 


387 


220 


PLFFVGCFFFFFFFFYYFFFFFFFFFFF 
FFFFFFFFFFFLGGCERNFIIRKNINN 


9080 


22981 


A 


9147 


539 


220 


VINFKHI LIKTVIQSLPFFERGSCLPPR 
LECNSMIMAHCSLNLLGSSHPPASASQV 
AGTTGAYQHTQLIFIFPAETGSCYVAQA 
GLQI»LGS SNPPPWPPEVLGIjQV 


9081 


22982 


A 


9148 


3 


141 


GGGGCCEPRWSHYTPAWATSAKLCLKKK 
KKKRFLEKSCLTPLLKA 


9082 


22983 


A 


9149 


413 


240 


F I FCRDGGLTMLPGLTSNTWS OVILP PO 
P P KVLGLQAYHADELPHLATWGVL YGLT 
V 


9083 


22984 


A 


9150 


395 


193 


LLFGPCLLNLITQFVSCHLIiDIKLQMTL 
SEGYFPLNIQESPFYRAPLDDPSVRHAR 
GEILPISLGPG 


9084 


22985 


A 


9151 


1 


77 


LVAAIVLGGTGWGFKNIRRGRSGGA 


9085 


22986 


A 


9152 


365 


181 


IVDRPDPCRDADAQREVFTLSARYPKIW 
VPPGDPAREFPFIQSAPVMLLAAPDELV 
SAQLA 


9086 j 


22987 


A 


9153 


373 


251 


FKKI F FCRDRALNMS PRLVSNSRTQAI L 
PPQPLKVLRLQA 


9087 


22988 


A 


9154 


204 


95 


DYHWSQPGMVAHACNPSTLRGRGRWITR 
SGDRDHPG 


9088 


22989 


A 


9155 


44 


280 


AELFGTIiMREDGLSLGGGGCKEPKSCHC 
NPAWATE PDCLRKKRKESP YLLWTP PD 
SSYSPFFFLLLPAPDNNDYFLS 


9089 


22990 


A 


9156 


2 


117 


ERLRHENCLNLKGRGCSEPGWCHCTPAW 
ETKQDSISKK 


9090 


22991 


A 


9157 


3 


207 


HASAHASADAWDYRRVPPCPANFCILYV 
SHVGKAGLNLLTS SDLPTSASQSAGITG 
VSHHARTPTYV 


9091 


22992 


A 


9158 


117 


267 


NRVLLCCPDWPQTPGLKGSSLSLLSSWD 
YRHIPPRLAARSSYLGPGFYH 


9092 


22993 


A 


9159 


415 


167 


LFFFPI KNXFFFS P PLFFFFFFFPPPPX 
XXFFFFFFPFFFFFFFFFFFFFFFFFFF 
FFFFFFFFFFFFFFFFFFFFLELKQNF 


9093 


22994 


A 


9160 


217 


41 


SSQIHRDRNRMVRQGAVAHACNPSTFGG 
KGEQITRAQEFKTSLANMVKPRLYYKPC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

1 1 d o m i n A 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, P=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PNProIine, 

V^— v»iuiainincj iv - /vrgiiwic, & oci iuc, 
T=Threnninf* V=Vnlinc 

A i III vUlllUCj W T Oalllw, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide insertion 














LY 


9094 


22995 


A 


9161 


341 


1 CA 

164 


FPPFFFFPPFFFFPFPFPFFFPFPFFFF 

FFFFFFPFFFPFFFFFFFFFPFFFYVCF 
VSVELKKWQNYS FDVRI ILRVEP 


9095 


22996 


A 


9162 


190 


402 


IHTGKHPKCYRNSYSVLFLASKKKKKKK 
KKKKKKKGGGFKKKKNLNRGGGEI IFFF 
WGPKKI FGGGGLKKG 


9096 


22997 


A 


9163 


108 


209 


LS VKRCAKNPKKGKVAPG I KE IRTRSHQ 
DSEDI 


9097 


22998 


A 


9164 


2 


145 


KHCGRPRPQGPLPRKTRDLCSLVYLLTF 
PPLLS YDPAKS PS ARNTQE 


9098 


22999 


A 


9165 


170 


402 


ISRCHFPSLILSYVKITKTIKAPKKSKK 
YVYYYYYYY^YYYYYTCY'YWCXlCl'PVYYKN'P 

LGGGGGE I FFFLGAPKKKRGG 


9099 


23000 


A 


9166 


399 


113 


VTEQDQS SGYEGFQRPSSLS KARI PSVA 
APT? <5MCOP ACVCNI.SVEHI1SL YS CLLAO 
QGKALYKLLNK 


9100 


23001 


A 


9167 


3 


138 


SWITG VCHYAWL I FAFLVETGFHRVGQA 


9101 


23002 


A 


9168 


134 


301 


EKLTQSHSVTQAGVQRLNVHSLRPPPPR 
FKRFSCLSIiPSSWDYRNLETVLQEETH 


9102 


23003 


A 


9169 


201 


l 


CZHT TT»R WYT YKYJWP P YYY KYYY TO? K KK 

GKGWSGTESRWNLSGEGSSGLRRRKLQA 
PAGPQEFEKLA 


9103 


23004 


A 


9170 


2 


191 


PHLANFCGFFFIiKFFFFFFKKKNFFSPR 


9104 


23005 


A 


9171 


2 


191 


GGTE S CFVAQAAVQGCDIiGS LQS P PTG F 
SETASQTVDPDKVKDPGKLRTMFVAELP 


9105 


23006 


A 


9172 


3 


143 


SWFLYVGQAGLELLTSGDPPALASHSAG 
T TGVSHRARIjOTG I FRAV 


9106 


23007 


A 


9173 


361 


174 


VLGI ILSMSETE I FNTKCLDQSGTVAHT 
Y7JP<3TT«ftC10f5fiRVT R AORFETSIiGNTGR 
PHLFKI 


9107 


23008 


A 


9174 


493 


351 


LVETGFNHVGQAGLELLTSSDPPASASQ 
<5 AGTTGVS HIjATjVIiISRKSC 


9108 


23009 


A 


9175 


1 


115 


FRKIWYIYTMEYCIAVKKKEMLSFTITW 
MNLEDFMLVR 


9109 


23010 


A 


9176 


286 


96 


PLFFFNRWGLAICLGWSAVAIHRRDHSA 
LQSQTPGLKRSSCLSLLSSWDYRRVPPC 
LANESHY 


9110 


23011 


TV 


cm nn 


jyy 


on 


RNTKWljFFLRRSLTLSTGlaECSGAlSAH 
CNLRLPGS S DSLANG I FKDLALLDM 


QUI 

yl 11 




TV 

r\ 


one 
y\ Jo 


All 


1U1 


GRTGRQGRGAPHI SHDGRPGRDAPRPPD 
VMAAGKRRS S LPR WIX5GRAETLLTFQTG 
QPGRGAPHI PDDGRPGGDAPHFPDGWA 
GQRLQSQHFGRPRQAAGRWRL 


9112 


23013 


A 


9179 


193 


41 


IVFIiFIYLFCFFVGTGPCYYAQAGLKLL 
VSSGPIiASAPQHVG ITGMSHHT 


9113 


23014 


A 


9180 


292 


119 


KITKINKQ I IjNLS YTLDQMDLTD I CRI F 
HPTATEYRFFSRAHGTFFRSGTKNSAWH 
I 


9114 


23015 


A 


9181 


3 


279 


AIiS PP YSSECCKLGS FNRELSS PS PGGQ 
KSEIKVSAGPRPLPRVQGDPSPPLPASG 
GPRCSWACGQITAVSASGVTHPSPPLLH 
VCCVCLI 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
OSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

aiiuuv mvIU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

dinir te\ 

last amino 
acid 

residue of 
peptide 

can iipnrft 


. Amino acid sequence (A=A!anine 
C=Cysteine, D^Aspartic Acid, 
E=Glutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P^Proline, 

0={* III famine R= X roininp Q=QArtnA 

T=Threonine, V=Valine, 
W=Trvntnnhfln Y=Tvrn«in*» ! 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

liu^-iCTjiiue iiuci nun 


9115 


23016 


A 


9182 


402 


238 


QMQWHNLWPLQSLPPEVTPLFCLNLPNN 
WGYRWPPPPPTNFFFLKGVST/3TJPGSV \ 


9116 


23017 


A 


9183 


3 


207 


EFHHVGQDGLELLTSGDPPASASESAGI 
TGVSHRADOACRFLHFFEGTCFTiSOET»S 
YLKLGRDTIIC 


9117 


23018 


A 


9184 


3 


106 


YPHYVKS I VASTFI ISLFPTTIFMCLDQ 
EVIISN 


9118 


23019 


A 


9385 


110 


248 


GVLVGSPLRTRFHSQLQAEGAEEIGWT 
WMGDGVL PVCWTiE VDV 


9119 


23020 


A 


9186 




1 115 

1 IJA 


KCPSAWAAHPTS E X R T SOGGS C\ ZV&WT .V 
LFFLLKNPGDVI PALWEAEAGGSQGQQF 
ETSLANMAKPRLY 


9120 


23021 


A 


9187 


399 


292 


ASQVAGTTGAHHHARLIFFVFLVETGFH 
HVSQDENF 


9121 


23022 


A 


9188 


7 


SO 


PCD PP A55 V55 fJS AG T TRV<?WWAWnT 17 


9122 


23023 


A 


9189 


1 


117 


LPISLPSFLPSFLPSIiLPSFLPSSFLPF 
FFPSSVTPYLK 


9123 


23024 


A 


9190 


238 


46 


NKAFRIKAFVS PSNKKFSS FFFFLNEMK 
GVSHHVRP 


9124 




A 




*> 




TiTRV17VnMMT.'VrPR^2T.l?T.T.CCGl^T D7VT 7VT5 

KVXJ^FTGVSHHTRPELYFQRLljAPGTLL 
EVWGTSGLR 


9125 


23026 


A 


9192 


A A 


too 


DV 


9126 


23027 


A 


9193 


AAQ 




PCPANFVFI»VKTGFTiWVRHaGT.KT.T.TQG 

DLPNSDQKFYVN 


9127 


23028 


A 


9194 


427 


238 


PPARPTAFPYIEFFVEMVFRYFGKAGLK 
VLISGDLCASASPGVGITGVSHHAQPK 


9128 




A 




j*t 


1 /o 


GGSRGQEMKTILANTDFWTG 


9129 


23030 


A 




00 


J JO 


CKiPVSMWWKTQ T fCD VO VT7GVTTP TTVTTDD'D V 

EQKERERKERKERREGGREGKERGRREG 
KKRKKNKRKKVGL IGS PCIVFTWQAAYC 
EESGGVWPGGETEHTOEGT.VO 


9130 


23031 


A 


9197 


3 


91 


QGLSLGGGGCSEPGSCHCTPAWATERDS 
I 


9131 


23032 


A 


9198 


1 


203 


RTRGNKS IDDTSNFDDFPESDILQPVPN 
TTEPD YKS KDWVFUJV T V KkFEGI/TORG 
SIPTYMKAGKL 


9132 


23033 


A 


9199 


2 


144 


PTRTMPCCCCCCCCCCCCCRCLFMTGVL 
FTCCHNQFENHSGCLPRQI 


9133 


23034 


A 


9200 


1 


99 


YAAKE L VHDRYRPT I EMERRRGLW WLVP 
RLSLE 


9134 


23035 


A 


9201 


j 


149 


LNAVORME WOUCVO E T.D PAGH K" <3 T .PVNTC 
VPSFYVPMVDVNDDFVLLPA 


9135 


23036 


A 


9202 


284 


21 


KGPPLFFFFFFLRQGLVLWHRLEYSGMT 
TAHCNLELLVSSDPPTLSSQNAETTGVS 
HCAOPASLILSKPYIGFLHOIOKfiCT.PV 
TFL 


9136 


23037 


A 


9203 - 


1 


77 


LRLVLNS WRQAILLLWPPKVLGLQA 


9137 


23038 


A 


9204 


529 


340 


GVLLWRQAEAGELSLGGGGCSELKVRHC 
TPAWATRAETPSQKINEIiIKSMAVELTD 
DFIT 


9138 


23039 


A 


9205 


413 


241 


HPPPPELKLFSCISLISRWCHRHRPPCP 
YVFVLLAETGFHVGQAGHFCHTYCCVSLP 
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Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
B=Glutamic Acid, F=Phenylalanine, 
G=Glycine, EMHistidine, Msoleucine, 
K=Lysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Profine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonme, A^Valine, 
W=Tryptopban, Y=Tyrosme, 
X=Un known, *^top codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














T 


9139 


23040 


A 


9206 


33 


152 


QLQGIiESWGMERCALPGYPVVi-rNljUKY 
RSWIBETMRDK 


9140 


23041 


A 


9207 


70 


402 


HVDWEALGNDPRLVNLGICKLKASGIHE 
WVHTRTCAHTHMHTHTHTCTHCRS PRW 
VLNLGCTFELPGALKNARAYPHPNATKP 
MTSESVGIRPTRQVFKAPRAEQHCSTA 


9141 


23042 


A 


9208 


46 


262 


GGIPPAAVRGGRSWLRDVLSRIiAAAGRR 
RRGTRRGRGRAGGAGGGRDRVGSRREPA 
AARTAAEGERAVRGKC 


9142 


23043 


A 


9209 


15 


206 


PIEPLSERTNSAGAPPPASLPPCSLISD 
CCASNQRDSVGVGPSBPGDSDMQTSLWA 
KVPVSKD 


9143 


23044 


A 


9210 


475 


110 


NLLAPHLLVSLKKHS I RMGVAQFS RCHL 
SQLPLARKENSLTPCASGVRRCLALLRL 
MLGGLHPLSCTHCPTGPSEMNLVSQLEM 
QKSPVFCVAHAGS CRLELFLFGHLPL 


9144 


23045 


A 


9211 


420 


132 


PPPPNYFGGAGFFPPPPFFLRPPPPFFF 
LPPKKKKIFPPRGPKKIFFIiKAPPPSFF 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
LVRSHFILTIM 


9145 


23046 


A 


9212 


353 


87 


FFFXKQXFFFPSLFFFSPPXXFSFYPF* 
YLKYPLLPPSFFLFFFTPPPPXFFFFFF 
FFFLFPPPFFFFFFFFFFFFFFFFFFFF 
FFFFD 


9146 


23047 


A 


9213 


2 


276 


FIQLiLKRRLCELGYEVLPHPSYSHDLLP 
TNYHFFKHLDNFLQRKFFHNQQDAESAF 
QEFAESQSMEFYATGINTLISPWQKCVD 
CSVSYFD 


9147 


23048 


A 


9214 


402 


276 


LSCLSLPSSWDYRCPPPKPGYFFYFFLF 
LVKTGFPHLAQAG 


9148 


23049 


A 


9215 


386 


37 


GFKKRVCSPFFFFFFFFFFFFFFFGREG 
FFLQKKKOTC^PSSNKLLTLGLQGKKK 
IQPALMKKN5ISQS 


9149 


23050 


A 


9216 


1 


142 


KDFKVQVWYNHSSLQPQPPGFKQSSRLS 
LPKYWDYRHEYLHCIQLAP 


9150 


23051 


A 


9217 


402 


181 


FFFRXFFFFXXFFXXXX± r l?'l r l? if AAAAAA 
XXXXXXXXXXFFFFXFFFFFFFFFFFXM 
FQQEKLLFIFYVNIWTF 


9151 


23052 


A 


9218 


2 


87 


SGSLIFGLIYICMCVCVCVCVCVCVYPC 


9152 


23053 


A 


9219 


420 


271 


YYCXFRETGFHHVGQAGLELLTSGEPPA 
SASQSAGITATAPSPFLTFLLV 


9153 


23054 


A 


9220 


139 


413 


SGYriSKTbrQKKKKiuSitKKK^ 

IFFRGGGGNFFFFIiGPQKNIGGGGLKKR 
GGGKJxP PF try ± if r r V 


9154 


23055 


A 


9221 


134 


236 


LHRWGPAMLPRI ILNTWPQVILLPRPPK 
VLELQA 


9155 


23056 


A 


9222 


364 


280 


STHLSLPKYWU YKfcUS Vfc*UJ-iAJUUJAUJ>I? i> 


9156 


23057 


A 


9223 


1 


147 


TIJaPMLESSGTIMAHCSLNYLGSSDPST 
SASQSARITSVSHCTQPPLPF 


9157 


23058 


A 


9224 


1 


208 


AGAPPPASIiPPCSLISDCCASNQRDSVG 
VGPS E PGAG YHLWRRLSQS EKRS IRVG 
VTRFSSLLItSSHA 


9158 


23059 


A 


9225 


1 


145 


FQHVGQAGPKLLTSGDLPASASQSAGIT 
VSNRVRPKTKNFYTGNTRPN 


9159 


23060 


A 


9226 


396 


210 


KCWDYRHEPPYPALTNLLFIEMGSYGVA 
RAGLKLLGSSNPP I S ASQSAGVPGLSHH 
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OCysteine, D=Aspartic Acid, 
E^GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, IHkoleucine, 
K=Lysine, L=Leucine, M=MethionIne, 
N=Asparagine, P= Proline, 

vjriu lamme, jtv — /vrginine, o — oenne, 

TsThrpnninP VsVolino 
a — jl urvuiiiiic, v — valine, 

W=Tryptophan, Y=Tyrosine, 

X— I In If n OWn *— Qfr»n ffttitxrt I — nncciKIa 

r%. viiiviiurvii, — oiu|j vouoii)/ — puss I Die 
nucleotide deletion, ^possible 
nucleotide insertion 














AQPIF 


9160 


23061 


A 


9227 


2 


1S4 


GMSHHTQPRMVFFSNFSSTAQC 


9161 


23062 


A 


9228 




91 ft 


FFTjIAIjG T F T T .W 21 T VT . T YT . Q T WD D CMU 

PSLSIYLSI YLS I YLSIYLS ISLSLSLS 
IYLSI YLSIYLS IYLSDFLSSAR 


9162 


23063 


A 


9229 


3 


OJ 


RC3D Ii I TF P FT .P Tf T .T .T JCWC C 


9163 


23064 


A 


9230 


25 


188 


YFRTAWNQAWW YVP I VSATWEAEAGGLb 
EPTSSRLRCAVIAPVNSYCTSTWAVS 


9164 


23065 


A 


9211 


AA 
**** 




VT'JT.PrPWMW\7\n?VT7r.i7is ,r rc'i2 r ETC'T ov^renH 
xuijivr iNjMrLV vr I r xjrlilisr HoIjoKVKCN 

GAISAHHNLCLCDGARLHHRRGGKKTKK 
OTTTT.DQ 


9165 


23066 


A 


9232 


3 


195 


FFVGLGS PYWQTGLELLCSGNPPASAS " 

TiMST? TTY5MQ'PnVO'DM T TY2DT CWT/3QOTV 

i-uv/uc x iunoriy v yrw xvjKXjo r i JjUo oil 
GSIHAMI 1 


9166 


23067 


A 


9233 


JOU 


94A 


QVPT.AFMCSRHlTW^TiaQDnT.T DDUTUOOA ' 
o w J-i«-c»i*iw»r ritr v is Xj Li Irlr Vlnoby 

PPKAIGLQARATAPSLH 


9167 


23068 


A 






101 


IJ.T.TWTr.MPTUT COTT T T GT WDnTTTPD 
XjXjXjXN IXiTlr XxiJbolrXljXjJjfaJjJSJJb'UXX l\jr 

EG 


9168 


23069 


A 


9235 


411 


226 


FFFFFATNSQGGPTSFWGPRNNAQVAGP 

TiRWTCTP*rPVfSQP'irVTTyPCWT?T'P\7T? 
■UXVLXXvJL r x zr v vjolr XVIvXI? «roVil!iXxrVJr 


9169 


23070 


A 


9236 


404 


262 


QGLAMLPRLVSNSWTQAILLPWPPNQSA 
GGSGVSHHTQPIFISENIH 


9170 


93071 






ooy 


1 

1 JZ 


tUWtr Vfa X WKSFHVSNS YM TV x vQMYHNIi 
SlPLPDPHFFFRGGGLAMLPRIiVSNPWP 
QAILPPQPPKVLGLQA 


9171 


23072 


A 


9238 


389 


190 


RPKKKKKFFPPPGGKGVFFKGAPPPYFL 
wr ±rr r rr c tf err r r rrrLi x xHNrxtXX* 
FLNCDKLYIT 


9172 


23073 


A 


Q71Q 
y^jy 






v3rt k i£ w U5i J t J r irrx^IXjAyWvjxrCjbxjQNSPPEF 
KQFLCLRLPZSIVIiEYKSlPPCPTNFCSFY 
KNGIFPCWPGWPFKFQKKKKISWVWWCA 
LAEAGGS LEPRWR WRLQSAKMAPL 


9173 


23074 


A 


Q7 /in 


199 




IfWlfWDfyir .T'DTT'T DDT UP7\lT/v»/ , 'ODrimr 
ivjvinjvzvjcvsV^Ij lr VXr r* XjnJ&AIUjrijvsortUSc **> 

IQTIIiAGGGKAPLY 


9174 


23075 


A 


0941 




9RA 
ZOO 


TJ*C*\7/VT , irT4 , UT-T\7A/ r ^7V 7V T ./"^VT T7Q O T>7~\nri'T»r» "A o 

r c Vyl Cir ilrtVAyj/\AlAjljV^> jKIJPP 1 HA.S 
OSVD I TG VSHPAO P V T V<;rTjnf>VTT .WYP 
RQVTVTSLIAFCQVAYPKGAKSQWPQSF 
FFSSPFLFSL 


9175 


23076 


A 


9242 


2 


95 


HMKKCSSSLAIREMLIKTTMRFFFFKLl 
TCY 


9176 


23077 


A 


9243 


377 


293 


PL^PRLVXSSWAQMILLPLPPKVLGSQA 


9177 


23078 


A 


9244 


422 


147 


KKGGTQGFSPPRVFKRGPRNFFFGPPKK 
■*vivL^^ crrivuivi err jjinxsjtxto r r r c r c r 

KKNYSFF 


9178 


23079 


A 


9245 


402 


208 


TIGRCHHTQLNLKFFVEMGSHYWQAGL 
KIjIiDS TNP PS S AFQS AGISGVNP YAWTE 
IMFLYSIPI 


9179 


23080 


A 


9246 


314 


168 


E P YH L H FIj P P HHYYYC R WR T . T VT . P MT . V Q 

*Jt limn rnni i J. V«C\.«\rv7JJ X V Xl.fr I IXl vo 

NSWAQVILLPWPPKVQGLQV 


9180 


23081 


A 


9247 


3 


320 


WQAGLKLLSSCDQPASASQSAVITGVN 
HHARPLIFLYEMDYGSRTVLIIRKWDDN 
EQSNRTIFLKKDELTGNRSACLKYTPMD 
INVNSGLGWRYRFGMHHQCRW 


9181 


23082 


A 


9248 


22 


303 


KGSYFSFTYEAQTESCSIARLECRGAIS 
AHCNLRLLDSSNS PAS ASRVAETTDHKT 
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last amino 
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np nl"i rtp 

|ICU UU6 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F— Phenylalanine, 
G=Glycine, H-Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N^Asparagine, P=Proline, 

0=01 11 til mini* R~At*ointnf* S^sfrinfL 

T=Threonine, V=Valine, 

W=Trvntnnhan V=T*vrosinc 

X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion V=nossible 
nucleotide insertion 














LVPERSCFTPKERNVTQKGQEESEEMSN 
HLFTWLPILH 


9182 


23083 


A 


9249 


13 


174 


IjGLKVRHRLWL I FVLS VBTE FHHLGQAG 
LEIiLTSGNPSAG ITGVSHCARPTSHS 


9183 


23084 


A 


9250 


3 


141 


HVGQTVLRLLTSGDPPTSASESAGITGV 
SHRPRQIFISLFTICIR 


9184 


23085 


A 


9251 


56 


262 


LRIPLTFGGWKGEEEDRAVMHLRLCGD 
DSPMSESLHFIKLPRTVLVDEDARPEKK 


9185 


23086 


A 


9252 


401 


275 


PADLF I F I WEDGGLTMS PRLVMNS WS QA 
TLQPWTPKVLGIiQV 


9186 


23087 


A 


9253 


411 


258 


PKVFKYRCDPPRIAWFLFFVQSLLPRLG 
LNSWAQAFLPPWPPRVLGLQA 


CM 0*7 


zJUoo 




yZDj 


iyi 


10/ 


GTjKTTiT55(^TjPAi3AQn<?AGNTGM<3HC!AR 
PLMSFKIRYFLFTIKMKLKLLWFWSA 
GVHPPHFSCPIVITNKPRI 






7\ 
A 




0 




FFCRYGGIAVTiPRLiVSNS WTOAI T.S P W P 
LKVLGLQACTHHFFLMVNSNAARIKTIE 
MLLSEDVNFFVALLCPSFTISHFHL 


nt on 

9189 


23090 


A 


yZjf 


45/ 


4Z 


GSSNPPAIAFQSARITAVSHRAQLTCIK 
YTFLFTHIP 


9190 


23091 


A 


9258 


1 


243 


ILSSGGSFISLTAVILIIFMIGEAFASK 
RKVLIVEEPSINLEWLYGCPPPYHTFEE 
POP KKSETYS KCFFFSVFVHFGGWG 


9191 


23092 


A 


9259 


411 


2 


FFFFSPPVSLSLCVLESSPPPFFFFSPP 
PKKGFFSPPFFFFSPRVFPPPVFFFPPP 
PFFFFPPTjKKNFSPPPPGTCT FFFFFRPP 
PFFFFFFFLFFFFFRPFFFFSFFFLRWE 
EVLYFIMCHHISDAYPPNVDMNTKE 


y\y£ 




TV 


yzou 


All 
41 / 


ziy 


CVRMflFT .WVfiO AS T KT.PTSGDP «1 ASfiflO 

SAG ITGIiSHCARPHS QLS IVCCYSVPVQ 
CLHSLAARY 


VIVJ 




TV 

x\ 


7Z01 




1 /Z 


f!PNH FR KWGHRGC P P PRVF F FVGR DGVY 

LUlL IU\n WUxwVif IT JT *\ till V \3l\XJ\J V X 

VAQAGLEIiLDLSNS PASASQSETGTTGMS 
HLGOPLLFILNDVIAVI PLQGS F 










41/1 




HODNKPAPPGFFFFSPPPKKGFFSKP FY 
FVS PRF FPFP PFLNP APLF YFWG PKKKI 
FFFPPPAIKFFFFKRAPPLFFFFFFFFF 




z5\jyo 










IiFFLNKI PFFPPGGSKGGDFGPHTNPPP 
GVKPIFRPHPPEEGGPKGPPPRPREFFL 
GF 


9196 


23097 


A 


9264 


206 


438 


FPILBCTHFRx^TVAHTYNFTxljGGPGGR 
NTCSKFKTSLGTWGDPFFHKSENCIIiGWG 
VPNCGPCFWGAQQKSAWSEGL 


9197 


23098 


A 


9265 


333 


48 


ALLAGPTPFFFLGKKKTRFGNRAIjFFFF 
FFFIWVI^CNPGWIMVV7aHRCDHGTL 
QPPTPGLKRSSCLSLSSSWNYKLKPLHP 
AGVRFLDFVS 


9198 


23099 


A 


9266 


3 


275 


TOFHHIGQAGLKLLTLGDPPLSASHSAG 
T TCVSHHAWPMHLES YWRVOYPFMPOVT, 
YFWAPSGAQHCAQ I IENAI PTLLSRPQE 
EPASEA 


9199 


23100 


A 


9267 


49 


226 


HGPEPCMDADAQREGINVSARYPKNWES 
TVDPAREFTOIYSAPLMLIADPDEFASV 
QLA 


9200 


23101 


A 


9268 


159 


236 


PRLVLISWPQAILPPWPIKWGLQA 


9201 


23102 


A 


9269 


1 


208 


I FNYCVFSSGVALAYG I YKQDLP ALKEK 



1103 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Ala nine 

OCysteine, D-Aspartic Acid, 

E=Glutamic Acid, ^Phenylalanine, 

G=Glycine, ENHistidine, I=Isoleucine, 

K=Lysine, L=Leucine, M=Methionine, 

N=Asparagine, P=Pro!ine, 

0=*Glii famine R= A. roinino c — c*.;«> 
>c ^»im minims) iv t\.\ gin in £9 0 — serine, 

T=Threonine, V«Valine, 

W~Trvnto n h a n Y=TvrrKzino 

X^Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 

nucleotide incprtifm 














PRGWFVDMGHSAYQVSACAFNRGQLKV 
S IYFFSRRLLWW 


9202 


23103 


A 


9270 


1 


209 


IFNFCVFSSWAIiAYGIYKQDLGALEEK 
PRNVVFVDMGHSAYQVSVCAFNRGKLKV 
S IYFFSRRLLWW 


9203 


23104 


A 


9271 






GSPASSSRLC 


9204 


23105 


A 


9272 




J 


RKKERKKERKKERKKEDGATADGISEAE 
VIRVHPYPVGFGVLSEGDIWTEDMRAGR 
RPHAGESGDTVMLPOAfH? 


9205 


23106 


A 


9273 


405 


246 


PTKKNKYLHAPPLFHFFFI I PNPNFFFF 
FFFFFFNFFFFFKTKGCBFLYFSY 


9206 


23107 


A 


9274 


it 


777 


SVGITGVSHQPGPSFHFERKSLGLKTLL 
LCVKVGVRACFALFFQGSS 


9207 


23108 


A 


9275 


70 


304 


NLFLKASTYPSEHLYHLPPTPIVYIRKR 
S AYRGLS JCCUTDYP P7J? PPPDUT.V 


9208 


23109 


A 


9276 


2 


260 


TVF YHVGQAALBLLTSGGP PALAS ES AG 

SPYRFSFTQLNHGFNTVLYPLVINISEI 
FF 


9209 


23110 


A 


9277 


406 


330 


PRLVLNSWTQTILLPWPPKVLGLDV 


9210 


23111 


A 


9278 


2 


lOV 


YXVYRVSRKSKNIPLKHIGIQSFP 


9211 


23112 


A 


9279 


] 


91 

✓ i 


GLTVFPPJjVWN^MAnMTT.PPWPPTfMT r^T 

QV 


9212 


23113 


A 


9280 


3 


107 


LGETGFHHVGRAGLELLNKVMHPPWAPR 
VLGLQA 


9213 


23114 


A 


9281 


336 


234 


AHYKLLLKI KLPGQAQWLTP VI PTLWEA 
EAGRS 


9214 


23115 


A 


9282 


3 


241 


DRPE PCVfDADAQREGINASARYPKNWVT 
TGDPAREFTMTHTA Pr.Mr.r.a npnp nrcjir 

QLAQSWPFGAIASLGRSPCRKRN 


9215 


23116 


A 


9283 


404 


277 


I FF VETRSHH IAOAGLE r ,T £l <2 enp p vt? p 
QVFTEMSLSQRALF 


9216 


23117 


A 


9284 


399 


231 


HYVAOAS LKLLYSGDP PAS AFO <3 AF T AR 
VSHCTWPTGSLNKIQNNIHSRISSM 


9217 


23118 


A 


9286 


394 


231 


AFFFFFFFGERGFHHVARTVLKLLDSSN 
PPAWSQSAGITDVSHDFLISLYRIY 


9218 


23119 


A 


9287 


420 


311 


IOTCMK5LAKLPRLVLNSWAQAVLSPQP 
PKVMGLHA 


9219 


23120 


A 


9288 


398 


173 


LLSFFYLFFYFFFFTIjFPr.PPPFPPPP'P 
LMFKALLFSCHYFPAMVTFKFSLS ISGV 
VRTSGYIMLFSYLCLLKTM 


9220 


23121 


A 


9289 


270 


23 


TPEOCRGI KLFI YSL ILWRrjGVT.T J r , r'<; 
GWPWAIHKCDHS TLQHQI PS LKQASCL 
S LLSS WE YR YAPLHLAS ANFIiKVTW 


9221 


23122 


A 


9290 


273 


46 


HLLNMPLFTEIWTERLFLFYILCFVFLV 
E TGFFYI TQAGLKLLTSSDP PAYASQS A 
EIIGTSHCTRPICYVNVTNI 


9222 


23123 


A 


9291 


424 


230 


CLPFTHSLSLSLSLSLSHTHTHTHTHTH 
TIYMNK 


9223 


23124 


A 


9292 


2 


146 


NLFLVETRSHHVARAS FKLLSSSNPPTS 
ASQSSEITGASHRTWPEVFF 


9224 


23125 


A 


9293 


393 


202 


GRERERKREREREKERKKERKKERKPER 
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nucle- 
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location 
correspon 
ding to 
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peptide 

sequence 


Amino acid sequence (A~Alanine 
C=Cysteine, D=Aspartic Acid, 
EXJlutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=f»Hi famine R— Artnnine- S=Serine 
T=Threonine, V=Valine, 
W=Trvntnnhfin Y=Tvro«;ine. 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PSGSLTYCVREGITAANLGKRRHKASKT 
LISHKNKN 


Q99^ 


9119£ 

ZJ IZ\) 


A 


QOCM 


7^9 




GFCCCCCCCYCCCSDKRHPG 


9226 


23127 


A 


9295 


409 


269 


SQYFGRLRRVDCIiSSGVQDQPGQQGETP 
SLLKYKIYTHISWAGWHV 


OOT7 

7zz/ 






0/7 OA 


z 


911 i 

Zl j 


FCCGLLIHLIYCKTEYSGOSTVAIHRCN 
HSVLQPQTLGFKQFSHLSLLSSWDYRYS 
PLCPAYIFNLS IGY 


9228 


23129 


A 


9297 


169 


408 


RFARHMN-SPHEERLPPPAGIRVRPPPQP 
AASLGPPRMAPS ISRRFQMTP YYKHNDF 
AGQAOWIiMPWPALWEAEAGGSSE 


9229 


23130 


A 


9298 


199 


51 


LHPPPPSLSI^LSHTHTHTHSHTHTHTH 
TLWLGRPGPAQDWLVWGNLT 


VzJU 


ZJ 131 


ZL 


0000 


1 
1 


17£ 


PSPFSLPSSWDYRHVPPCLASIFFLKLN 
YH 


9231 


23132 


A 


9300 


412 


209 


MGFHHVGQSGLELLTSGDPPTSASQSAG 
ITGVSHRNWPHCGFNLVFLMPDDVKHLF 
MCILAICVSSL 


9232 


23133 


A 


9301 


195 


347 


KIKGFRPGATAHAGNPSALGHQGGRTAI 
AWE FT I S MSNMGRPCLYRKYKN 


9233 


23134 


A 


9302 


386 


190 


GFLFFFPPPKKBVFPHPFILGFPGFFPP 
PPFFKRAPEFFFWGPKKKKNSPPPRALH 
FFFFKGAPP 




ZJ1-5J 






77 
Z/ 


171 
1 to 


S FOLWGHLKVPRIRPKRADHIiRSGVODO 
PDQHGKTLSLLKKYKISRAW 




ZJIJO 


a 


Ol (\A 


1*7/1 


171 

izi 


OWCDHGSLOPOTPGIiKOSSCFSIjPKRWD 
YRREPPHPTSLSPYIVAWPCTLCPPPLS 
ALWKHLIHQKCIX5VMVIiQVPGRTHNVTL 
T 


9236 


23137 


A 


9305 


70 


196 


PQVIHPGQAGLELLTSGDLPASASKSAG 
ITGVSHHAQPISHF 


9237 


23138 


A 


9306 


378 


256 


QKWVS I CCPGWSQTFGLQES S CLGIiPKS 
WDYRRQPQPCLTIi 


0/7-20 


7111Q 




Q107 




1 


PCHLGWSAWQSWLTAASNS WPHAI LPP 
QPPKVLGLQASLKFLKAGITQVTFSDYN 
KILNKEIKRGR 












t Af 
lOl 


HFKKKI YI YI YVYIHTYI YI YTHIHI YT 
HTYTYIYRYIHIYIHTYIYMYVYIYIPS 
LPF 


9240 


23141 


A 


9309 


503 


288 


FFLNLDKSQEWWIiMPVI PTLREAEAGGS 
LEVRSSRPASSDPPTSASQSAGI XGMRH 
HARPWLRENGRIQN 


9241 


71149 


A 


9310 


418 


309 


FFCKNGI S I CCLPKLVSNSWPQAILQPQ 
PPKVQGTL 


9242 


23143 


A 


9311 


442 


275 


VSPPPLGGGFLSPPFKPPPFFFNLKGGP 
PFPTIPFLIIKiGKKFCPPPPFFFPGLF 


9243 


23144 


A 


9312 


2 


95 


HENRLSLGGGGCSELRLCHCTPAWATER 
DFI 




ZJ Afj 


A 


qui 


lift 




FFFFFFFFFFFFFFFFFFFFFFFFPFPP 
FFFFFFFFFFFFFFFFFFFFFFFFFFFI 
FILFIYSLLYWCVGVLGS 


9245 


23146 


A 


9314 


388 


199 


EQPKCVHRI KQI WHICALE YYAAFKKKK 
VLPYAVIQIKLEDTMLSEINQSWGKYCM 
IPLIGGI 


9246 


23147 


A 


9315 


2 


97 


STPGLKRSSHISLPKYWDYRRGPPGPAS 
ILF 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartk Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIydne, H=Histidine, Wsoleucine, 
KHLysine, LHLeucine, M=Metfaionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^TJn known, *=Stop cod on, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9247 


23148 


A 


9316 


3 


235 


TMWNAEGIPIRTTLDNSTTVQYACLLH 
HLTMKAKSTVRDIDPQNDLTPLRIRSKK 
HEIMVAPDKEYLLIVTQNPCE 


9248 


23149 


A 


9317 


253 


71 


LFFFFLPFVDTKFHHLAQDGFQIiLSSSN 
SPTSASQSGGITGRSHQAWPYVHLLTLQ 
KLNY ! 


9249 


23150 


A 


9318 


425 


95 


ClaLFCPKKKKKKKKKECKKKKKKKKKKKK 
KKKGGP I KKKKKQP 


9250 


23151 


A 


9319 


25 


214 


VRIDDRPEPCMDADAQREGINSSARYPK~" 
NWWTGDPARECGMIHSAPLMIiLADPDE 
FVSVQLA \ 


9251 


23152 


A 


9320 


50 


231 


GHRPEPGRDADAQREGI IASARYPKNWV 
TTGD PARS CTR I QS APLMLLAD PDE FVS 
VQLA 


9252 


23153 


A 


9321 


401 


226 


LAATTGTRHHAQPS KFRNLKGWIiCAKRA 
RT | 


9253 


23154 


A 


9322 


410 


159 


IiGSLQIiPPPGSKRFSRLSLPSSWDYRGC 
SELRLRHCTPAWVTEODS VP P KK3CKO I P 
RPFPFSYGESLGVGLRHLCFSKPSRRF 


9254 


23155 


A 


9323 


310 


25 


KWGGLFFFSPPKKRG FFKQ VFFCVPP VF 
FFPPFFKTGPGDIFLGPLKKKI IFPPPG 
gkkfffkrgpppffffffffffffffff 
FSEVECVCIjCR 


9255 


23156 


A 


9324 


406 


289 


PGFYHAGQAGLKLLTSGNPSASASQSAR 
ITSISQRTRPH 


9256 * 


23157 


A 


9325 


1 


116 


LHSVLLPFFYFCVCTCVCVCVCVCVKHK 
YVRAGGAPGR 


9257 


23158 


A 


9326 


2 


158 


FIFLFTVGGIiTGIVLANSSLDIVIiHDTY 
YWAHFHYVIjS IGAVFAI IGGF ih 


9258 


23159 


A 


9327 


3 


107 


RGCS E PRLRHCTP AWVTEGDCVS KKKKTC 
VPKAWG 


9259 


23160 


A 


9328 


257 


2 


GFFSNISASVIIPPPISRKNLFFPFRVY 
FGPPPKTFKGPPPFFFFFFLS PRLECSG 
VXMAOCS LTLPGS S DLPLSAS CVAMTV5? 

r 


9260 


23161 


A 


9329 


66 


384 


GNIiCGWMKSMAWDNPWRKADWGQIRBS 

KRGGP FLKI SWGGP ILGGRGKI FFFFFG 
GGIIiTPRGVFWKSPFFWGGKKK 














9261 


23162 


A 


9330 


1 


98 


I IGQVASVLYFTTILILIPTISLIENKI 
LKWA 


9262 


23163 


A 


9331 


33 


412 


NLFYVMAE ILHTHIQLCCQVKLHLSVPR 
PSLPLLCCWNFSLVLHEESQPASEKKKK 
KKKKNKKKKKKKKKKRGGVFKKKPGGAP 
FOGGGENFFFFFFGGGLKKPRGVFGKNT 
FFWGGBKRPPPPPY 


9263 


23164 


A 


9332 


401 


311 


PTRPVDLLVDDSVKESLIGYGPTEDSKE 
FS 


9264 


23165 


A 


9333 


5730 


452 


TRSAPRAASWLEDPREVRSVCLSATFFT 
IiSLQKPRLIjLFSPSWHLGVPLSVGVQL 
QDVPRGQWKGSVFLRNPSRNNVPCSPK 
VDFTLSSERDFALLSLQVPLKDAKSCGI* 
HQLLRGPEVQLYAHSPWLKDSLSRTTNI 
QGINLLFSSRRGHLFLQTDQP I YNPGQR 
VRYRVFALDQKMRPSTDTITVMVENSHG 
LRVRKKEVYMPSSIFQDDFVIPDISEPG 
TWKISARFSDGLESNSSTQFEVKKYVLP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
fling to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCystetne, D=Aspartjc Acid, 
E^Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
V£=Hjriuiaraine, K— Argimne, o— oenne, 
T=Tfareonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














NFEVKITPGKPYILTVPGHLDEMQLDIQ 
ARYIYGKPVQGVAYVRFGLLDEDGKKTF 
FRGLESQTKLVNGQSHI SLSKAEFQDAL 
EKLNMG I TDLQGIaRIj YVAAAI I EYPGGE 
MEEAELTS WYFVSS PFSLDLSKTKRHLV 
PGAPFLLQALVREMSGSPASGI PVKVSA 
WSSPGSVPEVQDIQQNTDGSGQVSIPI 
I IPQTISELQLS VSAGS PHPAIARLTVA 
APPSGGPGFLS IERPDSRPPRVGDTLNL 
NLRAVGSGATFSHYYYMILSRGQIVFMN 
RE P KRTLTSVS VFVDHHLAPS FYFVAFY 
YHGDHPVANSLRVDVQAGACEGKLELSV 
DGAKQYRNGESVKLHLETDSLALVALGA 
LDTAL YAAGS KSHKPIiNMGKVFEAMNS Y 
DLGCGPGGGDSALQVFQAAGLAFSDGDQ 
WTI^RKRI^CPKEKTTRKKRNVNFQKAI 
NBKI^YASPTAKRCCQDGVTRLPflMRS 
CEQRAARVQQLDCREPFLSCCQFAESLR 
KKSRDKGQAGLQRALEILQEEDLIDEDD 
I PVRS FFPENWLWRVETVDRFQ I LTLWL 
PDSLTTWEIHGLSLSKTKGLCVATPVQL 
RVFREFHLHLRLPMSVRRFEQLELRPVL 
YNYLDKNLTVSVHVSPVEGLCLAGGGGL 
AQQVL VPAGS ARP VAFSWPTAAAAVS L 
KWARGSFEFPVGDAVSKVIjQIEKEGAI 
HREELVYELNPLDHRGRTLE I PGNSDPN 
MI PDGDFNS YVRVTASDPIjDTLGSEGAL 
SPGGVASLLRLPRGCGEQTMIYLAPTLA 
ASRYLDKTEQWSTLPPETKDHAVDLIQK 
GYMRIQQFRKADGSYAAWLSRDSSTWLT 
AFVLKVLSLAQEQVGGSPEKLQETSNWL 
LSQQQADGSFQDPCPVLDRSMQGGLVGN 
DETVALTAFVT IALHHGLAVFQDEGAE P 
LKQRVEAS ISKANS FLGEKASAGLLGAH 
AAAI TAYALTLTKAP VDLLGVAHNNLMA 
MAQBTGDNIiYWGSVTGSQSNAVSPTPAP 
RNPS DPMPQAPALW IETTAYALLHLLLH 
EGKAEMADQAAAWLTRQGSFQGGFRSTQ 
DTVIALDALSAYW IASHTTE ERGLNVTL 
SSTGRNGFKSHALQLNNRQIRGLEEELQ 
FSLGSKINVKVGGNSKGTLKVLRTYNVL 
DMKNTTCQDLQ I EVTVKGHVB YTMEANE 
DYEDYEYBELPAKDDPDAPLQPVTPLQI* 
FEGRRNRRRREAP KWEEQES RVHYTVC 
IWRNGKVGLSGMAIADVTLLSGFHALRA 
DLEKLTSLSDRYVSHFETEGPHVLLYFD 
SVPTSRECVGFEAVQEVPVGLVQPASAT 
LYDYYNPERRCSVFYGAPSKSRLLATLC 
SAEVCQCABGKCPRQRRALERGIjQDEDG 
YRMKFACYYPRVEYGFQVKVLREDSRAA 
FRLFETKI TQVLHFTKDVKAAANQMRNF 
LVRAS CRLRIiE PGKEYLIMGMGLDGATY 

RQRAACAQLNDFLQEYGTQGCQV 


9265 


23166 


A 


9334 


46 


324 


SKYLPPVITFQHEIWVGTNIQTISDWVL 
ICCTGWSAVTQSRLAAASTSWAQAMSSS 
SCLGLSSSCYYRHTPPCLANFVAFVETK 
LYTMLPGLS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

nentide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Le urine, M=Methionine, 
N=Asparagine, P^Proline, 
v vj| u i*tiu nit, iv— /^.rgininc, o— 'oenne, 
T=Tbreonine, V=Valine, 

W^TTrvnJ'nnh a n Y=Tvrn<iiii a 

X^Un known, *=*Stop codon, /-possible 

iiuwicviiiuc ucicuuii, y— p 11391 U1C 

nucleotide insertion 


9266 


23167 


A 


9335 


338 


178 


EVD 1AVSHRRKOAFMLCLL KCK P UTtfNP T . 
NLEGGGS S ELKLCYC I PAWATDQSL 


9267 


23168 


A 


9336 


396 


254 


KLSTDCTSKEI INRHPTEWEKI FANYAP 
NKG PTS TIHSS CTLMP P YIjP 


9268 


23169 


A 


9337 


454 


359 


SSWDYRHAPPCPANFVFLVETI FIQGE I 
GLL 


9269 


231 70 


A 








TKERVSFLFVAPGCIQGGKKKKPPTKKK 
KKKKKKKKKKKKKKK 


9270 


23171 


A 


9339 


21 


209 


AEEFPDFKLYYKANQNTYRGQWNRTETS 
EITPDHfLGIQLVSNSWPQVILLPWPPK 


9271 


23172 


A 


9340 


345 


2 


KIFFFLIYILFFFNFFFFFFFFFFFFFF 
FFFFFLENRWFLMENFIKGRGHVKGPLG 

SLLQVQAGGPVHIRI SGLQDARQGPRVA 
GA 


9272 


£~J 1 /-> 


& 




1 c 
ID 




FETGS PCVSQAGLKLLGSRDPP VSASQS 

ARI IGVSHRAOP AT.TDFP QTGT? 
ru\A XV7 v omxtv^trnxi J> x>J? ro Xor 


9273 


23174 


A 


9342 


484 


276 


RHAPPCQANFFVQTRFCHVAQSGLKLIR 
SRYLFASAFQSVRITCMSHCARPLYFLM 


9274 


23175 


A 


9343 


464 


326 


LVETGPHHVGQAGPELLAPCDPPAPASQ 


9275 


23176 


A 


9344 


486 


280 


SFSHSHSCSlVPSSWDYRCTSSIilANFFF 

KRWOFT T T.PPT . TT.WQWDnVTT.T.OftCUPW 

EPLCLETLKTLNV 


9276 


23177 


A 




776 


I J7 


DRDFTMT .PP T »VT ,1UQ Iff D ITT TT.T.P T .D DPT T T 

RLQS 


9277 


23178 


A 


9346 


45 


143 


RWNSRQENCLNLGGGDCSEPSLCHCPPA 
WATE ** 


9278 


23179 


A 


9347 


229 


73 


PPPHMIFFFFFFLFFFYFFFFFFFFFFF 

FFFPFFFFS RTjOVG WPG GVT.TOTTMVn 

KLKEEKEE OlUjy VOV V 1 "l X\XN V XJ 


9279 


23180 


A 


9348 


391 


139 


IPSLKRSSHLSIiAKCWDYRHICPPCQVTF 
S FLNVR I YATKFFTJ3 TP LAAW£ KVWVT , V 
FLLS FS SKYYLTP IS FLFLTHNVSSSI F 


9280 


23181 


A 


9349 


53 
j j 


191 

171 


IYVTYI WYS E VSHAYNS STTidflnfifiR T T 
GAQEFKTSLVNIAKPLLY 


9281 


23182 


A 


9350 


373 


235 


EEGELLEPGVOGCSEljSMCHCTPAWtSTE 
QDHCVSYKNSNTHNNVNY 


9282 


23183 


A 


9351 


476 


82 


YLATYLPTYIjSIYLPTYLPIYLPIYIiYL 
GSHY IAQAGLKFLGWVGHNGS CL 


9283 


23184 


A 


9352 


286 


142 


ELQFKMRFGWRHSQTISWGIiSljLPRLIS 
NSWAQVILLTWPPKVLGIQA 


9284 


23185 


A 


9353 


3 


102 


FFETESHS VARLE YSGAI SAHCTP VT KL 
YLYT 


9285 


23186 


A 




*j 
j 


77ft 


TAGAPPRPANFCT FGPnfWGPCWPfJWGP 

T PGLRCD P PAS AS QLLGRIiRHENCLNLG 
GEGCTCAIlxAHCTSTLRAQAILQPQPSC 
PAPQ 


9286 


23187 


A 


9355 


1 


112 


GEGSAHCNLCLPGSSNSPASTSQVAGIK 
GVSHRARPN 


9287 


23188 


A 


9356 


1 


63 


VALIQAYVFTLLVS LYLHDNP ! 


9288 


23189 


A 


9357 


244 


140 


QRRHTHTHSHTYTHTHTHIPNKKI PILT 
ILKCIA 


9289 


23190 


A 


9358 


464 


181 


I^AGGMTGILFSLGASMVU5GVAQMLAP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Giytine, H=Histtdtne, I=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Criutamine, K=Arginwe, s— serine, 
Threonine, V^Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Un known, *=5top codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KARTKK J.U'lTDNGKQNTyFSSLDNMVAQ 
GNVLPVLYGEMRVGSRWSQEISTADEG 
DGGQVWIGR 


9290 


23191 


A 


9359 


11 


469 


XUIjRURFRJ^iSRERERERERERKRKRlSK 
EREREREREREREREGRAPPPPLFFGGG 
GPPARAHAIiSPPHIiGRPPIj x ii r r £> x VK 
AP PTERS ARTQREARRALKS VCWRERE 
THSLPRARCVCSHPTGCGGAPGARVSLS 
QSAHCFLCVCGAG 


9291 


23192 


A 


9360 


22 


172 


GLSHEERERERERERERERKRERERDTV 
CPLFCPTGGFILREARVKSGCLCF 


9292 


23193 


A 


9361 


15 


345 


PPNRPPAEFRTRHLISSLLCLEGIILSL 
FIIATLITIjNxHSIJjANIVPIAX^ 
CEAAVGLALLVS I SNTYGLDYVHNQKKK 
KKTWGGGRAPNSPYGGRNNNPRPAV 


9293 


23194 


A 


9362 


61 


298 


SCDSPLQECCIHTHTHTHTHTHTHTHTH 
IiFSDIKTHTWREWALSLFIiLTEEEGGGT 
LFQGTLI FGHTLFWDKPLWQKEF 


9294 


23195 


A 


9363 


430 


2 


GGAGPPPPCWTQGPGVPPPTLSPRVNGP 
SIiCSDFPSPKFYKEKRKKKGNKTPKRKG 
KTIiLPTETIVLHVS PGCESKPGAGSADR 
RAPGVRDGGSDASS SGTRASELGDGPVR 
VNFVI/jDFoBfVbKwr o MXAwlVlrtjl OoIrK 
TS 


9295 


23196 


A 


9364 


432 


291 


GITGLRHHSQLIFLFLVETGFHHVGQAG 
LELLIHQHRPSKVLRLQV 


9296 


23197 


A 


9365 


343 


507 * 


GQGIjTLVPQvuVyWGNLiSSIjQIjijPPGijK 
QSTCLSLPKCWDYKCX5LPPPASFCFVL 


9297 


23198 


A 


9366 


417 


128 


MG WWPGRGAP PFLDEI4AAGQRRS S LSRV 
GSQAEGLLTSHTMGGPAETIjijroU rGWK 
LGRGCNLGSLEGQGRRIiGGGGCNE PSSR 
HCTPAWAPLSTE 


9298 


23199 


A 


9367 


500 


285 


SLPRSWD Y RRAP PKHAN fc"X»FXi V KMUfc'HH. 
VGKAVLELLTSGVELLTSGDLPASASQS 
AGITGVSHRAWQIRE 


9299 


23200 


A 


9368 


1 


229 


IVDLTTWGIPTHTIHKYTHHIHKYIHTH 
AYTHHTHRHKHIYTYIHTrMWWLSPHPN 

t tt t vrcc* c?tixTnTiT rQTJcv dxt/^d 
JjXIiNr SSHN PHvfa WrA»,FNtjK 


9300 


23201 


A 


9369 


451 


88 


KQTNIYTCIYMYVYIYTCIYTYVYIYTR 
TYTCIYVCVYTCIYVCVYTCIYVCVYTC 

ilVLVIiUi I vL.VIIKVIlV.Vvl I vXiK 

VCOTTCIYVYVYIRVYTYICIYTCISTY 
mr 1 TVTHTYin/Tif r*so 

iyu i. x im i 1\. v io,wv3 


9301 


23202 


A 


9370 


1 


208 


ADETIEQRGG I PKFVHDNYVI RRTAVDR 
GIPLLTNFQETKLFAEAVQKSRKVDSKS 
LFHYRQYSAGKAA 


9302 


23203 


A 


9371 


354 


202 


FAAUKJSvj X lAoAK 1 o J\JM n V X rKjUr ntt& r 
TMIQSAPLMIiLADPDEFVSVQIA 


9303 


23204 


A 


9372 


1 


336 


KSL I KYKGYQVAPAELES ILLQHPNI FD 

■y, f-ri TTV /"»T TYTYrYr\TV/"*E*T T>7V TV^TTTXTT ©TI/^WMn* 

AGVAGLPUUDAGfcujPAAVVVijKHUixTMl 
EKEIVDYVASQVTTAKKLRGGWFVDEV 
P KGLi 1 o JSXiUAKKJ-KlS X it X KAKAovvaoKXi 


9304 






yi ti 


Z 




NI LVTALYSliY I FTTTQWGSLTHH INN I 
KPSFTRENTLMFimS PI LLLSLNPD 1 1 
TGFSS 


9305 


23206 


A 


9374 


96 


240 


ACWLMPVI PALSEAEAGRLVQPRRLRAA 
WATWQDLISTKKKKKGGPF 


9306 


23207 


A 


9375 


3 


293 


STISAFTTSTTSATTTSTTSATTTSTIS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=rsoleucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=*Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AP1TSTTLS PTTSTTSTTITSTTSAPIS 
SNKLHKKKKKKKKQKKKKKRKNDDTKQD 
TITKIHSGGGSL 


9307 


23208 


A 


9376 


69 


160 


EHTHTHTHTHSHTHTHSHLPERFPCVLN 
SL 


9308 


23209 


A 


9377 


336 


199 


EGITASARYSKNWVTTGDPAREFPMIQS 
APLMLLADPDEFVSVQLA 


9309 


23210 


A 


9378 


321 


128 


EPGFTMWASLAPTSSLQVIHRLGLPKCW 
DHRRBPPHPAYHSLYESGPSVQNGEEKG 
INPIIKMG 


9310 


23211 


A 


9379 


440 


286 


SSDPSASAS KSAGFTGMSHHTQLLTS YS 
EGECLSYLVFLVLV 


9311 


23212 


A 


9380 


392 


154 


KGSSPLSPPISIiAGGKPPPPRKFFLNFF 
FFFNRDRVSPCLPRLVSNSWPQAILPPQ 
PHKALGLQAYATTAS FI FLIVTV 


9312 


23213 


A 


9381 


403 


178 


GRRHEPCRDADAHLEGIIFSAFYPKNWE 
TTGDGAREWTMSNSAPLMLLADPDEFVS 
VQLA 


9313 


23214 


A 


9382 


303 


206 


DPVSTKICSLQPPPPGFKRFSCLSIoPSS 
WDHR 


9314 


23215 


A 


9383 


215 


116 


SNNF VVCCCCCCCCCCLRWSLTLS PRLH 
SCGTM 


9315 


23216 


A 


9384 


2 


285 


ARHLLMQLSGGATLAISTIDLPSTLI IF 
TTiLILLTILEIAVALIQAYDITLLGSLY 
LGDNTYKKKKRGAAFEETKFTSAGMQSK 
NFFLGTLNWR 


9316 


23217 


A 


9385 


12 


123 


GYENFLLPTIiHTQTHAHTHTHTHTHrHI 
HSKDPMEY 


9317 


23218 


A 


9386 


3 


230 


KCWGYRCEPLRLAPLRCFFTPGGISLTS 
GPALLSNASAIiAEVRGQQLCPPTPFLPT 
LIALTTLLLPISPFILIIL 


9318 


23219 


A 


9387 


590 


396 


ARVQWPDLGSDQPPPPGFKRFSCIiSLLN 
SWDYKCAP PCRADFCIFSRDGVLSCWPG 
WSQTPYLK 


9319 


23220 


A 


9388 


224 


79 


GLGPVGHACPTLWEGEAGGSRGQEIEAL 
PANTVKPRFLKRKKKGKKKG 


9320 


23221 


A 


9389 


249 


72 


LVYLITVKKILFPVHIRTFNKIDHMIGH 
KTCI^KLQMTEILQSVFSDYNRI KLEAS 
NN 


9321 


23222 


A 


9390 


400 


254 


LRQPGSTNS PVSASRVAGITGTHHHTQI 
I FVFLVEMWHACNPS YSGG ! 


9322 


23223 


A 


9391 


3 


64 


PPQKSDPPYKLVPVHLDSIL 


9323 


23224 


A 


9392 


266 [ 


106 


LTANFLFLVETGFPHVTRVGLELLASSD 
LPTLASQRAGITGVNHLQNFWPGNF 


9324 


23225 


A 


9393 


430 


348 


FGKPQPP PKGKS PPPRGNFPPQKAPKG 


9325 


23226 


A 


9394 


481 | 


311 


LSLSEKRSIRVGVTRFSRCCPSPLSLTG ' 
KGNSLTPGVSQEIHS PPWFGIAYSVCTH 


9326 


23227 


A 


9395 


3 


136 


VFPRLVLNSWAQMIIiLCWPPKVLGLQAC 
TTRPSHFLPLLNSKYK 


9327 


23228 


A 


9396 


143 


347 


NFVFLVBTRFHHVGQAGLTLJjTSSDPAS 
QSAGITRVRPEILTLKRYHFPSGILGTE 
GMVTEMAYLLKV 


9328 


23229 


A 


9397 


2 ; 


231 


LELLTSSDLPASASQSAG ITRGSHHARP 
S FTFRCVIHLE FNFMRGAREERLHFYCR 
WVIRRLTTFGRGYPSPAASD 


9329 


23230 


A 


9398 


383 


200 


QHYLNRRLSLAQWWMPWPATQEAEAGG 
LLEPRSSSLQCAMMLPVNSHCTLAWATH 
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REPHL 


9330 


23231 


A 


9399 


3 


416 


GFLHVGQAGLKLLTS GDLPTS ASQSAGI 
I6VSHRASCLKS I YLTFPTRKFFVQMS I 
CDFKVEIEGKIGFISVPISLQSILHIVN 
NFFFFLETGSFPPGWRAWEKFGLIAPLA 
SGVKGIFLPLPRENIiGWRPSPIPPA 








74UU 


4.nn 

*tUU 


151 


FGYPQWPLPPFLKPGPGIFFLGPLKKN 
FPFPPPGLKI FS FKRAPPFFFFVRKGLI 
MLLRLVSNSWAQVI I PPPPPEVLGLQA 


9332 


23233 


A 


9401 


324 


154 


FFFFFFKXXXXXLLFFFFFFFLFYIFFF 
FFFXFXFXFFFFFFFFFFFFFFFFFFFF 
F 


9333 


23234 


A 


9402 


413 


312 


PKLVLNSWLQALLLPRPPKVLGLQARAT 
ITQPK 


9334 


23235 


A 


9403 


363 


3 


FKKKNFCLGGAGVVFFFFYDPPPKKNIF 
YFPSKXIYFLSPASSSFFFFLCFPFPIF 
FFFFFFFFFFLENRWFLMENFIKGRGHV 
KGPLGQARDHDISGRRWWRREQCRASQL 
PGVRPRV 


OIK 






QATiA 


401 


OA 


selphdpinpttgyiypkkmksvcftdi 
ctpmfttalfs iakiwnqpkclskriti 
fmflvtysqlfcthypilpypffpapls 
sshnyhytlylyenilldstye: 


9336 


23237 


A 


9405 


382 


219 


WARWHGFLVPTS PGGEAGGSLE PGKRRV 
QLTQISPLPSSLGGKVGLPLKXKKNN 


9337 


23238 


A 


9406 


3 


195 


YLYSLVETGFHHVGQAGLELLI SRDPS A 
LASBSAEITGMSHSIQPKSAFYIQIPBT 
FWQPAFK 


9338 


23239 


A 


9407 


371 


1 


FRRLRLKNRLNLGNGGCSE PRLCHCI PA 
WETRAKLGRKKKEERKREKGTKMYNKMY 
KMYLELNLTCPEPASPKQMTRQIjEVI*SS 
NPSPGTFQFICKKQFKFSCCFALRVSIY 
LLMSVFEYCSC 






A 


9408 


326 


229 


RQGLTMLPRLVSNSWAQVILPPWPPKVL 
GLQA 


9340 


23241 


A 


9409 


81 


231 


LESASFPPGKVAHTCNPNTLRGCAGWIA 
WAQKFKTSLGNMAKPCLYKIK 


9341 


23242 


A 


9410 


367 


256 


HPGWE WWLVSVI PTLREAKVGRSLEPRS 
LRLAWDTWG 


9342 


23243 


A 


9411 


2 


192 


VFLVEMGFLHVGQAGIiELPI SGDPPASQ 
SAGITGVSHHAQALTFLNIPINPSHPPK 
ACCSKQL 


9343 


23244 


A 


9412 


3 


83 


KMCLNLGGGGCSELRIiCHCTLTWATE 


9344 


23245 


A 


9413 


371 


301 


SDHPRMVILRLLRYIVRLVWRMH 


9345 


23246 


A 


9414 


486 


337 


AGLELLGS SNLPALTSKIAE ITGMCHHT 
WLAYI FVC^FGWVLLLiDLCLAF 


9346 


23247 


A 


9415 


378 


254 


IVAHCSLDLLGSSDPLTSASRVAGPTGM 
CHHVWLIWFFVC 


9347 


23248 


A 


9416 


160 


309 


WRYSSNEFLLFLLFYLFKSNFTVFCCC 
YCCCCCCWRQSLTLPPKLECGV 


9348 


23249 


A 


9417 


162 


339 


RPSSFCEFLHDSVPELAIiPMKARFEKSR 
RSQVSWLMPVIPTFWEAKVGRYQARSLI 
PA 


9349 


23250 


A 


9418 


1 


147 


GGGGCSEPRSCHCTPAWATRVKLGLFKK 
KKKIKFLFVPPFIFLKSKTGV 


9350 


23251 


A 


9419 


283 


126 


KMMYKGQ AQ WL I P I IPAFWEAKERGLLE 
VRSLRPAP^TEQDSTATKNKNKCXF 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
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W=^Tryptopnan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 


OKI 


152.31 


A 


9420 


133 


3 


KiiHr b WAW WIMP VI PALWfiAEAGGS RGQ 
EFKTSLPNMVKPRLN 


9352 


23253 


A 


9421 


3 


133 


LRGLYSGAGI IRFEAKKWNAIAIiWAWDI 
AGDNCAIGRNHIMDL 


9353 


23254 


A 


9422 


409 


291 


ILILGGGSCSELRSCHCTPAWATRAKHR 
LuSa &o P A W V x 


9354 


23255 


A 


9423 


I 


210 


RF FHLGLS KHWD YRCES PHAGTRNE VL I 
YATIWMNSENIVPNEI IQTQRTNIVS FY 
uYQVGRIGKFIDRR 


9355 


23256 


A 


9424 


479 


306 


GVPTNQPWSNRVKTPSPTKNIKISWPWW 
CMPCNPSCSEGQGRRIAtiNPGRWRLQ 


9356 


23257 


A 


9425 


1 


209 


GFFLFQGGGHHAGQAGLELIiTSGDLPAS 
ASQSAGI TGAGPNQGI FVALES PMESAA 
LLHLHWLSSPCS 


9357 


23258 


A 


9426 


628 


433 


RQGVAHLPGMECSGTITTHCSLELLGSS 
DPPASASQSSRLAWSHCAQPSLLFSVL 
CLKPIFCCA 


9358 | 


23259 


A 


9427 


417 


125 


FLRQSLTSSPRLEYNGAIPAHCNLCLPG 
SSDSSIIiASQSIRITGVNHCTWPDAAVL 
RQNFFFSEKPQFYSEGFQQTHI IKSNLL 
YLKSTKCTCYPYL 


9359 


23260 


A 


9428 


366 


256 


SFFYRDRSLATLPRLGSNAWSQAILPPG 
PPKVLRLLA 


9360 


23261 


A 


9429 


3 


140 


GFCHVGQAALKLL I SSDLPTSPSQSAGI 
TGVSHCARLTRLFS SHE 


9361 


23262 


A 


9430 


368 


3 


VRASQLLCLPTQASAMAGTPPPASLPPC 
SS I SDCCASSEQGSMGVGPSEPGAGYNL 
WCSSLRTLEKCS IRVGVPQFSRCHLSQ 
I^tiARKGNSPTPCASWVRRCLTLLQLML 
RGLYPLSCT 


9362 


23263 


A 


9431 


3 


131 


LTMLPRLVSNSWAQGSSHIiSPPKCWDYR 
CEPLHSGLKDI FLE 


9363 


23264 


A 


9432 


384 


274 


LSFAMLPRLVLNSWAQESAHLGLPKCWD 
YRHEPLGLA 


9364 


23265 


A 


9433 


431 


273 


PPP LGYFF I VKME FPP WQDGFKLLNS S 
DCSTSASPSGRISGVNHSTRPRIF 


9365 


23266 


A 


9434 


211 


473 


CVCTFFS FLSMHI FFL FLVGEAESHCVT 
EAGVQWHDLGSWQPPPPRLKRFSCLSLP 
SSWDNRRVPPYPANFCIFSRTRVWSCWP 
GWS 












loo 


FE FKDN I R WI KLVE KAHTQ YGN I Y I CV I 

WTUTDT T7VDVXTTJ 


9367 


23268 


A 


9436 


331 


115 


SRDRGFAMLPRLVLNSWTQA1 CSLLDSQ 
bArluo li P lU-WN Y KKE FXjCPAovjE I LiSFI 
KPSSSKSFHNSTKKRQ 


9368 


23269 


A 


9437 


3 


109 


IYGSYSLPACFFPSFLPSFLPSFLPSFP 
PFFLSFF 


9369 


23270 


A 


9438 


2 


167 


ECNGAISGHCNLCLSGSSDSPAPAWATR 
VKJjRIiI KKKKKKKNQILFFYPPRKTLF 


9370 


23271 


A 


9439 


648 


310 


VCPC ILRHPAFLFAKSGNPLESSLLDHS 
S PTFKLTDRENLRHV KCTSHSQS L PTSG 

GYRAPVGRRGPGSRNSWRLHWTPITSVG 
D 


9371 


23272 


A 


9440 


1 


170 


YE I WFIiDVGQAGLELLTSGDLS ASGSQS 
AGTTGMSHHARPLKL FTI SVS SNKKVKC 


9372 


23273 


A 


9441 


133 


244 


LFGYQTRGLAMLLKLDSNSWPQAVLPPQ 
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PPKVLGIxLA 


9373 


23274 


A 


9442 


365 


176 


RGSMDMVPSESCTGYNLIMCPIiLRPLEK 
RS IRVGVTRFSRCHLSQLPIiARKGNSLT 
PCASQVR 


9374 


23275 


A 


9443 


368 


231 


VEAGFCHIAQAGLTFLSSTDPSALASRS 
ARITGVSHHARPIFLKE 


9375 


23276 


A 


9444 


88 


194 


KMLEAKKKIEQQGGFTFENKGVL5AFNF 
GTVPSNN 


9376 


23277 


A 


9445 


3 


349 


IRSGSYACYFCYPRECYALFHCYSMIVS 
YGPYYFMNIDQRHESRLSTIYIFKFEIG 

SKHYAQAGLEUjCS ssppaftsqsagit 

SVSHHTRPQKI FNVT YSCDLHFHPGIEG 
PQR 


9377 


23278 


A 


9446 


3 


141 


HAECSSMMIAHCELELLGSTDPPASASQ 
SAETTGISYHAQPYSTF 


9378 


23279 


A 


9447 


2 


282 


ICPTRSRFPPRVLKNGRKKDRLECSGAI 
SANCSLCIiPGSSDSPASASRYAGSTVAE 
EDSTRRAVRHQGS SRSAPSGRGSWAGKQ 
ERADRAQQS 


9379 


23280 


A 


9448 


307 


430 


GLAWWLTPGIPILWEAEVGGSLEVEFET 
SLASMBKPSLYYK 


9380 


23281 


A 


9449 


2 


253 


IQCVCVCVCVCVCVCLSVCLHFIAILGC 
SPWGSRLPCSVSSPSGRGPGGEDLSPLP 
PTMSVSPEADFPPLAAAAPGNIMTAIT 


9381 


23282 


A 


9450 


1 


166 


NTCGGGE E I RVFS S CFAEDRLQKGKDRP 
GAVAHACNRSTLGGRGGWITRSQIFFC 


9382 


23283 


A 


9451 


1 


123 


NTRGLVMLPRLVSNSWAQVILQPLPPKV 
LGLQASVTVHINN 


9383 


23284 


A 


9452 


396 


291 


SLCIKRVCACACVCVCVCVSGAKLRKLV 
LDFMRS 


9384 


23285 


A 


9453 


1 


147 


FAIRARFSNTYTKIGTIHTYIYIYTYIY 
TYIYTHTH I YILSQVLMEYE F 


9385 


23286 


A 


9454 


394 


235 


TRSCYIAQSWQWLFTGTLI IHYNLQLL 
SSRDTPVSASQYLGLQAHTTTPDSC 


9386 


23287 


A 


9455 


1 


85 


TNPVATLLEWQDALCFI PVGSAAPQGS P 


9387 


23288 


A 


9456 


339 


165 


KTHWWGKKSFLKNPFFFFFFFETKSRSV 
TRLECSGVILAPKQKTKKQNDKTTINKT 
I 


9388 


23289 


A 


9457 


3 


209 


LLILFSIKMRNRPDSPALLSTQIiPPIFI 
YIHTHIYLYIYTHIYIFIYIYLYIYLFA 
YIHIYVSLFPLK 


9389 


23290 


A 


9458 


374 


156 


QAGLEIiLTSGDPPISASQSARITGLSQR 
AQLSELTLTVFTTRHCTCXCAHFLYYTV 
FLEGRNDDEFISIPST 


9390 


23291 


A 


9459 


287 


3 


EAGLIiWRKATHKYAtiTHl'HTHTHTHTHI 
RGLQNM INCKYITYQE I MTLAQAKNKGV 
IPDSFPFTSSPPLTYPPALSFHFQKHIP 
NKTTTYHGHC 


9391 


23292 


A 


9460 


3 


72 


KTELKMKFTNILESSFLMNNKS 


9392 


23293 


A 


9461 


327 


92 


ISLFLVWGALFFPKKI I YPPKKKSPLFF 
PPPPPQENFNPPSFLAPPVFPPQENPKF 
AGGRKI FCPPPPVFFFYFFFFF 


9393 


23294 


A 


9462 


3 


173 


ACRGDSGGPLVTRDLKDTWYLIGIVSWG 
DNCGQKDKPGVYTQVTYYRNWIASKTGI 


9394 


23295 


A 


9463 


258 


39 


LKVTLPSDPS SI FVCLFLENESRSVTQA 
GVQWHDLGFKRFSCIiSLLSSWDYRRVPP 
PPAKFCI FRKHGVSPC 
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9395 


23296 


A 


9464 


104 


336 


NHVS QCEE I FLFDEYYI FSRAVASMFWV 
FFLGKKVFFFSQNGGLGRPGQ 


7J7v 


7*5907 


A 


QA£< 


410 


0 


IjN JjGGGUC£> eprlcf ctpawate 


9397 


23298 


A 


9466 


254 


61 


KKKKDAFFTI PLYPSSQNLFAFTQMDPD 
TNISQQLTWTVLPQGFRHSPPTFLDKPF 
KQTSINFA 


9398 


23299 


A 


9467 


403 


255 


PAPQIiGGPPMMPMMAPPSSWMMPMGCAP 
AMRLPIGGDI IWLCVPTQI SP 


9399 


23300 


A 


9468 


1 


1428 


P ARFKI) LNLAGTAEVGLAG Y FMDHTVAF 
RDLPVRMVCSSTCYRAETNTGQEPRGI*Y 
RVHHFTKVEMFGVTGPGLEQSSQLLEEF 
LSLQMEILTELGLHFRVLDMPTQELGLP 
AYRKFDIEAWMPGRGRFGEVTSASNCTD 
FQSRRLHIMFQTEAGBLQFAHTVNATAC 
AVPRLLIALLESNQQKDGSVLVPPALQS 
YLGTDRITAPTHVPIiQyiGPNQPRKPGL 
PGQPAGPRVGYKALQPWPAQRLGPVGRP 
PCRPEGQPLSCCCCGCSCWGRtiTRCLAE 
VVTEVLTVGQVQRGPCTALLHKELCGTE 
PKGCASTEEKGLLLGDFKKQEAGKMRSS 
QEVRDEEEEBVAERTHKSEVQEQAIRMQ 
GHRQLHQEEDEEEEKEERKRGPMETFED 
LWQRHLENGGDLQKRVABKASDKETAQF 
QAEEKGVRVLGGDRSLWQGAERGGGERR 
EDLPHHHHHHHQPEAE PRQEKEEAS ERE 


9400 


23301 


A 


9469 


265 


170 


GTESHSVTQAGGQWLtNLSSLQPLPPGFK 
QFS 


9401 


23302 


A 


9470 


289 


110 


RDLELAS FLS FFPS FLPSFFPSFLPSFL 
PSFLSFSFLISFFPLPSFFSFISFSLSL 
FFE 








y4/i 


67 


210 


TVTTCQALTHAYNPSTLGGRGGQI TRGQ 
Ural i Jj V f ljUixNr bKVLAbb 


9403 


23304 


A 


9472 


2 


186 


TSLWRQAKKNVEKKTDKYTEVLKTHGLL 
V V- i :y j\^> Lor iuuS J\iM iN-ftn l/jJxyANAPARK. 
VEIRC 


9404 


23305 


A 


9473 


417 


265 


FLSSFLFFFFFFFFFFFFFFFFFTV 


9405 








'toil 


1 /X) 




9406 


23307 


A 


9475 


1 


150 


PTRPWQAGFKVLSSGNPPALVSQCFRI 


9407 


23308 


A 


9476 


404 


109 


PPPLFFNFGPFFIiVEARLHHVGQAGLKIi 
TLY 


9408 


2330Q 


A 


Q&17 

y*tf i 


jyi 




DD PUPPPD nDfiT.TMT.DDT. T CMOU AmrTT 

xr fee r r IJKUJK&xu lTwJrKJbJLoJNo ViftyVXli 
LPWSPKMLGLQARATVSGIiNKRYS 


9409 


23310 


A 


9478 


438 


277 


KRAPFFFFCRDRGLTMLPRIiISNSWAQV 
1 UJjFW £> F KMLfc JjQAKA I VbCjUJ NRYS 


9410 


23311 


A 


9479 


399 


279 


PPPPVFKNPPPKKNKGAPKKKNISPPPG 
RKKYFFFKGPP 


041 1 




A 


C\A OA 

9480 


247 


26 


LKAGDDSRTVLGKVHQSPCFSLLLFFIQ 
VSCRDEGLTVLPRLVLNSRPOAILLPQL 
PKLLDHRCEPPKLANHG 


9412 


23313 


A 


9481 


2 


176 


rLtXJT CCltTniri C IXC C t>r«XVaI»JWX V JL \£ V ^ 1, Mill 

DSSNP PALTSQSAGITGENQHGHPMNCL 
SG 


9413 


23314 


A 


9482 


2 


178 


HAPPHPANFLFFVEKSSCYVTQVSIKLL 
DSSNPPALTSQSAGITGVSHHGQPMNCL 
SG 



1114 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

Od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nude- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=Gtydne, H=Histidine, I=Isoteucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagioe, P=Pro!ine, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9414 


23315 


A 


9483 


2 


151 


GRVGEGPELVSLLVLDQHPVGTSDHRDA 
TQLGFQPLMSDLGSTGSTEGA 


9415 


23316 


A 


9484 


420 


325 


FFVFLLETGFHHVSQDGLDLVICPPRPP 
NTFV 


9416 


23317 


A 


9485 


400 


104 


LFFFLFFFFFFFFFFFFWNPPKTFFKKV 
FGLNFPTLLKLKFNPPPPGPPKKKKKEL 


9417 


23318 


A 


9486 


117 


280 


SSTNQANNCFFFFKKRGIiAILPRGALNP 
WAKGILLPWPPKGLKLRGGAIGPGQK 


9418 


23319 


A 


9487 


76 


353 


VLWNTLEFCLI YLDLIiFLLFFNKEALI R 
KKKKKKKKKXKRGGNKKEKKKKTPWQEK 
AGLFLGDT I I YSRGAVLKNSGGETPRHT 
PSGGLWPR 


9419 


23320 


A 


9488 


3 


122 


HASEDNVFLHCSHSDLGYVIiIiAGTASLQ 
DFSFFQKSPPP 


9420 


23321 


A 


9489 


1 


155 


PTRPTVCSGAMLAHCNLCLPGSSESIiAS 
ASQVAGTTACTSTPDPFFFFFSL 


9421 


23322 


A 


9490 


3 


212 


BTGFHHVGLSGFKLLTSGDLPTSASQSA 
GITGMCHCVWPQFAYFCLLLVYLDHLKF 
TIIMLVLCLLFYD 


9422 


23323 


A 


9491 


2 


163 


ETLFRHVGQVS FE PLTSGDLPAS ASQSA 
GITGVSAMPSWMIKFFIAFYSCINC 


9423 


23324 


A 


9492 


474 


296 


SQPAGTADCLLPLPPGSCFPTI PCPAAL 
VPLPSFPSARLWPPQSLHQLLPLPGTCS 
SCT 


9424 


23325 


A 


9493 


7 


151 


LCAVEFVYMKPPAHEEAKAPSRGFWRY 
AFWTASSSEKGRDDGMGPRA 


9425 


23326 


A 


9494 


52 


363 


TTAPSLSVFLFTFTGELFVYWWGVTLE 
TSFISFFLFFFS FLEKDFS FS PAGRPWP 
NFGLIQPLASGVGGISRPNPPSRWELRA 
LPPPPGGLGFFFWKKGGPPG 


9426 


23327 


A 


9495 


383 


118 


WKPRPWVPPGAGVQTPPGPQGETPLFLK 
KKKKRGRGGPLFGPPSFGGGGGEIPLTP 
APKGQEIYFPPPPPPPQGKKKSLFQKKK 
KRIR 


9427 


23328 


A 


9496 


554 


419 


FFFETKPCSVSQAGVLWRDFGSLQPPPS 
QVQRDSPCPQASQVAWD 


9428 


23329 


A 


9497 


428 


281 


GSYVQLTFKYFS I Y IYVHTKLHVYLYTH 
IYVYVHICIYKEGYITGSVLA 


9429 


23330 


A 


9498 


144 


16 


KIV1CKKVSQCPGIVAHNCNPSTLGGRGGR 
IAWGQEFETLSLQKN 


9430 


23331 


A 


9499 


2 


155 


NIALLGSSDPPVSASPVSGIRGVHHHAQ 
LIFFFFFGKRAWFNPPGGRKGEE 


9431 


23332 


A 


9500 


2 


184 


FIYGRDSLTMLPRLVLNSWPQAILPPRP 
PKVLGFRCEPPHLASIjILFYRDILFTIiG 
LKSV 


9432 


23333 


A 


9501 


465 


153 


PHKVFLNKGPFETPQGELSRGEWKTRPP 
PPRERSRPFHRKGGFGSPPKLRQGPPFV 
FFSWPPWCPPPKKGARPQQKGFFYFPPP 
FFNPPFFFFFCLFFIFEIV 


9433 


23334 


A 


9502 


429 


11 


CRTLLGADCVOTCVCVCVCVOnCAFPPI 
SYSCHLVPSDLPLLGVQFKCWVHFGLSF 
LTCPFPSPPWPGGAQPGSCYV7GLGCLCL 
FACKIAjARIjIisQa JuGlPASGPG rWATPli 
GS CAS VLRRLPGGS ERDTDLKVS VIiSG 


9434 


23335 


A 


9503 


436 


202 


LPGGYIFQGEGVEKTKFSPLLRENNFFS 
NFFPPKKNNGGGPRIPPPPPPPLFKKGV 
FFIPKFPAPPRPKKKFPKKKKK 


9435 


23336 


A 


9504 


3 


112 


S FDLLS SS DPPALASQSAG ITGVSHSHG 
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SEQ ED 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, L= Leucine, M=M ethionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, R-Argtnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QKVYICNM 


9436 


23337 


A 


9505 


426 


208 


VGHNLLVPRFLSPSEKRS IQVGVTRSSR 
CRLS PLSLTRKGNSLTP FASRVRQCLTL 
LRLARAPTDLRPLSGTP 


9437 


23338 


A 


9506 


44J 


189 


CPANFFVFLREMGPHHVGQAGLLTSSDP 
PASASQSAGITGLSHCARPTVGSDVLSL 
PSGQCTSALTGFMLGSTTVHSNAAARA 


9438 


23339 


A 


9507 


117 


278 


NKALLCHPGWNH I KKVNGRLGMETHTCN 
PTTLGGQGQEFENSLGNIGRPHLYK 


9439 


23340 


A 


9508 


3 


126 


FFVEMGFRCVGRAGLRLLTSGGLPVLAS 
QGAGITGVSHCI 


9440 


23341 


A 


9509 


421 


134 


FLVERRSHHAAQASLEILSSSDSPTSAS 
QCAGITGMSHCARPKYTSFFRLHRHMPM 
ATPTKLPPTLASAS PQVKVCLYPLLPST 
SKGEIFTLQLY 


9441 


23342 


A 


9510 


2 


200 


PQPPTFYPSFLPSTPEPPAFYPRLPPST 
PASRLLPQPPAFYPSACHLLPQPPTFYP 
SLGNRARLHL 


9442 


23343 


A 


9511 


414 


204 


ERDRKKERKKEGRKERERERKKEKKRKK 
KKERRKEGRKEGRKEASKGQRVWIMVD 
T 


9443 


23344 


A 


9512 


1 


211 


FHHVGWAGLGIiLTSGDL PASASQSAR I T 
GMSHHARHRHGFDNLSLCALAPSPTPLW 
RRHRRRGHFTVPEK 


9444 


23345 


A 


9513 


390 


265 


VFLVEMGFHHVGQAGLELLTSGDLPPLE 
SQSAGITGVSQQFS 


9445 


23346 


A 


9514 


438 


180 


IGFRHVGHAGLDFLPLGDLPSLASQNAR 
I TG VS HRAWP AMAFVNCHGAGG S VAMRT 
TRGHSCRHLGIGGFWPASLLQPVLSAGS 
L 


9446 


23347 


A 


9515 


126 


11 


RNLDFFbSLGLAMLSRIiVNSWPQVIFQA 
WPPKALRFQV 


9447 


23348 


A 


9516 


2 


249 


FLHVGQAGFVLPTSDDFRPTHTPQSSGI 
TGVSHRTRPLRCYFLRLNNI PLYVYAYY 
ILFIHSVISGYVGGCTAHYCHPRKW 


9448 


23349 


A 


9517 


403 


57 


PEGALVPQKKKIOaaaaCKKKKKKKKKKK 
NIKI 


9449 


23350 


A 


9518 


406 


140 


PHYiraTFVYNGVRLEDLQHMCLELrVWD 
REPIiASNDFLVGVRLGVGTGEIPSPILT 
CLKAS CLFYS YYHS RP PTQS S SVMTWF 
FPFS 


9450 


23351 


A 


9519 


3 


75 


GIEWMVNCVVRNVHRPPRQRDIT 


9451 


23352 


A 


9520 


183 


6 


LLTHLILTSALIGEIMVAPILQKNSGSE 
RECRLPRLVLNSWPQGILLPWPPKVLGS 
QV 


9452 


23353 


A 


9521 


89 


339 


LQCLQKSGKLEPGMTYTKLIDADVNVGN 
ITSVQFIWKKHLFEDSQNKLGAEMVUNT 
SGKYGYKSTFCSQDIMGPNILQNLKPC 


9453 


23354 


A 


9522 


3 


329 


PSLMFPWLKLSFPGRPRGRQTYSRYQTL 
ELKKB FLFNPYWTRKRRME VFHALGLTE 
RQGKIWFQNRRMKGKKENTRDNLPEAQN 
EEKVEEKGIEEKEKEEEEKEENRD 


9454 


23355 


A 


9523 


1 


156 


AKTSMNVMEIFMAIAKKLPKNEPQMPGA 
NSARGRGVDLTE PTQPTRNQCCSN 


9455 


23356 


A 


9524 


1 


272 


AKNSARALLGILGYRSHLISSLLCLEGI 
ILSLVIIATLITLNTHSLLANIGPIAIL 
VFAACEAAVGLALL VS I SNT YGLDYVHN 
LNLLQC 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
eorrespon 
ding to 
lasx amino 
acid 

residue oi 
peptide 


Amino acid sequence (A=Alanine 
C-Cysteine, D==Aspartic Acid, 
EMJlutamic Acid, F=PnenylaIanine, 
G=Glycine, H=Histidine, I=Iso!eucine, 
K=Lysine, LpLeucine, M=Methionine, 
N=Asparagine, P=Proltne, 
v^f=\jriuiamine, iv— Arginincj ^— oerinv, 

T^rTh rAnninf* V=Valinf* 
a~ l urcvniiicj t — t miiic, 

W=Tryptophan, Y^Tyrosine, 

Y^flnLnnwn *=.^tr»n rnHnn A=nn<*cihle 

nucleotide deletion, V=possibIe 

nitr1t*ntif1f* incprHnn 

IIUL.ICUI.IUC *M Jti UUll 


9456 


23357 


A 


9525 


1 


62 


GNDAWVCTNDMFRLALHNFG 


9457 


23358 


A 


9526 


1 


111 


GDI IHNENMRQGRDKYKTLRQIRQGNTK 
QRIDEFEAL 


9458 


23359 


A 


9527 


427 


340 


I PIGNI MSCDSHSHHELVMS AQLNAI FH 


9459 


23360 


A 


9528 


439 


27 


PRRTDKWLTVEMIQDGHQVS LLSGELTV 
DQRASIIQRFRDGKEKVLIPTNVCARGI 
r\xn/r\\7T i T\n7T , T7T>T.P\7vrv^T^PPnVR r T t VT.P 

IJ V ts\i V JL X V V 1 c JJJjir v j\yvxE*iiJr UlCi i J jjxv 

RIGRTGRPGKKGI1AFNMIEVDEI1PSPMK 
IQAIiFDGRNKQIjNABVPCIQAAMED 


9460 


23361 


A 


9529 


1 


236 


K rjfXi 1 j-L J. 1 Si vLAKvlUvAyv 11V ViNCUbr 

VKQGEEPDYETYLHSIGRTGRFGKKGLA 


9461 


23362 


A 


9530 


437 


240 


QQVSSARPEPEAGEVSPPVGAGVNSNSW 
AFKYGPGNPKQSGPEPKKQTQVSFIiLRR 


9462 


23363 


A 


9531 


413 


260 


LFDQWKVASPSGLRTSTVRYGDPEKLK 


9463 


23364 


A 


9532 


431 


260 


ASSGQMFSVKQQAPPVRDLPPFGISLFG 
GQIYDNISPRPPQGIVAPPGGRGNIPNL 
G 


9464 


23365 


A 


9533 


2 


134 


I HGVCVHMCV CACV CVYACV CVGGYDGR 

T17T TVT.QOT^rDCMXTWTM 
LcLxH 1 too IVroilVWAPl 


9465 


23366 


A 


9534 


419 


53 


LLLLTNGDRQTQREKI EACACQS YFDAV 
WGGEQREEKPAPSIFYYCC3JLLGVQPG 
DCVMVGDTLETD IQGGLNAGLKATVWIN 

XVNo J. V f li i\oo It v It XT ill VoO V liCiUrrtiJUv 

SIDCKVRMST 


9466 


23367 


A 


9535 


439 


109 


NCI VI IDSTPYRQWYESHYALPLGRKKG 
NAKISSLVEEQFQQGKVLAC IASRPGQC 

AtAI/aI V 4>uvfwEiliQr OJiRftA l\/>f\t\\Ji\ 


9467 


23368 


A 


9536 


1 


185 


NTTSSHHFTIPCLVLYSCTSTHTHIHTH 
THTHTHTHVRVFS YI PSQDVKEGCIPAP 

Q\IT7QW 
0 vrow 


9468 


23369 


A 


9537 


168 


280 


NPNS IRLTSCCCCCCCCCWKSPYNVYNR 


9469 


23370 


A 


9538 


142 




KIVKMGFPHAGQAGLKLLTSGI>TPGSTS 
QSAG ITGVSHRSRARS I FLYPYLPSVYL 
PWRRV 


9470 


23371 


A 


9539 


3 


139 


YIGGMGQAWWLMPGKPRQNDYIiSPGVQD 
QPGQHSETPSLQKGFKN 


9471 


23372 


A 


9540 


383 


318 


MSTQNNIQS YP IGEDEES BSD 


9472 


23373 


A 


9541 


372 


207 


YGLNMCRQCFRQYAKDIGFIKVGVCRRT 
PCVIVJP FGCdCl VPFR KROTDRSS LEW 


9473 


23374 


A 


9542 


2 


360 


ARGDDDHRSSDRKDGVLEEQIERLQEKV 
ESAHSEQKNLFLVTFQRVIMILTEHLVR 
CETDGTS VLTPWYKNCIERIiQQI FLQHH 

QFCALQA 


9474 


23375 


A 


9543 


379 


75 


TNLQQFAMS HGS AQVKGHGKKVADALTN 
AV2^iVDDMPNTLSALSDLHAHKLRVDPV 
NFKLI SHCLLVTIiAAHLPAEFTPAVHAS 
LDEFLASVSTVLTSKYR 


9475 


23376 


A 


9544 


380 


56 


RDLSMVRMKSMFAIGFCFTALMGMFNSI 
FIX^VVAKLPFTPLSYIQGLSHRNLLGD 
DTTDCSFIFLYILCTMSIRQNIQKILGIi 
APSRAATKQAGGFLGPPPPSGKFS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aniino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 1 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HHHistidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Glutaniine~ R=Arainine S=S**t- in*» 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X = Un known *=Stnn rod on /=nncctKlo 
nucleotide deletion, \=possible 
nucleotide insertion 


9476 


23377 


A 


9545 


304 


92 


SKSRVTNNTLRSTRKDTMITLNTHSLIiA 
NIVP I AI I VFTACEAAVGTiATjTjVf? T <?NP 
YGLDYVHNLNLLQC 


9477 


23378 


A 


9546 


394 


108 


KEIDFSLVDVTADDAGNYSCMYYQTKSP 
FWASEPSDQLE ILVTVPPGTTSSNYSLG 
NPVRLGLAAVTVVIMnAPT.VEAWY^RMV 

■i* ± ▼ *\ i ixj i Iran v ^ v V -U i ivjfu i-I ▼ XZaxWi J* OXvIjI V 

SPGESEAFKPB 


9478 


23379 


A 


9547 


1 


335 


GNAVGLNLHR I NFDKYHPG YFE KDRM KTT 
YHLHRNQSFCPTVNLDKLRTLFSEQTRL 
NAAKNKTGTAP 1 1 DAARS VYYKVLGKGK 
LPKQPVIVKAKFFGSLIPCFRIWILID 


9479 


23380 


A 


9548 


326 


188 


NFLRMGTVAHASNPS ILGGLGGRIAGAQ 
EFETS LGNITRPCL YFKK 


9480 


23381 


A 


9549 


381 


152 


VTLISLSLI^QWIGVLLWIARLNLNE 
VEKQWRLNQLNNAATILVFFTVVTNVF I 
TAFGAHKTGFLAARASRNPL 


9481 


23382 


A 


9550 


170 


65 


IHRS KNKWKFHLKDGIMNLNGRD YI FSK 
AIGDAEW 


9482 


23383 


A 


9551 


544 




ATEQDSSSKEKKERKNLKGNTVKSVPLP 
LTSHA 


9483 


23384 


A 


9552 


369 


181 


GVLGATSGDMQIHTFGSMIIGSITCMVS 
ELGYKFLTVSTADVMIQTLNLHLFSHMS 
HQQKKSY 


9484 


23385 


A 


9553 


1 


516 


S LVHWE FGQAIARRLKP FGVQRFL YTG 

WACSLTPATEGLCNKDFFQKMKETAVF 
INISRGDWNQDDLYQALASGKIAAAGL 
DVTSPEPLPTNHPLLTLKNCVILPHIGS 
ATHRTRNTMSLLAANNIiIiAGLRGEPMP <? 
ELKL 


9485 


23386 


A 


9554 


386 


56 


LFPACVLRWQFASDGGDIGFGVFLKTKM 
GEOOS AREMTEVLP SOR YNAHMVP RDG <? 
LTCLQAGVYVI^FDNTYSFJiHAKKIjSYT 
VEVLLPDKASBETLQSLKAMRPSPTQ 


9486 


23387 


A 


9555 


375 


150 


I WGKTSFHVS I PLS LFFPGD VLYDLLQ 
HILKQSKPRILFSPFFHPGNSIHTQPBV 
I LHQTHEEGTGRGFS FLA 


9487 


23388 


A 


9556 


3 


354 


CLPTQASAMVDTP P PALLP PCS LI SDCC . 
ASNEQGS VGIGPS E PGAGYNFLVCRLLR 
PLEKHS IRVGVTQFSRCXLSPPSLTRKG 
NSLTPCTSRVRRCLALLQLTLSELPHCP 
VPSV 


9488 


23389 


A 


9557 


463 


40 


PXKVEDPEPVKKPPKPriR c ?TTn?MPl?TTr* 
DE FNGVPS YMKS RLT YNQ IND VI KEINK 
AVTSKYKILHQPKKSMNSVTRNLYHRFI 
DEETTKDTKGRYFIVEADIKEFTTLKZUDK 
KFHVLLNI LRHCRRLS EVRGGGLTRYVI 
T 


9489 


23390 


A 


9558 


338 


104 


NFKKKKKRRTKKKE I RTEEKGRGANKKT 
FRGPKKKETRGSNRSPEGKKNHGAKKEG 
ERRPRKKKKKKKKLARYDFTCL 1 


9490 


23391 


A 


9559 


1 


152 


NTCVC VCVYTG FCHVAQ VAHE LS SSNLP 
SSASQSAGITGISHHAWPGFGF 


9491 


23392 


A 


9560 


517 


238 


SGPGBSVPHPWSAVRTRPAKKPG3QCLL 
QSAFPLWCSSRRTTWAEERKLNTETFGV 
SGRFLRGRSSRGGFRGGRGNGTPRRNPT 
SHRAGTGRV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
OGlycine, H=Histidine, Msoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopnao, Y=0>rosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9492 


23393 


A 


9561 


709 


280 


NTSPALTAPCGRPRGIRPMQCWARVPAP 
SSACPAWQLPSIiCLRSTVPAPHPSPAAH 
QPCBLSQHHRGGIiEPALPGTRQRDCSRT 
WHRCVRFLPGAQEAGELGPAACATGRLP 
VSHSTDMEQAPGLTSLHQEPHSTARGGT 
ATH 


9493 


23394 


A 


9562 


3 


130 


YRGFCHVGQAGLELLTSGDR ITS ASQS A 
EITGVSHRAQPQMS 


9494 


23395 


A 


9563 


409 


160 


NPGVKRSPCLSLSSSGGYWSMPPCLAYY 
KKKKRRFLKDGGLTMLPRLASNSCAQQF 
SHFGLPQSWDYRHEPLHIAQFFLFSLL 


9495 


23396 


A 


9564 


1 


241 


GICLGSNIGSGWMGTSSLCNPYSNTLNQ 
NLLSTTKPFPVPSVGTNCGIEPWDSGVT 
SGLGKMLFTjYYYYLLLFSQMTPQS 


9496 


23397 


A 


9565 


2 


215 


IHWmGWMDGWMDGWMGSCVRAVTGPGT 
SSPHTSTCGHAVCVCVCVCVYTCAHVFL 
ALFGVEKNNYFLWKA 


9497 


23398 


A 


9566 


244 


117 


DHQHLNWSVFISFSFKVIKARKNKKGIL 
NPDSSMETSPDFFF 


9498 


23399 


A 


9567 


414 


77 


FGGVPPGTLVP PTWI»PPVRVP P PPAFCL 
GELSPPPFFWRKGLAPPLLNFFFSPGQR 
GFPSPGGKGPRPLPRPAGFPGFVAKPPF 
RPPGFSSTPSDCGGLPPFFFFFFKKKVF 


9499 


23400 


A 


9568 


418 


198 


PPVSIiTEVIVSKSHKAELVFWNMPGTLE 
HPEGDNNYMEF I E VLTEALERVLLVRGG 
GSEVITIYSYPTLNARA 


9500 


23401 


A 


9569 


549 


415 


RQENRLNPGGRGCS EPRSCCCIAAWATE 
GDCLKKINNNKPKNEQK 


9501 


23402 


A 


9570 


546 


1 


KLECRGAITAHCSLNLPGPGDPPTSATQ 
AAGTTGTCH 


9502 


23403 


A 


9571 


415 


215 


SRLRKKIYIYIYMCVCVYVYICVYICIY 
VCVYICIWCICVYICIYVYICVYMYIY 
MHEFITFFSI 


9503 


23404 


A 


9572 


374 


138 


LAASAAIQELFKRI SEQFTPMFRRKAFL 
HWYTGEGMDEMBFAEAESNMNGLVSEYQ 
QYQDATAEEEGEFEEEAEEEVA 


9504 


23405 


A 


9573 


370 


233 


GSPPVRSVHQSGVSLSGTQVDEGVRSGS 
KRMVAPPGGRYNITSLS 


9505 


23406 


A 


9574 


3 


399 


YSSPGPIAIALRDPCAHGGPSPNSSLSP 
AQGHPARAQQGPQPQQGPPSHLGSDLRP 
HVAHTRGALHGPAPAPASPAPGGSGSGV 
AASMCSAPRPGRAGGTLGSVLTAHHPAS 
AGCAHRDAYVCKAPGRAVF 


9506 


23407 


A 


9575 


314 


48 


S ILPHSDPERCFS IQPEEGT IHTAAPLD 
REARAWHNLTVLATELDSSAQASRVQVA 
IQTLDENDNAPQLAE PYDTFVCDSAAPG 
QVSN 


9507 


23408 


A 


9576 


505 


264 


XFWEMEFHHFGQAGLRLLTSGDMPASA 
SQSAGVTDMSHCTRPDTPFNFKELPGAV 
VTFLLC^MQVGRDELEDDWMGAWS 


9508 


23409 


A 


9577 


508 


314 


PATWKGCLSNFSS YLTKPADFI EHQVLS 
WEQVPDG FFI FNERFKS FTWVLNNVAE 

orrnrvtrr T 
FVCKYKliLi 


9509 


23410 


A 


9578 


512 


1 


RKREAIFSPGRLPRGAAPLRGPAGTLVP 
WMPTPGECWLSLSVSAAGBKPYKCSVCE 
SAFNRKDKLKRHMLIHEPFKKYKCPFST 
HTGCSKEFNRPDKLKAHILSHSGMKLHK 
CALCS KS FS RRAHIAEHQRAHTGNYKFR 
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SEQID 
NO: of 
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M 
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nucleotide 
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ing to first 
amino acta 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G*=Glycine, H=Histidine, I-lsoleucine, 
K-Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginioe, S=Serine, 
T=Threonine, V=Valine, 
w— irypiopnan, x— lyrosine, 
X= Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














CAGCAKGFSRHKYLKDHRCRLGPQKDKD 
PV 




01/11 1 




9579 


3 


487 


WDL I YVGRKRMQ VEHPEKAVPR VRNL VE 
ADYSYWTLAYVISLQGARKLIiAAEPLSK 
MLPVDE FLP VMFDKHP VSE YKAHFSLRN 
LHAFS VEPLL I YPTHYTGDDGYVSDTET 
SWWNNEHVKTDWDRAKSQKMREQQALS 
REAKNSDVLQSPLDSAARDEL 


9511 


23412 


A 


9580 


1 


840 


SKYLYDEGEEEEVFNDEWGAAGGVAVPT 
LNALERGFLSAMDWHLYTDPREIFEVLS 
WLESCVAEQQGRWRGWYTYTDLCVLLBQ 
PTWQIiALGSLCQRLVKliSCLIiAVAYVS S 
VALAVASVAVIHQSLGLSCTPTPGPPDL 
GLTSRCLLEPCIPSVPQCLPSIiANVSSC 
JjEGSMGUcSLWGSLLASLTPP PLPPPDP 
PAPPTLLHNCHLCQKLQRDSPTCHACLH 
FNKT V PTAIoSS PWYHTYGLAPP WPWS P V 
PLSLPQPQQCSLFSVMEIARLKSFVFPG 


9512 


23413 


A 


9581 


2 


158 


IHAGFIS VSVLDNI IYICMCVCVCVCVC 
AHMCACISEHITHAPRLPEVFYTP 


9513 


23414 


A 


9582 


3 


144 


YTAGVIS VSVLDNI IYICMCVCVCVCVC 
AHMSACFSEHITHAPMLP 






JV 

A 


9583 


503 


3 


GKCV CLS AVWGS LLSAFRGS LS VCCLG<2 
FVCLSAVRGSFSVCCPGQRVCLSAVRGS 
VSVCLSAVQGNVSVCLSAVRGSIiFVCLL 
SGAVCLSFCaj^GAVOiliCRAPTPCliSR 
SLAHRRCSVNVCEMTQLISVSYFFCWGW 
WRHRNNIHLQKCRLRILMAYTGQGPP 


9515 


23416 


A 


9584 


2 


1358 


AGLDS LHKFQVKI EKEKVYVRAS KQALQ 
LQRRTKVMAKCISPSAGYSSSTNVLIVG 
AETLRQEGFSDRIVLCTLDRHLSYDRPK 
IiS KSLDTQPEQIALRPKEFFRAYG I EVL 
TEAQWTVDVRTKKWFKDGFKLEYSKL 
LLAPGSS PKTLSCKGKEVENVFTIRTPE 
DANRVVRLARGRNVVVVGAGFLGMEVAA 
YLTEKAHSVSWELEETPFRRFLGERVG 
RALMKMFENNRVKFYMQTEVSELRGQEG 
KXiKEVVLKSSKVVRADVCVVG IGAVPAT 
GFLRQSGIGLDSRGFIPVNKMMQTNVPG 
VFAAGDAVTFPLAWRNNRKVNI PHWQMA 
HAQGRVAAQNMLAQEAEMSTVPYLWTAM 

VAFYTKGDEVIAVASMNYDPIVSKVAEV 
LASGRAIRKREVELFVLHSKTGDMSWLT 
GKGS 


9516 


23417 


A 


9585 


360 


239 


Q CRKGLGKLVSFGGLG I WSLIDVLL IGV 
GYVGPVHGSSYV 


9517 


23418 


A 


9586 


160 


1156 


MPRLTFAPKGWPHPPTSLHPGQVTDQTT 
WWLPQBLPTPSENSMPPGLSTPTASQEG 
AGPVPDPSQPTRRQIRLSSPERQRLSSL 
NLTPDPEMEPPPKPPRSCSALARHALES 
S FVGWGL PVQS PQALVAME KEEKES PFS 
S EEEEEDVPTjDQD VEOAT ,OTF a KT «?mTSfl 
NNYPTWRRTLLRRAKEEEMKRFCKAQTI 
QRRLNE I EAALRELEAEGVKLELALRRQ 
SSS PEQQKKLWVGQLLQLVDKKNSLVAE 
EAELMITVQELNLEEKQWQLDQELRGYM 
NREENLKTAADRQAEDQVLRKLVDLVNQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue oi 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

UIDg W 
IU51 4UII11V 

acid 

residue of 
peptide 


Amino acid sequence (A—Alantne 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, ENHistidine, I=Isoleucine, 
K=Lysine y L=Leucine, M-Metbionine, 
N=Asparagine, P=Proline, 

0=/~21nf q minp R= A t*oin in<* SssSf^rine. 

^£ VfIU lallllltC, XV tXM gill 11117} u Cm Itlw, 

T=?Threnfiine V= Valine. 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 

ntif*lf*nti Hf* incprtiotl 

llULICUlIUv IllaCi Utfll 














RDAL I RFQEBRRLSELALGTGAQG 


9518 


23419 


A 


9587 


419 


130 


RRDPHLPRIMGRTRLNRKNAETRINAQIi 
PLTDKGRMARHVLDNSGBWSVTKRQVIL 
LHTELERSLKYLPLRCGVLTGLAAIASL 

xi x njj x r\ x Liiir i*i 


9519 


23420 


A 


9588 


2 


399 


FSCEFNMEANQCPLWBPSYPDLVINVG 
EVTLGEENRKKLQKIQRDQEKERVMRAA 
PAT .T .WQGGGVT RMAKKVEHP VEMGLDLE 
QSLRELIQSSDIiQAFFETKQQGRCFYIF 
VKSWSSGPFPEDRSVKPRHV 


9520 


23421 


A 


9589 


19 


205 


LYCLVEWHDLCSLQPLS PGFRQFSCUJL 
LSSWDYRHMPPRPTIKLSSYSS IKLIYY 
LFMLVL 


9521 


23422 


A 


9590 


500 | 


407 


KLTNEVRSVKRSQLFPES FMCKEKLGNT 
Nil 


9522 


23423 


A 


9591 


727 


236 


Kjs.oK. t JLJo 1 JFrs nftl trt ' t*- n itiiv.Kit i u 

SSFS FMAFFFTFMAQLVISI IQAVGI PG 
WGVCGW I AT I S FFGTN IGS AWML I PTV 
MFTVMAVFSFIALSMVHKFYRGSGGSFS 
KAQEEWTTGAWKNPHVQQAAQNAAMGAA 
QGAMNQPQTQYSATPNYTYSNEM 


9523 


23424 


A 


9592 


1 O 1 

381 


Z/o 


HPSVYKVASGLKEGIiSLFGILNRCHCKW 
GEKLLR 


9524 


23425 


A 


9593 


412 


156 


SQRCLSLGCHEHLANAYAI I ICDNGFSA 
txI.GPPL.PGWIYDIAOKYDFSFYICXSLLY 
MIGILFLLIQPCIRIIEQSRRKYMDGAH 

v 


9525 


23426 


A 


9594 


263 


50 


KHAAPPASLSLS LLLHHGQKRACFP FAF 
CRlX:QLLEGSPAMLPVQPAKIiLVLIiBQV 
CLLCALLIPPSGSRI 


9526 


23427 


A 


9595 


432 


309 


GTFSERGPPI^PRSQTVTVDFCQEMTDK 
CTTDEQPRKDYT 


9527 


23428 


A 


9596 


457 


29 


YLPSWRCSPSGALTHPAACPTGQRSSSA 
LLPSLPQAPPGFCGMTGLRPLPSPSVGC 
CSAPSPQLLCPTQLPPAPESSEGGCSES 
RCVANVTCx^n^LGDFIiEKCTNGEASALE 
YP 


9528 


23429 


A 


9597 


864 




P <3R R T, <? PTfTR TTHG.S S AACAPP P PRIiER 
RSRTSRPPWGPPHSASARTQRACTTWYS 
RSWACSCSWITSRPSWMTTKRTCMIFST 
I PTTPRTALWRGLSLWKIIAWLLTPJjKLA 
PSS PTSMPPTTTCTACSSTWMTCGS PAR 
WASTPMPRSSTT 


9529 


23430 


A 


9598 


421 


145 


LTQHNGDAAAS LTVAEQ YVS AFS KLAKD 
PNTILLPSNPGDVTSMVAQAMGVYGALT 
KAPVPGTPDSLSSGSSRDVQGTDASLDE 
ELDRVKMS 


9530 


23431 


A 


9599 


1013 


604 


PGRPTRPDICI^LERMEEQVKNVMKTFR 
EELYNIEKAFEVERQELIJ^NKKKWEQA 
LQAHNAKELEYLNNRMKKVEDYEKQLNR 
QRIWDCEEYNM I KI KLEQDVQILEQQLQ 
QRKAIYQLNQEKLEYNtAGAEEER 


9531 


23432 


A 


9600 


409 


191 


HHVGQAfiLELMTSSDLPSLASQGAGITG 
VSHCARTVYGLS I VFFSSHRGFASVRTP 
PSNENFPDRPISIAHP 


9532 


23433 


A 


9601 


367 


1 


PPRYSPPIiGGVRGGVPPGPGVLAPPFPK 
GKPPFFFKNPNLPRPGGGAPYSPFFGGV 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 
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26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X-Un known, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GRGISFNPGGQGSPGQKKPPGPPRRGTK 
GFFSKKKKKESTKLPVTSHTTHCWPIjPE 
ALDLELDPLY 


9533 


23434 


A 


9602 


1722 


394 


GWNGSWNDNLVDTSPLKRDPLQDICRRY 
MEDLKKICFYRELNSKTTLKFVHTSFHG 
VGHDYVQLAFKVFGFKPPIPVPEQKDPD 
PDFSTVKCPNPEEGESYLELSLRLAEKE 
NARWLATDPDADRLAAAELQENGCWKV 
FTGNELAALFGWWMFDCWKKNKSRNADV 
KNVYMLATTVSSKXLKAIALKEGFHFEE 
TLPGFKWIGSRI IDLIiENGKEVLFAFEE 
SIGFLCGTSVLDKDGVSAAVWAEMASY 
LETMNITLKQQLVKVYEKYGYHISKTSY 
FLCYEPPTIKS IFERLRNFDSPKEYPKF 
CGTFAILHVRDVTTGYDSSQPNKKSVLP 
VSKNSQMITFTFQNGCVATLRTSGTEPK 
IKYYAEMCASPDQSDTALLEEELKKLID 
ALIENFLQPSKNGTGSGRSCLGVPPNTV 
MTLCX3AYGNRATRRNCHTLEPCG 


9534 


23435 


A 


9603 


14 


356 


DFVERTQYTHTQTHTHTHSHAVRLSPRP 
VQSRPEASQTGPTRTKPALGPAHPSPRG 
APCPDDPGSLRMLGYGSRASHNSRRGRW 
GHGRCKDQRAMGPHSHMWEAAIiPPGALS . 
SP 


9535 


23436 


A 


9604 


477 


264 


LVETGFLCVGQAGLELLTSGDPPTSASQ 
SAGITGVSHRTWAALSTFTVLCNYHYLF 
LEVFHHPKLKLSPH 


9536 


23437 


A 


9605 


1 


378 


EGINFSDNLRRCVVMVGMPFPNIRSAEIj 
QEKMAYLDQTLPRAPGQAPPGKALVENL 
CMKAVNQS IGRAI RHQKDFAS I VLLDQR 
YARPPVIiAKLPAWIRARVEVKATFGPAI 
AAVQKFHREKSASS 


9537 


23438 


A 


9606 


417 


254 


MVSLTQELCPVAMRVAEGHNKMLSNVAE 
RVTVPRNFIRGALLEQAGQDIQNKLE 


9538 


23439 


A 


9607 


404 


272 


PPKFLIiESGPIRVVDWWSLGALMYDMLT 
GAVGAQLKAACI IGLC 


9539 


23440 


A 


9608 


409 


205 


HMNGSLGSGDIDGLPKNSLNNISGISNP 
PGTPRDDGELGGNFLHSFQNDNYSPSSL 
PCSWSLFYSLPF 


9540 


23441 


A 


9609 


449 


263 


VWICPCDPPASASQSAGITGLSHHLPKC 
WDYRLEPPRPAHCFCFYSFTMRNRNLLS 
LVKYSR 


9541 


23442 


A 


9610 


366 


75 


TPGLKRSSCLSLMSRWGYRIEKKRTI IR 
SWEAIKEQDGREVDWEYFYGLLFTSEN 
LNLVHIVCHKKTTHKLTCDSSS I YYPQT 
RliKRKQPVRKRQ 


9542 


23443 


A 


9611 


1 


801 


PGYCGSWVFTCGALRQLSGGRDLSRGAR 
MGNSALRAHVETAQKTGVFQLKDRGLTE 
FPADLQKLTSNLRT I D LSNNKI ES LPPL 
LIGKFTliKSLSLNNNKLTVLPDEICNL 
KKLETLSLNNNHLRELPSTFGQLSALKT 
LSLSGNQIjGALPPQLCSLRHLDVMDLSK 
nqirs i pds vgelqvielnlnqnqisqi 
SVKISCCPRLKILRLEENCLELSMLPQS 
I LSDSQ I CLLAVEGNL FE I KKLRELEGY 
DKYMERFTATKKKFA 


9543 


23444 


A 


9612 


179 


329 


KGLAFCPPPGRGGGPPPPLWPPPPSYKK 
IFGLKTPKSGALSPPPPPPGNF 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQfl) 
NO: io 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alantne 
OCysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Phenyialanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine t 
N=A$paragine, P=Proline, 
1^=1*1 uxanune, tv— Arginine, o— oenne, 
^Threonine, V=Valine, 

W — Trvnt/inhan VrrT \sm c i n A 

it — lrypiopiian, * — ijrru»mc, 

X=Un known, *=Stop codon, /^possible 

minliw\4'ir1 a italof inn i^tlflccinlo 

nuLievuiie uvicuuu, v — i>UdMute 
nucleotide insertion 


9544 


23445 


A 


9613 


A 1 A 

410 


Zy<s 


PP FTP PS HTiSTjRS s wd yrcp p prpanf l 
SLSSVIiHGT 


9545 


23446 


A 


9614 


1 


130 


NPGSHCVAQAGLQLI>SSGNPSASASQSA 
RITGVSHCAWPNTTI 


9546 


23447 


A 


9615 


1 


146 


NTWRLHMVPRLVSNFWAPGILLSWPPKI 
LGLQKRATTMPGPEIYTSIN 


9547 


23448 


A 


9616 


552 


26 


RliSIRNLLTHAFFAEDTGLRVEIiAEEDD 

CSNSSIiA±RIjWVJ5JL>FIUUjK^ 

EFSFNLETDTPBEVAYEMVKSGFFHESD 

CKAVAKS I RDRVTP I KKTRBKKPAGCLK 

ERRDSQCKSMGNVFPQPQNTALPIAPAQ 

QTGAECEATEVDQHVRQQLIQREPQQHC 

SSVTGDJj 


9548 


23449 


A 


9617 


420 


48 


FSQILRASYEFDSPFWDDISESAKDFIR 
nPTIT ROTRKNFARTHWKRAFNAT 

LULU -L 1 IVy J V JDy IXvIUM I. nI\A£lMl\IVAL IV* * 

SFLRHIRKLGQIPEGEGASEQGMARHSH 
<5RT .PAGOPPKW 


9549 


23450 


A 


9618 


190 


285 


RQGLTMLPWLVSNSWPQWLLPWPLKVL 
GLQV 


9550 


23451 


A 


9619 


392 


308 


SPIKPKIPLSAPRKNTNSVKYRLKFRFG 


9551 


23452 


A 


9620 


166 


23 


C LJNlr X ir CJOJjfOoJjror lorrlirur J_j J. ir r 

LPCFLPPFLPFLSFMTLTF 


9552 


23453 


A 


9621 


49 


374 


DRRGIRIMAAALFVPLGFSLLGTHGPSG 
AAGTVFTTA* YLGS KILLTCSLNDSATQ 
VTGHRWLKGRWLKEDS\LPGRKPEFNV 


9553 


23454 


A 


9622 


85 


472 


SHVFPPLRIiTLTLHSMAVPETRPNHTIY 
txtxtt KrnirTirpn^PT.TrirQr.VTVTPQOFrJWMT, 

4. JMNXuN UjB^L &AUfiiiAAO u ini r a Si " onnu 
DILVSRIIJN^QAFVIFKEVSSATNSL 
RSMQGFPFYDKPMRIQYAKTYSYIIAKN 
KGTFLSC/DRKLIK*TPQ 


9554 


23455 


A 


9623 


170 


2 


I FFFFCFERGSCS YA* AGVQWRS YS LL * 
S * TPG\ S SNTAS ATS VAGTTGL \HHHAB 


9555 


23456 


A 


9624 


2 


469 


PADPGKAGVPGVAAPGAPAAAP PAKE I P 
EVLEDPRSRRRYVRGRFLGKGGFAKCFE 
ILDADTKEEFAGKIVPKSTtTiTiKPYHR* K 
MFME I F IHRSLGHQHVXi * FHG YF * EHDF 
MS EELEL\ crprs fl f 


9556 


23457 


A 


9625 


661 


1393 


ASPPPSQTTPAAPGCSAGEVSLA\GIjCL 

qerrlvdvagfs I fi psryldhpqpska 
eqdasippgtheallqtalsppppptrp 

rnpg*cs*sgsphqwplphlp*grl*a* 
pspccpg*vg/pvhspqarrrp ITTSIS 
qrswkfwpkptvfretsggpwamprpsm 

hvvhtlx3eratcgdvrrahhspr 


9557 


23458 


A 


9626 


1 


286 


PTMAKLIALTI^GMELAIiFWNHQASYQT 
RLNALREVQPA*LPNCNLVKGF*TGSED 

T?TT DXfr*T.TPTQQf^T.'B''KIDMT*DT»'rVDVM 
\£iXJjirrfv7JJl T XOOVjJjX^JirWJ. RUUr \M 

SPGKMLLMDLNE 


9558 


23459 


A 


9627 


173 


491 


EGPLPLBSSSNWGQADLDKKSRELLWECT 
VYIiL*LNIQLPYNPVNPILGINPRETHT 
C\HQEMCTRI FIATLFTKAKA* K* PRCP 
*AREW/IK* / IWNIHI IEHY 


9559 


23460 


A 


9628 


351 


1 


VLGDAIQSRRGSSRKAGSYLLSRSP/CS 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E>=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, MMMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamme, R«Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=TVrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \-possible 
nucleotide insertion 














EGGRGGCWAWNLGTMSSVASGHGAWVPR 
ELTS * AWPEPGPAGQREGRPPPTPCPPT 
CAHGP PAQP * PRGKAAYPG/ CLGXiQWPD 
*SAFNP 


9560 


23461 


A 


9629 


599 


11 


KMWR* *H*AKSQIRMAIPFTIAHK/RDL 
KSLKIQLTKEAKDIYNENYKIPL/NEIR 
DVTNKWKNI P\CS + IGYLYSTYKYKCIC 
LVNIYL 


9561 


23462 


A 


9630 


226 


I 


KCDNLGYSTFFFFFFPFFFFEIGSYFVA 
QVGV* WHN YSNS/ DASAFQVAGTTGVCH 
HAQLI FNLFVEVGSHYVAQA 


9562 


23463 


A 


9631 


241 ; 


1 


VSGCPLPRG* RPPPQPQSPPPSTPGTSD 
HPCTPCR/PALRR*SP**LPCLDLPPAS 
GPPSTSGNSTGTMQPT*PKLNSSSS 


9563 


23464 | 


A 


9632 


201 


1027 


MTLTPAPGQGQQSSQCLWRRPL/ PLPPG 
PDSEBPGSGAAG/PGSSL*SPDLPPAAP 
PKP*TSS *EMNTVGSTQEGSGLAQRTTQ 
HCPVPPST/RVDILGTO^PFPALCLTS 
APWLVRMPPSQRSSREIP*RLEAF/SLR 
TPIPAHSAQPSHQEKEP/PS PR* DHGD* 
P P P * P PKPTHQTLAGQAPRSGQGIiARPT 
LP YPPS PLRVTPRI CRVHV*GHLRRPRG 
PPPSSARRPTWDWRVGLPWEDLEWLCR 
GPAWLLARTSAPFSSLPTHACTGKKTTM 
PF 


9564 


23465 


A 


9633 


103 


379 


YHCITIIVLKCIFFFFPLKPCLILSPRL 
EARGIILGH*NLCLT\SQLLRRLIRKSP 
LTLEGGVCSDPRS PHCS PAWPTKGNFFS 
KGKKKILD 


9565 


23466 


A 


9634 


155 


405 


VPAFRSVTNISISVSFFFFETEPGFVTQ 
AEMQGLDIGNGQPPCPKLSSFA*ASQVM 
GTTGPRH/HC*ALIFLVKTGFPHVAKTW 
F 


9566 


23467 


A 


9635 


4 


423 


GAMRGDRCRGRGGRFG3RGGPRGRFRPF 
VPHIPLDFYLCEMAFPRVKPAPDETSFS 
EALLNRNQDLDPNSGEQASILSLVTKMT 
YE I DNLTYAPGTLD VL I EERQQGGP YIK 
GTMTT*RH/SVADLTVILQILTTWESVA 
S 


9567 


23468 

• 


A 


9637 


2 


492 


GSRAIGFGIiAHELERHLIKIWLAS IVR* 
DLA*LMNWSG\SHKDLAGKYRP I LEKAI 
QLSGSEQLQAFKAFEESMVNENVSLVTS 
RQMLTDFCTHTiPNLPDSTVKEIYHFTLE 
KIQPIVISFEEQVASIRQHFASIYEKEE 
DWRNAAQVLVG IP L * TGQKQYNVD YKLE 
T YLKIARL YLEDDD P VPAY 


9568 


23469 


A 


9638 


1 


417 


WQGLGLVMGSPSRRLQTKFVITCFKSV 
LLIYTFI FWITGVILLADGIWGKVSLEN 
YFSIiLKDKATIVPFl\LIASGSAI ILPG 
I LLCIFSV*HYPV*LLLVTFLI * IjLVFL 
NKWWISPHFLILTFNYLLFDIIIDNSS 


9569 


23470 


A 


9639 


125 


451 


IYRKPLKSIRRVGGLHLNKIFFFFLKQI 
WG WARGJjMP C * HFGRLKKGDCLTLG IGD 
/SRG/RYDHDPALQPRQQSKTLSLKINK 
IGWAQWLMPG I PAPWEG*AGRS FGAGI 


9570 


23471 


A 


9640 


670 


927 


GHVXiLPRRAGCAQPGTGPARAAPAAVQR 
LPGGGVQPSGPGGRAGQGQGLRGT*AGG 
NPGTGF/GGPHASALLPPSEAPGSLLPS 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glatamine, R=Arginine, S=Serine, 
T=Threonine, V=Vafine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














QPR 


9571 


23472 


A 


9641 


115 


460 


ISWIFLI ICVGFFGGGGGARVPLCHLSP 
RLECSS AIRARCGFDFPGSSDP * ASAS * 
VAGTT/G/SCHHVQLIF 










i 
i 


*tj i 


AMRAEVLVYCCRGAIjASGCVQAFRARY\ 
WVL 1 1 VD VPDNEQANAS 1 1 VKLTDS FTE 
QADQIiTAEVGKLLGE* KVDAILCVAGGW 
AWGNAKFKSLFKNCDLMRKQSIWTWTIF 
SHIATMHIiKEGGIJ^TIAVAMAVLDGTPG 
MIGD 


9573 


23474 


A 


9643 


154 


408 


IIMNAQSVEBDSILIIPTPDEEEKILRV 
KL*EDSDGKYGSKIPWNHLPEPENFRLL 
FKHVGYLD*SGLY\EIIiTKF* ILCVLCY 
Y 


9574 


23475 


A 


9644 


3 


403 


YSQFFGGPKLEDPIRPGVFSQPGEHRGH 
PFQIF*KVGLGNGARPGFPVLGKVKCKD 
NLKLGGQGCSRVEMHPRTSAWAIERDPV 
LKKKKRQR\EHQCSNEQKHTN 


9575 


23476 


A 


9646 


3 


399 


ARAMVLSPADKTNGKAAWGKVGAHAGEY 
GAEALERMFLCFLTTKTYFPHFDLSHGS 
AQVKGHGKKGADSLTNAVAHVDDMPSAL 
VALSDLHAHKLEVGPVNCKLLRH*LLVT 
LGG\HFPAWLTPGGIAFLEKF 


9576 


23477 


A 


9647 


22 


407 


APSAWGMCHFTEEDKATITSLWGKV\NV 
EDAGGETLGRLLWYPWTQRFSDSFGNL 
SCAS AIMGI PKVKAHGKKALTSLGDAI * 
HLDDL*GTFAQLS ELHCDKLHVDPENFK 
LLGNVLVTVLAIHFGKE 


9577 


23478 


A 


9648 


243 


2 


QIATTTLGPGSEGVGKRDLGSSSRHHSQ 
VES VKH\ PRDLAQDA*GHGSAPRPLCP P 
PAQSWHCLRGPCLAPHHPTLCEPSR 


9578 


23479 


A 


9649 


283 


196 


SYFI * IITTTHSLI \IS IIPLLFFNQIN 
NNLFSCSPTFSSDPLTTPLLILTT*LLP 
LTIMASQRHLS SE PLSRKKLYLS ILI SL 
QIS LI ITFTATELI IFYIFFETTLIPTL 
AIITR*GKPTPLIQ 


9579 


23480 


A 


9650 


10 


354 


QLRVITRLTWNN* TSA* SRGVS YP*AH 
\ SSDALHL *RELTEQRFI VSVNCAIAHC 
RHQSASEDHVKLRNEVTEFAQTODADBS 
AENCDKSLHTLFGDKICTVATLiRETYGE 
MAD 


9580 


23481 


A 


9651 


2 


404 


AFGTTKWVTYISVLFLCSSAYSRGVLRL 
DAHKSEAAHRFKDLGEENFKALALIAFA 
QYLQQCPF\EDHVDLVNEVTEFAKTCVA 
DESAENWDKI \LHTLFGY* LCAVACFRE 
T\YGBTAAC*AKHEPVEDB\CFLR 


9581 


23482 


A 


9652 


83 


396 


NIPGVGGGGVPIPLVWKYNISHTQR*RE 
PYIQSLRSGNWSBRMSHMVAR* RQRHHP 
YYI \LLQAQLSMA\SDPANVMMDS VITA 
GVLVA/SRRARRISKLGHRI 


QS89 








V f 




REGNHNTERNCKRP PQDTG PTQ / RPGP P 
PPEVPWQDASSAPTPAAAPVGKLLPS*G 
PPGSASCQGR*TLGSVHWL*GSPLPSPS 
AGGRALPYGPAQGQSQGHWENTGRGTPL 
PGSRBT* GHELTTGVEHQRHKAPREPQA 
GQPRLCPWVQSLPPT\ARSPSWSQGERP 


9583 


23484 


A 


9654 


1 


252 


RPPQLLASDSLPPSRPPSFPPSLriLSLR 
PPI*SRSLCIiSHYLCPPI*SLIQQLTSFPD 
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SEQ ID 
NO: of 
nucleotide 
sequence 




SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A 8 Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TFSPPKSTTSGPPQPEDSPSS*PPTPSP 
P\PPSLLSSLPPSI*SSAPPF/YVHSVSP 
TISAPLYP * YNS * PHFPI PFPPRKVQHL 
ARPSPKTARPPWTIRRI 


9584 


23485 


A 


9655 


1 


394 


SLRIYCFVIEVICFLYLL/FIMYSIS*G 
I*LLMNFTILCIGY*LLQYFFFVWKSF 
YFNFIRIPFILFLYLICLLVF*LIFFDL 
SYF/MCLRLI I FNLFY* LFLLYS I CLWN 
MVLLLISKLF* F* FCGRFALIQ 


9585 


23486 


A 


9656 


122 


371 


SPYLLUjQGGVSWIWLLNVESTNSSFL 
FPETGSHCAVQGGVQWCGRGSLQS*TPG 
/ SQS * VAG ITGMCHHAS * LEKFFFVEMG 


9586 


23487 


A 


9657 


425 


3 


KTSMMNSIVPHISIPTLNVNGLNVPLKR 
YRIAE*IKIHQPSICCLQETHPTHKDSH 
KLKVKGWE * I FHANG\QGVAI LISDK/T 
DFKATTVK/RDKEGHYI 1 1 KGLVQQENV 
TVI*NLCAPNTGAIJCFIK\QLLLDLRNEI 
NANT 


9587 


23488 


A 


9658 


179 


373 


VHSWLLWGSQSFVLFRPLTD*MRLTH/ 1 
MKGNLLESKSTSLSVNLIQKHPHRNIQN 
V/AQNIWVSWtfS 


9588 


23489 


A 


9659 


103 


373 


LKHVLRLCLQSERKLHYLFNIDQWGKFY 
TWDDLYECISSKFTTHFFFSFL/FFSF* 
SQGLALSPRLDCSDAIIAHCILEFLGSS 
HAPTSAS 


9589 


23490 


A 


9660 


216 


424 


TDINVTTKTIKPLDENIGITLYDLGLGS 
GFLDMT*T/AQVANEKIG*WDFIRNLKL 
TCIGHYHEVKKTNPW 


9590 


23491 


A 


9661 


276 


2 


PRIGKLPTNMI EPL 1 1 KEML INLAMRFH 
LSPVRMAI/IKIKKSNRCV7*GCREKGI\ 
IYHYWWECKLVQPL/WKSSLK/FIK*LP 
IDLSFVPAIPLL 


9591 


23492 


A 


9662 


239 


356 


ICIKRRK/WSGTVAHTCNPSSLGGQGGR 
IA*VQEFQPGQQ 


9592 


23493 


A 


9663 


1 


340 


RHEIiMTPHAFGAMKRVTFISLLFLFNLA 
YSRGVISRDAHRSEDAHP IKDLV*ETFK 
ALVM I AFAQHLQQR PFEDHV* LLNE \ VT 
EFAKTOTAEESAEKCDKSLHTLFGDK* C 
TV 


9593 


23494 


A 


9664 


759 


1039 


KRIPFGRPRRVDHLRSGVRDQPGIiHDET 
PS\LKIQKIAGQGGICL*SQLLRRIiR*E 
NHLNPGS RGCS E PRSCHCTPAWATE * DS 
ISKKKNTKIS 


9594 


23495 


A 


9665 


1 


355 


ITDLYSMFHFG*VDIiATITSLWGKGNEQ 
KAGR * TLERLLDVYP WTHRFFDS FGNLT 
SASAIMGNPIVKAHGKKVLTSLGDAIMH 
LDD/ LKGTFAQLSELHCDTLHVDPENFH 
ENPCDGS 


9595 


23496 


A 


9666 


364 


3 


APNI PPSVTRI PALLAFRSQCLCGSCPL 
PLATEPVGLLLSMSFQAAPCFTWWVTEA 
WLTGISWPPDFLISLLCI/HINYNAVIF 
RKYRPRVGAVAHICNPSTLGG*GGWIA* 
AUh FETS LGN 


9596 


23497 


A 


9667 


I 


357 


PISNAMRHIjGQEETATITSLWGKVNVBD 
AG * ETLGRLL VI YP WTHRLFDS FGKFLS S 
ASAIMGMP+V\KAHVKKALTSLGDAIKH 
LDDLKGTFAQLIELHCDKLFVDPENFKL 
LGNVLAT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQD) 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 

1 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
EXJlutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutaraine, R— Arginine, S^erine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9597 


23498 


A 


9668 


24 


369 


APRPDAMGHFTEE DKAT I TSLWGKGNVE 
DAGGETLARLLGVYPWTQRLFDSFGNLS 
SASAIMGNPKVKAHGKKVLTSLGDAI *H 
LDDLKGTFAQ\LSELH*DKLHVDPKNFK 
LliGK 


9598 


23499 


A 


9669 ! 


12 


365 


LLU4DGERK* FLETDSAPCBBAMNTVEM 
TTKNLE*SINLVDKAVA/ SGFERIGFKF 
EKSSTLGKMPSNSIACCREIFHERKS/S 
MWQTSFLSYFKKLRQA/PPAATTLNDHQ 
PSALRQDPSP 


9599 


23500 


A 


9670 


346 


2 


YPYDPWSLINNMFFLFKRFQNFFLKEGT 
LTFPFKGFRGDLSLFFWPPYKLFNLKSF 
L/CKFLEI*RYFFG*IIFFFPFFFFFFL 
RQSLTLSPRLECNGTISAHCNLCIjPD*S 
NSP 


9600 


23501 


A 


9671 


121 


364 


HKKKTAGRVQWPLLVIPALWEAKEADCL 
SSGVQGQPGQHSKTPSLP* PSKSWDYRC 
TPPHLASFCNFFSFFFGRDGVLLCCPGW 
P*TPELKQSA/ CLGLPKCWDY / RARATA 
PGLLFF\FMLKYVLVKPAVNIANAN*KE 
LSEPDVKNEICFV 


9601 


23502 


A 


9672 


437 


223 


EME/ FSLI*LiPGI£CNGTCIiAE*NIjRIjIiG 
S SDFP AS ASPVAG I TGVRLHAQLELYFL 


9602 


23503 


A 


9673 


100 


338 


KSRPSAVANLTPVIPALWEAEAGGSPEL 
KQFSLSLLSSRDHRFATPC/RG*FFIFF 
VEMEFHCVAQTOFEUiDSSSPPTSA 


9603 


23504 


A 


9674 


161 


1 


S FiiW KiiCiiRGAPS CMRCQIiA \ LL * DVS Q 
LGY/SGVRDPLEEAVCPFSDLKLRAGR 


9604 


23505 


A 


9675 


304 


1 


VPPLASQIjGDKRRI FFQKKKKKKKNKDK 
IHI 1 1 S I ILKKFDKX * YSL 1 1 K\TIj* KI* 
GME*TYLNI IKVI YDRPTAS I ILSGEKL 
KSFPLKSGR*QECPLL 


9605 


23506 


A 


9676 


111 


346 


SKEGSKEGSLEIWGFLVXiAFPFPFPFPF 
S FFFF\ * RQAGRPRiiECRSMI TAHCNLE 
LLGSSDPPASAS \QVAGTTGPxSP 


9606 


23507 


A 


9677 


3 


349 


ARAGRIIKDLWLGFFFFFFFFYPKKTPL 
FTRDKGGGKLQTAKKNNPHPVKREKKKP 
PPGGTRGGKNPPGNPFWGFGPQRFFFGG 
KTFGGGAHPPDPTLG/ PSPQS * KRPFGK 
RGPPP 


9607 


23508 

■ 


A 


9678 


38 


368 


QASLLKESESERKSPTSLTLSQKLGMIR 

KEKLFKEIKSAISVNTU1IRK/RNSLTA 
DMDKVWAWIEDLSSQNNPLSQSLIHSK 
A 


9608 


23509 


A 


9679 


187 


345 


Fr JLLic r S 1 ifio i ia v A^iUl^UWLlJlAjJalj^/l 
PPPG\SSNPPVSALWEARA3GSRSQ 


9609 


23510 


A 


9680 


192 


468 


ERTSLFTIHLUCLLTITOSAAMSNHV*V 
FV * T* IFIjSI/GQMPKS VTAGSWGQSIjFR 
FFK\NCQTLFHGGCAMLH/SHTQLNAIP 
LFRNLWQHLS 


9610 


23511 


A 


9681 


141 


367 


a<^vnVA5?P(^C^rMDDFAKESFTVVDYVIiIj 
ENCPNKGD YWAPQ FMTDNYVRVTQLNW 
DGVGTQ* KDYI /SSERNL 


9611 


23512 


A 


9682 


193 


3 


I FGERR* I LKI YKLTGHVGLRL* SQILR 
RI / RLNLGGGGCSEPRSCHCTPVWATER 
DSISEKKEK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

aknino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Argimne, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop cod on, /^possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


9612 


23513 


A 


9683 


1900 

• 


1988 


NLVHFEEHFPSLFTPSPSLCHIF/CSRD 
RVSPY*PGWSRSPDL/RYLP/CLDLPKC % 
WDYRRESPCPASSRFLMNFLYF*NTTLW 
Q*KTIPLLF*SGDC/ S*FF*YNFLAHTS 
STMVNSGISRNSRPVFKLADKAPSHSPL 
S ILAFGVE FF/NLGSLQPQPLKLK* ASH 
LNCLSSWDNRHAPPCPDNFFFNIL*RWG 
FPILSRMVLNSW/ PLCSLSTS ASQSSG / 
ITGVSHCAGPILVF 


9613 


23514 


A 


9684 


69 


358 


ILIVKVFFPHTQKPNFVLRLSLQS IQQM 
LLPERKWTHAQKTCVSSENTERCHEKQV 
I TSHWLGMVAHS CNPSTLGGGGGWIT * G 
QVFET/SLANTVKP 


9614 


23515 


A 


9685 


66 


344 


IGEKLLLTDKHRNWVIiETE *APGKDSVN 
I DEMT /TR / DLEYY IN \TDKAVTG FERT 
DFNFKS S IRGQSL* YS Il/CAKGSQQKQ 
QISLVPYFKKLPQ 


9615 


23516 


A 


9686 


269 


1 


PKIQNGHCKTLPKNPSQSDTKTFPLFTS 
GSSPQPHKLSFCFSAGPTLPSKTQLKIH 
L\WQGTVPHAYNPSTLGSRGGR±T*GQE 
FKTIQ 


9616 


23517 


A 


9687 


229 


361 


DRYYYKSHFKPGAEAHAC* LVIiASFFMC 
YRAIMINK* HL YHRKL/ VYQVI RH* KYR 
D/ 1 * THVTLTTTL * DRYYYKS HFKPGAE 
AHACNPSNLGG*GG* II*GQEFKTSLAN 
MA 


9617 


23518 


A 


9688 


318 


46 


KLYALNDMASNTCKEYDNLIQYYRYKIY 
KRLIQHD*VEFIPEME/NWFNI*/RKSI 
NVIHYIHSLWEEKNYMIISLDVADVSDK 
IKLTFLIF 


yolo 


23519 


A 


A/* OA 

9689 


168 


3 


GNSNTMI FFTILYLFFLYFF/liFFIFDM 
KSHSVARLECSGVILAHCNV*LPGSSNS 
P 


9619 


23520 


A 


9690 


192 


3 


WHKASLSNPQPAGCMQPRMAIjHEAQHTF 
VNFLKTLWAG\ PVAHA.YNPNTLRGRGGR 
IT*GQEFK 


9620 


23521 


A 


9691 


388 


1 


CRSAGVCWRSTPDPVCLGITSSGCRTAE 
IVACS FLWKLHSRGAPARCQPELSCMRC 
\R* TIiIi*GVSQSGGTGVRDPLKEAVCPL 
AELENGARRSTALFRASRQDSLSLLKLR 
PQLPLLPGALSQIEGI 




loo 11 


TV 

n 


yoyZ 


lib 


i 


Wr LjLiFsJV JU-iAJ-MI FKJjlr PNJjPr rc cc * 

DRV/CTVAQAGVQWRDIiSSLLIiSLLGSS 
SSPTSASRLTGTIILQHHAE 


9622 


23523 


A 


9693 


305 


1 


NHAITTVN* FGLIRHLVTKAAFNSGEVD 

TT7 / O TlTT^'nT T AT XTVTnn/JUt'CtllTrXO'TTJ/""* vo 

IV/ o liUJPij J.AJ_iN I IV xMrHHDSTHGKr 
RGTVK 


9623 


23524 


A 


9694 


320 


I 


PRDPPASASQSAGITGISHQAWPDMILY 
IENPKDSSKNPLGLINKYSKVAGYKXOT 
QKSAAFJj * TNNYLKN * P/ MRTI PFT I AT 
KKKYLETYLTMEVKDLYTENYKM 


9624 


23525 


A 


9695 


263 


1 


VRILLFSESLALSPRLECSGVISAHCTL 

V^lJtr^O*\VXrUJOJLjJLJOl\. V JJ I / I*Vf LlVAfluc 

RHV/GQAGLEFLSSGDPTVPSLPDFWDY 
RCKPR 


9625 


23526 


A 


9696 


262 


359 


IRKIHKIWSGVVAHTCNPSTIiGGR*TKA 
HEGSLYCLRVQKI VSNKVGTRS * FFRTQ 
K*V/HLFFS*VYHINGSH*SRTSL*IRK 
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SEQID 
NO: of 
nucleotide 
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SEQID 
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M 
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od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

pepuue 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/liner t"f% 

last amino 

dLlU 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
rv=r21iitnmine_ R=Ar**inine- S=Serine. 

T=Threonine, V=Valine, 
W= Trvntnnhan Y=Tvrosine* 
X=OJn known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














IHKIWSGWAHTCNPSTLGGRVGWIT*G 

Q 


9626 


23527 


A 


9697 


166 


473 


tqdtsqpwkniwisfgslkvfenvdihk 
lyle ldnrhrs wmtd i llffyf * dgs ft 
vtVrlecngailahcnlhlpgssnspas 

AS * VAE VGGTLE PRRLRLQ 


9627 


23528 


A 


9698 


456 


59 


LFLWKIiHIiRGAPGYVRCRLA\LLGGASQ 
LGYSGVRDPLEEAVCPFSDLKLHAGRTT 
TLFKAVRYGHLCLQKFLLPFVRLCPAPT 
GGVFTVRQASMSCDGLLI V* \VSERH* h 
PITPOOPWEIjVPKFVRPGLKSL 


9628 


23529 


A 


9699 


4 


375 


VFCSSVTIVSEIAFLMWLSAQLIiljVYRN 
A c! nPCMS STiYPRTSLiTKVTOGOALi /H*N 
G*LT* *EQYWVKTDQKAVI/ CQQSTGSS 
GQS S Q ARE I KG PGAVAHT CNP S TIiGGRG 
RWIT*GQEFETSLA 


9629 


23530 


A 


9700 


239 


361 


RNDTFFFFETESNSVA\RLQCTGVTLAH 
*NLCLPGSSNSPA 


9630 


23531 


A 


9701 


169 


1 


t fywkat ffeteshsvayagvo* snlgs 
IjQLPSS\GSSDSPISASQEAGTTDAHH 


9631 


23532 


A 


9702 


37 


348 


HAKNFDVSLIjPYTKTHEHFVKEDIWXAN 

khvkrcsatlvtkeippqs/tlkwlkh* 

PPC*OGNGRTGSP IP /C /WWECKI VOLW 
KIV*PFPKKLSIYL/PY/DPPILLLGI 


9632 


23533 


A 


9703 


16 


339 


arlntfamaakc*serks/rn/sltlnh 
klqm i kl i eegmlkavngqklrll yq /t 

^nPANAKDEIiIiMDI IfiATPATV*MIG*0 
NSLIADLEKVRW*LDDQ/TSHNIHLSH 


9633 


23534 


A 


9704 


276 


2 


GISRGGLSKPPFFP*DFPLDYLGQSPFC 
KVKRNS /RFRTTKSPLDFPIPPLFFFFF 
BKVSQAI SAHCKLHLPG* CHSPASASRV 

nui JLVJlIUl 


9634 


23535 


A 


9705 


352 


2 


CNSRSDFFQLMMNCFLRWAVSPRVECS/ 
GVITAHC3JLNIiPDSVDPPTSTS * VAGTT 
GHTP\HTWLIII I I IVAGIFLFLFFLFS 
FFGRDGVSPCCPSWSQTSEFRQSACLSL 
PKCWDY 


9635 


23536 


A 


9706 


6 


351 


IWNSRPRRPLGGQGGRIQDWPKLHGKTR 
SL/LKKLQKLARRGRASLWSQVLKRLRL 
TWVDCLSIiGG*GCSQP*SH/IHCSPAW 


9636 


23537 


A 


9707 


76 


359 


CFLFVCLDFVIFIAYFOT*YLFPFLISH 
TFFFFLKKGLIFAPGVNLRDQNKFYFTL 
FDVFFNIYYC*IFFIF/CLVLLFFIFFF 
FFYCFVMRFIVL 


9637 


23538 


A 


9708 


342 


3 


GEELLLMDEQSKWFLE \TPGBNATNI VD 
LTTKDLEYS ISWDKAAAGFERIGSN*N 
STLELKTVN/SVPHAR* IFCKRLQSVWQ 
TSLLPILRN\LP*SPHPLVSAMLIIQQS 
STLK 


9638 


23539 




9/uy 


21 1 




RODIALLPRLE* SGTITTH/CDPOLSGS 
SEPPTSASLVAGTTGACHYV 


9639 


23540 


A 


9710 


253 


3 


VKCFS REHMLI RRS AIGGKKH I E KGTL I 
MSFWWYVSQSEH/NLCNHYAY\SHIKK 
VWLGTVADVYNPSTLGGQGGRIT*GQEF 
E 


9640 


23541 


A 


9711 


80 


324 


LITLTOASMCSNERKSHTSLTSNQKLEM 
SKPTEEGMLKAKTS * KLGLFHQT / SQLV 
KTKKKFLEEIKSTTPLNAPLMRK*NSL 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

n m Inn o^iH 

residue of 
peptide 

OCIjUCIIUt; 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
uing to 
last amino 
acid 

i-oqIHiia rtf 
JTCslUUC Ul 

peptide 
sequence 


Amino acid sequence (A*=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine 1 P=Proline, 
\£ wiuioiiiiiic, i\r B A.rgiiiinc, oennc, 
T=Threonine, V-Valine, 
W==Tryptophan, Y=Tyrosine, ! 

YsTTnknnwn *=QtAn ^rtrinn / — nnpc!Hla 

j%— 'Uiiiviiuwii, — oiop couuiij /— possiDie 
nucleotide deletion, \=possible 
nucleotide insertion 


9641 


23542 


A 


9712 


246 


3 


QLAKHGI*LFPI#FFEMGSH*LRIjBCSL\T 
ALLS KISAHCNIiNL PGP \SDPPTSASQV 
± lurtLuurtyuir isixxr v VDi luonl vii 


9642 


23543 


A 


9713 


2 


359 


AAASTTIKEQDMLKFYFLFYFFETESHP 
VTQVGVQWCNLCSLQPPPP\GPSDSPAW 

UTRHriQVQ VT? * trp tfKT.T VTMWT r\\T T V r VU 
v layuavoAa " t\»rcru\jj ±j\x w v z jjy v x let 

P*HSPQHIVRVLSVLN*LNEKINRRVTK 

ooo JCV-Lj r\xJt\. 


9643 


23544 


A 


9714 


278 


53 


KLFRGLNNHRKFQTRGGGLFFFLFIYF* 
lV?a /YATiQTiTiRrQfiTTMAHTTqT.DTT/^qQ 

HPPISASLAA\GWRTCSPSC 


9644 


23545 


A 


9715 


166 


402 


TLSLKKNENISRAWWHVPWSGVWKAEA 
R*SLVPRSLRL\HCTPAWA 


9645 


23546 


A 


9716 


168 


1 


VDCIIMRRSIAKSPRRKCRGTISARCKL 
RLSG\SGHSPASAS * EAGVTGVGHDPR 


9646 


23547 


A 


9717 


1 


553 


TSIRLFFLLS IFFFSRARNRILFKIYCN 
r>\TD 7\ irr*p cine wrrivp fTJD kth^mh tj iv^ t>/-vt 

LLSWGRGMKRQDQGWSWGGRGEGDTGCI 
PPPPGSTSRPWTPRSPWPLRLHLFSASG 
PPGSSSCSP*LFPLTDSQVGRCSGPDPN 
PPYHLMKVQPLPALLLPLLKPHRFPSLG 


9647 


23548 


A 


9718 


13 


364 


PPNR*RNPLIBLTNHSLIYLPTPS\GIS 
A* *NFGSLLGACLILQITTGLFLAMHYS 
PXRLQLAFSS IAHITRDVNYG* IIRYLH 
nTiVjAo j.r r J.v.ljr xxriXuxtVjxjX luoriiiQ^ 
T*NIGI I LtiLAT 


9648 


23549 


A 


9719 


82 


351 


GEALEPELPMPALSH/HAATLLAPLPTT 
tfE VVj 1 IV V 1 X VVjUAU" / rlUALt ^£>U*lr £&RP 
RRVDYLHLFSRNGQNPISTENTRISWAW 
WRVPVIPA 


9649 


23550 


A 


9720 


179 


365 


FSLFLASLKAQIIDSKPFFSNTLENI * K 
XCV XJsUr uX J. \xi\Xjjlri^Jjxoljv-ir xvJxio 
VCTILIi j 


9650 


23551 


A 


9721 


2 


320 


PIPPGNESSPPTAQEDMQAANKH/ IRRY 
STSIAIRET/QL/KTTTREPYAPITMAT 
TETSANTTCWREYGESGSYCWVGCKTLR 


9651 


23552 


A 


9722 


1 


344 


PLPTQNWPQGQS FLHYPVLLPAV3 VLWS 

QPRPFPW'K'OTT.T.TiPOPT.Rr: /NfmitfQn* U 
orrrrr wx\\/xjjxxJUlrlrrvL»JoVJ/ noyiiau w 

KFPLPGPSLFSCPSFQTSGNYGPPCX3AR 
AIFWKFKIKTGFHGVTREGLNFLTSGSA 
PLCS 


9652 


23553 . 


A 


9723 


156 


3 


VTEAYFILFYFETESHSV* WHDLGSLKS 
PPPG\SSNSPASASQV7VSTTGALH 


9653 


23554 


A 


9724 


76 


337 


YLSIYLSIYLAIYVSIIYLPIYHPSISI 

V\f<I<lTV<lTMHT.DT.WI7QTHT.<IQTPCQTV 
X voo J. X oXiNnXilrlj I if olflujoinooj. X 

LLSISILSINYLLLSISLSIYLSI*LSI 

YDSS IHLS IHLSHLYIYLI * YLS I YLS I 

SIIYLCIIFLPIY/LSTOSSIYLLSISI 


9654 


23555 


A 


9725 


249 


436 


ELAG * I QDHP / GQHHKTLSLQQ I KNLAR 
RGGRHL* SELFRLRQEDHLTPGVRGCSB 
L*SHHSTP 


9655 


23556 


A 


9726 


177 


3 


HLN**FSNLIFFETESPSVT\RGIISAH 
RNPRLPGSSDSPTSASRVAGTTDT 


9656 


23557 


A 


9727 


272 


233 


RKNQRIYQ I ARKRLNEMAR I S PLRSMI I 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, D=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S^erine, 
T=Threonine, V=Valine, 
W=^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon ,/=possible 
nucleotide deletion, V=po$sibIe 
nucleotide insertion 














LNVSGLNFPLKRCRIAEWTEKKK\DPII 
CCL/ QKxHFTG/ KDIYRLKX KGWKKJ. t H 
ANGSQ*Q 


9657 


23558 


A 


9728 


368 


380 


RKGQRISEIAIKRLNKMARISPLISI II 
LNVSGLNFPLKRYRRAEWTEKKK\DPI I 

r*rrr I mmrtronv /imTVDT trTvr t r»7WTT?lJ 
CCli/ QKTHFIvj/ KJJX I KJjKI KLtWKKJL c n 

TNGSQ*RR 


9658 


23559 


A 


9729 


294 


2 


KGNLS PKKKKNLKNFKVTPQNFGLKDQK 
TQGKPTLiKVS QNQKKPKF KT NJjKG KKF Jj 

klfpkkkklnpsrswfybkkkk\inkld 
gtlvrqinkkrknq 


9659 


23560 


A 


9730 


7 


284 


SQGLGRIiRRENYL* FETE S HS VARTGVQ 
WRNLSSIjQSPPPG \ SSNS PASASQVPVI 
T 


9660 


23561 


A 


9731 


1 


322 


RS IjfUUU&RKrS KJtti JUMvj fUy ttfi JVjVJj If Jj V IS. 
TPKI * KIKKI YPG/RGGTHLNSQLLRNIi 
RKENHLNSGGKGCMDPKLGHC I P AQMTK 
TPFKKKGGAKQNGKKLIiQLIYGIK 


9661 


23562 


A 


9732 


168 


2 


GF YRKSIiSCIiiJ V W 1 UFKDF FFFFxSM V 5K 
SVA*AGVQWHDLGSMRPPPPG\SGDCPA 


9662 


23563 


A 


9733 


43 


317 


LKIjQPRDTHCIWLSTTSPFIYFFFETES 
CSVAQA*MQWHDLGSLQPPSPG\SGDSP 
AoAb v- IXzMrixul 1 W LirLK-L. JUN o F JJ v lr JJMJrl-i 

GLGLNSHI 


9663 


23564 


A 


9734 


8 


296 


NWSACSLNGLVKGRLS CLTLNQIL* MI 
TLTTERFLKAERGQKLGLLHQTA/QVVN 
/AKEIFjjNEVSSATPVYEGI IKiSiiNbJjJ. 
ATMEKACWWIEDQT 


9664 


23565 


A 


9735 


3 


300 


TRPSNS PAALS KCS SRRKT/ R/ TCLTLN 
QKLEMIKLSKEGILKAERG^m5LLHQT 
A/QWN/AKEKFLKEVRSATPVYEGI IR 
KQNSLIANMEKA* VVW1EDQ 


9665 


23566 


A 


9736 


352 


1 


PFFYLIiNFPTFFKGPPLIHFFLESFLEN 

IViTI/VTlWin' /MtfT?l?7\I7T DOT T Vfr»OI7T 'E r KT 

FGFQGAF FKLrQNFF Ar 1»F S JjL»NGFr iiSN 
PFYFKFPINC\KFLGQNTFYCPYIX3FF* 

AHCKLG 


9666 


23567 


A 


9737 


130 


3 


KIFFFFFFFFLRQSLSVAQAGVQWC/N* 
IiGSLQPPPAGLKRFLV 


9667 


23568 


A 


9738 


167 


3 


loKovrLr rJ?is inoLo v iu» CiUniuNxu 
SRRPQPP\GSSNYCASACRVARFTGLV 


9668 


23569 


A 


9739 


140 


419 


INVKYLSRHFFKGN I QS PNKHRKAIRE I 
QIS\*TDNNKCW*GCGEVQYFIHCCMCE 

M\7lTQT.UTTT\7M*T.T.T , /3T. /TITUT.QTYTYD21TT. 
W VJtio JJ Yt ivJ. V W w JjJj 1 VxD/ XI J. nLiaUUtrJt\±XJ 

LGGIPPEEMKT 


9669 


23570 


A 


9740 


93 


371 


CISKITKEIjQLNNKNS /NPTQKWAKDLD 
ISPQTl^QlAKNQVKRCSTSLVIREiMQI 
YTKSIPLHIH 


9670 


23571 


A 


9741 


1 


288 


CiXKJJJtiiaKSliVKJjy \ FUy ti{3 1STKS lobJKJLy 

N*GCX3HGGSCXSSQ\LTWRLRQVNCLS 


9671 


23572 


A 


9742 


296 


563 


FIKIol^SimX^NI^TKI^EICKLPFYI 

LGGGGCSEPKI \HCCTPAWATDGGSVSK 
KKKSRR 


9672 


23573 


A 


9743 


32 


400 


DALWHWSSYSPICLDL1WLGIYLFSDR 
VSLLVPRLECSDAIMAQHSLDLPRLR*S 
SHLSLPRNWluMFVI LVETGFCHVA* DGR 
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SEQD) 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
h nil no acta 
residue of 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C^Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H^Histidine, I=Isoleucine, 
K=Lysine, ^Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S-Serine, 
T=Threonine, V=Valine, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, Impossible * 
nucleotide insertion 














KLLGSSNPPALA/ FLKC/WRNMRKSQL+ 
PLLLGLLSADHSD 


"9673 


23574 


A 


9744 


60 


356 


HSEVDS I I I PILQGRKQISEVRPQRMCE 
REQGLFISETLLPLCPAPSCS /HFLTPA 
Q*SYF * KWVS WLGAVGDTCNPSTLG6RG 
GWI \ RGQE IKTILANMV 


"9674 


23575 


A 


9745 


2 


446 


PMVNKHVKRCSASLVIRAEQIKTIMRSH 
CTPNS * M * KTDNTKS W/ *GCGATGTP IH 
C/WMKYKMVQPLWKKAQQFL\KHIHLPY 
DPINLLLGIS*EK*NHLC/YKKTYVRIF 
IAALF 


"9675 


23576 






OCT 

257 


1 


VWLRLSFPSFLLPILSPSSQGDLVCCFL 
MVF*RDGVSLPIPAGVQWLDRSSL*P /P 
S PGS SS PPASAS * VAGTTGMHLHAEPYA 
SC 


9676 


23577 


A 


9747 


148 


3 


I KLCEQ FHK* T I HVLNHK / L W PGMVAH 
AYNPSTIiGGRGEWIT*GQEFDSC 


9677 


23578 


A 


9748 


769 


1 


YPQLFKIAKKKKKSNVPIKSIMVSQYGH 
* ENKKWVLNHKGATLQ/EGKGNFGDTFK 
VTLKDKIAVAVKTRQERLPQELKLKFLL 
EAKILKQYNHPSNVKSIGVHTQRRPIYV 
IKELVPGGDFLSFQRKKNELKb\VKFSIi 
DADSGWCI SSKKC IHRDLAVRNCL /VGE 
NNVLKIS0FGMSRQEDGGVYSSSDLKQI 
P IKWTAPEAIiNYGRYHSESDARS FGILL 
WETFGLG\ VCP * PGMTNQQPQEQVERRY 
WMSVSWQC 


9678 


23579 


A 


9749 


607 


986 


SFSSSSSPEFISYGCDLLPMRDPHVFIT 
HPAFNIiPLFLALGMFFSLFSPS PLFIYF 
LPPPPRPPC*HSPP\PPPPLTOSLFVFL 
PPPPGVPSSPPNTSPPHLPVPRPPPSPP 
TSTHLSSPPPPSPTH 


"9679 


15580 


A 


9750 


313 


339 


AR*GLSCSG*S*TPGFMQSSCLNLPKCW 
DYRC/RATVPSLCFVF 


"9680 


23581 


A 


9751 


213 


453 


NLAVNVDE INLFFFPLFLIVTLASKFCS 
/HLLVGR*FIQSCHYFr J FFIFDTESRSV 
T\RAEMS ATVLAHYNLCLPGS PDSSA 


~968l 


23582 


A 


9752 


368 


3 


PLSSPRLPFPLFDPSYIFLISISSIRFL 
SPPCYFLCVFLSISWGFSLVSQDGSDLL 
SS*SAFLGLPKCWDYRI I *PPHPAWRELS 
FFFS/SFFYLR*GSLSPSLBCSGMIIGH 
CStiDII,VSSARA 


"9682 


23583 


A 


9753 


2 


335 


ARAGFSGGDWNVGMQSTASGLSLFHCFI " 
S *GELFI ILYILLKCYITL* IN* LMFGS 
F YFRDLQHFRLP/TS I VGKKRPGTVAHA 
YNPNTLRGRGGW I TQGQEFETTLANMVK 


9683 


23584 


A 


9754 


467 


120 


SQLFGRPRQTDHLKSGV*DQPG \QHCET 
PSPLKIQKLAGRGWV/ CAPIGPRLLGRP 
R* ENGLNSGGEVCSEPRSRHCTPAWVRE 

RDCLKNQNKTKSGVLFHSPHQCFLISNT 
SRNLL 


"9684 

"9685 ~" 


23585 


A 


9755 


137 


368 


DGVYLWTHRPYCGLGSUTFGSVI I VLP * " ' 

VKAYGWMVLTSLGDAIQPLADPECSF\G 
QIiRELRCDMLHVDPEDFRLLGK 




23586 


A 


9756 


172 


267 


S KSGNKPNDYHLMNGS VRWCIRTV* QLL ' 
KRLTTELPYGPAVAFLGVYAREMKAYVH 
MKTYSLQKMCT* IFIAALFGI VKKWK*A 
K*LSPDEWISK/M/WCIRTVEYYLTIRE 



1132 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 
ne n tide 

K/V }/ VI VI V> 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, HNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

O^Crlutamine- H=Araininp 5v=&*rin<» 

T=Threonine, V=Vaiine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon,/=possible 

niirfftrtflff P. rlplpHnn \=nn cciKlo 

UUVIvU UU6 UCICUUHj \ — UUSolUlC 

nucleotide insertion 


96*86 


23587 


A 


9757 


56 


318 


QFWGFSFFFFFFCFIDTETCS VSQAGVQ 
WHNOS WLTA\ P * SDPP A <5\ A <5 * VATZTTrt 
MCHHDQLI FFFFKQALCPRGWSAMAQSQ 
LTSSS 


9687 


23588 


A 


9758 


179 


i 
i 


SAS+VARIIGVHHNTWLIFCILVEMRFH 
PRA ! 


9688 


23589 


A 


9759 


219 


xuo 


NFCNFOMPLYOKTIiFWVRV^T.T^PPT 
SGTI ITYCSL/RTPKLK* SS * LSLGVQG 
CSKL 


9689 


23590 


A 


9760 


133 


371 


AMAYQLYRNTTLGNSLQESLDEI,IQ\LR 
NRVNFRGSLNTYRFCDNVRTFVIiNDVEF 
REVTELI * VDKVKI VSIi\DGKQTGF 


"9690 


23591 


A 


9761 


813 


3 


CGEGWAAGQDQGSRMGSCLPVPRAGRQL 
GSLFPAFQVYCSS CGTQRSSQSQWHKTE 
RSLPGNNPAPQDS / ASAPKYQTPGPVLG 
VQS PLNQQSCHTALSPRAS SS AFFHTTP 

A*RPRESRH\GR* EWTKRSQARSQEAIN 
ARPRKALGLSDPAPSLHPGSIiEDSPKFS 
LGAPKAPPLPASSPTAQETQQSHISQQP 
TATGFIjGARIjPP ^ port cc\B cznw i orz\r r rv> 

LLLLGE WGVGEHS FPDVLV 


"9691 


23592 


A 


9762 




377 


RYHTNMAAQI PESDQI KQTGFHHVGQAG 

LiELLTS \ PKBPI/5TYNKT.T13T * PT.TVa V 
DFTTRERK£EETTLSEHCLPKYSTMT\H 
RISLT\FTDNHI 


"9692 


23593 


A 


9763 




jyy 


TDKRTiTjTiPD'Rn /P1fWTi , T.T(TTv l ^Tnf2an»tr 

Ai/ri r> i ii II ii\XJa\j/ fvJXVVl? JjXVXuO A/vj/al/r\.V 

NITVEMTTGDLE * C INLVDKVAARLERTE 
TNFE/RCSTTOQ/Ml^SKSIACCTE 


9693 


23594 


A 


9764 


225 


25 


KCKI TMNS T * I E/ TVFKD YYEHL YVCJCL 
AKMYKIYNDIE 


"9694 


23595 


A 


9765 


128 


218 


FFKDLPDCFPKWSHCFTLLLAVYEGFTF 
ST*LVTRl^IKTP\l*YHFSP1SrRMVIM 
BCKSNINKCWQGPjGESKALIHC* *ecktv 
♦plwktvwqilkkliig 


9695 


23596 


A 


9766 


194 


551 


KNFFFloEMEF/SVLIjPRLECNGVISAHR 
RLRLPLS SYS PASS SQVAGDYRACTTTA 
G*ILYF* *ROGFHHVGGAGLET.PT<V;nP 
PAS ASQS AGI TGV SHCPQLKKS ILHETP 
KGLTGVTS 


"96% 


23597 


A 


9767 


238 


3 


SFLWKLCPRGAPACLRCLSA\LLGGVS * 
SGYMGVRDPLEEAVCPFSELEHHAERTT 
AiFRAVPQGCLSLQKIiSAAFCSC 


"9697 


23598 


A 


9768 


343 


1 


RGAliHSRGIXrVRFTGFCAPIPVIjNHPKP 
LFPFFV*KSPIERSSLLGPPLKRSKNRF 
PFLR/VPRFFKNHPEFFFKKIFLPPPPP 
FTFEMESHSI^\RLESHGAILAHCNLSL 
PGSC 


"9698 


23599 


A 


9769 


191 


2 


FRim*TSH/LRGQKKKKKTNSKAlGRK 
QITKIRA*LNBTQPPESIQRINET*S*F 
SGKIKLI 


9699 


23600 


A 


9770 


165 


2 


GKP PNCFYILNFKKKKGQD VFNRT/ WLG 
VVAHACNASTLGSQGGWIT * AQECKTS 


9700 


23601 


A 


9771 


97 


912 


VILSTGCSSGPLPGWQSVLHHSLTRCPF 
SFFLSSPTPEME/FSLLLTRLECNGEIS 
AHCNXjLLIjGSSNSPASASGVAGNTGI /c 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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Amino acid sequence (A=Alanine 
OCysteine, D-Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KNLysine, Lr=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
0=Glutamine- R=Arainin^ %=<!orin» 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLHAC * AS * FL YFHS VETGFHHVGSGLG 
LELLTSDD\ RP A <? TV ^nnan t tysitppo \r* 
PLNS FSRHGLS CFFKETRVLS CCPS WT * 
TPGLK*SSCLNS*E/C*GYRHTPAHPAQ 
DMVFIFITY/CPFFVFPLL/VYIPRFIiF 
LRWS FTLI VQAGVQWRNLGSLQPPPG\H 
KOSETPSOI * INKY*LLN*LTTRPF5T1C 


9701 


23602 


A 


9772 


3 


363 


HELRSSANEHVFASFIGPTILGLPGVLL 
IILYPPLLIRTSIYLISN\RI,\VTTQQ* 
LIKLTSKLMITIHNSIGRS*SLI1jGSLI 

AIPL*AGAV 


9702 


23603 - 


A 


9773 


38 


385 


YLIILDS PEKGL I CGCLCNCMHS SLLDH ™ 
LNLT*SS*MCLYANIHIYVAISCIYFI+ " 
F*SI/YIFII*YFHYSIAYVYIY/IYIY 
MYTHAHIHMCI 


9703 


23604 


A 


9774 






DYPASASQVAGTIGVHHHTRPTFALFLV 
ETGFHHVGQAGIiELPTSGDPPPL\ASQS 
AGITGVSHCTWPHLSTTGKILSSGTICP 
GIWGQRWWAFLIL\SLPWL*LWHPCLSS 


9704 


23605 


A 


9775 


376 


1 


KYFELMYIPVIAICLFNSBVFLNK*QFF 
NLTSSSGIQNLNRIjI ISNEVES 1 1 KSLP 
TVKNTiG PDC5TA ATF \ TYTfPfVT .TT. T T .VTT.v 
QKHBEAR I LSNLTSETS ITLILRKQQPK 
KVPNEHTVKILV 


9705 


23606 


A 


9776 


192 


2 


NVYQPPP VFLYPPGVAGGV* PF * S PPVF 

RVK /TjGF + f3^T ■RPPPPPTTUTTT QD CJT7TD T 

ECSGVILA 


9706 


23607 


A 


9777 


77 


350 


WLRXFGSRNSGKLTLYLNVYYLQTSFFF " 
FFFKIGV* LS PTIiEPR f?P T WVN * MT /^W P 

G*RDFPTLTS*VL\GPRVILEFGFLEKT 
GFSHVPQL 


9707 * 


23608 


A 


9778 


133 


325 


PRWVRFP YliGDCTPRVSF*tIjGIiOfl\A/P <? 
S SP FFVC FET * S CS VAQAGVQ / WWCDLG 
SLQPLPLG 


9708 


23609 


A 


9779 


145 


362 


RLAFGSCSTCFPLT*TVLY YL I * FNI F F" " 

VFEMESLSVSVPXECSGAJSAHCNLrTP 

AWTTERVDAAANSRSMN 


9709 


23610 


A 


9780 


260 


1 


QKQNQEKKRKKI FRNI KHQRD YNEQ* YG 
/NKLDNPEDMNKF* ETRYT* /PNLHQEE 
IGMx^PVTNKVL+LVIKIPLTKKSPEP 
DGFV 


9710 


23611 


A 


9781 


230 


1 


SSDSKTGS S WLVACRFFFFLETGSGS V 
TQAEVQWHDHSSLQPPPPOPPK* /PC3TT 
GA*STIAS *NAEITGVSHHAR 


9711 


23612 


A 


9782 


20 


322 


SQHFGNPKGNPKI FRNLE INF FRLRNAD 
HMI ISTDAQKVFDKI QHLFI I * TS * QT * 
TERIFLNLI KAS * KKPTANI ILNG*RLN 
I\FPKTGETKHIC 


9712 


23613 


A 


9783 


158 


379 


LVKIICNSFQYISFFFFFFLETKFCFFP ' 
QVEVQGGNFG*LNLPLPGLNHFS\ASAS 
REPEIPGPPHHPG* ILVF 


9713 


23614 


A 


9785 


54 


312 


KVDKSTKIGRNQSKMAKNSKNQNASSPP 
KDHNSSPARKQNWTENELDELTEVGFRR 
WVITNSS KLKEHV* PN/GKETKNLEKRL 
DQ 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amiao acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, P=Proline, - 
V=Oiutaniine, K— Arginine, ^—oenne, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *^Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9714 


23615 


A 


9786 


13 


613 


PGAGWARQHHGV*DQPGQHG\KRPSLLK 
IEKLAGHGGRCLLSPAIGGGLKQGESLE 
TCGRDCSEPRLCHC\MPAWGNRNKTLSQ 
/QNKTR 


9715 


23616 


A 


9787 


794 


164 


FSFLFCFFFLRWSLTLSPSLECSCVTSP 
HCNLQLPGLSHS PATVS * VAGTI GTCHY 
TKLS * FF\FDDSL* APFLGWDTETYYP 
DPPSSKGLIiPHSGQHGQQHP PAISRAWL 
SC*VLPHLRS/STSLGQP/RIW*LSKVE 
V*RGFTFS PTQNTPTNNNCS IAPGRVD* 
SUjID* LQWCDHSSIiQPQTPVLK\HPPT 
tiASQSAGITGXSHHIRPD 


9716 


23617 


A 


9788 


256 


380 


YIFRQAKTQGLPPCALFREAVGENTP*V 
FAC/PKNVLFYVHP*LFG* I * IYQIiKIT 
F*NYEGMVT/FVFRFLPETRSHLSPRLE 
CSGTI TTHCSLELVGS TOSS ISAS* VDG 
NTGVRHNASC 


9717 


23618 


A 


9789 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKCaiKE 
LNVRAKTIKLLBENIDRNLCDLGLGNCF 
LDMTPNAQTTKGR \ X DKXiDF I KI * NFWIi 
DT 


9718 


23619 


A 


9790 


288 


2 


QVWQE*AETESAIHCWWERKAIQLL*KM 
RWQr JjKM IN V \KJuF ilaaAJL FiiViaXWbKJS 
LER*ST*NIMFTAPLFVTGK/T*NLFKW 
PSPYEQITKMHSC 


9719 


23620 


A 


9791 


142 


361 


PPGEGKRVPFTTE PS PRI PS KAKS VSQG 
DTCTPMFTAALFT IARMWNRSKCPSFEE 
* IKKMRCI TT/ WBC I SA 


9720 


23621 


A 


9792 


305 


346 


TYEYTDYGGLIL* *LITFYDHAVLIIFL 
ICFLFL/YALFLTLTTELTKTTGGDAQE 
IETV*TIIiPGIILGLIALPSLRRLYITD 

ILNS 


9721 


23622 


A 


9793 


2 


317 


SRDRPRVRDR* LFSTNHKDIGTRYLLFF 
A* AGVLGTAIjSLL I RAEIjGQ PGNLLGND 

UT WlfT^TTA r 7\ T7\rT T tPDTT^T DTT T\ OCT? 

nlilVIV lALiAr VXXr rXJJljfc'XIjX \bar 
G\SVYVLIiILGVPCTAVFLHSS 


9722 


23623 


A 


9794 


384 


1182 


RIGKI KG/ LCFLFV* WFLKFFFKMEFLL 

VAGITGMC * F\VFL IEMRFFHVGQAGLK 
LLTLGD/PPRPPKVLGI 


9723 


23624 


A 


9795 


205 


1 


GLQI KCTMRYHFAPTKMAITYFFIiKKGK 
G\NNKCW*GS/GTEIGILIHCWWECGMV 


9724 


23625 


A 


9796 


307 


348 


QSARL* EAVCPFSDLQLRTGRTTALFKA 
VRQGHLSLQRIJjLS f\vclypaprggay 

TJPBna CT CrV2/^T U H D"\TDttCDT.T.r*T.t>Vr\7\W 
XoKUrlo jbOUOOJjxlir V KHoKlJjjlJjr' J\UAW 

AMVGAPTPASLPPCSS ISRCCASNQRDS 
VGYDPSSP 


9725 


23626 


A 


9797 


193 


381 


X XjLiX * XHX XiJbSM X o b ±> c r c nxGbHSo/v 
SRLECSGTASAHCSLDLPGSGGSPTSAF 


9726 


23627 


A 


9799 


153 


359 


FGTYMCFADSYQLSYLFWGTFSWIiEEE 
GV* WCICGSLQPQPPRVK* /FLPSLSLL 
SSWDYRCAPSC 


9727 


23628 


A 


9800 


2 


396 


ARAARAARELEELIKIFFFFFFFWGQIL 
ALMPKGGGQGGILTYPNPPLPG*NNFPG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenytalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R^Arginine, S=Senne, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LTPPRT6INGLGPPGRVNPGIFKKKGGF 
PPGARGV*NPGPRGTPPPGFPKGR\DKA 
PHPGPL*TFEKVPRPPNFPQ 


9728 


23629 


A 


9801 


256 


347 


LYI LLLNSLK* LVEKLR*NWLASLI IR 
YKVTTKRF* * FSKPAQSEELQLLTSAPG 
LNF*RYI ISFFFWIQSFALVA* VGVQWC 
DQGS PQPLPPGFKR FS /CRS WDYRHEPP 
R 


9729 


23630 


A 


9802 


109 


401 


HLRRPCPTPPFALRTSGDQPLSPEARPD 
SGLLHSPLTS PRGLNGQCP PRGSTLNFN 
P*AS IPASPFPSGQPQASQEPPPVASEH 
PPDT*GDPAQPPP\RPQDFWGPATVPRS 
QAGQWPSPLPSDFSKGAQWPVPPTR/PP 
PSTSTQSRWT 


9730 


23631 


A 


9803 


371 


2 


PGFP LFS FPEGNGPS KRQTGD * IRCLF* 
DGKVWECSPPKKKKKTKKRKKAVTF/ CV 
PVQTKCI\A7EGGEETLVGDV*V*P*\GS 
FKHVVAMFPEK/DCLCTLYEASFKTKES 
RRVDGFVCVRVGT 


9731 


23632 


A 


9804 


188 


1 


FLFFFFSET*SQSVTHAGVQGLEQSIjPP 
G\SGNSRASASQVAGITSRRKHSWI I FV 
FFVETG 


9732 


23633 


A 


9805 


112 


383 


VFINIRVFRLFTFSFFFKRGFNLSPRVK 
OTG*TKGHCIPDLPGVR*SFPFSIiLTOW 
DYRCAPPRPNKFFF/ SFETGSHSVTQA 


9733 


23634 


A 


9806 


599 


237 


FRDRSLTLLPRLHCHCVITAHCSLKLIiG 
SSELPQASPLSSWDYRHSPPCLANFLFF 
VETRSHY/ASRNSLGSSNPT*A/ fpkcw 
DWQV*ATAPSLAYWFSEAEKLIIiRLIiAL 
KVSDENICTNF 


9734 


23635 


A 


9807 


168 


2 


PQQDVFFFFLETGSYSVTKAEVQWCDHG 
SVQL* PPG\STDPPTSASQEATGTHHRA 


9735 


23636 


A 


9808 


225 


3 


GPLGYI 1 PPYFPQPLISRG*NPSDFLKIS 
RVFYGT / HYKFEFLFLFFFETESRSVA\ 
RIlECSGVISAHCKLCLPGS 


9736 


23637 


A 


9809 


541 


3 


RQLTGINSGRRQFQPQMSNYVI P YMMDM 
I ILKTKKSNNKYWQGCEKTBLLIHCXWE 
CKMVQLVWKTV* QFLNRSD I E FPHDPG I 
PPL/GYKRKKMKTCLPKKLCVLVEP/RM 
FTAALFEVAK* *KQ/PQTPITR*INTMW 
RI YTME YYSAI PRKTPLIHP IA* LDTES 

r i *tekiqsqettyy 


9737 


23638 


A 


9810 


213 


1 


FLCVDFVPCNFAELKLALVGLFVCLLFG 
SVTQAGVQWCNL\GSL*PPPPGLSDLPT 
S AF * VAGTTGV SHH 


9738 


23639 


A 


9811 


54 


388 


PARP LPRQ / WDER PNQQPTKKKKKRGRN 
LPTKKKTQNYSHQNPVAQRNPGGiCTNQK 
KPKTNPKKKKGGAL*KKTQRGPKQTGGR 
K*KISPNKGGK*NPCGKLLEKNPFSGGE 
K 


9739 

Y 


23640 


A 


9812 


429 


24 


RSKGAEFPICPCRSSKFSVLQQRQVRIV 
FQHPELGESVAR/ CRYSQLLGRLRQ*SH 
LiNWGGRGS SEPKS CHCI PAWATRASS I F 
CNFQASSVEVRRSARKKLFSDILKRHNT 
INWRVSGLLLVDSYFGRIATPVRTQ 


9740 


23641 


A 


9813 


149 


350 


CHFPCFHTTL/ CE*EPGEDVTGPIGTKF 
FPAPGRTDGHTHTHTHTHTHTHTHSDPH 
HLCVWNTLIHFV 
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last amino 
acid 

residue of 
peptide 

dvvlUCUkv 


Amino acid sequence (A— Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=filntnininp ntrinin** Q^^f^rinf 

V£ MIUMIUIIIW) AX /TLI gJlllllVy >J AjGl 111 W, 

T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 

niidpnfirfo ind*f"tfftn 

IIUUCVUUC 1113d IIVSU 


9741 


23642 


A 


9814 


66 


401 


PENKKTVYRKPIATIIFNCERLNALPLK 
LETRP+YLLY+LFNIMLEVLASWRQEK 
EINLS IFT*DIVIK/MKIPKNLENKLLK 
IjLS PFS KITGYKI S IQKLTSS FLYYGFT 

v 


9742 


23643 


A 


9815 


82 


I 


APPPARFFFFFFF* EKFSS FPPGEVQRG 
VFCPCNT/CFP*G*KIFFL\KPIiKKVGF 
WPPPPGPKNFFYF * KKGGFPFWARGFFI 
PTPFVPPItHP FKKWGVRRG * APPPARFF 
FFFFFL*GRVSLCHPGWNA 


9743 


23644 


A 


9816 


256 


2 


QHFP* LPIPHI PIQT*QHFPQTRSPFQG 
PPLCQKPFSSPWGSK\A/FIliLSFFFFF* 
DRVSLCHPGWNAVM*SQLTAASNTW\VK 
*SSHIiSIVr»SS *DYR.CVIiS 

iJiJXl 1 11 f III MTItJ JUJ>I\V«*iJw 


9744 


23645 


A 


9817 


53 


411 


TIYCSNYVYNFVSQHFRSLYS * KNCGLC 
L»* FHLFVLQS *NFMQFHM* FCLKI * VDP 
IITFEV* FTYI * FF*TYSS VSFTDAYNF 
VKPHR / VYRGPGAVAH I CNPSTlJGGQGG 
WIT*GQEFK 






A 


70lo 




4.17 


DEHSE KODE PLORPRS KIiIjPFPGTP PSS 
lXLLNHIGQESGLTHVlUiASCIiRlSnRGFC 
FDMKERLCljLPMlaECRGMI SDHCSL l/R 
PG*DKPPASAYQVGETTGTYHRA 


9746 


23647 


A 


9819 


136 


2 


IYMGMQPGAVAHACNPSTLGGQGG/WWI 
T*GQEFKTRIiANMVKPC 


9747 


23648 


A 


9820 


257 


426 


ivpglflgtgscsfthagvq*s;ncssvq 

S*TPGP\ SDPPASASRVAGTTCAHHKAW 
L 


9748 


23649 


A 


9821 


151 


416 


LHKLCLMVMESGTAKTPSLSPI FNSVGL 
ILLI,T,EYSIDSLC»SVLCCYKEIPBAG* 
FIKKRGL/WLGVVAHACHLSTLGGRGGW 
ITRSGN 


Q*MQ 

yf*ty 


ZJOjU 


TV 


VoZZ 


50 




LDLLHHTISQVVNSKEKFLKEMKSATSV 
NTRMIRE*NSLIADMEKVLVV*DQTSHN 
IFLVQNI IKSKGIGLFKLMRAKRDEES / 
AEGKLDASKGRPVRLK/EKRSHLHNMKY 
QDEASVDIETTASYLKV 


9750 


23651 


A 


9823 


48 


406 


LIIWYTIiTTRWIGHSAVSYPIiLRPPYS 
IxRHNIEIRPINNPTMASKCSSERKSLTS 
LTFHLKIKMIKLSEEGVSKAKTG*KLGL 
L/R/QTVSQWNAMKKYLKEIKSATPVN 
T*MIRKQNNL 


9751 


23652 


A 


9824 


143 


444 


WNKKNNKNEIADMILWGCGDKTIFMRKL 
TRDETKYGI PQIiALRGHSHFVRDGVT \N 
LEX3QVDLS\G*WDGTIiHLWDLTSGSTTR 
RFVNK/TKDVLSADFS 


9752 


23653 


A 


9825 


297 


2 


HRSPTKETKIIAS IKKKTMITKDLNRQF 
SKQDKHLTSEYMKRCPTS /LREM* AKTI 
MRYHLTPIRRAPIKNKNK/NKSKCW*GC 
GETEILVHCW*ECKMVP 










1 IJ 




GGGTNGANPPPKI KGSKEKKNF* VFFPP 
FPFKNFFFP*NL*FLGGGWPHLSPPQK* 
V/CFPKIPKLVFFSPPLREKFFPPLPP* 
NLGPPGF/SFWGPPLFFFFFFFF*RDKF 
SFYHPGWKAVIAAHFSLELLGSSNPPTS 


9754 


23655 


A 


9827 


64 


622 


MFSFFLFDLIIiNHLFFCFV*MFSFFLFD 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S^Sertne, 
T^Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LILNHLFFCFVIRQQFSFLFF/ ISFFFS 
FFHCNVNTPELCDFSVCIHSDNSFRFLH 
LF/NF*HMTLCISELR*TLKQSCYSYTK 
IC* * FYS VQLS FLT ILY* * CKS *QALSI 
ALQNE FLS V\Q YHKNMNY I TLT I EFSRN 
KFLSVTFLLFSIFSHNFGISASISIFIL 
HVYIKNNPIGFFK 


9755 


23656 


A 


9828 


298 


1 


MRNCLTDDERKWFPEMASTPGEDAVNTV 
EMTAKDLE/Y* * YINLVDKAASDFEMID 
SNFERCSTV/NKMLSNSTACYRENFHER 
* GQS VQQTFVRVGERV 


9756 


23657 


A 


9829 


361 


I 


FLTGTQWGK/DS PFNTWC* DH/ WNIHRN 
LDLYL IPYI KINLKQLTGPNLRAKTI KL 
PEQNIGENIiCDL*LSRERYSTKSITRIR 
KL/DTLGFIKIKNKCISKDTIRKR*ATD 
WEKI FANHVLKG 


9757 


23658 


A 


9830 


402 


2 


RFHHRFSFVLFGPFAKVAFPIGAKNFIS 
NHYWAGVFPFWEKKNKIDSFLPPFSKNP 
ILGELKA* F*NFLF*GFQIFF*SLFFCP 
FGGK* IPGVFSFFFFFLVFFE IG3HS VP 
\RLKCSGTITAHCSLDLIiGSSN 


9758 


23659 


A 


9831 


2749 


3215 


FCQ*IKMYVMCICIYNFRRVYLCIYTYL 
HTHTFTHTHKHTPTQI PEKDS /QCSLSD 
LKGHSL 


9759 


23660 


A 


9832 


3 


386 


KIJUiGQLNTSAMAAKC*SERKS/R/THL 
TJ^QKLEMIKLSEEGMUCAVIGQICLGIiL 
YQ/TSQWNAKEEI*LKDIKSATPVTA*M 
IGKQNSLIVDIiEKV*W*IEDQ/TSHNI 
SLSQSLIQN 


9760 


23661 


A 


9833 


1 


370 


RRCRWPPDPSRTVGRQIGKLVTHRPTVF 
QERGCPFPLTRQAGSHHGGGAFAQVISP 
TKS I S PCGRGGSRL * SQHFGRPMQVDHL 
R/LGVQDQ/RWPICGQYGETPA\LLKLP 
KIS WAWWLAPVI PA 


9761 


23662 


A 


9834 


125 


409 


GQENRET WKIVHLV * VLYT * HI KRLNC F 
SHFI*CYQPTASQAHVHHDSNDSSTHV\ 
N*NSRWPGTVAHACNPSILEG*GGWIT* 
GQEFETSIiANM 


9762 


23663 


A 


9835 


223 


1 


PKPQKMGTF*PPMEGS*MGPPHWEKMGP 
FLKKKEGHPFFFFFFEMESRSVT\RLEC 
SGA/TISAHCNLCLPGSSNS 




23664 


A 


9836 


377 


515 


FILFLRQVLTIiIjPRLEYSGAIVAHCSLX 
AF*G*SDPPASASRV 


9764 


23665 


A 


9837 


274 


1 


AGEWHDQ IC VFKI PLMKNGLHGARVGQB 
SWKERNQQV*DRCQKQNLSDS*IQT*R/ 
m/wpgtvahayn/ PSTLGGQGEWIT*GQ 
EFETSIiANMVK 


9765 


23666 


A 


9838 


553 


114 


GAGVKTHPGHKGKTLFFFKFFFQKLPGV 
GGKTLPS PLSRGLGRENS FHPGGKGSNK 
QSSPPS FPG/WGKKGGGLPFQKKKKKEK 
RKKGKKNGVAPSEPPLKYSN/ WQATWG* 
RKLIiNDTMRVQPTKSRMQKIIiQDKGPVN 
VjJLr 1 KJAjKLi 


9766 


23667 


A 


9839 


36 


434 


LPFPQCVTEFIIVLISWCYIREDACKNL 
KHVAITI INVCIYICVCVCPSTLYVYMY 
MLP/HLSDTLLTLN*VTLVEMLSSFQCL 
MFWYHGHI *K*HELDWAHACNPSTMGG 
QGGWIT*CQEYKTSLANMVKPR 
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ammo aciu 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 

rltnnr in 
UlUg IU 

last amino 
acid 

residue of 
sequence 


Amino acid sequence (A^Atanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine» 
G=Grycine, H=Histidine, I=Isoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

(\ — (~l 1 iifq m iri*> t>— : A roininp S=Serinft_ 
\^ wiuuiiiiiiic, i% — Argmmc, k7~wci itit, 

T=Threonine, V=Valine, 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

nni*lckniifl«» HplA+inn V=nn«y:ih!ft 

nucleotide insertion 


9767 


23668 


A 


9840 


268 


1 


RFPPFFCFLTILFWAPGWFSNFIjGVRA 
PQPP*IFFPI*KKPSPLIFFFFFFEKES 
RSVS\RLEYSGTISAHCKLCVSGSQHSP 
AS AS 


9768 


23669 


A 


9841 


463 


86 


LYMHEFISGLFLLFCWSI / WSVLCQYKN 
CFDDKVYGLDYGNTSQVFT*NSKLIKLY 
TLTMYS FLCINQTTI KPSEM IQATKANI 
DK*NYMKLENFCTAKSTVNKMKRQLKKW 


9769 


23670 


A 


9842 


37 


400 


VHSPPFKFWSLYVSVTiKFPFCSLYFKFL 
ALYS IFLQMCQHLLV* AFFF *KRGLNS V 
T\RLBCRGAISPNCNLCL/ SGSSDNWAS 
LPRKAEAGNSFT/ CR* SQVAGNP I PS I s 

TP T tNTIP TTT P RR Q 

X X\ » IH^IvlvX CK IjO 


9770 


23671 


A 


9843 


42 


407 


NCLSFLITFFFFFLFENKVSFCPQG*G/ 
RGGPFWPHGTLPPRG*GNNPP*LPGRGE 
*RGAPPPPGYFWLFGKKRGFPL\GQGGS 
KPPAI.KEPPPIiGPPKGG\NYKRNPPPPP 
GNFF * LP P YQW 


9771 


23672 


A 


9844 


113 


379 


LGPWATSLiRGRFLGYQtiQQSKKH*NYP 

VWQFLKXLNTGLPFDAETPL/LAETPKE 
LKA 


9772 


23673 


A 


9845 


284 


528 


PLPKTME IMLDKKQIQTIFLFKFKMGHK 
IAETTRNIKNAFGPGTANE\KCTAQWWF 
KTTFrKA* F\ QTiEDTCEAV *GHPSEVDTN 


9773 


23674 


A 


9846 


888 


1295 


LEQGCNLFLHKHS *IjGEVFFFVCFRRSF 
TLVAQAGVKWRDLGSL/HKLPSSLSLPS 
SWDYRPPLPRLA/NFFVFLVEMGF\TVL 
APMV<5 T Q * PRDPPTSASQSAGI IGVSHR 
AGP VAG I LMFCRWNI SNKPKGAVFKKK 


9774 


23675 


A 


9847 


88 


387 


AYRMKI IDRISLSLYTAALWVTliFLFLF 
LFFFFKTEFCFAPQAGGQ/WGQFKLMDP 
NPP PPG* K\DFLVSHPRDLG I KGAP PQC 
GQNFVF YKE KGGWLLTAT 


9775 


23676 


A 


9b4o 


298 




ITT PPrtaDTJPPnrWTTT FQPPLGGPTKTPR 
GALEKKTLF*VRGPWPGFVKKESPRPKK 
LG*QIFWPPGKKPNPPMGG/SLDPPPPL 
ILSRPDPP*KKKKKKKKKKKGGGPIxKNP 
PGGPKFPRGGKNFFPPFRGAYKNTPGGS 
* EKNPFLGGGKKRKQP 


9776 


23677 ' 


A 


9849 


251 


498 


AI IKKMESKKFW*RHGEIGTLIHCYWEF 
KMV*PFWKTV*QFLK/D/IiNMGLPFDSA 
FQLPDICLGFiKTYVHTENCTQMFMAAL 
F 


9777 


23678 


A 


yojv 




j t 


ESRNKHSHT.WS TNFOKGLR*LNRGKOPL 
NK* CRHNS I STCKRMKLDPYLTPYIKI\ 
rai»rrKDLNVRAKTIKSx^ 
LG*GNDFLDMTPKAQTTK*NID*LDIIK 
I *NFCGSKDTINKVRRQSTE*BKI FANH 
VADKT.K 


9778 


23679 


A 


9851 


2 


378 


RIxEGLFLCALFCS IAI CMFFFFFFFF*K 
KRGPFGFQGGDPGGNFRQLDPLPPGIKQ 

f^gpXppfjcagktgggpppganlgfwgk 
krvphggqkgfkp /rnprgsprpgppkg 

GVAFVFRPKP PGLEQ 


9779 


23680 


A 


9852 


229 


3 


FGPYKIFFK3CKGAPCPLENPPVF\HQKL 
GLGFF F FS FF FF * DGVS LFLLRLKCS AA 
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■ CjIUUv Ul 
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peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
v^=v»iuxaniine, k— Arginine, o— oerme, 

T— Thr*»nninf» V— Voting 

* — i utcuuuic, ▼ — valine, 
W<=Tryptophan, Y=Tyrosine, 
X=Unknovrn, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
iiuLicunue inscruou 














ILAHYKFLLPG* SDSAASAS 


9780 


23681 


A 


9853 


108 


420' 


GPRVCTRLSLRDDSDLLATEDSLAHTEG 
DPVTASSHSQAPSLSVHPGAALEGVCWG 
IbAIAui PKFQKPPQbTPGGRPA\PIAP 
GLTWEPPSHPPPQ*RSPPPPP 


9781 


23682 


A 


9854 


2 


256 


FFIFPLRYYIYCARFQFLSPILYLGK*K " 
MD* *RRVFQEKWEQAYFFVEV\SPMCLI 
UNyU i»o VSKjB xJH * RCHYETNHGENFD * F 
TEKMHDENLLHLTTRQP 


9782 


23683 


A 


9855 


323 


3 


I IQMANKHRKRH \QSQ * * VRVMQKKI I M 

TITO VT nrtfilT vrndVPrniTtm/innimincnnT 

XHRYI PQWLKTS KS DM I KCCS GYGTTE I 
FTHYIiWECKLLHIiLWRMWL\ ILLKFKT 
DTPHDPAIPLLGTYPAECMHMYPRA 


9783 


23684 


A 


9856 


28 


385 


DGVTQAGTQWGS KFTAA* TA+ YQGS I PN 
AWSFFSPPPVLKTPPPPQKK/YPPPKKK 
KIPPPKKK 


9784 


23685 


A 


9857 


436 


518 


aanrleniyrhliydkd/ehyrivgkdd 

LFNKWCWVNWI/ASNKEKNLDSYLIPHT 
K/LNSR* I IQLDVKD* I IKLLEDNLYLH 
DLGDRQKFLGRI /PVFTIKKKIGKFDPM 
LKLSTFVHQKTPC+RLNSA 


9785 


23686 


A 


9858 


153 


492 


RLAGSDPGVADVSVQLQRQEKS PCPS WK 
AVSQAESSSSSAGVSLVLFRLLADWMRP 
IPIKEENKL\SQSTDLNFNFIQKHCHRN 
TQNS V* PGGVGPPGTOGPFRRQSSP 


y /oo 


zjoo/ 


A 




2 


419 


r ruKLQ VbHKKbT YSHFSKKPTHTTNKD 
MKRYSPSL/ AIREMQSKTTG\MRYHFTP 
KYG\NNKCWLGCGETETLI YGW * ECKMV 
QPLWKAVWHFLK* IiNI ES / P I Y * *NYS * 
xISYSWRKaCITIYDFVILLLGIYPREV 
KTH 


9787 


23688 


A 


9860 


88 


419 


tfffffgfwflktkplfvpqlfegpgp i 
lg*wnppprd*knfs/gpnppggge*ra 

QPPFPGYFLFFKKK\GVSPWGGGGSKPP 
TPGNPPPGPPKIRVIRGGPPPPGGRKHF 


9788 


23689 


A 


9861 


301 


401 


KRA*GGQEPIKVCPFFYNRKVGNGSPPK 
KFKKGRRAAFFGLRANKRGLFVKKGK$I 
WVGKVGEPIT/DPFKEFGGR\LPEKDGP 
LV , 


9789 


23690 


A 


9862 


220 


415 


KIMGGAQI FRGGGGFFFFLEGWEKNNFG 
VbrKKiUrFr\3VjC3Vr JjPPPP *HKKNYFSS 
QRQYISLGGGGRKTPPPKKNFIiLKDTPK 
LFFSHPSKKKKNPPPPRKIWAPPMIF* I 

FFFNFKKP IFKTFLS PPFKVFPPPKKKK 
KKNP P I S YRRWPLAI 


9790 


23691 


A 


9863 


116 


366 


GQEFETSPAMFCFETESCS IA* AWQWH 
EPRIRRLQSVEITPLHSSLGNRVRLHL 


9791 


23692 


A 


9864 


619 


295 


FFFEMEFSLLLPRLECNGAISAHRNLRL 
PGSSDSPASAGGLL* SQVAGITRLRHHD 
+ LILY/FLVEMRFHRVG*AGLELLTSGD 
PPSSASQGAGITGMSHSAGHYGKIF 


9792 


23693 


A 


9865 


12 


432 


IADRRLFFTNHKDIGTLYLLFGA*AGVL 
STALSLliIRAELGQPGNLLGNDHlYNGI 
GTAHAFG I ILVI VI P II IGGFGN* LVTL 
IIGAPDMAFPRINNrSPGLLPTSLLLLL 
TSAILEARAGTC* TVYPCLA\GGYSHLG 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
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residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lso leucine, 
K=Lysine, L= Leu cine, M=Metbionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














A 


9793 


23694 


A 


9866 


30 


380 


LFSTNRRGIGPLYLLFGA* SGVLGPALT 
LLI*AEDGQPGYIiLGDDHTYNDIDTAHA 
FVIILFIVIPIIIGGFGD*LVPLIIGAP 

DMAFPRIN/NI sf*llpaslllllasgi 

REGRSK 


9794 


23695 


A 


9867 


256 


395 


EVLRKS EKS FETVCI INNKC / WPGMVAH 
ACNPMTLGGRGGRIT*GQEF 


9795 


23696 


A 


9868 


458 


2 


PKKRFFSQKPPRGFYS APLKGKNY I FPP 
PVNFGPPKDFFKGPPLFFFFFFFFFFFP 
RFFFFFLEKG*KF*NFFFT*NMVFFYIS 
/CSKKFPFQFVLTPIPF+KVPKPVWFFW 
DFPIFFKKIFFFFFFCDRVSLCHPGWSA 
VARSRLTASPRV 


9796 


23697 


A 


9869 


144 


425 


IPLCSRIYSFGPC*FSLISSSSCLSPFI J 
SI PLCVLS * * T* * YLFFYISMREI YKTR 
TKGIYPGAQRNSLYTH\FSIEVQSTIKA 
ERFWPGAVARACNPSTLGGRGGQITWGQ 
EFE 


9797 


23698 


A 


9870 


138 


401 


DLRLKLNLSKNIIRLQGTVESSVLTIKE 
MQIKAKVRYNLIPVKIKR/ SGNDRWW*G 
CGERGTLIRC* *ECKtiVQPLWKrVWSFL 
KKKKK 


9798 


23699 


A 


9871 


179 


441 


PSGKGGIKGPPPPPKKIFGNFFFFLKKK 
GVPPGPPGGPKPRPLETPPPNPPKGGEY 
GGGP\WPPPPIP*FFKGPFKKKGPPWTN 
RGNP 


9799 


23700 


A 


9872 


57 


435 


FTQMRKNLKNNSGNMKK*GS ITPLKDHS 
NFPAVDPNQNEMFEIPGKEFKK/SDY*V 
TQGDTRERCKPT*VLKTILDMDEKCSKE 
MD I LMKNQS E LLEKKDTFRELQNAMES F 
NNRLDQVEERITELE 


9800 


23701 


A 


9873 


228 


443 


FSSKFILISVFLFLFLETGSCSVAQAGV 
QL*DYSSLQPQSPG\SGNPPISASQIAT 
TAGACLHALLI FVFFV 


9801 


23702 


A 


9874 


349 


2 


KKPKKKNFLPPKNFGFFSPFSP*KFFFF 
LKGFNFFRGFFPNFPPPKKKFFSKNSQL 
VFFLPPLKKKIF/CFPTPVKFGP/SQRF 
FLKGPPLFFFFFFFFTjDRVWLCYPGWSA 
VARSRIS 


9802 


23703 


A 


9875 


405 


2 


kkasrmenvrkkkkiiemiklseegmlka 
nigqkls llakqqvvnakenllkeirsa 

IPLNTQMIRMQNSI*IADMENIL/WWI / 
EDPTNYMIPLSQSLIQSHALT\FNSMKT 
ERGEEAA*EKFDAI SG * FMTFKERS I 


9803 


23704 


A 


9876 


355 


394 t 


THPYYSHQEYQS P * P/LTGALSALLKTS 
GLAM*GHFHSITIiI,ILGIiLTNTLTIYQ* 
WRDVTR* SAYQGHHTPPVQKGLRYGI I h 
FITSEVFFFAGLF+AFYHSSLAPTPQLG 
GHWPPTGITPLNPLEDPLLNTSVLLASG 
VSIT 


9804 


23705 


A 


9877 


47 


85 


TIYIHICVRIYACECVCVCAYIHICGVP 
AHLTEQNlA KPARTQS P PAPQKNPELPV 
DLVS ISLDREAQDVEIaNHYRLGNTCRTD 
VLPKVNTLCLRL/NFFPCLPILEELQEL 
P*LENIPLFPQPIDYIYTYMRAYICV 


9805 


23706 


A 


9878 


2 


389 


GRGQFP PEFFYFFWGGSLVPSTPNKKKP 
LGEVTiGGGWSNRAP KGVJGGG F P P P ILAF 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P^Proline, 
Q=G)utamine, R=Arginine, S^Serine, 
Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














KPKGPFGPKFVLKPSNVPPLLFFPKMRG 
PLPKGGGFLPFFFFFETKSRSVTQPGVQ 
WCNFGSL*PPP\TGSSNS 


9806 


23707 


A 


9879 


351 


1 


FLGPSKNFNPFPPQKWPFAFFPKSHRAR 
VNEI FS /RLSVINF *GLTKPQRLFFSLL 
NFSS\PIPPFQFPSRLIKEIRFSTFLTP 
ENSKIKKPLFFF* FFFF*DRVTLCCPGW 
SAWQSR 


9807 


23708 


A 


9880 


123 


387 


S FLWKLRPSGAPTCMRCLLAXLLGGVSQ 
LGYTGVRD PLE E AVCP FSELQRHAGKTT 
ALFRAVRQGCFFFAYF* IELFPCY/ CSG 
RVGG 


9808 


23709 


A 


9881 


341 


659 


S FLGRVQWLHACKSQHFGRLRQVDHLRS 
GV*DLPGQHDETLSLIiKMPK/QKLARHG 
GMCL* SQLLRRLRQENHLNLGGRGCNEL 
RSQHCTPAWATEQDS ISKTKTKK 


9809 


23710 


A 


9882 


216 


1 


PKFFWVLRDSQIFKPGFGPQGFF/GGFG 
NPQGKKKPGFWGLG * KKKGFFFFFFFFF 
LRDGISFCCPGWSIQW 


9810 


23711 


A 


9883 


2 


244 


GRVGAVGRREGENFI KCDELPPPPPEGK 
QQNLCKFLTGFSLPCQDKKP\CFFV*KK 
KKKKKKKKKKKKKKKKKKEKGGGGA 


9811 


23712 


A 


9884 


310 


2 


IFLQTLFPNTFENSLSEYCI IFSEYSFI 
**KQ\TNFILSSILLYYI*PSGSYISNL 
*Y+ELFSQNLSYQHFLWI*KKWGLGAVA 
HTYNPSTLGGQGRWIAWSPG 


9812 


23713 


A 


9885 


242 


358 


KTVWIiYMNI *MANNHME*CSLPLVIREM 
*IKTPVTVH\TVGMAAI*NKK*NITR/C 
W * ACRETAVLVYCWGE PYTQPL * KTVWL 
YMNIE/IPCDPAIPLLGMYIiKE*RPGTV 
AHTCNPR 


9813 


23714 


A 


9886 


360 


2 


LLKLGFPFWAKKGKKKRIiWVEGPLGKPP 
GGFCQKLFFSPGGGPKGFSPS IIIIiPKG 
* PFVWGPFREVF* KKRFKLKKKFRPTFS 
/Y/LFFFFETESHSVA\RMECSGAILAH 
CNLCRLGSSD 


9814 


23715 


A 


9887 


236 


347 


IKGFFFF*TESHSVT\RLECTST/SAHC 
NLCLPGSSNSP 


9815 


23716 


A 


9888 


310 


3 


NFFFFFETGSCSVTQARVQWSDQ/GSLQ 
PRSPGL\SDPPTSAS * VAGNLKFLSGLC 
SSL*SLEIS*DTKK5QVQITDDLCYTVV 
LCLF FEMES HSWQAGVQWH 


9816 


23717 


A 


9889 


119 


391 


APAYWNPHARDSPFFFF*TNFPFFPQVG 
GQWPNSGSPQAPPPG/SPPP/YSASASK 
ISGAPGAPPPPGPNFFFSFFSSPTFFSR 
DGVSPYEP 


9817 


23718 


A 


£890 


195 


3 


DGVS LLLPRLE CNGT I LTHCCLS PV/ S W 
DYRRLP*RL/V* FFLFLVEMGFHHVAQA 
GLELLTSGDHP 


9818 


23719 


A 


9891 


3 


378 


RDGERNMLADGTASAKALKYSLALS LPR 
LECSGTI ISHSSLNPP/GLKWSSCLSST 
SSWDFRPAPPHLAKKTKQQ/HATFF* EG 

YKL\DRHAW 


9819 


23720 


A 


9892 


231 


1 


PKPFFPKFFSP*N*KPFFQISPAKIQKI 
RVYQS\HKKFFSFFFFFHGVSLLLPRIiE 
CNGTISAQCNLCLSGSSDSP 


9820 


23721 


A 


9893 


2 


347 


APARQENWCVYVCVCMCVCIVVLHRHC 
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K=Lysine, L=Leucine,M=Methionine, 
N-Asparagine, P=Proline, 
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T=Threonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














EVLCPYKMLGKKRSIYAY*LVy*YIYIH 
MCVYIYI / YI YV/YMYTYTYVCIYTH 


9821 


23722 


A 


9894 


28 


382 


TAHLEAGATGYRTPLIiPAIGLLCLPWVL 
DIX3AGQTAQ*SMIiLDHAMLLAHRAHELF 
I /DTYYEFEETYIPKDQKYSFLHDSQTS 
FCFSDS IATPSNMEETQQKSNLKIaLRI S 
IiLLIESW 


9822 


23723 


A 


9895 


93 


386 | 


FWKI ISHYYFKYFS CS I LIS PSDLPITC 
ICMFHLLKLCQSSEFLHLLLFVWLVFET 
QSHS IKQPECS* * LSAA/CHLPGS SDP P 


9823 


23724 


A 


9896 


174 


1 


TLDWGPRKKNPFFFFFETRSRSVTEAGV 
Q*HNHGPLQPQPAG\SSDSPTSASQADA 
W 


9824 


23725 


A 


9897 


251 


386 


LQRLSCSGTI/SAHCNLCIjLGSSNPLAS 
AS * IAGTTGTLTGDVDST 


9825 


23726 


A 


9898 


127 


393 


GHMGTLTGSPATLS ILAIiFFWGGRISTQ 
LDGINKS \FSLFLYFLRHCSLSLRLE*S 
GAIITYCRLELLGSGDTPASASQETGTA 
GTCHT 


9826 


23727 


A 


9899 


3 


334 


KLRLGQLNTSAMAAKC*SERKS/R/THL 

tlnqklemiklreegmlkavigqklgll 
yq/tsqf™akeellkdiksatpvta*m 
igkqnslncdlekscwliqdqtg/hni 

FLS 


9827 


23728 


A 


9900 


356 


1 


GRKPFSHLPKATLLPMGPVLGGRRALMG 
PDSRPGPVPSCSLVLLTPLAPLPLTARE 
SLCPCPPS *TPQPSVNP/H/GKLARRSP 
CWSGRQSLP*AEIVPLHAPPALGDRDB 
TPSQKKKK 


9828 


23729 


A 


9901 


118 


343 


I PYAKEKKKQEWVLRFIHVNLCISNWFW 
FETGSHFVTQG*VHWC35I*SSLQP/PNLQ 
GSGDPPTSASRAAGATGVQH 


9829 


23730 


A 


9902 


321 


50 


slrtrv*rppsapg*npfclkpqkifqg 
ggXesplsqilkrvkqensywlggkgfn 

* PKLPPCPIjTWATKKTSPPKKKKKKNYR 
TWETSVS 


9830 


23731 


A 


9903 


317 


8 


ncydpnsgivtpllgilpkstk\t*vrt 
ktct*mlivalciitkkwk*skclstdk 

QVNKI * YIHIMEYYSPIKGRKY*HTLQH 
/WMNLBNIS *KRLDIXKPHII 


9831 


23732 


A 


9904 


287 


3 


RGALNGRGGPRGVKSPGFAPFKLQLKPP 
GTFIjNPK\QFPFFPNPPF*KKGAFFHFF 
LPIiF*SFRWTEPYSVT\KLECGGAISAH 
CNLCLS CSSDSRA 


9832 


23733 


A 


9905 


2 


406 


PRVRTSSRSRATALFFFFFFFVFFWGKG 
EI WAPPIiKNC/ IPPEKFCYFGGGAPGAG 
LPPPPGVIFFSLCRVVKKKKKKGAPPGG 
VLGKKKKTPFTPKGGTFLTLGGF* KKSL 
FGKKTLIiWVGGPLI*LKNFF* EKFR 


9833 


23734 


A 


9906 


1 


296 


I WVGATE C I FKNNPT I WGHQETHFTCKD 
T YRLKV/ KGWKKI FHTFHTNGNQK* AG I 
AI VISDKTDFKSKT IKR* KGHY/ IMTNG 


9834 


23735 


A 


9907 


474 


41 


FMEYLTLS FIHVLLQ* FIHCFVHLFIHS 
FFYALMKSLIQ*FIHLYSCF/ICIDIiLI 
SS CI YS F I YALVKS LANSLTHS F IHTFL 
Q* FVHLFTPLLICS FTHSS IH* PSHSLT 
FIQLHLFFHALVSGFIWSFIYLFCKINV 
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sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuctne, 
K=Lysine, L=Le urine, M=Methionine, 
N=Asparagine, IMProline, 

f~^ == f»lll ti\ minp PeAroinina C C A ci H ft 

viuunumc) *v-ai gin me, o — oenne, 

* M ui cvmiiiic, v — Valine, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleAfidf* rfol<»tmn V — mwdkia 

iiuvicuuuc ucicuuil) r^JJvoSIDrc 

nucleotide insertion 














FLIDK 


9835 


23736 


A 


9908 


286 


84 


lkppppfffffqanflcfird/gfhhvg 
pasplisgphqti 


9836 


23737 


A 


9909 


316 


63 


DQLLVRSIGFBALMSPIFFFQMEF/SLS 

AV* /PMANLLKLCAS VSYCIELVLNELH 
WIK 


9837 


23738 


A 


9910 


370 


461 


F* F* flfseteshsva\qlecsamisgr 

SEPGLCHCTAAW 


9838 


23739 


A 


QQ1 1 
✓✓I 1 


994 




Jruiuio 1 vyivlSr'VJlV.Wr IJSMKlrTPGENAVN 
IV* PTTKDLEY/WINI /DKTVAGCERTD 
SNFERSSTMGKML\ SNS IAWYTEI FRGR 
KSQLMGI 


9839 


23740 


A 


9912 


307 


463 


CKL * TLSNND VI VGS LIVT / TCTTLVGV 

UXFIkjXJ / fCUUM tu Ji KJi X IjGWJjGT FLiS I Ii 


9840 


23741 


A 


9913 


377 


28 


REMLTVKPLAPSFFRWSRAYRAIFFITP 
TWVSKSGKVTGPLFFIjHIFFPGP/CPGL 
PIFFFIKTKGTRFFFGWFFFFERVSLCC 

SEFHR 


9841 


23742 


A 


9914 


384 


725 


HLTAIRMTVIKDTENSKCERGC \KTLMH 

OWVYJB X JSKVUJbMjnA.1 KW'r/ IiQQVE/IjPS 

PCDPSIP/LLRMYP^LNTD/CKRMC/C 


9842 


23743 


A 


9915 


187 


3 


QPHKHLGLDNI YI FFEIGFCSVAQTGVQ 
PRQENG 


9843 


23744 


A 


9916 


15 


167 


DGVSLSLPRLECNGAILAHCNLR/LPGL 
K*FSCLRLPSSWDYRIjPPPHPAHV 


9844 


23745 


A 


9917 


345 


477 


YCSRTSLPL/ TLKKGSA* PGAVAHAYNP 
SILGGRGRWIT*GQEFK 


9845 


23746 


A 


77 IO 


*frvv 




1 J\Mvj]£ l\J<i>KKTGNS KNQS A 
S/APPKERSSSPAMEQSWTENDFDELRQ 
EGFR * SNYSELKEEVRTHGKEVKNLEKK 
LDEWLTRI /SNAEKSLRDLMELTPKA* Q 
LPPERTNLSS* FNQLEERVS VMEDEINK 
MK 


9846 


23747 


A 


9919 


282 


21 


AAGYLQPSEFWLHSLGSRRGLPISVFPV 
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Predict- 
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ding to 
last amino 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=Isoteucine, 
K-Lysine, L=Leucine, M=Metfaionine, 
N=Asparagine, ^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RGRQAWLSCDGYHPVRPSRLIjCLPTQA 


9851 


23752 


A 


9924 


254 


3 


PGGFYFPGFLPPPLGFPKGFL*RLWPPC 
PATFFFLYQ I SKFPF F FFGKYQKNF FFF 
FFFFPESKSRSVT\RLECNGAISAHYNIi 


9852 


23753 


A 


9925 


309 


1 


YTPTIMLLGIYSNELKTNVHMKT* I *MF 
TATLFLIAETWKQPRCSSLSEWI/QQLW 
YIQTI/EYYSWRRNIK/LMKTW/MKLK 
YILQSEKTQSEKTTYI * FQLYDI 


9853 


23754 


A 


9926 


237 


2 


RWNFNFTQLILKC I FLRTKFFS*VS K*N 
FKYFLGFWFFFFLRQGLTPVT\RLECSG 
TISAQCNLRLPGSTDFSASASQE 


9854 


23755 


A 


9927 


244 


1 


KPQCLWRK*D* WHNTQKLDVEKGKPLIH 
YWWERKLV*PLWKTVWRLMR/NLKIEIiP 
YDPVI PLWGNYSKEMK* LCQREDSRA 


9855 


23756 


A 


9928 


2 


644 


IVQVDQKIRLNIHCLQBTHFKCKYTYRIi 
KVNR*R*VYHVNINQEKALYAIDGAYFR 
ARKITRDKERHY/ IMIMGSMLQKD 


9856 


23757 


A 


9929 


1 


364 


GTSGTRPINNTTVFSKCSSERKT/R/TH 
LTLNQKLEM I KLSGGGI S KTDMG *NLGL 
LHQTVSQWNTKEMFLKEI KSATPLNT* 
MIRHQNSLIAAIGKVLWWIDQ/TQHNT 
LLSQRLIWNKALT 


9857 


23758 


A 


9930 


88 


374 


ILKACGPCPGNPHPWGPL*GQGGLGIiNP 
SKSPPHPHPRGDPTWGGTPIGDYGPKPP 
KLPHGAGLPKKPCSSGGQNN/ PKISGP 


9858 


23759 


A 


9931 


384 


2 


LCSPRFFYRKIWGGGQGSRIiLPPPFWKP 
GGKDCFSPGVLDPPGEQRGAPCFP*KKI 
RWG*GAAPWPPPQEG* /VGRAPLTPKI 
PAPVSFGCSPALPGEGQSETPFFLKKKK 
KEKSEAMIPPPPGPRA 


9859 


23760 


A 


9932 


32 


359 


IFFFLFFF/C*KGGPPPVPQAGGQGPKT 
PPWPPIWAKKTKPLSKKKKKKKKGAGPP 
PPPKGGPPPLKKGEGPLLKRKFLKKNGG 
ENFKKKKIWAGGGGNPREPPPFGGAR 


9860 


23761 


A 


9933 


138 


1 


TGFFCCC* SFALVAKGGVQWCDDGSLQP 
PPPG\SSDSPTYASCSC 


9861 


23762 


A 


9934 


103 


355 


LAGQACCS * STS ANRCVNHA\ PG YANGR • 
CDTITCILVFIRNSWLITPIALITMFCY 
HVG*VRPQEQAPGNRVSIiSDLI J L\PPVT 
C 


9862 


23763 


A 


9935 


29 


363 


AWSHNAFLLFFFKRELAFVPQRGERENN 
FVSLKPPPPG*SPFSAPNPPKGGNKGPP 
PPPQLI FFYF* /GEKGFS PYNPEGPKP P 
TUGKPGLF*NTKKICPQPRGLKKGGQHI 
W 


9863 


23764 


A 


9936 


173 


363 


PKKRPIISLKRAFNSNPGD*QKFLKRGL 
PPMGSQFKRGNYPYDKNLEKLCPGGGNK 
GSG/YHYGL 


9864 


23765 


A 


9937 


360 


3 


NQKMRKNQSSKTGI CKNQSACPPPKERS 
SSLAMEQSWMENVFDE tiREEGSR* *nys 
ELKEEVRTHGKEVKNLEK/NLDEWLSRI 
S/N*EKSLKDLMELKPKAQELHGECPSL 


9865 


23766 


A 


9938 


194 


370 


VSNFFFFFETEFCSFAQVGAQGQNFR*L 
KAPPP\NCCLFPT+ASRWGTTGTCHHS 
WLI 


9866 


23767 


A 


9939 


426 


1 


SQEFKVAVSYDLTMHSNLSNSETLSQKE 
KTKQTKTKQKKTY* EGRKHIKRCPTPLI 
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Amino acid sequence (A=Alanine 
OCysteioe, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, MHMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *«Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














IREIQIQTTVRYHFTHIRI / VLLPQKQN 
KCWCRVE KLKPLVRC * WECKT VQPLWI T 
VWW / FLKI LNI KLP FDLATPVVC I YQME 
FSC 


9867 


23768 


A 


9940 


255 


16 


RVTYNIYIMYILCIYTPSRLLVEIGFHH 
VGQDGLDLLTS * S IRLSLPK\CWDYRYE 
PPCTNIYIYLHIYLIYMSDGVKY 


9868 


23769 


A 


9941 


206 


355 


FLQGHTAI KNGRG/ W PGAVAHTCNPSTL 
GG*GGRI I *GQBFKTSIVTNMVEP 


9869 


23770 


A 


9942 


3 


402 


HEELNPRS ATS S PALVLHVTI AHVTIAF 
AEGCFQHDFCFSQPENRHIPLVFTTASK 
VRNIFYLLCRGRGILRFFLVCLKIDPAI 
PLMGIYPK\KSLYHKDTYIHMFIAABFT 
VAKIRYQSKCPSTDD* IKKICV 


9870 


23771 


A 


9943 


79 


328 


CIKILLKFLCLFCILPPCLNFFLRQSCS 
VARAGVQW\SLQP+ PPRLKHTPTSAS* S 
AETRGTHHHAHLIFSYNFL*R*VSHVA 


9871 


23772 


A 


9944 


321 


414 


VPSWAG\PVAHACNPSTLGGRGGRIT*G 
QEFE 


9872 


23773 


A 


9945 


142 


480 


QEAKSFDFSDSTTMEVMLDK^IRVIFL 
FEFK\MGRKAAETTCNIENAFAPGAAND 
♦QMRWWLEKFCKGEBSLEDBECSGRPSE 
VDDDQLRAIIEADPLTTIQBVAKEIiSID 
H 


9873 


23774 


A 


9946 


191 


3 


FFLPRGGGDSVFKKPPKKKNGFPKIFPF " 

CPGFPLAT/INFPPQIFFFFFFF*DGVS 

LCHPGWNAV 


9874 


23775 


A 


9947 


451 


461 


K* INK* KVKGW/RKEHHANINQKKVDVT 
I FISDNMNFRKKNKI TGDREGRYI I IKQ 
PIHQENIAILNVYVPNNRVAKYVK*KLI 
ELKKEIHKFriliVGDFNTIIiSTIDRTTR 
*KVS 


9875 


23776 


A 


9948 


186 


405 


NHTLAISLF*PYKP*AILKLGIYRKERK 
LVC*RDI CALMFITELVTAAKLWNQPKC 
PSTDK/WKMWYICTMEYYS 


9876 


23777 


A 


9949 


18 


412 


PEFRITTILPEKKNLIj*LTN*VSKFSGY 
NINI *TSVAFF\YSNNKISETGIKIVPF 
TILSNRMKYLGM/TLTKNMKDLHTENYK 
MLRKENEEYQMCTIMGSWILKINIVKISI 
LHKVI YRVKFLSNFNAI LLKK 


9877 


23778 


A 


9950 


1 


396 


LAFDGRAERLKTGFHRVTQDGLELLTS* 
SVRLGLLKCWDYWREP/PVPGLWD 


9878 


23779 


A 


9951 


307 


405 


GIRVRR* PQGLRPPNPDALVFKKMVWPP 
PTN\PRGWGGKGWEPPQGPQDPPPQAAP 
GFTjPBTWRGPWNSLPTRGAWPPNSNLPP 
WPPAPSRHPAQRAPGPSTAAPPPGGGGE 
GWDPPGRV 


9879 


23780 


A 


9952 


2 


402 


YGRPKEGGSLRFWIiTTPGPKGETPFFL 
KTQKITGGGGGGP* FPLPRRVRQENS FY 
SRGGGFN* PKRAPCALTWGKKQKFFLKK 
KKNTPQKPKKLPGCRIQY* TQK/SSSLF 
MGGTPGKRQ 


9880 


23781 


A 


9953 


249 


1070 


SSFISRFYFICIYLFETRSHSVTKARVQ 
CCDYN*IiQPQPPGAQEILPLQPP*SAG\ 
ITGVAPNTPQLIFYFFVE/SGGSTHVAP 
RLALNSWAQNIAL/RLPKVLYFTF 


9881 


23782 


A 


9954 


75 


396 


GFKGRKRGLPLPLNQKLKTLNLSEKGLS 
KANTGQKLGLL/RPVSQVVNPNGKFLRE 
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peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny1alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N^Asparagine, P=Proline, 
Vr^vjiuiaraine, k— Arginine, ^— oerinc, 
T=Threonine, V=Valine, 

X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














IRc/iT / rrXCJx* VI rO^/IN oi-»i trUrllLCi v uvj 

V/WKTKKYSHNIPLGLSLTQKRVLTLF 


9882 


23783 


A 


9955 


181 


458 


RLWNDGLYNI FDLKLQ/ SIHV* SLLLLF 
ILEPESHSVS\RLECSGAVTTCCSLNIP 
GWSDS PPQAFQVAGTTGVCPNAWPSAF 

Li 


9883 


23784 


A 


9956 


67 


430 


LAPRLIYSPIKFQWSFMSVRQIQVQSLA 
I GNFHNPRLP I L I CHCYAFHR I LLFQMV 
RNFIIiYSISISANKQESKCILKYILLYI 
NIiRNWLGTVAHAYN/PSTLGV*GGRIS* 


9884 


23785 


A 


9957 


77 


422 


LPLALWASCRPQLCIiDTDFIHGPPTSGA 
ASQFLCFQTMQPPTCSSSRLYNIi*PALV 
IiDHTTSNWCFRSP/ SFFFLLPLPLPPA 
PLP\ FFFFFFFDRVLLCHPGWSAVARFQ 
LTAAS 


9885 


23786 


A 


9958 


3 


422 


LiHSb'ii V'L'&QUS I £i JUUUUUUL1 VLA&3'XRJ&. 
GGPSLGGFFFGPPGFPKIVKKKRGALIG 
FPFFCPKKKPKKPLFFPKPLGKPPRKKG 
F* IGEPWKT/EGPGFPKKLNFVFFSPWG 
GNFLFPPQKPGGKVGKIIiLiQPKKKPPGK 
r* 


9886 


23787 


A 


9959 


370 


2 


PFSSWVANFREGDGEKCSVLQGIRFCMF 
CRYINVTPLVHFPFPISKGTSPFPLYFF 
SPEVM/HFP*LPY*FGWAIiLSRP*/MSL 

CNGAI SAHCNLCLP 


9887 


23788 


A 


9960 


256 


2 


AWEEETAPLHSHCLGNQARPWNHRHAP \ 
P*IiIVFVFSVETGFHHVAQAVLELLASS 

EG 


9888 


23789 


A 


9961 


269 


3 


HRQACGQVRSPIQCWWACKIAQPSLQIV 
RSSLKG* I * SPYDPVI PLLVI YPRELKT 
\SLHTKTCT*RFTAALFIIARRWKQLKC 

JroJLUJS 


9889 


23790 


A 


9962 


268 


324 


SGTLSH/WIFS*VGIEITMWLKVPIDQTK 
KISLRSAEAAIKYFLTQATASI I IRRAI 
LFNNRIiSEQSSITNTTNQYS SLI I IMAI 
AIKVGMAPFHF* VPEVTQGS PVTSGLLV 
L 


9890 


23791 


A 


9963 


341 


2 


ITFLPTKI IRQSNFKTGKRKTDSNDLNR 
HFKREDTQMANKYM * F I * KMQ I \ KTTMR 
YYFTSKRTTISRKTSNIOC*DTWSPGTF 
INCLLECKMVRLLWKMTWQFLTEVHIHIi 
VY 


9891 


23792 


A 


9964 


250 


2 


YHPVNLYFKYISPIDV*FGCFIYIFDYP 
LCSCFSLHPLVYLQ*L/LFFFFEAESHC 
VMT.FCSGVVSAHCTTLRLPGSSDSPAS 


9892 


23793 


A 


9965 


3 


334 


RTLRHHIWLIDPSFE*HLSCFHIiIiATVN 
SAAINIPC/HSYV*THF*YS+MYRGRTA 

YH/WE*VPISPQSCQKIiDFFIFIiIIAIS 
EG 


9893 


23794 


A 


9966 


179 


352 


NQE*TEILNRPMSNQIESVIKYLPTRKS 
HRPHRFTAELYQTY\ ILLKLFQKVEGEG 
LL 


9894 


23795 


A 


9967 


2 


350 


THPSINSFIHVF1HASMNSLIHLPTHSF 
S I YLFTKS F IHLFIHFFS FLFFLRPSFT 
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correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 1 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G-Glycine, H=Histidine, I=Iso!eucine, 
KHLysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














PVAIYSFIHPFTYSSIH*FIYPCVYSCI 
DEPTHS FTHAFIQHLFIH* II/IFIYSF 
ISFLFF 


9895 


23796 


A 


9968 


228 


558 


CHQLRQELAI FTS FVI LQLFSGHLDVYM 
C3AWAQRPDKYEYDNK*FIEIKKIIQFTL 
I S KRMK/ YVG I MLTR * VKDLHNENYKTL 
MKEIEEDTSEWKDISCSWNGRINSVKT 


9896 


23797 


A 


9969 


196 


2 


DRKVFFRPLFI I FGLKKKIFPLFWFFPS 
/IRGRG* IPFFFFFFFETESPSPKLECS 
ETISCHCTP 


9897 


23798 


A 


9970 


245 


1 


TQCDMCVFIYTHIYISHICYMVYIYIHT 
RTHTGILFTPKREKNSNTCTCATKDNP / 
CMTLC* / HKRRQTHTHTHTHTHTHTHT 


9898 


23799 


A 


9971 


163 


380 


NHRRQKKNGR/ HNGNKYKTVTKMINSNP ' 
P I TRNNLNVNGLKTP I KRQKLSDS I KKQ 
YPSIYCIi*KT/YLKYKD 


9899 


23800 


A 


9972 


328 


1 


AKCGFLKKLNIELFNS I PGHSCKT * TQM 
FAAALYIISKKYKQPSCPSNDE*MNEIW 
HILTIGYSAVKKISWG\RWMNLENIARL 
SGRQRGHILQDSVDMKSPE*AIPQ 


9900 


23801 


A 


9973 


200 


3 


MAS KCS S ERKSHR / S I THNQKLEM I KLS~ 
EEGMS KAKIG* KLGFLCQ\ VIHWTAKG 
KFLKE IKSATP 


9901 


23802 


A 


9974 


111 


517 


NLBQPIilKRTIANKPIKKINTTILI ITI 
NVNGLNTLTK/RQRWSD* IFKNQDPTMI 
/C* KKN/HCKYKDTNTLKIKG* KNIYDA 
NMLTQIKKKAGVAILI/DKIDFRA/TDI 
TRAKEGHFIM I KGSVHQEYVTILNVSAH 
AS 


9902 


23803 


A 


9975 


342 


3 


FIiATKKI FFFFP VFPPQGPPPVLVAP FF 
WAFGGGPPKWAPKKKFFGPRGPPFENFF 
F*GFFFFKTGGFFFIjRFSKKVFFFFFFP 
SETESCSVA\RLECSGTISAYCNLCLPG 
SR 


9903 j 


23804 


A 


9976 


13 


326 


ILDHSNRPS SSTLKTPNAGKDVEQKQFL 
/LCCW*EC3(IVNYIjGR*L/WQFLTKLN1i 
Ii*PYDPBILIiLGIYQKELKTCIHTVACT 
*MFIEALFVIAKT*KQPRYSSVG 


9904 


23805 


A 


9977 


74 


396 


LAPRTQPLPCSPAPLRQCHTQSLLPPFI* 
LRLI FYFFCFLKTRAHSVAQAGAQ*RDH 
GSLQT * TPTLACTPASA/ VARTTGTHHH 
AWLIVFFL/IEKVSHSLVCDKT 


9905 


23806 


A 


9978 


287 | 


2 


QPHNSSYTQI FTPKI 1 1 KTPP PNLKQNN 
TTPIKKKTQ*FP/PSTNP/SYPSTHPPT 
HLFIYPLIHISPPPSPTHPRIHPPTYLS 
IHPSTHPSIHPPSR 


9906 


23807 


A 


9979 


97 


332 


GNNDLFLYFFFFLETGSHSYAQARVQWC 
DHSSL*PQPPGSSGPPISTC/RLGLQAC 
MCHHAWL I FN F FCKGRLSLFGLLR 


9907 


23808 


A 


9980 


168 


2 


STWLRWDYWKTFFKTSKS/WPGTVALTY 
NPYTLGGQGRHI T* GQEFKTSLANTVKS 


9908 


23809 


A 


9981 


267 


1 


LKRERR * RRAN/ S KTSRRKE 1 1 KI RAE i"~ 
TATENRKP IEK/ SNKTNG*FFEKTDK/ 1 
DNPLERLRKKERD* I TKVRNGREDI L * N 
NDLIKVKKD 


9909 


23810 


A 


9982 


230 


412 


CHIYLC/D*PIDGIV*CHIYIYFGHKHT 
HTYTHTHTHTNTHPTC*VSKILLKKKRR 
TCSVCA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


9910 


23811 


A 


9983 


229 


415 


SYVFCHIRFCPLDSLNKBIVA/SQVQWC 
VSWSTT*VAEAGGSLEPRSLRPDWASQ 
RDPISKN 


9911 


23812 


A 


9984 


2 


288 


RGYDSTSSAYTVGHFT*BDKATITRLSG 
KENVKHAGGBTIiRRIiLDVYP*TQRIIDS 
VGI^SA\SAMM\GPPRLPKVIiGIiQAWT 
TMPGP* KYIHI FF 


9912 


23813 


A 


9985 


246 


3 


FLLDYQYCALCCHCFLFNFANCGHFNI L 
LGGVKHTKFPKYS * KFSVL/ WLDAVAHS 
CNSSTLSGLGWI TRGQEFETSLANMVK 


9913 


23814 


A 


9986 


342 


1 


FLNFGLTGFFPKGPPKPLGLRDGALPLD 
PPQFLGVFKRGAFFLGTQ*FHF\LEGFK 
FWFWEPKAFFFFF/ CCETISAHCNIiCLP 
GSSDSAASASWIAGITGAHHYARLSRRL 
RQQN 


9914 


23815 


A 


9987 


330 


1 


NRPLNNLVMAS KCS SERKG YTS FVLS RK 
/LRKMIKLSEESMLTVTTG*KLGLLCQT 
/ VQVNTKEKFLME I KSATPGNKGTVTKS 
NTLTADREKV* VIWIKNKTNHKI PLSQ 


9915 


23816 


A 


9988 


209 


2 


IKGSHFPVLI*YRKIIIIFGGVKGKGGF 
YSY\LFFFFEKGYHSVTQLKCSGMILAP 
CNLCLPGS*DSPTS 


9916 


23817 


A 


9989 


256 


357 


QHRWMNTEMLNYLC*IIYVHF/HTHTHT 


9917 


23818 


A 


9990 


263 


1 


IQIRVFENSHLWPSYVHFPHGHSYLFCS 
L* *AFFRH\G* FVLSVHFLKI FLNLKKI 
FFETG\SHSVAQACSGMISAHOILCLLG 
SSDR 


9918 


23819 


A 


9991 


3 


368 


SLDPRRSRLHLAKI VPPRS SLGDRARPC 
LLKKLELFFETGFCCVAQPGVRWCSHGP 
LQPRI PG\SSNPLTSAT*VAG\ IAGMCY 
HTQLQLAF 


9919 


23820 


A 


9992 


294 


391 


SGTVAHACNSSTLGGRGGWIT*GQ/ESQ 
TSLTNM 


9920 


23821 


A 


9993 


317 


3 


TDKELLLRDEQRKCFVaiEFSPGKHAMS 
NVDMTKDLEYS INLVDKAAVELERADSN 
FERSSTLGKMLSNSISCYKEIFLERKSQ 
LM*QIPLLIi*/FKKLPQPP*PSA 


9921 


23822 


A 


9994 


193 


395 


IFWITCCSSLSVLAASPCTFMLWRQLLS 
LNLMNQ*DHSLSQSLIQSKSATLFNSVT . 
AERGDESAEEKLEASRSWFMRFKERSCL 
HNI KVQGEAARTDR/ EAAAS YPENIjR / V 
KDEGGYTK*RIFWDEIAFCWKTMPSKT 
FIAREKS I PGFKVSKDRMTARCGRSCL 


9922 


23823 


A 


9995 


82 


420 


S FLWKLC PRGAAAYVRCVSA \liLEDVSQ 
LGYTGVRDPIEEAVS PFSELKRCRERTT 
AVFTAVRQGHLSLQK\FLLPFVQLCPAH 
RGGV*RQ*ALLCCGGIiSPVQAPLPLCLP 
TQ 


9923 


23824 


A 


9996 


376 


1 


CGRTDAPASTS P YAMTVRFLRP PQP CVL 
DGLHNCPARAPQEQWSLGEVDKRGPREQ 
TR\ PATAAP PRPLGS PSWICPGDW* *CR 
NS LQG FS LAASVARGRD WALAGL PATVS 
ARFQEQQKMNTV 


9924 


23825 


A 


9997 


495 


982 


VQKFLRPNLAPKKHQRKLAPNSLQGRLR " 
SLPSPTTOCTMAPPTGVI^SLLLLVTIA 
VCLWRMHS WQKNHW * AS FKKTHD * TGLA 
EP\SCARKQCSEGRTYSNAVISPNLETT 
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E>Glutamic Acid, ^Phenylalanine, 
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N=Asparagine, P=Proline, 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














RIMRVSHTFPWDCTAACCDLSSCDLAW 
/WFEGRCYLVSCPHKENCEPKKMG 


"9925 


23826 


A 


9998 


401 


341 


IASKCSTERV/SRSHTIiKQKLERFKL* E 
KGM/SKNEIGQK*ALLYQ/VSQWNAKE 
KFLKEIKVD\LHAQMIRK*NSLTADVKK 
VLWKIDQTL/ PLGHS VIQSKALTLFSS 
MKAQ*GEEAA*EKS 1 


"9926 


23827 


A 


9999 


30 


393 


DTTI LKLRQLI IGAVATKHWSERKSLTS 
LNLNQKLGTIKLTEEGMLKAKKGQKRGIi 
SCQTVGQVMN/SKEKFLKEIKHAIPGNT 
*MISK*NSL/IIADMEKVLVG/WSKVIF 
*DQTSHNIP*SQN 


9927 


23828 


A 


10000 


133 


1420 


EQRRLPVTPGVASLQTLCCFLPRRGCSH 

G*ESAATHPPGPELVLLLLQGH*AGYLG 

/ DFG/PRTATGS PG\PAGVQPGPCLVPQ 

LSRAAAAG*PDPT* *GAIFSYHCKALRA 

TAARHCPPGC*WPG\PAGSPPGTSRPLS 

HPSSPSPPWTX3VSSFLIiNLVLCP*ASDA 

QGHACYPGPVPALAGQGHDSAGGRLGDP 

GHHWQQAALHG/DQGPPGHQAHAGTQPS 

AQGI/3C* ATHPGEGG* VGYQGLPAG*GR ' 

SHPRFPIVTRPMGRSCPYATVRPGPGSQ 

GQTPPGLV\PRGEAGQNPGGAAPGLSCI 

QEVTORWWW*GKKPLGHYPPCGAVRCPS 

S * PHPDTMFKGTGREGCLPPTS PVGVTG 

QMS\EGSRPCEWTLTMSPWGE*SPRASC 

AMLHFCPGSRVGGYHGCPPLHHMGPQST 

AGQAGQPHTLDIKAS 


"9928 


23829 


A 


10001 


1818 


6682 


IKFPEAEEEEIQEVLVLLFRVAEEKENP 
SRVGNSQREYENQWLYSICNQLRYRNN 
L/RHVKKDERGYYEELLNYSRDHLMLYP 
YRLLDIMVKGFSITPFSYYTGIMENIMN 
SGHNFTAADCLRLLGIGRNQYIDLMIQC 
RSSKKFFRRKTAGDUiPIKFVEIAIEAW 
WWQAGYITEDDIKICTWPEKCATDKTV 
DSGPQLSGSLDYNWHSLYKKGFIYLDV 
PMSDDSCIAVAPLEGFVMNRVL/NFDYF 
ETLLYKI FVS VDEHTNVAEIiANVLEIDL 
SLVKNAVSMYCRLGFAHKKGQVINLDQL 
HSSWKNVPSINRLKSTLDP*IO«iLSWGG 
GESRRPVQEASSATDTDTNSQEDPADTA 
S VRSLS LS AGHTKH IAFLFDSTLTAFLM 
MGNLSPVQSTGEGKAQRYFDHALTLRNT 
ILFLRHNKDLVAQTAQPDQPNYGFPLDL 
LRCESLLGLDPATCSRVIiNKNYTLLVSM 
APLTNBIRPVSSCTPQHIGPAIPEVSSV 
WFKLYI YHVTGQGPPSLLLSKGTRLRKL 
PDIFQSY\DRLLITSSG\HDPGWPTSH 
VLTMLNDALTHSAVL IQGHGLHGI GETV 
HVPFPFDETELQEDSC\NMGVHKALQIL 
RNRVDLQHLCGYVTMLNASSQLANRKLS 
DASDERGKPDLASGSDV\NGSTESFEMV 
IEEATIDSATKQTSGATTEADWVPLELC 
FGIPIiFSSELNRKVYRKIATHGL\*EKR 
AFKNLLHSSRKLSLQVLNFVHSFQEGAS 
TLDIHTEPSFSSLLSQSSFADMGVPLPA 
KKFN/SLK1VSY 


9929 


23830 


A 


10002 


439 


1135 


MAVDFYNFVTKLWTTGYLRISFLAYKF 
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W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FSFPFLDSLS/LIiSPRLECSGVIPASLQ 
PLLPGFKRFL/CLSLPSS *GYRRAPPPC 
PALLYF*VETGFHHVGQAGLELLTSGKL 
TRLGLPKCWDYR 


9930 ! 


23831 


A 


10003 


216 


572 


LTLVS IFTAKQSVTLPIICMIiEJyiRKWLL 
NDTSAF/PI I * KVLFPKIFFFFFF* KGV 
LFCPPGGRBGAQFGLTKPSPSGLKEIFIi 
PNPPGGWKQRRVP P FAGYFCFFIKRRVS 
LLWRGLPN 


9931 


23832 


A 


10004 


3 


354 


YSNNPKDRKGETGINEKTNNKMVDLSSY 
TSVI TLNF*ML * /D /KIN* KFVKSP * KK 
PTLCCT J iKTYFKYGIGGLKVKRWKKI *H 
ANTIGKKTLVAILITKQTSELS * YPNKE 
EHYILIKE 


9932 


23833 


A 


10005 


187 


455 


ASIHYSLGVRICIYIPGCIHPGEKSQKC 
NT CGKNSTQKPELVN/ HR * LKGHRCMK * 
NEGGKTFTRVQSL/ ACHKRVYTERKSYK 

c 


9933 


23834 


A 


10006 


134 


456 


IWNLPLGKPCAWKQVTSDLISPDAIFIN 
AAI HSLS FFFFFFETKFLFVPQVGGPWH 
NLN*LKLLPPKL\SDFLVA*VSKKVGIT 
GPPHHPGLVIWGFKKKGGSPMLPR 


9934 


23835 


A 


10007 


210 


451 


CYYTIQTTBQPANDSMLGIKPHVSIIiTL 
NVNNIiNTPPKRLGVADW IK/TQDPAFCC 
LQETSLTCNRTHRFKVKG* /WKKIYH 


9935 


23836 


A 


10008 


169 


462 


YPCWVDMAKNTS YNKCWQGCGE I RILPH 
CRWEYKMASPLWK/SQFLNK*NMELP*D 
LP I PLLD IDPKEWKTGVQTKTCR/ RMF I 
AALFPTSRS 


9936 


23837 


A 


10009 


490 


724 


EMPP * FMEKNFLAPGG\ VTPPGVNTTRV 
NPKTAFDPKRSSSPFLEKSQKPGLIEGL 
THLKTLILNPLKKDESGEIPVLF 


9937 


23838 


A 


10010 


306 


588 


KQFI RAFI EMSWWLTPVIPVLWDTKADG 
SGIRDQPSQHGETPSLLKIEKLAGHGGA 
GL*/SQLLERLRQENHLNPGGGGCSEPR 
SCYCIAAWVTE 


9938 


23839 


A 


1001 1 


473 


53 


ISLGQDSTDISTINIPRMAFKLSSKRKS 
HMS FSLNQK/L/ EM IRLS / EEAC * KPRS 
QEOiGLLSQS* PSCESKFLKETKSATPVN 
TQMI *K*NSLITNK\LKEVLMVWIKDQI 
KHTISLRQS VI * CKALPLFNSMKAERGE 
ESAKC 


9939 


23840 


A 


10012 


184 


455 


FWLSLI KKPLS S VTQKKINRET \ I LLTT 
KKKKKDCNFLEGGLIi IGGS VCRLVYRHY 
ATLYS VFCVDSS KSERGSVDPTQVFVBT 
*DKCFDY 


9940 


23841 


A 


10013 


203 


453 


KEELYISRERKSSTSLLLNQSQK*LIKL 
SEQAMPNANIGDRLKARPLSPNSHLIKT 
KQKFLKE V / KCATP 


9941 


23842 


A 


10014 


237 


483 


TLIRYI ILQGLFS YI PPFFFETESRSVA 
QAGVQW*DLGSLVPGSRHSPSSASQVAG 
TTGFHY/HAWLI FCLFCATGS 


9942 


23843 


A 


10015 


157 


875 


DDPVRGRGEESAMPSGGRRNRSVSSSWV 
GSMAGITTIEAVKRKIQVLQHQADDAEE 
* AEHLQ * EAEGKRWAWEQAEAEVAS VNG 
RIQLVEEEIiDCAQECIATAIiQKLEEAGK 
AADESERDTKVIEIWALKD/E/EKMELQ 
EIQLKEAKHIADEADGKYEEVACKLVII 
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EGDMGCTEERAELAESRC'EMDEQIRLM 
DQNLKCLSAAEEKYSQKEA* CEKERKIL 
TDNLNKSPMR 


9943 


23844 


A 


10016 


579 


30 


CX3ISTKKNKWMIKQNSSELSTCVHNQIT 
LHKI AMATQWG\ KNSFFNK* C /W/BN* V 
ST/CKKIKLSPSVTCTI*KQYLKWIKYL 
NIKI * QTS * IS \NGGKYDIGLGTNFLLQ 
KTKNLMSFS*L*NQMHKEQIIEKWDYIK 
LQNFCTSKEKFSGVKMSPKKW/DEIFEN 
HIFDRS*FSQYINNV*NSTKSE 


9944 


23845 


A 


I00I7 


44 


228 


EFVCRVRGLWW\CWLASQLLWEAEAGG 
SLEPKSLRLQ*AMIM\DCTPPW | 


9945 


23846 


A 


10018 


471 


40 


PGEGRFQQTRFPPFLPPWGPKGGPFFKK 
PKKKKRKEM I KLS * KPRQANQ * LGLLCQ 
/TSQWNAKEKFLKEIKNATPVNTQRVR 
KLNSLIADIETVLVVGTKDQTSHNIPPG 
QSPIQINILAVFHSMKAEQSKEATEEKF 
EARS 


9946 


23847 


A 


10019 


217 


486 


KKIS * PVYLSLYLS I YLSVCLSVYYLS I 
YLSIYIiSIYLCIYLSTIIYYYLSIYLPV 
YLS INLLSI/CLS P IYIiLTYHLSSYLLS 
YLPTYLP 


9947 


23848 


A 


10020 


327 


1245 


TENQQGVLALAALS KIRMMPTRLKWFLN 
IDYTKFCEAFHVFKKRKKKKINKNIKQL 
KMPLTKRIYLAFTEYSTQ*QQNTH\FFL 
SADETHGKIE /HVLGPKTRHNKFKR 


9948 


23849 


A 


10021 


329 


490 


GES PTDNHCRS PKSQVEKVI YTFFETES 
RSVTPAGV*WCNIiGSIiQPPPP\GTSD 


9949 


23850 


A 


10022 


3 


394 


YRVLEVLGYMLNIRFWLGCSSCGQAAQ 
FREGSRCTTACRHPAHTLHAAKMSRRKV 
SSAKCR*RETPKKRSARWSAKPAP\QSE 
TKPKKAAGKDKSSTSS\FSSSSSIiKAEK 
GGKEKQAEVADQETKDLSAE 


9950 


23851 


A 


10023 


448 


477 


SQVFETSLAS *MS KNIL/ WAGTGAHTCN 
PSTLRGQGGQIT*SQVFETSLiAS 


9951 


23852 


A 


10024 


198 


455 ' 


SLKTNRREIPEDFHGQPCGKIjVHSEKPR 
VRMGQTESKYASYLSFIKVILRRGGVRA 
S /TRNLIMLFQTI EQFCLWFPE *gtldl 
KDW 


9952 


23853 


A 


10025 


469 


26 


PPRAVQPPFLPSSSIiHTCVPLCRTVFSA 
TSRPDSLLPALQVSAKMSPVL*SSLLLP 
SLGQGFPMASIAPRL/PHRSLRSCVVAA 
AHSRAELDRHEAFLQCSLGQWPRNDFLF 
S FFRDRVLLCHPGWS TAA* S * LTAASNS 
GAQVMEFQ 


9953 


23854 


A 


10026 


222 


489 


KKKKKKKKFCFI ARVKSNGAI SAPCNLC 
LSGSNYFPASTSLVSGNTGACPLARVNF 
FY*NL\ FLVKMGFHH IGQES I 


9954 


23855 


A 


10027 


313 


2 


QDWE IS KLYSYCHS * VRRL/ 1 FLKIRLS 
LYYILQHSIiNCNIIG*LQFESCYIKLAG 
DKIAFRFLTLKTNHQLGLVAHAYNSSTL 
GGRDGWIT*GQEFKPSLANM 






A 


10028 


49 


388 


T P KJ. RGVFP FKG W FLG VFGAF P FPQKD I 
FYFPRTPMGTRLVS PGKKKKKEV*RVLY 
GLKKAKKI FVLKVKFSHKRGFFGEI QFK 
KLFKW/HLKRNPPPKKKKR 


9956 


23857 


A 


10029 


394 


231 


GRQIASAQEFKTSWGNMAKLCLFI*RDR 
VSPCCPSWS* IPELKQSACLSLPKY*DY 
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X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RC/RNTTPGL* HLLFRCLLI ILPSYLFF 
CKSFKLLYMDI 


9957 


23858 


A 


10030 


265 I 


2 


HDNMRGSDYFPASNGLAVPFKRQVFTIA 
SNFLEKSPPRLAFT* LQVD*ERKWAG\ P 
VAHTCNPSTLGGQGGWITRGQEFKTSIiA 
NMPKP 


9958 


23859 


A 


10031 


235 


413 


KAGKLFFFTE ITFKKGKKAGGGGSFLKP 
. HLFGRPGGGNHRVR I L* KRGFYKMGRW 
LKSLPCDSPPLASQKGGVSRKSPPPRPF 
CPF*RLFP*KKKAFRLFIPKKV+*ALV/ 
SLPFIGEGKTLFFFF*DRALLCRPGWNA 
VAPSGLKLSSCLSIiLRS*DYKHVPPCIiA 
MF*KFF/CRN 


9959 


23860 


A 


10032 


167 


424 


GREFDLRMGHMVKPPLSFFFFPFSETMS 
LSVTQAGVQW*DITALSASWTPVISDSP 
VSASRVAG\IGTTGVRHHTQLIFL 


9960 


23861 


A 


10033 


252 


3 


VEGCIiSSGVQDQPEQHSETPSIiQKKI FF 
\ *LAGHGSMCL* SQLLGRLRWEDHSS PG 
G*SCSEP*SCHCSPAWATE*DPISKKKK 


9961 


23862 


A 


10034 


175 


492 


CPTCPFVWI PLCSEQPVQLYMWHTHTH 
THTHTHTHTHTHA/ HLLFSLSFS FLRQG 
LSLS PTLWRSGM I / CGSTQS QS PRLKQ F 
CSPRYSRG*GWRMA* /DPGGRGCSE 


9962 


23863 


A 


10035 


200 


448 


LYLHLVQLSPLDFPTPIiFRVECPVSLSL 
CLSFS*FP*VCFFFLVSLGLF*GRVLLC 
RPGWSAWQSQLTAD/ SDFPVLKHSSCL 


9963 


23864 


A 


10036 


229 


463 


MILGISKISFSLAITLANFPPTLNLFIF 
LDTGSCSVAQADVQWHNQGSL* /SELLG 
SSDPPTSAS 


9964 


23865 


A 


10037 


164 


422 


GIjSSQIjTDGHTiT.iATiTiRVGGSEVGSREES 
EREEAGREREGERRDSRGG*REREREES 
GAREEREKKEKRER/ERDREEK 


9965 


23866 


A 


10038 


1 


490 


PNQIQNAVLIMISFSICFVSVIFFKSBI 
YNGQPKFKSFFFF*DESCSVAQAGVQ/W 
CG/LQSQQPRPPG\SSNP\PTSASGVAG 
ST 


9966 


23867 


A 


10039 


255 


446 


FFLRQSCYVAQARVQ/ YAIHRCSHSTLQ 
LPALAS SNPSALLAPRVAGTAGMQHHNQ 
L*YFFRIHS 


9967 


23868 


A 


10040 


157 


390 


DHTCPPPSTADYRLS * FLYFLLLLFRRS 
LALPPMLECSGAISAHCTLCVQETRETD 
AAHFKAARTIRAPHPAQLTIAFQVQTHV 
YQDGH/DGSWWCDLRTHPPRS 


9968 


23869 


A 


10041 


290 


30 


TLKKRKEFSSVLCPRGYIVLLLLVHQFL 
FFQTGSCSVTQAGMQ/W/CDQSSLQP*T 
PG\SNNPPVSASQVAGPTGMHHGFLHTE 
YESRGPP 


9969 


23870 


A 


10042 


190 


929 


NHRYISYLTRLVKIKNFANKSWLESLSP 
TLKHS* / W*CK*V*SLWKTVWHYLVNL\ 
KDDLTFDPAI PFLMI YLIEMCAQVHPE I 
YARMFIAVLFE I FKHQ\NNPNKLWYIHT 
RESYKTIKIKE 


9970 


23871 


A 


10043 


457 


724 


HFGRPRQVDCLSSGVQDQPGQHGEVQSV 
LKL\KKLAGHGGAHL*SQLLWRLRHENH 
LNLGGRGCSEPRWCHFTPAWTTEQDSV* 
NNNNNN 


9971 


23872 


A 


10044 


111 


429 


KPDEDTTGRENYRPVFFTNTDANLSKIK 
CMYPCISVH/HVNMYGKIPEMMFT*Y** 
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E=Glutamic Acid, ^Phenylalanine, 
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nucleotide insertion 














*LFLSFFFYEMESGSVPRLECSGVISAH 
CNLSLPASSNPPPGFPR 


9972 


23873 


A 


10045 


30 


220 


LGVL YKKNS VFNKCYWDN * I S I WKRM \G ' 
LDP YLTPHTKINFKWIKDLNI ITKTLI C 
CSWMKTKA 


9973 


23874 


A 


10046 


203 


361 


TYKLLVIFIVIDHF/LFSFLSFFFRDRV 
SFLLPRLECSGAISTHRNL*LPGSSD 


9974 


23875 


A 


10047 


198 


358 


IQRKCFTFFFFFEV* SCFVAQAEVQYCG 
SPG\SSNLPFSASEVSKTIGWILPS 


9975 


23876 


A 


10048 


270 


272 


RKNQRFFKIARKRLNKMTRISPLISIII 
LNVSGLNFPLKRYRIiAEWTKKKK\DPI I 
CCL/QKTHFAG/RDiyRLKIKGWKKIFH 
TNGSQ'QRSEFF 


9976 


23877 


A 


10050 


256 


1 


GVTTNLYFWKNFLNPPFLVKKCENCP I N 
KVFP * KPPLLKKTLFFSKKKFCGV/HPT 
EKFFFFFFESLALSPRLQCNGTILAHCN 
LR 


9977 - 


23878 


A 


10051 


112 


359 


SKVSBEPTENEVEHNLKV*SFILPIiKQY 
SM**QKKNIHIH\ISFYKK*ELTWPGMV 
AHACNPGRLGTEAGRSPEGQEFETSLA 


9978 


23879 


A 


10052 


346 


2 


RVSSYTLITLNLNRLNVALKRYRRLNGF 
FFLKKP*PN\ICCIQKTNLIYKSTYSLK 
VKGWK»IH\HANGLKKQAEVAI \LFISD 
KKDFKSKIVKRDKEGHYVMTKGS IQQ*D 
RTIL 


9979 


23880 


A 


10053 


376 


6 


TRPSQHPDFILLNKCLIjFEATPFLVICY 
SGHRKLRPVAPPIRGSKSQIIPMPPAVL 
PRWHLSCLSFRIPSSFF*LRLPRLCKSR 
/SAWGPVSPHLPQFLFFQAISFFFFL*D 
GILLCHPGWSAVA 


9980 


23881 


A 


10054 


102 


347 


FI^FFVIX5KIiCLEAKIWVLAPAVFIVTR 
VLLLLSTL/RQVELRNIYLCSNVHIHFH 
LYFCIYLSAYILQTMSLY*HL*YHSNV 


9981 


23882 


A 


10055 


251 


51 


ISLVNWLFILENMKLDPYLSPYTKINIR 
/WQ*DLNVKNGTTKVLEEN*GTIYVMVG 
WKRPILACQQPKS 


9982 


23883 


A 


10056 


169 


342 


SQKQYSTCQNVFCFVLFLETRSHLV\TT 
LECSGAIMAYCRLDLPG*RHPPTSAS*V 
DG 


9983 


23884 


A 


10057 


1 


586 


AAARPAQGKARPGLLLRRGRWELRAGF 
LTAKGAFRWWLSQKHVKMSYPPQSROGC 
GGMGAAGPPSLTVHQELCGAYS PDGTTE 
ALPLASWPRARPS PAKAPAYDTAKLPAI* 
IGCGSRRPPGVNPGASSLKPGACVSEGA 
GPTGTLESAGSRPPTPIiPPPV/CCPGPP 
ARADH*FCHVTPQCR* /PPPRPSP 


9984 


23885 


A 


10058 


393 


43 


HARPGAECERPAEGEAARADAFSGRHAC 
PHSLYCRL*RGFFVHKLHLNKA/RLIKR 
RKADNI KCWQRCGAPGTLMHC * WD YETM 
GLLEQTVFS FEIRSHCAAQAGVQRNNHG 
SLQEF 


9985 


23886 


A 


10059 


228 


466 


GLLHGPPIYDLPEPVSTTPSLYHPPIVP 
* ASPLGPSAILTLPFHS PPPA\PLFS PP 
HSVLGSSGPRSLPT/SPTHLPTLTPH 


9986 


23887 


A 


10060 


293 


9 


DILQFYTFSPIQ\CPVYILHIi*YISLQR 
L*FHQKYLIGIYFIETWSCSVARLECSG 
RITPRCWLNFPGPTYPPTSASGVAGTTG 
VCDFLGLPWSR 
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Amino acid sequence (A=Alanine 
C=Cysteine, 0=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Grycine, ENHistidine, Msoleucine, 
K=Lysine, LHL«uane,M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T-Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


9987 


23888 


A 


10061 


199 


428 j 


TAYCRTLPLKKSNPTISGSFFFFLETGP 

RS VPG* ragg\dlmahcs fdlpgsrds p 

ASPRQWGATGPCHHPWLIFE I 


9988 


23889 


A 


10062 


191 


2281 


LQTRGKLFKYKEKGICTKLHRSEIIIGK 
QWKEAVCPPTGQAQSQGGLDPKSRKAIjR 

gghgfvpgggvcghqel*lhsdglkavh 
pagsepsdpggvqaaacqdaatggtplq 
pmpetargc/tatas sreagpgmdp/ re 
a/gpeawqlgkarsswggsstsggpawv 
firdtpswwgi pgtvsppagprcppptr 
pqweqlgsgrpg*mqttaeetnaaprag 
qertpsrgpsadpgvsqhvlshrrscpk 
cerrsvppng\agptlwtelhpsdasvs 
altpl*gdvevgpyg\cegyikpgrrth 
qhnwldldlrpaas rtcrkmlelprv* f 
llrlpelaetysstppacaqrprgtggg 
/ ppqrcvlmsgshqggas * kgrdapslg 

SGLERPREGGPGWIiS /RQPSPHAVQPCK 
HMLCLGPAF+RGRGARGVPVPVRASHFP 
FHGAAWWPGDHMSQP * WQMLSRPARGHR 
PSWQVRRQCTVLGALKPGLPDLKCPGPT 
AFLPG*GVS/EENGQIPNVSTQLYLQNL 
PIiPREQN*RRFPRDLSSGGAGPGTG* CG 
GGGGHALS PTPSRRLRAHS KDNSWAQPL 
CCWMPAASGC\PGIGCNGVPPVAAS*QA 
AA*TPPWSEGSLPAG\*QPSAHRCVTRV 
PGGRI PHLLAQAVPPDDRAFPNCQASGQ 
LLGVHSRPSLSAAGCRQPLAVSGIGCNG 
VPFVAAS 


9989 


23890 


A 


10063 


65 


453 


ISREFLFLETKSLYIMLKGS IQQEDI I F 
VNIYTLNTGAPRY/ 1 K* ILLELKRQIES 
DTVIPEDDNTPLSALNRPSRQK/VDLIC 
TIEQMDIilDIYRTFHPTASEYLFFSAHG 
SFSKKYYMLGHKICLETFLN 


9990 


23891 


A 


10064 


100 


466 


IjGILPLQSRPSGPGGQCTPTLPFWVSLP 
LPVRGCFIiHLCWCLPYSGALGS PCHTGL 
VCWHIPCF*CCGVSP*R\SCLEVHLYTH 
THTHTHTHTHTRFVLM*MRTPLRKGKRQ 
VMRLCGGGGRA 


9991 


23892 


A 


10065 


226 


377 


GIiAGSPCISGSVSFCSGCIY\RGHGIMF 
ICNDCKVFRFCKS K*NCFKEHNP 


9992 


23893 


A 


10066 


292 


458 


KvSS ICLKMSFLFIiFFFRTGSHCH* kys 
S ICLKMSFLFLFFFRTGSH/ SSPRLECS 
GMI IAHCSLKLLDSSDLTASAS * VAG 


9993 


23894 


A 


10067 


252 


36 


TGFNPLHFIYYYYYYYY\FESKSCSHAQ 
AGVQCCDHGSLKPQP\PSSSHPPTSAP* 
VAGSRNAHQANSTAWPMVS 


9994 


23895 


A 


10068 


56 


481 


G I LKNCHNLMIKLERMRSCVLMGEQRK/ 
WIY*DESTPGEDTVNTVBITINDLEYFI 

NLVEKTVA/ gvetvdsnfergptvgkm\ 

I VCYRE I FHEKKS PSMQQL\ YF* KFE^P 
LQPSAATTLVS*QPS\SKQDPLSARRLQ 


9995 


23896 


A 


10069 


263 


435 


hfsflsfff*kteshsvt/rvegrggis 
vdcklcrpgsrds fasasrvpgtrgisq 
grg 


9996 


23897 


A 


10070 


116 


474 


RGAGAWGVSGRSS PPLSALP IQP PPHLS 
TPPPIiETP/PPVIiH*PLKDLGGWGALIiA 
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C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K^Lysine, h=heudne, M=Metfa ionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
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W=Tryptophan, Y=Tyrosrne, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V-possible 
nucleotide insertion 














Q*RLPG*HSPKKKKKKKKAPRPPYK*GP 
PKTPVDPPDGPFLDPPLFGWPPPP\VFL 
PGERPPPL 


9997 


23898 


A 


10071 


159 


377 


KSHMPLTLNQKLEMIKLSBE/GLMSTAE 
IGQKL/GLLLPNSQWNAKEKFLKEMKS 
TS PGNT * TI I KW PGAVADA 


9998 


23899 


A 


10072 


180 


473 


AIEIQLLSRPLCLWPSGRFPDAQVANIR 
VRVAV*GNPLR * VGPAEPGEQRPSPWGF 
PSIS/WDISPTWSSSASPPGLSADCKFT 
CHYRCRALVCLDCCG 


9999 


23900 


A 


10073 


98 


440 


GQLNKLSGPYPENVGYTLPVLINPLAQP 
VTYSTIFAGTLITALSSH*FFT*VGLEI 
NMLAF I PVTjTKK\ INPRSTEAAI KYFLT 
QATAS I ILIiIAILFNNILSGQKKKKKGR 
PF 


10000 


23901 


A 


10074 


358 


404 


PCLYTYHLHNSNST*SIiRLVSIMIEIVT 
SNLHRTVPHSYTPPTQLS INLVMAIHLS 
AGAWI ISYPSMIKNALVHFLPQGSPTPL 
IPILVIIETISLIiIQXPRPIiAVRLTCNI 
TAGHLLMHIiIGSATIiAIST INLASTL I I 
FTILILLT 


10001 


23902 


A 


10076 


3 


419 


KWRKKMWCLQKMEYYSS * KGKKLLSHVT 
IWMDPEEVMLS/EIS/SVTEGQILFNST 
YV 


10002 


23903 


A 


10077 


335 


414 


DKSPF/D*RFFLPPNSNGFLVKMPPSVK 
RIiPFFFIFPFSFFFFETESRSVT\RTjEC 
SGTGLAHCNLRLPGSRDS 


10003 


23904 


A 


10078 


279 


1 


kdtqlpfiqfffetgscsvpeagvqwcd 
hsslqpppp\gssgssplsl*vagtigm 
lifkifsrnplnfffflrwslalspdws 
avars wft 


10004 


23905 


A 


10079 


397 


3 


KSSS LFQNHLWANSST YSHL I PTLGY I L 
KHGNQFDPDTL/RDKM*NFFCTIVCPHY 
ELPSLEQW/VSLGSLNYDTILPLDLFCK 
RQGKWSEIPYAQDFMTLYQNLTICQT/ P 
QNPPPPPKESSKVELDIIDDPPH 


10005 


23906 


A 


10080 


281 


1 


I FLGEWGPFFPPQKKSFFPKIPQWVFFT 
PPYRKKI FFFLPR*NWA/HPKI FFKRPP 
PFFFFFFFFFFFGFKENCFILLPPKEHI 
TQAPAWGFAG 


10006 


23907 


A 


10081 


207 


35 


QENRRC* RGRGES /GTLIHGWRECQTVQ 
PLWKTVWQLLKRLNTEFPYDLAILIiLGE 
FH 


10007 


23908 


A 


10082 


2 


408 


IAPLHSSLGDRARLRLKKKKKKGKIRPL 
LALFFKNQNPSFKKNSHSFYGALMEP\ P 
PPHIPPFFFWKVQPPLFPFRGTIiEFPGP 
IGTCKLQTLGSPG*WDPPCPLPHQSPGR 
GGLIPGRKGEDPPSFRGLPPPHF 


10008 


23909 


A 


10083 


195 


2 


ECKQRPQLEMVSFRRRSHEPFFFETGPR 
SVAQAGMQWH/DIiSSLKPPAPG\SGDLP 
TSAS*VAGTI 


10009 


23910 


A 


10084 


219 


1 


RLKI KLLKI I ITTT IDSGICS FKS CKL * 
H/ J^IQXWWIJjKIiAHAxNPSTSGGRGRQ 
IT*GQEFKTSLGNMVKP 


10010 


23911 


A 


10085 


364 


413 


KKKGGEKGPLLKKER* KTNGQ FLVHTNF 
/RLPGLKYFF/CPPPPSKWGLRAPPPKP 
GDFFFFFFFFLVFLVETGFHHVGQAGLE 
LLTS * PI PLGFPKCWDYRRE 
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OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1001 1 


23912 


A 


10086 


200 


3 


INF ILLLFI FVLLI FETGPCS IAQATVQ 
WRDHGTLQPQPLGLKQAIIiP / ASAP*VP 
RITDVHHHARLN 


10012 


23913 


A 


10087 


154 


329 


FSLSDFKT YYKAAGHRGSCL * YQHFGKP 
\RRTDPLRPAVQDQPDQRGETPSLLKKK 
KKK 


10013 


23914 


A 


10088 


157 


390 


NSSSFFPPSOGAQ/KSEVKKSSGPHSSS 
SSG*RGE/CLPLLVSGGSWHFLAYGSLT 
PVSACVFT*PFSMSLCVYIQISVS 


10014 


23915 


A 


10089 


305 


1 


KKKRAQSLPGQKGGRGFSQPVTAGDRSF 
RPKIRKEALGLGQLGLTDLCRTFPPTAT 
G*TOFSSAPFPG\yPPV*PTKKKNTNIiN 
♦FTRIKIISRIFSGHEK 


10015 


23916 


A 


10090 


329 


2 


RGFFQPLGFWDPGPISNIjGGPPKNLLIiK 
AEIMGYNFCVPPFPPGGIiPTTHFFFPPR 
NPPF/CFPFTPSLGFFSPNPPGFFFFFF 
FFLSRVSIiCHLGWSAVA*SWLSAAS 


10016 


23917 


A 


10091 


84 


408 


EKKGENKLFWGFLIiLGTLFIGGYRFKI I 
FCLVGYFIFIYLFGEME/FSLLLPGL*C 
NGTCSVHHNLPLPGASGSPASTSGEGGI 
TGMRRHPRLY* I YLGRWSFALVARAVMQ 
WHVLGSPQPSPAWCKRFSCLNLRRGWYY 
RHAPAPPSLLNIQKI I LGFWLAPVY 


10017 


23918 


A 


10092 


184 


359 


FQINS SVTRRSFVljAAQAGVQWRDLGS P 
LLPPP\GSSDSPASAS *VAGITGLHVGL 
SHK 


10018 


23919 


A 


10093 


298 


386 


KLCEK* * ITPIRMAI I * KKREGENS \KC 
W*GCGEIGTLVYCL*EYKMVQPLWKIW 
QFLKILGIEL 


10019 


23920 


A 


10094 


310 


1 


I CMNEMILYRQDFGRTI I IRKI VI YFWI 
FN*FGH* IRlY*NIiLRYKTOCL*RG* * I 
NLPQRSFKNLRLKI * * W/WPGTVAHAYN 
PNTSGGRGGRIT* DREFETS 


10020 


23921 


A 


10095 


40 


407 


EHTHTTACEPGGI* IRPVD*MVLISWL* 
HGTITCKMVTLGETGQKPQGISLIIIFF 
FFG\TQSLFVPRNGVHCRGPISAR* KLC 
LPGPPLSPT * AS * VAETTGACYCTWFEN 
NFFLRLPLHCY 


10021 


23922 


A 


10096 


189 


2 


PPFFFFSNLFKKS FYEQGLALS PRLKCR 
GIIIA/HCS/L*TPG/YK*SSCLSLP\S 
SWDDRCAPPRV 


10022 


23923 


A 


10097 


260 


401 


DHIiRSGVQD*DGQHGETSSLLKVQKLDG 
RGGLQL I P \RRLMHENHLN 


10023 


23924 


A 


10098 


187 


464 


WYMTRCGSLGPSSAAHESHHPPPQQLMN 
P\PSPSSAAHESHHPPPQQLMNPTIPLL 
SS*IPPSPSSAHESPGRVGRPEFTGGAH 
GYLHLSPP 


10024 


23925 


A 


10099 


397 


1 


FFGGAPCPPPRRKGKTPPPPRKLFPPFF 
PKKPPLPKKNC^NKNPLLGPKKKNKRPP 
PGRFFKKGFF*NPP KAPLTNFWEKKKKK 
\SPFFKGVFFPPGEPKKPPPFFFFFFFF 
FFFFPTRPPRVRPRVRPRV 


10025 


23926 


A 


10100 


253 


3 


NAINRNKMQ*TKTSEWKVYADHNATIKQ 
KQKRKCQVL/ WRGCGETETLYACWW5CQ 
MEQPAPVEN/W*HFLENIKIQLPYDPAr 
HE 


10026 


23927 


A 


10101 


121 


404 


VLASWS IFFFFFFWKGGLIFSPRLEGG 
G/ LILG* LKPPLSGLGPFS CLTLPEG WE 
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OCysteine, D=Aspartic Acid, 
EXJJutaraic Acid, ^Phenylalanine, 
(^Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M==Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Argmine, S=Serine, 
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W^Tryptophan, Y^Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














LKPPPPDPGFFFFF*GKTGFPRGGQEGL 
NFLTWEPPPLAL 


10027 


23928 


A 


10102 


468 


1829 


I RDTYTQNPDKAKGYFSCI FFE I EETTM " 
SRKQNQKDSSGFI FDLQSNTVLAQGGAF 
ENMKEKINAVRAIVPNKSNNE I ILVLQH 
FDNCVDKTVQAFMEGSASEVLKEWTVTG 
KKKNKKKKNKPKPAABPSNGIPDSSKSV 
S IQEEQSAPSSEKGGMNGYHVNGAINDT 
ESVDShS EGLETLS IDARELEDPESAMIj 
DMLDRTGSMLQNGVSDFETKSLTMHSIH 
NSQQPRNAAKSLSRPTTETQFSNMGMED 
V/RPRHQ* KS * VP I FEKSVKDLQRCTVS 
LARYRVWKEEMDAS I KKMKQAFAELET 
LINGSRKWALLAEMDKVKAEAMEFLLSR 
QKKAEIjLKKMTHVAVQMSEQQLVELRAD 
IKHFVSERKYDEDLGRVARFTCDVETLK 
KS IDS FGQVSHPKNS YSTRS RCS S VTS V 
S LSRPSDASAASSSTCASPPSHTSANKK 
NFAPGVY 


1 00? J? 




71 
*\ 


10103 


254 


2 


KKKDFPLTPLGFGGLtKIQLFFFLQKKKN" 
FYFPP* IFPKI FFFFFFWFGTQSCPVT\ 
RLECSDRI PAHYNLHLQSSSHSRASVSH 
E 


10029 




TV 


1U104 


256 


419 


KVIELHTGR/ CAVGQHVGRPRRVDCIiSS 
GIGDQPEQHGEPHLY*KNAQSSQAWW 


10030 


23931 


A 


10105 


153 


439 


TTVTSLCIYYTIYYYFRLLSTHAYTHIY 
THIYV*KLTIKQPQAGPSEGF*KKEF* * 
RRW/WVGTVZVHTCNPSTLRCQGGWriT*G 
QEFETSLANMVKL 


10031 | 


23932 


A 


10106 


424 


157 


SSHLGRPRWVDHLSPGVPDQPWQHGETP 
SL/LKLAGGACIi * SQLIGRM/RLSSGGR 
GCSEP*SHHCIPVWETEQDPISKNKQTI 
FRMFKN 


10032 


23933 


A 


10107 


213 


3 


NYKCPPTPNIiNTLKALYFKKLILSPKA* 
AIKEKID*/IIKIKHFYASNIINRMKRQ 
I/TEWKKIFANHVSDKG 


10033 


23934 


A 


10108 


300 


1 


TPKQRGKKGSPREKPFPVNLPGKTLF/Y " 
EV* FMNPLK*QKKKNFQGFFFFFI*IOGT 
VYHQFTSK* ERGVPGTVAHTCNP S TLGG 
RGGWIT*DQEFETSLG 


10034 


23935 


A 


10109 


120 


421 


KKIKKKNKNHMI I \S IDAE+ AFDKIQHR 
rmx is. 1 JjW JUjuPEGKHI/ KIMKAI YGRGG 
QDGQLHTAKK/ EPLPLRENKISGKP 


10035 


23936 


A 


10110 


102 


418 


tlvcvcvcvcvcvcvslsvflslslcls 
qagvqwcdhsslqprpsg\sgds ptias 
wggitgvhchiwptfyflflirlkslm 
3j* *tssemaltkmgvhshglfn 


~T6036 


23937 


A 


10111 


382 


2 


VNNATMPSKCSTBKSHMSLTSNQKLEMI 
KLSEKGMLKAKIGLKLGLLC/QVSQWN 
AKKKF+REI *NPTPVNIR/DMQKVLV/V 
KTEDQTSHNI PLNQ I LIQNKSLTLFNS I 
KAfc*Kb£*EAA* EKCEDSRG 


10037 


23938 


A 


10112 


61 


427 


ENTI YQVQCS I KEEYLQRRTLKYAS LFQ 
KHICGSLAFLHLL*PKLYHSRSNAESDW 
I IQLFSVQPNVKEISKHKIGW*RGHACN 
/RQHFERPRHKDHLRLGVRDHPCQHGET 
PSLLSLLKIHI 


10038 


23939 


A 


10113 


71 


399 


NVLGKEARSVGWEAVRGQISRVSIQMKW | 
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KGLIKGEKLKMEKSQACICIYFFFFETE 
TNFIAQAGGQGHDLGFMEPPSPG\*SDS 
PA* APHEQNGL/TGMHHNPRVI FV 


10039 


23940 


A 


10114 


362 


1 


SKKKS WGSHSGS KCHTS FLS YPTKEQKS 
YLSLFYKCNLQSIiTFKELLDHFRFVCLP 
TIY*NKNWP/RDLPLKPSFKTLML*NQT 
L/WPGWAHACNPSILGGQGGWITRGQE 
LKTIKANTVK 


10040 


23941 


A 


10115 


314 


3 


wrerrcgahlmrcasnmahdkpsrmiinl 
*kpkmat/rntdhdnfw*ghgeteilnh 
ywwdykim*plwkti*qfpkmfniklpy 

VSYI QLLGIYPRNIjKINIHTKT 


10041 


23942 


A 


10116 


44 


394 


PGFQRVSQDGLDLLTS * SAHLGLPKCWD 
YRHEPLRPAGIYSYPAVLFVLSTYRASN 
IT*MLMTPKPTSG\YSFLLSFRFSSFSY 
LRDISAYMSLSCLKLNVFKSKPIIFLSP 
LWLLV 


10042 


23943 


A 


10117 


398 


3 


MESCCFMGIEFVLKEEVLEIGCRTM* I Y 
LHY*SIiHLRMTOMVTSIJXOTYHN*TIF 
KKSS VFLYAGNKQL * I * ILKNYIYVMIP 
FTITSRNMKRNNLTKDGKDMSTETYKTQ 
LKEITD\*NKWRDTTFMHQN 


10043 


23944 


A 


10118 


104 


386 


KVPWATCGVKMHRLAGHGGWRIiWTRLP 
WRLRRDNRIiSSGSRGA\AAEILPtjCHCT 
PTWVTEHDSITYKY*KKGAPCWRASS*C 
TJjSGHALFFFA 


10044 


23945 


A 


10119 


258 


401 


YNYPFLTCVMKEKIGAGMAAHACNPSTP 
GGRGG* \TRGQHLETSPTNMV 


10045 


23946 


A 


10120 


310 


384 


GVFPFLTFLYGGIKGGPPPPRFFLF*FA 
FLFFFF /ETGSHS VAQANCSGS VSAHRS 
FHLPGPGDPPTSAS * VTGTTGICHHS 


10046 


23947 


A 


10121 


245 


479 


TSSLYRKIQKRFLYMLALEVLKICNTYF 
MNTERFPLKLFGQFS /FRI ISI * FFPFL 
FPKI YHYFLRDRV* LCHPGWS AV 


10047 


23948 


A 


10122 


268 


469 


NSVFWIiGWAHA/W*VNPSTLGGRGGWI 
TRGQELETTPANILPP 


10048 


23949 


A 


10123 


221 


423 


GQAGAVAHACNPNALGRPRREDSLRPGV 
GDQPGQHSKTPSL/LKSDNF+KMGWLGP 
WAWWLTPIIPALW 


10049 


23950 


A 


10124 


73 


183 


AGGGWALTNTVINPT\PPSTPFTPCLSY 
SCTISTASAFRFPG\STSTAVLTHVRIL 
KSTPDAPHPPLKIPRARRNIQRDHTLSS 
NLiFYYNHQLNVTEELTYYDLT * INPTAT 
IHAIHTMPLLFLHNLNCIS I 


10050 


23951 


A 


10125 


342 


2 


PNLGPCPPPRGTKGGPFFPSFHPSI \XG 
PPNFFSF*RP*FF*MLKSLGVP /SPTGP 
IPKFPFFPGFKNYFSPPL\PKGPLLKPP 
RALFFFFFFNRHRVLL\CPGWS PTAGLK 
RSSC 


10051 


23952 


A 


10126 


85 


442 


KKVINIKCW*RVWRTGSLIHCSWG* KMM 
PHWRAVWQSFR/SLSMYLSYEPALPLPG 
SYPRALRTTCVCTHSRCFSLLELPQA/W 
NH 


10052 


23953 


A 


10127 


180 


484 


KSFFFFSQSAQPIKVIPAP*NFHFFGVK 
RSSCLSFLRKWGPRWVPPPPFFFFFFFL 
IFVEMRSHYAAQAGLKLLASS\VPPPSG 
IPRTLD*SYPD 


10053 


23954 


A 


10128 


376 


1 


WS PGAQLLF I S CL YYNLWSLHTGRTPAK 
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PP*GWTLQKTTPIiSCPAP*LPPPLPSPL 
QQIDIiKEBCLVFTAPACSL/PHSPNSSPW 
PQRRWPS VS / P/ PVAHPAAPRLPPPP VC 
ARPLWLPEAPLLGI PPH 


10054 


23955 


A 


10129 


140 


454 


CLIRSSINSVHNNIYIHSTPDRHLGCFQ 
FLWTKNAAVSRLAYVF* * I * I PTYDHI 
HTHTPAGY I PRHGIAGSQLTNIFNFSS * 
IjSSNSF\QVGCTSYFLFPSHI 


10055 


23956 


A 


10130 


340 


473 


AGARHSVIiRPRPPItAGSSCPSS/CGGHC 
VALGVSCPGGGCRAACVPGCCCLLCGLP 
SDVCVC/SVCWGGSMGSRLGGWAAPGF* 
GNSREGPGNCS SGRKAGLAGGGLGPAWA 
EQAPDILCCGPGPHWLGAPAQVPLAPPL 
SPPPCLYGCLYSDRGDL 


10056 


23957 


A 


10131 


280 


2 


DSIiALLPRLEYNGVNVAHCKLKSSWIRR 
SSHLSLPKHLDYRSMSH\LPGLEIF*YS 
YCK*CLLTSPYFEFLMNSLEDISSCFIP 
IRVMPKCRMK 


10057 


23958 


A 


10132 


244 


3 


IISTDLISHSWTLYLISSIiVYVFLKQHS 
LAVLPRLDCSGMIIAHC/KLQTPGLKR/ 
FL/CIiSLPST*DYRHTPPNRANFYYYYF 


10058 


23959 • 


A 


10133 


272 


3 


VGKPRVFPYR IAFNFGRVTS I LGPH IKK 
YHFQCRGGKHCFF* KAPPFFFF\FNETG 
FCSVPRLECSGTI IALCSLNLQGSCNSP 
TSVSQV 


10059 


23960 


A 


10134 


217 


3 


FPRFRPLVSPALTEIiGQF/ HFLTRGL* K 
KKPFPFLFFFFSETESCSIA\RLWGSGM 
ISAHCNLCLPGSSRTRG 


10060 


23961 


A 


10135 


229 


3 


KNDYLKCWQGCGRTETQMLGCWDIK/LI 
QP/LWKI I WQFLLKLNMHLLCNPTLPLL 
GIYP* EMNVYVHTQTCICTDAW 


10061 


23962 


A 


10136 


84 


352 


RREFKTGLATWQNIjVSTKKYKTTVRYHL 
TSTRMDI I KGKR * QVCW/RKC /GDKGTL 
LDC/WMRMLVQPIWKIV*VWKFCRK*KL 
NLFFPPPPQ 


10062 


23963 


A 


10137 


280 


2 


KCWVPEERI LREKSQHLGKI KQEGRLN 
AWLIQPGQHGETLSLQKI / QLAGHGGA 
PLRSQLHRRMRREDHLSPGVRSRSEP* * 
LSLHWVTEQ 


10063 


23964 


A 


10138 


2 


386 


HIQEKQLPNKDSTLNPYFLLIS ILKRKD 
EKIQMPSKHMKRCSTSLVIREMQIKIRS 
HFPH * VGKI LRS ANI KRW * MCRKRG\TL 
ITCW*TYK*TYKLVKTF*RIILQHPLLK 
THQS YYPAI PHLKETQA 


10064 


23965 


A 


10139 


229 


I 


TRS YHJSSLTILHTNHHLNCS PFPQAITP 
*SRFPYSNRA\WLGSVAHTYNPSTLGGR 
GRRNA*GQEFKTSbGNLPK 


10065 


23966 


A 


10140 


261 


429 


LSHLFHVALI*LGLVW/SFVFVFFETGS 
LPRLECSGVITAHCSLDLLGLSGLPTSA 
F 


10066 


23967 


A 


10141 


377 


34 


WVFGSSGPPPPGFKKFFSPPPPKNKNPR 
GPPPPPIFFFFFCKRGVSPLFLPPGGGP 

KIFSRPPPPGGKQNFFFLKKKKKKKEHQ 
IKR 


10067 


23968 


A 


10142 


208 


424 


FFSLFMQTFHIHIRFYFFLAGGGGAGVD 
RVS/LLSPRLECRDTISVHCYIHLPGSK 
QFSCLSLI*SN*NYRHAP 
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10068 


23969 


A 


10143 


i 


297 


PTRSPQVSNLNHKSFISFYLFIDtiliSVC 
LSVSVYFETSLTLLPRLECSGAITTHLS 
LLSS*GHRHAPPCQV\FFVBTRFHNVSQ 
SGIELLSSSHLPTPA 


10069 


23970 


A 


10144 


185 


488 


SGFPRGTELIGAVCVCVCVCACVCVCIK 
LGYI*NHKRIY I *LLYIYSTYYIFNLSL 
*LSVY\SYVNENSM*DRSQSI+IFILPK 
LKTCLENTHTLKNTHEGG 


10070 


23971 


A 


10145 


283 


3 


LGWPGRKADGSGKLTIDYGRLDKLVSP \ 
I * SAI SDMASTTKAVLQAQRDQYSVLDL 
AHAFFSIR\NQSQFSFIRDSPQYAFTLP 
SEEHLNSLDGQ 


10071 


23972 


A 


10146 


146 


380 


CFLWDNFFFFI FTEKTGSCS IAQAGAGT 
Q/WVGIISYCSLKLMGSRDPPTSLSCVA 
GTTGMPHHAQVI YFFIFL* RQGL 


10072 


23973 


A 


10147 


275 


3 


I KGMLKAKTG * KLGLLCQ / TSQVENAKE 
EFLKE IKSATPVNT/ RMIRKRNCLIAEM 
EKV*VTIiIDQTGHNIPIjSQSLPQSKCLT 
LFNSKKAER 


10073 


23974 


A 


10148 


281 


2 


KI FLPQP PQEMVFQGFAPPGG* IFLVF* 
♦KQPFPPLGRFVFK/LPGGIPFFFFFFF 
FFFFFETESHAIT\RLECSGTISAHCHL 
RIiSGSSDPTRP 


10074 


23975 


A 


10149 


413 


139 


S LRVWP I FGLLQANKKKFFW ILFT*S / IR 
PLFFFSRERGGTPLFPPFFLRNPGKKSF 
PPG* PPPPLKIQFPGKYFFTSRPAPFFP 
LFCQPPGKNLNYWGLGPGFPKIFPPPPL 
FFFFF*DRVSLCRSGWSKVARS*LTWGK 
AFFTWVS 


10075 


23976 


A 


10150 


100 


431 


GPRLTDHLRSGVPNHPGQHGETSFLLI/ 
HNYSGG*GRKIANRLNPKGGGCSEPKSC 
HCTPSWGKKRNFVSKKKKKKRE 


10076 


23977 


A 


10151 


134 


417 


GLAAPLVG/ WGKRNPPP*PFG*GEKTGW 
GPHPGKI FLKKKLLFFFFF* KGGLS FPP 
GGRERAHFWLKKTPPSRKKEILPPPPPR 
RGGGGPPPPPFL 


10077 


23978 


A 


10152 


403 


2 


LAFPILQEVPQCFCNLLHSLRMYSSTGS 
KLFPLAS PQPHSLLYHLTLARAPNT LEE 
LQSPNTALLLKYL*EAALKKKSR*HITS 
SIi*KVDLSFFLETKPRSIA\RLKCSGTI 
SAHCSLRLLGPSNSCATASQVAG 


10078 


23979 


A 


10153 


382 


297 


GLN* BSWSGEKGTL/ LHCWWECKFVQPL 
WKMV*RL 


10079 


23980 


A 


10154 


209 


15 


SSPKF*FFFLSPKKIPLPFF*PIFFFTK 
TP/HFFF/SFFFFVCVSGSHSVPRLECS 
GAITAHCSSTS 


10080 


23981 


A 


10155 


153 


422 


RTSGKNGNRGQI ILVTMVTNWLFS FFFF 
PLKTGPHFVIQAGGQGRAHGSLQA* IFG 
\SGDLLAIiVSLQAGTAGFHHHAQLSFFF 
KKTGLF 


10081 


23982 


A 


10156 


257 


3 


RRLQNSKDCCLFFPLEAPRGATARCQPE 

LS cmrclsa\llggvs qsgykgvrdphk 

EAV* PLAELKR* AGRSSAPFKAVRQGCL 
S 


10082 


23983 


A 


10157 


37 


386 


DATRFR I NSVGTAEGGLDRIEPNDLQ * Y 
INLVGTAVRRPDRIESNFESSTVGKKLS 
NNI VCYRS FFCERESQLTQQTSMLYYFK 
KLPQIiPQPSAITTLH/SRQDPPSARRLQ 
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LGQ 


10083 


23984 


A 


10158 


407 


272 


HKGGKRERKRKKKE KKEGGRE KGQERRK "" 
ERE/RERKKGRKREMKKVNRYSKCTQ* * 
LLSN 


10084 


23985 


A 


10159 


248 


478 


VKWHRLLLDPSVEDPWKKSLKGPtiQP* 
/RI>* I FFF*NSGLLCHPSWRAVAQSRFT 
GVSTSQAQAILPSFSLPSSWDR 


10085 


23986 


A 


10160 


458 


129 


GFSVLNRLEDEDLRQENLLMPAV*DQPG 
QHSKTPISTKNNMNPAGHGGVYP+SQLL 
RTVRPEDHLY/SNKPCSYLCAPAWATET 
LS LKNAFS C CSNHYGLS FQTS FHR I LMF 


10086 


23987 


A 


10161 


82 


291 


YSKTKSLFFAGHGGTCIYSQLFRRQVQV 
RQEDHANPGV*GCS /DTMIMPMNSHCTP 
AWAA* * DPVSKKKKK j 


10087 


23988 


A 


10162 


6 


409 


LRINI LPE P I SLS PLTAPAVFMPWTNYS 
SCOTFSLASMTLLPLHFLSTSLALPWSL 
FIAPSLFQISKHQLYLSPVLLTLSSCYF 
LNPCFRNHILMNLGRPRQANYLRSGV*D 
QPGQRGETPSL/LKNTKITWP9TO 


10088 


23989 


A 


10163 


249 


415 


APPECSLPLLVTiTJ.LTjTJJ.LFLRWEDHL 
GPGG*GCSEPKSYHCTSAWVTR\GDPVS 


10089 


23990 


A 


10164 


I 


405 


RSEWRLRQPERQSETVKKKKKGFFFFSR 
GGGGGKIFGLMEPSPSGLGQFFLFTPLE 
NGGYRPPPLGGPFFFFFKKGGFPLCGPG 
GFEFPALGTPPLLFPKGLNFRGGPTPPA 
RFFFFWGPVFSVFS PKN*NPPVFRGKNG 
KNRPPKKXK^GRG\WGPPRKFNPFGNKR 
GG VPRAGNS KP PGP QRGNP P FLKKKKKG 
PPRGGGR* PPFSRGVNKKNCPKPEGEGS 
INPKI FPPPPPREKKKKPFFFFLTVSLC 
RSGWRNLHSL 


10090 


23991 


A 


10165 


175 


404 


AI KMVW I QAVPS QCLHGVGF YLPRS VCV 
KTP*NKRHRQGVVAHGCNPSTLGGRGGW 
I \RGQE FETS LTNMVKP 


10091 


23992 


A 


10166 


317 


2 


PFKNSPLLLNEMTNIHFLEPTCNYVNVS 
PFLK*HKFLFF*EMESCSVT\RLECSVT 
I IAHCCLKLLGTSNPPTSASWAPGMI IG 
MRHCPAP KVCSHAS AHAS AHAS 


10092 


23993 


A 


10167 


325 


2 


KMGEKQGHFIKEEIRKANKHIERCSRLL 
AIEEM* IBCATVTI/RKHQTKC*QG*RES 
GSLVHG * * KHKTVHPLWKILS VS YKTKH 
S 1 1 1 IGYSS VLLGTVLVHSGSDAW 


10093 


23994 


A 


10168 


245 


3 


GEP FLFCFP I KKPKVFFCPYFNQAVFFL 
ETHRGFFEFFFLF*KKKKKS/LGMVAHT 
YNPSTIiGSRGGQIA*AQEFKTILGNMS 


10094 


23995 


A 


10169 


164 


I 


MNKMSQYTFPYKN*QWGG\SVAHPYNPS 
TLGGQAMQIT*AKEFETSLSNMAKPC 


10095 


23996 


A 


10170 


214 


419 


LS * PPPPFPGFKKFNRVLRFYGKSKPGP 
/FMFVALFQKGGPFYQNIPPPVKLEPFP 
PT I LRVSFFFL* DGVWLCHPGWNAVA* S 
QLTASSTSRVH 


10096 


23997 


A 


10171 


168 


1 


WTWWLFPGFSITNNYCSHFSSLHLAIW/ 
LGTVAHACNPSTLGG * GARGQE FETS LT 


10097 


23998 


A 


10172 


139 


3 


PAH/CES*PPGLKRSSYVSLPSG*DYR/ 
HTPPH\QIiIFVFWEMGFHQPA/SA/FP 
KC*DYRS\DHHKWPT*HLK/SFHFIKK* 
N*LKRYLACSRLWYLFCLIRCMV* *NLD 
DLS ILYLHLNKFQFLFIYIiF FEMES 
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nucleotide insertion 


10098 


23999 


A 


10173 


253 


2 


FPLKICNTLPPQIFSTQRGLIiSLPFTIF 
LLSI*ILLRPFPFFF*TESHSVPECSGMI 
SAHCSLRIjPGSSDSCAoAS \UViU3iCTCA 
R 


10099 


24000 


A 


10174 


300 


3 


GRAFF FKGLVKWVFLTPNP I PFLLKNFL 
KRVFFLAPSEKFVFFKNWPP*LSFLIFG 
EKNFFFFFFFFSEMESCSVA\RLECNGM 
ISAHCNICLPGSSDSP 


10100 


24001 


A 


10175 


117 


381 


E IXjRY I VS I RIRRTNLKFQQEELRVNI K 
NNLSSLGNMVRPRL/ECSGVITAPTAVS 
APGFK* SSHPSLPSSWDHRRAPLHLGDF 
Fill ETRSHHVAQAGQVI FNViSKvLtLhK 
FKISPSDPYRNNISQDFLAIGPTNMVPIi 
QPYI VLCACRTKNNKI IDLDSEQSSRT 


10101 


24002 


A 


10176 


194 


412 


TLKNliLIiSIiHCIjS FVPTDLYQPERLSKP 
PRPAK3CYKFLIL/WLGVVAHACNPSTLG 
GRGGW I T* AHE FDINJjTK 


10102 


24003 


A 


10177 


100 


339 


VPVKS LLLCVCVCVLLL * RISDHLPTL Y 
ANLG*CFFSYLVISYS\NSTYCILQRHL 
LAI QKQP / CS F JjRQ/ HIjVGWIjHKKKHTH. 
THTHTHrHTHKSNTHTHTHKS KDLTGTQ 
NTTEFLLK 


10103 


24004 


A 


10178 


229 


2 


KEKYIKSKLHFEARHNGSHL*SQHFGRP 
RLMDCIi\VQDQPGKCGENPYLQKNTKIS 
QAWWIj V PVVPVx SHRAb KTIiS 


10104 


24005 


A 


10179 


368 


2 


SGSSPSTSPQGRPDPLGLFSAKIVLGQL 
ATI PLPLCFLLKI FHPLTLP PLLLLGGE 
FNLSPQQQDS ISLPWFLCYRYGPTYKAF 
FYV*ASVPEKI \FFFFFF*DTVSLCHPG 
WTTVAQS RLTA 


10105 


24006 


A 


10180 


398 


1 


QNIMLYTINTYNFICQLKKKTIQWGENF 
FFWKWC*ENWI FTCKRKNI1HPYI1IPYTK 
TNSKWISELNL\KSLKVNRGGNLHDFQF 
GNGFLGDTKSKTK/AKISKLEFIKIKNF 
CVLNDI I KKVKRQLIEWKKYLQ 


10106 


24007 


A 


10181 


252 


419 


G^RFKGSNrT SAbGU^I Ir rMGPPKli JLi> 
RPVF*HRGEGKTPGVTQLNRLEEHPLFA 


10107 


24008 


A 


10182 


101 


380 


FCWIYNSWLIGYLFFPFRTLNILAMOGG 

0 UJj *byi ro Ks2 Kvj AUHijb Jro V KLKJ *r Viy X 

GETPS /LTKI S WAWWHVPWPATQEAEV 

V-3V30 UEj ITXvCi V IV 


10108 


24009 


A 


10183 


2 


377 


PGQVDCFSSGVQDQPDQ YGE I LFLIiKLQ 
KLGGHGG/ IR* EDHLSLGGQGCBCEPRLY 
HCTPAWVTEQYLVSQKKKKJCKKK 


10109 


24010 


A 


10184 


140 


3 


DGVIiLSLPRLECNGAISAHRN\SPSCDS 


10110 


24011 


A 


10185 


364 


3 


SVCPPRLGGVSQSGDMGVRDPFEEAV* P 
liAELERCAGRSTAFFRAIR*AHLSLLK\ 

l?T T D'NTJ"D'I?DW\7DPPPX7MQP*'E , r!T T.DT7WiB 

CPAQRGE I WRRSIiAAVALRS CGGRLQVR 
TCWQLCLHR 


10111 


24012 


A 


10186 


2 


249 


SVRLNKNKENTQIRNKKGDTINLTEIKR 
NPLRPGAEAQRLVS * RFWRATAI DHMRP 


10112 


24013 


A 


10187 


270 


2 


PWAQRGCVFKTNQKI FFGFFE IPHLD* P 
CVFQRPPS\F*VRLIFIGGGFKPFFFFF 
FEMKSCSVARLECSTISAHCMLRLPGSS 
DSPASA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
(XJIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10113 


24014 


A 


10188 


428 


28 


PIFDKG/E*GN*WGKKSIFNKWCWEKCI 
C/ KKLRDSSLTS Y/ TRINLKWI KDLNVR 
TK/STKLLEENIDIV/IYDLGLGIGFLD 
MTLKARATK/AKIEKLDKIKIGNFYASK 
DPKKMKR*FIEFIEQGKIFTNHVSDDTS 
I 


10114 


24015 


A 


10189 


85 


243 


DR VS / LLSPRLECS GM I S AHCS FDLPG * 
SDPFTSVPKSSWNYRCPPARKLSFT 


10115 


24016 


A 


10190 


96 


440 


I^SNHINTLNVNRLSSSMKRFRLSDEIF 
\KKHGPALCCL* EMYYKDKGTCTLKIKG 
WKNI YHANANNNKT/ GMVLLSGKIRLES 
N/TVTRVKEG * LVI / INGRL I / QEDLIL 
LFILKTVYAP 


10116 


24017 


A 


10191 


137 


423 


AKDLDRQFSKEDIHMTNRYMKTC/SGRQ 
LIRETQIKTTMRYPLTPLDLRKSKDKKS 
** * YEQKGTPAHC* *ECKLVQPLQKII* 
TFPKLLKINL 


10117 


24018 


A 


10192 


221 


72 


AKVK*VWFFCFLFFF*DGVS/HWVAQ 
ECSGAI SAHCNLCLPGSSDS VRVG 


10118 


24019 


A 


10193 


460 


121 


SDKTKYW*RCRAIKTLTHF*LKSKLVPL 
L*KTI*CYIAKIiKIHIVFDSAIPL*DTY 
FR/E* FCICAPKTYSNMVIEALFVPANQ 
TS KKKNWK* LS CPQ I VDNFLNI HLMKYH 
LE 


10119 


24020 


A 


10194 


393 


2 


ATMPS YQRLLLTAVHELES PE * MDQFLD 
TYNLPRFNQEEIENLNSPIMSNEIQSVI 
KCLPFGPNPGPNGFI IEFYHTYKEE/LP 
ILL*LFQKIEG\ILPNSFHEVSITLIRK 
TDKDAT * ETHKP I S LMNTDA 


10120 


24021 


A 


10195 


3 


353 


eaapmpvkffqllepsgppglsmsclgd 
cngyacfsvslhqsanqsidq/siyqsa 
ihlliy/vsiiy*si\ylll*sli*pmy 

VPIYPLSCLSIYLSFMPLSCLSFNNLSI 
TRHGSMCL 


10121 


24022 


A 


10196 


118 


395 


dfnagllpsyqvlslyfcf*ngv\chct 

P/RLGDRPKLR/ PPKKKKKKTPPPPI/3G 
SPPGGEPYTPRPGVGPPPPPFEHPPREN 
PGCFHPPPKAWP 


10122 


24023 


A 


10197 


437 


160 


FGYPRVFPLPALLKRGPGILFWGPNKKN 
YSSPARGSKICFFKRGPPFFFFFYFFF* 
QTESCFVAPVGVQWCDHGSLQPQPPG\S 
SGPPPSTS 


10123 


24024 


A 


10198 


259 | 


3 


RQGNHVTHVYSSRLQLILSPPASLAVFI 
QKIYVNLCTYIHIVNLYA*MCICVCICY 
TCI L\ I CI YGL VAVAHTYNPTTLGAKLG 
GS 


10124 


24025 


A 


10199 


1 


416 


I ISAHCNLHFPSSWD YSRTPP YPANFCI 
FSRDRVSPCWSGV*HHLEQHSKMI*SKK* 
NKIDMHGGMCL*S *LLK/RAE*DVLRQE 
NCLNPGGRGCGE PGSRHCT PAWATRAKT 
LSKAS YASSSSSPKKKKKKS PS 


10125 


24026 


A 


10200 


369 


3 


TAPPFFFFCRDGLRTSLI IFPSRATPYS 
LVTSGS FLPAGAGGRGLSLEI RVIjDLVQ 
DGCPCSQAWAAALGGPGWWAPFLCFFET 
ASCSVAQEGMQ*CNLGSLQPPSPG\SSD 
SPASWLWTSK 


10126 


24027 


A 


10201 


314 


2 


LHHMRPEV* PTPSKLGVTGYMVSNISYP 
WRCRAHK* SQVLVSLRHKHLFPLEMESG 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, MHVIethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Tbreomne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














S BVQL \DTNRC I SANCNFCIiPGS SNS PT 
SAS * EAGTTDSRQ PDAWADAW 


10127 


24028 


A 


10202 


68 


413 


KIGILLKTH* PS I C/ VS ITNF* YNDTGE 
LKVKG* KNI YYANINQK/KAGMVMFVLD 
ETDFRAKK/DYQGH* IMIEVSVHKE/E/ 
IiVILNLYASNKRVTKYMK * KDTDIKGEI 
DKSTTIVGDL 


10128 


24029 


A 


10203 


351 


1 


PKFPPLNPAWGTTIKIFFKKKNAHNWFS 
PRKEKALQKIHKKG/RIiLAKEISFLGGF 
* TDYKTNPDSQS IRTLGEKKKKIRPGTV 
AYAYNPSTFGG*GRWIT*GQELXTSIiAN 
MVGRV 


10129 


24030 


A 


10204 


2 


392 


FIjEGVIiLLLPRLECNGAI PAHHNLHLPS 
PRFKRLSCLSLPSSWDHRHAPTRPANFV 
FLVETGFHHVGQDGLDLLTL*SPCLGLP 
KCWDYRH\DHHAQPSF 


10130 


24031 


A 


10205 


264 


2 


PLSLSNLFPIPIiNSILKIFFILK*L*IH 
GKLHKMYRKV/LV*TSPSPSQC*HLR*V 
* HN IQNKKLMGLGAVAHASNPSTLGGRG 
GWIT 


10131 


24032 


A 


10206 


281 


2 


ISPPLFFLFSPPKKIFFSQKHIFLGPPL 
FYPPPLFFSPPP\ PHFLWGFRPVPNPMP 
KIDPQTWAKNKDKGPNFPPPFFFILPPK 
KNIFFTKTYFFGPPPFIiPPPPFFFSPPK 
KK/S YPFPIiKKNS PNPPPS * K* AAISLP 
PPFFYFFFFFFFS FFLENTWKMADAW 


10132 


24033 


A 


10207 


340 


1 


STKKLVQKLGEHTQCWQGCEATGTLIHC 
RWECKLVQPLWK/ SAWQILRQL*NFMHL 
P YNPT I QLQGS YSRE \QHTCP YKDLYVA 
ALFT IAKNC * QAKFLS TGE * INKLWC I H 
ITE 


10133 


24034 


A 


10208 


136 


3 


KNRGPPPRSGKFFPFIiPLFFFFFF*DRV 
\YPSHPGWSAMVPSQLT 


10134 


24035 


A 


10209 


327 


2 


INKKYYEQLYA*KFDNLDEIYQFPGRYN 
LPKLTQEKANNLNRPVSLRETKSTINNL 
PKRKVPDPDGBTGELYQTFKEEITP\FS 
TEAKGILPNSFCEVTI IVRLFLLRQ 


10135 


24036 


A 


10210 


89 


485 


E IVPLLRFPLVI F IAS PTTFYTYGFVSG 
EKSQYKKRE KLCDKTLEQSRSIiLGMHST 
CGIALHPKREGQMCTLCACFY* FFLKES 
HSAARAGVQWHNLSSLQLPP\PSSSNS 


10136 


24037 


A 


10211 


361 


462 


IQQIGSNTGNIPAMELALKRNSKNINRL 
AGWNGP*VFAC/PKNVLFYLHP*LFG* I 
* IYQLKITF *NYEGMVP/FVFRFLFETR 
SLLSPRLECSGTITTHCSLELVGSINSS 
ISARAE FLQRGANL 


10137 


24038 


A 


10212 


62 


469 


SUjWKLRSSGASTSLRCLSA\IiIjGDVSQ 
SGYTGVRDPLEEAVCLLSELERGAGRTT 
ALFRAVRQGRLSLEKL\liIiPFVQlCPAP 
GGGIYR/GQ*ALLSCGGLCPV* AS* * /M 
CLPC/EA*TRLLKPPPGS 


10138 


24039 


A 


10213 


197 


1 


WTQPNEWCVTNKDAS FQQRCTKKIKT/ 
WPGMVAYTCNPSTLGGRDGWIT*GQQFK 
TSLANMVKPH 


10139 


24040 


A 


10214 


31 


213 


HASLDEGFHHVGQDGLNIiLTS *STCLGL 
PKCWDYRHEP/PAPNWWKKFLSSKAFKR 
♦LGCY 


10140 


24041 


A 


10215 


442 


29 


PTINPTMACKCLSKRKSHTSFTLNQKLE 



1165 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Predict- 
ed end 
nucle- 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 1 
OCysteine, D=Aspartic Acid, | 
E=G1utamic Acid, ^Phenylalanine, 
G=Glycine, KMHistidine, Msoleucine, 
K=Lysine, L=Leucine, M=M eth ion in e, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TIKLSEEGMLKAEAG*KLDHVH/QVSQI 
VNAKEKFLKEIKSATSLNI*MIRKQKRF 
IVNME/KKVLMFWTEDQTSHNIPLSQSL 
IRKRP * LFNSLKAERSEEAPEKLEASSG 
G * J 


10141 


24042 


A 


10216 


195 


533 


LWJ FWLRVTYG VAAKMSSGTT I IGRLS W 
GWRLHF\ KMAHL * G YWWEAS VPLHMKLS 
SCHGTG\SLRVKDPRERLRQKLQCPYDL 
VLEVTYWHFYCILFIRSKSLSPAHIQSK 
RI 


10142 


24043 


A 


10217 


274 


494. 


CVLKGLPFALPWFFFFF*DKVSPSPRLE 
CSVAI /SAHCSLNLLGSSDPPASASQLA 
GPPGLRR 


10143 


24044 


A 


10218 


142 


480 


VTIKIYLHPHDTTTPLRQGGSRIiKEILV 
SY*QQKVETRPSHVIFFTVISTQFKKTV 
TTRSGAVAHTCNPSTLGGREGQI/T/RG 
QEFETSMTNPVK 


10144 


24045 


A 


10219 


163 


464 


AVYFS FWRVCEHDLASLYFENSGKRDNK 
FVFLVLQAQGPAWRQGEVHKHLLPCVYH 
FAFWLTQQPQS FF * DG/QSHSLAR /LE 
CGGVI SAHCNLYLPGSSNS 


10145 


24046 


A 


10220 


379 


469 


V*KVQM/WPGMVAHTCHPSTLGG*GGWI 
T*DQEFETSLADVAKPC 


10146 


24047 


A 


10221 


211 


473 


VTWLLLFFSIFRASSLSTDSVSYVTAFQ 
FYFYLFI YFYLFI FETESCSVA* AGAQW 
CDLCSPQPPPRGS \SEEPSTLQMTSGSS 
NL 


10147 


2404S 


A 


10222 


142 


394 


GVCQS LLGS LCQLGGTGVRDQLEEAVCP 
LAELE P CAWRTLLLR I CCSLQS QQAGI F 
/RFC*SYIHICPFPQVLLSSEMGVLSIR 
P 


10148 


24049 


A 


10224 


77 


396 


KRKTKKS FWTPSGFGLRPP I PKKKKFFN 
SRGKQGGFGRGSGSRGGGGSTSGRGGGY 
MGKVFGMGQ*NFTGDQGLAKGGFVFVFL 
GRTTIGIKGALKRLFA/NNEHDLR* 


10149 


24050 


A 


10225 


155 


443 


GQARBGTLKSQFPFPS PPGWPL ILPGGV 
FSPKFFFFFFETRV/ CAVAQAGVQWCNI 
DSSNSCASSS*IARITGAHHH\TELIFV 

F*ISPPSQ 


10150 


24051 


A 


10226 


290 


485 


KRGEGPVQKKKFFFFFFETESCTVAQAG 
VQWHNLSSL* PPPPG\SSDSPAEFPR 


10151 


24052 


A 


10227 


228 


424 


KL YQNLHDTLFGFELSHYEE VH* KMLKG 
SSWEFVMHSS/VSNKCILICSAKKKKKK 
KKKKKKKKKKKKK 


1 A1 CI 

IU152 


24053 


A 


10228 


269 


454 


DLFAASQGHAQLLEELMATTLSLTHTHT 
HTHTHTHTPWSLF*SR/SALSLVC 


10153 


24054 


A 


10229 


255 


1 


FYI FS KFLYVWLNVSHLDSHICFCIQSV 
I ISCFGVFYYYYY\FF*RQSCSWHTGV 
QWCDRSSMQPQLQGQNDPPTSASQVAGS 
A 


10154 


24055 


A 


10230 


18 


472 


GPEEFGTRRERDRERERETRDSRERVER 
VRERER* /RQRERERERERERER 


10155 


Z4UJO 


A 
n 






5oi 


WKiuN 1 1 ry liviTVi JjENTKX/F XQKLHTK 
CW*G*KEIGRAIHCWWECKM/VIQPVWK 
R/VWQFLRKLNMLLPHNPAVELLDIYPN 
ELKTHVCTKALYEAGRGGSRL 


10156 


24057 


A 


10232 


3 


402 


APRLPLGGWISFSSPMGAMGLNRATGIM 
GSFFFFFLKRKSPLGPQGEGRGQNLG*G 



1166 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Phenylaianine, 
G=Glycine, HHttistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *«Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NPGPRG*THFPG * TP * KAGNNGEGPPPR 
GNFGFLKKKGVPQKGQGGSKP/RDPRGT 
PRLGPPKGLE*RGGPPPPAPQK 


10157 


24058 


A 


10233 


48 


470 


PGRRFTTAGLGKQEPAASGPRAPRPAQT 
PLKREAARPSLRHLPEGA+APRSSARSP 
LLI PAASLRSPRRRTCRRRliRKFGI ITR 
PFQPGVRLPDSGPGLA/RWACAAVPAAS 
RALP ALRRPL* RRPGHGAPS SKSHGHS R 
SP 


10158 


24059 


A 


10234 


169 


446 


CPPGPAPSHLLEDRDDPQAGRQSSFYAG 
WWDRWREKTI1FLFI1RQSLTLSPTTPFFF 
\FEAESHSVTHSGMPGVISAH*NLRLPG 
SNNPPTSAS 


10159 


24060 


A 


10235 


264 


2 


KLKPIFKMVFLKIMFMSYIMEVFFFIET 
GSRS VTQAGVRWHDLDSLQP * PP\GSSD 
PPTSVSQVGKWHKTWEAELAVSRDHATP 
RAKL 


10160 


24061 


A 


10236 


306 


3 


SQDMVKSPKKKFP*fcV*GLTEI*KIRAIi 
VFNKKFFFFFFFPETGSTGSCSVA\RLE 
CSGTI IAHCNLQLI/3SSNPPASASRVIN 
SIKKLFANFIVSANPRPT 


10161 


24062 


A 


10237 


20 


469 


" QCGAIPRRGDHLRSGV*DQPGQHGETPS 
LLKIQKLSRHDGH/CRRLSHKNGLNPGG 
RGCSELRSCHCTPAWATR\GNSVSKKKK 
S 


10162 


24063 


A 


10238 


313 


3 


TRPRPTQGP PHSGGRACLC I FS PHG PPQ 
TRLSTPVSPSVRPIiGFQVLLSARHGFIiL 
CQHGMF\PSLFTMSNITLWPGAVAHTCN 
PNTLGGRGGWIT*GQEFQTS 


10163 


24064 


A 


10239 


274 


49 


KGNPFFFVETGSHS I AQDGVQ WCNHSSL 
QP * PP\G\SSDPPTSASRVAGTTGVCPC 
EDKLSHILRRCXAQVFMLAG 


10164 


24065 


A 


10240 


144 


454 


VCKIKIIYIiIVLLEDYIN* FVKSSL*I*K 
S*K/LIQQMWAIISLPECLNSWCLGKV 
G*KRCSQQGTVAHTCNPSTLGGRGGMIP 
R 


10165 


24066 


A 


10241 


289 


178 


FNYS S I KNNTDGWVWWOiLPVI PALWEV 
EAIESF/CSRDEIPLSCPGWSPVPKLKR 
FYCIiNLP * CWDYRQEAP PHPAI S 1 1 FDA 
TVIELFS 


10166 


24067 


A 


10242 


352 


294 


PPRTTKMW*DQPGQHVEAVTLQKMQKL 
VERGGKHPQSQLC /REDH* SMGGGGCS E 
PRP*PCTPAWVTERDHVSKIK+NKIK 


10167 


24068 


A 


10243 


293 


1 


VSLCRESSNSFIWQCPWSHILALCFQR 
EKKRLDIiP/S *MVGK* EIPFLVPLIiFIiP 
HWPLLLMK/YM* I FFFGNRVSLCHPGWS 
AVAQSQLTAASVPRR 


10168 


24069 


A 


10244 


218 


3 


LERHI PVRPI IICDYKWVWRF/HFLRPL 
HN* FLFSFFSFFFFFFETGSYSVT\RLE 
CSGTI SAHCNLCVHLIjL 


10169 


24070 


A 


10245 


185 


1 


KKQPQQTKNPLNNVKKEGKGQHFFFFLE 
T*SRSVAQAGVQWCDIiGSLQPPPPG\SS 

Urrro 


10170 


24071 


A 


10246 


2 


265 


AHKKKPAKHIAPSTFLYGETEARRSMPK 
AAQLKCGRAGI ILFVFKTES CSVAQDAV 
LWCNHGSLQP*PPG\SSDPADSACRVAR 
TTSI 


10171 


24072 


A 


10247 


2 


430 


EAGGREALDG CGLTGRESGVQRRRDSMG 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=JL»ysine, L=Leucine, M^Metbionine, 
N=Asparagine, PHProline, 
Q=G!utaraine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=^Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possibte 
nucleotide insertion 














CQGTAGRGTRAGKEGPWRPEHLSRRPPA 
SS VLLS WAIHLAWVEGS * TGSLCDLEAP 
GPTLS PWCS WS PELLQLHV/HHHTHTHT 
HTHS/HPNTHTDTQSLCHQPSFSVSPRA 
RTVL 


10172 


24073 


A 


10248 


201 


16 


TPPGGVFFLGFFFIiFFFFETESCCVAQA 
GVQWCDLGS PHP PAS RRPGNRS S I /AAA 
♦GYHFK 


10173 


24074 


A 


10249 


167 


2 


GFDP FEGMLS YLEKEET/ WPGTRAHTCN 
PSSLEG*GGWIT*GREFKTSLTNMEKP 


10174 


24075 


A 


10250 


92 


389 


FQPGQHGEILSLYEIKKI * /ELAGCIDM 
CLWSY/LIiRRLKQEDC3ST*VQGCSBP* 
CXSCIPSWVTEQDPDSGRQI*KKKKISL 
GRRGGI FLFLKI WHVFFL 


10175 


24076 


A 


10251 


221 


415 


KKKKRGGRFKGSNFTSPGWRGN I FFMGP 
PKLNSRAGV*QRRD/WEKPGGPQFNCFG 
SNPPFPPGGK 


10176 


24077 


A 


10252 


94 


383 


GKLNROTIWLLGRCQRSPAVSNALWLHR 
VAPEEEAGEAWAGTLSLQLSCKSKITPN 
KIVY* KKNKT/WLGTVAHVCNPSTIjGGR 
GGWII*GREFBTSL 


10177 


24078 


A 


10253 


249 


2 


YPTMIFKCSSKRKSPTSLTLNQKLEMMK 
LSEEGMSKAELS * KLGLLH/QVSQWNA 
KEKFLKE I KSATPANTGMI RKRNGL I A j 


10178 


24079 


A 


10254 


132 


414 


NNTYFQGKIFILNSQVKGMLICCWWECK 
*VEPLWKAVWRFLK\DLEQNSKEYKL*T 
WNQPRYPAWDWIKKMWYIYFMEYHA\A 
IKKNEIMFFAA 


10179 


24080 


A 


10255 


359 


3 


YSLRHNIEMRPVSNPTRASKCSHERRSV 
ISLTLNGKRE I IQEASKEGTLKANKPKA 
KLFVPVRQWNAKEKFLKEAKSAAPLN\ 
RMIRKQNSLTADTEQV*VIWIEY*TSHS 
IPIiSQSL 


10180 


24081 


A 


10256 


283 


2 


SLQPWKLYVHVFLPKYATYPPFGTLPKT 
HFFHKILPSKAGSDLNSIYAMTYSHRHI 
TIFFETESHSVTRIiBCSGAI \SAH*NLH 
LPGSSNS PATK 


10181 


24082 


A 


10257 


216 


386 


PSKNPPPLINPFFFFETDSCSFA*AGVQ 
WRNLSSLQTPPP\GSNNSCASAS/RIAG 
I 


10182 


24083 


A 


10258 


263 


1 


FSQWLILPSPLSYSFLFLRQGFALLPRL 
ECSGEI TA\PSSAS * VAE ITGACHHABF 
LNFFVEMEFHHWQAALKPLGSSEPPTP 
ATK 


10183 


24084 


A 


10259 


248 


47 


IPKSRVICRENRLNPGGKGCN*PRSPPCS 
PPWVAKQNSLSKK/NKVKIYQNLHTETQ 
TVHGDIRGQEK 


10184 


24085 


A 


10260 


159 


2 


KRDMSSLKLEY* KKE/RIWPGAVAHTYS 
LSTLGGREWIT*GQEFEISLTNMVK 


10185 


24086 


A 


10261 


190 


405 


ASLHMFMYFFFFFFPKQIFFFVPQAENQ 
WGNLGKRNPPPPGLRDF\PASPPKEAGF 
RGGPH* PGLFLVF+GE 


10186 


24087 


A 


10262 


39 


293 


PSSSTSRIYPEDTPLITQKYICKILFIV 
ALFIFPKHWKLCKCPHIG/E*IKLQCIY 
TWRYYVAVQKNEENLYQLMYNDYQEILV 
SD 


10187 


24088 


A 


10263 


282 


3 


AFYNRFFLKIFFKRGPRGKEKKGGKKEK 
KKWGKKKKKKFFFFSPGQIFFLGGVFFF 
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sequence 



Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=lsoIeucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 



^agvqwcdqVsssqp 



FLFFFEAESRSVT 
RPPRLKPFFQ 



10188 



24089 



10264 



196 



376 



KKGGGHI INFPPHKGYKPGPEKKKFFFF 
FFEMESCSVA\RL* CSGMI / SQAHCNLC 
LPGSSN 



10189 



24090 



10265 



514 



157 



GLTLSFRliEYSGMMSb*P*LPG/ FR*FS 
RLSIiPSSWVFPLMYVYHYAQQSFCIFFV 
ETGFHRWQAGLKVTiGLRQSAHLGLPBC 
W\DYRCEPL*LTKliICFTYGLR*RSKPP 
FFVDGIQLF 



RSRIW*DSVILCS*LVSV/VDQSYFY*V 
TEILBQIKLLGSKIMCFCILQLHFLCFL 
CGSFVGST\IFIFSRNIWLGTVAHACNP 
STLGGRGRWIT*DQEFETCLASMVK 



10190 



24091 



10266 57 



378 



10191 



24092 



10267 



212 



LIQKDTNI YYIKEAL* I PNKHI / KKISS 
I VSHYKMQIKIMR/ YHTRMAI IKRTDKC 
L YECRETRTMHC W * GC 



MANKH I KI FSASL IRMEI * IRATMRE/ H 
FIPIKWIiVIiKSKRESNKCWQGCGRTGI 
VIHFQRECKIVLLL *KTV*QFIiKKLNIR 
IiGAVAHSCOTSTLGCLRPRVQDQPGQHG 
KTLSLIiEIQKLAWHGIiHLSPS 



10192 



24093 



10268 



399 



10193 



24094 



10269 190 



10194 



24095 



10270 



10195 



24096 



10271 



386 



FSPFLPLVTILECDKAICRRGKSFDFFL 
LGPSFYFFKLLL/ F I FYF*BRVLLCGPG 
WSTVAQSWLTA 



166 



327 



KTNSKTAEVCPLSVITLNINGLNSPIKR 
QR»TDF»NPl\ICCLQETHFRFKGTN 



180 



I FFLFEVETCSVA* AGVQWNDLGSQQPP 
PP\GSSDFQLIFAFLVETGY* IMVPSSS 



CRT 



10196 



24097 



10272 



273 



FLGLFSGNPbKANKPVOILTLPIIjQRIF 
F FFES E FHS \ VAQ AGVQWCDLGS LKPP P 
PG\SSDFPGRRR*Q*AMITPLHSSRGDR 
VRLtRLKKK 



10197 



24098 



A 10273 



399 



27 



YHFVPTRMAI IRRKNN* CW *TCGEIGNL 
VYCWWECK/LV*LLRKIVWQFLK/DVKQ 
NYHPVILLIiSIYRI * FDCLFLCKSHVEM 
* LISGVRVGAWWEVIGSREWFLMSGFT I 
SSCHQDCEFSQDLIV 



10198 



24099 



10274 



275 



462 



KINLVLQLDFFFVEMESHSVTQAGVQWC 
HLSSPQPPPP/GSSGDSRASDS*VAGIT 
SVHQHAGL 



10199 



24100 



10275 



379 



RGPGGRGLGSGTCFHPGRGGB P I VYQHL 
F* FFGP\ PENYILILPGFGI ISH/I VTY 
YSGKKEPFGYIGMA*AMISIGFLGFIV 
AHHIFTVGIDVDTRAYFTPPYHTFEEPG 
YIISRRRRKTKGFLF 



10200 



24101 



10276 



407 



FPSPSPSPFASVSLSLSFHGLPLPEHTW 
FIDGSPTKPNRHSQAKAGYAIVPS ISII 
EATA/PAPFTTC*QAELIALTRALTIAK 
GLCINIYTD/SHI3jHHHAVMG*K/GFLT 
TQGSS I IDVSLIETLLKAFLLPKEAGA 



10201 



24102 



10277 



203 



545 



10202 



24103 



A 10278 I 250 



FLWRGQSLALLLRIiECSGSILL/AHCNL 
C?PHRFKRFSLPQPPE*LSNW\DYKRLPP 
CPANFCTFSRDGFTTLGQAGLQLLTSSD 
LPTLAS PKCW \DYRHE PLVHGLTFNRS F 

NFFF 

AKWFIiEMKSTPGEDAANI I EM/ TKDLDY 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














D I NLVDEAVAG FGR I DSNFERS CTWKM 
VSNS ITCYRELFHERKSRLMCQM* LSYF 


10203 


24104 


A 


10279 


333 


3 


SLILLLFQSMNHTHVRVTPIDQFCVCSN 
YSTDWLFPHLS PIS LRTP YS LR YNI KI R 
PINHPTMAMASKCSSERESRISLTLNQK 
\LETI KL / SEEGMS KAQIG* KLGLLCH 


10204 


24105 


A 


10280 


251 


3 


ITSKVIDSNPTTS I ITLNIiNGLNTSF* R 
QRLL/DWASSPK*DSIICYLQEMNFKYK 
I FKKIGRRI YYANI SQKKVGVAVLTPDK 
V 


10205 


24106 


A 


10281 


22 


226 


TKINSKGNKELNVRAKTIKLLEENIDRN 
LCDLGLGNCFLD*YCIL*TKINSKGNKE 
LNVRAKTIKLLEENIDRNLCDLGIiGNCF 
IiDMTPNAQTTKGR\ IDKLDFI KI *NFWL 
DT 


10206 


24107 


A 


10282 


238 


383 


DAKRKVSSTEGDSEGVAHERRSAHLSAK 
PCPAEVE/AKPQKAAEKG*SSE 


10207 


24108 


A 


10283 


127 


377 


RIPGGSALGKEIRQCFGVKGQAEIGKIE 
MSQPTQEREWEGIGSQKAAQVDHLRSG 
VQDQPGQYGETPTLIjKIH* LAGHG/GRC 
L 


10208 


24109 


A 


10284 


328 


370 


VGPGPGF*SSGRVSSCCPGWS \PSELK* 
SAHISLPKCWDYRCQPLRPATFFFKTAS 
HYVAQAKVQWCNYS SLQPGTLRLKCCI F 
KKIQWQW 


10209 


24110 


A 


10285 


341 


1 


KWCVLTAPLSGRFSVCLPFLGPPYFLQ 
HSDIEIKPINYPTVASKCSNERRSHTSL 
TLN*KLETIKLIEEGGMSKAETGRKLSL 
L/QSVSQWNAKEKFLKEIRSITPVNT* 
MI 


10210 


24111 


A 


10286 


199 


329 


LFFSFLNCriSPHRIEESY*KEIKSAPPI 
TPG I PRK* TSLIAGKKNVFMDWKDQTNH 
MIP/LSQSKTITLFNSMRAETVEEAEEK 
L/ERGCFMRFKERTHLHNIKMKGPGWA 
HTCNPNTLGGQGRQ I T 


10211 


24112 


A 


10287 


358 


387 


YTNAIiIiLYV*DQPGHIjMGKDSLFNKWC* 
KNEYSHAKKKQNH\ P * FSLYTKTNSKWS 
KDLNVRAKPIKILEKK 


10212 


24113 


A 


10288 


111 


373 


SAQCQPDQHGKTPTLIiRIQKLAGPDGAC 
L*SQLI*R\ * GOGE PRSRHCTPAWAT/EG 
KTPSQKKKKK 


10213 


24114 


A 


10289 


197 


375 


IRIGLCKFI I * FLETKLHTVAKVECSSA 
I S AYGTLS LVDSGDS PAS AS / RWGTTG 


10214 


24115 


A 


10290 


118 


371 


QCLLMMS PGLKT* MTGRLRQBNRLNPGG 
GGCGE PRS CNCTPAWATK\GNS AS KNKQ 
T\KNKQQQQQN 


10215 


24116 


A 


10291 


214 


368 


CGYHQKNKKIWPKNSKCWQGCGEIGTLM 
HYWKDHKQV/ Q/PL*TIVQQFLKTRI 


10216 


24117 


A 


10292 


2 


376 


PFPLMAQRPTHLVPIiSASSGTSLLMPST 
IiLWLPWLYKAHAVPTVPLSKLFDHTKPY 
AVSKNNLGALSI*SY*VIPVPPRSRF/S 
FIRDKPTS 


10217 


24118 


A 


10293 


1 


379 


NI LHNLDFFKNTTLE PQSSRLQRAVI AP 
LHFSLGNIVRRCFKKKSGWAQWliTPVIS 
ALWEAD VAGSLEFKS ETTW3NMMRPCLN 
FKKKKKKAFLFGAPGFFFFMGGKILFGI 
AAPF*SSGRVSSCCPGWS\PFELK*SGH 
I SLPKC* DYRCQPLRPATFFFKTASHYV 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
KMLysine, L=Leucine, M=Methiontne, 
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T=Threonine, V=Valine, 
w— lrypiopnan, i— tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














AQAKVQWCNYSSLQPGTLRIiKCCIFKKI 


10218 


24119 


A 


10294 


201 


384 


GEERLSVQCSAFK*IiR*HINISIjQTFFF 
LRRESHSVA\RLECSGVISAHCNLCLPG 
SSDSPA 


10219 


24120 


A 


10295 


161 


400 


VCVIESVKANHLVDEHMjKSLSFHLSLL 
FGV*GVCVCVCVCVYMCF/CVCIYVC 


10220 


24121 


A 


10296 


1 


433 


GEGGGGCSGWSGVETASSKFAFvjAlsUUij 
LPAPALGRWQGLNSVCYPAWPQPNRRQP 
LALLCAAGLGPGQRLECSGMIMAHHSLD 

FHHVAQGGLELLGPSDWPPTVLGLQM*A 
TTPGH 


10221 


24122 


A 


10297 


209 


1 


Lilr ciJUlixrUli lliir c w jsooocov*iyi\rcw 
QWCDLGLIjQPKALG\SGDPPASAS*DA* 

DluVUI IdnniV.J. 


10222 


24123 


A 


10298 


225 


1 


RPGMPLSALPCTGHAPARQPHSAACSGP 
KCPWCQSPQPAVGWTV*VCPLPLSPCLC 
/FCVPQPWHGGLPLPCPPV 


10223 


24124 


A 


10299 


212 


367 


vlAGSGHWUx 1 WWlSV»iCLiisyoJj w 1XA V Wxtr 
IiK*LKVELRFDPAVPLL/SIYPED 


10224 


24125 


A 


10300 


97 


406 


LHRCrTPAWVTERGCLKTTNQNNGQHNVE 
GEACS SRPFTTIC/ EEKIHLVCALTEED 
♦RLAAETTANTTDIPIGSAYRILIKKLK 
PS KLSTC/YVAKQLSPGQLQTK 


10225 


24126 


A 


10301 


3 


422 


TLSTLSHFQKGWVKLFFFFFFKKGFPFW 
PPGGGGRAKP* F/HGNPPPPG* KNFPAI* 
PRQGGGKKG\GPPPPGQFWFFKKKGVSP 
WGGGAKTPELGNPPPGPPKGGGITGGPP 
PP 


10226 


24127 


A 


10302 


330 


1421 


RTKGRPLIJALPVLLLljFIiHLAFFSAPP 
GPSSLSSTEDLKWTLLSPAQSRKLQPQS 
LHRAGVRDLQWQGRAGAELTAAC/WPAP 
PRPETAVIjIjQGLPC/ qegrllleslngf 
ALWSAEGTI FYASATIVDYLGFHQTDV 
MHQNIYDY1HVDDRQDFCRQLHWAMDPP 
QVVTGQPPPLETGDDAIIjGRIjIiRAQEWG 

tgtpteysafltrcficrvrclldstsg 

ct TiwroTsrunirr irirT.'CV!iovTfTf2it3CrtZiMT»T5D 
C Ju 1 Wy fyijJVLiJXJ? Jj V VJV JXJCVIvrvr' o\Ji"U r iiJ xr tr 

RLSLFCIAAPVLLPSAAEMKMRSALLRA 
KPRADTAATAD /S KMRS * LTLS ESRLPN 
* PR T WD<3 AAPGTLPQLPFFGAFDGOHI F 
CIHTRALRGSTRTRQCIAAALEGASFRN 
HVEL 


10227 


24128 


A 


10303 


172 


383 


JViUTi J\.V Hyul aa±Et luv iV»Ylnlj^< XVI 1 v \£U 

LWKTTWWLKI LKLGP P * \DSI IPIiLGI 
NQGTEIRFCKRYSNT 


10228 


24129 


A 


10304 


205 


2 


NFYPA*KTDFFLVGSLFIYFE*ASHPIT 
\RI^CSGO^mCSLDLIiGSCNPPASAS 

ti\ 7 c r* t a n i\ trmi#^ T 
ri Vbvj iiVaArllulv* J. 


10229 


24130 


A 


10305 


423 


30 


I^EGKLTNRKDIRTKTPSVRRIlHQSPKV 
DRTAKMGKKQSRKSGNSKNQS AYP PPKE 
HSSSPAMEQSWMENDELREEGFRQSS/M 
SKLKEETOVNGK*VKTREKRLAKSLTRI 
TNMK^KDLI^LKTSVLRPL 


10230 


24131 


A 


10306 


211 


387 


IFFFF* IKACSVAQAGGQGRHLGSLQPP 
PPGLKQDSC/SPSRGVGTTGMTYHAQFI 
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X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FFGF 


10231 


24132 


A 


10307 


229 


382 


ERAWQPG*HGBT/ PSLKKL* KLSRCX3G 
MCL YS KLLRRLR * EDHLNPRS WGC 


10232 


24133 


A 


10308 


168 


417 


GSYCFFVRRNLLIVKYMRSVHLLFIMLL 
YRGD * WYVTTQ YN FC I LQKQ E KL I YR I A 
/ HVRNPSSLRGGGG* IT*GREFKTSLAN 


10233 


24134 


A 


10309 


253 


402 


KGGGGGVKTPARGNPPPGPPKGGEKRGG 
PPPPTQKGFF/CFFFKKGTRPPGGKTGB 
RAKPNEIPAPGKKEFFLPKPPEKGEIKG 
/ PRPPGQINF* FLKKTGFKKGGGGGSKP 
RPGGTPPPGPQRGGKNGGGPPPPPKKGF 
FGGKKKKKNPPR 


10234 


24135 


A 


10310 


182 


400 


LHVTMI I FWLYL I KNMLLNFTYF/ CYIF 
SVATSTFKIVCV/WLGAVAHICCPHTLG 
G*DGWIT*GQEFKTRLANM 


10235 


24136 


A 


10311 


371 


400 


FIWGNI *ASNKHIKKT\STSLAFRKIQV 
K\TTMRYLSPPTPIRMT/I/I*KSYSSK 
CW*GCEEIRSLIHCKMIQPI*KT/WQFP 
KIINIQLLYNPAIALLGNRTRG 


10236 


24137 


A 


10312 


113 


404 


QROGKIGTFTHCWWEYKMVQPLWKTDWN 
LLQRLNI ELPCDLAI PLLNLYPREMKTS 
IYICT*MFIAALFII/AKWRQP/KCSSA 
DE/IGKMWHICIMK 


10237 


24138 


A 


10313 


146 


1 


FLTQKFFFFFETESRS * AVVQWLDLGSL 
QPPSP\GSSDSPASAF*VAG 


10238 


24139 


A 


10314 


157 


434 


FFFSEIFFFGPGVFSPPPFFIGGPGIFF 
FCPLKKKILPPPRGKKFFFFKKGP/HPF 
FFFFFF * EGVSLLLPRLECNGAILAYCK 
IHLPGSSDLP 


10239 


24140 


A 


10315 


277 


2 


FFFTAFFLGVPGVFPPPRFFKPGPGFFF 
LGP*KKNFFPPPRGK/ IKFFFLKGPPSF 
FFFFFF*EGVSLLLPRLECNGAIIiAYCN 
LHLPGSSF 


10240 


24141 


A 


10316 


1 


388 


KTITHEVI TECI WLKVLLLYFPNCMCI Y 
VRMHICVCVCVHISI*MDHIVSYKCIHI 
\LCIY 


10241 


24142 


A 


10317 


134 


399 


LGLPGGDTRGGGTIENPPFERALNSQNG 
GK\NPLLGPKTRPLGWGLCSTPKFWEKP 
RVRWDLN*TTPCLEKRAPWLKAKKTPA 
FEIRI 


10242 


24143 


A 


10318 


233 


3 


E FLFSL I CAI CL * CALKQFS S C F FCLAL 
IELLGSMGIiLFH/SSLETFQPFFLQIFF 
/SFFPLFFGANYTYNKPFEYSLI 


10243 


24144 


A 


10319 


221 


8 


CDQPKSPPLSGVQMGDILWLVFFLFLF 
AF * R*GSHCVAQAGLELPGSSDP / PQAT 
SQVAGTTGMHCTWLVF* F 


10244 


24145 


A 


10320 


1 


407 


RSTPDPVYLGFTSGGCKTEKRAGCFFLW 
KIiHTKGAPARCQPELSGMRC\R* PLLGG 
V/SQSRYMAVKDSLEEAGCPLSELKRCA 
GRYTGLFKGGRLDHLNLLRLCPQTTLCP 
GALSQEDGSFIYKPLTGAAAFLSYM 


10245 


24146 


A 


10321 


115 


398 


ADEELLLMNKQIKYLLEMESTPGKDDVN 
TVEMTKE SE YS I / NLKAVAG FER I DSNF 
ERLSAVGKML*NSITCYKEIFH/RKSQL 
MSQTSLLSYFKLP 


10246 


24147 


A 


10322 


137 


454 


FFSLYFQGFFFFFFLGEKFFFSPGGKGI 
CPKPPFGPPPPPGKGFSPPPPPGKAKKK 
GPPPPPP\GFLFF* KKPGGGFLGGFFFF 
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nucleotide deletion, \=possible 
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LGGKI FFSPAGGKGGPQKPPFP 


10247 


24148 


A 


10323 


246 


372 


ISIKSYQLW\LGGVAYACNSSTLGGRGK 
WIT*GQKFKTSMAKM 


10248 


24149 


A 


10324 


415 


1 


IWGAPRVFPPPPF*NPPPEKNFWGPKKK 
KKFPPPPG* KIFFFKSPPPLFFFFFFFF 
SETESCSVT\RLGCSGGISAHCNIiRLPA 
SSNSPPPSSLRMPSTPVIiRPHGDLLSPQ 
LSAWVLSWPFLPSLLLPPDLDFIVQ 


10249 


24150 


A 


10325 


3 


559 


GDGITGLSPTISKFIiFFVFESISIiS PRL 
GSSGAIMAHCSLTSRREPPCRSINFFVS 
VITIiNVNNLTNGKIGRMDFF*MM*YTIQ 
ICNLQNTHCRP / KDTNCFKMKRRKE I IH 
ANSNQKRAGVAILI PDKIDFKPKNVTER 
HRMLVRES / I SQEDVTI INTY 


10250 


24151 


A 


10326 


24 


387 


NQGLGARNQKSQRVFLAGFFFFESSSCS 
VAQAGVHW*DHSSV/PGSSAPPASATW 
VGTTGVHNHPWLI F/VFFVETGSH 


10251 


24152 


A 


10327 


145 


396 


CDYFGEMFSYLLLNIYLCDDPAIPLLPY 
VTYENENTDSLKDMTDAHSSFIHNSPKL 
BTKCPGTDEWI *MTKKSDYCKHW*GYRV 
TGTLL YYEGECKI V* LLWGNV* LF\LLN 
IYLCDDPAIPU,PYVPYE\MKTRIHSKT 
♦PMPIAALFIIAPNWK/PKCPGTDEWIN 
TL* HTHAVQYHVAVER 


10252 


24153 


A 


10328 


94 


639 


ASISSSIKWDNISFLVGLLGELNEIRNI 
CKAVNMKQALNKWKLILLFVIENNNVSA 
SSNNFKI FYAYY IAKAVLLI ITEHITLS 

rrrr -tt yyr.r rNUT> inn i ht»iiiit/*'«i>t\ no/in (Ml/ U»l»T 

I VL I LKKQNNTRAVTKCW \ RGGREKETL 
IWLVG* ECKLVQPLWRRVWRFLIK7KIE 
LPYDPT/ ILLGI YSKERKS VYRK\NI CT 
PMF \ VAALFTTAKTWKH 


10253 


24154 


A 


10329 


143 


3 


SAWFFFQDRVTiL* / S PRLECTVS ISNHC 
NLRVPGSSKNYCIiSIiISIWDR 


10254 


24155 


A 


10330 


153 


389 


KjQGli JlLFPGIj V If W JjP Viiii SHlteJj V XJS 
WGYQCYHAWPS*CF*NftSl/WSGTVTHA 
YDPSTLGGQGKPIT*GQEFETSLA 


10255 


24156 


A 


10331 


382 


422 


RYKliDli* KLNTEJjP YDPTVPXiLGNYPKY 

*t*af\sktct*mfiaaysqppkgrskc 
sspne*inkmwpiyimr/yysai k 


10256 


24157 


A 


10332 


194 


1 


CKRQCNKEDRNI / CS K* *NPT ILF* TTK 
RKG/WLVAVAHACNLSTLGG*GGRAS*A 
QEFJKTSRANM 


10257 


24158 


A 


10333 


348 


1 


NSGQSWIMYPLLKKGVGRVSPPDPDIVN 
S ISMVWCGGRLFSNERFASLLT I KKKLA 
GHGGTCLQSLPLGRLRREDRLSQGGQG/ 
C*GYSE*LCC31CIPAWAAEGDLWSWSPN 
SVAY 


10258 


24159 


A 


10334 


388 


2 


IPGS PKRGPGGEKKKTFFHYLFIiRAAP P 
LPRFIFFPPKNPPLKGGGGFFFPPPPKL 

RGRFFFFFFLRQS FAVAQAGVQWCDLGS 
LQPS PP \G * S YHPARTRG 


10259 


24160 


A 


10335 


3 




SKKKSCISLTLSQKLDVTKLSGKGMLKA 
KTGQNL/GLGHLYRTVRHAVNVKEKCLK 
EIKNAALLNTQMTRK*NSLIADTEKM*V 
V* IQE * AS /HSMPLRQNLIQNK 


10260 


24161 


A 


10336 


165 


442 


GQMELLILARNFGILLHFNLSIYYRRLF 
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FSIPRYLFIFTYLFIYLFIYFESVAHSD 
AQVECSGTIMALT * KKNASLP / ASAPRA 
AGATG 


10261 


24162 


A 


10337 


33 


430 


TGSTHASANFCVSLVWTQLISRSSGQGL 
ARLQLRCLPCCILFFSFFSLVIYRLHSY 
LRLPWRRICLQVPSGC*QNSFPCNFRIQ 
GSLLL*/MQQW 


10262 


24163 


A 


10338 


284 


3 


SQHFGRPRWVDHLRSGVRDQPGQHGETL 
SRBTP\SLLKIQKLSGHGSACL*SQLFG 
RLRQENCLNPRGGGCSBWRQHQCTPV*V 
TEQDSVPKKK 


10263 


24164 


A 


10339 


414 


2 


ERAGPFSRKIFFSPPLAKKVKYPFPPTP 
RARAHKRPIFSRYVKKERGNSPDLFWPP 
FFPSLW* KQ*GPSLGRIiFYPV/SWGI FP 
SKP*GFPGPRPKVFCFLG/LPPGPFGGP 
FFFFFFGDRVSLCHSRWSAVAQSQLTAR 


10264 


24165 


A 


10340 . 


211 


422 


ASLGNGGRPSPSIAAALQFDL*LRAGII 
FALFKAARQGDLSL*RLLLSF\VCLCPA 
PRGGAYRGRQAFLSWGGLHPDRASRLFC 
IiPKQAWAMAGAPPPALLLPCSIiISYCRA 
SNQQDSGGIGPSEPSAGYKLLVPGFLLP 
AEK\RTIYGGVTRFFRGP 


10265 


24166 


A 


10341 


116 


432 


SRQRMPSTYHVQGSGDTKTGEKPCANGH 
KGCGSSILRSCLHRKRFSF*SLKALVWC 
GGACIiEAQLLRRLRQEAHLSQGVPGCNE 
L* PGYCTPAWATEP\DSVSKKK 


10266 


24167 


A 


10342 


103 


458 


SLNVCVLTTLPSSCSPISLPLRGPPYSIi 
RHRNIEIRPINNPTMASKCSSKRKS*MS 
LTLNQKLE\MINIiHEECMLX?UCTG*NLG 
IXHQIVDQVVNAKKKFLKKIKSAIPVNT 
QMVRKQN 


10267 


24168 


A 


10343 


53 


408 


SFQFFCSVFSPSLWFYLLLVFDDGDVQM 
GFWCGYP FCLLVFLLTDKTLS CRSVGVP 
CRVRCQCA\LLGGASQLG\SRGSGVRDP 
LEEAVCLLSDLQLRAGRTTTLF*AVRQG 
HLSLQRLL 


10268 


24169 


A 


10344 


265 


421 


NNFPPSPPRGGGNPGPPPPPRFFFFFLK 
NPGLPPF*VIYI»l4/YFTLTPSFCLWCYT 
ISVYGCGFVFFFFSFFFFFFEKKIFFFP 
RVEPKNRIPVYPKILFPG*WNFPPSPPR 

GGGNPGPPPPPRFFFFFLKNPGL/HPF* 

w^'R vrnr\T TTvrr>r>T>i»Tr> 
rUvijAivr t'UuLNyyyYik' 


10269 


24170 


A 


10345 


207 


420 


WQKGCPVTFMELGWGPLYPCFLHLWIQ 
PTSHRKYIHL/WLGMVVHTCNPNTLGGQ 
GGWIN+GQEIETSLTD 


10270 


24171 


A 


10346 


115 


3 


APRGAVKKR/ R P WMG AATHVC KP S TLGG 
RGG* IT*GQEY 


10271 


24172 


A 


10347 


41 


421 . 


FTFVS VCVHFKG I FLS FLS FFFETEVPG 
SPLGPKRGGGGGNSGPWNPPPPGLRGSP 
PPPPQRTGKGGPYPPPPGNFFFKFFFRK 
RGV/PPL*HPGGPQSVT*GTPPPGPPKF 
WGAQSGGPVWGKNPLF 


\\jJLlL 


Oil 1 T~> 

241 A3 


A 


1 f\1 AO 

10348 


228 


1 


S KQ VKifi K. i ub> V LtHJs. i Q KQAT * WK JlecR V 
TKKHVE I TKK/WAGHWAHAYNASTLGGQ 
GGWITLGQEFETSLANMVKP 


10273 


24174 


A 


10349 


405 


1 


GGFLKRGPPFFFFLRQGLSIjLPRIjECSG 
MTVAHCSPNILDSSNLSLRRSWDYSHQA 
QLIFLVSF*VLGSHSVAQAGVELIiAS/P 
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KCWDYRHEPL/WPSQSQNIiNSFFLDGFS 
LLIiSRLECNAAISAHCKLRLLGSS 


10274 


24175 


A 


10350 


301 


546 


RHIFMNLPFHNLVKIIFHCRGKTLFLDI 
SNPYLKKQVSSSKPNCK/HLKE I * LFLE 
AKDYWiiGVvPHAl NPb 1 I/jGOfciKb 1 1\> 


10275 


24176 


A 


10351 


873 


1258 


FCTQCTFHLLHHLHHHRPSPRIiEPPPPG 
AALPPVPEGRI I PLPCPSSGPRQTTGPG 
KGRRHPAPHRTPAASAAsLPARRTLPAA 
PW* PI»SFQNTTPGSA\PGGGNQPASGAG 
GNSRT 


10276 


24177 


A 


10352 


184 


576 


GLSPLSHLKMAASSSEISEMKGVEESPK 
VPGBGPGHSEABTGPPQVLAGVPDQPEA 
PQPGPNTTAAPVDSGPKAGLAPETTETP 
AG \ PQKjQPRPQTSA*AQEGNQRPTAAPK 
THAQKTVS KPEVS KEATAD 


10277 


24178 


A 


10353 


110 


384 


VPGSPRKTRSPQQMERVLAPKTAKKGCP 
RPGDHRSRLQNCRDTRARIKWADSVERP 
PWAG \ PVAHACNPin/ JUGGRGGW IT*GQK 
FKTSLVNM 


10278 


24179 


A 


10354 


123 


422 


RKQIANDKSKLFI INKYLNVNELNSPIK 
/RERLAGWIRKEKHDPSICHL*KTHLT\ 
KDTNSIjKM£G*KK.1 YnANNi / y KJUWaVA 
ILISDKIDFKFKKKVVREK 


10279 


24180 


A 


10355 


36 


426 


PPERPQY SARVCV CVFYTECRGLAPKPE 
CRGAIIAHCLJjNvXjGSQ*SSN1iR 
DYTQLIFKLFVQTGS^CIAQAGFELL/S 
SKDPPTLA/FPKHWDYRH\DHHSW 


10280 


24181 


A 


10356 


193 


3 


FQTRIFKISFFFKEFLKFKKFFFFILI* 
ES RS I AQAGVQWRDliGSLQP P P P \GSJb b 
PPASASLY 


10281 


24182 


A 


10357 


205 


377 


SPWFQPYYRLTI PYQKC* LATVCI*VY 
♦FFKLILYPAILLNSIIIWINFTSDPL* 
FSSHITG*LSLIKNVWVGPGMIiAHA\ *N 
PSTLGG/ RC3GWIT*GQ1£FBT61jANMVKP 


10282 


24183 


A 


10358 


51 


376 


LCRPFQLCHGDRTDMWCPFPEELQDRCS 
CRSTSMRVAKGTTPCFLKVLQNIYTV*M 
FCRTFRKHGVVPXiATRMJjV \ I YKN5GP V 
TLQGRGTTYLSCHHGKAGRVCRATQHAA 
GVWTKQRACI 


10283 


24184 


A 


10359 


295 


376 


ASELTSQSPGSIiFENQKKGRCIAFLPC* 
KCNSE ILVHCWWGCKM\WKIVW* FLKTL 
\RLELVYDPASPLLAIYPRCI 


10284 


24185 


A 


10360 


451 


1 


CQDIKTI IrftNEQE Vii V X PS VTIvY VGK 
SFVSRRYTjWSGCVLCYAFVTR^KTWHH 
FPYRSPGFWGRSGGRQFDS* PKSQADFR 
APQGPPTPEPEGAERREPEIAPPPTADP 
SPKPT/HSPPRSGAPPSAPFPLLGRAER 
GVG\PGQGRPRP 


10285 


24186 


A 


10361 


2 


776 


FFFLRWSLALSPRLECSGAISAHCKLRL 
AGSRHS PASASRVAGTTGARYHARLl/L 
/YLFLVETGFHHVSQDGLNLLTS * SACL 
GLPK\C\WI YRREPPRPAYFI FFLRRSL 

PASASLVAGTTGAHHHTRLVLILL\FYY 
FLRWSVTLS CRLE \ AGVQWHNLGS LQAP 
PPGFTPFSCLSLPSSWDYRCPPPCIiANF 
LYF* *RRGFTMLARMVS IS * PRDPP PSA 
SQSAHDLFIVGQ 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 


10286 


24187 


A 


10362 


188 


365 


CPQLHTQSNLINMDASQRYQFELKIQAL 
/ WAGTVAHACNPSTLGSRGGWI T*GLEY 
ATSL 


10287 


24188 


A 


10363 


146 


1036 


GWAVYQTLQAAWLGPRTESGVS FHGRFS 
LPHHP PQ P TLHAW PAPPLQ PEP CVPLRE 
DPFMPES W* LPPS I PAEGVS F* ECDFSR 
RMMGKGSPVPTVAPPLGPAPEPFCVTAG 
CAPMHTPTHTQ\PPHCSIFLPKISFKSK 
HFY*LLFCKCLSSSVFSIiPESILLLFIV 
ESVCQPGERCLALNMGWDGKGSGRRWAT 
KSQGVMDIAATQTPA/VFSPV/PATSCP 
AGVSGGK 


10288 


24189 


A 


10364 


229 


1 


PLENAKS I YWGKGRLLKNNARLFTVA* S 
WNQPQFSLMVDWIKKK\MWRW5TTTESCA 
AI*R/DE I VSCAGTWMELEAMY 


10289 


24190 


A 


10365 


186 


449 


RFFFFFPKFFWVFGLKSPFFFFG/RLGI 
* EKDGGFFFFFFFFPEMASGS VA\RLEC 
SGAVSAHCNLQFPGSGSLPASAPRVA 


10290 


24191 


A 


10366 


445 


3 


RVDNG VRGRYRGKRGERGR * TRWGWGHP 
QRWDQDRS RERPRETGRERAAGRRRETE 
AAETGRRRRERGRSDRNRN/ RGRPRERM 
IEAGRDAGRRER*ARQESWTEKKTARQE 
EGDRD/RQGHQETRAHKGWGNN*D*GRQ 
AERSENRRMY 


10291 


24192 


A 


10367 


1 


491 


NTAP FPTS LESAFWVAVQDRVS / LLS P 
GPECRGAI \LTHCC/IjRTPG/FKPISHL 
S/LPS/SWDYRHVPPPLDNLILCRDNVS 
QSGLKLLASSNPLTSAFPST/ YDYRRET 
PGPRASP*TQMQPCFMP/RS*SQP*TQI 
A/R*QSNWGSQ*VLNNRELAFGTMFFGQ 
RPKAFF 


10292 


24193 


A 


10368 


572 


40 


LHSFRQHFSNTYPQCTPGTP * VTQSLAA 
RPAGLPEPTSASGGAGAVTPPPRTD\PP 
SVRWSGKKGLDPVLRGGAPRPS PANPAP 
APQGPAPRKGTBCVPTASKTPRPGGRWE 
PAAP PHAHPGKDLQ P PSTLVKE CPEARC 
PRALTGDLP SSGENRNGLSAFQGQRRPA 
QKSGENCEN 


10293 


24194 


A 


10370 


36 


517 


PFLFSDIDFLFSFYFLSFFFF*KRGLIL 
PP/GPEGQGGNKG*WPLHPRG*REPPNL 
TPQTRGNKGNRQTGPVIPGILRKNGFPP 
FGQGGFKPPGLREPPGLALQRGGNNGQN 
PYP/P/PGKFF*RTKRGEKSPQPPYNKP 
NKKFNPAPMEEFSS FKEGRGPSLN 


10294 


24195 


A 


10371 


60 


403 


DVEQPKLRSYPTLKCKPRSAILTICPWA 
LQYRSHGLI FGFS WGFLSFETES CSVSQ 
AGMQWCDHGSLQT/ PTDSTSQAQAIDPP 
ASAP \QAAGTTGAt;tlllTCIiI FVFL*RWG 
LAKP 


10295 


24196 


A 


10372 


358 


33 


DSLGVPGAPLLSPGVTPHKQWFFRPYTQ 
I KKLGFFPQI FFGFKV* KAPFFFWGFGN 
LKKKGGFFFFFFFSEMASGSVA\RLECS 

ft AT *5 A WfTlT .OFPfS <5 ^ ^ T . DTQ APP VA 


10296 


24197 


A 


10373 


264 


3 


RQTSVSGSNDGKSWPPADIDTI \KYVIG 
NRLITTQQ* LIKLTSKQMITIHNTKGRT 
* SLILVSL 1 1 FIATTNLIJGLLPHS FTPT 
THCI 


10297 


24198 


A 


10374 


298 


384 


TLVI *KDGVHTTPHHQQHAYPGPKEYML 
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HKAPLLHSVSRDPSPFAQSSNCYNRSIK 
QSPVDPLTQAEPVPRDAGKMGKTPLSEV 
SQNGGPS /HK*GQ YSGDV 


10298 


24199 


A 


10375 


84 


518 


IMKLRQKSRSSLKIMRSKLQHTRAS/GT 

qlr/ pmiiggnf ialn/ahikklgrtqin 
nltsqlkelekqd/ rnpkargrqeit\k 
i * aelketet* kke iqkiybfrv/ wfu2 

NINKP\PASRLIQRMIERNTIRNDKGNV 
TSAPPDV 


10299 


24200 


A 


10376 


70 


363 


FHQKNLVPGLWVQLLGRRTLQSQHPPAP 
PPSSSPEVNVPSEVQAAD*LCCIFYDVK 
VRKCS TPQKKTRKRKKGCLF / CAPGQTR 
/CLC * KRGEKNLVGKT 


10300 


24201 


A 


10377 


299 


2 


Y P VAGRGNR* G CGAHALGVGGAAMP LG A 
TISSVSLTISSLCAFPRRGGLEPSIiPAP 
HTPPPPPE/P*ASSPSRGEWHLVPPCMQ 
PGLPPLPPPRLSPLV 


10301 


24202 


A 


10378 


302 


3 


WPTVATTCSCERKSHTSljT*NQKIiEGTQ 
LSEEGMSKAEIGQKPGLCHQT/VQWNA 
KKKLLKEIKNATPVNTQITRK\QNSLIA 
DT\EKV*VIQIEDQTSHV 


10302 


24203 


A 


10379 


125 


375 


NEYVFLEKSLKLAGRVGSCLLGRPRQVD 
HLSSGAQD* PGQRGKTPSLQNIQKLSRH 
GGMCP/RRLRWEDHI*S/SRGG+GCSETR 
ST 


10303 


24204 


A 


10380 


458 


3 


I T ISTI SCKATSFERAPRASRPP PDEWV 
SQAGPAPLRGWEN* GTRRRDDWPEASLK 
DPRRSGEPEAPL/PLRLML* SRLPPPLR 
WRRRPPPPDSHTPSPGNGKSPRVPRTGS 
KDKRPEALPP* FAAGEPLEANVP PVQHG 
GGSVRSRGSYLTAT 


10304 


24205 


A 


10381 


327 


2 


-LSAPKKFFFQKSLFLGANFRSPPFPGRP 
FTNRFSDLNVNPLTKKIYFSHKSSNPPT 
FFFFFFETGSNSVT* SGVEWRNHNSLRP 
ES PG\ SGDRP* V* HTTGAPYHTWLV 


10305 


24206 


A 


10382 


754 


1 


NTALETDTTFSGGGGGHQLCVRSRVSGG 
G*LIMPDLHNHPLQQTSGRNQRLQGEVT 
DPSPGMSGGAQTGPKAPPRPPSALRHSG 
SLQGQP * FAHLYCGTI P YPCYQTAPLPR 
APPPTPTGH* CPGEA\PQDELPVCGCSP 
WPRPPGQAQAGRNKH* FSGQRSSRDTTA 
CSPCSQSKQSPSRPPCVNDS/PGQGPGP 
KAPAGPPPALPEASLGSVHCSP*VGVMT 
PTRPGHFLVPGKVTLEQAPGGGASGGPV 
P 


10306 


24207 


A 


10383 


39 


433 


SHDQTYTEEELLFMDEQRKWFLLMESTP 
GEDAVNWE ITTKNLE YCINLVDSTVA/ 
GVSNFERRSTVGQMLSNS IVCYGKI F/R 
* KR/ S QSMWQTSLLF \NKLSQPP * P STP 
TILISQQPSMWRQDPPSEERL 


10307 


24208 


A 


10384 


283 


1 


NVFSCKPTSSCLISPDSPTPDRRFSKAD 
TYRNFMFPLRRHTHMHTHTYMHTHVHVC 
TCtf JTT .HT.P PH / 1 THGLS T . /TCAP PVTj*P 
PSFKPLHPTPV 


10308 


24209 


A 


10385 


12 


471 


SPPWWTHGFRYWBPTPPRFSVPHDALGQ 
FHTALFRLTRRFQLLSSCINISPHTCLP 
PCLiGLCVGGPIiEPAVCWAR/CPP +RGMV 
SGWKASPLG/CPPTLLPPC 
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nucleotide deletion, V=possible j 
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10309 


24210 


A 


10386 


287 


1 


SFCVRSRPAPTSGRTPFPAAPPFPAAPP 
FPAATTPSTAEDGVPG* G/ PAPAANCS P 
Ho SQKGo AAMG I P LPQAS / PPAPTPAP P 
SP*SLPFPPAPMY 


10310 


24211 


A 


10387 


336 • 


3 


LIEGKLTNRKDIRSKTASVRHHHQSPKV " 
DKPTKMGRNQSRKAENS \KIKSASSPPK 
ECSSSPATEBSWMENDBLREEGFR*LVI 
TNFSELKEKTGKLNPAALVCGRSEDQTT 


1 A "3 1 1 

1031 J 


24212 


A 


10388 


15 


470 


APSPDAMGR\ FTBEDKATITSLRCKVNV 
EDAGGETLVRLQWYPWTQRYFDSFGNL 
SSASAIMGNPKVKVHGKKELTSLVBAIK 
HLDDLKSTFAQLSELHCDKLHVDPBNFK 
LLVNVLVTVLAIHFGKE FTPEVQAYLHK 
MVTWANSLRYK 


10312 


24213 


A 


10389 


3 


438 


QTQRE PTMVLS PADKTNVKAAWGKVGAH 

SHGSAQVKGHGKK3/MALNNAVAHVDDM 
PNTLS DLNDLHAHKLLVD P VNFKLLNHC 
LLETLDGHLSAE FTP\AVNASLEQFTAF 
EITWLT 


10313 


24214 


A 


10390 


2 


444 


TMS FNTLAI CLDCLCSTLQ PTRS IPGYP 
SSPLPGNPTPPMTPSSSVPYMSPNQEVK 
SPFLPDLKPNLNSLHSSPSGSGPCDELR 
LTFPVRDGVV/ SGALPPAAQPGC 


10314 


24215 


A 


10391 


405 


3 


KMILTOiALPAQHCPPSACPQTMAQPPL 
PLSIKGAASMSDKLPYKVADIGLATWGH 
KALDIVEN/EMPGLMGMQEIjYSASRPLK 
GACIADCLQITVETAILIETLFSLGVQE 
QWSSCSIFSTQEHAVAVFAEAGMP 


10315 


24216 


A 


10392 


248 


5 


RLNKVGGHGGMYEHIiYVNKIiETLQEMYK 
FLD I YNIiPNLS LE E IE \NLNRP ITSNKF 
ESVIKHFPTKKISGLDGFTAEFLTNN 


1 AO %H 

10316 


24217 


A 


10393 


262 


3 


LPRDRQAGRSQGPWPQAVGRARKMPPF 
ISYQVYSKNFFEIESCSVTQAGVQWRNI 
SSMHPPPP\GSSDSPASASQAPGIPGAH 
HHA , 


10317 


24218 


A 


10394 


287 


410 


ALKGSSRILIMLGAWLTPVIPA/LWEAK 
AGRSSEVRS 


10318 


24219 


A 


10395 


175 


1 


GGREVDDEIPLVRESTIFHIFFFFFETE 
bno VAUAlaLaUW RDIjGSLPSS \PPG\SCH 
SPA 


10319 


24220 


A 


10396 


661 


963 


STLIAFIVISTLFPLLDMTEIYFSUiDE 
IVDTLGEGAFGKWECIDHKAGGRHVAV 

JxJLVJRJW vUKXUSAAKolSlU vJjtuiJLiNi Ili-P 

NSTFRCVQ\MLEWFEHH 


10320 


24221 


A 


10397 


1 


268 


PQTHREATMGLSTADWTNVKAGWGKGGA 
HAGEDGARALERMFLS F PTTKTYFPHFD 

JjorlLjo>iyiJi\JLxrivji\Jv v AilAli 1 W \AVAn V U 

DMRNAL 


10321 


24222 


A 


10398 


33 


302 


S RGATL I YVDNENGE PGTR WAKDGLKL 
ESGFb ±KAtiDGKSQVSTPRFGQTFDAPP 
AIiPKATRKASGTVNRATEES V\ KTKGHIi 
VQEHP 


10322 


24223 


A 


10399 


16 


233 


PTDYEFNTTTYRECGPRRPEFSTSLDLL 
SQPCRAVYM\VGEKNDIPFELRIVDLIT 
GKS I LGFGEPKSQEGDR ! 


10323 


24224 


A 


10400 


177 


1 


KLLDLGQPQSPALAACLEYSGAIAAM\C 
NLCIiLGSSNPPTSASRIARITGVHHAEP 
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peptide 
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Predict- 
ed end 
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correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenyla1anine, 
G=Glycine, ENHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
1 = 1 nreonine, V=V aline, 
W=Tryptophan, Y=Tyrosine, ! 
X=Unknown, *«Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














YA 


10324 


24225 


A 


10401 


185 


419 


HVNKDTLESHLVWKLKSALLGRKSESVT 
GPSCHPEPSDKTVEL/WQGAVAHACHLS 
TLGGQGGWITLAQEFETSLDNIV 


10325 


24226 


A 


10402 


104 


3 


GIILIPKPGRDTTKKENF/RPLIPIDAK 
ILNKILAN 


10326 


24227 


A 


10403 


3 


344 


SQVMAVAGPAPGAGARPRLDLQFLQRFL 
QILKVLFPSWSSQNALMFLTLLC/LTLL 
EQFGNYQVGLI PNQ YYGVLGNKDLEGFK 
TLTFLAVMLIVLNSTVRSFPLSVSLVIiS 
SPV 


10327 


24228 


A 


10404 


3 


404 


ARAQRRGDLSATGRNWSPLPPAGLPATV 
VLRHSGSLMAATCEISNIFSNYFSAMYR 
S \ED YTLASVP PAATBXaADDJjVJjXbSN P 
QTSLEGTEKANWMG\EQSQFWPKTQGLD 
WISFQAEKNKYEASAIGFPRWDK 


10328 


24229 


A 


10405 


254 


443 


NQLSSIMVMFKKIRSFEEDFNDPEKVYG 
SGDKVAG\RL I VEVCEDTRVKAVRILAC 
GVAKELRM 


10329 


24230 


A 


10406 


26 


461 


GEVARRS CCGAMACSTAJUKKLMAE YKQLT 
LNPPEG I VAGPMNEENFFEWEALIMGPE 
DTCFEFGVFPAILSFPLDYPIjSPPKMRF 
TCEMFHPNT YPDGRVCIS I LHAPGD/ DP 
HGLREQ\ PERWS P VQSVEKI lls wsml 
AEPNDES 


10330 * 


24231 


A 


10407 


1 


463 


QQAAKMAENSGRAGKSSG/ T/ PARGRGR 
C/ PAEQVI AGFNRIiRQEQRGLAS KAAEL 
EMELNEHSLVIDTLiKEVDETRKCYRMVG 
GVLVERTVKEVLPALENNKEQ IQKIIET 
LTQQLQAKGKELNEFREKHNIRLMGEDE 
KPAAKENSEGAGAKASSP 


10331 


24232 


A 


10408 


180 


446 


KKAAAS VSAASGSHLSNS FAEPSRSNGS 
MVRHSSSPYVRYPSDKPFLNSDLRRSPS 
KPTLGYPESNSRAIFSAL/XNLQD/KIR 
RLNTj 


10332 


24233 


A 


10409 


1 


417 


EAGLVTMEEIGILGEKAQDE I PALSVSR 
PQTGLSFLGPEPEDLEDLYSRFXKKLQQ 
ELEFLEVQEE YI KDEQKNIjKKE FLiHAQE 
EVKRIQSI PLVIGQFLEAVDQNTAXVGS 
TTGSHYYVRILSTIDRELLKPNASVALY 


10333 


24234 


A 


10410 


305 


53 


LEAQPHAPPLLGHHLPCWRIPSHLGTCR 
RPRAMRVQGSEEEGVA/GQAPPLPWVHR 
GSLPLVRRRPAGKPTGGNAREEKGKVEG 


10334 


24235 


A 


10411 


419 


2 


NNSRKFGNGTLKLiVi!; VE I YPL I S E S I PD 
KSELLLTPQDKKHAPLPSAVPNPSA/CP 
DLRPHTTPASPVS\PAKTVETRPSAPQG 
PLPGPVRRPRGRr SVRAASPQTbGRPGA 
PRPPGPALREATDAPRAATPPIAALAGH 
S 


10335 


24236 


A 


10412 


74 


322 


MDEMATTQ I SKDELDE L I EAFAKVDLS S 
NGFICDYELHELFMEANMPLPGYKARET 
I QKLJiLIX3DRNTDGKI s \yde FAYVSM 


10336 


24237 


A 


10415 


1 


415 


PfYTHRE PTMWAS ADKTNVMAAWGKVGA 
HAGEYGAEALERMFLTFPTTKTYFPHLD 
LSHGSAQVKGHGKMVADALTNAVAHVDD 
MPNALYALSDLNAHKLW^P/VNFMIjLS 
HCLLVTLV7UILPDE FTPAVHACLDKLL 


10337 


24238 


A 


10416 


19 


454 


APSPNAMGHCTEEDKATITSLWGMVNVE 
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peptide 
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residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
&=G!utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














DAGRETLGRLLWYP WTQRF FD S FGN \ L 
S SDSAIMGNPKVKAHGKKVLTS LGYAIM 
HLDDLKGTFAQLSELLCDKLHVDPENFK 
LLGNVLVTVIAIHFGIEFTPEVQASWPK 
KMNAVA 


10338 


24239 


A 


10417 


2 


395 


AIjSSSQPSAPMHPFYTRAATMIGEIAAA 
VSFISKFLRTKGLTSERQLQTFSQSLQE 
LLAEHYKHHWFPEKPCKGSGYSMIGVNP 
\KMISLILQACIEGGI 


10339 


24240 


A 


10418 


2 


374 


LSTPHAFGTGKWGTFISLLFLFSSAYSR 

gvfrrdahkseaahr i kdlgeedftalv 
viafaqhlqqwpfedhWqlandvtefa 
ttcaadessencdqslhtlfgdllcsra 
tldeshcvqdgrp 


10340 


24241 


A 


10419 


48 


347 


DPKAQLPEPLRVLWQLTYAMAAGSR/TS 
LL/LAFALLCLPCFKAGPPNRPLSRLFD 
HAMLQAHRRTTAIDTYQEFENLYPKD/Q 
QYS/FLMTPTSSALDSIPTPS 


10341 


24242 


A 


10420 


166 


1 


NFLKKPGPFQKFFQRGLPLIIiTFFFFF\ 
FETKPHSVSL\LECGGTVSAHCNLCLPV 


10342 


24243 


A 


10421 


2 


251 


LGCTQHRSQELVAAATSHQTCIQASEDV 
KE I / FARARNGKYRPLfdS I ENGQIMIG 
SY/SSQPSDSWDNDYDSFVLPLLEDKQL 
CY 


10343 


24244 


A 


10422 


251 


1271 


KEDLSPRAPMSGTQSTITDRFPLKKPIR 
HGSILNRESPTDKKQKVERIASHDFDPT 
DSSSKKTKSSSEESRSEIYGLy\QRCVI 
IQKDDNGFGLTVSGDNP\VFVQSVKEDG 
AAMRA\GVQTGDRI I KVNGTLVTHSNHL 
EWKL I KSGSYVAIiTVQGRPPGS PQIPL 
ADSEVEPSVTGHMSPIMTSPHSPGASGN 
MERITS PVLMGEENNWHNQKVE I IiRKM 
IiQKEQERLQLLQEDYNRTPAQRLLKEIQ 
EAKKHI PQLQEQLS KATGSAQDGAWTP 
SRPLGDTLTVSEAETDPGDVLGRTDCSS 
GDASRPSSDNADSPKSGPKERIYLEENP 
EEQKKG 


10344 


24245 


A 


10423 


198 


569 


QRNMVGQRLVEPRRLKPGFINVKSYNGD 
WEWHQGFFLVCFFFIjRWSLALS PRLECS 
GAISTHCNLCLPGFKQFSCXSIiPSSWDY 
RHAHHCTQL I FVFLVEMG \ FHHVGQAGQ 
GFFSLEKSLTII 


10345 


24246 


A 


10424 


2 


343 


PQTQRE PTMVLS PADGTNVYAAWGKVGA 
HAGEYGAEALERMFLSFPTTKTYFPHFD 
LSHGSAQVKGHGNKVS\DALTNAVAHVD 
DMPNALSALSDLHAHKLLVDPVNFKLLS 
HC 


10346 


24247 


A 


10425 


31 


342 


RAAVMPREDRATWKSNYFLKI IQLLDDY 
PKCFI VGADNEGS KQMQQI RMSLRGKAV 
VLMGKNTMMRKAI \ RGHLENNPALEKLL 
PHIRGNVGFEITKEDLTEIR 


10347 


24248 


A 


10426 


196 


2 ! 


KNLIISQTKKVICPNTLFLVFPKRVFFF 
FFFFETESRS/VSPRLECSA\SISAHYN 
LCLPGLSDSS 


10348 


24249 


A 


10427 


1 


405 


RLECWLEPPHGAGLQGLGWVWSCSVSTG 
PTMQALV\ LLLC I GALLGHMS CQNP AS P 
PEEGYPDPDSTRALEDDEDPIFIDPENK 
LPEAGSNFGYDLYRLRAIMSPTSNVLLP 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D==Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, j 
G=Glycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














P LMVATDLWDLS LEAQQLTE S 1 1 


10349 


24250 


A 


10428 ! 


223 


361 


EGPTEENMAAKVFES/ IGKFGLAIAVAG 
/GPVNSALYNVDVGHRAVI FE 


10350 


24251 


A 


10430 


3 


359 


IiTQREPTMALSPGDMTNVKAAWGBCVCAH 
AGDYGAEALERMFLYFPTT\KTYFPHFD 
LSHGFAQVKGHGKKWDALTNAVAHVDD 
MPNALAAGSDLHAHiajRVDPVNFKLLSH 
CLLVTLA 


10351 


24252 


A 


10431 


273 


1 


NHKDGKKQSGKTEKSKNQSASPPPKERS 
SSPATEQSWTENDFDEFREEGF I \RQSN 
YSELKEE IRTHGKEVKNLEKKLDDRLTR 
ITNAQK 


10352 


24253 


A 


10432 


211 


3 


SFLWKFCIiRGIPSHVSCQSA/LLlJGGAS 
QLGYSGVRDPLEEAVCLFSDLKIiHAGRT 
TTLFNAVRQGHLSLQ 


10353 


24254 


A 


10433 


281 


2 


TNQBKKKGDPNKIRSGKGDTTTDNTVTI 
IRDYYG /HTYTNKLKN/LEEVDKFLDTY 
YLPRFNQREIENMNQPITSNEIESVIKS 
LQTKKSPGPMAS 


10354 


24255 


A 


10434 


219 


1 


SFLWKLRLRGAPGCMRCQSA\LLGGVSH 
LGYSGVRDPLEBAVCPYSDLNLCAERTT 
TLFKAVREGCLSLQKF 


10355 


24256 


A 


10435 


133 


2 


FPKPPGNFFFFFETESHFVT \RLECSGA 
ISAHCNIiHLPGSSDSPA 


10356 


24257 


A 


10436 


248 


376 


KGGVFFFFFFFFBTESCSITQTGAQWHT 
\YGSLQPRPP 


10357 


24258 


A 


10437 


250 


1 


KGTRKELTFIEQLLSAKYWAAGCMHYFR 
GSGKYSNGYPVFVFVFIFSETESCS\VA 
QAGGQWCDLVSLQPPPP\GSSDSPGASF 


10358 


24259 


A 


10438 


354 


2 


KRGLKKNPFFNPRNMGEFLKLWGPNPKK 
I FKNCKI FKMGLKKKVLGFYKNSKNIiKR 
GKLNPFVIFFFFFFFFFFFLETGSYS IA 
\RLKCSNSGTNTAHCSPELPGSRDLPDS 
TTQVAG 


10359 


24260 


A 


10439 


126 


1 


gmydcvclfcfaikegrerbkerekerk 
rerqreker/ererdkerkreerk 


10360 


24261 


A 


10440 


29 


347 


IQKPTAOTKKFFKKKLKYTTRENHLHNK 
EDRKEGKKEEKTTKQHGSSRSLLINKTL 
NINGLKSPIKTHRVAE/WNKNKDPMLYC 
IQETPFTYKDTHRL/RIQGWKKI fh 


10361 


24262 


A 


10442 


405 


1 


IFFPHPFFFKKSFRIjSPQVEIQGIFLGP 
WKLFPPGFNNFS/CPNtiPNNLGFKGLPP 
RPSYFFRIFCKKKGF/H/RVYPEGFPFL 
ALGFFPLFPFKSWGFQGDPPSWGPFFFF 
FFFVLVAEGTLSEGCRLIRGRWKNLVP 


10362 


24263 


A 


10443 


122 


372 


LYNDP IS I YQVTLYRRVFFWVLYSANWS 
LRWKANFLFFSFFKMASDPVAQAGVQWC 
DLDSLQPQPP\GL\SDTPTLASQVSETT 
G 


10363 


24264 


A 


10444 


171 


3 


RHPTMAGSSEMFLGSGFGCRLTVFVLFB 
TESCSVT/RECSGAISAHCYLSLPGSSN 
LP 


10364 


24265 


A 


10445 


173 


3 


CMN WCfb 1 FXlfJjK..LN F ILjETESHS VSQ 
AGVQW\LITVHCSLKLMGSSDPPSSASQ 
V 


10365 


24266 


A 


10446 


187 


329 


S FLWKFCLRGVPGHVRCQS A\ LLG VASQ 
LGYSRVRNPLEEAVCPFSDL 


10366 


24267 


A 


10447 


27 


226 


YS YPVYLLLHIFLCWKCSLYF1 YFVFEM 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
OGIycine, H=Histidioe, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R^Arginine, S=Serine, 
T=Threonine, V=Valine, 
w— iryptopnan, y=iyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ES CS FAQAGMKWCDLQPLPPG \SSDS LA 
PASQVAGVDAA 


10367 


24268 


A 


10448 


119 


296 


SPIHYILVI I ICFIHVILLI I IFSEVGS 
His V APAJjV \ SCjAI I AHCn HKL LGS S S P P 

vsvs 


10368 


24269 


A 


10449 


111 


363 


SliKKLVFFFSIYTHNFFPFFDPTLKNST 
RHS JjYIjSIjS I*S IiFLSHSV \SRLECGGT I 
IAHCSLNLPGSSDPHTSMSQV\AGPTGS 
C 




7yO*7A 


tT 






1 


UKr WGPGPNIwFKNIaSSAFFIjGRGKLFF 
LE I WGGFFFFFLVFFLRQSHS IAQAGVQ 
WCU \l*NSLCPJbPPRFK 


10370 


24271 


A 


10451 


179 


3 


GPFKLTLFFMGKFALGREKCHFFFFETE 
SCTY APAGLQWRDIXSSIjQPPP p \gs sds 
PAS 


1UJ / 1 




a 

A 




71 


232 


ooot onr t mv.TT'xrT Tyirr 7v vT\Tiur/"\m its 1 tta 
&bPIjSDl^rWiVIiEVIiAKDXMQEVB/ IK 

LILIGKDEVKWSLIVDKI ILYVKNSKH 


l\JJ 14. 




r\ 






3 


7\ r> Z™ 1 T VLM P T /*»/"* f* l^i i^i I i ijiji tin tjT n/Mirrn 

AKVjIsJjx i? JUciXvjViVar r r r riaVr FIjRQSHS 

iaqagvqwcd\lnslqplpprfk 


IUj / j 




n 
-rl 




■51© 

338 


221 


MvjriiHVGQASIjKJjLTS ICP\IiGLPKCWD 
YRHEPPRPVIPS 


10374 


24275 


A 


10455 


214 


366 


YNYKYLIHYYSYLFFIFIFF/NIIHYIF 
FILFFYLTSFIFLSYFFFFFITLF 


i rime 


24276 


A 


10456 


155 


2 


DRVSLLLPRLECNGAIIjAHCNIxCIjLGSR 
HS/PRVAGITVAHHHARLIFCIF 


IU.J70 


242/7 


A 


10457 


870 


1249 


EGPRWADHLRPGVQDHLG \qhgetps l \ 
QKIQKLARRGGTC/LCQSQLLGRLSQEN 
CLNLGDGGCSB PRLHHCI PTWGDE\GDS 


10377 


24278 


A 


10458 


222 


404 


KQRPGRAQWLTPVIPKLWEAELGR\MLE 
AK riiRP AWAT 


10378 


24279 


A 


10459 


122 


362 


LSEDQLNCYPQRLHHFI FLPVITRVQMS 
HYLCQQSCLFHFF/ SFFFLYFEMESSS V 
i \Kli£ul\^lSAHSNIiYXiPGSSQSSG 


10379 


24280 


A 


10460 


3 


102 


AASTLALSPRLECNSAILAHCKLH/LPA 
FTPFSCL 


llfJoU 




A 


10461 


133 


339 


KDS KFQGS KKPE I KANSLDUUiFYHNY I 
TRQCFTS YSNFFGDRVSLLS PRLECNGV 


10381 


24282 


A 


10462 


3 


193 


ASNMMTELKS Q I P I LTLNVNGLNAPLKR 
HRVAGWINDPTICYLQBTYFT/ CKNNHR 
LTAK 


10382 


24283 


A 


10463 


126 


314 


RIX3GST I KVQRDLSSDRSKPGRFL YTSN 
I VKKLjKicWKACl? 1 ISRVP/ PVNDI I 


10383 


24284 


A 


10464 


1 


332 


KKNLFLSKDCLSSLFRFSKSPPAILGPP 

HFGLKPVP/ CPPFLSKTRFLPGVPFFFF 
FFFETESHS VA\RLECSG/ AI S AHCKLR 
L 


10384 


24285 


A 


10465 


173 


365 


RFVCSTIKVLRDLSSDRSNPGRFLSLQI 
PP \ijKJuJKKWKAi FTK/RPSPVNDIIST 


10385 


24286 


A 


10466 


335 


3 


PFKKFYFKISTRALLEPVFFGKKIRVPP 
ORLS P I WAPLFKRGP PNTS FOVNOVPT .FT? 

FFFFVRHGLALSSGVECSGTRLECSGAI 
/ STAHCHLCHLDASHPPTSASWVARPTG 
T 


10386 


24287 


A 


10467 


818 


1008 


VHTDVLADI LKS INNAEKRQT/ RRCFLS 
\WLQGHCPFLTVMMQHGLCGVELGGLPG 
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SEQID 
NO: Of 
nucleotide 
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SEQID 
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M 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaroic Acid, F=Phenyialanine, 
G=Glycine, H=Htstidine, Msoieucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

A-Plntamina D — A tfTl n inO C — Cafina 

V^FHjiuianiine, Jtv— Argiuine, o— oeriiic, 
W=Tryptophan t Y=Tyrosine, 

V- — l ]nlr nrwxrn A=fitnn Ci\i\ fin /=r»rkccj Kje 

A. — u n tvu own j — o u> y wu uuj / pvNi u 
nucleotide deletion, \-possible 
nucleotide insertion 














xtkjUC >— rv-Zvirrs. 


10387 


24288 


A 


10468 


111 


1 


MVS FGRPRQADHKV\GVRDQPGQHGETP 

CT T ITMTPT 


10388 


24289 


A 


10469 


204 


1 


LWGPILKIFCQILKTICEGNFRGPTFFI 
wt? PPMP VTO APMOWCDLS SLOPLP P 
GS\SNSRASAS 


10389 


24290 


A 


10470 


190 


330 


irpTif mnrj^TnCT/WTHFTFKDSORLKV 
KGWKK\IFHTNKNQKRIWT 


10390 


24291 


A 


10471 


162 


2 j 


IKGPKRAPPANKGRAFPFFFFFFETESC 
SCLQAAVQWHDLSSPQPPTP\GSSDS 


10391 


24292 


A 


10472 


120 


3 


SWGr r c err \1SVJloioViiyvv3vywwiw 
SLQAPHVMILVK 


10392 


24293 


A 


10473 


298 


1 


RRALPCPANFLYFYFLSLLLFFETEFHS 
\VAQAGVRWHEliGSLQTPPPG\SSDSPA 
SGBYSANYTGEYSANYTGEYSANYTGEY 

SVKi I\3fiiXOVr*I JLOEii 


10393 


24294 


A 


10474 


1 


380 


AVTGRAGSMVAPR \ PLRRLVLFYOGKLR 
SMAGNFWQSSHYIiQRILDKHDLIJJERQK 
DLKVLSEEEYWKLQRFFTKVIQALCBHL 

VLMAPTCVCLASTAN 


10394 


24295 


A 


10475 


289 


2 


TQGFPP INNS FYALYPKQKI P FFSDKLF 

FEMASGSVAQAGVQWRNLGSLQPPI\PG 
SSSLTTSAPRVAS 


10395 


24296 


A 


10476 


138 


402 


ITGRAPPKPFPLKRGYLSGGLTNIGFSF 
EST/SPDSSNS/PAPASRVAAIT 


10396 


24297 


A 


10477 


2 


336 


ARETNDTTQTRLLPANNTMTRSDPYISI 

t t»t KnTvrr'T kttdtITPV TfPini QWnvrKTYTT.M 
IjTIjK V WvjJjN 1 "a J\J1 \J\JK. v/4o W lltAWwiwi 

GCCLQETHLSSHETHNDTHKLKIKTWRK 

I YQANGKQKKARW/ 1 L ISDKTDFKP 


10397 


24298 


A 


10478 


51 


250 


RJ\V 0X1.P JxJSJW^Wl* xXilfX^lvJV. \iiouui xitr 

AKINFEWITDLNVREKT/IKLLKGNIKE 
NICDLGLGKDFF 


10398 


24299 


A 


10479 


216 


2 


tjij XxtlxAXiooJ J? JV ium? Vive JrHic oiujcc iv 

KNFSERFFFFFFFEMESHSVT\RLECSG 
PISAHCNLHLPGSSLV 


10399 


24300 


A 


10480 


191 


3 


FSTIX3NVNCLVKYS IYIYI YFFFFLJ.FF 
CTTMTTQfQ /VQPPT/RnNfiATw<lAWCTCLCLP 
GS CHS PARA 


10400 


24301 


A 


10481 


135 


768 


ijGGQVIaSCASQPVSIiFPLFFFIilSFLGC 
T.T.AFT.S ACF MTFS CVLGS PETS FSPCL 
HPPPKVLSSHP\PPCSRP/CPQPPKG/P 
PLPKHACPP 


10401 


24302 


A 


10482 


126 


333 


uyunno YPVMPFVT \ VLCLFLLV /WWW 
FLW / CS VYTVYWYGYAVVLF I FFFFFFS 
T ,T .rFAVCIiFVWliLDRF 


10402 


24303 


A 


10483 


240 


2 


" ATEKNMGLGTSVRGIjSGSNPSYGRVRER 
Trsri/n/rT.sffQrjTT.T.PfiPX t,pwedhTiGT/3G 

X \J*Jv7 V V - ' * fw >J ^ ■*■ 1 1 1 1 IxAJ*— \iXTvrf Ol/ilUVAlAJV? 

GGCSE prs SHCTPTWVTEQD P VL V 


10403 


24304 


A 


10484 


136 


2 


NKruiiforriCii ojnu vri^xivj v iiniu/u 
SSLQPIiPP\GSSDSPARA 


10404 


24305 


A 


10485 


161 


328 


EKNFKPYGSCRHFYIiFIYFETESRSVA\ 
RLECSGTISAHCaJIiRLPGSSN/SPAS 


10405 


24306 


A 


10486 


193 


347 


ILFSFIjLFIYFFETESCSVAQAGVQWHD 
HGSLRSRAPG\SYNAPALDSQGTE 


10406 


24307 


A 


10487 


114 


2 


GAWQS EVQWRDSGSLQAP PPG \SRHS P 
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nucle- 
otide 
location 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutatnic Acid, ^Phenylalanine, 
G=Grycine, H=Htstidine, Hso leu cine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine 7 S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X"Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














ASASQVAGLV 


10407 


24308 


A 


10488. 


314 


49 


IRRSTERSEPRYSMVCRRQPRQGGPCL\ 
RSQLLGRLRQEDHLSPIAGDCSEPRLH\ 
LACTPAWATEQE P VLKKMS PTLFCLFLW 
PLPCQ 


10408 


24309 


A 


10489 


291 


1 


KSPPPPNRRGLTLRSRGQIGNGVSFYYE 
APVTDFI FFFQSLVLSTQAGLQWCNLSS 
LQLPPPG\SSDSPASASRVAPVSQGHAT 
APQPGRQTLSSC 


10409 


24310 


A 


10490 


226 


3 


GSRCTMPRRRLAHAFPAARMPKRKVSST 
EWATEEEPKRRSARLSAEPAPAKVETK/ 
PEKAAGKDKSSDKKVQTKLV 


10410 


24311 


A 


10491 


2 


153 


ARGLVFFVYLVEMTLCHVAQAAVKPLTS 
SEPP\LGLPKCWDYRREPPRPT 


10411 


24312 


A 


10492 


123 


342 


THYLCLLRFFFIiDRGFLLLPRLDC/NAP 
ISPNLNIiRLPASSDSPASASRVTGITW 
NHYARLTLLFIFQNETIiF 


10412 


24313 


A 


10493 


194 


2 


IPGWPFKNFSLFPLFFFFFEMESCSVAH 
AGV/GSSDFPASASQVAGTTSMRINAWL 
IFVFFSRN 


10413 


24314 


A 


10494 


373 


1156 


ICVQtJCVYLFFFLRLSIiALIjPRLECSGTI 
LAHCNLPPSRFEQFSCLSLPSSWNYRRP 
PPHHARLIFVL/SVETGFHHVGQAGLEL 
LTSGDPPASA/FPKCWDYRHE 


10414 


24315 


A 


10495 


181 


3 


S FLWKFS LRGVPGRVROQSA\LLGGASQ 
LGYSGLRDPLQEAVCPFSDLKLCAGRTT 
TLFK 


10415 


24316 


A 


10496 


156 


2 


AASTTRLGDRDQPGQHSETPSLIjKIQKL 
G\SGGGHLRLRQENCLNP\GGRGC 


10416 


24317 


A 


10497 


157 


300 


IQSWFNILKSIHVIHHSNRLK/EVNHIV 
STDAEKAADKIQHPFLIKKQ 


10417 


24318 


A 


10498 


3 


289 


LVKKMLTGSTLGKSYRHSPFSINQGHNA 
LRKAAG\PLPRKAGY\LQGFSPLRYGL\ 
WDGKDLT IHQPDTREGS VLS RI S KRGRP 
L/CSCLPLGTECLSI 


10418 


24319 


A 


10499 


138 


1 


WYGLIYLLLEMESCYVAQAGVQWHDLPS 
LQPPPP\NSSNSPASSC 


10419 


24320 


A 


10500 


138 


1 


WEGLIYLLLEMESCSVAQAGVQWHDLGS 
LQPPPP \NS SNS PAS S C 


10420 


24321 


A 


10501 


200 


343 


LEAHSFLSFFIFII I FFDTKSHSVT\RL 
ECSAAISAHCSLLLPGSSNS 


10421 


24322 


A 


10502 


84 


339 


PRDGKRRDKQKQETGGEHYKS QLRGRLR 
QEDHVS PGGRGCSEPRSRHCTPAWAT\E 
SNSVSKKS i 


10422 


24323 


A 


10503 


141 


362 


EWNCCVCC/CCCCCCRCF 


10423 


24324 


A 


10504 


198 


320 


TIKKQKNTGGLIFNSYILPPLFLEPGDL 
\RFLDVDNRVKLPV 


10424 


24325 


A 


10505 


30 


356 


GYPCNP ILYYMSTVGFTTSLLDFDI PLE 
HP I LKVCPTHTRALRDLSSDRSNPGRIL 
STSNS S LY/EKDPRNKAYFTK/ rps pvn 
DUST 


10425 


24326 


A 


10506 


16 


375 


KKFFKKKKKKIFFPPFPLKNFFFPKRVN 
FFGRGWPKKSPPKKKvFFKKFPGGFKKP 
PLKKKKIFFSPPVKMGPPKGFFKKGPPP 
FFFFFFFFFFFFFFFFFFFFL/ LCIiIPR 


10426 


24327 


A 


10507 


156 


2 


AASTTRLGDRDQPGQHSETPSLLKIQKL 
G\SGGGPLRLRQENRLNP\GGRGC 


10427 


24328 


A 


10508 


313 


2 


QSKWLLEEESTLGEDAVNIVEMTTKDLE 



1184 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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X=Un known, *=Stop codon, /^possible 
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YYRNLVDKAAAGFERIDSNFESSSTVAK 
ILSNCIVC/Y/REILHEGKSRPMWQTSL 
S YFKELPQTTQHSAI ITLTSPRA 


10428 


24329 


A 


10509 


325 


1 


ECAKEMNAETKRNIAIDATBTQRIIRDC 
YEHYNKMENLEAMDKFLDTHNLPRLNHE 
EIENLNEPIMSNETETV/l/NSIiPSKKN 
SGSEGFTAEFYQRYKTPLILKLFKK 


10429 


24330 


A 


10510 


126 


14 


DSSGQVQWLKP I IPV/ LWEPEAGRSPEV 
RSSRPACPTW 


10430 


24331 


A 


10511 


435 


769 


PFKFPPIiSHACDGGSLFCRLRLECSGVI 
T\AHCKPQLPG/SQSDFPASTSQELGP/ 
TGACHYTRLI CCREEVIi\HVAQAGLiELL 
CPKDPPTSASQHARDYRH\GHPAQL 


10431 


24332 


A 


10512 


147 


412 


R FVCS T I KVLRDIiS S DRSNPGRFLSTS N 
SSLY/EKDKRNKAYFTK/RPSPVNE 


10432 


24333 


A 


10513 


251 


466 


QTCPTGS PASLLELFNSIATQWELVRSL 
IAGNALKDEND\SAVKMLASLIMSYKAA 
AAEDYKADCPPRNPGP 


10433 


24334 


A 


10514 


286 


1 


SFLRKFCLRGVPSRVKCQSA\LLGGASQ 
LGYSGVRYPLEEAVCPFSHLHLHAGRTT 
TLFKAVRQEHLS LQRI LLPLVGRVGRPN 
DSRCDRVIGH 


10434 


24335 


A 


10515 


28 


428 


RFVCSTI KVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10435 


24336 


A 


10516 


263 


2 


I KKPADS ESAGALI I*DFPASQTVRNSFL 
LLTSHSVCYFCYSSPNGLRQQLKIIIFF 
FFLRQSCSV\SRLECSGMILAHCNLCLP 
GSNN 


10436 


24337 


A 


10517 


62 


292 


PATLFSGVILPPTNS INAFSGSDSKSAH 
K/CEMLGTENFNFFTCHTHI IAEHSNST 

imr ftyi r fiin iiiiAflirTI 111 IffHTHHl IT T DVT? 

HTHTHTHTQTHTHTHTPIiijrYE 


10437 


24338 


A 


10518 


172 


2 


RGKKI FFPLEKGKNLVWW I TALFFFFFE 
TDSHS V\ TRLKCSGTILYHCSLCLPGSS 
DF 


10438 


24339 


A 


10519 


42 


194 


I LVETF \ CLRFVS LENLVKMRFLKLKKK 

Ty T /fl ■ ■ mm mm mm mm mm f/ll mr l/lfl/l/l/l \ W IF W T 

KKKKKKKKKKKKKSXKKKKKJKJC1 


10439 


24340 


A 


10520 


120 


I 


I FAKRWPI KFFPFFFFFFETVFHS \VAQ 
AGVQWCDLGSP 


10440 


24341 


A 


10521 


202 


375 


SFLWKLHLSGVPGHVRCQSALTGG\ASQ 
LS YSGVTDTLEEAVC P FSDLKLRAGRTT 
TIi 


10441 


24342 


A 


10522 


272 


402 


PKIFYVTKKAWNYYPYTITEYTCSF/ IR 
KFFIHIETKYEDNKGSN 


10442 


24343 


A 


10523 


147 


439 


LI LYVNYI SRKSTV YDSfcUitf UT 1 M Lil FiS 
KNCYKYHCNFLCT/SSSIPFPTPSTPSR 


10443 


24344 


A 


10524 


7 


415 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10444 


24345 


A 


10525 


321 


442 


TSFSFLFFF/IjETEFHSVA\RLECRGTI 
LVHCNLCLPGSSNS 


10445 


24346 


A 


10526 


477 


3 


KTTKMGKKQSRKTGNSKNQSASPPPKER 
SSSPAMEQSWTEKDFDKLREEGFRGSIY 
SKL/REEIRTNGKEVKNFEKELDEWITR 
ITNAKKSLKDLMELKASAQELRDECTSR 
AASANCVAVPQAFDRSHGVQ 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, V=po$sibIe 
nucleotide insertion 


10446 


24347 


A 


10527 


265 


411 


RPVCSTIKVLRDLSSDRSNPGRCLSTSN 

q /cDVDvnTroknr»vi?TT? /ddcddkt 
£>/ 9lrVKJ\lJJ\KXv&AXr lrL/KJroFKJN 


10447 


24348 


A 


10528 


81 


390 


RFVCSTIKVLRDLSSDRSNPGRVLSTSN 


10448 


24349 


A 


10529 


253 


23 


RIFLFYFHSFLRQGLARAEGQPHNHSSL 
QPP JrJrQj \ovjDP r Ah AbKAAR L I S AYHH I 
PLFFFRSYCVAQAKHNMLSLL 


10449 


24350 


A 


10530 


10 


398 


RFVCSTI KVLLDLSSNRSNPGRFLSTSN 
ooJuJC / ISIvjUIvKWIvAi r XJv/KPoP 


10450 


24351 


A 


10531 


289 


423 


RFVCSTSKVLRDLSSDRSSPGRFLSTSN 
SSL/QRKDKRNKAYFTER 


10451 


24352 


A 


10532 


152 


401 


PHCS IHSQVRI QANVHKQHRQRTYGS VI 

ftftTT HT TTt T~T Trim igOT nr\m Mrs m-r *Mm.r* 

PHILPIiHVLKKTFSLRDFHFSVSLKKNL 
VLTCLHLFL/ GVRTPRNDPFVSMMLLFT 


1U4DZ 




A 


10533 


2 


515 


WRISIiliLPRLKCSGPVSAHRNLYLP/G 
FKRFCSLGLPSSWDYRRAS PRVAKFVFL 
VDTGFHHVGQAGLKLLTSSDPPSLTY/P 
QCWDYRREPHLAAF 




24J54 


A 


10534 


469 


566 


PTKNTKISREWwCVPV/IWEAEARESLE 
PGKQRL 


10454 


24355 


A 


10535 


354 


47 


KPKNLPGGGGPPFYSSPSKGKKIPLPWK 
AKVPINQILPLPPHPGGGKKTPFPKKKK 
KKERKKKKMATVQKGM/ PHKYYHGKTGS 
YNVI QHAVGNAVTNRTRGS 


10455 


24356 


A 


10536 


1 


409 


TPKKRFFPKKPKGFFFPPPKKKKKIFSP 
PP KFW PPQKNFKKAPP PFF FFFFFF FFF 
FFFFLKSNPFFFFF/SKNP I LKPPL PPL 
FRVFPPPKKKKKKDP 


i n/t<£ 
1U4jo 




-A 


10537 


204 


435 


TSGSIiYKIjIiGMGDjjGIajKTTI FKQGGHQ 
NSS\GWHPEATTG/VNWAFKGLHWDPKA 
LAHLQLWNIAL/QER/FGKMTKACY 


10457 


24358 


A 


10538 


208 


382 


rfvcsti kvxirdlssdrsnpgrflftsn 
sslfer/dqrnkaffpk/rpspvndiis 

T 


10458 


24359 


A 


10539 


63 


409 


GWRLTNFPVGKTPSTSGTGLLFIPLLLN 
LMGRLKPSSISMKREFRKQNRGQKC/EF 

LKI FRGGPNFPGGEKI FFFFLGGGI KSR 
GGGFG 


10459 


24360 


A 


10540 


3 


1322 


RGYAWPNGALPASTVPCGFAACPGE FLC 
SVNGRQAPAGSSGFLPPVPSLCPHTVCR 

atfqckedstcislrrfpwgsethllcp 
AP I PS vpgvpcgtftpqcedrscvktls 

ASPLPLLLCPPHHLPPSPDCGLQGPSSR 
IVGGAVSSBGBWPWQASLQVRGRHICGG 
AL I ADRWV ITAAHC FQEDSMASTVLWTV 
FLGKVWQNSRWPGE VSFKVS RLLLHP YH 
ht. JJb HD x u VAJjJjs2 JjUHPWRS AAVRP VC 
LPAR/ SENREQGLQ/ CSWITGWGAliREG 
GPISNALQKVDVQLIPQDLCSEAYRYQV 
TPRMLCAG YRKGKKDAC /QGES PG 


10460 


24361 


A 


10541 


154 


3 


GVGNFFFFFLRWSLTLAQAGVQWRSLGS 


10461 


24362 


A 


10542 


3 


406 


LMVLWTAHLPALAPGSRTSLLLAF\ALL 
CLPWLQE\AGAVQTVPLSRLFDHAMLQA 
HHAHQLAIDTYQEFEETYIPKDQKYSFL 
HDSHTSFCFSDS I PTPSNMEETQQKCNL 
ELIiRISLLLIKSRIiEPVQVLTSMF 
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Predict- 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGlycine, H=Histidine, I=IsoIeucine, 
K=Lysine, LHLeucine, M=Methionine, 
N-Asparagioe, P=Proline, 

/"> * r> a ,.»!ninA C— Carina 

Q=Glutamine, R=Arginine, j>-oenne, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
XHUn known, *=Stop codon, possible 
nucleotide deletion, V=possible 
nucleotide insertion 


10462 


24363 


A 


10543 


3 


414 


HSSAHAYGTYESSGKRKYLQRPSRTAEL 
KKALKEKKNRLLLQQRSKSVTSSSSSSR 

nnnTiT>t\PPni7m?T?*POTOOGOPT^crvrnT7C 

PSSARDS S S Kt K rs rv » S 1 t>oooJilJi>LrilJJtSo 
SSSSSSSAYSTNSSSSFDSDSDSSS\SR 
GRSTSTDSSSADD/STNEEESELE 


10463 


24364 


A 


10544 


240 


509 


TFTSFI IGTPNLGEFCKCNNFGI /GKFR 
MKVACQ I ETLG I LS U'lf iffiTilS SR5 V I KJjjv 
CGGAILAHCKL 


10464 


24365 


A 


10545 


275 


2 


GCSRP FDTAI PLMGP YPEEKKLLFEKNK 
/ DTCTCMF IAAHFP VAKMW I QPKCPb IW 
EWIKNPVTHIYIHTHTYMCIYIYTHICV 
CVIHIYI 


10465 


24366 


A 


10546 


233 


2 


WSWMAIQSVLPQTAFTELKNKYSCIQET 
PILFYFLKQSLPIiSPSLECSGVIIAHCN 
/LQ/RPGSTTFPASASKVERTTGA 


10466 


24367 


A 


10547 


2 


420 


KKKIPNRGGCLKRDTKAFLKTPPMIiQRR 
TPPAFPKGPGGFLPDFNPLPPSKGNPPI 
YSWERFFPRFFKNI WAFVPKANI*LjS c r r 
FSIFWRTGSRYIAQAGLELLGSSYPPAS 
CIiPESWDSRR\DHRARr r l 1 


10467 


24368 


A 


10548 


190 


1 


AASTFFACVCFRLFFVFVFETES PSVAQ 
AGLQWHSHSSLQPPPNLG\SGNPPALAS 
PVAGNC 


10468 


24369 


A 


10549 


155 


1 


ARGEKYFLEIGGGFFFFLLVFFLRQSHS 
IAQAGVQWCD\LNSMQPLPPRVK 


10469 


24370 


A 


10550 


223 


392 


RFVCST I KVLRDIiS SDRRNP LJOf Lib X JjM 
/SLAVRKDERYKAYFTK/RPDPVYDIIS 
T 


10470 


24371 


A 


10551 


183 


3 


WQKLLFLFGTES CS VARVGVQWRHLSS P 
KPPPP \GSSNS / PASASRvA\ rUAHJUnA 
RLIFVFL 


10471 


24372 


A 


10552 


297 


10 


TDEEIiLCMEEKRKWFF \DMAS I PGEDTV 
NTDEMTKT / DLEYYINLVDKVSAGFERT 
DTNF\ERSSTVSKMLSNSITCYRKI/F/ 
RERKSQQMWQTS FLS F 


10472 


24373 


A 


10553 


509 


3 


LLTDRTIjSCRSVGVPCSVRCQCA\LIiEG 
ASQIX5\SRGSGVRDPLEEAVCPFPDIa2Ii 
RTX3RTTAIiFKAVRQGHLSLQRLLLSF\V 
CLCPAPRGGAYRGTQASLSCX^5LHPVRA 

I SDCCAS SQRDS / vgvgpskpgagynlv 
vrrf 


10473 


24374 


A 


10554 


262 


453 


GWGILTANIiLVFGGGSLSISERIFFSPL 
SD 


10474 


24375 


A 


10555 


112 


372 


KKKGGGPPFFFFFIFFLIRQRKAKLWL 
ACMT5T3nT D7Dn / t TTVVRPwnifTirkr&rtrvQ 

ArNKKW-iKiV.fc'U/ lc I lALrVV7JS.JL V<RJMVIJ> 

GFFIELVTTCGKNYLMCALAVIDPRDSN 

TTD<3 


10475 


24376 


A 


10556 


50 


358 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSIiY/EKDKRNKAYFTK/RPSPVWDIIS 

rp 


10476 


24377 




lUJJ / 




jjyj 


QLHADKVSRFATHAECRGVIMAYFNI^L 
PH/FKRSSCLSLWSSWDYRHAP 


10477 


24378 


A 


10558 


98 


387 


I QDTDL IRGVYRYRD KKRARE KDS P PER 
EGERERE IEKQRKKER/ERKKERKKEST 
HQKE 


10478 


24379 


A 


10559 


237 


3 


EVFFKNIFEIESCSVTQARAQGQDHGSL 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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M 
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od 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=€lutamic Acid, ^Phenylalanine, 
Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, t=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine» R~Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=*Stop codon, /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 














QPPTPG \ SSS PPTLASQSAVT TGVNHHT 
QSGLTFQSPKQKALGKMCISSGG 


10479 


24380 


A 


10560 


37 


359 


RLKCGGPVIAYCKLEILGSSNTPTSTSW 
VAGTT/TCHHAQ 


10480 


24381 


A 


10561 


170 


3 


RPFFLPWGGKKFQGVSFFFFFFLVFFEI 
GSHSVP\RLKCSGTITAHCSLDLLGSSN 


10481 


24382 


A 


10562 


1 


388 


LPEPLMVLRTANLVAMAPGSRTTLL\LA'" 

FALLCLPWLQBADAVQTBPLSMLYDHAM 

LQAHRAHQLDIDTYHELEETYIPEDQKH 

SLIiHDSQTSFCLSDSIATPSNMEETHQK 

SNLEMVRISLLLIETWLE 


10482 


24383 


A 

4 


10563 


224 


392 


CVDPILKWFPLRTATRQGCPLS \LFNI~ 
LEVLATAIRQEKEIKDP/QIGKKEVK 


10483 


24384 


A 


10564 


185 


2 


RYFGGPDSPQNGKTVFNPLFFFFETRV\ 
HSCCPNESNGTISTHCNLCLPGSSNSPA 
SASQVA 


10484 


24385 


A 


10565 


175 


324 


APGYGTKYRVI LCKII4I1LRRLRHKNHLN 
PGGRGCSEPRSHHC/ITPAWVTE 


10485 


24386 


A 


10566 


3 


367 


HELLWSTIKVLPYLRSDRSNPGGVISTS 
NSSLC/EKDKKNKAYFTK/RPSPVNDMI 
ST 


10486 


24387 


A 


10567 


128 


391 


GETCEGVYGPQQGGKQTWKVFHSKKDEG 
PCS KTWDVGLGSS PQFI FLKKVRFFFFF 
FETESHSVTRLBCSGTPGT/HCKLCYPG 
TS 


10487 


24388 


A 


10568 


251 


485 


SFLWRFPLKGVPGHVRCQSA\LLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQRFLLPLFSY 


10488 


24389 


A 


10569 


176 


3 


FVPPILKKMGPFLKKKEGKPFFFFFSEM" 

ESRSVT\RLECSGA/TISAHCNIiCLPGS 

SNS 


10489 


24390 


A 


10570 


212 


386 


RFVCSTI KVLRDLS SDRSNPGRFLETSN 
SSLY/EKDKRNKAFFTK/RPSPVNDIIS 
T 


10490 


24391 


A 


10571 


134 


489 


SLFFLPSPRDRWSLTQSTEMPPSWKFPG 
REIRSPLLAMLHSLLLQPTAFLCPSSTL 
LLSLKPDSLRTGSLFLSFFSFLYFEACS 
VA\RLKPSGTISAHCNliCLPSSSDSPAS 
AYIVSG 


10491 


24392 


A 


10572 


1 


470 


GQSRGIPIiLLTLDLEKPVSLLLSVTNLY 
SKNSAQFSTILQTLSFPATFTPSPSIPL 
S SAYFFFFSDRVS /LLSPRQECSGLNLG 
AHCRPQPPRFKRFS CLS PSS \SWGLTRR 
HTTTPGL I FCI F/ S VEAGFHHVAQTGLE 
LLSLSNPPASA/FPKCWDHRR 


10492 


24393 


A 


10573 


242 


410 


VMLRMQ I YFSLHNP I FFFFETGSHFIA\ 
RLECSGMIPAHCNLCFPGSSNAPASASQ 
V 


10493 


24394 


A 


10574 


236 


1 


ARTFFIH I LKASCGDNAITGVLWHRATW 
CPSVLLDRDSLSERVCEILCIHAC/VMR 
IHEHTHTY I YAHTHTHTHTHTHT 


10494 


24395 


A 


10575 


241 


35 


RSSYLFI FNFFVETRS VAQPGLKLCCSA 
FLQWL I I DQ AGNTN 


10495 


24396 


A 


10576 


294 


72 


FKJJFKSPLFSLFFFLILGTRFCYVAQAG 
LK\SWPQGDPSALASRVAGTRGVHRHTQ 
LQVSFNYYKVLAMHSGSQL 


10496 


24397 


A 


10577 


1200 


1625 


PDGWSYSSGHSWIiPEEASNGRKAWSSCW 
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NO: of 
nucleotide 
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SEQ ID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Y=possibIe 
nucleotide insertion 














VPPQLSPPQAPPSPATHIjGrTirPi-iA/ of 
PQAPPPPATHU3PAPPPPLSSAP\PPPD 
TLLGPAAPPPPSSGPDPFRYSP/EVPPQ 
LSPPQAPP 


10497 


24398 


A 


10578 


226 


325 


ERHTHTPHSHTRSHTRLTHTHA/YTHTH 
AHTHA/YTHTHAHTHFPS 


10498 


24399 


A 


10579 


108 


320 


TINVFLKVNVMKLSYIiRFNTRNCSFFFL 
AGVQGCDHGSLQLQPPG/ SSCJ->PF;b£jA£> 
RVAGTTGAYQHTQLIF 


10499 


24400 


A 


10580 


1 


288 


ARGERERERERERERERERERERERERE 
REREREREREREGLS FFFSRSRGGGKLW 
CVC / RPS ARPRERERHAVFS AP PSLQKK 
KKVFGGREGTHSLFYPPCGNSIiRR 


10500 


24401 


A 


10581 


1 


476 


REWGLALSPRLECS WI JAHC-o Ubb IAjIj 
KRFSOjRLPSWKDYRCAPPCPSLNLFFV 
ET\GPHx VAlrGVfoljlji/jioot»/ ffFJjAof 
KCRDYRHE 


10501 


24402 


A 


10582 


210 


346 


PHCS IHSQVR I QANVHKGHRQRTYGS VI 

ntITT TIT TTT 7T V Wi* / 17CT 


10502 


24403 


A 


10583 


316 


461 


LPNILGWLFFVSETGFCSVTHAGVQWC 
DEHGo LtQ byi r\» \o oUJjlr 1 


10503 


24404 


A 


10584 


172 


383 


NLDLYLTSYIKINSKWIIDLNVIAKTIK 
LLE \ KNVAGENIPDLGLGKEFLD\KTQK 
AL»S \TNKKiUKXi£4r IK 


10504 


24405 


A 


10585 


273 


374 


GPGTEAHTRNPSTLGGRGGAIT/R/GQE 
FKTSLAKM 


10505 


24406 


A 


10586 


150 


344 


KLKCKI 1 1 KLG I KI S 1? A V X V V V a XVarHJr 
CYLGWVQW\PIRTHPGSSDPPASASQSA 
GITSMSHHT 


10506 


24407 


A 


10587 


199 


3 


SFS YIRVHLOHX J^TRHKAbCJfi V XA1N UK. 
AKGIITGIKKK/RFWPGTVAHACNPNTL 
GGRGRW ITRLV 


10507 


24408 


A 


10588 


216 


1 


PRGAPTCMRCQSAL \LajKj\J JJuLJjl l\» \ V 
RDPLEEAVCPISELKRCAGRTTALFRAV 
RQGRLSLQKFUjPFLV 


10508 


24409 


A 


10589 


190 


3 


ISAQPWAAFEKRFIFIYRAPA/IiKFMIF 
FSPFKGVP r FFTF F F rFBMK bKb V/uCL li 
CSGV3SAH 


10509 


24410 


A 


10590 


369 


40 


KTERNS ininkndvhtktps KGRQHQRP 

KADKSRBMkKNQHJUWUSWo ti£t y tttfoatftr 
RDHNSSPARKQNWMENEFDELTEVG/FR 


10510 


24411 


A 


10591 


114 


398 


RFVCSTIKVLRDLS SNRSNPGRFLSTSN 
SSLC/EKDKRNKAYFTK/RPSPVNDIIS 

rp 

1 


10511 


24412 


A 


10592 


132 


1 


I L YLWCVCVCVCVC VCVYLL ILV/ CKVL 
S CLGE PALSVGAFQMS 


10512 


24413 


A - 


10593 


3 


287 


IGFITLNVGLYIiYLLCYIYVYFFQVNYI 
VFILVLAVFLLKNRLLRYNTALYNEEVT 
REKISHYF I / NS 1 1H I vlalU. JS JSJNJJ 


10513 


24414 


A 


10594 


2 


363 


RAEVGVMAPLYLLIiIiSGALGLTDTWAG 
SHSLRYFSTAVSRPGRGEPRY XAVE xVD 
DTHFLRFDSDTAI PRMEPRKPWVEQEGP 

qnweXwttgyakanaqtdrvalrnllrr 
ynqseage 


10514 


24415 


A 


10595 


156 


3 


NGLELDS DDGC IALLF F YFETEFCS \va 
QAGVQWCDLGSLKPQPLGFKPTRP 


10515 


24416 


A 


10596 


80 


306 


MDEELLFRDEQRKWFXjEMESTGEVAVNI 



1189 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Giutaraic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
KNLysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=vTyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














V/BMPI KDLG/ YYRNLVDKAVPGCERVD 
FNFEISSTGWVWWLTPAIPVL 


10516 


24417 


A 


10597 


232 


389 


ILCWVFCCCCCCCPETE FLVIQAGVHWC 
DLDSLQ/SPPPGSSDSPASAS 


10517 


24418 


A 


10598 


463 


154 


MGQGKPRFPLFFHPFYI FFFLKQGFSLS 
PRVQGRAKKVPGTLASWGQKI LLPQPPR 
/RVRQENCLNPGGGGFSEPKI PPSFPAW 
GEKGGSLLKNIiKKKKTKHIC 


10518 


24419 


A 


10599 


208 


38 


LGPPQGFFKTAP PFFFFFETESRSLAQV 
GVQWWDLGSLQLPPPG\SSDSPGLSKKF 
LL 


10519 


24420 


A 


10600 


200 


3 


CQPELSCMRC\RQSLLGGIiSPSGGMEVR 
DPVEEAVCPLAELKRCAGRALLVRI CCS 
LQSQQAGTFK 


10520 


24421 


A 


10601 


73 


427 


RWCSTIKVLRDLSSDRSNPGRFljSTSN 
SSLY/EKDKRNKAYFTK/RPSPGNDIIS 
T 


10521 


24422 


A 


10602 


193 


401 


GEVSLLSPRLECNGVISAHCNFR/LPGF 
KRFSFFFL 


10522 


24423. 


A 


10603 


181 


395 


RFVCSTI KVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYFTK/RPSPVNDIIS 
T 


10523 


24424 


A 


10604 


384 


10 


PRRPGGECVHCLGDSLVTDRRYSGLRSS 
DQTLSGDPMEAVPWSNSLSCSHYRIiKPA 
SGITWGCKQAGFFS FFLFFKMESHS V\T 
RLBCSGAISVPCNLCLPGPSNS PAPGGS 
LGPRSSRPPWTT 


10524 


24425 


A 


10605 


215 


379 


RFVCSTI KVLRDLSSDRSNPGRFLSTSN 
SSLC/EKDKRNKADFTK/RPSPVNDII 


10525 


24426 


A 


10606 


1 


163 


QTHREPAMVLS PADKTNVKAD\WGNVDA 
HAGEYGAEALERMFLSFPTTKTYFPR 


10526 


24427 


A 


10608 


283 


2 


IQKHGLI IRCPQETPFKNNDVGMQKVKG 
RGEKS IMQIL/ YFFFFLNS/HLSPRLE/ 
CSGAISAYCNLRLSHSSDSPA/VDGTIG 
ACHHTQLI FLFLVE 


10527 


24428 


A 


10609 


144 


327 


CSWSWFCGCVRVSTLIKKKNFFFEMEFR 
S \VAQAGVQGRDFGIIiKPPPPG\SSDSP 
ASASRV 


10528 


24429 


A 


10610 


137 


1 


HKCFS 1 UhWUisR V / WCMYTKEYCSAIiTK 
EEIMSFGKTWMEIiENIMI 


10529 


24430 


A 


10611 


201 


334 


SFFWRFCLRGVPSRVKCQCA\LLGGASQ 
LGYSGVRDPLEEAVCPF 


10530 


24431 


A 


10612 


1 A£. 

146 


2 


FIjKJUaljJvKYJt'r r rr r B j. BSCSVTQAG 
VQWHNIGSLQVPPP\GSRHSP 


10531 


24432 


A 


10613 


205 


259 


TTMPGLSFCTRTKSNKNIjS FIiTLDGSAR 
VDLFLYLLLFRTIFQNF/CGRDKVLLCC 
PGWSQTPELKQLICLDLPKYWDHRYEPL 

r*T TV VT) T\ T rWKTT T" V T 

CLiAlF Jf Alxivrl it 1 K± 


10532 


24433 


A 


10614 


153 


2 


GVLMVSDGVLSMLANLFFSFLFF/LFET 
QSHS1 \SKMliLb(iV ISAHCNIjCLP 


10533 


24434 


A 


10615 


213 


8 


KPPFFFPFFFLFFFLRWGSHCWSGY/C 
KGTI IVHCGLELLGSSCS PPAFQVSGIT 


10534 


24435 


A 


10616 


146 


1 


ILTIRDYWVKITLFFFFEPKFHS/VLPR 
LECSGAI SAHCNLCLPASS DS 


10535 


24436 


A 


10617 


208 


2 


NNFWFSSSGKYGQYFSSETESRSVTWGH 
DLYSLQPPPP\GSSDSPASASQVAGITG 
SCHHAQLILVFIVE 
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residue of 

peptide 

sequence 


\mino acid sequence (A=Atanine 
C=Cysteine, D=Aspartic Acid, 
C=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methioninc, 
N=Asparagine, P^Proline, 
i£=vriutaniine, K-rArginiue, a— owmc, 
T=Threonine, V=Valine, 
vy — i rypiopnan, i— *i.yri»iirc» 
X=Unknown, *=3top codon, /=possible 
nucieonae ueienon, i— pu»iuic 
nucleotide insertion 




10536 


24437 


A 


10618 


120 


2 


qfTPPPFFTR FR WAOAGVOWHDLSSIOP 
PPPG\SSDSPSL 


10537 


24438 


A 


10619 


162 


1 


QpmAna",PT,RRAPfiCIMRCHSA\IiLGGVSQ 
LGYSGFWHPLEEAVCPFSDLTLCAG 


10538 


24439 


A 


10620 


193 


419 


TFFFFETESRSVAQAGVQWRDIjGSIiH/ s 
QPGQQSETP 


10539 


24440 


A 


10621 


313 


460 


ECGGTISAHCNFCLPRSSDFSA 


10540 


24441 


A 


10622 


243 


505 


PTGCPKQEQVQAMLRPWFFFVFETQFN 
S / VPRVECSGTI SAHCNLRFPGSRDSPA 
IiASRVAGI CRR / CATTAQLI FVFLVETG 
FCHLVG 


10541 j 


24442 


A 


10623 


67 


430 


LARRTWKNQHSSTHGCDLFFFFFFFFFF 
LIKKGWFFPPEGGHGHRFFFINIKFLN 
vtmoucnT T?Kfn\rr.MTQ\/RPPIiAFFFFFF 

YKJvT r J? tJJjc C»JJ VX4IT1 J-O VVJlr it uiu c c c c ^ 

WRGGFSYLLLRCV /LTFLALGAPLFAPM 
FFLFLFGESL 


10542 | 


24443 


A 


10624 


3 


207 


ESGVGKSSLIiLRFTHDTFDPELAATIDP 
SSWNPARNAATP 


10543 


24444 


A 


10626 


151 


380 


KM! it*\iyjV<tjWi-ii. It X XJrJ^WJCti-Mi^vJVgJJJfa \ 

LRSSRPALKRSETSSPPSLPOQKKKKKK 
KFPRAGGQMLEVP FPGRVGAG 


10544 


24445 


A 


10627 


210 


3 


" xT^rv^^7Trr , irwv , ni?QQr»TKGFGKNKK]CKGG 
SFFFFFFETESHSVT\RLECSGTIIVHC 
SLDFPGSSDPPTSA 


10545 


24446 


A 


10628 


237 


2 


"t put epaorcDVTnWTRVNGVNPTPII 
LTNKTNTFGPLFLI IFFFGETESLSIA\ 


10546 


24447 


A 


10629 


h — 


352 


RGPLSIQDYISKALFFLLLFGFIiIJ^IiLF 
PG\RS PSDPPASAS 




10547 


24448 


A 


10630 


48 


200 


tTt itditmi?t .t?I?qt?TK55SS VTQAGVOWRDL 
SSPQPPPP\SSSDSPAPAN 




10548 


24449 


A 


10631 


209 


376 


SFLWKFCIiKGVPGCVRCQSA\LLWGASR 
LGYLGVRDPLYEAVCPFSDLKLCAGRTT 




10549 


24450 


A 


10632 


141 


1 


" AGFFFFFLFFFLETESRFVA\RL\QCSG 
AI TAHCNLCLWGS SNYHAS 




10550 


24451 


A 


10633 


18 


522 


" PLYSLTKRHTPREA/ KDNLKSTQLLSVI 

EGNTNISGVTWFRAGEQEQTPPEGFES 
c r* qt?t\7T /STRVTrVTYTP I TRT ITS AN XDR 
IiRFTFGVQALVETTS KGDRNPSEVRLLV 
Q IQRNGG WVTE KD I TI KGKTT SQ YLAS V 
V 




10551 


24452 


A 


10634 


127 


2 


" NLSPLFFFFETGSRSVTQAGVQWSDLGS 
LQTPPP\GSRHSPTK 




10552 


24453 


A 


10635 


336 


452 


SPPP\GSSESPA 




10553 


24454 


A 


10636 


259 


3 


- GTRIFFPFNSSPLSFPPVQKKSVSPFFF 
SF/LFFFFETESRSVT\RLECSGAISPH 
CKLHLPGSRHSLASATRNVLRPLV?RIHA 
SCA 




10554 
10555 


24455 
24456 


A 
A 


10637 
10638 


96 
1 180 


413 

473 


GDNTSQHS AILINYFI FFFTFLFF / ETG 
SHS VAQAVRIiE YS S VITAHCNFRLRGS S 
NPPKKKKKGGAVLKDPWGGQSLPGLATY 
YFFPYRGANKNLLGDFWEGPLFV 
CYMGRKAVa^VIjSNLLTLKKKKKKKKKKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny1alanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, V=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKKKKKKKKK/ EMPVKQKNRGRALYKK 
KKGGRPFPIFWPFFFFFFLGGGKKHPGG 
FFRKKFFFGGGKKKK 


10556 


24457 


A 


10639 


312 


480 


ETVDSTSGEDDVNIAEMATKDSEDYXNL 
V\GKAVAEFERTDSNFE\RGKMMSKSI 


10557 


24458 


A j 


10640 


2 


516 


MKPLENLSQTASLARGATLLRPVLRRLC 
GLPGLQRPAAEMPLRARSDGAGPLYSHH 
LPTSPLQKALLAAGSAAMALYNPYRH/E 
ERPRISTSTLDLGKLQSLPEGSLGREYL 
RFLDVNRVSPDTRAPTRFVDDEELAYVI 
QRYREVHDMLHTLLGMPTNILGEIWKW 
FEAVQ 


10558 


24459 i 


A 


10641 


56 


455 


PLCSYIQFIYHSKFNSNQISLVHTIPIiS 
KLFGYFSQLYS I YS YLCQHFIHLLS I FI 
TVFCIVFYWVFVHLISVPMYICVCl^CTN 
VCVCV/CCLW 


10559 


24460 


A 


10642 


272 


33 


GRWFLFLGPAKYFLTGGRFPSDFFGPLK 
INPPFFFFFFFDTESHSVAQAGVHWHNL 
GYLQ/SPPPG\SSDSPAPAEFHHTV 


10560 


24461 


A 


10643 


285 


1 


MGNFLKRFFKNEKKFFFPILRAHPFNFF 
PGKSPFKTPRRALPLGVPPKHPFFFFFF 
FSETESRSVARLECSGMISAHCNL\NLH 
LPGSGYSPAL 


10561 


24462 


A 


10644 


100 

* 


466 


FLLKFRIiREVPSRVRCQSAMiLGPTSQL 
GYSGVRDTLEEAVCPFSDLKLCARRTNT 
LFKTVRQGHLSLQRFLLPFVQLCPVPRG 
GVYRGRQASLSCSGVHPVRASPPLCLPK 
VPPPS 


10562 


24463 


A 


10645 


260 


460 


LKPHAERETIDKGRLPYYSFFFFETESC 
FVAQAGVQWHDRYSIiPPPP\ PGSEGSRA 


10563 


24464 


A 


10646 


360 


504 


Q IGH I CAYVEKTELRL I FFFLFFF FRQ S 
CSVT\RLECSGTILAHCNLCL 


10564 


24465 


A 


10647 


241 


45 


WE VEVPRVAPCHI LKRD YYFI ILFLFLF 
FEMES YSFT\ELECSSAISTHCNLCPPG 
SRDSPGIPPH 


10565 


24466 


A 


10648 


76 


462 


FLWEYQEKAHILWSLFSKLiIIjSRSRFTV 
KEKVRQKPGILFIYIiFIFEMESCSVAQA 
GVQWRDLGSLS PQKKKDS/DQS KAI TFV 
EGINSKRGGWTGPHF/HCSLKMIFLI 


10566 


24467 


A 


10649 


3 


359 


QTQRE PTMELTPADKTNVKAAWGKVGAH 
AGEYGAEALERMFLS FPTTKTYFPHFDL 
SHGSAQVKGHGKKVADSLTNAEAHEDYM 
LNALTALNDIjHAHKLWVDQVNFK1i\LSH 
CLLVTLA 


10567 


24468 


A 


10650 


266 


3 


TPEKKKKIGGLGAPFPGYPKKIWPPNFF 
FFKKTKFFFFQRVGWAPPPKSFFFFFFF 
EVESCSVAQAGVQWRDLDSLQPSPP\GS 
RGSP ; 


10568 


24469 


A 


10651 


278 


461 


KKIICGFFFDTGSCFVAQAGQQWRDLRS 
LQSPPPG\SSDSLTSASRWGMSRHMRA 
VF 


10569 


24470 


A 


10652 


161 


456 


VFFFPGENIAFVENKTTLSYHFMFTRMT 
AAKKMENYKGCWG WET I GS FMP CWLE S K 
RVQSLW\KVWQFFKWLNVKLIiYGLAI PL 
TGICPKKLKKYVQGY 


10570 


24471 


A 


10653 


91 


486 


PCFNHGHTTWYMYFFSLSLILFFVFSLK 
PVI LAYKTFS FPITIRVLLQRRHRQKFV 
LDGFNGVDQNNILGS ICVSLIKI FFLTE 
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peptide 

sequence 



Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, Msoleucine, 
KHLysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argjnine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Nppossible 
nucleotide insertion 



S CFVAEAGMQWCNIGSLQAPRP \GS CHS 



10571 



24472 



10572 



24473 



10573 



24474 



PGISPPVLG 



10654 



267 



470 



NXiHMRCE KEQNFNKLS FKDLPGLGWLWF 
FIFCFWRRII*TLSPRLEC/RG/MIFAHC 
NLCLPGSGHSPASA 



10655 



775 



1401 



TFPSQIRYGLRKRPNSLTLSPRLECSGA 
ISAHMQRPPPGFTPFSCLSIiPSSWHYRR 
PPPRPAICVCVCVCVCVCLVETGFHRVN 
QDGLDLLTS/S/IPPASA/FPKCWDYRR 
E 



10656 



287 



454 



LPLIYFLYFYETECHPIAQAGVQWRDLS 
SLQSPPP\GSNDSSASASPTVPTSAM 



LFLSKFYFFEMESCSVAQAALiQWCDLGS 
VQPLP\PGSEDSPASAWGYLD 



10574 



24475 



10657 



212 



64 



10575 



24476 



10576 



24477 



10577 



24478 



10578 



24479 



10579 



24480 



10658 



368 



473 



GFIDHTRRERPECP/LTDEWIKKMWHIH 
TTEYYSAL, 



10659 



358 



10660 



10661 



10662 



IIiAVFCFCGSAPGFEGGKLTPFFNRGRE 
TFLALVKTPPPGKARGPVFISKNKKIPE 
FKQPPNPNPNSFFFFFFETESHTIARAG 
VQWHNLCSLQSPPPG\SSDSPGRWSLQR 
TKIAPL 



235 



KRDLIRHYPKEDIYMANKYILK\CSTLT 
MHTETLIRTTMRYHLILIKVTIFKKTDN 
MTAVGVHI CNPNTLTGHGRG IA 



538 



CVTVRIPSRPTRPIiSSDRSNPGRFLSTS 
NSSLY/EKDKRNKAYFTK/RPSPVNDII 
ST 



374 



38 



SFLWKLHLRGAPGCIRCQSA\LLGGVSQ 
LGYSGVRDPLEEAVCPFSDLKPRAGRTT 
TLFKAVRQGRLSLQKFLLPFVQLCPAAR 
GGVYRGRQAS LS CSGLHPVRASRPLCSR 
R 



10580 



24481 



10663 



268 



47 



ALIPLSSLTISAFHLIiLLETVSTSSPlOl 
ECSGAITSHCSFNLPDSSPSPTSASR/V 
IGTIGARHQAQLMFIYFC 



10581 



24482 



10664 



64 



451 



FSSERKSHMSLTLNQKLEMIRLSEEGLS 
KAKVGQKLVRLHQTVSQWDAKEKLLKL 
IKSATPVNIGMIKQHNLIADIEKFGMIW 
TDCQTSHRTVLCQRLIQSK/ALTLFNSM 
KABRGKEAADEKLEVRRG 



10582 



24483 



10665 



355 



474 



ILFYFIFFETESCSVAQAGVQWRDLGSL 
QAPPPG\SRDSP 



10583 



24484 



10666 



317 



481 



GHTCPWQTFFFFLFRDRVLLHHPGWSAV 
TQSWLTAALT\FGPKRSSCLNLLNDWD 



10584 



24485 



10667 



57 



329 



10585 



24486 



10668 



86 



10586 



24487 



10669 



254 



VKNTQWG KDS L FNKRV \ FKNWAS I YRR I 
KLD\LTSYAKINSKWIKDLNVRIiEIVKV 
LQVEYPSFKILGNGSVLDFVFFYSGIFA 
IjHLMGEHP 



468 



EaJYKIPLMGGKNFLLFPSIPPYFFFFSR 
LGLTLL PRLKCSGDHCSLQPRP PGLKRS 
SCL\GFPKCWDYRNEP/CVPR 



25 



GSHI CKVPAAI YSNTCTS S EHGDGGWG 
GCSSGSTTHPSPVADSFFVWETVSL/S 
VTQAGVRWCDLSSLQPPPPGI 



10587 



24488 



10670 



206 



YGRPRKLCKIGLBCSGFLDPTHWGQHRVI 
SFFFIMETDSRSVAQAGVQWHNLGSLHP 

PPPG\SSDSPAS 

GFTSQS EIiLYS I DP ITKPTDQVTIGI S A 



10588 



24489 



A 10671 



20 



355 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionioe, 
N~Asparagine, P=Proline, 
Vr z Olotamine, K=Arginine, o=oenne, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, Y=possib!e 
nucleotide insertion 














QSYCRVHVHNRVYDLDVESGHPNGAPAI 
KfcioVuo IJLiWPoJJVoovjKoiNFvjKr V515 

NSSL Y/EKDKKNKACFTK/ RPS PVNDI I 
ST 


10589 


24490 


A 


10673 


112 


359 


NKAQQCVHENHFKLKDANTLNIKVWRNI 
CHS S PNQKKx GIiAI IiNIjDKjSGFRSRKDT 
GDEE/HFIKIKKSVTQEDFSI INIYA 


10590 


24491 


A 


10674 


317 


2 


TCPGFFPQIWKVFPPFPLKIFFTQKFLV 
SFWGFKTIKVGPFVFFPKGPQPRFFSPF 
DGVQAEKFFFFPFFFFFFFFCERGSHSV 
A\RLECSGAISAHCNLCPPGFK 


10591 


24492 


A 


10675 


240 . 


495 


DHRPE KTKS STCQAGEVP LLGLi FV P FRS 
STDWVRPAFI / MEG /HI CPTQS TNSNIN 
PVWKHPHRHTQDNVWPNAWSPHGPVKLM 
HKLX 


10592 


24493 


A 


10676 


271 


484 


NPAGQTCRIKSFFFSFFBTESCSVTQTG 
VQWHDHGSMQPQSLGP\ SDP PTSANSS A 


10593 


24494 


A 


10677 


364 


121 


KQE WRS iiSHNALCNDQAS PLPGSGHWK 
SKQKLSKAAPCAGSSKHKHL/HKECWKH 
THTHTHTHIHTHTHTHIHAHTKDRFA 


10594 


24495 


A 


10678 


354 


539 


FFFFFFIVFEESHAIAQAGVQWCHLSSP 
HLR \S PPPGFN \DS PASASSAGITTLSS 
SVRL 


10595 


24496 


A 


10679 


123 


853 


RWSLCHPRLE \ CSGTISAHCK/ L/RAPG 
FT P FS CLiS LPS S WD YSARHHARI>IFFVF 
LVETGFHHVSQDGLDLLT/SG/IPPALA 
/ FPKGWDYRRB 


10596 


24497 


A 


10680 


251 


30 


ASLGVSAPLCIHVS PCPPNGDLGKTWWV 
GGGFFFFEMEFHS/VLPRLECNGTISAQ 


10597 


24498 


A 


10681 


203 

< 


1 


NFPAPGKLGPPRDSLKTAPPFFFFFETG 
SRSVA/EAAVP/W/CDLDSLQPPTLPDS 
SDSPNSASGVAGITG 


10598 


24499 


A 


10682 


237 


397 


DSLTLSPRIiECNGS ISAHYNLC/RLGSS 
NSPASAS 


10599 


24500 


A 


10683 


8 


388 


LYMCWFRPGFIiAHNSHDHGYSLTLS CWG 
ASGIjKKQP/ CRIjSEKxUvKKKKKKKKKKK 


IUoUU 




A 


lUoo4 


201 


463 


LMFGQGFLFFEMDSHSAARAGVQWRSLG 
QT.PPT.PPfi/QfiVTT/T.PQQnQPAT.ZlQPVn 

EIAGV 


1UOU1 




£\ 


1A/COC 

1U05D 




i 
1 


ppttwp trrriKTSJi p p p pppprtyi q oqud o a 
GIjQWRSHSSIQNEPP\GSSDP 


10602 


24503 


A 


10686 


165 


3 


CVFVEDIISNFFFFFMETESRSVAQAGV 


10603 


24504 


A 


10687 


292 


384 


NTE IGWVWWLTTVI PALWEAEAG/E ITR 
SG 


10604 


24505 


A 


10688 


161 


365 


RPPCWIRTSRWCWRYYRFDGSTIKVLRD 
AYFTK/RPSPVNDII 


10605 


24506 


A 


10689 


286 


3 


QPSSVWSFRSVICAPAPFSGMGLSPELQ 

FKLEE L/REAPASLVS FLPPQMLRI KWE 
SEQWLVLLCDRG 


10606 


24507 


A 


10690 


169 


2 


SDTPAWWPRKHVFPPGFFFFFETESRFV 
T\RLECSSTISAHCNLCLPGSSDSPASG 
S 
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M 
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sequence 


Predict- 
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nucle- 
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ding to 
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acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leutine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaniine, R=Argimne, ^-serine, 
T=Threonine, V=Valine, 
W= tryptophan, y— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


10607 


24508 


A 


10691 


240 


406 


DGIRRLSFFFFFFFETKFGFVA\RLEGR 
GTlvrvNVNIjHLPGSRDSPi^ i 


10608 


24509 


A 


10692 


343 


3 


QVPLNLSLHLKLGMLFPGCINGSLSQFL 
QFFAQISPYQTGFPSFYKTSIYHNSLHQ 
YSLS FLPY F IFLFF FSF r s F LiKUbiib VS 
\RLB CSGMVSAHCNIiRLTATS TTQVQV I 
LV 


10609 


24510 


A 


10693 


196 


3 


YDAGHTKKTLHFPTVYPFVLFCWWECKM 

t rr\i "\ htjtt" Tt 70TD / T7T VTTT WTI/LLVUDS Tnp 
VQFRW KJL VKK/ rulSJSJJN A IS-Lir* I U fAJL if is 

LSVHQKELRT 


10610 


24511 


A 


10694 


245 


391 


KTDYQPGAVAHFGRPRREDHPRLGVQDK 
PDQLEKPVSTKNTKLVW/SWWL 


10611 


24512 


A 


10695 


252 


389 


KTGGKVLFFFFETESRSVAQVGVQWRAL 
RSLQPP XQPGTSDCPASAb 


10612 


24513 


A 


10696 


385 


1 


PPNKAKMISSKDNKNLHWGKDTLLNKWC 

VraSWIVTCITMKLDSHI^ 

KD/LKTIKILGENIKKTVLDIGIiHK\NM 

VNRKSTEWKKVFASYLSN 


10613 


24514 


A 


10697 


296 


430 


KHIQARRGGS /CGN/ RQHSGRPRRADHL 
RSGVREQPEQPGEKPHL 


10614 


24515 


A 


10698 


280 


468 


DYLC L/ SLSI YLS I YLSIYLS I IHQFIY 
HLFIYHIiSNXiSSIS Irr iK.Wl*b 


10615 


24516 


A 


10699 


15 


393 


RSVGVLGPVRCXiCA\LLGGDSQLG\SQG 
SGVRDPLEEAVCRFPYIjQLCTGRTTALF 
KATOQGHIjSIjQRIjIiLS F XVWIiCPAPXOjCt 
AYRGRQASWSCGGLHPVRA/ SMLLCLPK 
EAWAMAGAPPPASLPPLS 


10616 


24517 


A 


10700 


176 


1 


DWTTNFTTFLYNFKPSSIMPYLSHLFICT 
LR/MWPGAVAHARNPSTIiGGRGGWIMRS 
GDE 


10617 


24518 


A 


10701 


494 


80 


FNKKDIHSDTPSEGHQLQRPNVETIiKKM 
GRNQCKKGENPKNQNAJS spkdhns stpr 
EQNWMKNESDELIEVGFRRWVITNSSEL 
YK3 \DVLivCKi£AKWiji^Ki-^rvVij i KJ. i 
S LEKNGHGLMEVKNIAQELCEASAGWR 


10618 


24519 


A 


10702 


264 


410 


KKGPLFTPPGGGGG/PQKKPPGPLNPGG 
QRL/oor 1? JrJrtrvivjwl lVjJSAJrrvjv | 


10619 


24520 


A 


10703 


125 


3 


NRGNKGQV/ QWLMP VI PALWEAEAGRSP 
EIRSRDQPRQHGETL 


10620 


24521 


A 


10704 


184 


2 


RIoRVliAFCKiiijPKAJrKl WPiNxCrr r rot: if 
IjFFPQTQSHSVA\RLECSGAISAHSNLC 
FPPTRP 


10621 


24522 


A 


10705 


187 


406 


LFLWKFCLRGVPGHVRCQSAL\IiGGASQ 
TLFKAVRQGHLRLQRILL 


10622 


24523 


A 


10706 


82 


410 


ILRGKFGKHYFNRIDWREALRQSLSLFN 
FIIFSNFLASLHKPEMETELKGSFIELR 
KALFQLNARDASLLSTVDSDFS FCRKFS 
R/CSKCGQ 


10623 


24524 


A 


10707 


390 


2 


S KDCRTAKIAACS FLWKLRSRGAPARCK 
PELiSCMRCLiSA \1jLiGGVSQSGG rGIRDP 
LEEAVFPIiAELERCVGRSAALFRASRQE 
HLSLLKMHPQLPLPSGALSQADGSFIYK 
PLTGASAFLSEMPCQERR 


10624 


24525 


A 


10708 


229 


392 


YSWWRQMHSVAHAGVQWCD/LVSLQPQP 
PG\SSDPPASASLAARTTGAGHHNQLI F 


10625 


24526 | A 


10709 


193 


2 


RYI^YQDHVILLLLLLLLFLRWSLTLSL 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possiblc 
nucleotide insertion 














RLECSGTIMAHCSLDS \SSNPP I SASQA 
AGTTGACYH 


10626 


24527 


A 


10710 


180 


11 


LLFFFETRS/HVSQTWQCGMISGHCNL 
RLSGSSDP PPLASRVAGTTGKHQNIWLS 
R 


10627 


24528 


A 


10711 


234 


408 


MKLIMLKVILFPQIHLQIQPNFYQNIFF 
FETELCSVAQSGVQWHNLSSLQHPPP\G 
SSD 


10628 


24529 


A 


10712 


312 


470 


TGPHCVT \RLECSGAITAHCSLDFAGLS 
TSPTSVSQVSGTTGTWMKLETI ILS 


10629 


24530 


A 


10713 


64 


392 


PKMVIRISSETSLYASLPLQMKGQRQK/ 
G 














10630 


24531 


A 


10714 


169 


427 


NNQKTNNKMVGVS FYLSI 1 1 LNVNELNA 
P IQRHRVAEWIKKEKKKKK/DPGICX3LQ 
QTLFFYEDP/HDPLRLKIGGW/RKYYPS 
RGTQIK 


10631 


24532 


A 


10715 


176 


410 


ARSSWGIiHTAVPVAFFSIiRFLLEIFLDR 
DTRCSPPAAFFFETGSCCVA\RIiEGRGA 
I TAQCSLNKLGS SNP PTSASRVA 


10632 


24533 


A 


10716 


234 


389 


NSGNMDRYKDVQNI IQNP I CWPGTVAHA 
CNPSTLGGRGGQIL/RGQEFETSLA 


10633 


24534 


A 


10717 


190 


2 


GPFPPHPGGFLRGFFVPNTIPPPTFFFF 
FFLRRSIiSVAQAGVQWCX>LSSLQPLPPG 
P\SNSPCQ 


10634 


24535 


A 


10718 


201 


1 


WPLFLPKMFFFFKRFPHMGPSPWVGAQA 
KKKKIFFFLTGSHSVT\RIECSGTISAH 
CSLNLPGPSH 


10635 


24536 


A 


10719 


195 


419 


EYHTSLVTCGNPCVYRSNNKLNQTSRRK 
VITKIRAELNEIETEK/LQGSGETKIWF 
FEKINKTGLEIiLGSSDPPW 


10636 


24537 


A 


10720 


218 


407 


GKKNLAFKKKKRKEKKRKKSPSQSNMNS 
AKIEARTNI KLWKHGWKNCE 1 1 DA\LQ 
KATODNA 


10637 


24538 


A 


10721 


231 


409 


GTQLHLGGFFFSEKELSFCFWF/MFALF 
EMEACSVT\RMECSGTVLAHCNIiRLPGT 
SNSSA 


10638 


24539 


A 


10722 


40 


401 


PLCPSESSGNTLMASSDPSTPAVPPPNT 
THPPLCLSKSHLPLRPKQGLPSGNLLQXi 
PLTLLI PLIiGAPVACWQLPQQCTLSTFF 
FETKSHPVAQAGVQWCCDGSLQPLSPG\ 
SRDSPDSAC 


10639 


24540 


A 


10723 


210 


14 


HVMGLLLLFLNKLTVNNFGWLAGFWCLG 
FFWFFSFFETRSGSVT\RLECSGMISAH 
YKLCIPGSSH | 


10640 


24541 


A 


10724 


347 


462 


TFFFFFFETKSRCDIQAGVQWCDLCSLP 
PSS\PDSSDCP 


10641 


24542 


A 


10725 


169 


386 


dkkqakt i kwgkns fsnkwcwnnciatg 
krmkldp\yltpykk/ inskwikdlsi 


10642 


24543 


A 


10726 


257 


54 


PLFFFFKRQGLTLS PRLECNGTVTAHYN 
LKLLGSRDLPTSASP\SAGITGISNPAR 
LFTYLNPTHLQT 


10643 


24544 


A 


10727 


2 


401 


NNYDRAETQ I YQYMCLNPTFYCLQETHL 
TCNDIYRLKVKGRREIMQIENKRVGVAI 
LVSDKTDFKSTTVKKKLHYIIKGSIQPE 
DlillLCTYSPNIRASRFIKRIPDLRKEI 
A/HTVKVGD FS IS LNRLS RQNT 


10644 


24545 


A 


10728 


154 


1 


PMCSLLGLSKGGI I FFFFFEAE FCSVAQ 
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AGVQWRDLGSLQPPPPG\SSDS 


10645 


24546 


A 


10729 


34 


151 


PRPPVPSLLDRGRLQLWRQRGLRHRAHS 
NGFIGGKQQIMKLLKNYVRRPVG/VAVA 
IMFDPDPRYPRSWIEDDFNYGGSVASAT 
VHIRMGSLVENNKS 


10646 


24547 


A 


10730 


2 


408 


VFLLTVRTLICRSVGVWWIISTPDLVCLG 
ISSGGCRTANIGLYQMLLPDHSSGSFCIi 
RGVPGHVRCQSA\LLGGPSQLGYSRVRD 
PLBBAVCPPSDLQIiHAGRTTTLFKAVRQ 
VHLILQRFLLRFVWLCPAPR6/GVYR 


10647 


24548 


A 


10731 


274 


89 


ENIPIVFNPPVCSPLLWQPQE/CEYPKL 

cklclklhkhrvfyXflffffetephsv 

ARLECSG 


10648 


24549 


A 


10732 


251 
i 


1 


TFQMMQKCFSHRKIFHNLLDKASYKIVY 
KEDPFSPSSLSSSVSLKNNFFFLETBSC 
SVTQAGVLWCHLSSLQPPPPG\SRDSP 


10649 


24550 


A 


10733 


375 


1 


APFPPLWVRGSPPFSPLVFPHPKLKPPL 
ABILGFFKERKWGSIRKPCLFKVKKLVS 
VWPGLELQIFKFIGEFPFSFPSIiVGRKP 
NFFLGPVFFFFFEKESRSVA\RMECKGT 
ISAHCNLHLPGP 


10650 


24551 


A 


10734 


82 


398 


SFLWKLRLRGVPSHVRCQSA\IJjGGASQ 
LGYLGVRDPLBEAVCPFSDIiQLPAGRTT 
TLFKAVRQGHLNLQRFIiLPFVQLCPASR 
GGVYRGRQAS LSCGGLHPVRAS 


10651 


24552 


A 


10735 


2 


341 


TFCAISWLESGVEDGPRSRISYRISTFF 
FFFKGAPEPKEVRGRPKPGLAPTSLPGA 
HGTTGLSGWTTPS IGGWQPPPPPRENPK 
GEHPPAPVAGDTFQ\PKKPPPILKVFFP 
K 


10652 


24553 


A 


10736 


171 


1 


RVFYLIiAFALFVDTGS PFVSQAGVQWCD 
HSSI QSQTSG\SNDPPAS AYRVAGTTGV 


10653 


24554 


A 


10737 


190 


1 


EHKTTVRKI PKMEEMADSGSNMLRI I I F 
LFLFFETKSCSVT\RLECN/GSISAHCN 
LHLPGSSNS 


10654 


24555 


A 


10738 


252 


413 


GLLGIjQNFSYKVHLTKAHLKKCSWLDAV 
AHAYNPITLGGRGEW/ IQEFQTSLTNV 


10655 


24556 


A 


10739 


101 


249 


AGSTSRCI /QBLSDLLEHLEQENCLNPG 
GRGCSEPQSCHCTPAWVTETQKK 


10656 


24557 


A 


10740 


185 


403 


l^LPVHTMKSNYHYHLCHHHHHIiQHHHH 
HHHCHCHYHS/HHHNSQHPPPPPPPPPP 
HHHHHHHHHLP 


10657 


24558 


A 


10741 


119 


425 


IJ^HljHbl^RTillirliWWCwAHKOKjNCFK 
ADFVITDDIKQLCPPQSWWTRAGKLPIiG 
W3RGGSHMLSHHFWRPRHADILYLGVLD 
QPGQQGETPSL/ IjKNTKSSW 


10658 


24559 


A 


10742 


112 


2 


GETFFFETGSSSVA\RLECSGAISAHHN 
LHLPGSSKSP 


10659 


24560 


A 


10743 


188 


3 


PPPLFFFFFFFFFKTGSGSAT \RLECTA 
HCNCCLPGSSHHPTSAYQVARTTDVCNH 
AWLIFV 


10660 


24561 


A 


10744 


176 


3 


SLYSKNKNPSHLLFLPI PI KFFFFFFBT 

Et c Ki> V Pi\j£vo V \i Wj±Ul_t£jo Jby if if if \f\a\Z301j 

SP 


10661 


24562 


A 


10745 


250 


1 


GPRRI FFLKEFYPRFVFGKNPAPGGFFS 
GGKKPGPFPFNPRPI KFFFFFFETEFRS 
VAQAGVQWHDLSSLQPPPPG\SSDSP 


10662 


24563 


A 


10746 


155 


3 


PPHPFPFNPLPIKFFFFFFETEFRSVAQ 
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AGVQWHDLSSQQPPPPG\SSDSP ! 


10663 


24564 


A 


10747 


157 


I 


TPRPFPFNPPPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10664 


24565 


A 


10748 


157 


1 


TPRPFPFNPHPIKFFFFFFETEFRSVAQ 
AGAQWHDLSSLQPPPPGXSSDSP 


10665 


24566 


A 


10749 


153 


2 


PPPLSWTPAPLRFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10666 


24567 


A 


10750 


157 


1 


PPPPFLFFPRPLKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10667 


24568 


A 


10751 


155 


3 


NTFPFFFTPLTPKFFFFFFETEFRSVAQ 
AGVQWHDFSSLQPPPPG\SSDSP 


10668 


24569 


A 


10752 


86 


406 


SFLWiOLiHLRRAPGCMRCRLA\LLGGVSQ 
LDYSGVRDPLEEAVCPFSGPKLHAGRTT 
TLFKAVRQGHLSLQKFLLPFVQIiCPAPS 
TGVQEGRQASLS FGGLHPVQSSR 


10669 


24570 


A 


10753 


364 


469 


GQFIaHSLD/SHWKKSEDFCFIiWFPHSEN 
VSAIHQDH 


10670 


24571 


A 


10754 


250 


61 


GEKKKKfCPRVFFFFFFFETEPGSVT \RL 
ECSGVLSAHCNLRLPNPNDSPASASRVA 
ASAKLG ! 


10671 


24572 


A 


10755 


66 


446 


SFLWK^CLKGVPGCVROQSA\LLGGASQ 
LGYSGVRYPLEETVCPFSDI1KI1RAGRTT 
TLFKAVRRGHLSLQRLLPPSVCLCPAPR 
GEAYRGRQASLSCGGLHPVRASRPLCLP 
TQALAMVGAPPPGSL 


10672 


24573 


A 


10756 


252 


482 


RLPRQPVRKWYAGVRGCCVWGWSKSLQ 
RSTTLDWQGPPQRGGPILLFFFFETLCH 
/TRLEGNGEI SAHCDLCLPGSN 


10673 


24574 


A 


10757 


205 


488 


PLESLRSPIiGPIiSWRTASSVSPVICIYY 














QYI FVCVVFLKDRVLLCHPRQSAGARSW 
PTRTSASQRQEM 


10674 


24575 


A 


10758 


272 


471 


YSYVLFFIFLGIESCSV/AFSAGVQWHN 
HSSLQLQTPGLKQSSHLSLP/ASASPVA 
GTTGMR YHARL I F 


10675 


24576 


A 


10759 


17 


342 


GTLSSGAQVLIGRIESIVWGIiKPWALG 
GCPSPRAVHWIiLASSDWRPSLQDGAEGW 
KKGEANGNHKR/ GIAVISDQIDFKTKTI 
KGD\KKSHYVMIKGPIQQEAITIINI 


10676 


24577 


A 


10760 


42 


498 


EFRERGREKEREKERKTEERMEDRERKA 
EREREKERQEGREREREKGRQRERERKR 
ERERKRERE/RHEPGSL 


10677 


24578 


A 


10761 


1 


305 


ASWDDPAHNNNFHIPGGWAHFFFCRLL 
DCPPWAPRPPAPR/VLL/TPAAAAAAAA 
AASRPEKKRAEENVGAIPPRKRPPRWEE 
RRGPRKGSAPPGRGAGRRAR 


10678 


24579 


A 


10762 


364 


2 


FPQKQNGQYLPLFPKPTGLCGKGKTRLG 
FFLNYFGKKKKKRGREDGFPPFCQIWGT 
SIYRGSMGXFFFFFFWRQVSLALLPMLE 
C\RGMIRVHCSLDLVGPSNPSTSASQIA 
RTTGTCMCHH 


10679 


24580 


A 


10763 


152 


484 


RFVCS TI KVLRDLS SDRSNPGRFLSTSN 
SSLY/EKDKRNKAYBTK/RPSPVNDIIS 
T 


10680 


24581 


A 


10764 


208 


420 


SFLWRFPLRGVPGHVRCQSA\LLGGASQ 
LGYSGVRDPLEDAVCLLSDLKLCAGRTT 
ALFKAVRQGHLSLQR 


10681 


24582 


A 


10765 


319 


85 


GKLLNNNRCFGS KFQGIMFFFGETES P Y 
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t tosgvr whdfgslopp'\ppgsrdspas 
ASXRGITGARKHTQPIFLIjVSNS 


10682 


24583 


A 


10766 


122 


3 


KFFFF FETEFRS YAQAGVQWHDLS SLQP 
PPPG\SSDSPSI» 


10683 


24584 


A 


10767 


240 


86 


RAPPFFFFEMESCS IAQAGVQWLSLGLL 
QAPPP\GSCHSPASASCELMFSKI 


10684 


24585 


A 


10768 


244 


85 


QAPPP\GSCHSPASASCELMFPKI 


10685 


24586 


A 


10769 


35 


296 


EVKSPSARQPPRLGSEEHLRPAAAPSGR 

EVRS PSARQPPRLGGEPNS SLRTGHDDD 
wr v 


10686 


24587 


A 


10770 


260 


484 


FIDEEIjLLMNEQRTWFLEVESSPGEDAVS 
I IQLATRDLEYDLNIiVEKGAAG/LERKH 
VQFPP QQTVnKTTiSHNTACY 


10687 


24588 


A 


10771 


248 


393 


TQEGKKLINWPGTVAHACNPSTLGARGG 
RTTRGQELETK\LANKIKPCL 


10688 


24589 


A 


10772 


239 


2 


SFLWKFCLRGVPGRVRCQSA\LLGGASQ 
RTSALFKAVRQGHLSLQRLLLS F 


10689 


24590 


A 


10773 


158 


455 


" LFFTLCPSLLQHIAVMLELGLKGPKSIQ 
PIFWVFLQGTEP/HFIiVTPVR/CCLPLL 
KLFCLLVFGMESPSVPHAGECSGVISAH 


10690 

* 


24591 


A 


10774 


166 


403 


KKTFLGEPLFWGGAKKKKPGKKNPGFFP 
RG/ I KPRVFFSRFFFFGPP PKKG FPQKS 

WFT.TfQT.TJfZFFT.FtTfSr'PPPFFFFFFFFEM 

ESCSNTRLECSGVILAHCNLCLPGSSDS 

-n 
ir 


10691 


24592 


A 


10775 


208 


1 


RLCF FYFRKALLGKAQ I KN I AF FPRKGS 
FFFFSETESRSVAQAGL/LDCSGAISAH 
flTtiRFPGSROSPAS 


10692 


24593 


A 


10776 


192 


29 


IFRKEFPCLNFFLLFFETESRSLAQAGV 
QWRDLGSLKAPPPG\SRRSPASRDTGV 


10693 


24594 


A 


10777 


275 


490 


KPGFFLLQMMCVYM IFFFFEMEFCS \VA 
QAGVQWHDLGSMQHRPPG\SGDSPSCLP 

OX/ rlul iro 


10694 


24595 


A 


10778 


340 


3 


LBCVPVGKPRSTIiIjGVKKVPFFGNKRKKF 
IiALFFFPRPPPGEGFSTAFIiAQKPTPRV 

vpalgfpknqsplspffffffetkscsv 
toagvowcglrspopmppgVssdspasc 

L 


10695 


24596 


A 


10779 


313 


1 


ANPFGGPSGGDPFSSRVFPPPGPKNETP 
FFKNKTTATKKTGNTCGGWAPSPHRGGPK 
KG PALWDKKGKNLWTPFFFFETESHS VT\ 
RLQCSNT I LAHCNQ CLPGS 


10696 


24597 


A 


10780 


116 


422 


ILEDTNIQTIETLI^REVQIQTTLiRNH 
FTLTGMAI I / RKTDNNKCWRE CGKI ETL 

NIELSYDLAMPGFI IF PRE 


10697 


24598 


A 


10781 


120 




' TOTTGAPOI1HLASRWI1SRSGGLTSS POE 
IPKLFWSIESPLGSSKHLSLQVYVCLFV 
CSFVFEMESCSVARLECSIVISAH\CTL 
HLWGSSHFHASASRVA 


10698 


24599 


A 


10782 


134 


t 


SSFFFLCQTESCFGVQWHDLSSLQPSTS 
RAQA\LSLPSSWDHRR 


10699 


24600 


A 


10783 


2 


435 


CSHRGDSSSYSQLSGIRAGDLGGGGKDI 
FRLLPTTLNI FAGKES YD WCVTHERMC 
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LC/SCLFIYMCLCMCVHV/HYIHEACVF 
MCLCL | 


10700 

* 


24601 


A 


10784 


432 


161 


FSRAGFH/ RVSQDGFDLLPS \ CLPPLGL 
PKCWDYKREPPRPAWKTJVJCRQVHRKCTW 
LFIQIGSTLFKTNGGLSAVAHAYNPNTW 
GGRGERIA 


10701 


24602 ! 


A 


10785 


270 


442 


NVRLRLGLALSPTLECRGTIMAHCRLDF 
PGLMQS S HLS HRVAGTTG /TCHHAWL I F 
KFL 


10702 


24603 


A 


10786 


12 


424 


LIQLPRLECSGALIARCNFTY/SLGSGD 
PPTSASQ/VLKTTGVCHHAQL 


10703 


24604 


A 


10787 


350 | 


3 


DSSSVQTNKGIPGQVLSDILSCAVKASV 
LVTI PLiHNALS SWLVP AL FLWKS WQVGK 
SQQAHSSVSGLC/MHYRSHTHAHTHSPH 
RHRHTYTYARAHTHTHTQMLSAYLPSKQ 
PSGSLS 


10704 


24605 


A 


10788 


155 


3 


HAFFALCIRNRLBCNGVILAHCNLC/RL 
LGSSDSPVSASQVNGIAGACHHAQL 


10705 


24606 


A 


10789 


188 


2 


RRDLS S LQPPRGGQ / MRGC I YTHTHTHT 
HTHTHTHTVHWGWGKRHHVPKGKMSANG 
ESGRAK 


10706 


24607 


A 


10790 


19 


405 


I RPT I SRVERG INS LVASEGQRL P WDG I 
ACSQGLVWQQTWGPLGPFPSLLGMPHR 
PTFRDLNSEPAPGVANVSGTLSTPLPGA 
SHGLLVFFFETESHFVAQAGVSWGDLRS 
LPPPPPG\SSNSPVSAS 


10707 


24608 


A 


10791 


157 


1 


KPGPFPFYPGPLKI FFFFFETEFRSVA\ 
RLECSGTISAHCNLHLPGSSDSP 


10708 


24609 


A 


10792 


157 


1 


QPRPFPFNPRPPKFFFFFFETEFRSVAQ 
AGVQWHDLSSLQPPPPG\SSDSP 


10709 


24610 


A 


10793 


83 


387 


S FLWKLHPGGAPACMRCQLAXLLGGVSQ 
LGYTGFRDLLEEAVCPFSELKHHAGRTT 
AVFSAVRQGCLSLQKFLLPFVQLCPAPR 
GGVYRG/RQALLS CHRLHPV 


10710 


24611 


A 


10794 


204 


419 


KGVYGHSGS FSPPAPLiACFRDKTLFFVS 
LWKEFVHS PWCKCTLPQPLWKTVWRYLK 
NFKMB\lPYVPEIPliIiG 


10711 


24612 


A 


10795 


207 


2 


RRGFTMLVGQNSLDPSTSRSAHLSLPKC 
WDYRCKP/PAPS/ LREGFSYTiTiHSTiTiHP 
TPAPGSHHLWAALIC 


10712 


24613 


A 


10796 


2 


237 


FFFLRERILLALSPRLECSDANMSHCSL 
NLPG/FSQSFCLSHP\SRWDHRHMPPYP 
VKFFGI FVGDR VLALFPKLI S YI I 


10713 


24614 


A 


10797 


342 


40 


DRVFFCS PRLECSGAI IARCS / LRTPGL 
KQSSHLSLPKCWNHRCKPPRPARCSLNE 
SHSAKKWPGQSTDSSACEEVQPTSPFPT 
VPYPTLLSPSAGHGRGR 


10714 


24615 


A 


10798 


209 


2 


CSVLAVIIiNPECIYWPWIiELLT^HDPIP 
RLVCPKSFFFETKFCSVAQAGVQWHDLG 
SLQPPPPG\SSNSP 


10715 


24616 


A 


10799 


142 


2 


IFFFFFEMESRSVARPGVQWSDLGSLPP 
PPP\GSSDSPASATPSPMQS 


10716 


24617 


A 


10800 


276 


2 


I LP I I IRNTCCCFFLKESRSCSVTQARV 
QWHI I / GS LQPQTPG\ SSNP PASAFQVA 
TGAHHHTSLIRNTLIIINWEKNTHKIiSG 
CTSMHFRFL 


10717 


24618 


A 


10801 


12 


364 


LHHYKTVSGIYKCLCVLNGPPTRFFPDF 
LPPLGLPYSLRHNNIEISPINNPPIASK 
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YSRERMS CISLTLNQTLVMI KLNEESJblj 
KANSDHKLOiLR/SVYQVVNAYEKFL 


10718 


24619 


A 


10802 


235 


3 


HRP S EDLDFPPHWAVMRAPQ Y PS WDTRK 
GGPRSFLFLFLFFEMESYSVAQL/EGTI 
SAHCNIiCLPS SSDS PTSRALCSL 


10719 


24620 


A 


10803 


349 


1 


NQTPFFFFFFGGTETTSTTIiCS \YGLLI 
LLKYPEVA/ESASQRDPEWEAAVWRWLE 
GPGSAQPPSAPAKGQELDPWGQRPVPS 
PDDHVQWPYTNAVLLEIQRFISVVKRTL 
TLDTLY 


10720 


24621 


A 


10804 


365 


3 


IDVCNVCVRKQYRGFCDQKYRAWISPVY 
PHKCGWHRVYNTPTPHCETEWIiWVIIjHA 
QEH/TFSLTGRHTHTHTHTHTHraTSKL 
APPASRAIiFGVAHVEAQKALAS PSSGRY 
LAITMFVQPCI 


10721 


24622 


A 


10805 


406 


96 


CPPEFSEESPRLLKFRVGGYLTPQVSKC 
GLGWRIFKVFFWSPPKVQTSLFFFSKT 
GSHSVT\RLECGDTIIjAHCDLCAPGSGD 
PPASATRVTVTVGIiPPCPAR 


10722 


24623 


A 


10806 


58 


369 


FFFFKGDRAQNNS / WGERCLLNKGYWD I 
WISTCKKMKSTPYIjTIiHTKISSKGLKDL 
I IRAKRIHLLKKYIGINIiHDIjGIjK\DFIj 
NMTPKTLATKEKIDTIiDFI kj. jk. 


10723 


24624 


A 


10807 


126 


1 


KEPFFFFFFETESCS VAQAGVQRCNIiS S 
LQHQPTG\SSDFP 


10724 


24625 


A 


10808 


204 


415 


HLGFDSLTTCISLSDGLKYKATVFLVFF 
FERESNC\AVWAECNGPISVNCNLRLPG 
SGSSPASPSRGVEIT 


10725 


24626 


A 


10809 


168 


2 


KISKRPFFFFFFETGSHYVA\KLiECSGV 
I1AHCSLDLPGSSNPPTSASWVAGTTGT 


10726 


24627 


A 


10810 


97 


389 


lavspslslslslslslsylglpysijrr 
snieitpintpaegsvcsserkghmsls 
fnekievitlsaqdmsntkig/rkldll 
ch/tsqwnaeekflk 


10727 


24628 


A 


10811 


24 


416 


leyiarrylgwwlffffflnrqgeksr 

GPFKFFFPRGFFSTRNGAPPGGPWGPLP 
WGGGPPVGFQKQGKGGAP\PPKKNRFPK 
GGPLTQPNLPKTPIENPKGPPTRGFFPS 
GPPPKKGAGPPPI FRVGPG 


10728 


24629 


A 


10812 


60 


435 


KKRKNFPQKKISPYFYPLKWFKTPPLWV 
KNQTPPV/CCFFEAPFSLKKPPRGLKKG 
/ WGKNFPPPLVPQKKKKKDTAGVAILVS 
DKKDFNPTKIKKDKGHYIMVKGSMQQBG 
LGCPDAWVPS 


10729 


24630 


A 


10813 


618 


1519 


FWFGVNCKSVCVPFLPVLCFVVCVWCER 
G VijSyKSMo'vAv o fcf IjIu iTfLuArlt? dm* 

FKGDYS VTMTPGKLRTLCE I DWPALEVG 
WPS EGSMDRSLVS KVWHKVTCKPGCPDQ 
FPYIDTWIiQLV/YRPPPY 


10730 


24631 


A 


10814 


179 


15 


BCVKRLKTPFFFFFFETRTCSVAQARVOW 
HNHSSLQPQPPGPKRS\PTSVSREAGT 


10731 


24632 


A 


10815 


61 


422 


NCFFLKGPPiiFFFFFiyFFFFljJ-ty 1 1 V If 4. 
YKFFAFLFQMEFRSLPRLDKCNGAISAH 
CNLSLPSSWDYRNLPPRLANFFLFLVET 
RQPAS A/ FLTCWD YR 


10732 


24633 


A 


10816 


170 


440 


RQGLSLLPRLECSSMTTAHCDLKLLiSSS 
NPPISAPHIAUJ/LTGLCHHTQL 


10733 


24634 


A 


10817 


322 


443 


FIFIFYFFETRSHAVAQAGVQWRDLAI*L 
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OCysteine, D=Aspartic Acid, 
E^Glutamic Acid, F=Pheny lata nine, 
G=Glycine, ENHistidine, Msoleucine, 
KNLysine, L=Leucine, MHVlethionine, 
N=Asparagine, P=Proline, 
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X^Un known, *= s Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














Urrrr \V3rS0D0FA 


10734 


24635 


A 


10818 


134 


3 


KKAHLPFFFFLEGTESHSVA\RLECSGT 
ISAHCNLCLPGSSVSV 


10735 


24636 


A 


10819 


76 


245 


FLLRKGTRQGGPIiS S / LLFNI VRDVLAR 
AIRL\EKKIKDIQIGKEEAELSLFADMI 
IF 


10736 


24637 


A 


10820 


233 


440 


FFEFGALKKWKSMTHKKVERGKRTTLNG 
GG I KAAKTAS LKKHliQG WGKVMLMANKIj 
LRGERAW/FPPRIRG 


10737 


24638 


A 


10821 


293 


402 


rnpfffletephsvaqgevqwcnlsslq 
ppp\pgs 


1073S 


24639 


A 


10822 


168 


1 


plrglsdsslgimkrmatdlsslqplpp 
g\ssdspasasrvagitdshhhawvif 


10739 


24640 


A 


10823 


97 


405 


LCVNICFNFPWLGVEWLNHI IGVGI/TFF 
FEMESCSVARLECSGAISAHCNLL/LPP 
SFGSSDSPASVT 


10740 


24641 


A 


10824 


186 


2 


EPGTISLVMjKLQKWPRR/SDHLRLAVR 
DQPGQHDETPSIiLKNTKISWAQWQAPVI 
PALEPV 


10741 


24642 


A 


10825 


199 


1 


KKTQIGGAPGGALFFFFFFETGFPFVA\ 
RLQCSGTNQAQCSLNLLSSSNPSFPALQ 
VTGTPSACP 


10742 


24643 


A 


10826 


252 


381 


GQQEFFFPCSVPQAGGQWCNIiSSIjQAPP 

pg\shhs pasasrvag 


10743 


24644 


A 


10827 


77 


427 


IPQVHCPMSPPVPMACIPRVSSFTSWVF 
HNLLPPSECPLGPLVPASSHPRPCVCCR 
PCTSWS\CPLWPRPPCSNSPV/TCVPCL 
PCIiCISEIPSCVPWP/WTYSSLCPMSHV 
PDSPCPLP 


10744 


24645 


A 


10829 


20 


518 


SFAFSLLQHLTETS FAINS CSEATLLFL 
SVFL/RAQTLTAPCQTRGPRRGKDRGSG 
SSSPSGPKATKESSVERRKSFDSWGHRF 
AA\QRLMDNQAERESEAGVGLQRDEDDA 
PLCEDVE LQDGDLS PEEKI FLREFPRLK 
EDLKGNIDKLRALADDIDKTHKKFTKAV 


10745 


24646 


A 


10830 


351 


3 


LHFS PFIjQHRQNIKVWLMRASMQRHSRT 
HGAF PLHQDE I QTRPRIiRSAASSGPSLL 
SDHILSTLPAFTCANLSSRFCSSSPCSC 
SCLRA/CCTCHFDLSTLIHAHTRTHTHT 
HTHTMY 


10746 


24647 


A 


10831 


8 


376 


GMLPAnljADVIaRnNoVGRPKHNKVMAGA 
IiEGDLFIGPKAE\EHRG 


10747 


24648 


A 


10832 


336 


38 


GVATEGVGEAAQGGEPRQPEQ/ PPPQPY 
P PP U Hcl£KIVlAAi!»ARQ A/ AG AP MDDG 
FLSLDSPSYVLYSDRAEWADIDLVLQNV 


10748 


24649 


A 


10833 


206 


1 


TYFFPFPPGLFFIAGIFFFFFLETGSH/ 
SSLRLECSGIITASCSLNLPDPNDPPAS 

AoyVAJSl lv-AlMI 


10749 


24650 


A 


10834 


226 


376 


RISQAI ISFYFYFLFETESRSVAQAGAQ 
WSD Jub S JjQ P PS PGV \ S OS PALP S 


10750 


24651 


A 


10835 


213 


1 


DRVI1LLSPRLGCSGMITAHCNI1HLPWFK 

PT7QPTir r PP1? /CCWTYVPPDT.D /PPPT/TQV 

FLVETGFHHVGQAGLR 


10751 


24652 


A 


10836 


351 


447 


RENLWLTPVIPALREAKAGI \LEPRSSR 
PAWAT 


10752 


24653 


A 


10837 


71 


472 


SASTAPMAPVKKLWKGG/ K/ KKKQVLK 
FTLDCIHPIEDGIMDAA/NSTNYEQFLQ 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
EF=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, Msoleucine, 
KNLysine, L=Leucioe, M^ethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine > K— Arginine, a— oerine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X—Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 















ERIKVNRKAVNI^VVTIBRSKSKLTVT 
PEVPFSRRYLKYLTKKYLKNNNLCDWLC 


10753 


24654 


A 


10838 


37 


382 


SRCIMAFYLYGIRS FPELWKSPYI/3VGP 

GHS YVSIirv AGRCG V KDtvU KliL*> V IS. X Ho 

PQNTKATNVIAKARYIiRKDEGSNKQVYS 
V Prir L iAuno JSJSiCoy L v lw o W^j H-unrv/ ua/ 
PVKNAV 


10754 


24655 


A 


10839 


313 


617 


AFFLIVFQYVARERERVSRS / VNCGLYQ 
HCYFRTL 


10755 


24656 


A 


10840 


309 


511 


WEQGSWEHVLQAPCSLIAPRITHAYTHT 
HTHTHRYTHTHLYPH/APTS IL 


10756 


24657 


A 


10841 


269 


450 


TPWPLKKEFFFFIiKWSLALVAQAGVQWR 
DLLGSSDSP\LQPSRFKQFSCISIiPSSW 
DYR/PC 


10757 


24658 


A 


10842 


420 


1 


LERGENGTRRDRRKGIiSHCHQPMlJbVljF 
PLCHPPPLLVTMEEBITMLFIDIGSSMY 
KAGFAG/DDASRAMFPS IVRCPWHHGVM 
VGMSQKDS YVGDEAQTKHS I LTLKYP I K 
HDIITNE\WDNMEKIWHHTFYNKLHVAP 
CI 


10758 


24659 


A 


10843 


431 


1 


GEHSWASDL1AEDVTK.V IKUl^iAsir Jur WLjJj 

CSGNLSFSGCVPGLPAGAVPPWVPVPFQ 

gga/swvwkgpspwtlhlllrtwglagg 

VGGRSLGKWRAWPGN PtwyoyvxHJrl^iirlu 
ATCTPRSRTGSTGGIJSTVTIIiEGSHVSG 
MGIR 


10759 


24660 


A 


10844 


55 


297 


QRWPGTLLGVQPG/APPDSTSASGSGGA 
RGGPVPTlJcKiGN l\joKJvWXSJJlr^.WVaoU«o 

PSSDGSKPPPGPLTSKVCDDSRLSB 


10760 


24661 


A 


10845 


109 


427 


' QTGPSAAGLLEFARGPLQTLFAWVPAAV 
AAEQQIFVNRECCCLIVPLEFCLRGVPC 
CVRCQSA\LLGGASQLG\SRGSGVRDPL 
EGGSCPFoDIjQIiHAAKI lAUrlA 


10761 


24662 


A 


10846 


17 


416 


sflwkfclrevpgrarclsa\llggasq 
lgysgvrdpleeavcpfsdlqlragrtt 
tlfkavrqghlslqrillpfvwqcpapi 
ggvyrdrqasiiscsglhpira/sraavp 
tqasamagapppdslppcslssn 


10762 


24663 


A 


10847 


265 


429 


LSGASCCLFSLPIVLFALLLLSVLSFSF 

CF*VNSNPKNLTFMVNFQGKGMKIGLAL 
ERIC ICGCS LVFPEAI LETS CQNLFCTY 
ACAGVLSSVYDYI1ALI1AVSFLFL* FFSL 
FFFFLYYLFLFALGASLPGLYLLLYFLL 
LCCISYSLF 


10763 


24664 


A 


10848 


12 


462 


\2 1 ULj 1 JKWlN iSvjJLjr Al r XAJrJL JL J-iv^Jjf i/ttJUi* 
IILFPPLLIPTS*YLINNRLIITQH*LV 
KLTSKQMITIHNTKGRT*SLILISLIII 
IVTTNLLGLLPYS FTPTTQLS INLAMAI 
PL* AGAEVIGFRSKI KNALAHFLPQCTP 
TPLIPILVII 


10764 


24665 


A 


10849 


I 


" ACT* 


TTLHAFGTMKRKAFITLLCLFTSANSRG 
VYARDAHKSEAAHRI KDLAEKDFLALVL 
IAYAQYLQQCPFBDHVKL*NEVTEFAKT 
CVADESAETCDKSIiHTLFGDSLCTVATL 
RETYGEMVDCCAKHEPVRYECFLQHWDD 
CPNLP*WRPEVD 
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correspond 

ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glu tannic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleuctne, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


10765 


24666 


A 


10850 


164 


1 


RGFQVRGE*NSGRETER*RERE *KLARE 
RAIGQEREREKSREAGRERERERKKB 


10766 


24667 


A 


10851 


417 


100 


MMLWGNLETRLWSQAQTIPCTKQRSSL 
FFPVLNAAIFRSKSKVLNCLEYFLGNTL 
FSWDLIPBILFPLSHYHHYHHHHHNHQY 
*KS*QWSASNTRFQGILDKNSY 


10767 


24668 


A 


10852 


339 


3 


ETTHSEARRGRSAAASCRGSAIiRRGRFP 
ESRRGREAAPVCPRHVLL*GAQSKQAAV 
AGKRSGTRHASRWPKSLFTPRRRRISLK 
RALHFWQQSADPSPSVSRIPPHGVGSS 


10768 


24669 


A 


10853 


1 


373 


I>QQ KGMRKRAGQS EMAPAGVS LRATI LC 
LLAWAGLAARDRMYLHPFHLVIHNEST* 
EQLTKANAGKPKDPTFIPAPIQAMTSPV 
DEEAMDQLVLVAAKLDTVDKLMAAMVT 
MLAIFLGFRIYG 


10769 


24670 


A 


10854 


1 


423 


VSCS FLKLKTMKHGL.LLLLCGFLI1KSQG 
VNYTEEGFFRARGHRPLDKKREEAPSLR 
PAP P P I TGGGYRARPAKAAATQKKVER * 
APDAGGCLHADPDLWV*SPTGCQLQEAL 
LQLERP I TNTVDE LNNNVEADSQTSSS F 
L 


10770 


24671 


A 


10855 


343 


3 


rgceahplprs egpagsalaq pvmyct i 
fagtlitaissh*fft*vgleinmlafi 
pvltkkinprsteaaikyfltqatasi i 
l\h:ailfnnilsgq*tltnttnq*sssi 

I 


10771 


24672 


A 


10856 


147 


1 


TRTPTGQCVSPKSMFLGAVAKSCNPITL 
GG*GRRIT*GPEFDPSLANMV 


10772 


24673 


A 


10859 


189 


422 


NHTMDDFERRRELIRHKREEMRLEAERI 
AYQRNDDDEEEAAR*RRRRARQERLRQK 
QEEESLGQVTDQVEVNAHNSVP 


10773 


24674 


A 


10860 


90 


273 


shqeieqnsamaprkrggrgisfmfycl 
rnndqrymt*rl*sgigwmllssgrmgy 

ALPG 


10774 


24675 


A 


10861 


25 


411 


APCAKPCGDWRSRGSLVWAMSGCNARKG 
DCCSRRCGSHL*N* IPTDWPLN*FFLTS 
AKVKBCFPKKESIYSQTVY*SPG*KM*T 
*DKEHPRYLIP*LCIQFYHLCWVTGTGG 
GIILKHGDEIYIAPSGV 


10775 


24676 


A 


10862 


33 


302 


SRRATLIYVDMENGEPGTRVVAKDGLKL 
GSGPSMLALNGRSQVSAPRFGRTFNAPP 
SLPIATIRALGTVNRATEKFVKTNGPLR 
Q*QPRS 


10776 

• 


24677 


A 


10863 


1075 


1521 


YCHTGKGEQLGERFCEGVSRRGPAERGS 
DSOTPWPWPLCAAAGTSAGTSHSGSSSG 
AF?SWPCWTAABPAARKRGRPAGSWSSP 
ATGAPGRCRHR I LS RGAGGS AGFVCSGL 
AESGL*ESSSPGRSQG*PQGQRHPQFNG 
LPAPPSTSV 


10777 


24678 


A 


10864 


245 


418 


TSKLAFPIS IPVI YANKVCP * FSKKKKK 














FFEGPGFFF 


10778 


24679 


A 


10865 


132 


398 


LNMGKGDPKKPRGKMAS YAFFVQT *REE 
HKKKHPDAS VNFS E FS KKCS * RWKTMSA 
KEKGKFEDMAKADKARYEREMKTYIPPK 
GETF 


10779 


24680 


A 


10866 


115 


455 


LLTRNMDRLLILAGGMPGLGQGPPTDAP 
AVDTAEHVYISYIiALLKMLKHGRAGVPM 
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peptide 
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Predict- 
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nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Atanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possibte 
nucleotide insertion 














EVW3LMLGEFDDDYTVRVIDVIAMPQS * 
TGVSAEAVDPVTQAKMSDMLKQTGKPEM 
D 


10780 


24681 


A 


10867 


64 


483 


QGERPAAAMKI *SLTLL3FLIiLAAHVLL 
VEGKKKVKKGLYIKVDSE * KDTLGNTHI 
KQKSRPGNKGKFVTKDHTNCTWAVTEQE 
EGIS LKVECTQLDHE FSWAYAGNPTLCL 
JCLTDE I VYWKPVARNLRIiQNDIMI YSPQ 


10781 


24682 


A 


10869 


229 


1 


DTTILKLGRLI ILQWLLSEGKSHMSFTL 
NQK*EIIKLSKECMSKAELSQKIiGCFCQ 
VGSQ VMHAKEKLLKE I KSAT 


10782 


24683 


A 


10870 


934 


515 


KKVLLCLPGLGCGG VILAHGS LALPGS S 
SLHLSLQSSWGLTGM*YDARLIFVYFFR 
EMGSCHICQAGLNSYNSSIPPTLVSQNV 
GTTCVSHRAQLAPVFCFCFVTLVDVKAL 
HFGKQENRIFFFFIIiFPGGSRDEDRVA 


10783 


24684 


A 


10871 


373 


424 


EFKTNLANMG*CLKSQHLGRPRCANHMR 
LGVQNQLGQHGE I P FLLK 


10784 


24685 


A 


10872 


1 


469 


RSRSGDSLPACDRPSGASALATCETIFS 
AISCFWDLPAPSLRLTPSCQPTMSSQIR 
QNYSTDTEAAVNS LDNWYLQAS YTYLCL 
GFYFDRDDVALEGVSQFFRELAEENREG 
YDRFLKMQNQRGGRALFQDIMTPAEDD* 
GKTHNAMTAAMGLETK 


10785 


24686 


A 


10873 


244 


2 


NIYSCKETFSVPLIiAIHVYHFLVGRGGS 
QGTEMLWHRVDLRYREQAGHSGSHL * S Q 
HFGSPRQVDQLRYGVSDQRGQHGE 


10786 


24687 


A 


10874 


83 


427 


ISLNMIRIAALNASSTIDDDHE* SFTRH 
NTQTKEAQEAKAFALYH * ALDLQKHDRI 
EESAKAYHELLEASLLREAVSSGDENEG 
LKHPGLILKYSTYKNLAQLASQREDLET 
AME 


10787 


24688 


A 


10875 


94 


1 


KSQKACNPSTLGG*GGWIT*AQEFTTSL 
ANT 


10788 


24689 


A 


10876 


2 


413 


GVTRGFNMRIEKCYFCSGPIYPGHGMMF 
VRNDCKVTRFCKSKCHKNFKMKRNPCKV 
RWTKAFRKAAGKEIiTVDNSFEFEKRRNE 
P IKYQLELWNKTIDAMNRVEEIKQKRQA 
KFIMNRL*KT*ELPKVQDIQEVQLN 


10789 


24690 


A 


10877 


65 


417 


RFAGAGAI PEARAWPTDVHAAEEEKEMD 
LPDLASRVFCGRILSMVNTDDVNAI ILV 
QKNMLDRFEKmEMLIiNFNNIiASARIjBQ 
MSERLLRHTRTIi*DMKPDLGSLFRPIRT 
LEWKL 


10790 


24691 


A 


10878 


4 


442 


APTPDAMGHifJL'EEDKVTITGLWGKVNVE 
NAGRETLGRLLVDYP WTHRFFDS FGNLT 
SGSVIMGNPKVKAHGKNVLTSLGDAIKH 
LDDLKGTFA*LTELHCDKLDVDPENFKL 
LGNELETDMAIHFGQDFTPEVYAYLQNM 
VTVYAN 


10791 


24692 


A 


10880 


1 


419 


GKHIRQYHEEKETGQRINIHEYYLGNML 
AKNLLFEKEREAEEKEKSYEIPTKNIQG 

MYTCHPESKHEI LSVAEVTTCEYEAVIL 
TPLLCSHPKYRFRASAVND IFCQSLPG j 


10792 


24693 


A 


10881 


54 


335 


REIFTMSGALDVLQMNEEDVI1KNI1AVIT 
HLCCTKTDSPMEQI IYGS*TEYIYSIKL 
NRT*QNLMLAOT*NYDI*NHDEVTVMTF 
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nucle- 
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residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Add, 
E=GKutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, IMProline, 
Q=Glutamine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NNTLQMFEP 


10793 


24694 


A 


10882 


216 


3 


MS I KSMS I CCLRKKGKHGFQ IRGK*NSG 
RETERERERERKRARERARGQERERQKT 
SEGGRERERERKKE 


10794 


24695 


A 


10883 


3 


390 


GELKCSKEKCQSMSAKSRTI WI IGAPFL 
KGQPRGGVEEGPTVLIKAGLLEKLKEHE 
CDVNDYGDLPFADI PNDSAFRI VKNPRS 
VGKASEQLAGKVAEVKRSG* ISLVLCVN 
LGGGLECLCGHAKVY SR 


10795 


24696 


A 


10884 


3 


375 


STMRAWIFFLLCRAGRALAVPQQESLPD 
ETEVVEETVAEVTDVSVGANPVQGEVGE 
FDDGA* ETE * DWAENPCQNHHCKHGKV 
CELDENNTLMCVCQDPTSCPALIGEVEK 
VCSYDTLTFYSS 


10796 


24697 


A 


10885 


95 


2 


KCTQGP SAVAHACNPNTRGGRGGW IT * G 
QEF 


10797 


24698 


A 


10886 


1 


114 


MGFHHVSQDGLDLVTL*GACFSLPKCWD 
YRREPTHLAH 


10798 


24699 


A 


10887 


1 


368 


E PTMELTTAEKTNGKAAWRKVGAHAGEY 
GAEALERMFLTFPTT* TYFPHFDLSHGS 
AHVKGHCKKVADALTNAVAHVDDMPNAL 
SALNYLHAHKLRLDPVNFKLLSH*LLWT 
LSAHLHVEFT 


10799 


24700 


A 


10888 


336 


9 


FRGQVI PAI FFPFQTTSLSYFQTGQPN* 
TPFFQKILKLTRGGGKLLDFQTPGRLRQ 
KREDPLFPGV*NCN* P*SHPWWGTKQNP 
VSKKKKKKLKTSSIFVHRVSMSPI 


10800 


24701 


A 


10889 


3 


362 


GFLIiAPLEMQELGTPMNRIIjQIjT iaeq£ 

tfltpalllpiphqtystasavplakpd 
t*pkdvgilalqvhfpaqyedqtdleny 
nnvkalkytedlgqtlmgfcsvqedins 

LCLTVEQP 


10801 


24702 


A 


10890 


66 


333 


TLPGNIGISFVERVMEVLRPQLIRIDGR 
NYRKNPVQEQTYQHEED**DFYQGSMEC 
ADEPCDAY* VEHTPQGFRTTLRAPS *LY 
TPIVG 


10802 


24703 


A 


10891 


405 


49 


I PWMGTERTRASAQRDVAGPARNPACPI* 
PEQCFPGPAEWTGPLTPPGS C* LRPGER 
ERHGPGCWGPRERLSSGQTHPPSPQGCA 
OTCPCHKHQQELLAGGVAVFYKLFYRYG 
NSPKWDL 


10803 


24704 


A 


10892 


2 


197 


PQPLRVLWTAHLAAMAPSSRTSLLXiAFA 
LICLPWLQKAGAVQTVPLCTLFYHAMLQ 
THRAHQLVT DTYQE LE ETYI P KDHKYS F 
LHDSQTCL * FSDS I PTPSNMEETQQTSN 
L*LPDIPAPGFCPDLPALASKGWCRPNR 
SAMHAFLPRYAPHASRAPTGH 


10804 


24705 


A 


10893 


3 


334 


DQLPEPLKVLWTAHLLAMAPGSRTSLLP 
PYALL*IiPWIiQEAGAVQTLPVSRLFDHA 
MLQAHRAHQliAIDTYHEFDETYIP*DHK 
* SFLHDSQTSFCLPDS IPTPSNMEET 


10805 


24706 


A 


10894 


3 


341 


LLTPGVSDAICYILDSGYI IMSDTFTAY 
ViG^Kr JSV«Var w riA.I VKifAAV VFP VAIitrW 

LGV*WDNPERGTYDGTHEWTVYFKCRHS 
TGGFFIRSNKVNLGTDFVTADKNLYVLD 
Y 


10806 


24707 


A 


10895 


25 


351 


AMIQTRDLQGGRAFGLLKAQQD* RLDEI 
CTQLLDDLKYSNDEDLPSRLEGFKEKYM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGiycine, EHHistidine, IHkoleucine, 
K=Lysine, L=Leucine, M=M ethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, Iv=Arginine, a=oenne, 
T=Threonine, V=Valine, 
W=Iryptoplian, i— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














K frDJj IGNGHI DIMxIiKLMlj I KloAv Fu 1 JJ 
LQLPRSIGEVYSGSGETISYPDFLM 


10807 


24708 


A 


10896 


364 


38 ; 


FSQFKIAQFJLKK.L± , £ WaiTbW J?y«Vi<jrJ>ur 
LKFSKFFPNKRGFFPHPFF* KRGPFFCP 
KWGPQGHIGAHGPPAPQGQKIPPFQVPK 
KMGIKGPTPPPGFFFFFFFFFFSW 


10808 


24709 


A 


10897 


136 


3 


STLKHR* RPGMVAHACN FSTI/iGKijUKi 
T*GQQFKTSLTNMVKP 


10809 


24710 


A 


10898 


193 


269 


GNKYCLT ILLTiTililtliS FKRQS LTliljPi 1» 
ECSG*TRTPGPKGSSIiIOTS*VAGHGGV 
CIj * SQXtX RRLiKWlilJii-l-io fu V ^ vxtlo Utf t\lt 
HHCTLTWATE*DSVS*KTTTTTTTK 


10810 


24711 


A 


10899 


316 


374 


CCCX:CCCCCC*CCCCCCCCCCYKETNFB 
QQ 


10811 


24712 


A 


10900 


38 


206 


VYCVLVFTICTLLCNTSLGLFHPEFFFF 
ETESCCVAQAGLQWHHC* SItRTiTiPnRI -L 


10812 


24713 


A 


10901 


364 


2 


D YQHI S PEKHCRPEGSGMVYLMCRKKKK 
RKKKNLSTKIIjNPQPSFIL* kprnsakk 
S*QKEITKIGADSI»I IENRKRIEKIYIN 

TKFRNERG 


10813 


24714 


A 


10902 


132 


3 


ATSSPWGYSAYRVAIiJ^ijxFbMJUjA^ 
VLYGFV*NYFLQEI 


10814 


24715 


A 


10903 


234 


352 


NFCFFETGSCSVT*AGVIMAHCSliDLPG 
SSNSLTSVSEE. 


10815 


24716 


A 


10904 


250 


356 


TMEMMLDKKQ1 *A1 if UK"* b KityUKAAii i 
TLNINNAL 


10816 


24717 


A 


10905 


280 


322 


QT *SL I LVSLI IC I ATTNIJjGLIiF Y £5 FT 
PTTQLS INIiAMAI PL* AGAWIGFRSKI 
KNALVLCTSTPLPTSTMIiPVHMMDTSSD 
IHPKI * S IR*QTCIQHK1J^W JjKa- 


10817 


24718 


A 


10906 


271 


365 


KWRPG VVAHT YNP S I LGSRGG W IT* GQE 
FKT 


10818 


24719 


A 


10907 


135 


358 


IiRYKLTNPKLTLLTCGLIFLKGNIVNIG 
QCNGVHM* YQHFGRPGQDCLS PGIQGQH 

GQHJcUSo lio Jj w KolifLX o Wo 


10819 


24720 


A 


10908 


107 


212 


~I KNEGMGQVRWLMP VI PALWEA* VGGTP 
EVSHSWL 


10820 


24721 


A 


10909 


165 


383 


PKNRPI I PLKRSS ISNPGDFQKSIiRPGL 
TPMGPHFKRGNYSYDENLENLCPVGGNK 

/17VOU4PT T TVI^TPD/IT? 


10821 


24722 


A 


10910 


119 


291 


GGYRFYVKDTF*K*GFWPLTLFI ILKPL 
AOK3GACLWS WRLMRLK* EDHLSLGGRG 
CSELWLCTIiA* ATE *DPVS *KVRLLALN 

SLESGRSRLQ 


10822 


24723 


A 


10911 


283 


2 


SDNTTDTFLPFIYKYIFPITWEQGETWK 
KNPRNSRLKKTIjRSETIAQIPLKCNLWP 
GMVAHACNPYTIjGD*GGWVT*GQEFETS 
LANMARPbL. 


10823 


24724 


A 


10912 


1 


360 


PHAFGTMKWVTFMSLLFFLSWANSRGGF 
RQNAPKS EVAHRLKDWGKKjNr KAWG * MA 
LAQNLKQGPFENHGKLGKEVPEFAKPGV 
ADDAAENGDKWLNTLLGNYLAPVAAVRE 
TYGERAEC 


10824 


24725 


A 


10913 


270 


361 


SKTWPGTVAQTCNPRTLGSQGGWIT*GQ 
EF 


10825 


24726 


A 


10914 


157 


1 


VFFQLSRLAWKGFSRFFFVFETBSHSVA 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Add, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleudne, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
1—1 nreonine, V=Valjne, 
W=Tryptophan, Y^yrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nudeotide insertion 














* AAVQWRDLGSLiQAPP PGFTP FSS 


10826 


24727 


A 


10915 


1 


357 


LEEWGPEREFAEEEKEKKNTKKKKIKNS 
TPPKKPARKGEEKPGPFKRAITTIFVAR 
IPLFCLKGFLWPSC*IIGRSSPTPGSRG 
HTGAPNCPP CPLAAFP KDVPNFNPKMVE 
AVKNPXIi 


10827 


24728 


A 


10916 


102 


471 


PSTPI ILTSSYPHYVKSSVTSTFIISLF 
PTTI FMCLYQEFI ISN*HLPTTQTTHLS 
LS FKLDYFLI IFILLTFLVTWS I IEFSL 
♦YINSNPNINQFFKYLLIFLITILILLT 

TVTvTMT 17AT TPY/** 
AWJNijr y Lit J.Vj 


10828 


24729 


A 


10917 


2 


138 


REPTMVLSPADKTNVKAVWGKVGAHAG+ 
YGAWLFTWLTLNVTSD 


10829 


24730 


A 


10918 


178 


393 


LVLCT VLLMWRFHFHDSLS ILKAQDVTE 
NLSNPTVLRGETGPSAVAHTSNPSTLGG 
♦GKWITQGQEFETSLA 


10830 


24731 


A 


10919 


278 


124 


TAWADWGSNETPFLIiKLPKKLT * GGGVC 
LEFQVLGRWPKNPFNLGNQGFN*PKFR 
PWTSTWGAKQNFV* KKKKKNKNKKP * NS 
RHTPPP 


10831 


24732 


A 


10920 


221 


243 


IIIGSPLIDIKISKCFLKIVAFGQAQWP 
V I P AL WEAEAGRS P EVGS P RPA * PGS LK 
VHVDNNWESIN 


10832 


24733 


A 


10921 


33 


292 


GTMWLE IRHAVEVQCVLVSELVI PTSG0 
KPBQC*DHYLIT* YLILGKWGI * * ISGA 

FI 














10833 


24734 


A 


10922 


108 


3 


KLAVYGS VCL*SQLLGRLR* EDKLNLGH 


10834 


24735 


A 


10923 


106 


2 


KLAVYGS VCL* SQLIjGRLR + EDHXiNLGH 


10835 


24736 


A 


10924 


195 


2 


ASLFKSQLNRAFIYLFIYVFIFETGSL* 
PMLVCSGVTTAHCNLHVLGSGDP PTSAS 
QVAETMVL 


10836 


24737 


A 


10925 


127 


2 


YVCILKNLKGRPGMVAYACNPSTIjGGQG 
gw i t* gqefkts l 


10837 


24738 


A 


10926 


639 


282 


FFLSIKG WVQQFMPI IPTPWGIiKQEDHL 
RPGLRDQPVQNSKTPSLLKI *KLARRGG 
ACL*SQLFRKPR*ENCLSTGELKPKGGI 
FIRIiVQNKI PTTRGEKRKQSHTGS YQCP 


10838 


24739 


A 


10927 


1 


363 


ALLTQALTCRQAGAEAPHAGATPSLMPP 
SLPQGFRDCSPSDAAYTMEMTIDKKQNQ 
VTIIjFKFKMGHKAAQTTRNINNAFGPEI 

PSKVATDQL 


10839 


24740 


A 


10928 


53 


379 


TEAELLTLYLLPNALLNHFTSPPLMFAD 
KRXiFL. rNHIDXGTIiYIiLFGA*AVVljGTA 
LSLLIRAELCXJPGNLLCNDHIYNVrVTA 
HAr V X 1 rr XCjJjP I X XoGFGW * IiF FXi 


10840 


24741 


A 


10929 


180 


350 


EPMAKGKTESPGPKRCGP* I *WVISQRG 
TLRFRGAGLFFMGEFLRLGENLLEIPRG 


10841 


24742 


A 


10930 


361 


379 


RRYWWLG*VQWLMPVIPALWEAKAGRSP 
KARNL 


10842 


24743 


A 


10931 


121 


543 


HRNTGSTHASAHAYH I VHTNP * PLTGAL 
SALLMTSSLAMGIHFHSITLLILGLLTN 
TUT I YH * WRDVTRES T YQGHHTP PVQKG 
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SEQID 
NO: of 
nucleotide 
sequence 

• 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gtycine, H^Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vf=OIutamine, K— Arginine, a=oenne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
A= Unknown, =otop coaon, ^possiDie 
nucleotide deletion, \=possib!e 
nucleotide insertion 














LQYGI ILYIT* * VFFFAGFF*AFYHS IL 
SPTPQIiGGHWSPTGITPIWPLKDPLLNT 
S 


10843 


24744 


A 


10932 


227 


339 


KKKKKKKKKKKKK 


10844 


24745 


A 


10933 


64 


409 


DQRNKSAHLRAHLKKKKKKKKKKKKKKK 

WAFFLPLLKRKFVGENFLKKNFFWGGNI 
GGQ I SSKKx RGGGK. 


10845 


24746 


A 


10934 


2 


362 


NKLSGPYPEKVGYTLPVLMMPLAQLVIY 
STlFAGTijITALSoH*r r x vyjU&xuSMi 
AF I PVLTKKINPRSTEAAI KYFLTQATA 
SxlxxulAJJjtWrJXJjoijV * 1 xxox loyio 

SIiIILRAM 


10846 


24747 


A 


10935 


154 


3 


IQIGLEVK* SliLTDNMILYLGN 


10847 


24748 


A 


10936 


250 


363 


XAWFvjxljxoir orxjx a 1 1JXJL wxjww/\iiti 
CDPSTLGGRG 


10848 


24749 


A 


10937 


199 


339 


NVLGIIiIiVFRI IVSIYKCGRLWLGTVAH 

1 vJM iro i iAJ^-SJKVjHJ 1 J. w V3ivD 


10849 


24750 


A 


10938 


130 


240 


KNEQDPRDL*DNDKWPNIHVIGVPEKDK 

t"»xt/"» ti? o \ Tun 
UNulbKV r U 


10850 


24751 


A 


10939 


35 


235 


F I LVENTKKMC I CPLLNMIYAILYLFVP 
SVFL*EENKQ*GITEKKKKKKKKKKKKK 
KKKKKKKKIKPGGL 


10851 


24752 


A 


10940 


179 


12 


QLLGRLRWEDCLS PGGRSCEEPCSHSG 


10852 


24753 


A 


10941 


297 


278 


INQDNE L.TL INQSS KQ I KHKN INQTLRT 
KMNENLFA* 1 3LAATIIX3LSATVQI ILFP 
PIIiIPTSKYJjINNKXiII ItjV" UlisdjXZtK. 
QMITIHNTKGRT* SIxLGGD 


10853 


24754 


A 


10942 


3 


318 


FGGGRGVRNYPDAFVLSVLLPSRLLFPH 
LFPrtbr Pi-ifFr Jj^JUfU^xv^r xjxJj w If ^J\. T 
*LYYSQSIIFFFYREMKNKLLVFWATNL 
FFLYYYNYMEMWTLICYWKN 


10854 


24755 


A 


10943 


190 


3 


KLSHKQS VYFKTPFI * KNHVLFK* KNL W 

t TVTWJIMf T? U*t? 1? V W C T .H T . C Ot> T -rt fMlftlT T T . 

VY PWDIUf 1? rr r r EoUAJjoitxUjUUNvjI J.1j 
AHCNLR 


10855 


24756 


A 


10944 


281 


1 


KKWLFSSYSSLYGNGIiFLIPPFLRVSGF 
GKGFWKKFFFLRARGALFWGSPLKGFFL 

HCNLRFPGS 


10856 


24757 


A 


10945 


169 


319 


NGVEDAFKNMVLGWAQWVTSVI PALWE V 
KVGGSPEVRSFRPTRPI* KYGFRkGTVG 

VKPC 


10857 


24758 


A 


10946 


312 


1 


Cf\ttcKjLllji\J3ViLlt?i^r tri-tUC V rtyjC C C AXvT 

PENWLEFEIFPFLI IMAKRKKLSKSLFP 
FPFLG*FKD*GQKTPFFFFETVSLLLPK 
LECNGAISAHCNLCLPGSSD 


10858 


24759 


A 


10947 


208 


329 

— — — _- 


lx? i\jUKoKfJWK±i» T /U<Wx<lFVilrdJjWiS/V 

EAGGSLE PKSLR 


10859 


24760 


A 


10948 


llo 




FYFGYLLFFFCFFEMAPCSRG* SWSARG 
QSFCNLI FPGSSDLYVS AFRVARITGAR 
DHACI 1 1 VFVVDGG CVDT I TS GLAGWLR 
RG 


10860 


24761 


A 


10949 


264 


434 


QLAFCTDTLTS VREQCEQL* KCVKARKR 
IELCDEQ*SCRSHTD*DCTDELFDFIjHA 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino arid 
res id iip nf 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

tact Amino 

acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, ENHistidine, JNIso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

0=fvliifaminp R^AraSnin^ G^Carino 
Vj yjiuuiiiihic, i\ t\i giiiiuc, o — oeriilc, 

T=Th rpon i n p V=Valin<> 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon,/=possible 

UUvlCUllUC UCICIIUII, \ JJUoolDIC 

nucleotide insertion 














T 


10S61 


24762 


A 




996 




ill wi. ftlll /XV-L V £7 I. O IV X IX J* Vi£\Vl£\\£ VI W t T i JZ V 11 

PELWEAKAGGIT*AQELETSLSNIVRPW 
L YKKFKN* VWWC 


10862 


24763 


A 


10951 


188 


433 


YSAEWKIDLGIEVFWVGKMTYKQ*FFWF 
VFKFLFF*RQGRTLSSKLBWMSVITGHC 

liLLUUlAJOKu X rnOATyv X W X XV70L.T10 


10863 


24764 


A 


10952 


298 


445 


LFS KCS SKSI KEEG PGWAHACKPGTLG 
GRGGWIT*GSEFESSLANMVK 


10864 


9476^ 

*T i\J J 


A 


lu7JJ 


9A4 


i 
I 


ifTDrtnT.xnn?rT.i?irircDT ypt irrn/oMnn 

txl ryubV V r \~Lt K ISJvo tr Ju Jx\j A issjt V orjv? V lj 

ICFFLKSRINLFNPRGLGQRKNFFFFFF 


10865 


24766 


A 


10954 


199 


3 


QETKKEQNKENKQIK*RSTRKKHRC3GTN 

IVX A *r ±r V X 1 XAar A rlHJK.tr K. 

" RRATTSPRA 


10866 


24767 


A 


10955 


157 


3 


YS YTFS FI ITTS IL I IQTLSGHGGACL * 
SQLLGRLRQDNHLNLEHGGCTRA 


10X67 
1 uou / 


9/1768 
4sk /Oo 






ZOO 




rtr.OT V\TT TOUT / , RTraira<*'MDO'PT /v^npo 

*IT 


10868 


24769 


A 


10957 


333 


2 


KWKADMLIPLVIKSFFxjGIFFKMFTNSKT 
RYIFLLQLIiAQHSSFFSFIRCMEL*WPY 
f J. fJjv. I U%joJNU " i-iAx\-Kx^ljlrNcixjXr 
FFFFETEFHSCCPGGISAYCNLCLSC 


10869 


24770 


A 


10958 


130 


228 


gssflggspsvaqagvhlpdhgslq*dk 

o ro V y rvrlo i\.W xvjL^vxH,v-L> ^ o Qlil iKKl iKi,. 

QGRLS P * GQACSEP * SGRCTPAWATEGD 
PPKKLLPYTTDS 


IVO /VJ 


Arr III 


A 




zjU 


J 


TrOTvf c ttprt vriDDM&i?m?nrk'i»iiDnTeo 
vyflLol lwUiiUru<fuUSi;ci Jryi AKir JLoVj 

FTTS IAMRLMLLTCSGHIWPVADLAFSG 

IMP*GLFFPSRQNLALSPRLECSARA 


10871 


2*112 


A 


10960 


199 


326 


VLGRiWSNQKNYTLIiARM*KIK* *NNTR 

PWf^nrYiA 1*0 p t t hp * npnj t * irrvMio 0 t 
i_yvovyv_vjLrt.x kkj. xxlv, " ^ tSl^K. A « JKAjytryoXj 

WRTV*QFLNTLNIR 




94771 

it I/O 




11/701 




i 
j 


i» x x\^xj v v r v rr 1 n to rij x x J-iXvOii tr v^.fl c 0 

K*LSTVTFFFRQSR5VSRLEWSGAISAH 
CNLCLPDSSDSCASAS*YAGSC 


10873 


24774 


A 


10963 


120 


3 


P FFFFFF * EMRSCFYAQARVQ * CDQSSL 
* PSTPGLKQSSC 


10874 


24775 


A 


10964 


168 


441 




10875 


24776 


A 


10965 


19 


207 


APLKLNVAMFJjSLGQWDVSRSNLWEIPL 
KKGDTGRVQWLTPVI PALWEAEVDRTPE 

NLWEIPLKKGDTGRVQWLTPVIPALWEA 
EVDRTPEVTSVTRC 


10876 


24777 


A 


10966 


209 


1 


TKSALSNMVVTTHIGIiCKNFCIVIi*DRV 
SVTOIGVOWHDLSSLOSl^PRLKDPPTS 
ASRVAGITGMHLV 


10877 


24778 


A 


10967 


109 


1 


DYLRSGV* DQPGQHGKTSSLLOQKLAR 
LVGPIISC 


10878 


24779 


A 


10968 


141 


2 


GliAMLPRLENI I FRPGTVAHACNPNTLG 
GRGGOIT*GOEFKTSLSP 


10879 


24780 


A 


10969 


175 1 


348 


LVYECKHILCVFNTSLFFPH*TCYK*RG 
VLWLGEVAHACS PGTLGGRGGWIMASGD 
RD 


10880 


24781 


A 


10970 


108 


2 


KTKKLHMR5GWAHTCNPSTLGG*GGWI 
T*GQDSC 


10881 


24782 


A 


10971 


184 


2 


SFLWKLHLRGVPGRVRCQSAPTRGASQL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

correspon 

j- _ Ad- 
ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A1anine 
C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, {^Phenylalanine, 
OGlycine, H=Histidine, £=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 

fW'liitomSna DsArainSnP Q— Qr^rinP 

v£ — ijiuianune, tv— Argininc, oci mc, 
T=Threonine, V=VaIine, 
w — i rypiopnan, i— jjiumwc, 
X=Unknown, *=Stop codon, /=possible 
nucieouue ucicliuii, v puawuic 
nucleotide insertion 














GYLGVRDPLEEAV* LFSDLKLCAGK'1 U 1T 
LFLV 


10882 


24783 


A 


10972 


245 


339 


QEF 


10883 


24784 


A 


10973 


295 


340 


IfOVTPP+APKTrMPFT.SPIjWOTjTljENPIjGP 

IXKVAT C JvTvXVXVLl XT JT JJOrUnVJU Jl UiiiLit mw* 

RLFLNP FLGLKI S QFFFPRQ I * PLAFFF 
WGTLPRPLNLKFGPFFFFFFFFFFF*DG 
uewp rpGWSO ARRY I EINI S LSTCPRA 


10884 


24785 


A 


10974 


186 


340 


RPYLTIRFKGHPPNEPIQ*SFFVLFCYC 


10885 


24786 


A 


10975 


253 


491 


FLECTS LLKRQRVHPGQHS KAP PTKELQ 
TTY^R(^AT.TiOSHIjIiRRLROEHCIiGPGVG 
SYSEP*LHHCTPAWVIEGDSSKK 


10886 


24787 


A 


10976 


141 


1 


«?F«? FFKFSP r K3DM IGFF*HFFFFLRRSH 
SVAQAGVQWCHLSSLKPRA 


10887 


24788 


A 


10977 


3 


138 


HEETGFHLVSQDGLNLLTS * STHtiGLPE 
/%jnVT?T?F P PPfiR F.GDF 


10888 


24789 


A 


10978 


186 


323 


YESRSATQAGVQWCELGSL* PSTSRFQ * 
FFCLNLSSTWDYSGLAPS 


10889 


24790 


A 


10979 


183 


3 


IKILFPFFFIiRLSIiALLPRLECSGrFSA" 
QLV 


10890 


24791 


A 


10980 


309 


55 


KNFGPNWVKFLGGKGGEMAFLGKFFFPF 

PGVTQISRIiGKIGGPIiFPKKKKKRERGN 
M 


10891 


24792 


A 


10981 


637 


838 


SQHLGRPMRVBQLRPGV*DQPGQHGEMP 
CLNSGGGGCSE 


10892 


24793 


A 


10982 


275 


2 


ROTTIUKMEFI^WFWGFWIWIiIaI^IRPKN 

VVnQT Q 1TQ VfFKTPT) ^ PWHFTiFICTrLdGKTNA 
FDTPDSLIiVRGKFSDSIHNTFDHM*RTK 
EYNEARA 


10893 


24794 


A 


10983 


98 


346 


GHGHATLRGLCVLTFS FH I TALS VSGTN 
DATOCCLCETOKPICGYIERNLLYIiLIK 

IWTT? VPH Vt7* WRP WQTi T ^ RY53IiWRD 


10894 


24795 


A 


10984 


30 


410 


LPEFTGRPKRTRTRGFSTNHTDIGTLYL 
IfFGA*AGWGTAVSLLIRAEIX3QPGNLL 
nTTOHMYWATVTAHAFEI IFFIVIiPIIIG 
GFGN+LVPLIIGAPDMAFPRINNICF*L 
LPPSLLLT .T iASAIAE 


10895 


24796 


A 


10985 


141 


360 


QTLRTKMNENLFAS F IAPT ILGLPAAVL 
I ILFPPLLI PTSKYLINNRIiITTQQ*LI 
VTjTS kom ITIHNTKGRT 


10896 


24797 


A 


10986 


3 


347 


HELRTBCVl^HLJASFMGPTSLGLPAALL 
IIL*PPLLIPTSKYLISNRLITTQQ*LI 
KLTSNC»1ITIHNTKGRT*SLILESLII I 
T ATTNTiT jGIjLPYS FTPTTOLS INLAMAI 
PIx 


10897 


24798 


A 


10987 


46 


317 


KSMTPIRK.INPLIKLINHSLIDLPTPSN 

YSPDA*TAFSSIAHITRDVNYG* I IRYL 
HANGAS 


10898 


24799 


A 


10988 


245 


1 


VSCLliEGKIiTORKDIOT 

PKVDKTTKMGKKQNRKTGNSKKQTASPP 

PKK*SSSPATEQSWMENDFDELREE 


10899 


24800 


A 


10989 


255 


1 


S CLPWANWIS VQNQHKKTFLVTRTFFP I 
GS FHVT * LLGPPLI LI S PPRI F F FFETG 



1211 



WO 01/64835 



PCT/US01/04927 



SEQID 
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SEQID 
NO: of 
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sequence 


M 
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od 
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NO: in 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A«Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycme, H=Histidine, I~Iso!eutine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P= Proline, 
0=G!utflmine R=A.rpinine 5v=5»erme 
T=Threonine, V=Valine, 
W=TrvDtODhan. Y=Tvrosine. 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














SFTVARAGVHSGAI SARCNLRLLGSSNS 
A 


10900 


24801 


A 


10990 


166 


307 


GEKPWGPLIWGGLVGRSSWASCQVLELL 
♦EMVFRHVAQAGLKLLSSSNPPASASQS 
AGITGVSH 


10901 


24802 


A 


10991 


109 


330 


TNQFKTKKEREAGKKKVKELERERERGK 
KRKEQRKNEKERR* P*NMD*RRIRBRNH 
SFDVYEFLICGLLNIiLHV 


10902 


24803 


A 


10992 


244 


326 


GRYLKGWLDGPA*AWLGTALSLLIRPE 
LGHPAILLGIDLI SSVX VTAHAWSILLV 
RVI P 1 1 I AGFGN* LVPLI IGAPDMAFPR 
INSISF*LLPPSLLLLLPPPI*QMRAGV 
GEREVRVR 


10903 


24804 


A 


10993 


101 


2 


KTFWARFVGTCL*SQUiRRPRQEDHLRL 
GGRGC 


10904 


24805 


A 


10994 


177 


1 


TPSLLFFVNI ICFCLNPGGGGYSEQKLC 
LCTPAWVTE *NS I SRPSONNSROSROVN 
HLF 


10905 


24806 


A 


10995 


214 


326 


KEERKNPRAIRW* PWGFFFPLFKEISL 
FLFVFRFWHG YP F PGAFFSTOKWL VF* T 
MVLFFFFFS * VSLLPRLECSGVISAHCN 
TFLPGSSDC 


10906 


24807 


A 


10996 


184 


379 


LCGMVYHFPS ILLLYI**CKLFFSPKLEF 
SSCCPLECNGTVLAHRSLRLPASGDSPT 
SASRVAGHGGTCL 


10907 


24808 


A 


10997 


136 


3 


GRTOGQHIMTHQSHAYHIVKPS P* PLTG 
ALSALLMTSGIiAM* FH 


10908 


24809 


A 


10998 


274 


363 


P WAYAE P LTI*HD AT YS CT * S L I RAS FTL 
FIA*TTRI^LP*SFTPTTQLCMNLAMA 
IPL*AVAPDIGYSSMITNALSHLLPQCT 
PTPLISILGIIETVSLLIOPITIjGVRRT 
ANIT * CHLLMHL I GS ATLS ISTI 


10909 


24810 


A 


10999 




1 


NLHTTPSOIPI/T* PLPNINIHNL1HT1TTT 
Q*IKYLILIPSNLFHNKPNLKTAYSVPP 
S PNLHLP S SSNS P AS ASQ VAGNAGARHY 
AWL I FVFLVE TGYS LLV 


10910 


24811 


A 


11000 


154 


2 


IWGILQSRRCGSRL*SQHFGRPRQADC 
FTPGVPDQEYFTLLPEWFFLV 


10911 


24812 


A 


11001 


142 


367 


GVCLYQRISIYIKKTKKLSQGLFYFLLL 
LLLF* EGGSHS VTKLECSGGVSAHCNLC 
LLASSHPPTSSSQVAGTTG 


10912 


24813 


A 


11002 


60 


385 


GNTLLSHTIJRAHL IENLHAEF IDRTILG 
LPAA*LIILFPPLLIPTF*YLINIRLMA 
NRH*LIKLTSERMITIRNSIGRT*SIIL 
VSLI ITIATANLLGLLPYS ITRTT 


10913 


24814 


A 


11003 


214 


83 


SKKSASPLTFNIVFQVLDNSVREEEIRY 
TEMEKEE I KLS * FVD 


10914 


24815 


A 


11004 


241 


368 


SYLL+PSP*PLTGALSALLMTSGLPMRF 
HFHS ITLL II/jLLTNTIiTI YQ * WRDVTR 
ESTYQGHHTPPVQKGLRYGI ILFITLEV 
FFFARFF*AFYHYSLAPTPQL*GHWPLT 
GINPLNA 


10915 . 


24816 


A 


11005 


3 


396 


HEPHALGMPLTADLPSMASCSQTSLLLL 
LHLLHIiPWI *EARAYQAAGCSKLFDHAM 
LQ AHRAHQLT IDT YQEVEETY I PEDHKY 
SFLHDFQTSFCFSDSIPTPSNTEETYQK 
SNLELLRISLLLIESWLEP 


10916 


24817 


A 


11006 


156 


335 


FGCCCCFLFF * LVYCC YVMMLYYI YVL I 
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NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

Amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
OGIyrine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S^Serine, 
T=Threonine, V^Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














SFGFYF*YWIM*YFYRLVYNIYLVLFYL 
LYMS 


i Aon 


Z**0 1 o 




11007 

I lVA// 


107 


2 


KCSSSKHFTKEDSQITNKHIEKCSS *LL 
VREMQII 


1 AQIfi 






1 100ft 


129 


412 


KTRSGFKKTPGQKGKTPLLKKTPKITGP 
GGGALKPPFSGGFRPENPLNLRDKGSSG 
PKTPTFFSPWGKK*NFFQKKKKNLLYPG 
KIFFFWMKNI 


ivyly 


Z4ozU 


A 


1 100Q 




268 


CTFKFKNHCSTIQSTT *LDRSEKKKKKK 
KKKKKKKKKKKKKGGPL* KNFSLKGG 


10920 


24821 


A 


11010 


154 


422 


" NEFSFFFFFWKPKPPFFAQVGGQYRNWG 
♦LKPRPPWLKNFSGFASGEAG ITGGVPH 
PG* ILVF*VKAGFHIENLVWNSETGNQP 
ACASQ 


10921 


24822 


A 


11011 


23 


375 


TRYLTKI KKKKKKKKKKKKKKRGAPFLK 
TPWGAPFFPGSAIFIFFFFGGFF*TPLG 
FFWETLFFWGGKILGHLSPKILPFWGKK 
NFFWVKGGKNMiNSPFIiKIFFIiGVFFKK 
FFPPG 


10922 


24823 


A 


11012 


151 


361 


ACTTTPSSFFSNIDTVSIiCCPG*VPGHN 
NSSLINFPSFLRSWYYRHAPPHPAHFFP 
ILIRSRYVAQAGLEFIiALRDTPISDCPC 
TVHYP PS LAN* S VFFFFLTQSFVSLPRL 
KRK 








1 1011 


1 AA 

1 TV 


378 


NFWFSIEFLHICY* IYSFICLIKVFL* 
GGKKFLGGGKNNS FFRNGGCF 


10924 


24825 


A 


11014 


82 


264 


TELFLYTICNWSAILYILC*RNKVFLSF 
INTEKKKKKKKKKKKKKKKKKKKJCKKKK 
KKKKKKKKIERKI 


10925 


24826 


A 


11015 


381 


2 


GRTALFTLLRGP IHRPQMFFTPLRVFPG 
SYTGI S I RQSGFF F I KRVI FFRIi* * KAL 
FLLPFLLPFFS FFLKIGTYSVAQECNGM 
IIDN*NFELLGRRNPSTSASQIAGNTSA 
Y*HTQLVFLCKDGI 


1 f\CT)C 

I vv/o 


zhoz f 


A 


1 1016 




251 


PNCI VHS LRTGTLNFI ILLLLCFTFE * F 
LNKNLIFESQKKKKKKKKKKKKKKKKKR 
GAPLKKTPGGPKFYPASKKKISPQKGG 








1 1017 


234 


1 


DINIKGSWVKGIPKPSVLSLQLFCNSKI 
FPN* KLIKKKNPNEPGVMVHVCNPSTSG 
GRNGQMA* GQE FKAS LANMAKP 


10928 


24829 


A 


11018 


107 


342 


FQLCIGAIVHCFLFMKQHSEYKSTDHRA 
SSKCHTVQGSSQLSTS AFKQK* THKKKK 
KKKKKKKKKKKKKX*KKKGGASF 


10929 


24830 


A 


11019 


264 


1 


CFKPLLFKYPGI I KAVNLP VSTASATFH 
KF* YVTFS FS * KKFLCVGGKTGSHS ITQ 
AECSGTI KAHCSLELPGPGDTPTPTSRV 
AARA 


10930 


24831 


A 


11020 


1 


288 


RTRGTSSRSRAATLFFFFP FKKGGFFNG 
NLFTTFGKLGNKKPPLNLFPGGPPLPHG 
GLKKGPGAPQVPPLFLKKPQTSPP*GLR 
EGPPPLKKFYLF 


10931 


24832 


A 


11021 


115 


287 


STCBISFGCSRGCCWDYLMVWEFSHFYF 
*FLETECCSVA*AGVQWSNLSSLQSLHP 
E 


10932 


24833 


A 


11022 


186 


1 


RPKVDKTAKMGRDQSRKA*NFKNQKASS 
P PKEYNS S LAREQNWMENE FELTEVGFR 
KSVITN 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamie Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
0=Glutamine R=Arpinine. S=5?erine- 
T^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown *=Stoo cod on /=no«ible 
nucleotide deletion, V=possible 
nucleotide insertion 


10933 


24834 


A 


11023 


180 


353 


IFCFCIFSRDGVSVC+PGWSQSPELMIH 
♦NTKIS * AWQRVPKIPATQEAEAGESLB 
PG 


10934 


24835 


A 


11024 


155 


1 


PQARRGGTCLQS * LLRRLRLRQEDRLS P 
GV*GYSEL*SCPCTPAWETBRLV 


10935 


24836 


A 


11025 




Jc-J 


KLI LL YDPOLTFI FLNTRAVVAHC!APr!C 
WP IHFVFGNYGFVFSCLFTF I FVYSGKA 
DLFFKPKKKKKKKKKKtJKKKKKK* KKGG 
RGLKNSWGGPIFWGFPK 


10936 


24837 


A 


11027 


127 


259 


GQVQWLTPVIPTLWEAMVGGSTKLRSSR 
PFFSTP*PCIVMILYG 


10937 


24838 . 


A 


11028 


37 


408 


IASGALFFFFKAAGKRGDFKTEGAYQRQ 
RTIFKNKICiiARREKTGRENT.RGNYKNMf» 
RGLKTPRGALGGPYLDKKGPLRGKGPFQ 
GGTLLGGGPKIKIOGTL* IRRNYWPYIR 
RYNRFKXRQKNM 


10938 


24839 


A 


11029 


146 


3 


LLVLKDLPRIMVRSAPTYIYIYTYLYIY 
IYIYTHTHTKTFIT*VLFC 


10939 


24840 


A 


11030 


26X 

*>vO 


43 S 


LMCLRNETN* F * KVE * WT/5AGAHTC?NPIC 
TLGG* GGRIA*GQEFETRLRNI VRP 


10940 


24841 


A 


11031 


322 


2 


KLPHLO(X)CVKVNCYKPFIiKNGKTGHGG 
VCL*SQPIiRRLRQENHSNPGQCTSAWVT 
QRDSVS KNKKKRKMEKYFSS VRQNKPVK 
HTKRMS PTKADTKS THCGGGRV 


10941 


24842 


A 


11032 


251 


380 


HKRIiHTI YYFVPLKKXMGPGL VGHTYNP 
STLGGQGGQIT*GRE 


10942 


24843 


A 


11033 


SJ*T 


124 


FWFFFFFFFFFLNPPGfiGKICJCK"RrPPPPT. 
GKKKKRGKKGGEKFFSPPPRGGGGGPKK 
ISLKRVLKFF*NPPGGTKKGGGPFLIPP 
GSP 


10943 


24844 


A 


11034 


420 


534 


CPGWAHAYNPSTLGS *GRQITIDREIE 
TILANMAKPH 


10944 


24845 


A 


11035 


196 




KNGHSL P PGPOKGTP FP KKKKTP K" INTiS 
ELGKSKAYSWPGTVVHACNPSTLGG*GG 
WIN*GQEFE 


10945 


24846 


A 


11036 


10 


248 

i 


PSDR*LFSTNHKDIGTIiYLLFGA*AGVL 
GTALS LLI RAELGQPGNLLGNDH I YNVI 
VTAHAFVIIF 


10946 


24847 


A 


11037 


32 


405 


DYVSKRKEKREKKRNVILETSISSHLVE 
WMLCSRYHPLIKKRSSVRILIALLCPQK 
VRDMS *GIGSKKQCWDS WLSI WIKMESD 
PFLIPYPKINA*RLIO)I 4 CERINLKI UK 
REYLHDYRVKKV 


10947 


24848 


A 


11038 


337 


448 


KNPRGFFGKNPFFWGGPFGGPPPPKKMG 
FGEKKKF* FKRSKKKKKKKKKK1£KXKKK 
KKKTG 


10948 


24849 


A 


11039 


307 


415 


R FVCS T I KVLRDLS SDRSNPGRVLSTSN 
SSLK*KKK 


10949 


24850 


A 


11040 


213 


403 


VHRGIKYYFLNKLPDFPRQTFKK*G*RL 
GAVAHACDPSTLGG*GGWITLGQTFETS 
LTRMGKS 


10950 


24851 


A 


11041 


104 


2 


SAFFF* ETES CS VAQAG VQ WCDLS S LQL 
PPPGFK 


10951 


24852 


A 


11042 


177 


3 


AYTAY ICVCIHIYLYK* MYIC INVHVCK 
ITYLYI YKYLKYNAI YLHIGFFS FLHTI 
S 


10952 


24853 


A 


11043 


314 


395 


TVYILNLSINSVQSVLLSVFHRLRHQPR 
GRVQWLTPVI PALWEAQAGGSP * VRISQ 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, ENHistidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

\^-—VJlUlitIHllIC, IV giuim^, «J c»d ill**, 

W=Tryptopban, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














XXvXJTA 


10953 


24854 


A 


11044 


310 


403 


RGKGGKKPRNPGGGKS *ALTKKKKKKKK 
KKKKKKKKPGGALKKKPRGGQKKTGGGK 

RGKTPPKKKKP*GKKKNLKGKGGKKTPK 
SGGGKKFPPRW* KKKLPPGG 


10954 


24855 


A 


11046 


181 


3 


EGPLSLSLIjFFFF*RQISSIiLLRIiKySG 
T I IAHCS FKFPS SRNP PTSAS QAASTTG 

VTlXl 


10955 


24856 


A 


11047 


392 


41 


KBSRSLSQGGREKGPFKFLAPPAPKFKR 
Ti?oDuoT.K'in'/5Pi?'r;ppp55PS* iTjSIiKKK 

X I? tvlr fur illUvLiwXT IVVJc C rur thj AJJwUiuu\ 

GGPPIIARWFWNSCPQGFPPPGPPKRLG 
PTTRf?PPP P PIjF* KKKP P FliGGENOKVKN 
FFFFF 


10956 


24857 


A 


11048 


147 


307 


TVT FVFIjLRKGLTIiIiP rlesgm imahlk 
S * PPGIiKQS SCHSLLSSWDYRRRRG 


10957 


24858 


A 


11049 


118 


I 


HGKIPSPLQ*HK* IAGYGSACLQSQLLR 
RLRWESRSISI 


10958 


24859 


A 


11050 


54 


284 


RKIRRGGIiHLWSNLLGRLRWKDLLGRLR 
WT7T*iT?T APPPf^TiP^KPPTinHCPPAWATE * 
DPVSKKKKKSLFVEKPQGGG 


10959 


24860 


A 


U051 


130 


340 


WNMWFAAHGS RINFDFFFFFFFFERGFP 
FCPPVGRAGTHFGLLEPFPSRV* KQYYW 
LGTVAHVCNPSTIiGGHVGES * GQEFKIS 

±Ar\_T*X V*\. 


10960 


24861 


A 


11052 


140 


337 


NI IMFFFFFFETVSFIiLLGLEGKGTIWV 
N*NIiCLPG*GDSPLGCS 


10961 


24862 


A 


11053 


197 


42a 


TP QTNNPTVASOYSSEWKSHI PLILNGN 
LDMIKFSEEGMLKAKIGLLRQTVGQWN 
AKEKFLKKNQSATPLS I * IRK 


10962 


24863 


A 


11055 


180 


1 


AI I ALLYSRLGDYARRFHLRRKKEQWQG 
wr\rm\rv~nv <z TX jnaoaci'R TA* TOE FKS SL ■ 
GNIA 


10963 


24864 


A 


11056 


158 


426 


PWPT.Pl»PTiPTVSLFPRSPSDAEPKLDCT 
AAISAHCNLPA* FS CLSLPSACNCRRAP 
PPT iTASAS RGAGIADGVS FTOCSMVPRL 
ECSGV 


10964 


24865 


A 


11057 


220 




YALHLNMKNNS YF * MRKKKKKKKKKKKK 
KKKKKKKKQNKKDPGGAVYKKIP 


10965 


24866 


A 


11058 


162 


464 


SCSVGLKLFSMKTSLALSYYCLLLLLTL 
HYYYYYYYYY*SLVFM 


10966 


24867 


A 


11059 


1 




APENRVDPRVRKTLVPLI LPI ITTLANP 
CKKD*YPYYVKIS1AC 


10967 


24868 


A 


11060 


382 


273 


" SSCL*PQLLRRLRQEDYLS*GA*GCNEL 
* *CHCSPAWVTEQDPVSKNIHTYIHSQS 
INYGSMIGYIHEQKGI IADHKP I IAEPE 

U%J X X C 1WU X 


10968 


24869 


A 


11061 


298 


378 


SCSASPCCSSRHGWSPCX5*QTCLLIiAIA 
I^(^PWLREAGALHTGPGaiLFDHAMLQ 
AHKAHQLVIDTYQEIGENYIPQDQKNSF 
LLESHTSFCFSDS ITTPSNMEETQQKSN 
LKLLRI SLLLIETWLEP VRFLTRI VT^IN 


IU5/&9 




A 






486 


RPTRPDAYHIVKPSP* PLTGALSALIjMT 
SGLGMRFHFHS ITLL I LGLLTNTLTI YQ 
♦WRDVTRESTYQGHHTPPGQKGLQYRIM 
LFITSEVFFFAGFF*AFYHSSLAPTPQL 
GGHWPPTGITPLNPLEVPLLOT'FGILAI 
GGS IT* AHH 
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SEQID 
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sequence 
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Predicted 

beginning 

nucleotide 
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correspond 

ing to first 

allllUU UvJu 

residue of 

npntirip 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

ulug IV 

last amino 
acid 

residue of 
peptide 

JCUUCIRC 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=Isoleucine, 
K=Lysine, L=Leuctne, MNMethionine, 
N=»Asparagine, P=Proline, 
^vjiuutinincj iv— Arginine, o = oenne, 
T^Threonine, V=Valine, 

W=Xrvntrtnhftn V=Tv r rwc i n t» 

X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 

n it rlfofiHf itiQPi*iinn 
HUlildlUUv llloCI iiun 


10970 


24871 


A 


11063 


269 


429 


ATWQKLPLQIQKNYPGLGLVAHACNPST 
IiGGQGGRIT*GQEFLNQPANMVNTH 


10971 


24X72 


A 








I IEGOAOWLMPVI PALWKAFAGfiQPRVfi 

X J-oUyAyn i_*7iir v x cnXSf% *W£imj\jQ zr la V Vjr 

SSKPA* P CDP PAS ASQSAG I TGVSHRTQ 
LILNS 


10972 


24873 


A 


11065 


176 


1 


IiFPPPKKVPPWGNTQFKSICLYTTNFFP 
FF/TBSRS VARLECRGTISPHCKF* LPGS 

PR 


10973 


24874 


A 


11066 


2 


397 


MVLWTAHLRAMAPGSRTWGLLTFALLCL 

HQLAIDTYQEFKETYIPKDHKYSLLNDS 
QTSFCDSDS IPTPYNMEETQQKTNLEVL 

PMSTiT»R TDQ *T.RPAO<3MlfP 


10974 


24875 


A 


11067 


172 


399 


QILYL*KYAYTVTMQHLSHPIL*KKKKK 
KKKKKKKKKKK1CKXXKKKKKTGR 


10975 


24876 


A 


11068 


192 


3 


PKLTPMLKLIVPHI IIjLPLT*LSKKHI I 
* INTTTHS X 1 1 S 1 1 PLLFFNQ ITSNLFS 
CSPTFC 


10976 . 


24877 


A 


11069 


204 


345 


DKSGEHS KTSS IQKNIjRIXSAVTHTCNPR 


10977 


24878 


A 


11070 


3 


416 


HELPQPLRVLWTAHIX^TAAGSRTSLLL 
MLQAHRSHRLGIDTYQBWETYIPEDRiC 
TSNLELXiRICLVLIDSWLEPARILTS 


10978 






1 1PI71 
1 IU / 1 






pqp FP??FTP<?r<;VT * nOTT ,UP nT .r^QT /"> A 
rorrcrciiDiDWvi rto ViJMXvL'ijV^oiA^/\ 

PLPGSSC 


10979 


24880 


A 


11072 


400 


278 


KVK*LRQENQWNSRGRDCSEPRSRHGTL 
PWTTEQDS IPKNKTKPYKIS 


10980 


24881 


A 


11073 


139 


3 


KLAGCGRAHLSFQLLPJUiRFJBDHLSPGG 


10981 


24882 


A 


11074 


17 


421 


DHVIGQPRWLMPVI LALWEAE PGGUjE P 
KVQDQPGQNCKSLSLLTIKKKRRENFPG 
RGGTQLCSQLLRRLRVGCCIDPKLHNCL 
PAWMTERDPTFKRE * KKNDVLEAKNFFT 


10982 


24883 


A 


11075 


412 


1 


QFFFLGVCCPTQI FGVGKKGE KKKKKRG 

FPPR VRFRFWPO T F CPPVFPfiP WWT/ZV 

FPPFFGERFLFFSKSKFFFPGVFKRGFP 
FPPKNFFFF*RIF*KFWFSPPKPFFFFF 
FFF*DRASLC3IPGWSTVAQS*LTSC 


10983 


24884 


A 


11076 


279 


410 


VFCCVAGPJX3APGVKAHFPAAQKPWDGG 
FPGP * SAGVGPPP I PR 


10984 


24885 


A 


11077 


12 


386 


IAHLLLFS FYHKDTGTLYLLFGAGAGVL 
STALS LL I RAELGQ PG I LVGNDH I YNVI 
VTAHAFGI IFFI 1 1 PI I IGCYGN* LATL 
I IGAPDMAFPRINNISFGLLPTSLLJjLL 
AYATVDAGARTGW 


10985 


24886 


A 


11078 


412 


3 


LAIl^LSLSLGFPYSLRYKNIKIRPINN 
PTMAS KCS S KRKTRTS FTLNQNLEMLKL 
IEKSTAKAKRLKVRPLVPVSQWNAEEK 
FLKEIKSAIPLNIQMIR+QTSLIADMEK 
VSVL* IKDPTSYNIPI*SQS*IHSK 


10986 


24887 


A 


11079 


78 


443 


RHRLPSDTCIjTAAVP * PCKGSKKKKKKK 
KKKKKKKKKKKKKKKKKKKGGGGFKKNS 
GGGQNFAGGKKI FFFFFGGVFKNPLWI F 
WKKTFFWGGKISGQLSQKKLSLWGEKKI 
FWGOGGETSF 
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M 

eth 

Od 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding tu 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l=Iso1eucine, 
K=Lysine, D=Leucine, M=Metbionine, 
N=Asparagine, PHProline, 
0=Gliit»mine R— Areinine, S=Serine, 
^Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, A=possible 
nucleotide deletion, \=possib!e 


10987 


24888 


A 


11080 


395 


2 


KRVPKKKPKKNSPPGGRKGGSPP*SPPF 
GGPDQPIF*VGGFDPPWLTPENPFFFKK 
KNN* PGKGGMPIiILKSLGGWGKKKIFTP 
GGGLPKN*KETPPPPPGGKKKIPFPKKK 
KKKKKRAAARDLELVPTRP 


10988 j 


24889 


A 


11081 


24 


182 


TVSTTKCFSLLIPYYGRKDME * KI * I IQ 
fQQCKKKKKKKKKKKKKKKKKKKKKKRGG 
PP 


10989 


24890 


A 


11082 


173 


3 


NNVI CYILY I PS VPKYKTTIjNMRGSAFF 
FFFFF IF* DGVS LCHLGWSAVARSWLTA 


10990 j 


24891 


A 


11083 


284 


391 


SVAHACNPGTLGG*GEQITRGWEFETSL. 
TNMEKARL 


10991 


24892 


J\ 


1 IUo*t 


Z J 1 


3 


IKKLREVNMLGWLYYMREEARPVDYVHW 
RGPEYSSLTKTIRNVTVKGHQHY* kas a 
GRGGSRL* SQHFGRPRRADYLKSGVRDQ 


10992 j 


24893 


A 


11085 


42 


171 


ARRMRAGIHRVSQEGLDLLTS*SAHLGIi 
PKCWEDRRE PLRLA 


10993 ! 


24894 


A 


11086 


176 


325 ! 


FKRSKKKKKKKKKKK3QCKKKKKKRGGAL 














RGGALLKKFLGGPQISGGGKRFFFFF 


10994 


24895 


A 


11087 


67 


379 


KKKKKRKKPKT KKRTGPVRQNVRWGRS L 
<3PGRAPAEGPOGGSTESPPI*I*VGRCLIL 
GLREPGVPPAAYGRFLRDYMNSIQGKLE 
KQRPQ*GLPSLTGAAELIKH 


10995 


24896 


A 


11088 


128 


326 


KKKSGGGVFFFI IGGGGHFFGGGRFFFF 
FFLGGFFFKLFFFF*EKFFFLGGNIiVKH 
SSKKSSWVGW 


10996 


24897 


A 


11089 


109 


1 


KTHSRPGTVAHTCNLSTLGGQGEWIT* G 
QEFETSL 


10997 


24898 


A 


11090 


52 


347 


DHSLLELQMKLYASAQAAITKCHRMGGL 
NNTHFLFIiFLSLFFFFLKQSIiADPPRLK 
GRGPILVN*KLPLRGKRDSPASPPQCVG 
ITGLAHPPRLILFF 


10998 


24899 


A 


11091 


386 


AQ 


IHLQPPliQICTLKFFTNKNPHLFLTGCP 
KYALIFLCFGALVKTGFHHVGQAGLQLV 
PLCPPRLGLPKCSD* RREP PHPAKKQFF 
IDRVMLCYPGWCPALGINPSSPVGLPKR 


10999 


24900 


A 


11092 


1 A.A 
104 


J 


VYSLLbRCPFS * AUTAKF INRCTYIHVC 
MYMCEPYAYTCLYTSTHIHVCMYIY 


1 11000 


I 24901 


A 


1 lAOl 

11093 






(XKLQPFGLGVVAHACNPSTYFISYQ* L 
SPVIPALWEAKTGGSPEVRSSRPARP 


11001 


24902 


A 


11094 


162 


3 


UjGLKRPI FFFWGGKFFFFFFFFFE I ES 
PSVAQAGVQW* CNLGSLQPASGRV 


11002 


24903 


A 


11095 


259 


358 


LRPGTYADTCN*GQARWLIPVIPALWEA 
KAGGSLEVGSLRIiAWP 


11003 


24904 


A 


11096 


254 


2 


SWGFI PMEERERENRYWDRQQGHATVSN 
IVINAFLYRYTEREKPGVSLSPRVECSG 
MI VAHCNLSLVGS S HS P AAAS * VAGTTG 


11004 


24905 


A 


11097 


132 


3 


KKKKRQ PRWAHACNPSTLGGCGGW IT* 
GQKFRTSLVNMVKP 


11005 


24906 


A 


11098 


2 


A Z< 


rjtfLEQLIEPLEVLWTAHLEALDPSSRTF 
VLLAFALL*tiPWLQSAGAVQTVPLSRLF 
DHAMLQ AHRAHQLAI DTYQE FEETY IP* 
DQKYSFLHDSQTYFCFSDSITTPYSMBE 
TQQKSNLEIiLRIYIiLLIESWLEPVQFLM 
SMFGNWLEYDT 


11006 


24907 


A 


11099 


2 


375 


QLPEPU4DLWTGDLVAMAPGSRTSLLLA 
FVLLWLP* LQEAGAGQTVPVSRLFDHAM 
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nucle- 
otide 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, l^Iso leucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W==*Tryptophan, Y-Tyrosine, 
X s Unknown, * s= Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LQAHRAHHLAIDTYQEFEETYI PKDQKY 
SLLHDSQTSF* FSDyiPTPCSMEETQQK 
SNVELLR I CLL V 


11007 


24908 


A 


11100 


1 


359 


LQLPKPLTVLGTAHLPAMCLAYATAL I L 
GCALLCIiA* VE * ARAAQTVPQSNF* DHA 
MWAHRAHQLAIDTYHEFBETYIPQDQK 
YSFLHDYQTSFCLSYSIATPSNMEFTQQ 
KSNLELL 


11008 


24909 


A 


11101 


1 


370 


LPEPLTDLWTAHLVPMAPGSRTSLLLAF 
ALLCLPWLQQAGADQTVP * 3 WLFDRAML 
QAHRAHQIAIRTYQEFEETYI PKDQKDS 
FLHDSQTSFCF* DS IATPSNMGETPQKS 
NLELLP I SLLL 


11009 


24910 


A 


11102 


2 


361 


AHLIAMAPGYRTYLLLAFALLCXaPWXiQE 
AGAVQTVP* SRLFDHAMLQAHRAHQLAI 
DTYQEFEETYIPKDQKYSFLHDSQTSFC 
FSDSITTPCNMEETQQKSNL*LIRICLL 
LNESWMEP 


11010 


24911 


A 


11103 


8 


366 


PLRALWTAHLAARAPGSRTSLLLAIALL 
CLPWLQEAGA*QTVPISRLLDHAMLQAH 
RAHQLPIDTYHEFEKTYIPKDHKHSFLH 
DSQTSFCF*DSIATPSNMDETQQKSNLE 
LLRIFLL 


11011 


24912 


A 


11104 


3 


341 


EPLRDLRTAHLVAMAPGSRTYMLLAFAL 

HRAHQLYIDTYQEFEETYIPQDQKYSFIj 
HDSQTSFCFSDSISTPCNMEETHQRSNL 
E 


11012 


24913 


A 


11105 


240 


329 


RNWPGAEAHTYYPSTLGGRGRQI T*GRG 
FE 


11013 


24914 


A 


11106 




116 


HEERERERERERESAIVROIHTOGEGIiT 
ELVFPYRRPAPSGLKFTFVYVQEHILGV 
GGNGLSPLQIGVILRDGRGLAHIROGTR 
NKILRILKSQGRAPDLS*DLY 


11014 


24915 


A 


11107 


169 


1 


E DGHHSGLHS KQQ KQ PAW P SMVTHTCN P 
NTI/3GRGGWIT*GQEFETSLTNMEKPCIi 


11015 


24916 


A 


11108 


192 


355 


GMSLTGHSNY 1 1 LNLNFLWP CVVAHTCN 
PMTLGGQGGRFT*GQEFENNIiAYRGK 


11016 


24917 


A 


11109 


145 


383 


EVNKDKGKYHVSQGIQSFSYVAYVYFRK 
LIYTMMTIVNSALLYTELVI CQGRRVLV 
FDFGF* FWFFETESCS VIQAGEQ*CDLG 


11017 


24918 


A 


11110 


199 


400 


TRAGFFPPFKKFLKGPGPGAPPFNLNFL 
GGRGGGFF*IPKKKIYPLIjLPFLKVQKT 
PSKPPFFFFFFFLRQVLLCCPGWSAMAR 
SRLTRA 


11018 


24919 


A 


11111 


373 


177 


QFETSIjGTMSGPS fylkn* ks ARCGHTC 
MRFQLLQRQSREDHLSSGVHRCSNP*LH 
RCTTTWVTK 


11019 


24920 


A 


11112 


204 


399 


NHFWDWYHFSSPFFFFFFEKNLCSVPR 
AEDQGMNLS * LNPLPPGLRQFFA* PSQE 
AGIIGPGHH 


11020 


24921 


A 


11113 


192 


16 


KKTEKNKR*RGYGETETPAHCGRACKLG 
* PLWKPV*QFLKKLKMKLPYDPTI PLPA 
IFS ! 


11021 


24922 


A 


11114 


113 


369 


AGGQGPNLFLQKPGPPGLKGPPPLPLPG 
GGNYGGGPPSRGIF*FFEKKGFGGVPKV 
GPNLRG*RDLPAPPSXGAHPLGNGFFSR 
N 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine t 
T=Threonine, V=Valine, 
W=Tryptopnan, Y=Tyrosine, 
X=Un known, *==Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11022 


24923 


A 


11115 


250 


387 


I LCDVEHS LWPGVEVHAYNPSTLGGQDG 
RiA*AQEPETSLNNIARL 


11023 


24924 


A 


11116 


192 


1 


VCDLFWKPSPHALTRRRTS VFHLKR* FS 
WIiGAVAHTCNPSTLGG*GFWRIT*AQEF 
ETSLGNMV 


11024 


24925 


A 


11117 


289 


202 


VFYCQ* KPIRKKKKKKKKKKKKKKKKKK 
KKLSKKI KKMDQQKLRSPTI K 


11025 


24926 


A 


11118 


412 


421 


RA* PTEQVTLG ITEQSYSRGH INNRVHD 
LDVGS*HPDGAAAIKGSFVQRLKGY 


11026 


24927 


A 


111 1 c\ 


1 A. 

Id 




DRLKPSATHAFADAYHTGKANP* PLTGA 
LLTLLMTSGLTM*VHLHSITLLIbGLLT 
NTLTIYQRWRBVTRESTYQGQHTPPVQK 
GLRYGIILFITSEALFFAGFF*AFYHSS 
LAPTPQLGGHWPPTGITPLNPLEDPLLN 
TSVLLASRVSIT 


11027 


24928 


A 


11120 


304 


346 


GLWHATGITP*PLTGSF*LLLMTSSLAM 
*FHYHSITLLILSLLSSTLTIYH*WREV 
TRESTYQGHHTPPVQKGLRYGI ILFITS 
EAVFLARFC*AFYHSSLAPTPQL*GLWH 
ATGITPLNPL 


11028 


24929 


A 


11121 


174 


420 


FISALLKSAIQRNNAMFSTYYALTCTVG 
TAVSWSSVS*PQRNMMMS*VHAPAMAGR 
LVCHY IRVYRRFEHVCKRAVHGQYP 


11029 


24930 


A 


11122 


299 


466 . 


IFILHEICITFFFETRHFCH*AEVHWCS 
LGLLKPPPPGLRCDPSTSPSRVAGTRPS 


11U3U 


my 31 






no 
1 Jo 


i 
i 


NPFNEKNPPPKGPGPKNAFFFFFF©4ES 
CSVAQARVQ*HDLGSLQP 








1 f 1Z4 




4QQ 
**yy 


NQRERS *HLYLL YNTVCLTHGLPILI I Y 
SAYQLAIYDLLLRRDALARLQADIiADRR 
RGLQPKKRFAFKTRRKDAASSTKVDAAP 
GI 


11032 


24933 


A 


11125 


252 


391 


PLSLDLQSNASLSHFTSPPLMFADR*LF 
STNHKNI * TLYLLFGAWA 


11033 


24934 


A 


11126 


45 


459 


AFLKNHPIiET*SNHSFIDLPTLSNISA* 
GNFG*LLGACLILQITTGLFLAMHYSPD 
ASTAFSS IAHITRDVNYG* I IRYLHANG 
ASIFFICLFLHSGRGLSYGAFLYSQT*N 
IGI ILLLATIATAFIGYVLP * GQISF 


11034 


24935 


A 


11127 


109 


301 


QDSIiMRKKSKNRSPSHER*RSKCLERKR 
SGDKKKKKGGPPKKKIGGFPFGGQMELK 
PFSILGGF 


11035 


24936 


A 


11128 


322 


2 


SCHHVSSLGDRARPCLQKEKKKERKKVG 
DFNTPLS IMDGYQHWYQARKI I IMDGTT 
K* KITRJC* KIRNQ* DI/TDVYRTLHPRVK 
YTFLSCAYGTFSRTHRIRGQAI 


11036 


24937 


A 


11129 


182 


2 


FLPRGFQPPPWVPGPLGFIiFPPPSSRD 
PGFPSMPFFFFFFFETESCSVS*AGEQW 
RNLG 


11037 


24938 


A 


11130 


138 


1 


GWVFFLWFSFSFLFFFLR*GLSLSPTL 
ECSGAI TAHCSLKLPGLS 


11038 


24939 


A 


11131 


304 


83 


WKKTDFPRKGRNSASRLPSDSNCNSSLS 
IjQGLSLLYRFWTYHAVTGKGSQSRPQEQ 
VLGYHARKNSR*VHRVK 


11039 


24940 


A 


11132 


247 


225 


KNFWGGPPLRGGKKNFFFFFGGGYKK*L 
LLCLVNMYI *CV* YI *HKRI ISDPKKKK 
KKKKKKKKKKKKKKHCKKKKKKKKKKKK 
K 


11040 


24941 


A 


11133 


242 


3 


KKKKIFFPPRYFWAPPYFFFLGPPPFFF 
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C=Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, 1-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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y=v»iu famine, K— Argimne, o— oerine, 
T=Threonine, V=Valine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFFFFFFFFFFFFFFFVF*TQFHIiS 


11041 


24942 


A 


11134 


361 


2 


ASQSAG I TGVS HRTRPTHGFKDT * RKPA 
EPBVTIH* 1*1 TLMSCNIESPEKVCADL 
X KGAK..fa KN LiK VKK FAGTP 1 KASRITTRK 
TPCEGSNTRDRFQMRIHKWLTDLHSPFD 
GRVGGRV 


11042 


24943 


A 


11135 


1 


370 


I RQT F FQKRE I VG I I LCYWNVGY YSQLH 

*CCUMT.UTT?1417'VT5'\/VTC3T7KT7TT \J~TT \roTJ 

EKKKKKKKKKKKKKKKKKKD 


11043 


24944 


A 


11136 


273 


2 


GGRAGQQILFCRGIWLGPPRSCQCTFRS 
P PGLiAXj * P WbC rb JjP PALKX Rr PS FIi PF 
FFEMEFSRLSPRLECNGVTVAHCNLRLP 
GSSNFP 


11044 


24945 


A 


11137 


130 


354 


GQIiNKLSGPYPENVGYTLPVLINPLAQP 
VIYSTIFAGTLITALSSH* FFT*VGT»EI 

XTMT 7\ t?TTYt7T WVlTWWflT 

WMIiAF X P V u rKKKKKGG PIj 


11045 


24946 


A 


11138 


366 


2 


LVEMGFLRVRQNGLYLLTS * SARLGLSK 
CWDYRREPPCPASDWVFILTS PL IHALD 
GKEHTHTHTHTHTHTGLGI CQS S DGKQS 
GGWGWLS ANRGQFS P FAVCLWS FLPEV 
PWTSALFT 


11046 


24947 


A 


11139 


228 


1 


CRAGRGLPAAVLI ILFPPLLI PTSKYLI 
NNRLITTQQ* LI KxVTSKQMIAI PNTKGR 
T*SLIIiVSLI I FKATTNLLG 


11047 


24948 


A 


11140 


157 


337 


HIPTPPHTTCPERPS IRHNS I YYIJCRYC 
L1^IILSLLPLQTSP*PLTGAIjSIjIjIjMT 
SGLSM*SHIjHSITIiIiALRLLTYTLTIYQ 
* VVKUVSRHS i iQRHHTPPVQKGIjRYGX X 
VFTTSld)IVFSGLF*AFYHSRLAPTTQL 
♦CHWPPSGITPLN 


1 1 t\A9. 


Z^y^y 


TV 

A 


11141 


188 


3 


X PT FUGPP PXjP PQKVLiFKPGNPRAPPGV 
FFFFFETDSRSWRLECSGVI *AH*NLH 


11049 


24950 


A 


11142 


224 


3 


SVSSPCISNTHKVDVRSSALLQESTSSR 

T7l7MTYTOT?tIVT TTVT7T*T.DOT ROCADT 7\r« 
Ei v in J. Vjxlr niJjll ir» j. " LiKoiiAyonKiiAL 

SGTISAHCNLRLPGSSN 


1 iUJv 


ZHyjl 


x\ 


1 1 i/ti 




ill 


OiUvjrVjV viUuiinro i JJobKVjluvX X vsy 

EFETSLTN" 


11051 


24952 


A 


11144 


244 


379 


TIGKQRCHAGIEEVSDSSQAPWLKPVIP 
AL* EAEAGGSLEPRSLR 






A 

A 


111 AS. 


1 jy l 


1 JOD 


UKiJiioijljr»riri± ir\j V XxrnUnJN Xac XJUg X X 

F t EAESHSVAQTGLQ*RHlX3SLQI,PPPQV 


11053 


24954 


A 


11146 


284 


1 


GRGRQGGDSSGKGEGNRAMDITGGLVGK 
KERWD*GL*GGRERERERERERERERE 

XVOJVOi XvCi XvA O U V 


11054 


24955 


A 


11147 


64 


286 


THALAWTRNQRIGSGHGAMCLYSHLLRK 
LiRQNDHLTPGV*KCLEL*LHH*TPTWTT 
KPTAKAIFFNLKKKDSWD 


I ivJJ 


^4iOo 




1 1 1 A O 


230 


358 


vi> ViaXiXXr X/ii liVXiXiGijXiPxbr i. Pi iyij 
SINIiSMAIPL*AGAV 


11056 


&tyj/ 


A 




R1 


JO/ 


OTLRTKMNKNWAS L I DPTI LGT.PAART, 
I ILFPPLLIPSCKDVINNRLMTTQQ*LI 
ELTWEQMITIHNTKGRT*CLILECLIMV 
IATTKLLGLLP 


11057 


24958 


A 


11150 


257 


2 


SVCAVITKYLRLGCFADAADTQRPLPPA 
TVKLTMFFNIATHGEPLGCIF*LFVDKV 
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PKTAENFHALSTGEK* FSYKGSCFHRIL 
P 


11058 


24959 


A 


11151 


257 


1 


WRCGEKETLIRCWWECKLVQSIjGRTVWR 
FVRELKLELPFNPAIPLLGIHPKEKK*L 
DPCTCMFIVALVTIAKMWNQPKCPSMDL 

v 


11059 


24960 


A 


11152 


17 


367 


NHKDGKKTEEKKQSTS PPQKDCSSSPAM 
EQSWTENDFDELREEGFRRSNYELQEBI 
QTKGKEVKNFEKN* DECITRITNTENCL 
KELMELIOUCARELCEECRSLSSRCDQLE 
ERVSA 


11060 


24961 


A 


11153 


213 


2 


TFCHFFFFFETESGSVAQPGVRWCDLGS 
LQAPLPGFTPFSWRQRLQ* AE I VPLHSR 
LKLHFKKKAGVFSC 


11061 


24962 


A 


11154 


1 


347 


GTSNHFPFIjYTYQVHNSTDYPRNRCCLN 
PSLRFHTSSKPLPARPHIMTHQSHAYHI 
VKPS P* PLTGALLALLMTSGLAM* FHFH 
SITLIjILCLLTNTLS i yq*wrdvtrest 
YQG 


11062 


24963 


A 


11155 


223 


3 


YSTNLKIIiRNHFILFF*DRVLLCIiPGWS 
AVASSRLTANSASCLSLPSSWDYRRAPP 
HPANFCTDRFHHVGQVG 


11063 


24964 


A 


11156 


143 


1 


RGFLNQCPSQNFPFFFFETESCSVTQAG 
MRWYDLS *MQPPPPRFKLV 


11064 


24965 


A 


11 ID/ 


1AO 


jjj 


LQPLPPGFKRFSCLRLLTSWDYRKDVDS 
ALLSNYVT*KYISKTIjCRHQSPEIVREF 
LiTAMKSHKLTKVG 


11065 


24966 


A 


11158 


156 


370 


RSDGCQKIPRNMIiGGWSVIjHHTVDEQ*G 
PNRY I KKCSASL I IREMHI INTVRRCLT 
PATMATVRRKTTSVG 


11066 


24967 


A 


11159 


298 


4 


hfqwimwrqlvkgrlqaqknaaftdsh 
thlcyi»kcvyfcvcvcmcicvfvcv*sr 
al**slyliflcikavkaghsvsglesqh 
ygrlrqadhlrsgv 


11067 


24968 


n 
A 


111 CCk 

1 HoU 






NSWGKIKILKGK* PPPPKKIFFPKYPQS 
FFLSPP*KKKTNFPHPRKYLSPPGIFLK 
APPPLI FFFFFFFFFFFFFFFFFWG 


11068 


24969 


A 


11161 


198 


406 


KSVYMTLKKKRGS FUERGDVPAICNFFF 
* IHIYMCVYIHICVYIYIYTHIYIYTHT 
QIYIYIYIYIYIY 


11069 


24970 


A 


11162 


95 


376 


KIITKHNIARTNPYTFCIMN*LEITLQG 
EPKIiRPPKPDKKKKKKKKKKECKKKKKKK 

KIRGGEKI FFFFLGGD 


11070 


24971 


A 


11163 


343 


391 


WR*LSSTNHNDILTLYLLFRARAGVLRS 
AVSLL I LAEQGQPGNLLGDDHI YS AIDP 
AHAFAIMFFIVIPMIIGGF r m*LIiAVRN 
GAP™AIPHINNISC*RLPPSLLLIiLAS 
AIYEAGACTG*TAYPPLE 


11071 


24972 


a 

.t\ 


1 1 104 


i 


1 JO 


QPMTPNRGPLSP*PLTGALSALLKTSGL 
AM*FHFHSITLLILGLLTNTLTIYQ*WR 
DVTRES 


11072 


24973 


A 


11165 


265 


466 


KIiLTS LS FFNLLRETNQVGLLE LSQAKY 
KKTPRI^TVAHVCNPSTLGGQGGWrT*A 
QEFETSIANMA 


11073 


24974 


A 


11166 


204 


458 


KRCLLPLPPDSNTQSSQCRREQIGRAIR 
KI LTFETQN I KRLKTE KQLLKLLE * SGK 
ITAHCNLDLPGSSS PPTSASQSAGITAG 
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S 


11074 


24975 


A 


11167 


256 


3 


KILFFKPGVLSPPWGGVNFFQLMPFQGG 
PPKKKRGPLSPPKKGPFFFFFFFF*ESR 
S LLPRLE CSGAI STHCNLCRLGS SRTRG 


11075 




A 


1 1 loo 


i 
l 


4^7 


nxviiro v n. i it L\\^.r\\j u r r\. vivo ii'sn xvi_/ iuiij 

YLLFGA* AGVLSTALTLLI RAELGQPGN 
LLGNDHIYNVIGTAHAFVI IFFIVTPII 
IGGFGN+LVPLI IGAPDMAFPRINNISC 
* IiLPPSLLLLRASAIVEAGAGTG*TVYP 
PLAGNYSHPGAS 


1 1076 


74077 


A 


1 1 169 




47 ft 


ALSLLIRAELGQPGNLLGNDHIYNVIGT 
GHAFVT I FFIE I P 1 1 IGGFGN * ItVPL I I 
GAPDMAFPRINNIS F*IiPTSLLLLLAS 
GIGEAGAGTG*TVYPPLAGNYSHPGAYV 
DLT 


11077 


24978 


A 


11170 


91 


2 


PRQDPSFFFFFF*TESGSVAQAGVQWCD 
L 


11078 


24979 


A 


11171 


219 


1 


IFKSKILCIiPPPEPNFFFFFFFFFFFFF 
FFMRHGGACCSPSYLGG*GERMT*ASRV 
AGTTGVCHYAWLI FVFL 


11079 


24980 


A 


11172 


292 


365 


RI *SLGQAQWLTPVI PTLWEAKVG 


11080 


74981 


A 


1 117^ 


7S1 


i 

i 


I CDTjO I T.THGT TTCVKMGLTTIj FVSG T T 
FLIHHSIxAIKFGV*KOTS*LGTVVHTCN 
LSS *GGRGGRIA* AREFETCLGNIVRP 


11081 


24982 


A 


11174 


200 


407 


KIFETDVYVKCSNIIiCLWASISFSNYFG 
GPFYLNL* *FLSYFSFLETESHIVAPLE 
CSGMILAHCNLCL 1 


11082 


24983 


A 


11175 


11 


155 


STNFFLFSETGSGSVLQAGVQWPGLDLL 
AS SHS PAS AS * VSGTTVDAA 


11083 


24984 


A 


11176 


140 


1 


PSTFPDWAMRRVS CIjSKGDKGQV* WLT 
PVTPAIWEAKAGGSPESI 


11084 


24985 


A 


11177 


210 


1 


TDEELLLMDEQRK* FIEMESTPDEDAVN 
IVEMPMKDLSYYINWD*WAGLKRTDC 
SFERSSTVGKMLSN 1 


11085 


24986 


A 


11178 


133 


148 


STSTR*LFSTSRia)IGALHLIiFGA*AGV 
LGTAVSLLIRAWLAQLGSNKEA 


11086 


24987 


A 


11179 


236 


] 


KFWKKDPLTKGGCPLKKKGGGGIFSKKK 
KNVFFPKKKFKGGKKKNFFFFKKKKKKK 
KXKKKKNKE*LFISFAHIAVHL» 


11087 


24988 


A 


11180 


210 


1 

> 


QSDGQENIjQVFFPLFGP* PNYLDKGEKW 
IRVGKIWGHtiFFFFETESRSVAQSPRLE 
CSGAIAAHCMLCLP 


11088 


24989 


A 


11181 


292 


348 


KKGALFFS PGG* KKNFPPGGKKKGPLFS 
KKKKNPP*KKPGEKEKTRFPPFLGGLGG 
GFPLAPGGKPPRNRVLAP PPNPGGKKKK 
KTPFSQKKKKKKKKP 


11089 


24990 


A 


11182 


168 


"3 

•j 


KPEASMLRMDFFFF* EAGSHS VAHAGVQ 
WYYCSSLQPPTPELSDPLAPSSKVGGIT 


11090 


24991 


A 


11183 


87 


351 


KGIKWSLNIQRGVQPHLRSYNYSFFSFF 
S *DWVLLRLECSGTTTAHRSLNLPGSSD 
P PTS AS QAAGTTGMHCHTRL I F * ETGFH 
HAVQ 


11091 


24992 


A 


11184 


55 


344 


TSLNLCFFFFFWGEKSPFLPPGGDEGPL 
FGQLDTRPWGIEKI FFAPPP * KGKRGVP 
PGGEE FKNP PAPKGE P PLS * KK* NQ PGQ 
GAPPCYPPFLGGGAKKIFSIPQGRVSN* 
P KRGPSS P PGGKKGDFS PQKKKKKQRFR 
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EV**PNSPSGAP 


11092 


24993 


A 


11185 


344 


190 


LVETRFHRVSQDGIiNHLLTL*STRLGLP 
KCWDYRCEPPRPANTANSLNFPM 


1 10Q^ 




A 


11 186 


135 


3 


IFPKVFGGPFFFFFFF * ETESRSVAQVG 
VQWCNLGSLQAPPPGFT 


11094 


24995 


A 


11187 


305 


14 


IMLITSIRRSRbSSTAEKGILSRLRICG 
RVAKYIilNNRIjITTRB* LI KLTSKQMIT 
H*NTKGRT* SLILVSLSWYGYHLF *G* 
FD*SLLIiY 


11095 


24996 


A 


11188 


198 


1 


LEQTFNMMFPPNKS I IS I PS FGFLFKIS 
FPFLLFFF*DEVFLLLSRLECNGTIIAH 
CNLCLPGSSD 


11096 


24997 


A 


11189 


352 


3 


QLCNTPLPKTNHAMLTSSNANTLIYRR5 
AIAAASSPPILFPASKILLNSRLITPHQ 
*LIKLASKQM1AIPNTKGRT*SLILVS3j 
IICFEVSLISHCWVVISRLIiMRYLKVGI 
NRGG 


l i\fy / 


24998 

ZH770 


A ' 


11190 


270 


137 


CIINRLYVCVY IYI YIYIYI * IYFYMYV 
YNTLCILGHRRESGVI 


1 10Q8 


94QQQ 


A 


11 191 

I 1 171 


18 


342 


ACRKRKMSSKFALRATGNCYYSCTFCSI 

KITV* K* SKTKKNT KGLGAR * ELSLA* L 
LGEPPIFAKGHPLMLKWVEICKRP 














11099 


25000 


A 


11192 


169 


3 


ERSGIIRVYQD*NQCLAWSLA* KIIHPW 
IAAVAHTCNPSPSGGRHGVaT*GQEFK 


11 100 


25001 


A 


11193 


407 


1 


KRACFRWLTLLISTLWEAKQENCLNAGV 
HDQPGQHGETPSPQKIKKSAWHGGPRIi* 
EVRWED* LSQGG*GYSEPCSHHCTPAWA 
TERDPV* KQKRSIPWKELASERKDSGRC 
RWLVREMWPSCQASVIQVRRISF 


1 1 lul 




A 


11194 


110 


412 


VOGFRLLHSPAI LMPLLMPS YFRRHSNT 
EIKPISNPTVASKRASERKSCTSLTLNQ 
KLEMI KLSEEAT* KAEVGQKLGLLHQKV 
SQWNAKEEFIiKEIESA 


11102 


25003 


A 


11195 


34 


155 


AWHEGMHHVSQDGXjNIiLTIi* dacuglpk 
SWDYRRKPPCPA 


1 1 101 


95004 


A 


11196 


289 


453 


NQPGKSFMISSTFHFMMEFTLVHAGV*W 
LDLGLLQPPPPRFKGFSCLHLPSGWDR 


1 1 104 




A 


11 197 


177 


432 


KWEKQAQTLGGGNFPKGPKTRVGKNFEK 
LKGNPPFLCL* KRGRHGDTGPWKKPAPS 
GERGIPGKKGGKTGPPGFKGPMEKIKRG 
K 


11105 


25006 


A 


11198 


139 


3 


AMVSLSPRLECSSMIMAHC5LGSLGSSD 
PPTSAP*VARITGMHHP 


11106 


25007 


A 


11199 


150 


492 


GDPLPGHHPV* EVRRPFARQLPRLRSEE 
PLRPVATPSGK*GASLSGSHPIQEGGGG 
GQPIiPGHLPHPGWRSGVKPP PGQSARPG 
GEGHLLPATPTGK*GGPQSGQPHLSGGR 
RE 


11107 


25008 


A 


11200 


239 


64 


YQKYYYYYFF*ETESRSVARVKSSGLIS 
AHCNLHL PGSRLQliCIiFPTSAVFGQESH 
IT 


11108 


25009 


A 


11201 


31 


396 


ILTMREIVHIQAGQCGNQIGAKFWEVIT 
DEHGMDPTGTYHGDSDDQLDRISVYYNE 
ATGGKYVPRAILVDL* PGTMDS VRSGP F 
GQIFRPDNFVFGQSGAGNNWAXCHYTKG 
AEIiGDSVIiDV 


11109 


25010 


A 


11202 


85 


407 


VLLPHSEALEGADTMPHSYPALSAEQKK 
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ELSDIALQI VS PGKGILAADECVGSMAK 
RLNQIGVENTQENRRLYRQVLFNADDRA 
KKRIGGVIFFHETL*QKDDDGVT 


11110 


25011 


A 


11203 


302 


115 


THLDHVAIYLS I YLSIYLSIYHLS I YHL 
SIYLCRF*LID*DRVSLYFSGWIiQMPGL 
KPSSCF 


11111 


25012 


A 


11204 


130 


348 


GNAYGGPCSVLVQRT** IPALTFPSTCL 
DSKFQRDLVTLRTDFSNFTSNTVAE IQA 
LTSQGESGLDGALGWGQ 


11112 


25013 


A 


11205 


232 


62 


ISARNEGFNTLN*RTFFFFFFSSHCLSR 
LECGGMILAHCNLQLPGBSLEPGRQSLQ 


11113 


25014 


A 


11206 


161 


2 


KKKGFLLPLIEPFCIQEIQLRLGAVAHT 
YNPSTLGGRGGWIT*GQEFKTSLAN 


11114 


25015 


A 


11207 


251 


3 


PNHLPKAPSPNTTIiRIRFQHLNFRGAQQ 
TFGIiQHKLTSKQMITIHNTKGRT*SLIL 
VSLI I F I ATTNLLGIiLPHS FTPTTQL 


11115 


25016 


A 


11208 


145 


287 


GFRLKL* INQKYS FLKKKKKKKKKKKKK 














KKKKK 


11116 


25017 


A 


11209 


436 


110 


ISGRGEQKFKYFS PAPFFRG INPPPLGG 
E*NQNLKPP * KKI FLGS PKVPKKKGGPQ 
KILKKGPGGGGPPLFPPPLGGQGGGFPF 
SQGFKTPPGKKKKPPPKKKKKKKK 


11117 


25018 


A 


11210 


217 


2 


DKWSVLHINYNILS SSLVHVLFKYLEN+ 
K*KKNLWPGMVAHACNTNTLGGQGGRIV 
*AQEFETSMGNTVKC 


11118 


25019 


A 


11211 


146 


3 


QKVSHAQYRCLSWGYFFFI LFLETRSRS 
IAQAGVQWRDLGSIj*PPPA 


11119 


25020 


A 


11212 


187 


384 


GL I CFYLTCF Y F I F YF I LF Y FS LLCYVM 
LCYVLCYVMLCYVMVCYVMLCYVI * DRV 
SFCHPDWSAA 


11120 


25021 


A 


11213 


373 


482 


MIKLRHRLGTVAHTCNPSTLGGRGQWIA 
*GQBYKTS 


11121 


25022 


A 


11214 


64 


362 


ITGVSHFTQPLPFIGGLALSPKLECRGM 
I IAYCNFELPGSGDSGHLSLPSSWNYRY 
MPCMPS *LKKNF* * RQGLTVLSRLEKDY 
LL*DFSALRSFPWGR 


11122 


25023 


A 


11215 


181 


368 


CDSAVPLLDIYPTEFKADLKKIRIPMFI 
IALFAVAKR*KQPTYPSIDKWIKKLWAG 
CGGSRL 


11123 


25024 


A 


11216 


268 


369 


TTRPDMVVHTYNSSTSGGQGGTIT*AQQ 
FRTSLT 


11124 


25025 


A 


11217 


90 


3 


IGQAQ*IiTPIIPAI J WEPKSGGSPEVRSL 


11125 


25026 


A 


11219 


277 


272 


S LTAPGL * SQLIiRRLRWEDHLS LGQS S F 
SEP*SCPCTPAWVTQ*DSVSKKKEKKKE 
KKRKEKEKVNQTPKATESPGQALIGSHS 
LQVQKRVYAHP IL 


11126 


25027 


A 


11220 


19 


343 


IiELEVEPEDVSELLQSLDKNYLSCFLKR 
FLEMKYTPGEDVVNTIKMTTEDLKYNTD 
LVDKAAAGLERINFNFEGSSSVGKMLSN 
S XAWYSEI FYERQSQFMWQT* LLA 


11127 


25028 


A 


11221 


266 


356 


FFEMESHTVA*AGVQWCDLGLLQSLPPG 
FK 


11128 


25029 


A 


11222 


105 


2 


RFVCSTI KVLRDLS SARSNPGRF I * TSN 
SRPRKS 


11129 


25030 


A 


11223 


145 


350 


RKGRQLLD * DLGWV* WI/TP 1 1 P ILWEAB 
LGGSLEARSSRPAW 


11130 


25031 


A 


11225 


155 


348 


PLFFFCFHRNTFY* KSVCRFPKKKKKKK 
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KKKKKKXKKKKKKKMEGGGGGANTKFLY 
GENRKPPLFFFF 


11131 


25032 


A 


11226 


271 


3 


N FI*S VFN FTR I CLGM CFLS I FLL I FGG S 
LKYQIiFLAQGFFFFIjHYFFTSSIHSSLL 
Y*MIiDFLDLFFLYLFFCNFNFSVFLFCI 
LSKFF 


11132 


25033 


A 


11227 


3 


413 


spaephrsytmwnvkvgvngfgrierlv 
traafnsgkvdivaindpfidlnymvym 
fqyds thgkfhgtvkaenaklvingnp i 

TIFHERDSCKIKRGDAGAEYW*STGVF 
TTMDNAGAHLHG * ATRGI I SAP SAD 


11133 


25034 


A 


11228 


48 


413 


VFVLVSCGFFFFFFWEKKICFFWPGGKK 
RGKIGGTGTFGPKGKGNFRGPPPKNKEK 
GGPPPPPNNFGFLKKKKFYQGGQGG*KT 
RNFGKPPAPNPPKGGDLGGGPPTQGQNG 
FLKKKKKQKK 








1 1 Az.y 






EFENSCSPGWQRRGVMIPGVTVEDMNQQ 
EFIRALVALLKKSRXLKVPKVfTDTTKLA 
KRKELTL*DEKWFYT*AASTT 


11135 


25036 


A 


11230 


247 


2 


QPKKKS VSKKKKKELPCDPAI PLPGI YP 
KEKKSVYKKDNLHLRVYCRAVFTIAKIW 
NQPKC*SGDK* IKKI * YIYTVDLRR 


11136 


25037 


A 


11231 


I 


372 


TETTVEYAWCELQDRKLTKSERQRFKEB, 
AEMLKGLQHPNIVRFYDSWESTVKGKKC* 
rVLVTELMTSGTLKT* VHQYYKS * PRSM 
REFDFLNFRSTS FTLWQPKKGTVLPTV 
TVLAYFLAHKVL 


11137 


25038 


A 


11232 


311 


464 


KKATPRNLCKWPGTVAHACNPSTLEGRD 
GWIT*GQEFETSLTNMVPP 


11138 


25039 


A 


11233 


64 


418 


VPVASRRRRRCGRVGGGKAMADLDKLNI 
DSIIQRLLEA*GAKPGKKVQLHENEITG 
LCLEPREIFLRQPILL*LVAPLKICGDI 
HGPYYDLLRLFEYRGLPPKSNYLFLEDY 
VDRRQH 


11139 


25040 


A 


11234 


233 


421 


PTTSNMRSRliLRGASS * KKTAGPQQRNL 
EPALPRRWGGRSADNPPSGSLRKSGKNM 
QKTPGTA 


11140 


25041 


A 


11235 


343 


1 


NNTHGLSHSS VGRNPGAVGSGS CMSGIK 
LSEGLRSLLEAVCISRAAVIKCHSWVAS 
NNRHFFSPSSGG*KSEAKVWAGPCSL*R 
RVPSLPLPASGGSRHCLVCGCVTPFSAF 
TW 


11141 


25042 


A 


11236 


122 


419 


RTPRGPKPNPGLQRKTFFLIGGRIKTYL 
GIFGKDLYFWGGKNWDTLLPKI *RSREN 
KNF*REKWVNLPPYPGCLKNGARQQHFA 
HLLILRDTKTYMLAF 


11142 


25043 


A 


11237 


1 


379 


AFNHLHAGHGLSGAAMKSLVLLLCLAQL 
WGCHSSPHGPWLIYRQPNCDDP*TEEAS 
LEAIDYINQNLPWGYKHTLNHIDEVKVW 
PQ*PSGELFDIEIDTLETTCHVLDPTPV 
A* CNVRQIiNEHAVQ 


11143 


25044 


A 


11238 


1 


397 


QTMTPNRGPLCPSKDLRSSHVISLPLHS 
ATHTRPTNQHTNHI PMMARRSTRKHIRR 
APHTTCPKRPSIRDNPIYYLRNFFLRRI 
FLSLLPLQPS PYPP IRRALAPNRHHPAK 
SPRRPTP*PLTGALSALLKTSGIiAM*FH 
FHSIALLILGLLTNTLTIYQ+WRDVALE 
STYEGHHTPPVQKGLRYGIILFITSEIF 
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OCysteine, DpAspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine } H=Histidine, l=Iso leucine, 
K=Lysine, Lr=Leucine,M=>Methionine, 
f^Asparagine, P^Proline, 
V=ijiutaraine, k— Argmine, o— oenne, 
T=Threonine, V=Valine, 
w— lrypropnan, i— lyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 














e r/ttjr r *Ar iHooljAPTPQljGGHWPPTG 
ITPLNPLEDPLLNTSVLLASGVSIT 


11144 


25045 


A 


11239 


267 


3 


FVSVRITPIYASAATWLSYVWVWASSPW 
VCPPGDRVALSPRVECSGT ISAVRNLRL 
PGSSDYPASA*PADGWAPPQYRCPPPRIi 
Ariv 


11145 


25046 


A 


11240 


391 


31 


VFFFFFFFFFFFFFFFFFFFFFFFPPPF 
UtSiPr JiPPPWrl 1 FFCPKF*IIjFIjN*I * F 
*IKF*I*KIPGFFt 1 NPPPFFPPFFFFPP 
GALFFFFFF*EGVLPCSPGWSAMSHLLS 
riiN-KlvJLj i rKWb x vi 


11146 


25047 


A 


11241 


371 


491 


KIFLCWARHDAHACNPSIFRGQGGHIT* 
Viy tSr i\ 1 oiiANM V 


11147 


25048 


A 


11242 


1 


221 


HRIDPLCRNSARAFFFLSTGCHRVSQDG 
LDLLTS * S AHLGLP KCWDYRCKP PRPAL 
NCFFRTI SQLLNLC I KD 


t 1 1/10 


25049 


A 


11243 


382 


333 


TYKI KDFKTF * R W FH PNI S RVEAEKLFJU 
SRGQRGDFbARPSESSPGGFTLSVR 


11149 


25050 


A 


11244 


169 


477 


TQKDEKESTTTTKNAENSKGQSAYSSPS 
DCNTSLARAQN* EEAEMDELTEVGFRRR 
V 1 MN r V KXjKGH 1 LTQCRKAKNHDK.TJbQ K 
LLTSITSLKWNINDLMELK 


11150 


25051 


A 


11245 


200 


462 


SLI YNELCTIHILYQYYVA* QRS * CFQQ 
LMSFY * RTLKNLKESLiCKVALLYPSFLK 
WKRGWLGTVADT*KPSTLGGQGGRIA*A 
QEF 


11151 


25052 


A 


11246 


484 


301 


I FCEKKGL * FQLLRRVRWEG P FNPGNQG 
CS * JjLSSPCPPGWITE*DPVSQKKKERK 
RNQVMELVSAQBNKNKPRSVGLQACARS 
MLiAr o x i PK 


11152 


25053 


A 


11247 


226 


458 


NTVAVD WGPRC F FS LIjLSHAENCDMWFT 
KIELFI FTNHLLLKQDLEMLW * GQAHAC 


11153 


25054 


A 


11248 


304 


436 


SPSRCRK*KIG*AQWLTPVISTLWEAKV 
GRSLDPRNSRSAWTMY 




25055 


A 


11249 


216 


451 


THKYIjFiYIiGYNPMLY I ERETGSCSIA* 
AGVQWYSHGLLHP * PSGLKRSSQVAGVI 
CLANFFFFFIiKKWSLWLPRLG 


111 «4T 

1 1 10 0 




7\ 

A 


1 1 OCA 

11250 


260 


474 


UKir liii i PIjKAIjNVVXjGAHXNX * KVSKI 
MC*K*DKNMYNFFFSF*DGVSLLLPRLB 


11156 


25057 


A 


11252 


193 


496 


MTS C YS FYL * I KDASMVKKKKKKKKKKK 
KKKKKKKKKKKRKKKKKASSSQKAQDYS 
KASDYK 


11157 


25058 


A 


11253 


348 


478 


DLGWLKLGTVAHTYNSSTSGGQGRRIT* 


11158 


25059 


A 


11254 


174 


475 


MHHWVLILQDIDDNT*NCNLHFSLCSQIj 


11159 


25060 


A 


11255 


292 


48 


LPLTCHRPAVSHTHLAFSNPTAGITIF* 
TKSSSWNAGTr IIFKIVTFFIjFFFF*ER 
VSLKLTRLERSGAI PAHCNLHLPEF 


11160 


25061 


A 


11256 


347 


16 


TQSADWCIYNPPARHKSSPSPHPTQ*PS 
WLHIjVDP APGPWVELPTS PAPS AfTPnP 
LGSRWDQPPQSKGQCPSGRLGPQGSPPQ 
VGLGKAGV7RSQALPVGLTRS IESHHW 


11161 


25062 


A 


11257 


238 


590 


APGAYI FPQQHNPTDSQVIFSLTASHLL 
I CSPLC I S VLHFNGMNIDDENRNAHVHA 
RACAHAHTHTHTHTHTHTLYLMNYRETL 
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A— un Known, — oiop couon, /— possioie 
nucleotide deletion, V=possible 
nucleotide insertion 














MLLKTRGRATGSHARCIFS * SLLQACP 


11162 


25063 


A 


11258 


310 


461 


LS FGDVLQJjESDRPUAV AK I UUilris 1 LAjKj 
*GGWIT*GREFETSLANMLKPR 


11163 


25064 


A 


11259 


461 


174 


MPPUDvcnnrT m t f r*QcauT/2T.DT«rr T wn 

MCjtHK vbyLKji-iUi-iijX w a&AU±teiJVJS.\-rlU 

Y * CE PLRLAPVGKLFKCVGYKLVLCHGS 
GYMSMVIKIRMPGLIWSLEGGSAAWHR 


11164 


25065 


A 


11260 


343 


472 


IiLRRLRQKDCLSSGGQGCSEPRIiDYCTP 
AWVTK*DPVSHKQE 


11165 


25066 


A 


11261 


229 


3 


VKMKJUQ * V P V Jw 3S.L * K. x I f JjJjOx xJUtt-ttL) 
BCAGFGRNTCTLMFIAALFTIANRWMQPR 


11166 


25067 


A 


11263 


163 


399 


KLLPGTRNLHAMEHPIFGWLRNPHATAQ 
G*HPLSQSSIALHGRADHICYPELCTSS 

do( L XAJa X rW BAsVUf Hoy CUUC V tl 


11167 


25068 


A 


11264 


316 


427 


TNKI0III*IIN1TTHSLIISIIPLLFFNQ 
INNNLFSCS 


11168 


25069 


A 


11265 


184 


418 


KTl if FVii?" FKKiM Jf J?"W tatrJMU^f f f f JVIM Jjf 

TjGGKKKFIjRGKGGKTFFFFFF*KIGSSS 

£* n TM5 Ti nr'T'T? fllV 15 TJ Q O D If O "O TO "D D 

bPP ir PPImJS yAPr err* ir iS*> lrlrKK. 


11169 


25070 


A 


11266 


219 


13 


KFFFSLEHQNFGGGGVPLFPPPKEGFLP 

trvmrru^CTTDDT V/MTTTTDD'DDnfTVQflPP 
tJSJrpJj^aC ± IJrJrJuAyftJJUr c f trwjX&vjirf 

RVL*KGPPHIFYK 


11170 


25071 


A 


11267 


3 


173 


SHHARPETGFHRYSQDGLDIiLTS *S ACIi 
\ju c r^XJti u x KiuS jf jf KyMxviuj i ^juupik ihc 


11171 


25072 


A 


11268 


157 


2 


PVD I THVKTTVGLGMVAKACNP 5TLGGR 
GGRIT*ARGSRFLLPPHWTSELR 


11172 


25073 


A 


11269 


291 


468 


ISEGKRSLFVILHFILFFETGSCSVAQA 
GVQWCDLDSL* PHI PGPK* SSHLGLHTG 
ACH 


11173 


25074 


A 


11270 


29 


478 


GWNPYNQQD IGTLYL* SGARAG V1/JT/Uj 
CLLIPJVELGQPGDIil/^HIYNVTVTAH 
AFAI IFFIVTPII IGGSGN*LVPLIIGA 
PDMAVPR1WISF*U^SLLLLLASAI 
AEAGA*TG*TVYPPLAGNYSHPGACVDL 
Tl F S JLiHIiAU V 


11174 


25075 


A 


11271 


61 


474 


PYNQGGNDAHAAWPHLTKWRSQGTTEIC 

ni?nt7nivmnDt)CnnDT llXPr^O OOTSnr DAT C 
REFriVxAJYUruroU FXiinJ\^Kolr\^IiKiUjo 

QNLPSYSGGSHLLSAYYVPDTISKHFIN 

TTnWT/lTVftTTAOWP QTTi *GETJR WTY * AO 

EFKTSLANVL 


11175 


25076 


A 


11272 


277 


478 


t\ L iy.tr V ivinvj ir XvlrvxrVVrtjri ± \*aic<o XJJVJL/xv 

DGWIT*GQEFETSLANMNSPSIj 


11176 


25077 


A 


11273 


350 


1 1 


TTFKTAfinRR WT.TPVT PALWEAEAGASPE 
VRSSRPA*PPP 


11177 


25078 


A 


11274 


138 


3 


NYINLYAWLGAVAHACNPSTLGGRGGWI 
A*AQEFKTSLGNMAKP 


11178 


25079 


A 


11275 


359 


219 


J? vliwuf ryVuKlAjrWrJjnO oArlAxUlr 
KWWD YRH * PPRLAL FFFF 


11179 


25080 


A 


11276 


262 


3 


" FCMSLVHLFSHSQDW YFTSFSL* IFNIP 
SGPFLFAFQDTLQ*P*DFIYFYFETESP 
SVTRIiECSGAXSAUCJJMLiKJjJjvb 
AS 


11180 


25081 


A 


11277 


285 


1 


VKKGDSVSKIFRKKKKRKDKIRLHFMIT 
TVNNtjGIEGNLLNMIKEIYKNPTANIIL 
NGERLKAFSL*SGTRMSFLTPTFF*ILV 
DTQQVYIFMSH 


11181 


25082 


A 


11278 


160 


3 


TDKEVPLINAQRKWFL.EMES IPGEDAMN 
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uiuutmiDCf iv— /\rginine t o^oennej 
T=Threomne_ V=Vfllinf» 

* * hi vvuiiiV) v Tall 
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X. = Unknown *ssS^An rn/fnn /aaywcciKl^ 

nucleotide deletion, V=possible 
nucleotide insertion 














IV*MTTKELiQYYINLVEKVLARPE 


11182 


25083 


A 


11279 


334 


488 


PLNTCGY*NLN* *nfqpgsvapacnpst 

LGGQDRWIT*GQEFETSLANMVK 


11183 


25084 


A 


11280 


527 




PFVFLVETAPHHVJVOn^T*DI.T.T<?* QQUT. 
NLP KCWD YRHE PQRQ AKNVGE FSETRS I 

o 


11184 


25085 


A 


11281 


486 


248 


VRQLFSLLIiPRLECNGVISAHCNLRLPG 
S CDS SASAS * VARTTGAS GSO AVVT jCWCI 
CLQPVQPGKLLRVDLFQLWLQR 


11185 


25086 


A 


11282 


38 


157 


STGLHHVSQDGVDLLTS * STLPKCWD YR 
HDPLRPASQRL 


11186 


25087 


A 


11283 


474 


339 


FVFLVQMGFHHVGQAGLELLTS* SACLG 
LPKCWD*RHEPTDPAK 


11187 


25088 


A 


1 izot 


£. 




T . t? HTTP TAD WM CUT AT T3 I_JTJ>T CPnu V~KM "X KTT 

IDKCVCSDYSKDWPFPIFLPLLRPPYCL 
RHNNMETRSINNPPWPSKCSSEKTNEEG 

KEIKSATPMNTSMIRKQNRLIHDMEKVXi 
VIRIEYQASHNIL 


11188 


25089 


A 


1 1 Z.O J 


□J 




oiuivamiiiil 1 ox\V-J^JjirlvL.ViL/lKt-JSr' 

PHPASP*FSKLLSSNLKTYIjSSL*DSHS 


11189 


25090 


A 


11286 


244 


488 


MRLGRVPSVLFCYVRMQREGI ILEAENK 
PSPDIESVGALILDFTASRTIRNKILLF 
IIFPG*GILDSPNVP 


11190 


25091 


A 


1 1987 


IAS 




WTILA 


11191 


25092 


A 


11288 




JVIU 


WGQSHPXGVQSQLCLCVLWDLTCSLSCS 
FQI IKLR*RPGVMVHTCNPSTLGGCGGR 


11192 


25093 


A 


11289 


157 


22 


LSCQEVKGEIGEAEICWAQWLPPI I PAL 
WGAKVG*SPEVRSSKPA 


11193 


25094 


A 


11290 


305 


497 


KLVNCI YFKGVS FMVREL YCNKI V * WLG 
TVAHIYNPSALGGACGWIT*GQEFETSL 
TNMAKPRL 


11194 


25095 


A 


11291 


478 


1145 


QHVQACPERPQMMGTLERSRAVASKIGH 
SYSLDSQPARAVGKPWPQQACTRVTELT 
R ATG1CT »T RT^WTRTrDHDOQ PADaa a twit 

LRPAS QQGVQMKTQGGASNPALQIGSHP 
MCKS SOFKSDOSNPSTVKHSOP KP PHS V 
PSQPKSSQTKSCQSQPSQTKPSPCKSTQ 
PKPSQPWPPQSKPSQPRPPQPKSSSTNP 
SQAKAHHSKAGQKRGGKH*RANSRDL 


11195 


25096 


A 


11292 


352 


468 


TFPGQHGKIPSI*IiKIQKLAGHGDACL*S 
RLSPR 


11196 


25097 


A 


11293 


295 




EN I OGS KKNEPOHMITOnVGTWNQMPG PV 
STWVYRNCLACLDISFSFIKWVKSHKDT 
SMGMFTAALFTIAKT *NQSTCPSMVDWN 
STTRTS 


11197 


25098 


A 


11294 


274 


429 


ECE PVOTLWKIRKFLKKLKIELPSDPAI 
PLKG INLKEVKS VP * SDTYTPMCI 


11198 


25099 


A 


11295 


168 


469 


PSI^NKSETPSPKKKKNLFLGFPYFREK 
LVGILLKTVLQKGAGGVFFGPKNPPEKN 
PDPPLDSPPKTFRRLGAFVKTFPKGLKA 
AWFPALV* PQGQMGG 


11199 


25100 


A 


11296 


37 


348 


IKIFNKTLKYFKKKKTIKKKKKKGGGPL 
KRPRGGPRIYPDGARKNLPHIGTAKKHW 
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fY=f2 In famine P= A raining S=Serine. 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
a.— un Known, oiujj tuuwu, / h uw u v 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GG YWEKNPHGGGGNTGPTPPQKTRPLGG 

tTNTMT . TTD T TfWC? ITDP PPT fZCZCZVTB gf)T iF.Q f! 

HA* P * VQ6EKPRPRLGGGK 


11200 


25101 


A 


11297 


93 


388 


DEYASDNCCYFPGSSHDSQVKQ IGSQFT 
TOTnaXJT.KrT ATfPPP AT,*NH£ * PMEMRSK 
CLLLAYIOCKKKKKKKKKKKKKKK^^ 
PKYTjSSRGKDHF 


11201 


25102 


A 


11298 


61 


243 


YYIHRTTVFMCMN*GLKDNVDK*TIDLC 
TtprrKTmnoiK K K KXJU^iOCKJCKKKKKKKK 
KKKKKKKKPQGGGA 


11202 


25103 


A 


11299 


250 


341 


IYMSLTLGLQNL* E IHLTHKDSHKLQVIC 
GWKNIFHANVNQKQTGVAILISDNIDIK 

MKHLLLNLKNBRIX5TTVIX3 


11203 


25104 


A 


11300 


3 


285 


DAWERKKRSQMNQKTTNKVAGVS P YLS I 
NniWNVTTRTtKPT.TICRHRVA* CI KKTKSMI 
C YLQETNFIKDTHRLKIRGWKKI FHPNR 
N*KLAGWY 


11204 


25105 


A 


11301 


175 


407 


a KTMMP nVT .OT ,KA * T 5? RGTFM P * ILLRVK. 
RGSNSRNNRCWQGWRGTGMLLHC * QECK 
LVQ PIiWKTVWRI LPDLESEIL 


11205 


25106 


A 


11302 


75 


280 


YHLNLVLYPHP PKNRVC* KKKKKKKKKK 
FFPGGKKKNFFF 


11206 


25107 


A 


11303 


299 


3 


IHASFLFRGCITLAFTVRCSHRSITSSE 
T<;nT .TtTYYYYYYYYF * IiASOSYAIAGVG 
WDQLGSNHPPTSAS * VAETAGASHHARL 
I FNFFVEMEVSLYC 


11207 


25108 


A 


11304 


137 


323 


FFTVNGRHRKEKKSJHCDQIiNI^ 


11208 


25109 


A 


11305 


64 


481 


RPTRPHVYHIVKP IP* PLTGAbSALLMT 
CGLSM*FHFHS ITLLIIiGLLTNTLTIYQ 
* WRnRTPJESTYOGHHTPPVOKGLRYGI I 

lfitsedlffagff*afyhssltptpql 
gghwpptgitpij^levpllntsvlla 


11209 


25H0 


A 


11306 


142 


330 


FFTVNGRHRKEKKSNKDQJjISOj^^ 

NPLLLF*KYIjGREKKKKKXKK^ 

KKKKKKN 


11210 


25lll 


A 


11307 


259 


440 


RPWIRDNPIYYLKSCFLRTIFLSLLPLQ 
PP PYPPTPJRAtiS PNRHHPAKS PRS P * PL 
TGALSALLMTSDLTMGLHFHYITLLILG 
LXjlT^n , LTIYQ*WRDATRESTYQGHHTPP 
VHKGLGYGIILFITSKWFFARFF*AFY 
HSSLDPTPQLGGHWPPTGITPI^LEVP 
LLNTVRL 


11211 


25H2 


A 


11308 


433 


3 


V PPPn T P W«>R QT/3F PPS FGRAGFLNFAF 
SKGQEPALPGGFYYPSPKKKNLAFCSPP 
LFYWGKFGSKFS1VGVGPLFFP* *SPKSF 
CQFLBNKGFFPEKSFFKFFPPAGGCVFF 
FSEGGFPFCLFLFFFDRVWLCHPGWISV 


1 1111 

1 1212 




A 


11109 


2 


423 


GRVGDR* LFYTNHMDIGTLYLLFGA*SG 
VXVCTALSLLIRAELGQPGNIJX3roHI 
GIVTDHAFDIIFFIAI P 1 1 IGGFGN*LV 
PLIIGAPDMAVARI1WISV*LLPPSLI J L 
LLAYAI VEAGAGTG * TVYPALAGNYSHP 


H213 


25114 


A 


11310 


234 


1 


TFFGKFPFCLiKFKPAQRGGENTPGPFPF 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














LVFFFFEMESRSVAQTGVQWRDIiLGSSN 
SPTSAS * VAGTTGACHHARL I F 


11214 


25115 


A 


11311 


406 


211 


LSLFLHSLISTHGARPTDPVGLVPTQDG 
LDLLTS *SAHIGLPKCWDYRRAPPCPAC 
GFDLHLPND 


11215 


25116 


A 


11312 


379 


398 


KT*NFTVRGSIPLIiNNIPMASMTFSKKK 
KKKKKKKRGGPLKKNPGGAKFNGGRKRK 
IFSLKGGEKKNPGGILEKKPFFGGGKKG 
ENPPKKNKGLREKKKF*RGKGGKKPQKG 
GGKKMGSSSAKTPTWG 


11216 


25117 


A 


11313 


390 


489 


RYKLGWAHACNPSTLGSRGRWI I*GQE 
FETSV 


11217 


25118 


A 


11314 


286 


434 


ILWYMNYI S I KLLK 1 1 1 YKLGAVAH , ICN 
PSTLGGRGGWIP*GWEFETSL 


11218 


25119 


A 


11315 


322 


3 


FTFYGCRQFQVF* I VTKNDRI I FSSYTK 
NLLKIYLGIiY*FYYVPIJjRTKSLKKTGS 
FESNLSFK*VFLWSGTVAHTCNPSTLGG 
RRGWIA*GQELBTSLTNMERPR 


11219 


25120 


A 


11316 


251 


389 


TFFFFEREFHFVPQVGVQGGSLGSLQAP 
PPGSHSLA*ASPRCWDGG 


11220 


25121 


A 


11317 


313 


80 


EYICPRCLSPGFRD * PGQHGETPSLQKL 
QKSAGCGGAHLRSQLLERLGDRVRHCLK 
KRIYLSLFNSLVLSKNRFQII 


11221 


25122 


A 


11318 


145 


410 


APKTLKVALTSAAAARESICQIiGVLDEL 
VTFLELNRSTDQRPGAVAHTCNPRTLGG 
HGGW I T* AQEFKTNLANMVKTC 


11222 


25123 


A 


11319 


393 


3 


NFFKI I FTLSPPLPLIFTPPFFFFFKKI 
PPKPLFFGVFHSQTCPQISFFKNFYPFL 
FWFSNPFFLLFSPSRKGPPPFSRFFFFK 
VGNPLFFLFFFFFFFFFG*EGVSLLLPR 
LECNGTISAHYYLRPRV 


11223 


25124 


A 


11320 


135 


1 


HRDVKLEYAIRKMEVRPGVAAHACNPST 
LGSRGGWIT*GQEFETS 


11224 


25125 


A 


11321 


282 


2 


NSESVLIKEKGDEEEKVLSFLLFSKRKV 
LESKTFPTLHSGCKKRSYFLKITTSSFL 
* PRVBCSGVT IDHCKLKLLGSGDP PTSA 
FPVARTTGM 


11225 


25126 


A 


11322 


149 


30 


WGSGGQVRWLTP 1 1 PALWKAEAGGS PE V 
RSS*PVWPCLY 


11226 


25127 


A 


11323 i 


316 


399 


STLGGRGAQIT*GREFETSLANMPNPIS 


11227 


25128 


A 


11324 


223 


397 


1 1 KAYQ * DVNREQ I KT 1 1 1 YFQNKLKC I 
KKMIWPGAWHVYNPSTLGG*GGWIT*G 
QE 


11228 


.25129 


A 


11325 


3 


396 


NYSTAALFLRGHIWGWRWGTLYYGSSPI 
YCRGLNSTLGLYPLNASSNPHPRKLCKP 
KIYTHIAKCPIGHKIIYPYANTPIH*HH 
THTHTHTHTL*DSLLWPGVVAHTCNHNT 
LGSWGRQIT*VQEFKTSIA 


11229 


25130 


A 


11326 


113 


442 


KRYIQIRAHSEVLCGHELWGTLHNPVQT 
IQTWEPFPYLPLCSRVTSGKSHHRFVPL 
LFHL*NRNNSHHHHHHHHHHHHHRNPC* 
ECCRAHSDSYAVWSDKVSQEPVAAM 




25131 




11327 


145 


3 


E JjDx KP PNP FFCHTGVQ * CDLGS PQLLP 
PGFK* FCLSFPSSWDYRCA 


11231 


25132 


A 


11328 


189 


I 


KTNUjLIHS klrlhlqs svfqc * s ksek 

SNVCT * CAFSCLP KLFWLGKVAHACNPS 
TLGGRGG 


11232 


25133 


A 


11329 


193 


405 


ID*QTLTKKKKKKKKKKKKKKKKKKKKK 



1230 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ED 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: fn 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 
peptide 

LJ UCIILC 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=lsoleucine, 
K=Lysme, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=*Valine, 
W=Tryptophan, Y=Tyrostne, 
X=Un known, *=Stop codon, /possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AYTSKKKT 


11233 


25134 


A 


11330 


209 


11 


GGKRGTPPTINNIFFNFFF*EKGSCCSS 
RAC* CNGEI * AHCPLRLPGSSNSPAAAS 
RVQETATTPS 


11234 


25135 


A 


11331 


38 


390 


VDQSTVCKMGQSAGCGWGSDKGIKAGCL 

LKKKLFLGPGGEKFFFFFWGPFFFFGGR 
FLKRGGGKKPGYKKNKISGHPPPFLGGG 
KKKKR 














11235 


25136 


A 


11332 


38 


326 


VDQSTVCKMGQSAGCGWGSDKGIKAGCIi 

FIKKIFFGPGGEKIFFFFWGGFFFFGGG 
FKKRGGEEKNLF 














11236 


25137 


A 


11333 


1 


234 


RTRGKNWMAFIKEGWVPTGRMGGSFIKH 
SFLESKNTWLGKKXKKKKKKKKKKKKKK 
PHDKI KDLFDPV*NKKQNI S I P 


11237 


25138 


A 


11334 


244 


398 


INHLYSNKNDLI FLLVFYIiFFFP*RRPG 


11238 


25139 


A 


11335 


174 


1 


HQ VYP YVKQEHYKQDSCQCY I KYMNVWP 
DAVAHGCNPGTLGAQGGWI I * GQKFERT 
RG ! 


11239 


25140 


A 


11336 


148 


411 


IiFFFPHCYLIiNA^SILYLNEPKVVYNLH 
LCVSAVCLLVPVHSPHRTCLY* KKKKKK 














KLLF 


11240 


25141 


A 


11337 


237 


453 


GQRPLLS FREKGS FFFLIiETE FCFAPQA 
EGQGHNIiG*LQLPLWG*SNSLV5SSQEV 
GITGAGHHPRLFFVF 


11241 


25142 


A 


11338 


65 


347 


VCRVDDFVPEARTRF FKS I * EAWNNKN I 
KPLLSTFSQVPGSENEKKCTLDQAFIGI 
I^EEIINHSSCENVIiAIISIiAIGGVTEG 
KYGSVLFCLK 


11242 


25143 


A 


11339 


158 


393 


VPCTHSEGIRKGRKCIiPKTRENAREERD 
KKI PPGLC* AGKLFPQEKSSTWPGKWH 
AWNPSTLEGRGGWIT*GLEFKTT 


11243 


25144 


A 


11340 


117 


3 


GIRPGAVAHVCNPSPLGG*GRWISRDWE 
L*TSLANIR 


11244 


25145 


A 


11341 


292 


60 


SGQGLRLRGAFRGKVQAKE WMP I PKLGC 
LVKDMKIKSLEEIYFFSPPIK*FEIIDF 
FLGASFKDEVFFLFVCLFVCF 


11245 


25146 


A 


11342 


149 


30 


TQIFPPPPPPGKKKXFFFPKKKKKKKKK 
ERKKDRKNKQK* KGPGVGVS PLFPPPLG 
GQGGGFPQGLGFKTPriAPKGKPWFP*KT 
QKFPRRGGGPPKIPLFGGGGEKKPFSPG 
GQGSNKPKFFPPPPPRGKKKNFFSPKKK 
KKKKRKXERKTEKTNKK 


11246 


25147 


A 


11343 


120 


504 


GVHDRCLWDI»frRRPPAAAATACPAGRP 
PPRREEEHSQLLLISFQGFRWD*DQDVN 
TPNLDHI1AREGVKAKYI/4PPLVTMTSPS 
HFTAITGKRHSAHFTRCPSNPQRPSFPV 
IRSKSSVSSREVEVA 


11247 


25148 


A 


11344 


213 


399 


ALVIHCLTTTIQVSGLSGSKYLMRQGMV 
a Tjp^TTjp-ij'pT y^r^rrr!XJTT , *cr\T7c'i? r r*CT ivw 

AilijlJxtirW 1 IAjoKLtLtW L L 'byKon 1 o LiAf-i 

TVKPCL 


11248 


25149 


A 


11345 


308 


3 


NPTPRSGDLWVEEGS PGAGNWGCLSEQT 
LRAI I KATTSYS FYCYFFETGSHSVAWA 
RVQWYNPGCSICGSSHLPTSAS*GAGTI 
GMHHHAWLI FLFFVENE 



1231 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/5154 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

fAf til I1A AT 

residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
^Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoJeurine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
A—un Known, atop codon,/=possiote 
nucleotide deletion, V=possibIe 
nucleotide insertion 


I \4JY7 


ZDiDV 




1 U4o 


1 OA 


All 

433 


UHvJKACii SIjTIiTHIiGGSAMEL I IHLGLC 
LQRMKFIHTTCIIIiFLFFLSATQAGGQG 
GDLS *LQPPPLGFKQFSCFSLPRSWD 


11250 


25151 


A 


11347 


257 


3 


KWGELCFLTIKEERMGFPLAKKRKIKGV 
r * FGIiFFFFETESHPVAQAHCNIjSLLDS 
SYSPASAS * VAGITGACHHNQLIFI FYE 


J iZD 1 




71 
A 


11348 


159 


434 


JUSPS CRD VGTySARAHPPHxHTRVRAHT 
HACTHVRAHTHTHTTAHGSHLTKEQPGL 
CTLPGSSW* PGLGGRAIATAQAWVHCAG 

JLJ. lAAMri' 


11252 


25153 


A 


11349 


240 


406 


AFKKLAKNNSMGVNRSLSRPGVVAHAWNT 
PSTLGGRGRQIT*GQEFBTSIANMVKP 


11253 


25154 


A 


11350 


163 


360 


PQLLQLMKFLQN*LliERIiRPQNGLNLEG 
LACSESRSRPCAPVWRTBRDSVSSPQPR 
PGAGGRPSFF 


11254 


25155 


A 


11351 


145 


406 


PLNSGAKPTLPPWSPRRPELNAPPPPPd " 
EKE F F FFI/3RGFLFAPQB EAGGENI V*G 
KPPPPG*REFFAPPPGGDGNKGAPPPPP 

r dt 
LitXj 


11255 


25156 


A 


11352 


122 


337 


ILWVLLLVRESLKFFFFIiLIiLFFLFFET 
DVA* aAVQ*SDHSSRQP * PPGLRGS YTS 
AFRIAGAIGlCHHLY 


1 IZjO 


ZJ ID/ 


71 


11353 / 


325 


416 


KCVY I MGWA* WLTPVI PALWBAGAGRSL 
EV 


1 1 0^*7 
1 IZJ l 


ZjIDo 


Tl 
A 


1 1 tCA 


260 


415 


LEELQNTACQKLEPFLSRTETKQGCLTjS 
PRLFN I VLEVLA* AVRQEKE I KGI 


11258 


25159 


A 


11355 


160 


2 


ISEISTKRNNYFLKNWLGVVAHACNPST 
* KCj * CTiiy 1 rKGyDFQTSIiANMVKP 


11259 


25160 


A 


11356 


160 


3 


ENLKLYNGSPFLKI IWLGMWHTCNPST 
LGGQGGRI P* AQEFKTSLGNVRRP 


11260 


25161 


A 


11357 


235 


408 


GQPLPAALFFYQWHliHFKEIiLAHPWGKT 
VWRFL* PIiKKEGPHDPAIPIiIiDIYPKNK 
KW 


11261 


25162 


A 


11358 


11 


393 


QTERTPTPKPHLYVSFPPNSQDPQLQGG 
KLTNRKN THT KT PS VCHHHQRP KVDKTT 
KIGKKQSRKTGNSKNQSASPPPKECSSS 
PATEQS WTENDFDELREEGFR * SNYTLL 
KEEVRTHGKEVKNLE 


A 1 iUi 


OS 

ZJ IOj 




I 1 ISO 

I I joy 


zoJ 




AGVQWHDCLSFLC*VLASSPGNSSPVR 


11263 


25164 


A 


11360 


266 


407 


LEIHHSKTL*FWLGIVARACNPSTLGG* 

uKn JL JL w UUxSAlS JL JL llAM X V K. 


11264 


25165 


A 


11361 


93 


1 


DVIiS I LLPRLECNGA I SAH * NFRLPGS S 
NSS 




Zj lOO 


7\ 

A 


1 lioz 


1 Q 1 


inn 
5/1 


MiittPa J. iU^KLi JLiis Kir PALiboXJj xJb KS S * IiK 
RVRARRRLFLKSRFS LDLQVAG FFVCLF 

v r VV1JK v oijL^lirHjW o V V VyoKXi I AALi I 


11266 


25167 


A 


11363 


212 


392 


Q AEE CAP LFIGVKE P P KNLTHWLGTVAH 
TS NPS T liGG *GGQ I TWGQEFETS LANMA 
KPCV 


11267 


25168 


A 


11364 


141 


3 


NELWLNHQNVLEWLGTVAYTCNPSTVGR 
WIT*V*EFETSLGNMVK 


11268 


25169 


A 


1 I JUJ 


Z J J 




ANPN TRP R T S T i A Q VCC* P <? PR If QPTT .T .DT .M 
nl' v tv^ivrctLOiJndAUJrOuIUwtvl j_|j_iir J-lXM 

QKLEM1 * LSEEATSKAKTG* KLGLLHQI 


11269 


25170 


A 


11366 


48 


373 


EPWTRKEVLKISSSLKIHCYLSILCLIL 
MPTLECTAAIMAYSNLEI*PGTSCPPTCA 
S * VAGTTGACHRAPTVLKI FLPJDR I FF F 
FFTNVTYPWCHTPADPAIFKANAR 



1232 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGlycine, H-Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11270 


25171 


A 


11367 


310 


2 


GlilCLNRENAYFFVIRQISMSCLEKEKP 

I PFLFF I * IV*AESRSVAQAGGQWLHFS 
SLQTPP*RRTSDSPASAS 


11271 


25172 


A 


11368 


303 


50 


MFALTNRGVQGS P VI FLGVFL I S KMGFT 
VLLCCPGWS PTLGI K* SSCLGLPQCVLT 


11272 


25173 


A 


11369 


109 


419 


^oJM/wiJorir XoJrJrJ-»o v^ujtv- uco ±urn\u 

IGALYLLFGA*AAVLCTAISLFILDWLG 
QPCNIiLRNDHIYYVIVTAHAFARIFFIG 
IPI I IGGFGN*LVPLIIGA 


11273 


25174 


A 


11370 


112 


258 


GbRr KJooNr 1 oflJjriytuN J. c r Plur r J\" li d 
RAGV* QRGEGKNPGVTKLiNRL 


11274 


25175 


A 


11371 


823 


1078 


SQHFGMPRWADHLRSGVQDQPGQHGETL 

CI*SIiGGRGCSEPRSHHCTPAWATQ*DSV 
S 


11275 


25176 


A 


11372 


184 


420 


TKrwirNrr wt ht ^nrPT.unT.T.WTrT'A/wn'PTiTr 
ltS.VylUNijr jjLrtJjVruiJUvvJ-i*J" JN.1 v n^r uja. 

KLNI ELLCDPAILLLGI YPRELKMYVHT 
KTCT*MFIVELFIKSKK*KQPTC 


11276 


25177 


A 


11373 


234 


44 


LAT I TDRLLALS DS AIPPLG I YAKEKKS 
VYQKDNCTYIFVAALFTVAQI *NQPKCP 
STRTRGS 


11277 


25178 


A 


11374 


73 


456 


PPIGSPIiVPYIPSPAI.SSPQAPRMGSHP 
CLIPSASCHPPLETDFIiSIjLPTHFCLAV 
FTKERFS PPPPSPYPRF*KFLRS * KFSF 
FLP P FLFFGGTVLLCLSGW S AVAQS WLT 


11278 


25179 


A 


11375 


43 


456 


E FFHHVGQDGLiDLIjTS * SAHLGLLKCWD 
* IRTKPSQAPASWGLWNLAHHPPRSHPS 
LGLSGSPARAVFKDSSCSLHPLATGI 


11279 


25180 


A 


11376 


288 


484 


EWVLSAVGGSQYGVCLPFLHCFI FFETE 
SRSVAQDGVQ*CDLGSGSSLPPG 


11280 


25181 


A 


11377 


129 


1 


V% f [m M i il\ Jj X rv r\* " t i~\ V ./"ii~Lf~l X il IT O flJjVJVJC* J-vOXv 

IA*GQEFETSLSNTA 


11281 


25182 


A 


11378 


307 


1 


DSEI PSLGLAKPWNHRREPPCIjAIaFLTL 
UOK7.PMT vT.QTTRf^T <5TTAKTP* ITAGFTjCO 
RVSQDVNAKEKFLKE I KRATLVKTQMMR 
HONS F T ADAEKVWMAWMK 


11282 


25183 


A 


11379 


200 


397 


CISLFSHYYKDLPETGSFIKKRGLIGSH 
WLGAVAHACNPS ILGGQGRRIT*GQEFE 
TSLASMVKPC 


11283 


25184 


A 


11380 


167 


1 


SNGLYSREAPJ^VQHTHIRreWAHTCNP 
STI^GHGGRIT*AOElxKTSLGNIVRPC 


11284 


25185 


A 


11382 


12 


395 


AHS S FLS IxDIiS VFATCPHPRAETQTGHR 
FSTLLPLSALYPK*DY FKKKKKKKKKKK 


11285 


25186 


A 


11383 


160 


2 


TMEMMLDIK*I*EIFLFEFKMGLKAAET 
TQNINNTFGPGTANEQTAQRWFNK 


11286 


25187 


A 


11384 


274 


389 


GQAQWLTPVIPALWETKAGGSIiE * GQAQ 
WLTPVT PALWETKAGGSLELKNSRPSLG 
NMVKPP 


11287 


25188 


A 


11385 


389 


37 


WSYESSWSFIFKSFLFSFMISAYDM*YA 
SQILFTPKLYRTHPYFLLVLFLILSILV 
ILETGSRFVSRLECX3GGISAYGSFK*LG 
SGNPPAAASLSSRIPYVHQSCWSDIAFN 



1233 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Asparttc Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Clycine, H=Histidine, I=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop cod on, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 














PEFEM 


11288 


25189 


A 


11386 


214 


374 


KGGGRFKGSKFTSPGGGKKNFFIGAPK* 
NSRAGV* KRREGKN PGVTQLKXG VG 


11289 


25190 


A 


11387 


142 


410 


RFVCSTIKVLxRDLSSDRSNPGRFLSTSN 
*KXKIKKKKRGAPFKKKNFFYRGGGGKY 
FFFGAPKINFLGRVFQPGGGENPGRSPF 
KGAWG 


11290 


25191 


A 


11388 


144 


1 


CLSNEDS I KTPKGLGTVARTCNPSTWGG 
RNGWIT*GQEFKTSLANTVK 


11291 


25192 


A 


11389 


167 


2 


RNISSHL»LNMAITKQTNNNNCW * GYRE I 
ITHIHC*WEFKMMQLLQKLVPQKVKHT 


11292 


25193 


A 


11390 


2 


412 


SDQVQDAGRLISSEASLLGLPTATFLLC 
LPTGFALCTRVPGVFLLLRTSVILREGP 
TLVTS FYLNYLLKDLVS IQ SHWGLGLQP 
MNFGETQFSL+QKLV* KEKBSMPGMVAH 
ACNPRNLGGQGGLIT*GQEFKTILA 


11293 


25194 


A 


11391 


1 


206 


TGRIiAGHRVSSLPIRGGRAKAPLTSRTG 
RLNQDGLNLLTS * S ARLGLPKCWE YRRQ 
PLRLAVT SGFLT 


11294 


25195 


A 


11392 


262 


388 


VSK*KKKKKKKKKKXKKKKKKQ 


11295 


25196 


A 


11393 


78 


414 


ICFFLRAPPFFFFFFFF*TKIFFFFPAL 
NSMANFFFFWTPPFLGGIFLPPPFFLEG 
GGPGLASPPGGAFFF 


11296 


25197 


A 


11394 


238 


11 


I KWETAGRS SACL * S * LFERLRRENPLN 
PGVHG * DE I * *HHCTPAWVTVRPRLSKY 
INKNKK+ ARLAARCGGS RL 


11297 


25198 


A 


11395 


340 


1 


KHILVPPFINVSEIQRYSLV*HLLFVGV 
♦NLFFLFLSFKKPVLCSM* I ICHEI ISP 
PSKPKCKASHLSFV*KKNSNLWLGAVAH 
ACNPGTLGARGGR IT * GWE I ETSLTNME 
K 


11298 


25199 


A 


11396 


363 


245 


QMDLNRHLS KDDK* MANQCNNRCLTSLA 
IRQMQIKTIMI 


11299 


25200 


A 


11397 


292 


2 


DPKKEAGRGGAQL* FQLLRRFKHETHLN 
PGG*GCSESRSHSSPPAGATK*NGVSKK 
K* Y* APKEERHPRPVLVEPRVPQVPSPG 
SQTYRQDHS PRV 


11300 


25201 


A 


11398 


121 


321 


KKLYTHKS KSDVM I FLKIETVS WLYCPG 
WRVMAHCDLKLLGSS KPPTSAS *VAETS 
GTCHHARLIFL 


11301 


25202 


A 


11399 


118 


394 


QALPQPIPVLILLQFFFFFF*GTGFCFV 
AQIGGQGGHYG*LEPPIRGLKPSSRRTL 
RGGWD *GGHHNAQL I LGFFEKTGS CFW 
QAGLaKALA 


11302 


25203 


A 


11400 


273 


379 


AE PGWAHS YNPGTLGCRGGG I T * AQE F 
YTSLGNM 


11303 


25204 


A 


11401 


360 


38 


KI FGQARWLTPVI PAFWETEVDGS PEVG 
RSRPA*PS 


11304 


25205 


A 


11402 


254 


404 


GPAGHDFHMITPSKSTETGLGAVAHVCN 
PSTLGGRGGWII *RQDLETSLA 


11305 


25206 


A 


11403 


161 


3 


ILTPTLQYI KINLRGQAWWFTHVIPALW 
KAK*SGGLLELRSLRPAWVIEGDL 


11306 


25207 


h 


1 1404 


1 


170 


MGVSLLS PRJjECNGVISAHCNCHLL*FT 
FK*FSCLSLPSSWYRCVPPCLATTFLIY 


11307 


25208 


A 


11405 


249 


3 


NAKSRIQVISIWVFTVKFFHLYYLCNFS 
Q *NEMAGGQS IFRKKHLDFSLEHSGMMT 
THCSLDLPGSSDPPNSASSVGGTRH 


11308 


25209 


A 


11406 


70 


401 


CGKHGVNL I IHS FLNHLKYLLLLQMANL 



1234 



WO 01/64835 



PCT/US01/04927 
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X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
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FISQFIiLTCVSEPLIiF*YCHQIIKVGRIi 
HEVKQMRGTKKQEKMFP YWNNDKNVC PG 
VPAHACNPTTVGG * GGW IT* GQE FNT 


11309 


25210 


A 


11407 


257 


389 


GGSLLQKYLRPLVAFLFSFFFFS FETES 
♦SWRLECNGMISAHC 


11310 


25211 


A 


11408 


364 


420 


SQ P FGR PRQADCLS SG * AQWLT PWPAL 
WEAKAGRIiLELRSLR 


11311 


25212 


A 


11409 


312 


413 


TSGMGPGWAHACNPITLGGQGGWI T*G 
REFETS 


11312 


25213 


A 


11410 


283 


425 


HGHKDYVHWLGAVARACGPS I LGGRGMW 
IS*GQEFETSLANMVKPCL 


11313 


25214 


A 


11411 


389 


3 


CRIDGLFFNLVFIiRLWWKF*VIFFLSY 
FS FLVI I LQFLTYMLNCTF F * Q I LNLFP 
ILVLLTKILNTLLSTENYCHFSLLYCLE 
FWFHPI IKHKMYNLFLI * FILI FILRQS 
LARSPRLKYSGTISIH 


11314 


25215 


A 


11412 


267 


3 


QEFQQEDTSEIiDLEIRSLCKRKGGLKQG 
DTKDNLS KRVWETDKGKAAWWPGAVTH 
ACNPSTLGGQGGWIT*GQEFETTLTNRC 
PRV 


11315 


25216 


A 


11413 


3 


281 


GAWSCRI PGEGSLSASQAIQPFSLYPVN 
PQ* ISLSARKKKKKKKKKKKKKKKKKKK 
KKKGGGGVLKKI YFPPRGEKKFFFFWGP 
PKKTGGGGF 


11316 


25217 


A 


11414 


275 


2 


KTAHFQLFNSDFKFPGSRNPGQSWSPRA 
KQS FCTQALLKY FLIC* MNEMKLNKSRSG 
AVARTCNPSTLGGQGGWIT*GQEFETSL 
ANMAKPR 


11317 


25218 


A 


11415 


319 


1 


RNLGVQVYEYWLVYKYCSVSSNNKMKN 
KRDLLVI * VPGI VIHLIYRL I KENPELY 
ISLFIYSMRSIiKKLTYRPGAVAHACNPN 
TliGGRGGWIVCSQEFETSLGNM 


11318 


25219 


A 


11416 


121 


428 


LSCTKWGJSn^MTKMLSKELKKKKKKKKK 
iOCKKKKKKKKKKKKLGDEEEDKE* QSSW 
GGGGALIKNILFPSRAASGFFLSLLKYK 
KI/3AAVYI ILGEKTI LWL 


11319 


25220 


A 


11417 


79 


253 


KI FNFQFSNMLENNSHTTRINNYFPGRE 
RWLTPVIPPLWEAKAGRSLEVRS*RPSC 
P 


11320 


25221 


A 


11418 


180 


4 


EVGFMKPGLCGVFLEAGSHSVALTIVQW 
CDLGSPQHPPLGSRDPPP*AIiQIVGNTR 
RVP 


11321 


25222 


A 


11419 


154 


3 


STLKDKENQKLS IQPDTMAHVCNPSTLG 
GRGGWIT*VQEFEISLNNMVKP 


11322 


25223 


A 


11420 


436 


478 


AIPLPH*RKKILHAALFFFFLKGSFVLS 
PRLECSTKI WAHCSLPPPNI SNPPASSS 
QEGGTT 


11323 


25224 


A 


11421 


143 


398 


AAFGFWFVLIRFIiRQRIiALSPRLDCFFA 
KSASCQPRPP*FKRVCWLSLPKSWDYRF 
DLLCAAGLVWVYLSVSRLHTNEKNQPPP 
P 


11324 


25225 


A 


11422 


1 


138 


MVFHRVGQDGLDLLTS * SAHLGLPKCWD 
YRHE PPRPAAHSYFXiijrltV. 


11325 


25226 


A 


11423 


2 


390 


LELPILLLNTLILFLFLILYIYLVFSPN 
AFRILFITIVLESFSFILLVPSKPWKYS 
LILSNAFVLCMMISSPPFPLFSLSTTRI 
inOO^DFWLGAVAHAYNPSTLGSQGGVJI I 
♦GQEFETSLTNMAKPHL 
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Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutaroine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


11326 


25227 


A 


11424 


267 


414 


VFQI TFC I KNFWPGWAHACS PSTLGG * 
GGRIT*AQEFBTSLGNMVKPC 


11327 


25228 


A 


11425 


189 


73 


HQDGLDLLTS *STRIiGLPKCWDYRHKPP 
RSARPNAWVAP 


11328 


25229 


A 


11426 


318 


33 


KKTISSHARGRKLGSFKTAAPFFFFF*T 
GSCSLAQAGVKWCSHSSMQPRPPGLK* F 
SASGSAGIIGMSHHTRPIAGFLLAKLKK 
KSNSTGYGGAHV 


11329 


25230 


A 


11427 


291 


393 


DRVSLCLPGWSAVLPS*LTAAQISWAQV 
IFLPQP 


11330 


25231 ! 


A 


11428 


51 


399 


KRVARQNPPNGQGFGWCSCS P IL YGQSR 
AQGENIYKLLGSGEWPGQIiI idvereql 
EDWRQGDLGRGMWMTKARVHKGPGLVAG 
TCNPSTLGGRGRWIT*GQBFKNSLANMV 
KPCF 


11331 


25232 


A 


11429 


242 


420 


VKKKKRGGRFKGSKFTSACLQGISFFKG 
APKLNS RAGF * QRWEGKNLGVPQFNRFG 
AHP 






a 








TKRGDSS FCLYRI WI IDSVMKWNQSKAA 
AAAAFFFFLRGTLLFLFRLKGKVGASVI 
♦NFCLRGWGGS PAS PSKEVGMEGACHNS 
GLFFVFLEKRRFPLVGQEFKTTRAN 


11333 


25234 


A 


11431 


121 


439 


LPLPTSGPEDPLALS FKPTPFPSTLSGM 
IYTGHPSPEHTSHLMPAPPVNNNKACRF 

KKKKI FAP PGGE I FF FFWGPKK 














11334 


25235 


A 


11432 


109 


283 


LTPWNLDSRCEIKGNGKPQENHLNLGGG 
GCSEPKSHHCTPAREKRQNSVSKK* INK 
IK 


11335 


25236 


A 


11433 


296 


1 


FLKKPSKGFSKLPHLPQNLNLFFPPK*K 
CPRAKKDI PPPS F* KISSLFGEG*GFTG 

RPGWSAMASSCPRV 


11336 


25237 


A 


11434 


385 


418 


QSETI»SQKQKTK*E * SWLGAVAHACNPS 
TSGGRGGWIT*GWEFENSLTNMVKPCL 


11337 


25238 


A 


11435 


3 


405 


LLHQGRLMLGQWGNGRTKTIiLMRENVGK 
FPAEPSCSAIX3VWLQALACYPGLVRRQ* 














G 


11338 


25239 


A 


11436 


221 


3 


KHEFQTIQMESMK*NTSIKICIYAS*IQ 
DCLRYQWQRPAAYAQAITEHFGRPRQAD 
HLRSGVSDQPGQHGET 


11339 


25240 


A 


11437 


288 


366 


KMFFS WA* WLMP 1 1 PELWEAKAGGS L 


11340 


25241 


A 


11438 


160 


370 


LWYHYFPVRDGLTILVTYQLKAI RRLLT 
LKSRSGAVAHTCNLNTLGGRGGW I T * GR 
E FETS LANMEKPHL 


11341 


25242 


A 


11439 


230 


404 


MQHGQVGIYSQGGGWRPVDKKNSQLGTV 
AHTCNLITLGGRGGWIT*GQEFKTSLTN 
MV 


11342 


25243 


A 


11440 


302 


54 


LSGVWFLTPPPKGGGFPTP*NGGAPGFF 
PPPPFKNPPPEGKLGGPKKKKNFPPPGG 
EKLVFLKGAPPFFFFFFFFFFFFFFL 


11343 


25244 


A 


11441 


271 


416 


KRTLSTELQ YMCCLLI KSKS I FK*KKKK 


11344 


25245 


A 


11442 


254 


392 


SVTGMTIMTSSCCYKIQKCTKIQKLARH 
GGTCI** SQLLRRLRQENS 


11345 


25246 


A 


11443 


390 


150 


LLKNQKTLGGPARNPPPLGGPNGGVPKA 
GNWGPPGPPGGTPFFFKNQKITPGGGGG 
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nucleotide deletion, V=possible 
nucleotide insertion 














P*FPPLRGGGPGNPLYPGGQGCH 


11346 


25247 


A 


11444 


122 


1 


PAPLFFFFFFETQSCSVA*AGVQWRNLG 
SLQPPPDDGTRL 


11347 


25248 


•A 

A 


1 1 A A C 

11445 


IT) 




RPKVDKTTKMGRNQSRKAENSKNQSASS 
PPKDHNSSPAME*SWM 


11348 


25249 


A 


11446 


70 


397 


LCHCAPAWATERDPVLGEKKRTMMSYCH 
PPMRMVEKKIVGDIKC* *GYGETGFLTH 
C*WECKMVQIL*NIV*CK^KNLNVHSPY 
ELAIVLLGIYPREMKTYIFTKKTSM 


11349 


25250 


A 


11447 


3 


401 


HASAHAS AHASAYYRLCAYP KS S FLYPS 
SFLLWSNFPLNILDTTIGPKATETLFI 
KHVMHDVTSMQLKSKTFYNYFFLLSIiNV 
"FYFVLSCCKRRKRSQIiGMVSHICNPSTL 
GGRGRWIT*GQEFETSLANMA 


11350 


25251 


A 


11448 


333 


1 


KSGCIPPKGQWLFFIjQKTAKIGS PKPV 
S PPRGKGQFFPQGLRGFY*FLPPPPGVP 
E IFFFFF* DSSHSVTLSPRLEYSGMNTA 
HCSLHLPGSSNPASDSHIAGSTGTCHH 


11351 


25252 


A 


11449 


104 


402 


WCRPLLrGGVSQLGYMGVMD P FKEAVdi 
FSELKHCAGRTTALFRAVRQGCLSLKNF 
LLP FFQLCPAHRGGVL*RQ* ALLS CGGL 
HPVRASRPLCLPTQA 


11352 


25253 


A 


11450 


198 


2 


KYKFMAGLGGSGL* SHHFARPRQQDGLS 
PGV*NQPGQHGETPSLQKKVSWAWWHAS 
VEAEAGEQP 


11353 


25254 


A 


11451 


1*71 
1 /I 


j 


TPPPRATSLFFFF * TESHSVSQECNGTI 
SAHCNLCLPGSSDS PAS ASRVAG I TGA 


11354 


25255 


A 


11452 


111 


461 


PMTGGKYISNSTCSKPNSLSPHLPLKTQ 
TKNMEHFGRLTRGDCLSPGV* DQPGQRG 
KTLLLQKISQALWDVTAHAWVVPGLPTR 
ERRRSLSFPTG 


11355 


25256 


A 


11453 


247 


3 


" EGECCGIESR* ILHLKGRICWHINYLKI 
TKI KN* VILKKMYKQKLRPGAVAHAYNP 
SALGG*GGRIARAQELETRLGNIVR 


11356 


25257 


A 


11454 


328 


57 


EVHSNICATLSTHQMKILTKGTRHQHIS 
*GFF*EGEVLTLSTKLECSGVIMAHCNL 
NLGGSSDPLALVSQVAETRGASYLLLAI 
CPKVNK 


11357 


25258 


A 


11455 


70 


435 


ATRAKLHLKKKKEGRKKEFFPHWQGQPT 
PALVFPQPRTHFI FKVSIFSLKKNREQL 
PYTVQTQSLLPLNSHWGQVA*AGVQWRN 
LRFSCLSLPSSWDY 


11358 


25259 


A 


11456 


112 


401 


KCWDYRRERPCPAPNICIF*LTFFFLGK 
KLNQSFFFFFFLKGSLVLPPRLKGGGAF 
L FN* NLTLQGKNNS PAS PS * RAETLGDG 
PHAKLWFVFLKK 


11359 


25260 


TV 




l>o 




" EKTSWQSGDSNKKAWLEMKTNKKSLKR 
♦GVYAHACNPSTLGGRGGWIS * GREFET 
SWTNMEKP 


11360 


25261 






1*7Q 


i 
i 


FFHMYNHIVCEYTITLSVNINSQGPGPR 
AHAYNPSTSEDRWI P * AQEFKTSLANMV 
KPH 


11361 


25262 


A 


11459 


210 


5 


RLIHYFFKNNLPSGGGSTA*GQEFEIRL 
DNTVRPHFLKK*KISQVWWCMVVVSATQ 
EVEVGNGLSPGV 


11362 


25263 


A 


11460 


139 


2 


RILALSKLFFFFFKQESHFFLPMLECSG 
VI SAHCNL* LLGSSDLS 


11363 


25264 


A 


11461 | 388 


272 


S * KKKKRGGRFKGSMFLS PAGQGN P FFM 
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GPLKSISLAW*QRREGKNLGAPHITRL 
GAHP Ij FAG WHKTQGAE P 


11364 


25265 


A 


11462 


2 


387 


CVWKSNNKKAPPCVAALRFRWKDDMDEM 
APGKHWGGLTSKVQSGGLMGRYTGPLTK 
TENS I LLKNPYFEAGHSGSRL*HQHLGR 
PRWEDYLRPGGQGCSEIi *SSHCTFT* VT 
EGDPVSKKLKRTIFGG 


11365 


25266 


A 


11463 


325 


398 


SI#PTWAVCAPRPSG*GSGIPELKTMLAG 
VILEDYLDIKNFGAKVVGLSCTLATGST 
LFLGKVGPPVHLSVMIAAYLGRVRTTTI 
GEPENKSKQNEM 


11366 


25267 


A 


11464 


174 


2 


YLEILPFFETGSRSITQAGVQWSHPGSL 
QTPPPCPSNSPASA*VAAHASADAWADA 

w 


i i inn 
1 1367 


25268 


A 


11465 


105 


411 


TDVE LLLMDE * RKWFLELETTPGEDDVN 
IVKMTTKDFGYS INLVDKAVSEFKRIHS 
YFERGSAVGKVLSHS I VC* GE I FHERKS 
PLMQQISLLTFILFYFSE 


11368 


25269 


A 


11466 


291 


404 


SQEGVKIKRQGTVAHTCDPSTLGG*GGR 
IT*GQEFKTS 


11369 


25270 


A 


11467 


64 


236 


THASGIiDbLTS*SSCLGLPKCWDYRHEP 

PRP ZVR FQ Q PYQfiQ T »T tNYT. a V T T WT\ a 1?P 

I 


11370 


25271 


A 


11468 


278 


416 


NPQI S CLKQHVWGAVAHICNPSTSGG*G 
GRIT*GQBLKTILVNMVK 


1 1371 


25272 


A 


11469 


178 


325 


RPEVKDQGAS CRDQPHRVGGLLPVCSDE 
RVPQK3CKKKKKKKKKKKKKKKSKKKKKK 
















11372 


25273 


A 


11470 


264 


60 


DINHQSS WLSGRWTPLS WVS I SQRBDRR 
GFSAS IYIYI YIYIYIYIYPPIHIYIHT 
YT*SYHFNQNT 


11373 


25274 


A 


11471 


95 


399 


DFPHLIAFLPLSQFCTACCPVPENRCFM 
YFFLI FSALCWEMNSRISHSNTARVEQC 
CKVFKHjQMKSVLGQVQWIiTPVIPALWE 
AEAS * liPB IRSRNQPGQ 


11374 


25275 


A 


11472 


139 


2 


NHVSTEKLSLHTMAHTCNPSTAGGQGGQ 
IT*GOEFKTSLATMAKP 


11375 


25276 


A 


11473 


179 


3 


RARWNPQVFC*GFLPVFPKILPIWPLRK 
GVPFPPNFFFFFLiRVSLCHPHWS AVARS 

RL | 


113/0 


zoz n 


A 


11474 


282 


12 


KYIRNE * I VLFVT YFCRNRISWCCPGWS 
♦TPELKRSSCLCLPSCWDYRHEPLCPAE 
MNKLDRLNKMDTTKWIQQK2CKNHRTQRQ 
DNKII 


1 1 J ft 


XJZ/O 


I\ 


1 1 yl*7C 

11475 


14j 


3 


VCMQLRTD FQ PGAVAHACN P S T LRC * GG 
WIT*GREFKTGLNNMEQPC 


i mo 




A 


11476 


61 


397 


PQTPRIiKQSYHVSLPSSWDYSWIYHERL 
VNPGRETDWATCYSGOJIQLQENKLNTP 
1 Lfo 1 1>W VS JjC.CFKi)BCSSAIPADCSIiNL 
PG*SDPPTSTSHVAETTGAC3YAQLI FG 


11379 


25280 


A 


11477 


156 


3 


FIFLKSSQNSYI * KCWPGAVAHACNLDT 
LGG*GGWIT*GHRFKPTRRPRV 


11380 


25281 


A 


1 14.7R 




JO J 


KNKKKKKKKKKKDSRGGG 


11381 


25282 


A 


11479 


339 


58 


NFKNKLCFCASGYLDSKDSQHYI IFFIR 
WSLALLPRIjRWEDCLNPGGGGCSEPRSY 
HRTPAWVIE*DPVSKK*IK*NKIK*KQQ 
DWLRLHDKIF 
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nucleotide insertion 


11382 


25283 


A 


11480 


244 


407 


SFGKKIGFLFGPKKFNGGSYGNYYKINR 
RF FFP ILGEGFSPHVYF*NLETRTTGGG 
IJLtElTKTTFNIiLGRKLGSFLGPRSLTAA 

IfXvj X 1 XlvLi XvjijJ/ r J? w ™ r Jrtu'i x jl s vt 

KDKWKKGF 


11383 


25284 


A 


11482 


308 


30 


SPQKNirJr 1 xo\jr JSJS^JjoAolrv^J\fVftJrK.r 

FLKNQFLPGIMGGGRQFPLLRRVRPPNC 
FNFGGKRCN*QKLGPCPPGGPKETVSKK 
KKKRERQQ 


11384 


25285 


A 


11483 


226 


407 


1JHWNKK* FW P KVAHTYNLSALGGQGGRI 
T*GQEFE J. bPoWMKk'riA© 


11385 . 


25286 


A 


11484 


10 


411 


QPPIYFIiSLRISLFWVFHINGIIQYVAI 
CVWLVSCHNFSRFICVAACDQDFIVFLW 

L1^IIIKL*TFVKKKKKKKKKKKKKKKK 
KR 


11386 


25287 


A 


11485 


184 


1 


rnnnriT f niUi^I7AT3XrMl?T.1?inrt>Tr , Qf2WnJVVM 
T DisiiLJjPC'TUlS yKixW r Li&rJsJtr X &\saUJ\.y£* 

I FEMTRKD IECYVH * VDKAAAMYERIDS 

NFLSN 


11387 


25288 


A 


11486 


435 


222 


AKNLNRHFSKKVI*MASRHIKRCSISLT 
FGSVSPLKSHHLKL 


11388 


25289 


A 


11487 


317 


408 


FRPGTVAHACNPSTLiGGOGGWIT*GKEF 
KT 


11389 


25290 


A 


11488 


340 


469 


GKGDTKCHLWIX5TVVHACDPNTLGGQGR 

n T 7\ -fr/"*/"YE , 17Tr*T l CT.ClVTM 

RIA* Gyifi r JX 1 o XjoJM w 


11390 


25291 


A 


11489 


38 


389 


KRPTSLKKSNFCS LQGF FFFGLSQKPGK 
SNRS PGKKKNPFFSGGKAP * KGF* KI FL 

£> i^tj Ir yJMr JaJjXAjVJlJVJSlUN X Xi X XjJB 

GCRDPK* CPCFPPWAGEGNPFS KKKKQK 
FPFFG 


11391 


25292 


A 


11490 


235 


2 


FVI^LMPKLHIPVTij:r J RRLFAFPEFL* 
BTjFCFVLSx^VLIiETGSHYVTYAECSGA 

XSa VHtJoiaTi LiFLsooU f rXo VXi v 


11392 


25293 


A 


11491 


25 


417 


GTLCLRIGFINISHHCMILKENSIjFFFF 

wetkfyfapqadgrgpnlg*whppspg* 

RGSPGP I oKi^vawVjol^ JrlrVxIr V XiroJr J-Xiv. 

KNGVPPGGLKGPKSIiAPGIGPPGPPKDR 
rr.urYiDDa prsT.T.TnrwPKvw 

VjLiKvjvj ir x^it^iaujajxw t ivvri 


11393 


25294 


A 


11492 


234 


438 


MWIIAQQCDCTECHY*LKMAKMINFML 
FIFYHTHTHTHTHTHxTnHTRERGPSYI 
YGKVS FKLTHL 


11394 


25295 


A 


11493 


344 


477 


CFTMFIS * IKT* KFNISQLVAHTCNPS I 
LGGQGGRIT*GQEFKT 


11395 


• 252% 


A 


11494 


35 


482 


J. Jro J? rill v uiiU j_Hj i. o * o jl nuvjijirxvv_ 
WHYRPGPPRPAISITPST*AYLLGNQKI 

VjV XXlC XlrVXXinXil? JC Ov7i. Zvrul JLOir XvX V > — U X 

FYl^VEEHlJGCFYFIiAINT!ATCVCVYKL 
SIRLYAVLL 


11396 


25297 


A 


11495 


307 


484 


LS PS KRHELKCVLW PGAVAHAYNQSTVG 

VjKXjrty JL x uriir 


11397 


25298 






**o / 




FLIKTEFHPIGQEGFDFLTL*FAElLGIiP 
KGWDYKGE PLRLAFWHPF 


11398 


25299 


A 


11497 


224 


417 


ASGASALKVELCSFSRYKIYPRHKRRYA 
RTDGKVF*FLSAKCEGITFFLSFFFFFL 
EVLFCSPG 


11399 


25300 


A 


11498 


17 


411 


KRLVFGGRGRTDRF I SQ YV* AE YVS INL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: In 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

AftiinA itcirl 
uiiiiiiv auu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

H in a in 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K-Lysine, L^Leucine, M=Methionine, 
N=Asparagine, PHProline, 

i u i<j in t u iv— *Arginine, o— oenne, 
T^=Threonine, V=Valine, 
W=Tryptophan, Y^yrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion) \=possible 
nucleotide insertion 














LI FLTKKKKKKKKKKKKKKRGGALKKKP 

KKKNFFWGGENWGKPPPKKKTPWGKKKI 
FKGKGGKTPPYPCFLKKMG 


11400 


25301 


A 


1 1400 


1 17 
11/ 




ETAF FLGGWTGSCS VVQPGV* WCNHSSL 


11401 


25302 


A 


11500 


244 


1 


EELFECYSCSPTFSSDPLTTPLLILTT* 
LLPLTIMASQRHIiSSEPLSRKKLYLSIL 

T QT.O T QT.T TTPTATWT .T TT7 VT 17X7I7T 


11402 


25303 


A 


11501 


130 


55 


RFVCSTIKVLRDLSSDRSNPGRFLSTSN 
SSLKKKKKKGRPF*RDNSAILF 


11403 






1 15fV> 




/171 


QPPT.VTanTT<^XrUTTr*T , IjflT XrTTT t T xtt t rr t 

o r t\±i tx u±\j x V xltlv* Irl Lt V Vii X IjNLVXjL 
K* ITFALKIMMFSKYSDIGQAQGLMPVI 
PALWEADAGGSLQPRSLSLGNK 


11404 


25305 


A 


11503 


1 


213 


GELLFLYIFA*WLFLLPKLLKFVTVCLC 
ELQFNERFIMVLCIIIILGVGGFFLFCF 
LL * FCFDFFFFFWRG 


1 1405 






11 3U<1 


o 


a 
0 


AGVQ WPRS WLRL P * LFFTQ VLM I FFP P F 
YHQNFFFP*KGFFFLGGLSHFFPPPNKG 
FFFNYPQGFFFSPSLKKKFFFFSSPIFL 

& D Dfl T T?T . * /ID DT5 1?T?17 171? 17 17 17 PT? * 15YT ro T 

CHLGWSAVAAIIiAPPAGFTPSSCLSLPS 
S 


11406 


25*?07 


A 


1 15fK 




44U 


KI I IRVQTTPDYS PQEAFTNAITDLISB 
LSLLEERFR 


11407 


25308 


A 


11506 


256 


370 


GKLM*GHHVGQ^6LKLIiTSSDPPALASR " 

GAfiTTYlVWDTJTiDli'DT XTW'IT wirnAWTTt 
Onwl ATjrVWKrlAirFKljWVliai 1 KGQTIIIj 

YHNQYNQ 


11408 






1 15H7 


i 




O/^^rLJOlD T'lTT .'CIS T?f"\TJ VCT 7VTOT vmr «*"«rNT5«'"«/"\ 

WJWWJrKi VJUc Ar ytllsx VNrliKTliGDFGQ 
VQWLMRVI PALWEAEAGKLPEARRWRPP 

LMRVIPALWEABAGKLPE 


11409 


25310 


A 


11508 


316 


420 


YTY I YE PG * VQ WLT P VT PAL WEAKAGG S 


11410 | 


25311 


A 


11509 


136 


2 


LDNRLWPGAVAHACN S RTLGDQSGWIT * 
GQEFETSIANMMKPHL 


11411 


25312 


A 


11510 


167 


420 


PTPRSGOTALQPPHHTHTHTHTHTHTHR 

MTH'l , HM , H , l , H'l , P2ir , T *• V\rDT7T3 7i T T?cavw 
ninxnininin/iLii ivvrC vrCrtJjj? o crlvl\.*v 

PLFCLSVSQIYVGRCLLKRLLFIFLYRH 


11412 


25313 




lull 


194 


415 


T YT QVAWT T T T . T T? T Ik. T K\rr .a T Q WT? WWirv 

EVLFLPPRREGGGAF*VN*NLCFtfG*GN 
FPA* PSLKKGITGALYTPGLFFVFLKKT 
GFRHVGQAWLDFL 


11413 


25314 


A 


11512 


121 


1 


PPVKFWAPFFFFF* IGSPSVAQAGVQWN 
NLSSLQSPPSCL 


11414 


25315 


A 


11513 


245 


13 


VLFILSGFFGPINYPTSPPPQLPFPASG 
NHPSTLYLHEFNCFYFYFLFIYFF*DRV 
SLCRPGWSAVARSQLLVPSDS 


11415 


95"? 1£ 
Z J J ID 


A 






z 


Kl/Jf JS r Jrv WtarbF-f Jfl? r r IfrGJsGIWGQKER 
KGASSPPWSRGILRGGGPFGGPGAPPT 
PLKKPPLFFFLAP PFLGVFPRFFS FPTV 
FFLPFFVCPKKRBALliGVPKGPRPFFFR 
SFLFFFSGD*VSI»CRPGWSAVARSQLTE 
SLAS 


11416 


25317 


A 


11515 


247 


390 


QGLRIYSQDFFFFLETKSRFVAQAGEQG 
GNIiG* LE PPPPS * SDFMASP 
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SEQ ED | 
NO: of 
nucleotide 
sequence 


SEQ ED 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=dycine, ENHistidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
QNGIutaraine, R-Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11417 


25318 


A 


11516 


269 


368 


FFFERESCSVTRAGVQWHRLSSL* PLPP 
AFRRF 


11418 


25319 


A 


11517 


256 


404 


NILKI FYLRI * KKXNQLGVVAHACNLRT 
LGGRGGWITRGQ* FKTSKBKMVK 


11419 


25320 


A 


11518 


211 


404 


YLCFTRCSVHSHLK* VFLHKCVI *MYRP 
HT*TGIi* CMQVHTHTHTHTHAHTHTHTP 
LSQ66DCI 


11420 


25321 


A 


11519 


41 


232 


ESVE PRRRRVPRAQ I TPLHCS LGDRARP 
CL*SQLLKRLRHENLLNPGGAGCREPRS 
HHCTAAWATEQDPV* KKKKSLNFKTVRA 
INLKMLGALISFHLFLWGIWF 


11421 


25322 


A 


11520 


146 


2 


LIDQQGVNQLPRMECSGAITAHCCLELP 
GSSDPLTSAS * VAGRPPTRP 


11422 


25323 


A 


11521 


297 


437 


TDHFWNRILSSCGDTFRDTWLGWPHAY 
NPSTLGGRARRIT*AQEFB 


11423 


25324 


A 


11522 


244 


460 


NCATITTNSRIFLQIiQKKHCTKWIiSLSI 
SWQSLISF** ICFVLFFETNLAVSLRLE 
CSGAILAHCNLCLLSS 


11424 


25325 


A 


11523 


222 


1 


ESKDLDIFRCQLAPRNSSCFVLFCFVFP 
LEIGSCSVAQAGAQWCNHGSIjQPPPPIA 
S AS * YAGTTNVRHYVQLF 


11425 


25326 


A 


11524 


308 


424 


ITFLTMFKIYLQFDPAIPLLGIYPKDY* 
SCCYKDTCTRM 


11426 


25327 


A 


11525 


98 


2 


NIGGRPGAVAHAYNPSTLGGRGGQIT*G 
QEFE 


11427 


25328 


A 


11526 


113 


399 


LDRFLTLLPRLRREDCLSPGGQGCSEP* 
SHRCVPAWATE*DLLSKKKKKGVLEKPF 
LPPPSVGKPPFSPPERLFFF ILRPPLGG 
VFPPPSKKIIS 


11428 


25329 


A 


11527 


280 


3 


PKYCQI * VLQCIAQLFKLSI YNTYTHTH 
THSLSIiSLSLPPPKVNKWLMSSEFTLTF 
KMYTFGPGRVTHTCNPSTLGG*GGWIT* 
GQEFETNL 


11429 


25330 


A 


11528 


284 


2 


FWGPQKKNFSFPPPGLKIGS FKRAPPFF 
FFFFETKS CSVTQAGVRWCDLGS LQPLA 
WATERD* LNK* I KGMS KKLHT * VCMFRT 
RGRTRGRTRG 


11430 


25331 


A 


11529 


134 


2 


DTUjPRLECNGTI tahcslklpgsgdpp 

AS AS * IAGTTGMCSHA 


11431 


25332 


A 


11530 


104 


1 


GRGFFFFFFFFF*EVESCSVAQAGVQWL 
NLGSIQ 


11432 


25333 


A 


11531 


157 


373 


CVFYTYSTSQLRCYI FIGNTRS VFRCHE 
I YT*KNRFWASTVAHTCNFSTIjGSRGKW 
II*G*EFMTSLANWR 


11433 


25334 


A 


11532 


237 


389 


ICKCSKTGFG*DFQEFETSLGNIVRPCF 
YRKIKASQV*WHAPIVLATWEAE 


11434 


25335 


A 


11533 


259 


122 


H IS PS VQRHFFL F YF * TKPHLVAQAGVQ 
WCNuS SIjQS P PPGFKRF 


11435 


25336 


A 


11534 


164 


1 


SSWSQALILLFLKKTCFWPGAVAHACN 
PCTLGGRGERIT*GQKFBTSLAS INL 


11436 


25337 


A 


11535 


106 


310 


GGDKGTVWDTLHF FDGQKRLDHNKDHWM 
TIHAELPHELSA*CYAFKKECISGIGSI 

* 7\ PXTC 1 VYTfiPl v* 1 

* AEKB I K~iy rDG 


11437 


25338 


A 


11536 


167 


378 


ASFFGTTITYKTILVYVYMFEKFI IYIH 
MYTF*RQGLVLSKLECSNVI IGHCNLKH 
LGSSNLPTSASRVA 


11438 


25339 


A 


11537 


101 


396 


VNNLKQRIGI1SKRGEKKI4LVLISGFGGG 
FPGAGWIKKGKPPPGFPQGLGP*KRGPP 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Add, {^Phenylalanine, 
G=Grycine, H=Histidine, I=lsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=^Tyrosme, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=po5Sible 
nucleotide insertion 














GKKGNFNKMGFNPKTLVFKNQNFGPGGA 
KPGTPGVRGPLGDL 


11439 


25340 


A 


11538 


18 


361 


QIGPRLY*DQNGETPSLLKIQKKKIPGF 
GGGPLKSQLLGGLGWENH*NPGGGGCRD 
PK* PQGLPPWGTNQNFVKKKEGGREKGK 
EGRKEGGGEKKKTLPKGIGALNLGGLLT 
SP 


11440 


25341 


A 


11539 


134 


365 


PGFQNKKTGGGGPFRPKTKTNFKVF*KY 
LKICFGFWPKRPPPPGFFILETRSLFFI 
FFLGFFF*ETKSRSVTRLKCSGVTLAHC 
NPRLVGSSHYP 


11441 


25342 


A 


11540 


2 


318 


TTERFIYRITFGPKETEVNDLPGAPSEP 
VAALRLEARSFILKL*SFSLVFDFFFLP 
F*KTGSCLSPRLECRGEI ITVHCSRELP 
S*SHPPTSASQSSGSGGFHLG 


11442 


25343 


A 


11541 


294 


409 


GTISFFRTCRLGMVAHTWNPSTLGGQGG 
*II*GQEFQT 


11443 


25344 


A 


11542 


1 


123 


GKQRQE PALS YDHATVLLTS * SAHLGLP 
KCWDYRSDPPCLA 


11444 


25345 


A 


11543 


69 


350 


VGAKHW IHMDTKI ETADPGNYYSGERGK 
RPRAEKLPVCYYALYLGNGIICNPKPPC 
CWKGHDFIYIYVCGHIYIHIYICTHIYI 
YFF*DGGLLC 


11445 


25346 


A 


11544 


103 


340 


LFCAFFliSPSKLDFYHI*SFYV*TIiFLKD 
LISIiAFFFLFWEAGSCSVTQAGVQGPNS 
THCNLCLPGSNNPSTSASQAAGT 


11446 


25347 


A 


11545 


256 


354 


PYPFGKPKRGDHWGLGV*NQPGQQGETP 
SLLKI 


11447 


2534S 


A 


11546 


196 


335 


EIVARLKRSPVGRAQWU4PVI PALWKAK 
ESGSPEVRSS*VKKITCKPGTVAHACNP 
STLEGQGKWIT*GQEFKTSI*ANMVK 


11448 


25349 


A 


11547 


202 


377 


TIHTKGVMVGNFI LVVFYQNFFKTFF* D 
KILGWARWPMPLIPAFWESEMGGLLEPK 
SL 


11449 


25350 


A 


11548 


396 


26 


PPRI KKE KRSPNGAFYPFPCWNF* KKKN 
LSLAKRFPRDSVFFFVSW*GSYLPTTFL 
QUjRWYVLTSHIAVSFKHLCEPHKNFTIi 
QHSIPYLYIIEIEVYSYFINRNTIYKIN 
MDFS I VGFVFL 


11450 


25351 


A 


11549 


232 


371 


REVPFENI KIGQVQWLTLI I PALWBVKA 
GGSLBARSL*KYQNWPGAVAHAYNPSTL 
GGQGGR IA* GQKFE I S PANMAKP 


11451 


25352 


A 


11550 


174 


381 


NRDEGFRYADRAEVQRLIjTGTILEHCGL 
QRPGL * RS S CFTLP CGWD YRH * S P S HVA 
GTTGISHHTRLIKKAFFSQTGVSSC 


1 1 AC% 

11452 


25353 


A 


11551 


262 


359 


DYSNLGLVWWLTTVI PALWBAKAGRSSE 
VRSS*PALWEAKAGRSSEVRSS 


11453 


25354 


A 


11552 


70 


380 


LHKCSFIYXNHCNNPIYGLISLYPKQIjF 
S YENFKDFCNKI * P * I PNFBLKTSTP* L 
EKL*KWAMARHGGSRL* SQHFGRPRRAD 
HMRSGVPGQPGQRGESPST 


11454 


25355 


A 


11553 


319 


56 


TFFLGPPGFF* KI FFS VGPPLI FFLPKK 

TLSSALECSGAMSAHDNLCLPGSSNSPA 
SAF 


11455 


25356 


A 


11554 


230 


399 


MMGEI INQANIISVCQIIRRLRHESHLI 
PGGRG YS E LRS YHCTS TWVT E * DS CLICK 


11456 


25357 


A 


11555 


376 


396 


F*IFWRDGVSPCCPGWSQILRLQQSTCL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ 10 
NO: in 
VSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
aciu 

residue of 

peptide 

sequence 


Amino acid sequence (A- Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pnenylalanine, 
G=Gryctne, H=Histidine, Msoleucine, 
K-Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
V^vriutamine, K— Arginine, o— oenne, 
T=Threonine, V=VaBne, 

W — TnmfnnKon V — TvrfKinp 

w — iryptopnaU) x — lyruMiie, 
X^Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GLPKCWDYRPPHPATAASCPYFYHYYLL 


11457 


25358 


A 


11556 


176 


384 


KAWPNVRNNYR*NFFFQCCLNNFRKRLK 
CFRCGADKFGKTGFSCLQYAIiKPTNWGR 
EQWLTVNPTNFGK 


11458 


25359 


A 


11557 


295 


391 


PYKIJjITHTHTHTHTHSRTHTI*HTTLiS 
LYTF 


11459 


25360 


A 


11558 


1210 


1753 


NTWCMERLAVAFGS FNRNS PLRGWALWL 
TPIILALWEAEAGGPLEPQEFETCLGNM 
VKxiCl* I KMJjKrl Vol* V w lJJjirijw»it»ilil JjoJj 
QKFKKCVRRGGTCL*SQRLRRLSWEDCL 

KTKTKKFFGLPS PFLFVCFLTHI YVNKE 

VAPVT.APRA QftlTPT CTTT .TMVT<?RNRT/5K 
XAf V JUnuIvwUI^i lOXUJinv X O [UivJiAjA 


11460 


25361 


A 


11559 


172 


3 


KQKFFFPPPSHQIKIFGGGAPEMPFFFF 
FFBKESRFVA*AGVQWRNLSLIiHPPPHP 


11461 


25362 


A 


11560 


143 


1 


GQAQWFMPVI PVLWEAKAG*SLEARSSR 
PVCATATPPQLI PK* YEGL 


11462 


25363 


A 


11561 


84 


2 




11463 


25364 


A 


11562 


12 


387 


QGILIiPCFSMSSEERGRIYSNTFSFFFF 
TLQGPGNGGP PP P PGQNFFFKKKGGYPG 
PGPGQKFL*NPKG 


11464 


25365 


A 


11563 


230 


375 


RAKGKLLSESLLSIQSFKKPSVHTCNPS 
TIiGG* GRWIT*GIiEFTTSLA 


11465 


25366 


A 


11564 


287 


55 


f JrlAxK. V UWJ-i IrvV l^ijWIS/iJVEAjijolJ KoxC 

SS*SQINPVGSVPYSLRVLR 


11466 


25367 


A 


11565 


115 


411 


FFFFFFFLGENSFFSFRGGPRGGGHI FW 

« Jr Jr Fv? w \i V r xf Jr F f r ir^^alSx3\3i\cirir\j 

FFFFFP*KTGFPPLGPGGPPPPLFGPPP 
FPPPKGYNFRRDPPP 


11467 


25368 


A 


11566 


139 


356 


CLLIGWYNTGCITQGSTRiCIGTTFFLRQ 
LPDSAS *VAGTTATYP 


11468 


25369 


A 


11567 


101 


410 


FIiFFFFFFFIiKKI FFFPPRGGGGGGN* K 
KQTPPLGGKKNFPPPPPKRGGGGTPPPP 
PLLFLVF * GEGGFPGGAGGGKKTPKGNP 
PPPPPTTfinFMfTRfTPPPPCiR 


11469 


25370 


A 


11568 


417 


56 


EDCLRTGVRDLLPNPHPNLQIAGRDGAR 
T .* .GHT /YP^TWT ,<? PrsPPnYRRT, * R PWPRPA. 
WATE+AYRKQNKTKQNYRSSAIiLHLYTP 
MATAWI KIAFLRAGDIFLSVLSFLPNS Y 
PLFSHPSQ 


11470 


25371 


A 


11569 


301 


443 


EDTTFVCFEVESCSVAQATDS PASAS * I 


11471 


25372 


A 


11570 


44 


408 


RGRGHGPHSPWRRRRRLRWEGRKATES 
KRRAYKYPLPALLCSHQWKPNEWIiDSQ I 

IAPMLECRGAVLAH*HLRLPRSSLSHAS 
ASQVAVTTG 






& 








AERPVTGPPVYAEPGLQGRQG* KNMFKN 
MQHPRWADHLRSGVQDQPGQYGETPVST 
KNTKITQAQCI 


11473 


25374 


A 


11572 


343 


1 


FGLLFAFTLEYKLYESINHFAHCCVCRF 
S SGTLHVWCNRHPNKLDEWNSGGGGRQ 
D LTLTPRLGCSGV I MAHCNL VLLGLSN P 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: io 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

pepnae 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, RpArginine, S=*Serine, 
T=Threonine, V=Valine, 
w— .tryptophan, Y— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














VSAS * VAGTIGAHSHAWIDKLFFVKTGP 

T V 
Ul 


11474 


25375 


A 


11573 


209 


3 


VGPPPKSLPGGFPFFFCPKCFPRLKKKK 
JUUrOJj VAX 1 (_NL«N I SGVRGKKX J7* AQEF 
KKSPSNIVKPWY 


11475 


25376 


A 


11574 


312 


430 


QRYARPGPWHCYNSSTVGGRSRGIV*A 
QBFATSIiCNMT 


11476 


25377 


A 


11575 


125 


3 


NSLGPWHACNPRSLiGG*GGRITSAQEF 
BTSIjG KI RRPLY 


11477 


25378 


A 


11576 


190 


362 

• 


CVNNYID* EKKKKKKKRERK KKKKKKKK 
KKKKKK*GGGAPKKKKKFTPGGGKNFPF 
h 


1 14/0 


25379 


A 


11577 


231 


340 


KRITRVNQDGLDLLTS *SAPLGLPECWD 
YTMKPCLY 


1 1A1Q 

1 


25JoU 


A 


11578 


60 


3 


RMQVSV* IFTPMSLQAYLIYSSS VAAGA 
QSGIEECKYQFAWDRWNCPERALQ 


11480 


25381 


A 


11579 


285 


406 


KKGNWPGAMAHTCNPSTLRGRGGWIA*G 
QEFETNVGDMAK 


11481 


25382 


A 


11580 


121 


1 


NLNCPGPVAHASNPNTIGG*CRPIV*GQ 
E FYTRI ANTVKP 


11482 


25383 


A 


11581 


199 


384 


KKTPPGGI FFCPKEAGPAPPL* TPFC* T 
PLFFSPAPFKPKRALKFFLKPKRGFFQI 
PFFFF*DGVSLCHPGWSTVARSCPRV 


11483 


25384 


A 


11582 


677 


934 


YGSRHCICFLQAISEILFLKNPARHGGS 
QVQDQPGQQSKTLTTiRKKSK*ARHNGS C 
LSSQNFGRLRQEGHLSPGG*GCSEL*PR 
HC 


11484 


25385 


A 


11583 


104 


2 


KKIGGGPPLFFFFETESCSVARLE*SGA 
ISAHCN 


11485 


25386 


A 


11584 


349 


9 


GGRSLGPRKTRVQWGIPEFPPLHLGGKA 
KPCFPKKK3CKKVKGLNKHYSKEDIQMVN 
RYMKKCSTSLPRERKI KTIMRYHFPS I R 
MAlIKitIKJjPKDQ*GCIEKGGSvvRPM^ 
L 


11486 


25387 


A 


11585 


163 


1 


TATOIKHNSYTLTPCLRHSRYLINVTCW 
PGPVAHACNPSTLGGRGRWIT*GQEF 


1 1 Aon 


25388 


A 


11586 


192 


41 


S KSKYHI YGDSALQKSTVYKW ITHFKKG 
* DDVKDKAHSTRLSMS ICEB KE 


11488 


25389 


A 


11587 


30 


374 


GWSPBDLFPGSLPPALKGFSGEGG* PFL 
PFQKKKGGGGGSSPPFPVIjKRVRPENKL 

KKKGFGQKKKKKKVLKEKT 


11489 


25390 


A 


11588 


66 


245 


slcffvffvff*tescsvtqvgv*wynl 

RRL 


11490 


25391 


A 


11589 


243 


402 


FLHLKKKKKKKKKKKKKKKKKN* KDAG 




25392 


A 


11590 


205 


3 


IRLiLG 1 1 KNPTPAHCQ * K* KTGEPLW* V 
VW* FFRRLHTELS YDPAIPLLCINSTEV 
WLAPGIPPRFR 


11492 


25393 


A 


11591 


404 


152 


MGFHHVSQDGLDLLTS * STLLSLPKCWD 
* CWDYTREPLCPGHFAI SFNKWLNTTT 
WLYFYLI ILHCIYCPYKLLFLFFFYIY 








1 1 CQO 

lijyz 


J 1 1 


380 


1 V X IrFJjAtoWljAHAGA'AGXVGTAJjSILI 
RAELGQ PGAL LGDDQ I YNVI VTAHAFV I 
I FFIAI P 1 1 IGGFGN *LVPLI IGAPDIA 
FPR1WISF*1^PPSFLLLLASSIVEAG 
AGTG*TvTPPI^GNLAHAGASVDLTIFF 
LH 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
VSSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide - 
location 
correspond 
ing to first 
amino acid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GJutamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, JNIsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P^ProIine, 
\l — vijuianiinc, tv^Arginiac, o-ocnucj 
^Threonine, V^Valine, 
VtMTryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11494 


25395 


A 


11593 


39 


397 


EGCPLFFFVFFFFFFFGGKKI FFFPPRG 
GGGAPHKIPGPPPPGGNFFFPPPPPLRG 

\jJ\Xv\3Xr if c if rur £Tf f XviWjVj v our irrvAJVj 

VFFFSFWRGGRGGFPPPPPQKKIFLKGP 
PPFYKKK' 


11495 


25396 


A 


11594 


243 


46 


GILSCCFHDSLSLNSL*CD*VWISLCLS 

XuVg X olxLJLj^VAJijX X7 C IWvDOr TV 1-UJUJXIXV 
MYLQPLSFIiS 


11496 


25397 


A 


11595 


327 


2 


trxvnXjXCvXU" uumrvU\jr x vi'inwyruoiiv,! 
CHTAKTVPTSRPLLIiGFFSLPGrAHISM 
WLVTi^S YGLGFS S KVTLWPGWAHVCNPS 
TLGG*GGWIT*GWEFKTSIiANMVK 


11497 


-25398 


TV 


11596 


2yl 


Jtr* 


SLR 


11498 


25399 


A 


11598 


125 


2 


PFQGQRSGGTRGPGAVAHTYNPSTLEG* 
GGRIT*GQEFBTS 


11499 


25400 


A 


11599 


302 




TftttWT WAHACNP S 1 I £GRGRO I T * GO 
ELETRIA 


11500 i 


25401 


A 


11600 


409 


131 


GRSGGVPKSGVLKPPGPPGGTPFFFKKP 

Xv X XK.Vjv3V3ry tr r lie rill lytv, V rvt J-ii. , l UiiOUUO 

GGCH * TKL PPCPS TWGAQ* KLVSKKKKC 
EGGGRIYS 


11501 


25402 


A 


11601 


196 




YICIYIYTHTRVYIYLLAI* IYIYVYMR 

TYTYTYTYVYIYTHTHGYXYIY 
in ix xi>i vixi miiiwi j. a _l a 


11502 


25403 


A 


11602 


41 


252 


TPGRARGQRLGHS S TGLDLLTS * SACLS 
LPKCWDYRREPPRPAEDIKFMF*AILSB 
TI LVRGVRS RATP S 


11503 


25404 


A 


11603 


271 


440 


KKKKRGGRFKESKFTSAGLQGNI FFIGP 
PKLIPRAAV*QRGDVTKNLGVTQIJ^CT 


11504 


25405 


A 


11604 


429 


2 


r oocr AaixoLLnv^Voy/Ujr x>JxUool\lJiJir JL 
SAPHLGIRLDGRLRQEDPPSPGV*GCSE 
t /^QpT.r^rpawTTTrRnp v<? * lCFKTrnKTTtT 

JjUOXUJVvX RinJLAXLIU/r VO IvJ- IXJv v aV X X\. _L 

HTH*RXPTKGQDHQYHCIiPPPHVMPLKV 
FRGSNVHGAVRTRGS RGS I P ELS HAVPH 
AP 


11505 


25406 


A 


11605 


113 


l 


X onlnuVUl X XvLi X\lA9nnn n <Lrir V irnli 

WEAEVGRLL 


11506 


25407 


A 


11606 


301 


3 


WViW<^VKNIi^GHACGKFHYEGRGBCPA 
EHPSWVLLVl^SI^KKLFI^YFLFTFLL 
OPPSPRLT*PVFRFFFFRDRIMLCCPGW 
SAWPERVGRAGNSE 


1 1 C/V7 

11507 


254*Jo 




1 1 £(\n 
lloU/ 


hi 


All 


SCDFOjS FS LSL FFFFFRKGI FFLGPRG 
ENKGGNMKWTLGPGGKC^SRPSPPKGG 
GKRGGPPPGAMiDF*KKTGFPQGGRGGE 
KPPPPGGRAGGAPQRGGKPAG 


11508 


25409 


A 


11608 


111 


88 


RRTTSS WLARNCIQGPAGYHTGS *TPDL 
KGSACLSFPKCWDYRWRIX)Ii 


11509 


25410 


A 


11609 


200 


27 


VPJiKPGDSRDPDMQWYYI*ENRPARCGG 
SI^*S*HFGRKRKADHLRIjGVQDTRSSA 
p_ 


11510 


25411 


A 


11610 


17 


414 


TVWLVMQDPENEWIVSGWGGKEEPGRYF 
MCEARSSSLMDRVFFFF* KKTLFFPPGGG 
GGAKIWVNGIPPPQGSPLSVAYPLGVWE 
KKGPPIHPHKKRVPPV* PGGVLFSEPGN 
PPPGPPKGVGKRSSPPSPVL 


11511 


25412 


A 


11611 


296 


428 


RI YEiWRGTVAHACNPSTIiGGQGGW IT* 
GQSLKTSTGTVAKP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
corrcspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine . 
C=Cysteine> D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, D=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamtne, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11512 


25413 


A 


11612 


20 


362 


NFVRGVARGSRGGATRCEWVQLRPVGMA 
YGRSPSWIiL*VKPSAASHRPPLPRAADT 
PGTAPAPTPTPAPAAAPALTPSSGPGSA 
ALTLEEELQEAIRRAQRWGLTMLPRLAS 
NS 


1 1513 


25414 


A 


11613 


158 


2 


LMLSRTPLQQL*YTIjLELELPRLiLAPDL 
LSNGSSLKDLKWTHSNYRASKETK 


11514 


25415 


A 


11614 


3 


410 


RSEVYGLVLTTWYQVNRVPWTHKAARFT 
RVSRWARPRSKISESCYWLPQAHRTKS* 
PLVRTLSGLCSLLGFHPLLFSLLATDLV 
PLPS FALLLLLELF FLPGMPKLAP * S PT 
KRSFS FYFFRDRVLLCCPGWSQTP 


11515 


25416 


A 


11615 


189 


397 


YKVLFLVEKNGFFFLFFETKPCCVAPTG 
VQGPNFG* LQAPPPGLPLFSALAAPEVG 
ITNGTPPPPRFFF 


11516 


25417 


A 


11616 


158 


1 


KGKPTPKTFFCQKQI FFLVFFF*TBSHS 
VARAGVQWRDLDSLQAPPPGFTPR 


11517 


25418 


A 


11617 


319 


425 


KVWLYSQAV* KVWLYSQAVAHTYNPNTL 
GGQGWRIA*AYBFKTSL 


11518 


25419 


A 


11618 


94 


2 


CLT IS WLGTVAHVCNPSTLGG * GGH ITR 
SR 


11519 


25420 


A 


11619 


381 


9 


FFPLPLPLPTVSLFPRSPSDAEPKLDCT 
AAISAHCNLPA* FS CLSLPSACNCRRAP 
PRLTAS ASRGAG I ADGVS FTQCSMVPRL 
ECSGVISAHYNLHLPATSLGLPKCRDCS 
LCPDEIVDPRI 


11520 


25421 


A 


11620 


144 


431 


IVNVIVGKGEKGRXIPE*IiIRCWGWN*S 
WDRWVAEDHVLREPSENRRLARVRCLTP 
VSQHFGRRRRLDHEVRSLRPACATWRNS 
VSTYNTKSGWAC 


11521 


25422 


A 


11621 


174 


449 


OTSLPLLGGASQLGYLEVRDPLEEAVCP 
FSDLKPHAGRTTTLFKAIRQGHLSLQRF 
LLPFVWLCPAPRGGVYRGRQASLS*GGL 
HPVRASRP 


11522 


25423 


A 


11622 


316 


412 


GGPSAVAHACNPSALGGRdGRIS*GQEF 
KTSL 


11523 


25424 


A 


11623 


183 


2 


P KQVYLRNGLLSHS KCNKI HH INRLKRE 
KIQPGWAHAYNANTLGGQGRWIT*GQE 
FENE 


11524 


25425 


A 


11624 


355 


168 


DLLI VCYD PNAINL * SLLLRRLKWEDCL 
SPGG*GYSEPRSCHCSPAWAAEPDPVSK 
KDRNVEF I YCC 


11525 


25426 


A 


11625 


154 


3 


TADVLFLGQAGRGNFWMKKNCRPGTVA 
HACNPS TLGGRGG * ITRGLEFK 


11526 


25427 


A 


11626 


412 


3 


I FFPPPKKDKRTRGGQKKN I PPAPPRNF 
LGRGGNFPPPQKPFFPPLF* IFPRPP*C 
KGS I PP FLFPLCFQKKKTPS FFFLPNG S 
P PICEKI FIiFPKKGFLKFKKPRFLKKKG 
PFFFFFGNAVLLCQPGWSAVAQSQ 


11527 


25428 


A 


11627 


290 


423 


VFEYFFKFFKKNNSWPGVVAHTYNPSSL 
GGQGGW IT* SQVLRP P 


11528 


25429 


A 


11628. 


213 


3 


LSGIYSGNRRLVRYSKSVSVIHHINRLK 
KLKWEDHLRPADA 


11529 


25430 


A 


11629 


152 


2 


FQKNRDLEPSKAGWPGMVAHACNPSTLG 
ARGGQIT*GQEFETSMVNMVKP 


11530 


25431 


A 


11630 


422 


189 


P LSG FC P S T WGAKQRALS KKKKRS EAGH 
SGSCL*SQYLGRLKQKDHPSPGGRGFSE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN ■ 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A—Atanine 
C=Cysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=PbenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Giutamine, R=Arginine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 














LLNCTPAWVAKQDPVPKKRLK 


11531 


25432 


A 


11631 


239 


424 


KEDNNRVFVIGPIiGGLNEIiAHDKAFKTQ 
NHHWLGVGAHACNPSTLGGQGRWTT*GQ 
EFTTSL 


11532 


25433 


A 


11632 


343 


401 


AREREHKSAACQ* GMERSVIKHCHVATE 
*EREQICRERQQERASEKERQQESERDSK 
RERARESERERESTRAQLVRKIVNQIS 


11533 


25434 


A 


11633 


125 


371 


FQGCAKICLTNTI1L1PYAHGYI1I WRHDGK 
GCHEKMTAKIIHSCKLSPDAI** SQLGVP 
L+THTHTHTHTHTHTHTFSEKKGLVP 


11534 


25435 


A 


11634 


235 


433 


AALTSFSGPFSFCSYLINTLGKNFPNHE 
ISTT*FTLQRL*KIIFWPGMVAHACNPS 
TFGGQGRWIT 


11535 


25436 


A 


11635 


93 


385 


FTLKSERRKHS VPQGKKQNKI KLLS PDC 
KLENKRFGWARWLTPVIPAL*EAKAGES 
PEGNGEREEATEPFGSIILGFFFCTQIA 
HLHRSDYEPNEDS 


11536 


25437 


A 


11636 


190 


398 


NTKKSQKAFFKTQGKPPPLDPKKKASKK 
TKFFLPKKISTRKFRTGPWRGTNFSLKK 
AKKNLGKTEKKIFP*RPPFPSISSLFWP 
FFFFIiFF* KDKVPLCHRGWSAWQSQLT 
ATSISQMY 


11537 


25438 


A 


11637 


190 


380 


TPKKWPKAFFKTKGKPSPFHPRKKALRK 
TKFFLPQKFP*NPLSPLIPALFWPFFFF 
FFF * KDKVPLCHRGWSAWQSQLTATS I 
SQMY 


11538 


25439 


A 


11638 


261 


91 


EKTLWAGHGGTCL* SQLLGRLRWENHLN 
LGGRG * RS RHCT PAWVTERD SVSNK* IN 
K 


11539 


25440 


A 


11639 


1 


334 


FRANRTVKDAHS IHGTNPQYLVEKI IRT 
RIYES KYWKEECFGLTAELWDKAMELR 
FVGGVYGGNIKPTPFLCLTLKMLQIQPE 
KDIIVEFIKNEDFK*VQCSLANIRGMY 


11540 


25441 


A 


11640 


193 


457 


DGILLCGPGWSAVAPSQLCSGTISAYCN 
IXLPGSSESPVSAP*YLGLQAO^HAPLM 
FVFLVYTGTCMLPLSAHHSRAANSSHFS 
YPLL 


11541 


25442 


A 


11641 


257 


373 


NTISCFRSLRPVAHTCNPSTLGG*DGQI 
T*AQEFETCLG 


11542 


25443 


A 


11642 


211 


367 


DTTSHPFEWL* FKKSRK*RVLDRIWGN* 
MLVHCWWEHKMVQLL* KTVWWLFK 


11543 


25444 


A 


11643 


112 


367 


LQIKTIRYHHILTRMAKIQNTDTNKCYQ 
ECGATGTLIHCWWEFKI IPVI PALWEAE 
VGGLLE VRSSRP AW PTRENP I FTKI IKN 
Y*KYKNTPYIIR*LQIKTIRYHHILTRM 
AKIQNTDTNKCYQECGATGTLIHCWWEF 
KI IPVIPALWEAEVGGLLEVRSSRPAWP 
TRENPIFTKIIKNY 


11544 


25445 


A 


11644 


310 


335 


YHLIAGSLGDI ERIiSMb* IGKAVCHNKY 
* *GCGEPGFLLHCWWEWKMVQSL* KTVC 
QILQT 


11545 


25446 


A 


11645 


143 


455 


KRKGPSLWLLGKSMGGIRFF*KKVFFSG 

rwriT Til PPOM TJ T Til IT TIT UDTM* T V T CITVtlD 

AKKKGGQRKAQFFFFFFPRSFSLAGCSG 
S ISAHCNLCLLGLSDSPTS 


11546 


25447 


A 


11646 


182 


458 


SRGGIRFF*KRVFFSVTKSIWFFFWPLL 
XPLGRVN* IKISFNPPAKKRGWERKAQF 
FFFFLPRS FSLAGCSGS ISAHCNLCLLG 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

alillllU aCIO 

residup nf 

peptide 

sequence 


Predict- 
. ed end 
nucle- 
otide 
location 
correspon 
Qing to 

laat allium 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
OGlycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
v^^vjiuLaiii ine, i% = A.rginine* o — oennc, 
T=Thrennine V=Vnlinp 

W=^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 

IJUCICUllUC IllaCI UUU 














LSDSPTS 


11547 


25448 


A 


1 1 647 


j I y 


j 


RSTFCYS PRGWRPSMWFTtRT iMVT?<;T .PRP Q 
koj> ixi or Auncir onn it uruiivr oijtr EXvo 

ILGPPLLGTPLRGSPPPPPPFEGKGFIY 
GLTFPFP * EPPCB I FFFFFLDRVLLYCP 
G WS AWQS AHGS LE LKDKVC I 


1 1 ZAQ 
X 1 JHO 




A. 


1 10*fo 


10D 


Z 


QDPV*TOTHTHTHTHTPLYAAAQRIN 


11549 


25450 


A 


11649 


290 


3 


SKQNTTHWATLKSYIYPSQFWRIjGSPSS 

P WaX .TV5PT .VP ZXT.PT .VPT?OD DQT.T7TT .TTaTTjJ 

EKTEERKREREK* DRERERNE IERNERE 
RERERETQRMY 


11550 


25451 


A 


11650 


220 


2 


TPTLPKGTGLTALDRI PGRDPGPRRAGR 
PF*AGVAAENCPDPAEGGSTRPLGAPGR 
a q ap i> * pfTPrcp arjrtD o 


11551 


25452 


A 


11651 


29 


387 


FLIFDVTIVIDLACHEPHSYKMAHLINV 
VC^TAPPTSYFPISFIX3PPYSLRI1NDI 
EIRPINNPTRTSQC^SERKSHTSLTLNQ 

1TT.RMTTT.QKTVnMQirATfTr , *I2Dr'T *fHtT»rr 
MjpLII -L JUo IVLKjitIo IVfvft. i. o w 15 Jr ±j 1 V 

SQAVNAK 


11552 


25453 


A 


11652 


267 


2 


KFFFPVFSTSKLNFYPGTPFPLFIiFSFF 
FFFFFFFMREGFAVTWPGAVAPGAVLA 
HCSLELVGSGKPP I S AS * GAGTTGMHYH 

TPMV 


11553 


25454 


A 


11653 


50 


411 


EAVAGGMEKSMMNLPKGPDTLCFDKDEF 
WMT7nPTivnwpvcnrT3xrDnriT.T?'PT.DriTaT3r* 

rlLN EtUJC Li V 17X11? VolJLKMKL-iljijt'ir. 1 iK t J[M r**J 

LYYKLIiKTAMVELINKDYADFVNIaSTDL 
VGMEQALNQLSVPL * QLREEVLS L * S FV 
IERISPSS 


1 1554 




A 








♦GQEF 


11555 


25456 


A 


11655 


356 


653 


SVSPCPFPGPASTLS FLVADFRRRGVDV 
SQVAWQSKGDTPSSCCIINNSNGNRTIV 
LHDT*GPRASPCYNPPISWLNQFPHSPP 
WASQLPPSVASVYQK 


1 1 J JO 






llOJO 




z 


TIIDBWKACAKVQTLSAPINSASRMQS 
IRHWYILKDSSARPAGKGAI IGFIKVG 
YKKLFVLDDREAHNEVEPLCILDFYIHE 
SVQRHGHGRELFQYMLQKERVEPHQLAI 
DRPSLV 


1 15S7 




A 


1 1 AS7 






n T NMfY?P WT ,Vn RP M * IfP KP n JrYWVUET 

E * VL* KKVRHSQLQKENVAQNAPVVQNA 
AYXDQPS PAHVGQQGLS KLPSRPGDQGV 
EPQNI^TLQGRSVIP^ATNTTLPHMLMS 
ORVIAPNPAOIjOGORG P PKHG I CRTTTP 
NMNPANV 


11558 


25459 


A 


11658 


342 


3 


ENQLKNGEHSKSQS AS S PPNDHNTSLAR 

ELKEHVVTQOCEAKVmKTTQEL^ 
RNITYLM* LKITTREPHTAITS IHSRMA 


11559 


25460 


A 


11659 


204 


17 


STCTPLPTA*SQSRPHPPPTPTPPAPWR 
SSNGPLSRGSSSRNSTSSWSIAXATGPW 
THRSEL 


11560 


25461 


A 


11660 


505 


514 


GS*GNHFNPDGASHGGPQDSDRTGPRGP 
CLPSPWGAQILRDWYPQEEMGTLGKQ 


11561 


25462 


A 


11661 


80 


374 


FLFDIYNRCYLIFINII*YLSSNFD*IF 
1 1 1 YLVAI YRIxLCLGL FRAPRQCKHPRP 
QFSFNKIGTKR*AWPGAMAHTCNPNTIiG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
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GLGAQIM*GQBFET 


11562 


25463 


A 


11662 


142 


437 


GQRVQIFFFFWKRFPFFAQGGTEGAQFR 
IjIKPSPSGLKEIPPPNPPRKGE*RDGPP 
FRGDFVFLRKNGVPPCGQGGSKPPAPGE 
PPPFTPPRGENNRG 


11563 


25464 


A 


11663 


120 


353 


QMiQVMLKHVNFFFFLFFLFWKQSFVPL 
PRLECNGATSVHCNLCLPG*RNFPASAQ 
EFRIAGSYDHAICPPASGKGE 


11564 


25465 


A 


11664 


1 


404 


NTWSDL I ISYFHPRPSAFFLRQGLALS P 
RLEYRGTITGHYSLDLWGSSDSPASISR 
VAGTTYRCM*FQLLGRLRRENHLSPRGR 
GCSDP*LCPYTPAWVTERDPVSKKKLRG 
GGENSLIiSSQTMY 


11565 


25466 


A 


11665 


493 


3 


NIWIPDVLPGHLARQ*LGSPGKLSAGF* 
PLQFPQTLSSARAPPFCIiPRLQVRNTVG 
FLPSPHTSRPSHCLPACPLPSRTQPWVP 
VKPGPTACRGFLQHPPTPGSPSP*SFQG 
SSWWDTDLRALECLLRILRGRNCSVFVT 
TQGQLSHEGMNKEFLLTCAKKKK j 


11566 


25467 


A 


11666 


3 


401 

* 


CMERAVTVIiLPGSATQSPPVYAPRALAR 
LWLTAAMMISGFIADYEA*SSRCSSACP 
AGDSLSYYHSPADSFSIMGSPVGAQDFC 
ADLGVSRANF I PTDTDMSTS PDLQWLVQ 
PALVYSVGPSETIAPHPLGVP 


11567 


25468 


A 


11667 


1 


446 


LSCAKPPQRPIiRHGI IKIFNVFEETRAN 
KHLLGVS KLKPRGFPALFLSVSS FGQKK. 
HVPQWLQHSHSHILVISLDFFFFFFFEK 
KFPFFPPGGRERDQTPFN*TPPRGG*RE 
SPPFPSRGGGTQGWAPHCP 1 1 FRFFQKK 
GVFPLRAR 


11568 


25469 


A 


11668 


18 


403 


DPLQRPQ YRGGIiRSVGCDLQS PTPRGPS 
GRWPNPAQARGPGEAQAWAWPGGGPREA 
FSGQGRPPPLGLHPHRRKGAGPPGPMGP 
* HEGQGS C* GAS CRGGLACTAFPAASS P 
PIFGGLTPPNLNFPEE 


11569 


25470 


A 


11669 


157 


1 


PRPQGFVFVFFFFEMQFGRLECSGAISA 
H* NLHFLGS SNS PASASHVYAPLY 


11570 


25471 


A 


11670 


396 


7 


E KVGQARDKS ACEGHS CADARS CLHLRV 
DQGGQHGSHGACPGRRIiRGPDTHRQSCY 
QCRCNR5 P DRLQ PGRAGTP * CAEAAAE P 
DAAVHPGQPAPARLPPALPGRGLLPLPH 
PGtjoyKV X C VRC XCvoaJjR 


11571 


25472 


A 


11671 


209 


3 


DDLGTHRPGKSQVRVTPREAGENPSAVL 
PCYLHLVPFV*EDEVKPEDSIPDMPGNE 
XAKJtSr J_iAiiAh'Jjx 


11572 


25473 


A 


11672 


270 


101 


liSLNRWILGAYI I FFETESHS VAQAGVR 

* UN lAjLilAjb Jjl^iAaoitKlr JjAt Ix. V AJf Lilf XCj 


11573 


25474 


A 


11673 


421 


592 


SSCTPGVI I KIPCVRSNHMKLPGPGMVA 
HTCNPSTLGGQGGRITWSE*FKTRLGNM 
A 


11574 


25475 


A 


11674 


176 


6 


LTFVYMPCFCALSSLIMTYGLAM*FHFH 
SMTLFI IGVLSVALAI YQ*WLDVSRVSV 


11575 


25476 


A 


11675 


1 


418 


NTVCV CV CV CVCV CVERKAGHRLTDS FP 
IQAGPGAQARTLCLRESTGGALS SGLPA 
SRYQDPGRGALHPHCVWTPRGWNWRLNY 
P PARS PS REAVCYFRQTLKKWRS PRGQG 
GRILRVIRTASTFRDTS * TESACLGARP 
C±^GK*VCESVPCFSLHTHTHTHTHTHT 
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C=Cysteine, D=Aspartic Acid, 
E=Glotamic Acid, F-Pheny (alanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K-Lysine, L=Leucine, M=Methionine, 
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HCI 


11576 


25477 


A 


11676 


305 


698 


SFMGRS PRKIDQFCNSRNMVHGSVTFRD 
VAI DFSQEEWECLQPDQRTLYRDVMLEN 
YSHLISLGSSISKPDVITLLEQBKBPWV 
WSKETSRWYPGK* E*SRQGEAI IVPDS 
PCAQRGVTPLRCGLENFLQ 


11577 


25478 


A 


11677 


107 


397 


GGDGRETLRAPADDGCI *QERAETPLAI 
S CPRS S STCRGRARRYDGCRTRDGHQL I 
QRGQQQESSRRTAQPGSSKLCRGPGGHR 
PSLGTVGVHGCSL 


11578 


25479 


A 


11678 


514 


554 


RAKP TPP * GGGEGQNFGLLKP P VSGLP P 
LACPTPPKRWKPKPPPPAPPHFLFF* *K 
RGLIF* PGEVFNLRNWKLPPSPPQRGGN 
KGRNQPPPRGTFMF 


11579 


25480 


A 


11679 


266 


3 


GPPLFIiGLKN*TFFSPTKV* IFFSQVFS 
SFPQGLKGGVRE I FFFFFF* DRLECNGT 
I S AHCNLRL PGS S DS QAS AGTTGVCHYT 
RMY 


11580 


25481 


A 


11680 


1 i 


408 


NTCAJjRHRDHFS FWSDGKTKEGEERI VQ 
ENWLI SWNNFS VPETLLTLSCPLGKEGM 
PGEDGTAGAGKVPGEDKI PG* DGTAGED 
GTEDENGTTGEDETAGVGKTPGAGGTTG 
EDDTESEDGTTGEDETAGKGGTAG 


11581 


25482 


A 


11681 


244 


420 


KDSESTKAYIRDNISSTKKKKSWPGAQA 
HPCNPSTI*EGKGG*IT*SQEFETSLANM 
VKL 


11582 


25483 


A 


11682 


294 


443 


QMNTCTHIQPAYVH*EKYFRPSAMAHAL 
NPSTLGGRGGWIT* SQE FQTRP 


11583 


25484 


A 


11683 


346 


1 


YNTNQFTLRGTQAS VYTCLTAL I VLKLL 
NQPYTLSAVLLNAGNQPFHAFPSPPSLH 
PDAFHSKIMSHII ILLLGFIPFLPASQS 
LTLS PSLERSGTIS AQCNLCLPGSSNS P 
ISA 


11584 


25485 


A 


11684 


140 


423 


SSHQASSPPTTASHSMKISVAAIPFFLL 
ITITLGTKTESSSRGPYHPSECCFTYTT 
YKIPAQRIMDYYETNSQCFKPGIVFITK 
RGHSRWTNPK 


11585 


25486 


A 


11685 


163 


3 


IKI PCNRLLKFGPPSCCGFFFPIiLPPFF 
FFETLSRSVAQAGLQWRDLGSLIAL 


11586 


25487 


A 


11686 


179 


1 


P I VFLGMLFVAFAFRGPFBS VTKPPNF F 

v?x?i?T?ijn?Qr , cxrRf\fir*\Tr\iJTiT\T pot nnnnn 
r r r IS Mn, V AyAJb V yWKJJlAybjjyAKFP 

RST 


1 1 JO/ 


Zj4oo 


A 


f t cot 

11687 


192 


1 


tT•/'V'" , I7TV3IJT 7\ OCiITOT XTTW 7V X/TilZTKJr+r\ DT r>0 

tt\j\j r LKiHLiAii W 1 SJjiuTTAKW JKJNGDEIjCS 
YYQSCCTDYTAECKPQVTRGDVFTMPBD 

oil VI UU\J 


11588 


25489 


A 


11688 


89 


3 


SGGYCCCLCCRCSRYSCCCCRLREGPTK 


11589 


25490 


A 


11689 


168 


2 


QDRLQ PHESKRKLGPAPQLRRSESDTPS 
WNFQSTESQLMSKGDEDTKDDSKETV 


11590 


25491 


A 


11690 


301 


397 


HFVFSLVTGMNPLSPYLNVDPRYIiVQVR 
LRFY 


11591 


25492 


A 


11691 


18 


421 


TKALQITCYLHSTMSEESDGKLIEDSLI 
QLRCHFTWKLLIEAPEIPDLENRIWEEI 
y r JjD 1 Jv I JN ± nn LtunX V KribJUjQNcLHA 
LVSLKKAEDL I QKEHANQADI RSLETWG 
NFAWVYYHMGRLAEAQTYLDKV 


11592 


25493 


A 


11692 


188 


3 


PU3GKGFGAKERGQICKGFLRGGVRASRP 
RPPFGPSLFFFFFFETDSHSVAQDGVQW 
CDLGS 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=GIutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *= ! Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11593 


25494 


A 


11693 


24 


391 


APRADAMGHFTE EDMAT I TSLWGQGNVE 
DAGGETLGRLIiDVYPWTQRFFDS FGNLS 
SASAIMGNPKVKAHGKKGLTSLGDAIQH 
LDDLKCpTFSQLSELHCZDKIiHVDPENFKP 
LGNGLQTALQ 


11594 


25495 


A 


11694 


138 


425 


NSGVKAAFQLNPGNPHLEGKPGPGFKTH 
PGQLGETHFLLKIHYI^GLVGKPWESQL 
LGNLNRENHWTPERGGPRDLRHCHGIQP 
WATNGNFVLKKK 


11595 


25496 


A 


11695 


158 


838 


CGTVHSCDAGQRQRATAPSHPCDHGNQQ 
PILYRVLCCQLFWILCFVFSHSIMSTKC 
PS VS PVSGEMKKRKAITLBMKLKI IAQH 
EGGKPVMAIAREDGLWQSTISTIIiRDKK 
QISDAAKSSASVKSTVITKKRAGPIDDM 
EKLLVMWMEDQIQKRIPLSLUfllQAKAR 
SLFNMLKDRASDPTYTQMFKASHGWFQR 
FKKRHNFHNVKITGBAARAGNEGAIAFK 
EQL 


11596 


25497 


A 


11696 


834 


1431 


SSACQGGSQGWPPRQPWWMWGWSRRAPS 
VTHRTGERCSTSGSAPIiPLIiLSSIiCSSIi 
PRSWEHSTLRQPPHPPLLPPGCLPPGRE 
AQRLSGAGHGPGPRPAAASYIPDTQADR 
KQQQHHPGPDQHVGHREGLALEDAATHL 
GIiVAALSCWLLGRAEAGYQVPHGHDHPK 
DQHPQADGGQRIVRAIGLGLGHHVSGRR 
ARP 


11597 


25498 


A 


11697 


143 


1 


ISKERGASRFSGPWVFFFFFLESRSVAQ 
AGVQWCNLGSLQAPPPGFT 


11598 


25499 


A 


11698 


58 


459 


KGKEEKVKRKEAEQNFSP YAQDKQERI K 
GNENDEKTKQGKETIIDIELFKGLDETG 
ENMDSTLTRTPFEPLENNKQILVLGLDG 
AGKTS VLHS LASNRVQHS VAPTLGFHAV 
CINTEYSHMEFLE IGGSKPFRS 


11599 


25500 


A 


11699 


215 


1 


GTKKAIiWAGGGGFFPYLPPMGPLGPICG 
GVGKGS PGLGCP I FFFF FFFETES CSV A 
RLDAQ WPDLGS LQS P 


11600 


25501 


A 


11700 


300 


420 


KPKI LFGNVFAAPHMENLKCRGETVAKE 
ISEAMKVKAMC 


11601 


25502 


A 


11701 


351 


466 


R I KNADLSQAQWFTPVI PALWEAKVGRS 
LEVRSSRPVW 


11602 


25503 


A 


11702 


172 


400 


SNRLRNKIQGVFLNDSS ISPF I LRKQS I 
GQAWWLTPVI PALWKAEASGS PEMDSTT 
LLPSSSQVPSLVKMEKLNYS 


11603 


25504 


A 


11703 


284 


408 


ASVFSSFFFVCLFVFEMESRSFAQAGVL 
WRDLGSLQAPPPG 


11604- 


25505 


A 


11704 


20 


447 


LPGADYGGGHLSIiRLFHTiTJiASAAWVPD 
ESQVTLNSMCVLSTVLIMEFPDDGKHC 
SEKTCKQLDFLPVKCDACKQDFCKDHFP 
YAAHKCP FAFQKDVHVPVCPLCNTP I P V 
KKGQIPDWVGDHIDRDCDSHPGKKKEK 
IEA 


11605 


25506 


A 


11705 


1 


455 


HSCSLQFTPTAWDCTGSVSSEQGERPAA 

KNGLHSKVRSEQKDTLGNTQI KQRSRPG 
NKGKFVTKDQANCRWAATBQEEGISLKD 
ECTQIiDHEFSCVFAGNPTSCPKAQDERV 
YWKQVARKLiRS 


11606 


25507 


A 


11706 




428 


DAEADKMAAAAVRGGRSGGSGGCSGAGG 
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Amino acid sequence (A= A la nine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leu cine, 
K=Lysine, LHLeucine, M=Methionine, 
N==Asparagine, P=*ProIine, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on , /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














ASNCGTGSGRSGLLDKWKIDDKPVKIDK 
WIXJSAVKNSLDDSAKKyLLEKYKYVENF 
GLIDGRLTICTISCFFAIVALIWDYMHP 
FPESKPVLALCVISYFVMMGILTIYTSY 
RE 


11607 


25508 


A 


11707 


1 


422 


RSQARSSAAAAARASVPLRGSPGPSAIM 
PMFXVNTNVPRASVPDGFLSELTQQLAQ 
ATGKPPQYIAEHWPDQLMAFGGSSEPS 
ALCSLHS IGKIGGAQNRS YS KLLSGLLA 
ERIiRISPDRVYINYYDNINAANVGWNNST 


11608 


25509 


A 


11709 


88 


423 


AADAMKQAEEAMGAFMRIiAYQELQIDRL 
KEDKMLLNLEGNKREHAERLGMGLVSRS 
S VSHS VLS EMLVI EHETPVSAKS S RSQL 
DLFDDVGTFCSGPSKYTDNPFSLWESIG 


11609 


25510 


A 


11710 


393 


130 


NPSATAQSFHLAARKLSTLDRPGSQAPQ 
KPMPSPKPGWLPASTNAVPTPLAEATP 
SKAHPAI SLLSTEEGI FKAVPSPAS S CS 
FLHV 


11610 


25511 


A 


11711 


331 


440 


KIFFLNFLISRAWWCAPLVLATWEAEVG 
ESLYPRRS 


11611 


25512 


A 


11712 


122 


478 


SGLCPQQPPRANSCPPSSMASCAEPSEP 
SAPLPAGVPPLEDFEVLDGVEDAQGEEE 
EDEEEEEEDDLSBIiPPLEDMGQPPAEEA 
EQPGALAREFLAAMEPEPGPSPVPKEWL 
DILGNGL 


11612 


25513 


A 


11713 


102 


2 


TNIiGNPRRPPPFFFFETVSLIiAQAGVQW 
CDLGS 


11613 


25514 


A 


11714 


126 


2 


FFAPFLKI FFFFFFSEMESCSVIiQAGVQ 
WHDLGS LQNEGSQ 


11614 


25515 


A 


11715 


178 


449 


LSGGNLFGI I ILLYCAI IGGKLLGLIKL 
PTIiPPIiPSLLOILLAGILITNIPVINDN 
VQIMHRWSYSLRSIALAIILVRAGLVLB 
SKALEK 


11615 


25516 


A 


11716 


48 


417 


GSGGNHS VCCDTMEGGGGSGD KTTGVLA 
GFFGAGEAGYSHADLAGVPLTGMNPLS P 
YLNVDPRYLVQDTDE F I LPTGANKTWGR 
FELAFFTIGGCCMTGAAFGAMNGLRLGL 
KETQNMAWSKP 


11616 


25517 


A 


11717 


103 


2 


PKSPTQWLMAVIPALWEAKAGGSRESRS 
SRPAL 


11617 


25518 


A 


11718 


1 


413 


WPSGQVLVGCLSFSLYCWKRSLKRKNPG 
FEVLLKI FLKNHPRCHTNRMIQLTATPV 
SALADEPAHIRATGLI PFQMVSFQASLE 
DENGDMFYSQAHYRANEFAEADLNHAAS 
LGGDYLGDLALRLLCTLEPVFPQSY 


11618 


25519 


A 


11720 


116 


462 


AGMLPAVGSVDEEEDPAEEDCPELVPIE 
TTQSEEEEKSGLGAKI PVTI ITGYLGAG 
KTTLLNYILTEQHSKRVAVILNESGEGS 
ALEKSIiAVSQGGELYEEWLELRNGCLCC 
SVK 


11619 


25520 


A 


11721 


167 


407 


EIYSLTRFIEVKMSKKISGGTWEMLGD 
EMTRIIRETLKEKLIFPYVESHLHSYDL 
GIENRDATNDQATKDALEAFNKPY 


11620 


25521 


A 


11722 


254 


423 


NQLSS IMAMFKKI KS FEWFNDPEKVYG 
SGEKVAGRVI VEVCE VTR7KAVR ILACG 


11621 


25522 


A 


11723 


3 


424 


VSCDTMEGGGGSGDKTTGGLAGFFGAGG 
AGYSHADLAGVPLTGMNPLCPYLNVDPR 
YLEQDTDE F I LPTGANKTRGKFELALFT 
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OCysteine, D=Aspartic Add, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, MNHistidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Argrame, S=Senne, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














IGRCCMTGAAFGAMNGLRVGLKETQNMA 
WSKPRNVPI IiDMVTRQGALWANTDGAIiA 


11622 


25523 


A 


11724 


2 


343 


AFGTMKWVTLVS VLFLFS SAYSRGVFRR 
DAHKS EVAHRFKDLGEENFKALVMIAFA 
QDLQQCPFEDEAFSTSEVPEFATTCQDD 
DHFEDRCRRVHTLVVVQLCTPATLLETD 
Y 


11623 


25524 


A 


11725 


1 


359 


HAFGTMKWVTVI SLVFLFNS AYSRGVFR 
RDAHKSEVAHRFKDLGEENFKAL.VLIAF 
. AHYLHQCPFEDHVKLVNDVTEFAICrCDlA 
DESAENCIDiCSLHTLFGDKLCTVATLQET 
YGK4ADC 


11624 


25525 

- 


A 


11726 


1 


349 


GAMSDRKFSAPRHGSLGFLPRKRSIRHR 
GKAKSFPKDDPSKPGHLTGFLGYKAGMT 
HIVREVYRPGSKANTNEVAEAVTIVETP 
PMEVADIAGYMETPRGLRTFNTAFAEHM 
SDEC 


11625 


25526 


A 


11727 


81 


349 


TKGb V VSCU VULC VCLC"! WaHLURiiVTW 

LPDMPDDVLWLQWVTSQVFTRVLMCLLP 
ASRSQMPVSSQQASPCTPEQDWPCWTPC 
SPEGC 


11626 


25527 


A 


11728 


264 


388 


QADPKDIMKFPGPLENQRLSFLLEKAIT 
REAQMWKVNVRKM 


11627 


25528 


A 


11729 


2 


471 


PGCSASWSKRGSGPDMLSSMAAAGSVKA 
ALQVAEVIiKAIVSCCVGPEGRQVIiCTKP 
TGEVLLSRNGGRIiLEALHLEHP IARM IV 
DCTSSHIiKKTGDGAKTFIIFLCHLLRGL 
HAITDREKDPLMCENIQTHGRHWKNCSR 
WKFISQALLTFQTQIL 


11628 


25529 


A 


11730 


160 


377 


LQGFGRPSVYHAAIVIFLEFFAWGLLTT 
PMLTVS IASLGLCFVRERDKFLGTYHCV 
CLDTCLGVALDSDLKQ 


11629 


25530 


A 


11731 


45 


438 


KLGQRKEATVTKBS CKSCSRKVGS PDRF 
RSPQKRSKGRQDCFTLFCFWKVLDKNME 
LISPTVIIILGCLALFLLLQRKNIiRRPP 
CI KG W I PW IGVGFE FGKAPLEF I EKARI 
KVCGRGRRGLQRRQCFLF 


11630 


25531 


A 


11732 


169 


292 


DS ILLVNIiACSAVI S AHCNLRLPGSSDS 
PASASRAAGGAHL 


11631 


25532 


A 


11733 


473 


600 


KJ?' WEKKTiLURGKb Ki> KN S £ L\jK 1 VS ISA 
LSTTEVAMHTSTSS 


11632 


25533 


A 


11734 


19 


349 


DAGGETLGRLLGDYPWTQRFFDS FGNLT 
SASAIMGCTPKVKAHGTKVLTSLGDAIKH 
LDDLKGTFAQLSELHCDKLLVDPENF 


11633 


25534 


A 


11735 


234 


359 


FYHLKSFTVSS VQSRWLTFVT PALWEAE 
AGGSPKVKS SRPAS 


. 11634 


25535 


A 


11736 


15 


372 


KLPLKALTGEEKTHINIDIIGHVHSVKS 
TTTGHLIYRSRGIDERTIEIFEKEAAEM 
GKGSFKYACILDKLKAERERGITIDISM 
RKFETSKYYVTIIDAPGHRDFIKDMTTG 
TSHADCA 


11635 


25536 


A 


f l fit 

11737 


1 


357 


PTNAAEDQKLKLERIiMKSPDIiAVTI PEK 
MSEWSPGPPPEFDRDVMGSRAGAASGEF 
HVYRLLRRREYQRQDYflDAMAEKRIIiDA 
EFQRRLE 


11636 


25537 


A 


11738 


24 


335 


APNANAMGHYTEED RATI TS P WG KGNVE 
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NO: of 
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M 

eth 

od 
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beginning 

nucleotide 

location 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=lso leu cine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Prolme, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














DAGGKTLGRLLDVYPWTHRFFDRFGNLS 
SDSAI MGNPTDKAHGQKVLTS LGDATKH 
LDDLKGTFAQLS ELHCDKLH 


11637 


25538 


A 


11739 


141 


335 


MQFLPCIPILKSLKKSVASHSQTVHSDI 
ISTVEFNHTGEUjSTGDKGGRWIFQRE 
QESKNQVHR 


11638 


25539 


A 


11740 


7 


337 


APS PDAMGHFAEEDKAT I TS L WCKVNVE 
DAGGETLGRLLAVYPWTQRFFDSFGNLT 
SDSAI MGNRKVNAHGTKVLTS LGDAI KH 
LDDLKGTFAQLTELHCDKLHVDPENF 


11639 


25540 


A 


11741 


182 


360 


SRHS ISPPVNQ IQLGASVTEE LTWTKT 
ARVSRAQWLMP VI P ALWEAEAGES P EVR 
NSR 


11640 


25541 


A 


11742 


174 


1 


HFSDHSLFVCVRQDLTLLPRLECSSLIT 
AHCSLHLIiGSGDPPTAASWVSGNTGVHY 
HA 


11641 


25542 


A 


11743 


75 


218 


KI I LGRAQWLMTV I PALWEAKVGRS P VS 
ASQSAWDYRRBPPCPSCTI 


11642 


25543 


A 


11744 


168 


2 


LAPLWSLGPVLGGVGQGSPGPGFPFFFF 
FFFFETESCSVARLECSGPILAHCSLR 


11643 


25544 


A 


11745 


83 


2 


REQRFLLPFVQLCPAPRGGVYRGRQAS 


11644 


25545 


A 


11746 


156 


3 


FHSGSGRVENPSFFFSFFETGSCSVTQD 
GECTGATLAHCDLCFLGSSNSS 


11645 


25546 


A 


11747 


244 


330 


KDRAQWVTPVI PALWAAKAGRSLEVRSS 
R 


11646 


25547 


A 


11748 


201 


1 


TSQPLIRITLSTFFFFSRDGGLTMLPRL 
DPELPGS SNP PS CS ASRVDGTTGMCHHT 
RLIFFSQTDKK 


11647- 


25548 

- 


A 


11749 


262 


3 


VYTSLTPPPISVNLLTQVSPRSVSYEQN 
ILCLLFVTYFTCTFFVCLFVCLFVCFFK 
MESCSVAQAGVRWRDLGLLQAPPPGFTY 
TT 


11648 


25549 


A 


11750 


225 


3 


PIINFSVPQFLHLYNGIIIESTPPSCCI 
LKCQSLGQAQWFTPVITTLWEAEAGRSIj 
EAWTLKTILANMAKPHL 


11649 


25550 


A 


11751 


185 


2 


VSTPFNSFPSPPWDLVFLKGFFFFFFFF 
FFETEFCSSPRLERNGAPLAHCNIiRLPR 
SSNSP 


1 1 £<n 




A 


11752 


261 


336 


UK VbJjWS PRLECNGAJSSHCNIjRIjP 


11651 


25552 


A 


11753 


282 


381 


TDIiIiY KKKCXjLGAVAHAYNP SALGGQGG 
RITCS 


11652 


25553 


A 


11754 


202 


45 


NPRKVKIiQWALILPLPFNLGGQIiKSRFQ 
KKKKKKKKKKKEEKRKKKTIiVVI C 


11653 


25554 


A 


11755 


293 


357 


LTPVI PALWEAE VGGS PE VRS 


11654 


25555 


A 


11756 


264 


1 


TLPINWCCDHSTGPSLISIjPLIiSLPYS 
LRHNNIEIRLINNPTTACKYSCLKKSHK 
SLTLDQKLEMIKLGEEGLLKAQIGQKLG 
LLHQ 


11655 


25556 


A 


11757 


126 


1 


ILPGFLKELKIELPFNPAILPLGIYPKE 
KKLLYQKDTCTHMF 


11656 


25557 


A 


11758 


399 


226 


IEKGTGRVWWLTPMIPALSEAKAGGSPE 
vtColrKi'AUXVaijbJxJ? W I iKLjSFUHFAKlto 
EI 


11657 


25558 


A 


11759 


235 


380 


mTVLKLGQLITLQWASKCSSERKSRTS 
LTLTQKLEMVKLSEEGMSKGD 


11658 


25559 


A 


11760 


249 


357 


NRASGQTWCPTP IMPALWEVEAGGSLE P 
RSLRLTWA 
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M 
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nucleotide 
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ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alauine 
C=Cysteine, D=Aspartic Acid, 
E=GIutaraic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leudne, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


11659 


25560 


A 


11761 


168 


372 


SKCSGTVTTHCSLKLLCSSDPATSDSRV 
AETTGTGYHAGL 


11660 


25561 


A 


11762 


303 


377 


QFAGCRWI^PvXPAbWllAiUUATOlrr. * 


11661 


25562 j 


A 


11763 


135 


1 


ATTPGLFFIFYFFETESHS IAQAGVQCM 
I SVHCNIXILPGSSDSFV 


11662 


25563 


A 


11764 


219 j 


2 


KLRSSQIiSEPEVRPDSTLSMLMHARAHT 
HTHTHPHLFLPHPCSRPHTSTPTCMPIV 
THSPI YXiKFHSHTYIIj 


11663 


25564 


A 


11765 


102 


1 


NRP INNPTMASMCS SERKSHKSLTLNQK 
LEMIKL 


11664 


25565 


A 


11766 


245 


3 


PGEPEEDQLVKfKPSLLWV FMFLfc'tliRR 
SLALSPRLECSGVECSGVISAHCNLCLT 
GFKQFSCLSLIiSGWDYRHVPPCPS 


11665 


25566 


A 


11767 


350 


97 


GSVPAS PQVTRPTiiMTSLADKWFSLCLT 
SIiRVNFGVALI LSGSHS VSQDTLDLLTL 
FSTHLGLPKCWDYKRVPPCSAQt JjVTFIj 


11666 


25567 


A 


11768 


188 


391 


LGSVAGDLLCFFGGVTFPCSFMFPMSLC 
YYLY I WYNS CV FLjFFKTE PH FVTHSGVH 
WCDGliliUr P P PJN 


11667 


25568 


A 


11769 


131 


2 


YTLYIPNKSPIENIQLSWAQWLVPVIPA 
LCGAK IvjKJjl»iiPKi>i> 


11668 


25569 


A 


11770 


165 


1 


VLIPLIHLCIYIYIYTYICVYIYTRIYT 

ITT r* T IPPUTTV TtJT VXTVTtTFVT VTVMVTV 

Hii-.JLrilriV i XriX a V X-Lrl± nil xni x x 


11669 


25570 


A 


11771 


72 


1 


ILLFFFFFEAESCSVAQAGMQWCN 


11670 


25571 


A 


11772 


121 


3 


TSFVLFCFETEFCSCCCPGLECNGAILA 
HCNLRIiPGSSD 


11671 


25572 


A 


11773 


152 


3 


HVCLNLTLPFLFEKNIFSLCVCVCVCVC 
VCWLCKKKKCEKDFYFERTF 


11672 


25573 


A 


11774 


190 


2 


GFSPRQRGAPRVPPPLAGPFPP I FFFFF 
FFEMWSHSVAPAGVQWCNHSSLAALTSP 
DSGDPP 


11673 


25574 


A 


11775 


287 


1 


GAHRRKLFITPGESLiRDKQIAGrEHRGG 
EKKKTLFYKKKKKRKKKRKEKKGKEKKM 
LPKCNYTMKNHQVPxS IKrlKEuAUiiKV 
FYFIKFQIIiKY 


11674 


25575 


A 


11776 


121 


3 


KCASRDIiSKFFFFFFLETESRSVAQAGV 

/~vr,T/-TT^T POT 07V T 

QWCDJjGi?i»UAJj 


11675 


25576 


A 


11777 


142 


1 


EKTIiHVRNTIHNSEGSLVIKIHGRLGVV 


11676 


25577 


A 


11778 


179 


3 


SHQPVPGTIiDLPRGPQKLQSTSEAESEA 
SMSEASSEDLVPPIiEAGAAPYREEEEAA 


11677 


25578 


A 


11779 


147 


6 


KTPGLKKNSNFFFFFFFFETRSHSIARA 
GVRGCDLSSLQPPPLGLK 


11678 


25579 


A 


11780 


184 


3 


GFGFYIIPNYRLFFFSuLIGPrr Flilrr 
ETEPCSYARLECSGVISAHCNIRIiLGSC 
DSPA 


11679 


25580 


A 


11781 


120 


319 


VKSIiAKFLVNYQLHRQTCSGQAGRGALR 
QRFGRPRQVDHLRPGVKJjy PGyHtab i P£> 
LLEVRKLSSG 


11680 


25581 


A 


11782 


91 


5 


YICLSLI Y THTHTHTHTHTHTHI YIYVC 
V 


11681 


25582 


A 


11783 


221 


332 


S RLGMVDHPRSGVRDHPGQHGETP S LLK 
IQKLARRSG 


11682 


25583 


A 


11784 


243 


379 


LKYSVP I KNOKPWPGTWDHTRNPSTLGG 
QGGW I TLDQKFBTS LAN 
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SEQID 
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M 
eth 
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NO: in 
USSN 
09/515,1 
26 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
£=GIutamic Acid, ^Phenylalanine, 
G=G!ycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, P=Pro!ine, 
V^Vjiutamine, K— Arginine, S=Senne, 
Threonine, V=Valine, 
W=Tryptopnan, Y=Tyrosine, 
X=Un known, *==Stop codon, /^possible 
nucleotide deletion, \~possible 
nucleotide insertion 


1 1AQ1 

1 iooJ 


zjjo4 


A 


11785 


327 


5 


txKV JJt>y P 1 1 PKJNPPG3FPKIY PGS FKTL 
FPYLQSRHS FSI PRS KLARFHQAKPLFC 
SKPSQSLS PHSELKKKBLSQARWLTPVT 
RALWES KAGRS PEVRSSRPTWPT 


1 1 6QA 




A 


ll 786 


245 


359 


FKCJjyLDRLRSGVRDQ PGQHS KTP S VLK I Q 
KLAGHGGMRL 


\ looO 


25556 


A 


ll 787 


356 


67 


RDI AS I TARLR YDRG VG I TRLEVLS KS D 
EYTKGSVIjDFSRETEPIGYIYIYIYrYI 
Y 1 1 JL Y liHS LiPJbNHMK I SIaSHPPLSNNKF 
VQNFDSPKLSYH 


i looo 


A J Jo / 


A 


11788 


103 


3 


KYIIPVCIFFFFETESCSVTQSGVQWCE 
LRRRG 


1 100 / 




a 
A 


11789 


136 


1 


VHQrTIRl IjNYYYYYFFETES YSVTQAG 
VRWCNLGS LQPLPPGFK 


11688 


25589 


A 


11790 


291 


163 


SLU^LPRLECNSTISAHRNPRLLdSSDS 
PASGRSLEPRSSKLQ 


1 1 £QG 


25590 


A 


11791 


107 


2 


I FMYVFFETESRS YAQAGVQWRYLGSVQ 
APPSGFS 


11690 


25591 


A 


11792 


329 


3 


KNPRNIIGKSRGPYITNFPGDKGPKNSS 
PRGNDLS I NKNWAQAFQPRGQKKLFFPK 
KKKKKETQSRS I PRLECNGPT IAQCNLK 
LLASSNPPTSASHSAAIISMSHSG 


11691 


25592 


A 


11793 


213 


3 


ISPWLFLLLQATLFRSQYPGPFPPFSLS 
PYLFPFKQKLGSGFFFFFLFFFKTESRF 
LARLECSGI ITAQ 


11692 


25593 


A 


11794 


1 


400 


KRAAPQPAPEQRDLKKKKKKPPLAPSSL 
FRFWEKLLPSFQFRQPRGPFLGTRGVS F 
AFHRRRFWQ YGNMGEKQTWGNPGS S SNP 
PPGTAGSLSGNRGPGWGGKIADRFWLYA 
RDSFGHSRGLKGSCCQAHLGK 


11693 


25594 


A 


11795 


3 


769 


RKEQTRNARAEVLRQAKANFEKEERRKE 
LKRLRGEOTWMLPDVNERIBQFSQEHSV 
KKKKKKDICHSKKAKKEKKKKSKKQKYEK 
NNESSDSSSSSEDEWVEAVPSQTPDKEK 
AWKVKDEKSGKDDTQ 1 1 KRDEWMTVDFM 
SVKTVSSSSLKAEKETMRKIEQEKNQAL 
EQSME I FQSKLEDAEKAASTKEDYRRER 
WRKPTYSDKAQNCQESRESDLVKYGFCS 
RDRYATTDTAKNSNNEKFIGDEKDKRPG 


11694 


25595 


A 


11796 


no • ; 


13 


HTHTHTHTHKHTHTHTHTHKDRSGKIKC 
VPTV 


11695 


25596 


A 


11797 


184 


2 


SEIFGKFHVFYILTVHFMPPTQVYVERD 
VLiGQr#QIjIjTP VI P AL WEAKAGRS PEVRS 
SRPA 


i ioyo 




a 

A 


1 1798 


83 


'JOT 

387 


GfcKi<KKRRRRJjWAPIjQKKKJCKKKKK^ 
KKKKKKKKKKKGGAPLKKPPGGPHFSGG 
Ky J\WI PPljM^liUjUiAPAGDr l^PGRGK 
IARGGFFEKNLSWGGEK 














i ivy / 




A 




i <i 


*> 
Z 


IPWJiLri 1 V loGYFQI? IjFSHSFFYFYFFE 
TGSCSLTQTGVQWCNHGSLQPR 








1 1 8AA 
1 loUU 


yj 


3 


V WoKAW W li lbv IdAljW&AtvVCxBO P£t VRS 

s 


11699 


25600 


A 


11801 


115 




NPLFFFFFFFEMESCSVAQAGVQWCNLG 
SLQPLPPRFK 


11700 


25601 


A 


11802 


295 


137 


RCFIFILFINKLYFVYLFIYFFEMESHT 
VAQAGVQWRDLGSLQAPPPNKLYF 


11701 


25602 


A 


11803 


287 


1 


MGPHAPFKGTCGGFQDFRFKNLEISAGL 
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peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Pheny lalanine, 
G=G!ycine, H=Histidine, I=Iso)eucine, 
K=Lysine, L=Leucine,lV1=Methionine, 
N=Asparagine, P=Proline, 

C\ — Clin Hm inp R— /Vrcnnine S=Serine 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 

V=lTnknnwn *=Ston codon. /=nossibIe 

nucleotide deletion, V=possible 
nucleotide insertion 














THSPLLAQNFRVGAVPSPRYSPRFGSKG 
GGGFFFFFFFFETESHSVAQQAGVQWCD 
LGSLQALPPGF 


11702 


25603 


A 


11804 


202 


3 


WSVCCFKQGLGLS PRLECNGAPGFKQSF 
PT JQT.T »<?NWC3YRGPP PRLNFFFFF FEMES 
CSLAQAGVQR 


11703 


25604 


A 


11805 


3 


315 


PMTiT. T T iTT .FT.TiRNDRTjVOCDVRS SWCL 
GFLLGWSVILYPLMAAFMPTMWIFLFIL 
FSMLFFYVFFLFLLFFLLFLYS FPLFCF 
FFCFYLFRFFFFFLI FFSSP 


11704 


25605 


A 

A 


1 1 Of\C 


I/O 


i 

D 


KRGFFFFFFFFFFETESRSFAQAGVQWC 
DLGSLQALPPGV 


11705 


25606 


A 


11807 


130 


2 


QSIKMTCSSLYFFIKKNVAWWLTPWPA 
LWEAGAGGSHEPKS 


11706 


25607 


A 


11808 


149 


1 


GGKKAILFFFFSLS PRLECSGAI PTHCK 
T,C!L,PGS RHS PASAFR WGTAG 


11707 


25608 


A 


11809 


258 


3 


KYYYQGNRQIKFVNAKSNKNLPSYSQKA 
S WFS SRNI RMMGQHQPMHLDNPTS FI KK 
IiGQAWWLRPVTPALWEAEAGGSLESRSL 


. 1 1708 


25609 


A 


1 1 Of A 


\jj 




CAS P I RSHQKLNKNGWVQWLMPVT PVP W 
EAETGGSLQLRSSRLA 


11709 


25610 


A 


11811 


160 


3 


NHLFLT.T.NRYKS I FIGRVQWLMPVSSAF 
WEAKAGRS FEPRSSRPAGQHGKTL 


11710 


25611 


A 


11812 


89 


2 


AFFFFFETVSCSVAQGGVQLCHLHSLQL 
N 


11711 


25612 


A 


11813 


254 


336 


LLGQPQWLTPVIPALWEAEAGRPSEVR 


11712 


25613 


A 


11814 


108 


2 


CVGVLGFFVCLFFETESCSVAQAGMQWH 
DLTiGSLH 

xj \ ii >vjtj iiin 


11713 


25614 


A 


11815 


166 


289 


SHSGWNAVWRDGSLQPLTPGFKRFPCLR 
VPSSWDHKCAPPC 


11714 


25615 


A 


1 1 Of c 

1 Jolo 


ivz 




DKDPGKVTRPFIiLLCI FYLF I CLFLRRT 
IAVSPRLEFSGRGCSEPRSHRCTPSWVT 
EQDSCRK 


11715 


25616 


A 


11817 


228 


343 


LLQYS IQNEDAGWAWWLTPVI PTLWEAK 
AGRSLEVRSIi 


11716 


25617 


A 


11818 


322 


407 


VLRMLLHCLRECKIjVQPLWKTVWQFLKD 


11717 


25618 


A 


1 f Of A 

Holy 


i to 


i 
l 


WEKXVRGKKCOKEN THTHTHTHTHTHTH 
THTYRESKRERLV 


11718 


25619 


A 


11820 


269 


1 


FVQVFYILWTQSFCLTCCQWFLLGYHLS 
SVSNRFMRSGKSTFFFMTSEFFFFEMES 
RCVTQARVLECSGS ISAHCKLHLPGPRH 
SPALV 


11719 


25620 


A 




in * 
Zl / 




EHI ETVYGKLG I S EHWGKGGLFNKWLT I 
WKKD YS WQWWLMPV I PALWEAEVGGSRV 
H 


11720 


25621 


A 


11822 


188 


1 


GSHMPCRVISSVHESMNEFPAVPTSYPA 
NPQPRERAWRNQREKEDKKERSQRSVGR 
VQAGLV 


11721 


25622 


A 


11824 


126 


3 


KLQGQVQWLTPVI PAPWEAEAGRS PEVR 
SSRPASTWRNLV 


11722 


25623 


A 


11825 


224 


3 


ALI I KKEVSAMELEGSNRIQAYGVQS IT 
GARDYHAAASRVPAI KGTHHHARVI FVF 
LVETGFHKVGQAGLDSC 


11723 
i 


25624 


A 


11826 


3 


364 


HELPEPIiRVLWTAHRVAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRIiFDHA 
MLQAHRAHQLAIDTYHE FEETYI PKDQK 
YS FLHDSQTYFCFS DS I PTPCNMEETHQ 
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26 
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nucleotide 
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ing to first 
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peptide 
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Predict- 
ed end 
nucle- 
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location 
correspon 
ding to 
last amino 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoteuctne, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Argmine, S=Serine, 
I^sTnreomne, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KSNLBLLR 


11724 


25625 


. A 


11827 


2 


376 


ARELPEPITVLRTAHLKAMAPGSRTYLL 
LAFALLCLPWLQEAGAAQTDTLYMLFDH 
AMLQAHRAHQLAIDT YHEYDETYI PKDH 
KYS I LHDYQTSFCFSNS ITTPYNKEKTQ 
QKSNLELLRISLL 


11725 


25626 


A 


11828 


288 


3 


IHTKNPSVHHHHQRPKVDKTTKMGKKQS 
RKTGNSKKWSASPPPKEHSSS PATEQSW 
TENDFDELREBGFRRSNYSELQEEIQTK 
GKEVENFEKN 


11726 


25627 


A 


11829 


107 


1 


KRSGVRLQRGLVRWLTPVI PTLWEAE VG 
GSFEVRC 


11727 


25628 


A 


11830 


220 


354 


QGNKS FKNI IAPWLGTVAHACNPSTLGG 
QGGRITGHE IET ILANM 


11728 


25629 


A 


11831 


273 


352 


ENLKTGQIQWLTPVIPALWEAKAGRS 


11729 


25630 


A 


11832 


239 


488 


SQTHCSGKTENSHGTNSDRNQVPTAIiHK 
RPSS PARFLTEAVP VPPVDYLQS VA7VFA 
VSAVASQWERTGKPFNPLLGETYELIR 


11730 


25631 


A 


11833 


189 


294 


DQTRWLTPVI PALWGAKASGS PSVRSSL 
HFPLLF 


11731 


25632 


A 


11834 


1 


332 


GTSPEPLTVLWTGDLLAMAPGSRTSLLL 
AFALLCLPWLQEAGAVQTDPLSRLFDHA 
MLQDHRAHHLAIDTYHEFEETyiPKDQK 
YSFLHDSQTFFCFSDSIPTPSNMEET 


11732 


25633 


A 


11835 


330 


175 


PRELAQLGSPNVRVSRVKPRGQPQMFFF 
FFWRQESCSVAQAGVQWHEPEQQE 


11733 


25634 


A 


11836 


96 


1 


W VKTGSRSVAQDGGQWCDLS S LQPQ P P 
RPRA 


11734 


25635 


A 


11837 


267 


334 


SWVQWLTPVILALWEAEAGGSP 


11735 


25636 


A 


11838 


180 


2 


SVIiEKKEKNLYKjnjFTYKLLKKVQGKTQ 
OEGRAQICSVCVCVCVSVCTCVHVCAYV 
SSC 


11736 


25637 


A 


11839 


183 


2 


AAPLTSSAPQAPGWKPHLAPNS PFKNFT 
GRGQAQWLMPVI PALWEAKVGRSPEVRS 
LSSC 


11737 


25638 


A 


11840 


54 


330 


DPNGQL PEPLKGLWTAHLVAMAPGSRTS 
LLLAFGT iT iCT .P WLQEAGAVQTVPLSRLF 
DHAMLQAHRAHQLAI DTYHE FE E P Y I PQ 
DQKNSFLD 


11738 


25639 


A 


11841 


117 


344 


IQYVKQ INETEMRNNESYLNHTSLTITI 
HTLCLMGSYLEHFQNQCKGRARRLMPVI 
PALWEAKAGRSPDVRSSKPA 


11739 


25640 


A 


11842 


750 


968 


nB/VITlTILTTT *T~ Tt TT> f"1 fl T\ IJI IT T~"T>T T T\T T7TTT 

RAQGPR W I LKI P FS S PFRLFP L I PLVFL 
YNb F F Li r F F c FK i W AJ/NJNb FFP PF liF P P 
LPFSPPPPPSPSRPPPS 


11740 


25641 


A 


11843 


757 


1000 


WELVWYCXILFSPFFPLPSPPLFPPPPP 
SPPHSFPPPSPFRPPLPLAPTRPPPPFS 
LFQAPFliLiForCoPPPPPPPSiior u 


11741 


25642 


A 


11844 


278 


361 


TLLFFFFFFEMESHFVARAEVQWRDLGS 


11742 


25643 


A 


11845 


260 


2 


GSCLLEGKLTNRKDIHTKNPSVHHHHQR 
PKVDKTTKMGKKQSRKTGNSKKQSTSPP 

RA 


11743 


25644 


A 


11846 


194 


2 


TKFAKEPSPPPCWQEVFKTSPLGFFFCP 
RSFPVNGFPRLKNLFFFFETESCSVAQA 
GVQWCARA 


11744 


25645 


A 


11847 


219 


1 


SLGHTLKQSESSLSFSPGGTPLLEFKPS 
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Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K~Lysine, ^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, SHSerine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X^Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














VYDPSVPPFQVSKGGC.VvjF rrr r y± CioU 
SVAQAGVQWHDIiGSPRA 


11745 


25646 


A 


11848 


126 


3 


RFFF KGLr JbHr F r r r r FFL-r KM ootid va 
QAGVQWCDLGLV 


11746 


25647 


A 


11849 


216 


339 


KCFKVGKVYKLNRETFYIiAQNFr DKN WA 
TQKMWKTLLEIil 


11747 


25648 


A 


11850 


190 


334 


VSYHIEMS FFEVYNEKIHDUjVCKDENG 
QRKQPVRLKQF I ICFEFLFL 


U748 


25649 


A 


11851 


198 


352 


SRQGTWGIHGLECFFVCLFVIiFFETESH 
FVPRLECSGAI S AHCTLHLPGEL 


11749 


25650 


A 


11852 


371 


1 


PPKKLGIQVFTPFAPRAFWPVLVFKKKG 
FPQGNLVLPGFFSKVSIQGVPPFCRKPF 
SGWGEAFQIGFF INRGKKFFFFPLGFFL 
KKVFLKFLAKFFFFFFFFFETESHSVAQ 
AGVQWRSLGLV 


11750 


25651 


A 


11853 


176 | 


3 


KKGFPLDWNSMVKNI IVKFGEDLQISKT 
LHTFFFLTESRSVAQAGMQWCDLGSLHS 
C 


11751 


25652 


A 


11854 


281 


375 


QLTFKKYFIiGWARWLTPVIPALWEAKAG 
RSP 


11752 


25653 


A 


11855 


146 


356 


KCGAL IAE I E V PIjFSEIjRUFJjIjAYSj Rflri 
ELCFKRCVPS LHHRALDAVDDDCLHS CS 

/tnlfWH TXTP C? T MYT7V V 

CTDY PIN Kb LiWRAjc 


11753 


25654 


A 


11856 


136 


1 


LS ILCEFFNWLIEMEYRSVSQAGVQWCD 
LSSIiKPPPPRLQQCHSC 


11754 


25655 


A 


11857 


277 


361 


IVLRGAMWLTP VT PTLWBAEEGGSPEVR 


11755 


25656 


A 


11858 


295 


152 


WFGFGFFETESHFVAQPGVQWCNLGSL 
CSLRLPGSSNSPASAFQVA 


11756 


25657 


A 


11859 


1 


342 


GTRLPEPLTVLWTAHLGAMAPGSRTSLL 
IiAFALLCLPWLQEAAAVQTAPVSRLFDH 
AMLQAHRAHQLAIDTYQESEETYIPKDH 
KYS LLHDSQTCFRFSDS I PTPYNMEDTQ 
HK 


11757 


25658 


A 


11860 


244 


332 


TIEMMJjDllvQlQVi r Lir or KnUKKXAIS 1 
T 


11758 


25659 


A 


11861 


1 


339 


(5TK V V T 1 Uy Vy littA! AtllflJ X J? JTJSWJEi/iKO 

VAQAGVRWCDLGSLQPPPPGSSSSSSSS 
S 


11759 


25660 


A 


11862 


311 


394 


GMLGAVAHACNPSTLGGLGGW I TCSQE F 


11760 


25661 


A" 


11863 


130 


1 


VPPSVKTFFFFFFtSTitoKta VSyAUVy WW 

YLGLLQAPPPGSTSC 


11761 


25662 


A 


11864 


3 


320 


IMFlXAbrijIjoVljIjVM^ X 
GGLVLIVSGWGCVIILNFGGGYMGLIV 
FLX XJjGGMMVVrGx rTAMAiKK x PKAWU 
SGVEVLVSVLVGLAMEVGLVLW 


11762 


25663 


A 


11866 


165 


309 


GuXljijPrluiSUoUKloAnvJoijI JjiAjolUf 
SDSGSHVAGTTCTCHYaWIjR 


11763 


25664 


A 


11867 


208 


314 


bbKN ILiiooyAy ni-> 1 r VI rHli WKAH vouo 

PEIRSSR 


11764 


25665 


A 


11868 


102 


I 


KKFFFFFhTESHSVAQARVQCNLGSLQA 
PPPRFT 


11765 


25666 


A 


11869 


126 


3 


MT^T/iLiRSSYCKKPFFFFLKTESRSLTK 
AGVQWCDLGLLQ 


11766 


25667 


A 


11870 


1 


273 


KQLPVNFLNWVRLELTGLLCTLNLSKPC 
MIFI ILVTVKYWFLLFCNIFKNHTFSQA 
QWPTPVI PALWEADMGGSHGYWITTI VD 
FMCATLT 
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otide 
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sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylaianme, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=»Asparagine, P=ProIine, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possibk 
nucleotide insertion 


11767 


25668 


A 


11871 


126 


2 


FPKALLWFFFFFFFFETESRSVAQARV 
QWHNPGS LQCPLR 


11768 


25669 


A 


11873 


269 


9 


QVSGKFWPDIFYKTGEKGLQCPNFPQGH 
PPRGLKKKSFSKKKKKKKKKS IAVGQAQ 
WLI PVI PALWEAKVGRSPEVKSSRPACP 
TW 


1 1769 


25670 


A 


11874 


114 


1 


LEAKPGVFQSGVLLVIFFFFETESHSVS 
QAGVRWCDLG 


11770 


25671 


A 


11875 


146 


1 


GNHLSSRVGVQDQPRQHGKTLSLQKIQK 
VAGPGGAHLQSQE VQVGGS L 


11//] 


25672 


A 


11876 


2 


159 


SLQFFFFFFRWS FVLVAQAGVQWHDLGS 
PQPPPPGFKHSPASASQVAGRGGR 


J 1772 


25673 


A 


11877 


124 


240 


FRAFSSGQAWWLTPVIPALWEAETGGSL 
EARSSRPDWRT 


11773 


25674 


A 


11878 


135 


5 


QVLFYFYFSDSFTLSPRLECSGVTLPHC 
NLCLPGSSDSCASAS 


11774 


25675 


A 


11879 


170 


2 


GFFFPEKWGYKNS FLCFFFFFFFEPEFC 
SVAQAGVQWLHLGSLQPPPPVFKVDAAP 


11775 


25676 


A 


11880 


97 


267 


GHGHATLRGLCVLSLFFHI PAPS VSGTS 
DAEECCLCVI YRLI CGYISRNCLYFSDH 
R 


11776 


25677 


A 


11881 


1 


292 


LPEPLRVTjWTAHLQAMAPRSRTTIiLiLAF 
ALLCLPWLQEAGADQTVPLSRLFDHAML 
HAHRAHQLAIDTYQEFEQTYIPEDQKYS 
FLHDSQTYFCFSD 


11777 


25678 


A 


11882 


93 


2 


KYQMGWAWWLTP VI LALWEAEAGRSPET 
SC 


11778 


25679 


A 


11883 


145 


2 


PPLGLRLQVQAPTPGFFFFFFETESRSP 
RIiKCKGAILAHCNVCLLLV 


11779 


25680 • 


A 


11884 


202 


1 


TWWRWG VTVL VRL VMNCRPCDRHKS AS Q 
LIGRVRQENGLNARVGGCSEPRSRHCTP 
VWVTSENPSSC 


11780 


25681 


A 


11885 


42 


155 


GERSGLSPGVQDBPGQHSGTSSLQKILK 
LAGHGGTCP 


11781 


25682 


A 


11886 


215 


1 


STMARHCPLSPMLFNSAMEVLVRAISQE 
KEIQGIQIGKEEVQLSLFTDDMIFNLEK 
RKDCSKNLLQLMNLV 


11782 


25683 


A 


11887 


214 


1 


GYFFGLNEVLGKLEKPSLKVPQNPSPKR 
PFFLGFFFFETESRIVARLECSGAISAH 
FNJjCLPGSSDSPVSC 


11783 


25684 


A 


11888 


215 


3 


WGPGFPFFSFFFFFLRQGLAVIRLKCSG 
TITAHCSLNFDGS SDPPASVSLVATTTG 
Uti PS ITQ FHS HGS C 


11784 


25685 


A 


11889 


118 


1 


RFFIMGENPTKFFFFFETESLLPRLECS 
GVI S AHCNIiS C 


U785 


25686 


A 


11890 


267 


83 


HCLRSGVQDQPGQHGKNPSIQKIQKLAT 
S FECKSLKI VT PPLPFKNDKTDSKS SCSS 


11786 


25687 


A 


11891 


134 


2 


DRLAVLPRLECSGMIFLLPLPSRFKRFS 
CLSLPSSWDYRGAPRA 


1 1 /57 


ZJ005 


A 


1 1892 


116 


1 


S KGVGHFL F F FFF FETE S RS VAKTG V QW 
CDLGS LIiCL V 


11788 


25689 


A 


11893 


397 


475 


RFVCST I KVLRDLS SDRSNPGRFLST 


11789 


25690 


A 


11894 


115 


331 


KNVCLFVFVEMKNHLNPGDEGCSKPRSR 
HCTPS WAAQQDS IS KYI CI YRYLYLD I Y 
LSKLWRLRINQLPNS 


11790 


25691 


A 


11895 


134 


251 


INPPVSRKKKKKKKKKKKKKKKKJCKKECK 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
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£=Glutamic Add, F=Phenylalanine, 
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W=Tryptophan, Y=Tyrosine, 
X-Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














KKKKKKKGGGL 


II791 


25692 


A 


11896 


186 


402 


PGKNLTIiENVPRENKVGKKAPGQNEPPV 
liGGGEYQDPGGNVKGvGP P PAE>»FGKjjG 
PKRIaADx FuIiX EGDGD 


11792 


25693 


A 


11897 


100 


2 


KGPPFFFFFFFFFFFFFFFGQSGQVKLK 
SPKCK 


11793 


25694 


A 


11898 


92 


3 


S VARI MPYALFIjIjS VGliVMGFV GF5 S KP 
S 


11794 


25695 


A 


11899 


283 


362 


MTYTOTIVHALWASVCLLLNHAHDPLD 


11795 


25696 


A 


11900 


248 


1 


VMSAQPGLSVUjRFMGKWSNWTGMQYl'ti 
SEVER YRDRGRARETERQRDRERERENP 
RNRKLFFYGRFKHSPQDFFMPQKVHF 


11796 


25697 


A 


11901 


253 


2 


KLQENPFKLLNFIINLCSVSIJSrVPI ILN 
INIKPSFIAPKPRCPSKFQRNYIYIYPS 
IYLSIYLSJYLSIYLPITISKVLKDVE 


11797 


25698 


A 


11902 


179 


3 


LAVTKNEIMGFSTTWMEMGDINFFFFFF 
ETKSHPVMRLECSGVISAHCNLRILVSS 
SC 


11798 


25699 


A 


11903 


1 


403 


GTSSQESFGGCCVSGLIAMGTKAQGERK 
LLCLFILAILLCSLALGSVTVHSSEPEV 
I I PENNFVKLI CAFYDLSFFSSS FSYFF 
HLS FFI YYP YYLFVS CFTF Y I VI JjLS FI 
YSHHLSSVSYYSYVLLIYTFIL 


11799 


25700 


A 


11904 


37 


242 


KGPTRDQLQHPKARLPAPLRVLWTAHLA 
AMAAGSRTSLLLAFALLCLPWIalYtXSAS 
HIVSLFMLFIIF 


11800 


25701 


A 


11905 


1 


298 


GTSQHRGRKUsRTGSHS SSDHPGAKLLS 
TEBKQAAETMRPPSAPPRRGCIPWQGLL 
LSSS 


11801 


25702 


A 


11906 


225 


30 


ACYLQKDGAFAILPDVLPDMFKGHSSLY 
PCQHAECDHI KNIYNCVCVCVCVCVCVC 
IVICKLNV 


11802 


25703 


A 


11907 


196 


320 


HSSSSTTPSQDTHTHTHTHTHTHTHTHT 
HKIPQRELLPSVPD 


11803 


25704 


A 


11908 


130 


3 


NGFVFSPFPFFFFFEMESHSVTQARVQW 
CDLGSLQSLPPGFK 


11804 


25705 


A 


11909 


381 


2 


ASICGQKLIFFQHWKKI ILVQQVGLQKK 
NSTLKRAWEFLSFFQPAPPLGS PLKGIN 
ILNKGAKKLKGKTNFPGPIVFFFSVLPP 
FFGLKKFQKKVFHQGKSPFFFFFFFETE 
SRSVAQAGVQWHDL 


11805 


25706 


A 


11910 


138 


2 


SVPLKEFIISQARWLTPI IPALWEAEAG 
RSPEVRSSRPASTWRNP 


11806 


25707 


A 


11911 


231 


54 


IRAS FGIQIRICLTIFFFFETEPHSVTQ 
AGVQWRDLGSLQPPPRRQEQNSVSKQTK 
TN 


11807 


25708 


A 


11912 


3 


461 


DAWGRVEGPPLRPPATSRRWAGPTLWRM 
EVTGDAGVPESGEIRTLKPOjLRRNYSR 
EQHGVAVSCLEDIjRSKACD I LAI DKSLT 
P VTLVLAEDGT I VDDDDYFLCLPSNTKF 
VAMAINEKWAYNNSNGGTGWISQESFDV 
FEAYSGATLLFFF 


11808 


25709 


A 


11913 


318 


407 


IjCGRIiWWIiMPVI PAIjWKAC/UjKjIjIjxUjKo 
MR 


11809 


25710 


A 


11914 


340 


5 


DRVAKFSEARLFRIiFFPFFPLKIFCFPR 
GFKIFRGVCPLLFPPEFWGLFQKGPGMR 
LFFPPLGGFFFFFFFFFFFFFFSFFFFF 
FFPFFFFFFFFFCHKNKTLLKKVHNSI 
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amino add 
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peptide 
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Predict- 
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ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Add, F=Pbeny (alanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop cod on, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


11810 


25711 


A 


11915 


274 


385 


IKPQRWGETRAEKLKSLKIRVPLLLQRN 
AAPHQQWNK 


11811 


25712 


A 


11916 


259 


441 


DTKLPKVNIKLNFHALKKKKKXKKKKKX 
KKKKKKKKSSSLRG 


11812 


25713 


A 


11917 


254 


402 


LIVSVIDFLRWRLPUjPKLECSGMIS^F 
CRLRL PQS YHS PAS GCQLLGR 


11813 


25714 


A 


11918 


223 


1 


NTNSPRKKFFYEEVSSVPVFFGVFPPSP 
LKGSPRAFLKLAWRPPLPLFFFLEMBSR 
SAAQPEVQWCDLGSLQPP 


11814 


25715 


A 


11919 


97 


3 


KPFFFPFFFFFFEIESRSVTQAGVQWRD 
LGS 


11815 


25716 


A 


11920 


154 


384 


KEFFLMLFFFLPPPPPFFFFFFLGKGLF 
FFPPGGGGGGQFFSIGPPPPRGKKDPPP 
SPPKMGGKRGTPPPPGYFFFF 


11816 


25717 


A 


11921 


225 


369 


GTLNLLTYKIKSWSGAVAHYNPSTLGGW 
GKW ITSGQEFETS LANWKP 


11817 


25718 


A 


11922 


116 


372 


MB X 1 WNKMAMa YNliHI YQTY KNSVIKHP 
TGVLPHSRVIMVWNIVLCISKQLKELLR 
RLMWEEHLSPGGGGCSEPRSRHCTPAWA 
T 


11818 


25719 


A 


11923 


271 


409 


KKKKKKKKEKKKKKKKMOCKTGGGA 


11819 


25720 


A 


11924 


121 


228 






GGGFKKKPWGGQK 


11820 


25721 


A 


11925 


55 


423 


NKPKKKNFLKKKKKNFFFPFYPLKFFFF 
PKS LNFFRRWPKI S PPKKKFFFKNS PS 
VFFFPPLKKKNFFFLTPLKFGPPKNFFK 
RPPPLFFFFFFFFFFFLSGNCIEKL 


11821 


25722 


A 


11926 


253 


443 


YQHQRPKVDKSMKMGRNQRKKSENSKNQ 
NTS S P PKDHNS S PARQQNWMENE FDGLT 
EVGFRRW 


11822 


25723 


A 


11928 


175 


413 


A 


11823 


25724 


A 


11929 


490 


182 


RKQKI RGCKKPNPLAQC3GVKKKGKKGPP 
FNFFLKQKGQDFLKNWNLGQKKKIPIPP 
QVFSLRQKGAFPGRIFFFFFFFETESCS 
VAQAGVQTLSYKNKNKIK 


11824 


25725 


A 


11930 


110 


2 


KKIPRKPPFFFFFFFFYKAGSHLVAQAG 
VQWHDLGS 


11825 


25726 


A 


11931 


169 


987 


YLEKIMSEHSRNSDQEELLDEEINEDEI 
LANLSAEELKELQSEMEVMAPDPSLPVG 
MIQKDQTOKPPTGNFNHKSLVDYMYWEK 
ASRRMLEEERVPVTFVKSEEKTQEEHEE 
IEKRNKNMAQYLKEKLNNE IVANKRESK 
GSSNI QETDEEDEE EEDDDDDDEGEDDG 
EESEETNREEEGKAKEQIRNCENNCQQV 
TDKAFKEQRDRPEAQEQSEKKISKLDPK 
KLALDTSFLKVSTRPSGNQTDLDGSLRR 
VRKNDPDMKELNLNNIENIRK 


11826 


25727 


A 


11932 


161 


389 


S VQTHPNIaRSCS VLKNAMHFYLLG YVI S 
GCTE PAKAI KP I DRKSVHQ ICSGP WLS 
LSTAVKKIVGNSLDAGATNI 


11827 


25728 


A 


11933 


134 


289 


AS TQNMGQMLRLMS S PTLLSLS LS HTH I 
HKHTRMHTHTHTHTHTYTPTKKV 


11828 


25729 


A 


11934 


128 


1 


I KKGE FFFFFFFFETESHSPRLQQSGT I 
LAHCNLHLLGSSNS 


11829 


25730 


A 


11935 


131 


19 


MYIGWVQWIiTPVI PALWEPKVGASLEAR 
SLRPARTSE 


11830 


25731 


A 


11936 


104 


3 


LKGQLIRAQWLMPVIPSLWEAEMGRSPE 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamtc Acid, ^Phenylalanine, 
OGlycine, H=Histidine, Msoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














VGSSR 1 


11831 


25732 


A 


11937 


146 


2 


GGRLGIIYHAWNPEGFFFFFBMESCSVG 
QAGVQWCNLSSIiQPPHPGFK 


11832 


25733 


A 


11938 


149 


1 


DKCFLCVCCLFIFIYLLIYFYPFETESC 
S IAQAGVQCCNLGSPQPIiPLA 


11833 


25734 


A 


11940 


176 


380 


QSVAQLPRIiECNGMIIiAHCSLHLLGFKR 
FSCX&LPSSWDYSRLRNYQGGRRWSKTH 
QAWCQSLT ILWN 


11834 


25735 


A 


11941 


133 


415 


KKKKKKKXKKKKiUUCKKKKKKKKKKJuQC 
KKKKKKSG 


11835 


25736 


A 


11942 


222 


420 


QGDKFLDNNNSTILYMfclKXKKKKKKKKK 
KKKKKKKKKKKKKKKKKKYSR 


11836 


25737 


A 


11943 


163 


425 


KNPPKNSQLSPSTIiKKKKKKKKKKKKKK 
KKKKKXKKKKKKJUCKTG 


11837 


25738 


A 


11944 


143 


3 


KMGKKQSRKTGNS KKQSASPP PKERSS S 
PAMEQSWTENDFDELREE 


11838 


25739 


A 


11945 


200 


3 


QNSQVGYKGAPHKEKTMALQARVNLGPP 
RGPLKRPALFFFFFETESPSVTQDGVQW 
HDLGSMQPP 


11839 


25740 


A 


11946 


221 


3 


RKIEGGPSTSRKSVRGVRCFQPLGLGFP 
GGLFF FFFFYETES CS VARLBCSGWSA 
HCILHLRSTNEGRRGR 


11840 


25741 


A 


11947 


68 


177 


I SSSFKNKCLCEKKKKKKiCKKKJOcKKKiC 
KKKKKRKR 


11841 


25742 


A 


11948 


210 


329 


RHTHTHTHTHTH I LSLS LSHTHTHAHTL 
MYFLAFFDLRS 


11842 


25743 


A 


11949 


183 


20 


LGTVAHS CNFVFS LS RGFLHVGKNGLRL 
PTSDDPPTSASQSAGTTGMSPCTWPE 


11843 


25744 


A 


11950 


232 


329 


VGIKMS ISSDEVNFLVYRYLQESGFSHS 
AFTF 


11844 


25745 


A 


11951 


412 


239 


LFYEKGSRFVSQAGLELLELKQS PCFGL 
PKCWDYRHEPPRQADLPFCVPSSAWASW 
L 


11845 


25746 


A 


11952 


310 


393 


PHTDISGTPE IMHYVHVHRVTTQPRNKP 


11846 


25747 


A 


11953 


193 


3 


IiSSWPTLISGAFKAEENAVEGNDDSRIL 
RGFFVLFCFVLFFRQSIiALSPRLECSGV 
ISAHCNL 


11847 


25748 


A 


11954 


185 


416 


SVQTHPNLRSCSVLKNAMHFYKLGTEPA 
KAIKPIDRKSVHQICSGPWLSLSTAVK 
KIVGNSLDAGATNIDLKLKDY 


11848 


25749 


A 


11955 


100 


1 


AHLSKVFP I FFFFFEMESHSVTRLECSG 
TISAH 


11849 


25750 


A 


11956 


35 


521 


KEKFFFHAGVYWGP PRNFLKRAPLFFFF 
FFFFFFFFFFSFFFS 


11850 


25751 


A 


11957 


214 


1 


" KKKIFFF KE I FYPKLSRKKKNFFRGAGK 
ILNLSIFPEKKKIFSPFSFFFETESCTV 
AQAGVQWHVLGSLQP 


11851 


25752 


A 


11958 


114 


1 


RYFSGQAQWLTPVI PALWEAEAGRS PEV 
SSSRPAWLTW 


11852 


25753 


A 


11959 


188 


12 


" SKCQCTFFSNFKKTEMESHYLAQAGLEL 
LGFSHLPTSVSQTVGITGVSHCAQPNAH 
LY 


11853 


25754 


A 


11960 


241 


556 


SS I PFLPNKHLLLSFSTLSSLGREVS IQ 
DMCQGTYQKQPFPFPLPHVFHLS IRDGE 
LCVGRDVHLTCQTDIiAQVFVCLFVLGVL 
I,SLPRAGVQWCHLSSLQPPPP 


11854 


25755 


A 


11962 


362 


462 


KI GQ ALTP IIP LL WEAEAGAS P EARS S R 
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N=Asparagine, P=Proline, 
Q=Olutamine, K^Arginine, S=Serme, 
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OJMriF 1 


11855 


25756 


A 


11963 


43 


2 


CCCYCCCCCCYCC 


11856 


25757 


A 


11964 


30 


415 


GLiGFKQICI FFF FFLiGQGPNFDLIjRAKKR 
PFLPPPFFPKNPKNRGVFLLGPSGVPKP 
RPRQGDVPPLKPRSLGVFLTSKKTPLGP 

MiMDODDOPHPO 

WAW e r if if r olAa e 


11857 


25758 


A 


11965 


149 


2 


SKTWLKLAKYLKNTYPGWVRWLIPIVPA 
Ii wQAGAGEXiHE PRNIiRPAWAT 


11858 


25759 


A 


11966 


179 


3 


KKNI FFP PPVKFGP PQGFFKRPPPLFFF 
FFFFFFFBTESRSVAQAGVQWLS IGSLQ 
AP 


11859 


25760 


A 


11967 


245 


382 


D VTDS KMHPDFRSS HVKMWIiGWAHACN 
PSTLGGSRGQEFKTSIiAN 


11860 


25761 


A 


11968 


264 


10 


LSTLECKGVSPGVFS PKGWRKGLGYIFS 
P FLE KPH I P PI/3 1 S MYFFF F FDGVS VAQ 
AGVQGRDLGSLHLPSPGSSYSCASASLS 
S 


11861 


25762 


A 


11969 


326 


406 


RLLKGWALWIiTPVIPTIiWEATVGRSPE 


11862 


25763 


A 


11970 


120 


1 


KGYGFPFPPKKPIX3PPFFFFFATESCSV 
AQDGVQWRDLGS 


11863 


25764 


A 


11971 


84 


370 


RNGAELKTPLAGQTQITINCRYSWVTLV 
LGDLESILCWHI FKKTNLFI CGFFSSFL 
LGAGRMKSHCVVKLECRGMX S AHRNVCL 
LGSNDS PCS AF 


11864 


25765 


A 


11972 


196 


3 


S RGKMGE YPFVL FBCNPS WAKAGGNKGNP 
FFFFFEAESCSVAQAAVQWCNLGSLQAP 
PPRFTPRA 


11865 


25766 


A 


11973 


559 


644 


tf Tr»r<o Ti ir/^i TT4 t^t t t*it ^rnnkTr/r twit r.rrrrf 

KIGCSAYGVAILLFLYFFNKLAFTLWKK 


11866 


25767 


A 


11974 


3 


391 


HEAQLPE PLMVLGTGHLAEMAGGSRTYL 
LLAFGLLCLPWLQEAGAAC?TVPLSRIjFD 
HAMLQAHRAHQLAIDTYQELBETYIPKD 
QKHS FLHDSQTSFCIjSDS IPTPSNMEET 
QQKJSMIjETjTiRXSIxIjIjIK 


11867 


25768 


A 


11975 


138 


365 


LKFECHSTLCANHGKQTFTFVFYNKFLL 
KYTFYFS VLTYAINWYI KNL INVFKKKK 


1 loOo 


257d9 


A 


11976 


275 


433 


MS X LJJLo KAKNFr liS FJjETGS CS I TQAG 
VQWLNHSSLQPQTPGLRDPPASASQ 


11869 


25770 


A 


11977 


108 


3 


RCGLFFFETRASPCRPGWHNLGSLQPLP 


11870 


25771 


A 


11978 


270 


408 


FENNVWLGRAQWLT P VI PALWEAQAGRS 
PGPENQNHPDYKGNPWPY 


Uo71 


25772 


TV 


11979 


AAA 

228 


A At 

441 


yAJjiVr V 1 ISM Li UjJjIU\.L.LiU Va v A r MKrlKJx 

SSRGKDS 














11872 


25773 


A 


11980 


2 


447 


GALALNKTTADVWRLNFLVSGLHWKRWL 
QQTSJUS KWV JsJL KxSCo i LiKKKiuUUuCKKK 
KKKKKKKKDKK 














11873 


25774 


A 


11981 


124 


445 


KYGEMSQNPARGGPKFSKNQKYSEHLRI 
HCCPPFTFLISKKBIGDRKYSICKSGCF 
YQKKEEDWFCCPCQKTKTSRRAKSLKRP 

Ivy IVtr v/vr rw w l\Mr'/\AJr Lurfir 


11874 


25775 


A 


11982 


48 


429 


KESNGSQDRLLPKIHSPDVSGGKSGGMS 
QNPARGGPKLSKNQKSSEHLRIHCCPPL 
PFIiI SKKE IGDRKYS FGKSGCFSQKKKE 
DWICEPCQKTKLKGKIRPPPKKNGPGGS 
LNGRTTWVSGLFVHN 
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a. — «j n Known, — oiop tuuuu, / jjtoaiuic 
nucleotide deletion, V=possible 
nucleotide insertion 


11875 


25776 


A 


11983 


419 


500 


SIiLVKVEKQWPGAVAHACNPSTLGGPG 


11876 


25777 


A 


11984 


362 


496 


JjW Or i.KllJLiljKWirVWVfxlV tf L VrrtinunJlv 

GRLLEPRSLCNMVRPIS 


11877 


25778 


A 


11985 


142 


433 


RFVCSTIKVLRDLSSDRSNPGRPLSTSN 
KKKKKKKKKGGGGVI KKFLGGP I FGGGG 


11878 


25779 


A 


11986 


177 i 


2 


GAVAHT YNP S FLGGQGG WVTYVHRTRGR 
TR 


11879 


25780 


A 


11987 


178 


443 


DKKKKKKKKKKKKKKKKKKKKKAGGGLF 
KKI 


11880 


25781 


A 


11988 


329 


410 


Llr r C i.r r iSKrioruD V/vj/iJ. v i^wxijjxjvjo 


11881 


25782 


A 


11989 


146 


1 


DRASALQPGRQGRNFVSKNKNNNVI Y 1 1 
NTLTTT I FLGTTYDALS PEL 


11882 


25783 


A 


11990 


240 


420 


FliELSCHAERTTALFRATOQGCIiSLQNIi 
SVAF 


11883 


25784 


A 


11991 


239 


410 


GTLFIYLLGDLFIYFRNKSLTLLPRLEC 
GGVI VAHCSLDLLGSG YPS I S ALPS S WD 
C 


11884 


25785 


A 


11992 


126 


3 


WNKPNGRVGGRV 


11885 


25786 


A 


11993 


300 


22 


SQLLGRLRQENHFNWGGRGCSEPRSCHC 
I P AWATRANS I FCGFQASS VEVRRSARK 

KJbr SIJXjuKKJriDJ I IbnK Vo<jxjJ-ixjVx<»0 irw 

RLATPVRAQ 


11886 


25787 


A 


11994 


303 


2 


EGEE I FLPS PPFP PGQENPGSNCS PPLF 
P FGTFLDIR VAGSHIwyAK.G V c IvKKr J/o 
LHLFLIKKRFFFFFFETESRSVPQAGVQ 

Vi KxJJ-ivto 1 lr Jr f W xMX 


11887 


25788 


A 


11995 


171 


457 


' SFSDHLILGWIGFO^SKIALCSSPIK 


11888 


25789 


A 


11996 


357 


204 


EKTGFPHVGQAGVDLLTSRSTRLSLPKC 
WDYRHE PMRLAGHL YYFYPQMK 


11889 


25790 


A 


11997 


219 


I 


nrjir cnr n ntlDD VDD/^2\r.T7WA DDDUT2\/'QT.I'l 
fit V If WAPf Jr K l lr lr\y/*JuJPWi4JrlCr VovOJjfV» 

ARAPTKAGPKQKGARGTFFFFFEMESHL 

ODDT J?f*Mf2 , TTWAUf , NT>P 
O irK_Ljd*_iMVj X J. WiUluVI XJ V« 


11890 


25791 


A 


11998 


254 


1 


AHLRGNRQLPKHTFFQYMTTNLKCAFSV 
fJP OQYQT PUP FYT i F FfTYK CVFHTHT 
HTHTHTHTHTLYFQIMVLLLPSLRKKGS 


11891 


25792 


A 


11999 


167 


2 


NFKSFFQGLSRGVLNPKVNLFFFLETES 
pq vQn a.nMnwfjDT .d^lo ppptrpptrp 


11892 


25793 


A 


12000 


104 


3 


RPWTFFFFSETKSRSVAQAGVQWRDLCS 


11893 


25794 


A 


12001 


178 


2 


KIFQRGAKNSPWGLPFRGGENKKGAPPC 
QKGFGRFFFFFETESRF ITQTGVQWCDL 


11894 


25795 


A 


12002 


163 


282 


GIGGE WCLSKYVT KYVKKKKKKKKKKKK 


1 1 one 


I j lyo 




1ZUUJ 


195 


2 


KWGRIPLKKGGFCFTKKKSGQTLLIKSP 
PPAFFVFFIFFFLETESRSVAQAXVQWC 
DLGSLQAP 


11896 


25797 


A 


12004 


152 


1 


CFVDVSAEKLRPKDFMTKTPKAMATKAK 
I DKWALTKLKS FCTAKET I IRV 


11897 


25798 


A 


12005 


359 


1 


EWPIWPPPKRRVLSKVSKQFFISAPIR 
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KKPFPCQVGLIIiAPFRVLIiIGRPFFFFF 
\j VJj V Va IQRWS r I S LVVVVVRAR IMMYA 
LFLLSVGLVMGFVEFSSKHCPVYGALVL 

VoJ,wKV 


11898 


25799 


A 


12006 


189 


12 


DGRLPGSVCYYLIFLRQSFTLLAQAGAQ 

UrVT /20 T y^O T3 D O/"* PVDP 15 'DXT'O t> T O TV T ITS /"» 

WCxlLrtjSL^oJ^ri'var KKCj KK V b K X KAVRG 
DQ 


11899 


25800 


A 


12007 


131 


3 


KI FFFFFFFEMESHSYAOAGVQWRDIiGS 
MQDPPPGFMPLLFY 


1 lOAA 


25801 


TV 

A 


12008 


150 


2 


1 1 LFIjVP IEAVFFFFETvAEAGVQWHNL 
SSLQPPPPGFKQSPASASRVA 


11901 


25802 


A 


12009 


305 


3 


KFFFSLKASIRGRWFGLFFTPPKKGFLP 
KI PHQVFKGGPLWEKLQLGKAGLNLGPY 
KGFFKGAARFFFFFFETBSRSVAQAGVQ 
WRDLAHASVHASAHAS 


11902 


25803 


A 


12010 


263 


517 


DKGFQLPPRWCPGQDPLFFYGGIVFHGG 
YGP P F FFFFFKABSRLVAQAGvQWQDIjG 
SLQPPPCRTRGSP 


1 IQfYl 

I \y\)j 


25804 


A 


12011 


107 


374 


WMGTWGSLPVLLTTRSTCPPRSPRLHRR 
TGATIWIFFRLFETGTVLPMLECSTVIM 
AHCRLQLPGS S S S PTIASQVAGTKAHLS 
DCFVY 


11904 


25805 


A 


12012 


144 


261 


FPKKIKNFYFTFLQDPTGI FSLDKTIGL 
GTYGRIYLVS 


11905 


25806 


A 


12013 


371 


3 


IFHLRKIFTFLPGIFWPIRLLSEKKKPK 
RFWVKI KKFKNG YPLKNP IN PKYKGPRF 
FFKGEKNPNGKVP VLKKI KI PS FSKALG 
FLQKKASKPFFFFFFDTESCTVDQSGVQ 
WCDlAsbLiQAP 


11906 


25807 


A 


12014 


101 


3 


RGAFFFFFFETRSRSVIQAGVQWCDLGS 
IiQAP 


11907 


25808 


A 


12015 


205 


2 


VFNPSGINVMYVDAI ILNKRLATRI QQH 
IKQRILHDQVGFIHGMQGWFSIRKSINV 
IQHINRPKDKN 


1 1QAQ 


25oU9 


A 


12016 


t /TO 

168 


1 


GCVCQFQKKLGNGGLNGFFFPIiFFFFFF 
EMKSRSWQAGVHWRYLSSLQTPPPEFK 


i iotio 




71 


12017 


311 


2 


RGLRrGQyvKTQMENPNTPFSAIiRPGQLS 
SIRSGLLHTYPGLGDRSBPLSCSILSSS 
KYWWQAVTS ALS SNKPGASQGHWKDDF 
FLFFETESCSVTQAGVQWH 


11910 


25811 


A 


12018 


383 


247 


LVEMGFRHVGQDGLDLLTSQSAHLGLPK 

UUnVDDPDDDT 71TTDI7T 

VvWUIKKiLr't'KiaAi liS VJj 


11911 


25812 


A 


12019 


2 


379 


RVLWTAHLAAMAAGSRTSLLLAFALLCL 
PWLQEAGAVQTVPLSRLFKEAMLQAHRA 
HQLAIDTYQEFISSWGMEAYITKEQKYS 
FLHDSQTSFCFSDS I PTSSNMEETQQKS 

1\TT t?T.T EfTCT TTTOC 


11912 


25813 


A 


12020 


3 


389 


PEPLRVLWTAHRGAKGAGSRTSLIiLGFA 
UiiV-ijP W JjUEAGA V QTVPLS KIiFDHAMLQ 
AHRAHQLAIDTYQE FEETYI PEDQKYS F - 
LHDSQTSFCFSDS I PTPSNMEETQQKTN 
LELLRI FLVLIELWLDP 


11913 


25814 


A 


12021 


?47 
z»*t / 




PMLGHVSOSGGNGVRDPIiEEAVCPr.AKT. 
KHCSGRSTALFRAGRQKR 


11914 


25815 


A 


12022 


17 


371 


PLRDLWTAHLEAMAPS FRTDLLLAYALL 
CLPWLQEARAVQTVSLYRLSDHAMLQAH 
RAHQIiAIDTYHEIEETYILKDQKYSFLH 
DSQTSFCFSDS IPTPSNKEETQQKFYLE 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
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LLPISL 


11915 


25816 


A 


12023 


3 


359 


LRALWTADLGAKAPGSRTFLLLASALLC 
LPWLEEAGAGQTVPLSKLFDHAILQAHR 
AHQLAIDTYQEIiEETYIPKDQKHS FLIiD 
SQTSFCFLDS IPTPSNMEETRQKSNLEL 
FRTSLLL 


11916 


2581 / 


7V 
i\ 


12U24 


2 




PEP LKGLWTDHLGALAP SSRTS LLLAYA 
LLCLPWLQEAGAGQTVPVYTLFDHAMLQ 
AHRAHQLAIDTYHELDETYMPEDHKYSF 
LHDSQTCFCFSDS ITTP YNMEETQHTSN 
VELLRISL 


11917 


25818 


A 


12025 


314 


393 


GQAWWLTPVI PAFWEAKGGRSLEARS 


11918 


25819 


A 


12U2t> 


oil 




PLIACSFFLFIiFFFFGKKILFLPPRWKG 
GGGIWVNGNPGFRGQAFFLASSSKKPGM 
GGPPLTPGKRFVFLKKKGFSPGGPGGSK 
FSAPGTPPTGPPKGWE IPAK 


11919 


25820 


7k 

A 


12U2/ 


lOD 


o 


WQKLLFHFGTESCSVARVGVQWRHFSSP 
KPPPPEFKQLSAPASRVADRRPPDAW 


11920 


25821 


A 


12028 


1 


338 


VFVALKFLMCLLSVCIFSLENIYSNSLP 
QKNCLPLIFFFLFFEMEACSVTQAGVQW 
GDLGSLQPPVSHIiNLGGGGCCEPRECHC 
TPCSRPGDRDFVFNKNTMIQDNHLMELT 


11921 


25822 


A 


12029 


2 


315 


HEERERERERERERERERERERVGRTVG 
GVGETYKAELPRVGGAAQKRAHFSARGR 
LFMEICGDMCGEKPGKPPHSLTEECLSR 
CGERFFOTS LAITRGCAQSV 


11922 


25523 


7V 
n 


12UJU 






ERI IPRGVRQRNRLFSGRDTASPPIiFYP 
PPRQIHKRGVEDREHBRRGERERERERE 
RERERERERARAAR 


11923 


25824 


TV 

A 


12031 


1 1** 




QTERNS ININKKDIHTKTPSVGHQHQRP 
KVDKTTKMGRNQSRKAENSKNWKASSPP 
KEHNSSPAREQNWMES 


11924 


25825 


A 


12032 


177 


1 


IKKKTFFWQNIPLFSKKKTCRQKFFFFF 
FFETGSLLPRLVCSGAVLAHCSLCLPGS 
AFI* 


11925 


25KZO 


■ft 


12U33 


ion 

12/ 


1 
1 


PS FFFFFFFNETBSRSVAQAGVQWRDLD 
SLQAPPPGFITPSC 


11926 


25827 


A 


12034 


47 


314 


GAPVASVSISCPSCSATDGWRNGKSTA 
GHQRYLCSHCRKTWQLQFTYTASQPCTH 
QKIIDMAMNGVGCRATARIMGVGLNTIIi 
RHLKN 


11927 


25828 


A 


12035 


15 


408 


GAI PGAMGHFTEEDKATITS LWGMVNAE 
D AGGETLERLLVDYP WTQRFFDS FGNLS 
SASAIMGNPKVMAHGKKVLTS LGDATKH 
LDDLKGTFAQLSEIiHCDKIiHYDPENFKL 
LGNVLVTGMAIHLGKEFTP 


11928 


25829 


A 


12036 


122 


2 


KXiVDVLPPPGGARVFFFFFFPLETEHCS 
VAKAGVQWHDFG 


11929 


25830 


A 


12037 


154 


1 


GKPP P KLG FWGGPRQRVRGRGP P FF FF F 
ETESPSVAKAGVQWCDLGSLHPL 


11930 


25831 


A 


12038 


58 


254 


DPRVRQQRTTAHCSLNLPGLQRPPSLSL 
PSGWDYRHAPLCATQKILAYRVAQRERS 
AHGSYYQAS 


11931 


25832 


A 


12039 


270 


124 


DGVLLLIiPRLECNCAI SAHGNLRLPGSS 
NSPASNKTKQNNNKKTLSNKF 


11932 


25833 


A 


12040 


63 


355 


LGRGTAHRRLWRPLSRPASRVSYIiSAAT 
NKRSFAPPSRAFPWDNKWERGGFYFFET 
GSHRRRPGRMECSGAITAHCSLDFPGSB 



1267 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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ing to first 

a ini no aciu 

■ WIUUC If I 

peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
Qing w 
fact amino 

acid 

residue of 

UCLrllUG 

'sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, (^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glu taurine, R=Arginine, S=Serine, 

* l ill cuillllc, t — Y 41111c, 

W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
uuucviiuc uci6(ivii, \ possioie 
nucleotide insertion 
















11933 


25834 


A 


12041 


184 


3 


GVS RL LGGVS QLG YMGVRD P LE EAVCL F 
SKLKHHPGRTTALFRAVRQGCLSLQKFI* 
LLFA 


11934 


25835 


A 


12042 


118 


3 


DRVLLCYIiGWECSGTISAyCNLHLPGSK 
rtr SCLSLiPSS 


11935 


25836 


A 


12043 


40 


436 


LSEGKLTNRKDNHTKTPSVHHHHQKPKV 

LJ1\L 1 KrHjK,Xyjo 1 iS-Ai^M^aK-NKS I SSPPKB 

HSTSPATEQSRMENDFDELREDFRRSVI 

X ISC o oXtRJ&U V xC 1 XJV>KJUYE\JN XJiSJiLKijUiSWxC 

TRINSMEKTI1TOLMELKTMA 


11936 


25837 


A 


'12044 


101 


2 


HLGQVRWLTPVIPILWEAEVGGSPEHRS 
SRPAW 


11937 


25838 


A 


12045 


123 


3 


LTVI KELPLGQARWIjTPVI PALWEVKVG 

r* O T3 T7T FO O <3 T5 Tl 7i 

V is V Ko oKFA 


11938 


25839 


A 


12046 


3 


432 


PIFGGGGERIFPFFFFFSQGGGVFFPKT 
KNKGFFPFLGFFKKIFTjRNFFFLFFPFF 
FIiNPFFYFGAPPFFFFFPGGGFFFPLFS 
RFFKFFFQNFPQGFFFGGGFFFFFLGFF 
err r r Jj 


11939 


25840 


A 


12047 


325 


164 


KNKRKNRGQARWLTPVIPALWEAEAGGS 
PEVRTTGVSHGTRPHLLLNTVSEYI 


11940 


25841 


A 


12048 


187 


381 


LMDKRVSLWGDENILKVESNSGCTTLYL 

LVCFEMESRSITQAGVQWRDLSSIjQPPP 
SRFKRFSCLi 


11941 

- 


25842 


A 


12049 


84 


464 


SYRVPSHPDTLVLSRISAQEAGEKSPFC 
FPERVWPCPRPLSDLGRRLKLECGPDLL 
Uc> X rXtor r rvjEGIilvrGSHSGAIiECSGVI 
KJJnVAvJjlJxjol^ovrfJJr Jr X oAClxK VAGTTGV 
SHHCI FCRQDLPGS 


11942 


25843 


A 


12050 


232 


20 


LLQCSSRAKIHTSLTGNQKLEMIKLSEE 
KVIRSATPEFRHAE 


11943 


25844 


A 


12051 


401 


518 


F FETES CS VTQAGVQWRNLS PIiRA 


11944 


25845 


A 


12052 


191 


1 


LFPLKKEKIFFLGPGTNGAPPMFFLKAP 
PLFFFFFFFFFFFFFFFFFFFFVNLFLL 
Dor xitlXlx&lr 


11945 


25846 


A 


12053 


12 


369 


PLPIYSACIEVGSNPQGPGIDAKSVSHN 
NCVl^KEKKKKKSNGSQNNLLPKIHTPNG 
SGKKTGEKIQNPPRGGPKLSKNQKYSKN 
ul%j\Jtl^l»lrlCLj X r Jj J. o JSJXrU^lNKKi 1 roKS 
GCFYQKK 


11946 


25847 






oy 




WRTT/PriQT'DT.T.KrKrTTJMaMT T T T T17DTT T 

AMAFLMLTERKI LGY I QLRKGPNWGPY 
FAAPAIiALPFALFLCPPRPIiGRP 


11947 


25848 


A 


12055 


158 


3 


SQLTWPIMAQTDSMCLWIVYFDDNIGWA 


11948 


25849 


A 


12056 


113 


1 


KBHGGHGVEVLVSVLVGLAiMEVGLVIiWV 


11949 


25850 


A 


12057 


44 


263 


ARIARSAHEGKMPRKYTGARKNAENRPQ 
REIjQLKASRSTIDLAEHAOTALVT^QIjFL 
I SVGS WKNYILFYLTYT 


11950 


25851 


A 


12058 


142 


1 


KKIFFRYKKEGFSFPKFFRISLFFFFFF 
ETESRSVAQAGVQWRDLGS 


11951 


25852 


A 


12059 


12 


347 


QTERNS ININKKDIHTETPSEGHQHQRP 
IVDKSTKMRKNQCKKAENSKNQSASSPA 
KDHNCL P AKEQNWTEN E FHKLTE VGFR I 
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peptide 
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Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
EpGlutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^iyJutamine, K— Arginine, o— oenne, 
i— i nreonine, v — valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possible 

UULIcullUc QcicUOIl) V — pOSSIvlc 

nucleotide insertion 














T TNQVPT.KRW T T.TnPRTPaiTNTj PnR T iRRT . 
j. liiij v LUiVfinxij ± yuxv r*rtj\x» ii ri\^t> 1 1 r%r» i > 


11952 


25853 


A 


12060 


245 


3 


MPEGS FFLE I PE ITRVFVKKPGKFLGGV 
r;p<3 t.t .TrppppwRF ppMRn«; T.a vq pp T.Rr* 

urjiiur c c tr c rrrrr iw^oiinv ortviiciv.. 

NSAISAHCNLCLLGSSDSRSSGSR 


11953 


25854 


A 


12061 


94 


3 


KGGI FFFFFFFFLRQSPSVAQAGVQWCD 
LG 


11954 


25855 


A 


12062 


108 


5 


HGGRHL 


11955 


25856 


A 


12063 


298 


1 


PPQRVTKKGGBKDSFIPKRREKKNPPPL 
ERERERERERERERAR 


11956 


25857 


A 


12064 


122 


1 


VCEGVCVGGWVRDRERERERERERERER 
ERERERERERAR 


11957 


2585S 


A 


12065 


126 


1 


ATY^ry-T?DMnKrDftT T P\ro\7FFPTJPirDPDl? 

RERERERERERERESR 


11958 


25859 


A 


12066 


48 


351 


FFFF FFFFKKGPKI VPPGGGGGGDP I FL 
EPPPPPLKKFWGPPLRGRGNKGGGPPGW 
VOTGIFLKKKGIiPWGPGGVKTPAFKGSP 
GFXP PK&GNNRKN P FP W 


11959 


25860 


A 


12067 


1 


365 


GTRLTVLWTAHLVAMAPGSRTSLLLDFA 
LLCLPVfLQEAGAVQTVPLS RLFDHAMLQ 

LHDCQTSFCFSDS IPTPSNMEETRQKYH 


11960 


25861 


A 


12068 


174 


1 


PAWVMQHNPVSLFFCFFETESRSLTRIiE 

ncnT T C AUPMUP T . PfZG QDQPVQ D QPV1V A 

RA 


11961 

i 


25862 


A 


12069 


270 


375 


TRI KRCNGGRAQWLTPVI PALWEAKAGE 
SPEVRSS 


11962 


25863 


A 


12070 


197 


350 


KKI YIFLTINTIMDLI IPFLI1DNII1GFW 
JjC^VAaACNr5TijGt7KUoWX lKo 


11963 


25864 


A 


12071 


3 


378 


HEGQ L P E P LKGL WTAHLLGMAPGSRTS L 
LLAI DLLCLP WLKEAVAVQTVPLSRL YD 

QKLS FLHEYQTSFCFSDSIPTPSNMEBT 

T .fTTf^KT .FT .T .P TVT . 


11964 


25865 


A 


12072 


219 


1 


FSHPFPPGVFQTPLVFPRPWEVLGSPGG 
GGGPPKKWAGVQKWVFFFFFFFWEMESC 
QVAOAfSVnWYTiT/SCPPA 


11965 


25866 


A 


12073 


134 


3 


VFGPPFFFFFFCETESHSVAQAGVQWRD 

T iTG TiO A P PPfiPMP <?P 


11966 


25867 


A 


12074 


175 


357 


MPINQPVK3CMOTCYOTCVYIYIHTPLYI 
RVYMTOCMYICVYIHVYIWMCIYIHTHV 
STYVW 


11967 


25868 


A 


12075 


193 


350 


KI PHLTS L YHTQNYLKYC IGQAWWLTP V 
TPAT.WPA F2Vf!r5CPFVP c 5 c 5P P AWPW 


11968 


25869 


A 


12076 


252 


1 


GVFGPFPKKGGFFWVKGFLGTPQIFWVG 
FETBS<^VARAGVQWRDSVKKKITIARA 


1 1969 


25870 


A 


12077 


273 


366 


TT7TVT TJTVT FF^FCPQt7'C/"i2if' , *J > nTjJPNT J"* 

RLR 


1 107H 


ZjoI I 


J± 




IZJ 


L 


FCKKKTT . 1? F FFFF F F EMESRS VTO AfTVOW 
CDLSSLQPPPPLV 


11971 


25872 


A 


12079 


206 


334 


HNRVTI VNNNL I VHFKITKRCWTQWLTP 
VT P ALWEAKAGGS S E 


11972 


25873 


A 


12080 


135 


2 


KMKRKFGEDSQIPKTLQTFFFLTESRSV 
AQAGMQW CDLGS LHS C 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Add, ^Phenylalanine, 
G=Glyctne, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, IMProline, 

{~V=zfZ fiitn iti in a R=A roininp ^Qorlna 

T=Threonine, V=Valine, 

W=Trvntnnhjin Y=TvrncinA 

X=Un known, *=Stop cod on, ^possible 

mtc l&ntiriA HpIpHaii \ — ti nr r i h 1 r> 
uuvrevuuc UCICUVUf ir^UUaalUtc 

nucleotide insertion 


11973 


25874 


A 


12081 


147 


3 


ATTPGLFF I FYFFETESHS IAQAGVQCM 
ISVHCNLCLPGSSDS PVLV 


11974 


25875 


A 


12082 


106 


2 


YFIPINEGYCFIYLFFETESRSWQAGV 
QWCDLV 


11975 


25876 


A 


12083 


164 




CP YKKRKKANOAKWT.TPVT MT.PfiP PDDV 

DRPSSGVQDKPGQHGETPSLPKIQKKSQ 
AWWCA 


11976 


25877 


A 


12084 


186 


1 


KSLFSKGNPLFKGPPPPPFFFFFFFETR 
SRS VTOAGWGAVTMAHCCUJTjPGP 
TPAPRA 


11977 


25878 


A 


12085 


98 


2 


CPFFRVNFFFFEMESRSVAQAGVQWRDL 
GSRA 


11978 


25879 


A 


12086 




Im 


FLYKTBSGSVAQAGVQWRDLGS 


11979 


25880 


A 


12087 




J 


KT^S VT Tf t TJT 1 PMnWQWf3PPPWr , D!*lliir»irvo~ " 
CV CKFWANQTLKKNLGAGRFQKPFSAAA 
FLEMGGFPFKFFRFFQGSQFFFFFFFET 
BSCSVTOAGGOWCMTiGST.RPT.PP 


11980 ' 


25881 


A 


12088 


154 


3 


KEFSFFAPGGKQRGEIRSLRGPPPQVKP 
LFFFFFFETESRSVAQAGVQWR 


11981 


25882 


A 


12089 




7 


QWlTATr.nvCT.'RITTT VT DClwSTET r"T70tIT» " 
o w am x o ua x x jj x jjir or yl r\Liv..r orix 

HTHTHTHTHTHTLYSQIMVTaLLPSLRKK 
GS 


11982 


25883 


A 


12090 


265 


30 


WFI VYSEISQLWGLPS FNTLGTWQSLS F 

TTTVZLTTMPCf^n/TYMVfST T7T T Aoennmron 
xr VAArUUV.VvV 1 yAblilS JjIxRooUxt it Vo A 

SQSARIRAMSPSVAWVIPGRSR 


11983 


25884 


A 


12091 


257 


1 


R^VMRMHTFRnYPPKTTTVROQ Ccr*i?TDDU 
V30 vritu'uiiaay I rC4M|^ v ayooobr JLKr'.ri 

LVYRVCFYLS VCLS VCLS I YLS I YLS I Y 
LS FFLS FFLLRKSLYLS I FLS FFLLS PR 

v 


11984 


25885 


A 


12092 


311 


404 


LYHONPWrjYSVFFFTiKTRSCSVADAGVO 
• WCD 


11985 


25886 


A 


12093 


331 


83 


GDDYKGARENSRDDKIjFYVTj I MMUVTOT . 
HVFVKIHKTLQLNGYILLCKLYLINLTK 
NNFQKPKTNSQKTLiHIFHPDSTAANI 


11986 


25887 


A 


12094 


82 


2 


LRE PttLLNx^EGatfEARWCHCTPAL 


11987 


25888 


A 


12095 


255 


393 


HGILFSSFVOOiNQFIiSQHWIiMPVIPAL 
WEAE VGGSLE PRSSTIAW 


11988 


25889 


A 


12096 


80 


3 


PFFFFFFFFETESRSVARLECSGMT 


11989 


25890 


A 


12097 


367 


1 


PRVDKPTKMGKKQGRKTGNSKKQSPSAP 
PKERSSS PATEQSWMENDFAKliREEGFR 
RSNYSELQEE IQTKGKEVETFEKNLDEC 
ITRI PNTEKCLKELMELKPKAQELREEC 
RSLRSRCDQL 


11990 


25891 


A 


12098 


94 


236 


MWADYFCMSPSIDEGLKKKKKKKKKKKK 
KKKKKKKKKKKKKKKGGAP 


11991 


25892 


A 


12099 


410 




MCT,HP<?T < ?r.FRP < ?^HPHTKT»TQr , r»anwT 

lnounro x our ivroonr nj. J\_Lj J. ovj Vi-ivJri 

TPI I PVPWEAKAGGLLETRSSRLAWAAQ 
EDPISMPVCK 


"TT992 


25893 


A 


12100 


140 


324 


NLGQ AQWLMPV I PARPWEABVGRSLAVR 
SSSNSPASASQSAGPTOPSHHNWLMFLQ 
LLTRL 


TT993 


25894 


A 


12101 


247 


357 


KQIQGRAWWLTPVFS PLWEAKAGES PEV 
RSLRPAWPT 


11994 


25895 


A 


12102 


245 


2 


DVLVGGLCLPAGGTLHLTS I FLLSGCWC 
GSKDEEAPCKQRISVQRESQSRTLRAGV 
SPKKAHPCEMCGLILEDVLHFPDLQ 
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residue of 
pep tide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
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last amino 
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residue oi 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=Proline, 

j~\ _/~» T> A _ .I.*.,,;.. O Carina 

V=\9lutamine, lv=Arginine, o— oenne, 

Threonine, V=Valine, 

w— tryptophan, x— lyrosme, 

a.— un Known, — otop cocion, /^possiuie 

nucleotide deletion, V=possible 

nucieouae inscruon 


11995 


25896 


A 


12103 


230 


379 


KEAS FFETESHS VSQAGVQWRS LNPLGS 
GNPPTST ^RVAGTTGM^HHVWL 


11996 


25897 


A 


12104 


86 


2 


IX^SVI^PRLECNGLILAHCNLRL.PGSS 


11997 


25898 


A 


12105 


116 


2 


SQKTPFFFFETVSLLLPKLECNGAISAH 


11998 


25899 


A 


12106 


271 


416 


QFTFSKAFNLVFIKPSFHISFYRTHYSQ 

7VUIXTT T , T5t7T T5CT X.TP T\T?A ftLf^ C D 


11999 


25900 | 


A 


12107 


194 


3 


HVLGTVLVAEDTASNKIRSLQGCLLLPL 

Iir o V V JjJS V laAKAXKy&KJ£±l\. V XvXwCblS 

VNRRRGR 


I/UUU 


25901 


A 


111 AO 

12108 


142 


•> 
3 


t r«T t VT»Trv/*»TrDii7\rjTiror , r\X7DiATT tdittdrt 
JjUIxU X WlA.JvKliAV1 WXfciU VK wli i ir V ItfAJj 

WEAEAGGSPEVRSLRPAW 


12001 


25902 


A 


12109 


233 


346 


HQRSLIGWAQWLMPVI PALWEAEAGVSP 

PWDCT VTi 7\ LiTD 
J£ V KoLiXVF AW lr 


12002 


25903 


A 


12110 


234 


349 


GFSPFYDFCLFVFETESCCWQAGVRWH 
DLGSLQPPPP 


12003 


25904 


A 


12111 


136 


333 


RIiNFF YFXFFETAS CSVAQAGVQWHDUft 
HCIPAWETEQDSISKNTVQKRKKKOTRA 
GISKTKNENF 


12004 


25905 


A 


12112 


,345 


110 


QJXSIa-LVMTJjQJ^QP X JjAYSIjWFP x vfp 
TTKACOTQG1^KPLLK3CKx%GQVQWIMP 
VIPTLWGADVBGSPELRSLEPA 


12005 


25906 


A 


12113 


108 


285 


YNAMKmFIiKTILNNKNSIGW 

I PALWEAKAGRS PEVKS SRS RPGAVAQV 

DAA 


12006 


25907 


A 


12114 


1 


364 


RVVAAEMGKFMKPGKAALDIiAGRYSGRK 
AVX VKNXDLK»I SDKP YSHAIj VAGI DRY P 
RKETAAMGKKKIAKRSKIKSFVXV11NYH 

rtT MPTTOVQllT^TDTTVtf^T^A/MVnVTypnDAT. 
UUFlkrl K X o V U X r JLUJJS. 1 V VW rJJV cKXJirHli 

KRKARREAK 


12007 


25908 


A 


12115 


120 


3 


TQIWGAFLTLFFFFFFEAKSHSVAQAGV 


12008 


25909 


A 


12116 


120 


3 


TQIWGPFLTFFFFFFPEAKSHSVAQAGV 
QWCSLGSLQA 


12009 


25910 


A 


12117 


231 


1 


ii f etiDCnVOllT'PVT T OT CJVOT T PUTT T T 

FJjS PFS CYoAITKXxJjS I»S YFljLGHITiTiTi 
LGSTPEAAaQWQWVS FADSD IVP PAST 
WVFPTLGIMHHNKQATENAKE 


12010 


25911 


A 


12118 


216 


1 


LIFPPLLNFCYGEVIiGFQGFIaALFFACM 
rr WJrKr or Jvl rrrrr x iStvJSr noAAvALsV 
QWCDLGSLQPLPRGFK 


12011 


25912 


A 


12120 


267 


1 


KKKKKAFPSPPLLGCFIRIiQKHL 

VP Q WT . T C C T .T . fT .T?flT T T . GT.T7 T T 2k TT . T TT 
I XorLLi X oo XiuLXxavjX X Lto ult XXA.1JjX 1Jj 

ITTHSLIiANIvP IAI LVFAACEAAVGLAL 
LVSIS 


12012 


25913 


A 


12121 


167 


372 


ISGQDLLKETKRVKRPFQQDDVPFINIF 
DFNTLFSALDRS 


12013 


25914 


A 


12122 


124 


2 


GLLKFNI FAPFFFFFSETESHSVTQAGV 


12014 


25915 


A 


12123 


121 


1 


RFTAAS ANSGAVSAAVGYMGDPGS E I IE 

Q^AO DUfi DX7Q C TA7 
o V Jr IrAtjir V ao V V 






A 




1 /o 




RDIjRG fs rf s nykfwnfgrvgg t f kfvf 

GKGVKRFFFFFFETESCSVAQAGGQWRD 
LG 


12016 


25917 


A 


12125 


137 


2 


KLGFFGGGGGKIFAPQKNFFFFFFFFFE 
MESCSVAQAGVQWRD IG 


12017 


25918 


A 


12126 


103 


3 


LEENLGNTIQDIGRGKDFTSKTPKAMAT 
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Predict- 
ed end 
nucle- 
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location 
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ding to 
l«cf amino 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
\i vriuiamine, xv.— Arginme, o— oenne, 
x j. urcuiiine, v — valine, 
W=Tryptophan, Y=Tyrosine, 

viitvuuwiij — oiup ciiuuij, / — puss 1 Die 
nucleotide deletion, V=possible 
nucleotide insertion 














KAKID 


12018 


25919 


A 


12127 


212 


351 


LITFEILNICLGDNILDLGLARWLTPVI 
PALWEAKAGGS PEVRSLR 


12019 


25920 


A 


12129 


187 


3 


RGSTMQQTNTRTPGVRVLYYILMIATMA 

TPYTPMAMT.T.T.T.TVDTT T21MJM7TMT TPD 
J.XT X XlTtUfUNXjljXjXjX VtrXXjJLAfll/Uf XFAUxiiK 

KILGY 


12020 


25921 


A 


12130 


n 
j j 




ZxGP V T TTP\/MaQTCTCT , irci?T BnarvpuacT 
xiorv.v x j.iz.v ruxODiZ? 1 I\or Ja/\XJAo XoJiyiixJ 

DANSALMEU)KGIxRSGKLGEQCEAVVRF 
PRr.FOTrYT5PDTT.TMQaTrr.Trr arYtnTPirmr 

c m^xjc trv X^ XXjXHOMX? Xj1\Xa«XJ vl? K 

NFlVFiCVlxKVTQQSEKHLEKI 


12021 


25922 


A 


12131 


x, j i 


jj\J 


MnTnPTVT/^PAPWT/PD'XJTDJVT UURvn op 
SPEVR 


12022 


25923 


A 


12132 






JCf^TiffPP D RTTT? T?T?*T , Tr CD Ct /TV r» n r*l rouro r\r 
it tree rrciXXTioKo VivyAvjVKWKlJli 

SSL 


12023 


25924 


A 


12133 


"129 


1 
I 


PISASQIPGTIGMCH 


12024 


25925 


A 


12134 


196 


361 


TIJ^Ex^AAMKKKSESPLLHFQGKKIiNQD 

vavatfTrtW vu Xt X ir V X rAXjrfi\MJ\.l vJvjoJjllr'Ko 


12025 


25926 


A 


12135 


120 


1 


PPPPGLFFFFFFETESYSVAQAGVQWCN 
IjGSLHPPPPBFK 


12026 


25927 


A 






i 
j 


KJSXj J. KX lSKUKliJ\.yxjo VCQRr QIiAKFPPF 
PSGVGEKQNFFFKKKKKKKGRAQWLTPV 
X xr/iXiVV Aiu&AvjUo r is VKSoKPA 


12027 


25928 


A 


12137 


1 OJ 


i 


lxEAETOSFAOAGVQWC^LGSLQSSWVTE 
LAAAS 


12028 


25929 


A 


12138 


158 


1 


LRRGGVFS I FFCGGTMVI*S PADKTNVKA 
AWGKVGAHAGEYR A R AT .PP MPT ..Q 17 


12029 


25930 


A 


12139 


76 


3 


KVLARAIRQEKEIKDIQIGKEEVK 


12030 


25931 


A 


12140 


157 


2 


FFSSPRIjIaKRRPGNFTX3AT£j^ 
ETES PS IAQAGEQWRDLGSLQAP 


12031 


25932 


A 


12141 


234 


388 


I^QL»MSANLFFVLLFEAF,FHSVNRLECS 

r , MTtJ7MJr t Kn" trr nr»eet?on7i omr»/-\ 
x wiUiUNJUrixjPvjo bES PASTSQ 


12032 


25933 


A 


12142 


112 


453 


LGRRQAASMREG I S I HVGQAG VHIGNAC 
WELYCLEHGIQPDGHMPSDBCTIGGGDDS 
FOTFFX^xX^GKHVPPJVAFVDLEPTAID 
EVCTGTTOQLFHPEQLITGKEI1AA1>TNYA 
RG 


12033 ! 


25934 


A 


12143 


44 


443 


AKLGTRKLPLKAKMGKE ITL INI WIGH 
VDWGJTTxTTGHLIYKOGGIDKRTIEKLE 
KEAAEMGKGS FKYAWVLDKLKADRERG I 
TIDISLWKlxDTSKNYVTIIDAPGHRDFI 
KJ^ITGTSHADCAVLTnAARV 


12034 


25935 


A 


12144 


3 


386 


REAATMRECI CIHVGHAGVQNGNACWEL 
YCLEHGIQPDGHMPSDKTIGGGDDSFNT 
FFS ETGAGKHVPRAVFVDLE PTGI DEVR 
TGTYRQLIiHPVQL I TGKEDAANNYARGH 
YT T G KR T T DT .VT .TjR T P 

X X XVJXNJZi J. XLVXJ V XXU/lVXtV 


12035 


25936 


A 


12145 


3 


386 


AGATY I DRLRVALFEATTFTLWRP ELS 
SGEATTMRECMS IHVGQAGVQNGNACWE 
LYCLEHGVQPDGHMPSDKTIGGGDDALN 
TFFSETGAGKHVPRTEEVDLEPTE IGEE 
GTGTYRQLFHPEQHMM 


12036 


25937 


A 


12146 


3 


385 


GRATYIDRI^VALLEATTSTLRLRRELG 
SREATTMREG I C IHVGQAGVQNGNACWE 
LYCLEHGIQPDGQMPRDKTIGGGDDFFN 
TFF I ETGAGKHVPRAVFGDLEPTDIGE I 
RTGTYRQLLHPEQLI 
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M 
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location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutaraine, R— Arginine, S=Serine, 
^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12037 


25938 


A 


12147 


11 


381 


HTPEGRTRVPVL IAGWYCQEALRDWGR 
VTASSTGAMAFLRSMWGVLTALGRSGAK 
LCTGCGSRLRSPFSFVYLPRWVSSVLAR 
CPKKPVSS YLRPSKEQIiPI FKAQNPDAK 
TTELIRRIAQR 


12038 


25939 


A 


12148 


321 


509 


YSMDQYFIPPYCQIMHLCPFIiKTRVSLL 
FP WLE CSGATSAHCNFCLLS S SNS PAS A 
SRVTGIT 


12039 


25940 


A 


12149 


51 


400 


AATMRECIS IHVGQAGVH I GNACWELYC 
LEHGIQPDGQMPSDKTIGGGDDSFNTFF 
NETGAGKHVPRAVFVDLEPTVIDEVRTG 
TYRQLFHPDQL ITCKEDAANNYARGHYT 
IGKB 


12040 


25941 


A 


12150 


25 


399 


EATTSTLGIiRHQLGSRESTAMRECIS IH 
VGQAGVHIGNACWELYWLKHGLHPDGQM 
PGDQTIGGGDDSFDTFFSETGAGKHVPR 
AVLVDLEPTVIDEVRTGTYRRLFHPEQL 
ITGPEDAANNYAR 


12041 


25942 


A 


12151 


1 


402 


TKEEELCLACRQRRADGGWRSQMNAGSD 
PVGI VSAARTI IGSFNGALTAVPVQDLG 
STVT KKVliKRATVAPED VS EVT FGHVLA 
AGCGQNPVRQASGGAGIPYSVPAWSCHM 
ICGSGLKAVCLAVQSIGIGDST 


12042 


25943 


A 


12152 


3 


400 


TDRIiRVALKKTTTSS FCFPHHLGSRETA 
TMRECIS IHVGQAGDHIGYACWELYCLE 
HSLQPEGQMPGDKTIGGGDDYFNTFFSE 
TGAVKHVPRAVFVDLEPTVI DEVSTGTY 
RQLFRPERLITSKEDAVNIY 


12043 


25944 


A 


12153 


2 


398 


GRCALDRNTF I FARFGCYL I ASGHPGEK 
LMDMDMS PLRPQNYLFGCKLKAENDYHF 
IVANDENEHQISLTTASLRAGANDDMNI 
VEAEAMNYEGTPI KETLATLKMS VQATD 
SLGGSEITPPSVLRLKCGSR 


12044 


25945 


A 


12154 


203 


405 


FTCPSRI ICVISRRISPCCAPDIiNPMLS 
ANAMLDFVFTVEDPGGWDSKNLQKKWSH 
YSFLTGIRPKI 


12045 


25946 


A 


12155 


229 


397 


TKIAHHKAGFALISKKITKNLKMFLSKF 
LPIHALWVTGSSGMQPYPLVWGHYDLGK 


12046 


25947 


A 


12156 


2 


399 


GGVPHCVWATAWGMRPGLPGPTGLCAQT 
SSRGQKSVLKQKESCGIWQLYHFLSRKQ 
EPRWEPCVSGSSSGBGAVADLADELRGY 
PALCCTLPVHSYRSWAGIRPQIMNGPLH 
PRPLYALLDGRDCTVEMPIL 


12047 


25948 


A 


12157 


85 


414 


ALLPQSEALQGAVTMPHSYPALSAEQKK 
EVSDIALRIEAPGKSILAADESVGSMAK 
RLSQIGVENTEENRLLYRQVLFSADDRE 
KKCIGGVI FFHETLYQKDDHGVPFVR 


12048 


25949 


A 


12158 


85 


407 


GLLPHSEPDQRAVTMPHSDPALYAEHKK 
DVSDIALRIGSPGKGILAAYESMGSMAN 
PLNQMGVENTKQNRRLYRQDLFRAEDRE 
KKSIiAGVI FFHDTLYHKDDNGVP 


12049 


25950 


A 


12159 


85 


406 


VLLPHSEALEGAVTMPHSYPALSGEHNK 
ELSDIALRIVAPGKGILAADESVGSMAK 
RLSQ IGVENTEENRPLYRQDLFS ADDRG 
KKSIGGDLFFHDTLYQKDDNGGP 


12050 


25951 


A 


12160 


275 


146 


EKTFIISFFFFFETESHSVAQTGVQWHD 
LGSLQAPPPWFRRIA 


12051 


25952 


A 


12161 


212 


2 


KRSRPFFFFFKEMGSCYIAQVGMQWLFV 
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peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
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ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Pro1ine, 
Q=Glutamine, R=Arginine, S=Serine, 
T^Tbreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 














GTITACYDPBLLGSKDPPTSASQVTGNT ' 
GTHTHTQLIFVFLV 


12052 


25953 


A 


12162 


253 


2 


GRVDS BCCLANGNAEKS I FCCLLVYLFY " 

LSSRQSHSVAQAGVQWHDLGSLQPPPPG 

FSLLSRVAGTTGARHHDWLIFVFLGET 


12053 


"25954 


A 


12163 


228 


1 


LLGLWFEQIiGRFTKIAKTKGERSQTQSG 
HSMDMRVPAQLLGLLLLWFPGSRCDIQM 
TQSPPAVSASVGDRVIITCR 


12054 


"25955 


A 


12164 


122 


1 


RLLFFE TESRS VARLECSGAI SAHCS LR 
LPGSGSSPASAS 


12055 


"25956 


A 


12165 


239 


350 


GQARWLTPVI PALWEAKASKSI.EVRSLR 
LAWAGHGGS 


""12056 


25957 


A 


12166 


235 


359 


TDKAISKRDLSVLREIIFFEMESRSVAR 
MECSG VI SARCNL 


12057 


25958 


A 


12167 


209 


396 


QFSGI PFDQYKCGFKNYI YIY I YIYI YI 
YIYIYRKYTRAYTKRYMCVFSRATHNIV 
YLCAYI 


12058 


25959 


A 


12168 


327 


130 


GGVGFLGGPGEKPPRPSPFFFFFFEKES 
RSVPQARVQWPDFGSLPAPPFGFTPFLR 
ESFLFSSLMI 


12059 


25960 


A 


12169 


317 


422 


RGEKPLFFFLQKKXKKXKKKKKKNIKKG 
GRRYKPS 


12060 


"25961 


A 


12170 


260 


375 


EKVTCS WTQWLTPVI PALWEAKAGRSLE 
FRSLRSAWAI 


12061 


25962 


A 


12171 


304 


3 


KFFFFFKGFFFLGGVGPIFPPPKKRFFS 
KIPPGVFFFPPLKKKIFFFFPPVIIJGPP 
RVFFKGAPLFFFFFFFFFFFSETEFRSC 
CPGRLECSVAVSAHCK 


12062 


25963 


A 


12172 


105 


3 


LI FLRQSS ALiLPRLE CNGAI SAHCNLCL "~ 
TGSSD 


12063 


25964 


A 


12173 


146 


17 


RIFFFFFLGQSLAVAQAGVQQRDLGSLK " 
APPPGVHAIIiLPQEY 


12064 


15965 


A 


12174 


234 


2 


LFPRKPKPSVSLSLSLSLSLFRLRKLSS" 

FMRHNNIEVRPINNPAMTSKCSSERKSL 

VSFTLNLKPC»4IRLSEEGMLK 


12065 


"25966 


A 


12175 


292 


380 


VRGLGRVAHTCNPSTLGGQSGWITGGQE 
F 


T2066 


"25967 


A 


12176 


109 


1 


GRIKKVIKGQAQWLMPVIPALWEAKVGG 
SPKVRSSR 


12067 


15968 


A 


12177 


170 


291 


LINFYFYFLRQSHSVTQTGVQWCNLGSL 
QPPPAARRRRRG 


12068 


"25969 


A 


12178 


260 


380 


LTYIALFI FLRRSLTIiLPRLECSGMISP 
HCNLCLLGS SDS 


T2069 


"25970 


A 


12179 


54 


166 


PKRGFHRVTQEGLNLLTLGSPRLGLPKG 
WDHRQDPLP 


12070 


25971 


A 


12180 


273 


359 


APAGHGGS CLQSQHFGRLRQADHLGSGV 
R 


12071 


"25972 


A 


12181 


27 


225 


IGQAQWLTPVILAFWEAKAGGSLEVRSS 
RPAWPIYFLVLLASYLRRFCLIQGHSDL 
LYTSSTRGS 


12072 


25973 


A 


12182 


1 


155 


VHKIFIAYKYSGFQTVFRGRAWWLTPVI 
PALWEAKANRSPEVRS SRSGRRG | 


12071 




M 


12183 


91 


2 


FNLFFFFFFETGS RSVARLE CNGAI SAC"" 
C 


12074 


25975 


A 


12184 


122 


2 


GQAQWLTP V I PPPWEAKVGKS PEVRS 3R ' ' 
RTRGRTRGRTRG 


12075 


25976 


A 


12185 


389 


20 


TDRGRRARRAC FCGKVFDGELS FALKLA 
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peptide 

sequence 
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nucle- 
otide 
location 
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*? ins in 
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last amino 
acid 

residue of 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
KNLysine, Lr=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 
v^vjiuiainiue, r-at gin we, o— oenne, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 

IIUCICOIIUG UCICIIUD, V~p05SIUIG 

nucleotide insertion 














REMGRPDWRAMLAGMSSTEYADWHRFYS 

DMHPLDFSLLNRREADBEPEDDVLMQKA 
AGIAGGVLFLI 


12076 


25977 


A 


12186 


180 


393 


LNPGSGGCGEPRSRHCTLAWVTERBSVS 
KNKIKIKINKQIEEGFLFQPFGQGFIWC 


12077 


25978 


A 


12187 


37 


443 


PDFPI PFPPRKVQHLARPS PKTARPAWT 
IRRILPPPPKKRHPPGLARGKFGPRE1A 
KAGLAKECPAGGPTPAVRFATPTRGGGG 
RVAGFQTPMGI PMGKARLGLLTFLGFAS 

un J.vji\ x KJr Kll JL liLXJijCLLi V iTi Jjy r 


12078 


25979 


A 


12188 


3 


411 


AFP ENAATGS T FQD P VPAS KGRAEVGNM 
RLS VAAPISHGRVFRRMGLGPE SR IHLL 
RNLLTGLVRHERI EAP WARVDEMRGYAE 
KLIYYGKLGDTNQRAMRMADFWLTEKDL 

x tr JVL» v y v LtfUr it X JSJJ^lVnar 1 KTUjU 


12079 


25980 


A 


12189 


2 


414 


QE FGTRKRATF I SLLFFFSS VYSRGVFR 
RDAHKSEVAHRFKDLAEBTLRALTLIAF 
AHYLHQCPFADHVXLGNEATEFAKTCVA 
DESAENCDKSLHTLFGDKLCTDAPLRET 
YGEMADCCAKQEPERNECLLQHKDD 


12080 


25981 


A 


12191 


3 


514 


PRLLMEAGPHPRPGHCCKPGGRLDMNHG 

RRHTPAPTRPRKPDIaQVYLPRHRDVSAH 
PRNPDYEESGESSSSGGSELEPSGHQLF 

VSAHTPLDPPMREALKLRIQEEIAKRQS 

QH 


12081 


25982 


A 


12192 


182 


3 


RGLFFIPPPPIKTKIWGPFWKVVFMGFG 
PPLFFFFEMESRSVAQAGVQWHDLGSLQ 

2VT.P 
nur 


12082 


25983 


A 


12193 


53 


400 


PETPSWIiGPVRRFYFIIIKSVWKWKKKK 
KKKKKKKKKKKKGGGPFKKTLGGDKFYW 

GGKCGAPPPKDISCLWGKKNFLGAIGEK 
TCCC 


















j± 






-a 
J 


VCIT P TOT APPKlTKTntfWRPT.WJrVATTi'KriFfS 

HP I F FFFEMESRS VAQAGVQWHDLGSLQ 
ALP 


12084 


25985 


A 


12195 


200 


378 


KSSKSQWLGYLFIGLLIYLFIKFFLETR 
QAA 


12085 


25986 


A 


12196 


341 


54 


SLSFHGGLSVLCIFSTYKKLSQEQWLTP 
yiSVLWEVKVSRSPQVRSLRPGDPPASA 
SQSAGITGVSHHARPHLSLNPRLEMIKL 
SEEGMLKRMTG 


lZUoO 




7\ 


iziy / 


1 


40Z 


nnPDPPPDMTTTVZVl/T *PT.ZV\7T PT HVCnTiDtJ 
PWHODEPPO *3 PWDR VMTiT ATA YVTV5T .KT> 
SGRDYASQFEGZAJ^KQlJ?imLl^WDS 
VTYTFSKLREQLGPVTQEFWDNIiDKETE 

QKELTFTAES EP VP 


12087 


25988 


A 


12198 


80 


1 


SNKVFFLETESCSVAQAGVQWCDLGS 


12088 


25989 


A 


12199 


221 


3 


TMGIMLD KKE IQAI FS FKFKMGHAVAET 
TCtnimTSGPGTANKVTVQWWFKKFCKG 
DESLEDEEHHGRRIiEV 


12089 


25990 


A 


12200 


239 


336 


FGALPITTCKFGKKKKKKKKKKKKKKKK 
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nucle- 
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peptide 
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Amino acid sequence (A=AIamne 
C=Cysteine t D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R D Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possibie 
nucleotide insertion 














KKKKKKKKKKK 


12090 


25991 


A 


12201 


279 


366 


KKKKXKXKXKIGCKiCKNKKXKKKKKKGF^ 


12091 


25992 


A 


12202 


153 


40 


GFLFLI FFFFETKSRSI IQAGVQWCHLG ~ 
CTLAWATETD 


12092 


25993 


A 


12203 


317 


406 


LGKLKOtfGAVSGHCNLRLPGSSDFSGSA 
S 


12093 


15994 


A 


12204 


2 

• 


925 


RSGPDHELTLDSGKGCTRSMVPASAASE 
DRRKLPIIVEDEGGPTRSRACSSPARGS 
RPPPSAIGCSPVAQASDSAAGPARRTAL 
QSLSSWLGYQIDRHSVPVYVFKSPLFSV 
IMAPKHKSSDAGNLDRPKRSRKVLPLSE 
KVKVLDLIRKDKKS YAEVAK" T YfiKMP Q e 
IRE I VKKEKE IRAS FAVS P PTAKVTATV 
RDKCLVKMEQALHLWVEEMNRICRVPIDS 
NMLRQKALSLYQDFSKGCSETDTKPFTA 
SKGWLHRFRHRFSHHYKKKKKGIMAQVA 
VSTLPVEEESS S ETRMWTFLVSALESM 


12094 


25995 


A 


12205 


272 


1 


KMARAGLLVIEGKVWRTVYYRFATREER 
EGKMSTNLMNKLDTIGFDNKKDLLISVG 
DLVDRGABNVECLELITFPWFRAVRGNH 
ERQPGQ 


12095 


25996 


A 


12206 


166 


478 


ISASGLLPTS PLTGTSKLQDPNEHLNLL " 
MLNRVSLLLPRLECNDTILAHYNLRLPG 
SSNS PDSAFQVAGITESFTCE INALKDS 
SQVALWS FRCEMDHS S I CNR 


12096 


25997 


A 


12207 


102 


497 


PLLALLFGSQKTFLCLQLSLKPTRSSLL 
SPGSAGNPENEAPCPQLNPEATSLKKKK 
KKKKKKKKKKKKKKKKKKKKKARG 


12097 


25998 


A 


12208 


131 


2 


FGFLGGDKAKGKGPGPPFFFFFETESPS 
VAKAGVQWCDLGSLH 


12098 


25999 


A 


12209 


221 


499 


QTRHKLTNSRSGRTGLQ I L I KEQHSSGR 
WCX3TGDILWQSLEDTICYSVPKSLLDFL 
LFI YFETESHFVTRIiECSGAI SAHCNLR 
IaLGSSDSPA 


T2099 


26000 


A 


12210 


297 


398 


HNF FFFLKTES CS I SQAGGHW YDLSS PQ 
PPPPG 


12100 


26001 


A 


12211 


341 


452 


NWSPGLKQSSRLSLPKCWDYRREPPCLiA 
GTiTiAYTiFRR 


12101 


26002 


A 


12213 


214 


468 


FFFFFFWGKGVFFFFPPLEGRGGNSVYW 
TPPPRGKKNFPPLGLGKMGNNPPPPPPR 
FFFFFFKKKGVFFLGPGGVKTPSLRETP 
P 


12102 [ 


26003 


A 


12214 


223 


477 


CPTETQLQLIFAREKHKPSPVLIiDLDLY 
GFFFEMESHSVS PLECSGPISAQSNLYL 
LGSHNSPA 


12103 


26004 


A 


12215 


263 


55 


TPKRYGKMQTATYLPKSKI YI I ISDLRK 
QDYI VYKQSKKPEGWAQWLT PVI PAL WE 
AKAGRSPKVRSSR 


12104 


26005 


A 


12216 


328 


471 j 


GVIHAQVLHAHTHTHTHTCTHTQVHAHS 


12105 


26006 


A 


12217 


434 


1 


WDHSAS PPTSVGTPLRADGPHPRliTAPG 
HWGSSAMLSAPAAGRRGLSLGRWCDRD 
KLSQRGPQPKWILWWGNLPGGGFIiHRHI 
HKPLPTIPIPSTVRGEEVLQQLLHSASF 
ILFIFI FLDRVSLCHPGWSAVARSRFTT 
TSTS 


12106 


26007 


A 


12218 


304 


43 


EKP IGVAPS RVKKRAGGGEKTECCIiLKG 
PriQGFQGVLKGPLKLNGPLFFFFFLRRS 
LALFRLECNGT I LAYCNLCLLGSSDSPA 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G!ycine, H=Histidine, I=IsoJeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proiine, 

0=f*liifaminp T) =: A r*on nmp Q=Spnnp 
\£ VJlUUlUHlICj XV nl glUlllc, O — OCI 1I1C, 

X=Threonine V=Valine 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nitHp.nHrip rfplptf rtn V=nnc<ihlp 

■ IMUwVWUV UulVUVU) ~l>ll9olUlv 

nucleotide insertion 














SBI ! 


12107 


26008 


A 


12219 


171 


365 


ALVQPRAGREAVTVPACCSCSDVFLYET 
NKVARIOS INYGTI KFFHVT VFSYVS1TE 
HLKRKLEK 


101 Aft 




7V 
1 1 


tzzzu 


1 H\ 
1/1 




ARYKFIiHPKPATOIiYPS <1S O I OH A R MOT, 
WILIKCSFYLLFLRQCLTLLPQLECSNM 
ITAPLOPOLPGLKOSSHLSLPSSWDYRH 
EFP 


12109 


26010 


A 


12221 


323 


458 


FRVQRStiFLFIYFFETESHPWQAGVQW 
HDFGSLHRISP 


12110 


26011 


A 


12222 


189 


56 


DRVSLVQPTLECSGTISVYCSLNLLGSS 
DPPTSAS PVASTTGGI 


12111 


26012 


A 


12223 


246 


494 


FLCSSYAYDDH^LRPFFPIHLILDITAF 
WIAPVIPVLWEAEVGRLLEVRSSRPAW 


12112 


26013 


A 


12224 


55 


353 


RIQGCS CWVKLCPKVEKWVQRTDAEQES 
fn , in^PTonMK'OPT^AVNMMniri7ap qapt. 
ERKINKMMNra^xCTHVKV 
ISVAFYKLPGTVIRL 


12113 


26014 


A 


12225 


199 


23 


LKLSSIYYVPGTILKYFTGWARWLLPVI 
PALWEAEVGGSPEVRSSRPANSTSWIER 
HO 


12114 


26015 


A 


12226 


369 


14 


DCVRVGLSYPQTVCPSCESVRKAPFSSR 
PHRGVLESGQPRCKSRRVDI FFS MERER 

nRRRPPRRRRRR PRROTTRTPP VWrV5RP "R 

X^IvEjXVI-i rvC»IvCiI\ fi fx. n»X\.Vj^X\X\, X tvK V rVOVjuJvEi 

KLGYSHAWGWRGGRGLGWGGMCGIPPPC 
TSDVHQ 


12115 


26016 


A 


12227 


206 


2 


QNGHMNGGPCRVSLSLFLETRSFLFFLA 

<?T iRMT ,Ff*<5C3 A TT* AHTTOT.PT.Pf3 Q QMQ pjq 
0 1 h> 11 1 1 ri uovanl Xlrunv^xx) IJtl\ i-lJrVjQ 0 IN o cno 

ASQVAGITGACH 


12116 


26017 


A 


12228 


231 


501 


RSWEGQALHGSDPIjASCTRIQSNYMAIjQ 
RINQELEDKLYRMGQHYEEEKRALSHEI 
VAmSHLLEAKVTIDKLSBDIJVSAAHTW 


12117 


26018 


A 


12229 


86 


455 


DLPTFPVPPHPGFSGPLLGQS ELPSNCQ 
Tr^RSTTTJ^VfiVTir^TTiATTPT/^T.nPPTr 1 

GTSQLPLGPI^IiAfflvSELAJGIiSLPGl^^ 
NCTSQRLPDRPHRHSDAPHTffnrrOTHT 
HTHTHTPATVT 


12118 


26019 


A 


12231 


211 


96 


KSHFKAGGGGPPRYS PPFGGGGGGVFRV 
GGFPPPGLTQ 


12119 


26020 


A 


12232 


228 


3 


KRWFIKGFLSKGLGPPPPKLKFLGFFFF 
FFFEQSFVLVAQAGVQLECSGTILVHFN 
LYLPGSSNYPASASHPRV 


12120 


26021 


A 


12234 


137 


3 


VPLPGTLKNHSMVPCFLFVCLFVCYETE 
SHSVAQAECSGAISAH 


12121 


26022 


A 


12235 


139 


1 


FKLCFGQAWWFKPI IPALWETDASRSLE 
VRSSRPADAWADAWADAW 


12122 


26023 


A 


12236 


80 


1 


VLLI FI FLEEMGFHHVGQDGLDLLTL 


12123 


26024 


A 


12237 


187 


494 


TERNS FNINKKD I HTKTPFVGYQHQRPK 

DRSSLPATEQSWTENDFDELTEVGFRRS 
AITNFSELKEHVLTRRTE 


12124 


26025 


A 


12238 


252 


503 


CQGVGADDLQSQAEIQCELCAESIDRFD 
ILFFFFIiGLGPPHKNPPPPTPQKWKPGK 
APPLALWDIPRGAKGPGRQPGRPAIHTC 


12125 


26026 


A 


12239 


2 


471 


R IAIiC P AVRIRHEERE RJBRE RERERET P 
GHTQLS PGARRTPPLERERAFNDCFSLN 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leuctne, M=Methionine, 
N=Asparagine, P=Proline, 

0=i»llltnm Jn*» R=sA roinino C — CA-tna 

v \jiuimmnt?, ivvi.rginiDe, o — oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

fludPAtiHp rlplpfinn \rznnccihla 
uulicvuuc u etc 11 U 11, \r^UUiolUIc 

nucleotide insertion 














YPGNLCPGDLIEVFRSVYOHPATiYT/rnr' ' 
YFINITPVYGI PAFFTSAKSVFFSRALE 
KTHLLKDWGSDTYRINYICYETLPSLS 
VEEI ITRSEFVTGQEA 


12126 


26027 


A 


12240 


14 


149 


FMTACRIRHEMGRGLLRGLWPLHIVLWT 
RIASTIPPHVQRSALPH 


12127 


26028 


A 


12241 


588 


671 


RSRPSFQEQAFBSSQKyKEGKYI IELNH 


12128 


26029 


A 


12242 


384 


464 


RGGQVW WLKPWPALWEAKAGRS LEAG 


12129 


26030 


A 


12243 


325 


459 


TYL F I YLF I YFETE S CS VAO ACIVOWP r>T . 
VSLQPLPNSPALPTN 


12130 


26031 


A 


12244 


323 


457 


SLQRECGPADTFFEMDSCSVAQAGVRWC 
DLGSLQNSP 


12131 


26032 


A 


12245 


241 


466 


GALFFFFFFFFFFFFFFFFFFLSNKAVY 
F 


12132 


26033 


A 


12246 


385 


37 


QARRRCLQTSRMEALGRWSYKVTI EFP L 
W**J* r ** i-jjivx j\v Jir\jXrULl car tic uaB 
YEEGORLGKSTVSWVFNEDTRLINIWNR 

PLQA 


12133 


26034 


A 


12247 


318 


456 


LiLKYFMYVc^YVHMYvrr/rvTirQpQWAn 

±JJ- A. i 111 W V — 11 X V XUI t V K^Tl X J. Vii^ 

DGVQWYDLGSLQPP PPGL 


12134 


"26035 


A 


12248 


362 


473 


GAIIDALQK 


12135 


"26036 


A 


12249 


310 


463 


FS I KTRKNALGKLQS LLRCHQ FYL»YVCV 
CVCVCVCVCVCTRTYAI LTCIAV 


12136 


26037 


A 


12250 


169 


54 


EAKSGQARWLTLVI PALWKAKAGGCPEV - " 
R^QPT-AWPTW 

I^O Olvlinn JET X rl 


12137 


26038 


A 


12251 


49 


449 


GIPATSTSCVQVILLPQPPLLLGLQDSF 
LPQE III KVEGEDTGSLTI PSQEGVNFK 
IVTVDFTREEQGTCNPAQRTLDRDVILE 
NHRDTjV^wnTiATHvrsinrnQTcvoD t tmo 

« ns\±JXJ v n \j Ut\ X x* V "CT IxX\*sO X J Jxs/lC X C UK 

EPANGVKI ERFTRDDPWLSSC 


12138 


26039 


A 


12252 


130 


428 


RWGTjAMTiPRT.VT.NQlJCrw/TT DDr*nnT7r»Tv" 
TFGG F PNP PGHWKLLGG I SSGEEAG VEE 

RTTRTCRWSLIjWKAH 


"12139 


26040 


A 


12253 


204 


54 


PGME2CTQKSDDGQVQWLTP VI IjGLWETE 
AGDSPESRS SRPAWAKHKI VR 


12140 


26041 


A 


12254 


375 


470 


SLCLFFS FFFQTEPRCVAQAGVQWRHLS 
SLQP 


12141 


26042 


A 


12255 


432 


512 I 


GQ VRWLT P VI PALWEAEGGRS PEARS S 


12142 


26043 


A 


12256 


232 


1 


KGALSKKTQSFWGGQKFFPRPGVFKGVF 
FWGGGFCFFVFGGFFFFFFFFBTESCSV 
AQAGVQWCSLGSLQPLPPRFK 


12143 


26044 


A 


12257 


22 


123 1 


RFVC^TIKVLPJ5LSSDRQMpr3PPT^?TQM 
SSSTRP 


12144 


26045 


A 


12258 


71 


362 


QTERNS IS INKKDVHTETPSKGHQHQRP 
KVDKSTKMRKKQRKKAENSKNQNASSPP 
KDHNSLPAREQNWTENEFDELTEIDFRR 
WVITNSSELKEHV 


12145 


26046 


A 


12259 


375 


3 


SPNIEAPPKAFFFFNQRHFVFCFFSCRA 
ENTHEIVWVKVYPFSNQSKAKLFFSFFV 
QRDRS S PDS YLMPIiQLiQMWDTAGQERFR 
S IAQS YYRS ANAL I LAYG ITCEES FRCL 
PEWLRQIEQYA 


12146 


26047 


A 


12260 


98 


1 


KYQIDLGGRGSSNS PASASRVAG I TGTH 
HHTQ 
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peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine y M=Methionine, 
N=A$paragine, IMProIine, 
O^^Glutamine. R = Arj*ininc S=Serine. 
T=Threonine, V=Valine, 
W=TrvotODhan. Y=Tvrosine_ 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12147 


26048 


A 


12261 


125 


3 


LFLIQKYITGQERWLTPVI PAPWEAEAG 
RSPKVRSSSPAW 


12148 


26049 


A 


12262 


228 


385 


SCLYLQMI PSSI EKSQGMYKKATRGRAR 
WLTPVIPALWEAGAGRSPEVRSSR 


12149 


26050 


A 


12263 


227 


369 


MLKHKSNKNLQDLGQMQGLTPVNPVLWE 
AKVGGSPEVKNSRPAWPTW 


12150 


26051 


A 






487 


OONRFFGTERGRGTGRGVSS FPOLSPG 

LELASSPSPSVGRIiGSAftSQRSRGGQBR 

PKAQLRRPPRPSSSPRLRGRKNCAHHAR 

TPGGILDRPPPTLGTNLLKGGRPAWMAG 

NPPEAHKTANGPGEQA 


12151 


26052 


A 


12265 


391 


476 


PRS YRVECSGS I S AHCNLGLLGSSNPS S 


12152 


26053 


A 


12266 


378 


476 


MSSRLGS IiEVGILGRAWWIiTPVI PAL WE 
AKEGG 


12153 


26054 


A 


12267 


166 


58 


NFFFFFFE TDS CS VAQAGVQ WHDLS HCN 
LCLPASSM 


12154 


26055 


A 


12268 


209 


49 


TPPNFYI KI SRSGOAOWliTPVI PAT i WE A 
EAGRS PEVRSSRPAS PTRRNS I SNS 


12155 


26056 


A 


12269 


127 


1 


S SSFFPLKTFGFLFFFLCFFESES RSVA 
QAGVQWCHIiGSIiQA 


12156 


26057 


A 


12270 


154 


27 


IQTPGKKEFFFFFFETKSCLVAQAGARL 
CPKKKEKENSNFVL 


12157 


26058 


A 


12271 


180 


22 


VLKIHNIGRAQWVTS VI PELWEVKAGGS 
QALRSSRAARTRWDACNFVEKTQVY 


12158 


26059 


A 


12272 


207 


422 


GDSSTRGADEKPKEGVKTDNNNIi INUCV 
MGQDCSMEQFKI KSHI PAGYGGSHRESQ 
HFVMPRWADHLRSGV 


12159 


26060 


A 


12273 


59 


479 


NSLGGGGVYGSRFRFTFPGCRALS PWRV 
RVQRRRCEMSTMFADTLLI VF I SVCTAL 
LAEGITWVLVYRTDKYKRIiKAEVEKQSK 
KLEKKKETITESAGRQQKKKIERQEEKL 
KNNNRDLSMVRMKSMFAIGFCFTALMGM 


12160 


26061 


A 


12274 


354 


439 


VKTGECS IAKTWNQPKCPTTIDWI KKMW 


12161 


26062 


A 


12275 


366 


499 


TRAQYFWWGIGSQCVTLECSGMIMAH 
CSLDLLGSSDPPTSAS 


12162 


26063 


A 


12276 


136 


37 


KFPGQVHWLKPVTPALWEVKAGGSPEVR 
SSRPA 


12163 


26064 


A 


12277 


419 


48 


DGKLNVSFLTVSSSGCFLIFIiLIVRPPY 
FLRYNNS E IRP INNPTMTCKCS S KRKSR 
ICLTFNQKLEIIKLSEEGMSKIKTGQNL 
GILCHVSQVVDEQHKFVKATKSATAKIN 
LSISKYIMSGA 


12164 


26065 


A 


12278 


190 


1 


RKGVFSPQMGPGLGKGKSPFLAKLNFPF 
KKTFFFFFETGSCFVAQAGMQWLDPSSP 
DAWADAW 


12165 


26066 


A 


12279 


276 


389 


WFNLSFFCFCFLFFETESCSVARIjECSG 
AISAHCNLT 


12166 


26067 


A 


12280 


495 


i 


PPHPPPGKPPPPPRPELFFPKTITMKTfPR ' 

VLFFSPPPKKNFFSLTTPRFFFSPPQKK 
KKKU^PPPOKGPPPP VP I KPPPP LPFFP 
FFFFFFFFFFFFFFFFFFFFFFFFFFFF 
FFPFFFFS FGLTVRVTMTCWLXVDIGIiIj 
I 


12167 


26068 


A 


12281 


253 


90 


IWPPQGSFKTAAPFFFFFVFLVEMGFCH 
WQAGVKCiI»TSSSPPASASQTAEKLF 


12168 


26069 


A 


12282 


61 


387 


PKSSSSAPFQISSWFSLPASTSASTMSI 
RVTQQPYKVFNSGPWAFSSSSYMSRPSA 
HISSLIVSRVGSSSTSFQGGLGAGMGLP 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
1 — inreonine, v = vaiine, 
W=Tryptophan, Y=Tyrosine, 
a— un Known, —Mop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














un^a jjANKAKljSy iUSJUCRGGRSKE P 


12169 


26070 


A 


12283 


186 


305 


bHHSWQVREEEHKSLHTQVHTHTHTHT 
HPHLVIMSSHS 


12170 


26071 






Z 


190 


PRVRFDRVNNRSLNPLISRTMRIEPIPE 
KKKKKKKKKGGGA 


12171 


26072 




!ZZo3 


1 7 


197 


Si^vijCjtjguuGGVHFHPCKPFIVLSPVS 
LSKHCSYLPKKKKKKKKKKKKKKKKKKK 
KKKKKKKKKKK 


12172 


26073 


A 


1ZZ5D 


I Jo 


405 


GVASYAEQRKYFLEMEYFPGGELAISIVE~~ 
MTTEDLHYS INLDDKAPTGFERVYSHFE 
RTCTDDKMLS YCITCYRE I FCERNKL 


12173 


26074 


A 


12287 


364 


142 


GCAFFLGGPLKERRKRRAVP PLFFFNFF 
FETES CS VTQAGVQWCNLGSLQPLLIDW 
wvjy pvjTvahachpgtlgg 


12174 


26075 


A 


12288 


171 


380 


VFEAGHAWLQENLTSPDFWGFSFFPTDT 
GMLEKKKKKKKK3CKKRKKKKKKDR 


12175 


26076 


A 


1990Q 
1ZZ07 




z 


GPRGFWQFWVTKGFFPKKG FVKFYP PAG 
GWFFFFPKRGFPFPSFLFFFDRVWLCHP 
GWISVARS 


12176 


26077 • 


A 


12290 


179 


416 


LIFLFTFACAVCFQLARFHFKIHIETAL 
RCLI I LS CPFTHPIiYKMYREGQARWLTP 
VIPALWEAEVGGSPEVRSLRPAC 


12177 


26078 


A 


12291 


3 


408 


1AYGFHDRTTYDTPSALMMVLLVDBLIT 
GNKNGSGEAPEYLPEDFRDGEDDAAVTL 
EKQEDLTTLVTLPVTIiGEHQRQRETQLE 
AKLLKKRLELGSLIiDLVEDLELr IQLKK 
lufcKKKKS PIjWGAP FKGPLGAP I 


12178 


26079 


A 


12292 


119 


278 


PKSIEAGVK5VLDHPLPNKAKTHLSCKK 
LKKKKKKKBCK3CKKKKKKKKKKJCK3CKKKK 


12179 


26080 


A 


12293 


119 


3 


MLFTGFHTKVMSLNSITGWAWWFTPVIP 
ALWEAEVGKS 


12180 


ZOUOl 






205 


1 


ILHFYSTETKQPKGLFLALFYSDCEQEK 
RRGGKLKKIFKKAQGQARWLTPVIPAFW 
EAKVGRS PEVGS 


12181 


26082' 




izzyj 


OK 
ZID 


1 
1 


AF K. J. KJL» i'MHl Fi> Si FltCLPQDE FHPF I EA 
LLPHVRAIAYTWFNLQARKRKYFKKHEK 

KTOo JM2i ir rxLtAKJ-ilSvjW \2 


12182 


26083 


A 


12296 


185 


411 


DVPLMFHWLHLHSDFRI ISTE IGVETVG"" 
KRVLHIRILLLLLLLFFEMESCSVTHAG 


12183 


26084 


A 


12297 


281 


388 


RILFFFFFETKFCSVPQAGVQGRDLNSL 
QPPPPGFM 


12184 


26085 


A 


izzi/o 


no 
1 /<s 


•a 
J 


tjjLiS vxy vfiJjPGSASSPIiIjFSRKDI ki 
PTSVFFSFQTEPRSVTQAGVSWCDLGSL 
QP 


12185 


26086 


A 


12299 


199 


450 


TLIRHMICNYFLLSHRSPFSSVDYFLCY 
REIYSLIKTLNKLGIGGFFLNIIKVMYD 
KFAAfal IRSSEKQKAFPtiRSGTRIRCPL 


12186 


26087 


A 


12300 


3 


407 


EFKDHSTAMDTEPNPGTSSVSTTTSSTT " 
TTTITTSSSRMQQPQISVYSGSDRHAVQ 
VIQQALHRPPSSAAQYLOGMYAAOOOHL 
MLHTAALQQQHLSSSQLQSLAAVQASLS 
SGRPSTS PTGSVTQQSSMVQTS I 


12187 


~2&m 


A 


12301 


227 


3 


KRGTEGQTL FFPAPRARKS F P P P KMP KK 
VEKEFPKGFFFFFFFETESAPSPRLECS 
GAISAHCSLNPHASAHAS 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Clutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N~Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, | 
X=Unkoown, *=Stop codon,/=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


12188 


26089 


A 


12302 


292 


380 


KCYRWARWLMPVI PALWBAKAGGSLEPR 
N 


12189 


26090 


A 


12303 


13 


186 


VDKRHIRLSKMAWVALYLLSLLWATAGT 
STQTQSSCC32YDVFPRRSGKQLPWGRGC 
DL 


12190 


26091 


A 


12304 


253 


409 


ITAVIAAAAAWGVTI IMTNFRSLPRLE 
CNGAI S AHRNLLLIiGSS DS PASAS 


12191 


26092 


A 


12305 


153 


64 


GRAQWLMP I I PALWEAKAGG PLDVRSSR 
PA 


12192 


26093 


A 


12306 


265 


420 


GTLMSRNTKLQLCRMNKSRTLGQPWWIiT 
PVIPTLWEAKAGGSLEARCSRPAW 


12193 


26094 


A 


12307 


295 


1 


RPCLKKKKTSPHWKKKFANPLFGKAPVF 
PLDKKPFKPHYRESPRPLKKKNGPKGFF 
FFFFETESCSVTQAEVQWRDLGSLQAPP 
PGS YHSPSKLI I KN 


12194* 


26095 


A 


12308 


247 


421 


SWCCLKKKKKKKKKKKKKKKKKKKKKKK 
PGGGALKKKKIFS 


12195 


26096 


A 


12309 


168 


3 


GQALLLLCLTVAFSKTTVVCHAPILGWA 
WWLTPVIPALWEAEVGRSPEVRRSRP 


12196 


26097 


A 


12310 


117 


3 


VAYNIAVlfRGGVSLLLPRIiECSGVISAH 
CNLRLPCSS 


12197 


26098 


A 


12311 


256 


101 


EPPTPFFFFETGSCS ITQAGAQRRDHSP 
PQSPTPGIaSNTPAPASRACESGR 


12198 


26099 


A 


12312 


146 


3 


FVALCIQLFPLVNVICFHKKHKTGRARW 
LMPVIPALWEAKAGGSPBV 


12199 


26100 


A 


12313 


207 


3 


VRTOTTGVIMALRGLCFSVEESMTKDEL 
IARLRSLGEQLNRDVSLTGTKEELALRV 
AELKQELDDSR 


12200 


26101 


A 


12314 


282 


2 


NCLAEKVKFMVSAISLQPQGISILFLSF 
FFEKESCSVPQARVQWPNFSSLQSPPPG 
FKLFFCIiSPGGGGCSEPKSCRCIPGWQQ 
NETPSQKKK 


12201 


26102 


A 


12315 


298 


84 


KT PQP PVFALS KNFGPPKTAP I FFFFFF 
FETKS YSVAQAGVQWRDLGS LHS SLGHR 
ARLHLKKRKRKSINL 


12202 


26103 


A 


12316 


124 


1 


SKGHFFFFFFSETES CSYAQAGVQWHNL 
GSLQVPPPRFTPF 


12203 


26104 


A 


12317 


132 


398 


WTNAVFSIVTFSPSVCHSEVALAAYKW 
LVCYLLRETYQKLNQE I KP PTLS VP KKK 


12204 


26105 


A 


12318 


262 


392 


PHYCQNQPQARRSGSHQHFGKQRREDHL 
SliGVRDQPGQYSETP 


12205 


26106 


A 


12319 


168 


3 


EAPKAI SNCTTALQPRLQGKNLSQNKEG 
QVQCLTP VT PALWEAKVGGS PEVRSS 


12206 


26107 


A 


12320 


48 


385 


DliGLSGHSTLASSFISTJiTJiSYKKCLSP 
STMIMRLPLPCGSPIKPPFLPKKKKKKK 


12207 


26108 


A 


12321 


218 


83 


NLSPIFPEGNFFFFFFEMESRSVAQATV 
QWRDLGSLQAPPPRRKF 


12208 


26109 


A 


12322 


197 


1 


KETWPFPPKKEQGLFSFFKKRIFFFFSK 
KIPSPPFFFFFFFETESCSVAQAGVQWH 


12209 


26110 


A 


12323 


128 


36 


KSLFFFFEMGVSLLPRIiECSGTTSAHCN 
LP 


12210 


26111 


A 


12324 


140 


3 


I LNSNLKP IKKNTFFFETESRS IAQAGV 
QWHDPSSKQPPPLGFKR 


12211 


26112 


A 


12325 


273 


369 


RKEWFLKKKKKKXiOOQOKKKKKKKTKKK 



1281 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGtyrine, H=Histidine, I=Isoleucine, 
K-Lysine, L= Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=Glutamine, K^Arginine, S^erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














KKK 


12212 


26113 


A 


12326 


140 


415 


RTGLLGPGRLQGPPYGSREYKGSGLLDL 
NASLLDPEVWAPSLSGPETKPPAHSPSD 
PGVHSSCSASQGSQGPGPLTPSSRIHEP 
KPPGPFFK 


12213 


26114 


A 


12327 


2 


506 


FVPLSETEAPGCSGSRPPELEPERSLGR 
FRGRFEDKDEQLBEEEELEEEEEEEEED 
MS HFS LRLEGGRQDS EDEEERL INLSEL 
TPYILCSICKGYLIDATTITECLHTFCK 
SCIVRHFYYSNRCPKCNIWHQTQPLYN 
IRLDRQLQDIVYKLVINIjEEREKKQMHD 


12214 


26115 


A 


12328 


1 


419 


FRVPS CARCQ YLPLLRGASQLG YSGVRD 
PVEEAVCPFSDLKLCAGRTTTLFKAVRQ 
GHLSLQRFLLPFVWLCPAPRSGVYRGRQ 
ASLSCGGLHPVGASWPRCLPTQASAMAG 
TPPPASLPPCSLISDCCASNERGSMGV 


12215 


26116 


A 


12329 


259 


414 


NMENSLRCVWVPKLAFVLFGASIiLSAHL 
QVTGFQI KAFTALRFLSBPSDAVT 


12216 


26117 


A 


12330 


302 


404 


S CYRVS VI SQARWLTPVT P ALWEAEVGR 
IPEVGS 


12217 


26118 


A 


12331 


94 


375 


SRDMS PGLLTTRKEALMAFRDVAVAFTQ 
KEWKLLS SAQRTL YREVMLENYSHLVS L 
GIAFSKPKLIEQLEQGDEPWREENEHLL 
DLCPGWSAMA 


12218 


26119 


A 


12332 


7 


246 


NPIVDPSPCGGIRVRTPAGRGGPALRVR 
PETWEEAGEKMPSESLCLAAQARLDSKW 
LKTDIQNCFITRKISLLPLFCHHri 


12219 


26120 


A 


12333 


150 


1 


YIYFFHLVRGLPGSGYDEYFVVVVVNTE 
S PS VTQAGVQWCHWSVQHKPP 


12220 


26121 


A 


12334 


34 


426 


EPGFLFVFFFLIRGKKTPLFFWGTRKNP 
KSGKPLFTPPGKS PSEPPKSRGGPLFFQ 
TRSRLTPQSWFGGAPKPPLGGEPPKFWE 
AGLGGDPPTPRAPHRADQTLEQPIQAPT 
RIIQNEQWAIKPFQPMPGG 


12221 


26122 


A 


12335 


185 


3 


TVTTPLHFSDGHRARLWSRVPAPPAENR 
SMEGGI/3RAVCLLTGASRGFGRTLAPLL 
ASHE 


12222 


26123 


A 


12336 


60 


316 


GWGPI PSTDVTVLS PQKPI LLQGHERAI 
TQIKYUREGDLLFTVAKDPVSVGWRGSG 
KCiCiXi-iiiPtfKWVil I P VLiPIjAGNQRASHCE 

tr 
Jv 


12223 


26124 


A 


12337 


3 


167 


SFRIQVQGNHTSKHYPKIQYSDQAQWLT 

o V JL trAJLiWISAIv 1 ajKo-Ue* VKo J-iKAt KAP 


12224 


26125 


A 


12338 


238 


401 


RPLS VMCGRLTS I RVEDSGARCWF YLF I 
I al JS rKb VAQAG VRKKDI/35 JjQT PPSG 


12225 


26126 


A 


12339 


354 


433 


GWAQWFMPVT PALWEAKAGVS PEVRS 


12226 


26127 


A 


12340 


238 


382 


LL I VYNMMLLPDAVAFTCN S S TLGGQGR 
RITRGQEFETSLANMVKFHL 


12227 


26128 


A 


12341 


101 


1 


KTKQTNKKTRSVGQTWWFTP 1 1 PAYWEA 
QAGEL 


12228 


26129 


A 


12342 


276 


23 


GSCLLEGKLTNRKDIHTKTPSVRHHCQR 
PKIDKTTKMGRNQSRKAENSKNQSSSSP 


12229 


26130 


A 


12343 


122 


I 


YMGVNERGCGQS I FKSSLSSQLWWWAP I 
IPATQEAEAGEW 


12230 


26131 


A 


12344 


180 


360 


LHWYFFSVGDFLPPPPPPLDDSSALPS 
ISGNFPPPPPLDEEAFKVQVRAEVKVML 
GKS 
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SEQH> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQD) 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^Alanine 
OCysteine, D=Aspartic Acid, 
E=Gtutamic Acid, F=Phenylalanine, 
G=Gtycine, H=Histidine, I=Isoleucine, 
K=Lysioe, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
rwninfn minp- P:=sA. rtrinine S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 




12231 


26132 


A 


12345 


35 


411 


TMTIVDKASESSDPSAYQNQPGSSEAVS 

PVPGAVVYSSSSVPDKSKPSPQKDQALG 
X\CIT &PPOKVT.FPSEKT CLKWGQTHRVGA 
GLQNLGNTCFANA 


12232 


26133 


A 


12346 


278 


386 


T T YCYHKOT T I GOARWLTTL IPS LWEAKV 
GGSPEVRS 


12233 


26134 


A 


12347 ' 


111 


2 


V <z OTfiOVOWTjTP VI PALWEAEVGGUjEIi 
RSSRLAWA 


12234 


26135 


A 


12348 


215 


316 


UsTKRLMlSFIKKKXKKKJ^ 
KKKKKKKGGPIi 


12235 


26136 


A 


12349 


265 


404 


SNISYFLNNNFSKALSIiNIKYMISQARW 


12236 


26137 


A 


12350 


372 


1 


PVFPLPPKKLGEQLPPPALRFIiAVSPItP 
tr a am?nR r tckkvlavhknp I D PVYGFP K 
KGPTNFFIiFPLFQRVIFlJGAQKGVSWFG 
FYWnTxSTJVPKLGGPPFFFFFETESRSV 
AQDGVQWCDDGS 


12237 


26138 


A 


12351 


318 


416 


Ar;D aoWTiTPVTLTIiWEAKAGGSSEVRSS 
RPAPP 


12238 


26139 


A 


12352 


144 


1 


vrrjvft TVR C*WXj FCFVL FLROS LS VAQAG 
VQWCMLGSMQPPSTSIiVQAI 


12239 


26140 


A 


12353 


116 


1 


SGRYS FKKMKCGQERWLAS VT PALWEAE 


12240 


26141 


A 


12354 


102 


2 


ETRYKKQPGGWERWIiTPI I PVLWBAEVG 

l99JraV 


12241 


26142 


A 


12355 


112 


294 


LISPSCPMWPSPHGPTWLSRLISQHSPS 




12242 


26143 


A 


12356 


138 


2 


SFFLKVFFFFFFEMESRSVAQAGVQWCN 
t ncpfMkPPPCiS'RHCPVD 




12243 


26144 


A 


12357 


148 


394 


PGBPPGEKGEGGKGKKPGGRGWGEPPCR 
GGFKEKFPRGKEKAPPLKENSRENLGWG 
TPfT" PPIjGKKKPPPPGEGBMILGPPQF 




12244 


26145 


A 


12358 


97 


3 


EKFSPCFVRARTHTHTHTHTHTLSLSLF 
GFH 




12245 


26146 


A 


12359 


204 


2 


tfirvTP AACFAP CIjPDEAP I IAAAKPATT 
TSEQKMAVPPKYANI/3KSARNVFNKGYG 
YGLINLLKTKS 




12246 


26147 


A 


12360 


156 


391 


* NRGIjFKVEBS FS I PCS VRCS IHPSALVS 
PTPPTTOTTNWALFFETESCSIHQTGMR 
WRDLNLIjQPLPPGFKRFSCLNIj 




12247 


26148 


A 


12361 


179 


1 


ALKKLLI I DDNSHKTNI VKHFSFRNFTF 
LFLLETESRSlARLECSnANPraCNLRL 
SGP 




12248 


26149 


A 


12362 


218 


391 


T I LKDCTFKM I CLSQAQWHI P WPATWE 




12249 


26150 


A 


12363 


2 


356 


" TNSHVDNS ITQKPEFCI PRCCGYI VCQL 
QLVERAF I FKFFFFFLKRS FVLLP RLEG 
RGAI FRVTQEGSNLLTLGPAPLGLPKGW 
NYRGDHPGPAYFYFLMGKRLLQIQGGRG 

T TTPWT.T. 






ZoiDl 


A 


12364 


94 


1 


KNPPLFFFFFFFESESCYVAQTAVQWHD 
LGL 




12251 


26152 


A 


12365 


237 


2 


YSS LLFKPAES AKGI LRHHVTP KSTS AI 
SKILFCFLFFVETEFLSPGLKCSGAIKV 
HCSLNLSGSSNSPTSASQVPGS 




12252 


26153 


A | 12366 1 340 


442 


GLFVCFFVW FLitTi fc»£> L-o V LVAij v txri v^ui_i | 
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SEQ1J> 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

iCslUUC OI 

peptide 

eaiitionna 
octJUCIICC 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=IsoIeucine, 
K-Lysine, L=Leucine, M=Methionine, 
N»Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
a— threonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














SSLQPS 


12253 


26154 


A 


12367 


208 


45 


EGGEFFFFFFSVFFFVRAGVLPCCPGWS 
iilPGSbuPPAIiASQSAGITGATVNST 


1ZZ JH 


ZD 1 J J 


j± 


lz3oo 


oU 


443 


CSSCVSSKPQSPHIjKKQGDRMGAHLISG 
GCTEATAGKGMRKCLVAWSTCAGARPPS 

D>TD/T.COmiJO 7\T> CV^fTiTJ T nrnnrar nnnnn 

KWKUoy^noAKrvliPKLPTVVRU 
QAKPFCKKWMLGGPIS IEPKPQHPPGPG 

KTTinAT7Xf5in\ BDP t .vdt 
AXVUW /UJilVrvVtr KJJ Vrl 


12255 


26156 


A 


12370 


341 


421 


SGWVRWLTPVTPALWEAEAGGSPEVGR 


12256 


26157 


A 


12371 


280 


393 


NTIDHYVTQWIiTPVTRTLWEAEAGRS PE 
VRSSRSAWPT 


lzzj / 


ZD I JO 


A 


12372 


268 


399 


TLCSLSSAES I IHS YPEFSCKDNYLMTH 
NEAYKACMRRAFHI PL 


1ZZJO 


OZ 1 CQ 


a 

4* 


12373 


209 


468 


KAVPUETSHPS PADSFSTSSNLACGTHH 
LRRLLKLYVEEELEYYPRKIQCFHFSFF 
FFESRSLLLLPRLECSGS isahcnlcll 
GS 


12259 


26160 


A 


12374 


381 


488 


TITCLFQKCQRGRARWLTPVI PALWEAE 
AGGSQGQE 




ZOlO J 


a 


lzJ /D 


276 


395 


GHVVlCrKKLKLI FVFFEMES CS VAQAGV 
QWHDLGSLQPPP 




ZOIOZ 




IZj /O 




472 


VYS&VAIjNTFTIjLCNHHH^ 
NKKCRGAILAHCNLHLLGSSDS PTSASR 


12262 


26163 


A 


12377 


390 


511 


PGAVAHACNPSTLGGRGGQITRIPSL 




Z0I04 




lzo /O 


41 / 


CI 1 

511 


AHRS FFEMES CS VTQAGVQWRRIjGSIiQP 
SPP 




ZOIOj 




iZj/y 


133 


406 


KLS LNKRD YKKQVCKETKKKKKKKKKKK 
KKKKKKPGAQKKKSGG5 PR 


12265 


26166 


A 


12380 


247 


345 


HMWSSQLNKGLINKKKKKKIOCKKKKKKK 
KKKKKKKKG 


1ZZOO 


ZD ID / 


71 


12381 


156 


15 


NFFFFlSTlSisCSSVTC^rKVQWFDLGSIiQP 
GKQEQNYVSKNKQKEIilH 


12267 


26168 


A 


12382 


275 


1 


KXVFLKGPPLFFFFKKKFVFFLKVGFKG 
GLNFIKTFPPGGKPISKKRQIFFFFFFF 
ETE PHS VARLE CSGTIS PHCNLHLP SSN 
DSLASTS 


12268 


26169 


A 


12383 


43 


379 


LLGYDESRSLSWICLCL^CLiGQLLSFLN 
LLVYVFCQTCEVCFHYFFQYFSTLFLLT 
r TrtDoIM VIjNXRHFvj 1 V\PQVXiDDIiVFFS PN 
KEI FLGQLQWLMP VI PAFWE PEAGRS PE 


12269 


7617ft 

ZOi /U 


j± 


lzjot 




l 


luji/r r r Bx^Ltaifr rjfuSfJxycb r PJPFFIiRT 
PPFFFFPPFFKNFFPPPPKKKNPFFFFP 
PPPFFFFFFFFFETRSRSVAQARMQWGD 
ttuo Iry £*y f .Fiji £> I/PTKP 


12270 


26171 


A 


12385 


141 


1 


KKEPLRAPPPFQLFCFFYFETTPCSVTQ 

J\D17/"lT*Tr" r MT.CCT./"M3T5DT DVtr 
hK V U « J-io a JjU r 1 Jr f LiK I It 


12271 


26172 


A 


12386 


31 


417 


CRLADSPSPNDTGQDSRGRAGIKHIPPIi 
KKKKKSGGGA 














17777 
iz,z /z 


ZOI /J 


7S 


Iz3b/ 


291 


Aft 

412 


LSSEMIjIjFCFVYFLRWSFTFVAjQAGVQW 
CDLGLLQPPLPG 


12273 


26174 


A 


12388 


310 


1 


MTPCPFLFFCPKKRKAGGGFI KKALFFN 
PQKBVFLGPPPKGPFFLGFQPTPPWGKK 
I PRLNPLFWRSS IFFFFLLDGVSRLLPR 
LECNG VI SAHCNLRLLGSS 


12274 


26175 


A 


12389 


117 


2 


KRNPQRGGPLFFFFFFFETESRSWQAG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, ENBistidine, I=Iso!eucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














AQWHDIjGSXjQ 


12275 


26176 


A 


12390 


117 


319 


SFLKKPKDSTKNLLKLINEFSKGSGPKI 
TIQKSVAFIYPNGEHSEKKIREVITFTI 
AAKTI KCPPQA 


12276 


26177 


A 


12391 


95 


2 


SKSWTSLKIjRTIjGWARWLTP VI PALWEA 
KAG 


12277 


26178 


A 


12393 


127 


395 


I PGS QG FCL I KQ S ALHLLNKS S FS FLS F 
IETQSLSVTQAGVQWCYLSSLQPPPPRF 
KLRQKNRLNSGGRGCNKLRAHHYTPGWV 
TERDS 


12278 


26179 


A 


12394 


252 


391 


FFCFCFVLLCF FiiMESCS VAQAGLQWPG 
LGSLRQGEAMLARLVSSS 


12279 


26180 


A 


12395 


275 


3 


TFRLVILKKSKETVEINCLETKSHSVAR 
AGVQWCSQDWWEEKAGFNWRPSKARRCP 
TSVLKCHLGWAQ WIjMP V I PALWEAKAGG 
SPEVRS 


12280 


26181 


A 


12396 j 


111 


3 


GNNVKIRGLRIRPLKGWACWFTPVIPAF 
WEAEAGG 


12281 


26182 


A 


12397 


124 


1 


VSRGRTVHQEKPTERAAS PRLE CNGTIS 
AHHNLCLPGSSNS 


12282 


26183 


A 


12398 


259 


387 


YQQESFSCLFIFIFIiFFIFYDTESRSVA 
QAGMQWHDLGSLKPP 


12283 


26184 


A 


12399 


103 


1 


KGRFFFPFF lETESRS VAQAG Vy WRDIiD 
SLQAPP 


12284 


26185 


A 


12400 


308 


416 


OGNNFE KAGRVRWIjTPVI PALWEAKAGG 
SLEVRSLR 


12285 


26186 


A 


12401 


92 


1 


HTWGLTQWLMPVIPALQEAKVGKLFBPR 
SS 


12286 


26187 


A 


12402 


265 


388 


GLS WLFRRPGHLFYF I F FETMS VSITQA 
GVGCCDLGSLQPP 


12287 


26188 


A 


12403 


279 


1 


SLSSKMESGSLTFSTIDIWGWI ITLSCC 
PGHCRMFSSIRGIjYPLDASSTTSSSCNN 

qnspdmgtcslvjsqllrrlrwedclntg 
grgcsepgs 


12288 


26189 


A 


12404 


387 


152 


NPPPGFNFGGPLKKNFfS PPkGiSKFVFIj 
KRPPPFFFFFFFFFETESHSVAQAGVHW 
RDLGSLQALKS FCDLVSAFKGF 


12289 


26190 


A 


12405 


262 


464 


IIFHWLACILVIHSSFDGHGLFLPIiAVA 
DSAAVKHSC 1 £> I CliN IK Y 1 PWSKjI AjboL. 
GISVLNFLRNC 


12290 


26191 


A 


12406 


204 


67 


KCFGQAWWLMPVIPALWEAEAGRSQELR 

QOTJOHUAtnTT T MTOTD'PD'TVO 

SoRPAWArUsijijWriK 1 KAo 


12291 


26192 


A 


12407 


134 


1 


IQRPCIQLFSTFFLIL.FYFIFTEIDSCS 
VAQAEVQWHDFGSLQP 


12292 


26193 


A 


12408 


145 


403 


TYMYSLINYYKANSHVTKVNKHNTARFL 
ESLLCAHPNPPIKLiI ISTiTJiRKKKKKKK 
KKKKKKKKBCKKKKQK 


12293 


26194 


A 


12409 


65 


415 


RNKRI ISQSWNGPSRKi^SSAIjLCGLGH 
LTSGVRARVSPVQGCIiVRIKGGSGPSKP 
KKKKKKKKKKKKKKKKKS SKKKKAQKGG 
ALKKK 














12294 


26195 


A 


12410 


124 


404 


KKKKKKKKKKKKKKKKKKKSSSSSS 


12295 


26196 


A 


12411 


164 


2 


TGAGLT IMDMVGVGPDLLQLRFCFVLFC 
FLVFFFETESDSVAQAGVQWCNPSSL 


12296 


26197 


A 


12412 


114 


3 


PGMVAHACNPSTLGGQGGW I TRGQVYKT 
GLAKTVKP 



1285 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glu famine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12297 


26198 


A 


12413 


583 


391 


LADTGFHHVCSQADIjKPIjTSCSTRLSLPK 
CWDYRCE P PHLAVP DTVLS TRGL ISEKS 
FLPLCTFR 




26199 


A 


12414 


280 


2 


KVSSMimQTSVLFSTMMISETLKyRGR 
KKRRBRGREQGREGRERKDECRKVRSER 
IEGGKKRGRKE YREEGRRR IEVSANE ET 
PRDHPANK 


12299 


26200 


A 


12415 


264 


352 


RFL VHGCD I YI C I N I YTHTHTHTHSHTH 
T 


12300 


26201 


A 


12416 


359 


485 


TDARMDWCYQEGTGSQAWWLTPVI PAFW 
EAEAGGSFEVRSLR 


12301 


26202 


A 


12417 


171 


1 


PLFRWWGFELPPPPGYQKKKGYPPFFFF 
ETEPCCVTQAGMQWYDLRSLQPPPPEFK 
R 




IblKiS 


A 


12418 


265 


1 


WHKSNEELAAEASAPVKARASNTIIjGPH 
THGKERCPVSLRNVIiDEGVKSTNFFSFF 
ETDSHSVTQAECSGAISAHCNLHLMGSR 
DSGA 


12303 


26204 


A 


12419 


278 


388 


SSLTGFF I RRRNEE PGQHGETPCIiLKIQ 
KLAGCGGVL 


12304 


26205 


A 


12420 


205 


413 


WNLFVYLYQTHKYARVYSYNIILFTNKK 
RAIGQAEWFTP 1 1 PAVWEAKVSRLPEVR 
SVRLSLPKWRITS 


12305 


26206 


A 


12421 


103 


2 


RSIiSKHRTETILGLPAAVLI ILFPPLLI 
PTSKY 


12306 


26207 


A 


12422 


372 


478 


MriFVFLKETPCQSQWIiMSVI PTLWEAEA 
GGSLEPR 


12307 


26208 


A 


12423 


174 


416 


KMHYCVI*S AFLI LHLDTVAL I LS TCSTI* 
DMDQFMRKRIEAIRGQtLSKLKLTSPPK 
NYSEPEEVPPEVISIYNSTRDLLQE 


12308 


26209 


A 


12424 


572 


733 


RFSCLSLPGSWDYKNCLNSGSRGYSEPR 
SHHCI PAWVTEQDSVS KQNKTKQSA 


12309 


26210 


A 


12425 | 


123 


2 


GYI FI FI YFFETGSCS VAQAGVQWRNLG 
SLQLPPPSSSDS 


12310 


26211 


A 


12426 


224 


417 


TADFYGVNYISIKLLPKKKKKRGGPFKE 
SKFTAAGLQKNIFFLSAPNFIALAWLK 
RRDWETPG 


12311 


26212 


A 


12427 


115 


3 


FFETGSH P VTQTGMQ W CDCNS LQPLTNR 
LKQSSHLSL 


12312 


26213 


A 


12428 


250 


398 


PWAGHLTSLGLSLP I FNIGIAGQACWLM 
PVIPAIiWEAEAGRSLEVRSSK 


12313 


26214 


A 


12429 


215 


3 


KKGLVVWGANKLWVFTPQKFFLNQFSGW 
ALTPRGKSWFSKRRAPAPFFPFFFWETE 

o 1 tot m n n. ^"*»* /"vfvir tt\t 

SHSVAEAGMQWHDL 


12314 


26215 ; 


A 


12430 


345 


2 


FSHHPAKPQGGLFYRQPLGLEGKETRPR 
LNDRFMAGPSFPAQQGLRMPFLGKEWER 
EGCSFVQCPVEGFAMWPYPLGSPPGPRL 
FFFFFFEAGSPSVTQAECSGS ITANCSL 
EL 


12315 


26216 


A 


12431 


348 


3 


LYSSLGDRVKLRLEKTKTNKWCLHSYIS 
TCKJCINLDHYLIPCTKTNSEWIKYLNVR 
TNSIKLLEEDISVNLHELGSDNSFIiAMT 
PKAE 1 1 KE KT YKTjHP T ITT TfT FWV<5 pn T T 
EK 


12316 


26217 


A 


12432 


160 


52 


NTAIGHPHQAI YLS YLS IIYLS I YLS I Y 
LSIYLSIYL 


12317 


26218 


A 


12433 


180 


2 


GRS KLHI CREHS I CAI E HVCG CDTENNT 
NLCQVQWLMPVILAIWEAEAWRSPEVRS 
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SEQ ID 
NO: of 
nucleotide 
sequence 
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NO: of 
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M 
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od 


SEQ ID 
NO: in 
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09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OCIycine, H=Histidine, I=Isoleucine, 
KNLysine, L=Leucine, Wt=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LRA 


12318 


26219 


A 


12434 


193 


3 


PGAENPPPKGGPPFFFPFPRGFPPGVFF 
IiKKKGFPFFFFFFFDTESRSVAQAGVQW 
HDLGSLQ 


12319 


26220 


A 


12435 


321 


422 


NVEGDFLQRIKNIRYWPGAVAHTCGPST 
LGGRGG 


12320 


26221 


A 


12436 


116 


2 


HDRFCLFVYDRVSTLS PRIxE CS S AI SAH 
CNLRLPGSSN 


12321 


26222 


A 


12437 


302 


400 


THTHTHTHTHTHTHSGLLYYLPKGKQSH 
EFCW 


12322 


26223 


A 


12438 


329 


399 


LALQARLECNGI ISAHCSLCLPY 


12323 


26224 


A 


12439 


310 


2 


KPILSRGLFCARPCFRRSNMRSFNPSQS 
YQVGKDTFGFTVKVTKIiKVLGPSRGPKG 
ARjQGVHKGKCLLTLPKMLRFLFFFETES 
CSVTRLECSGAILAHCNL 


12324 


26225 


A 


12440 


151 


401 


EGMPHLPGSHSSSNILKRGRESGREAGR 
KGRKERRKEGRRERRKEEE KRGRGKIRR 
RKDRGENEEEENNRLSS KVSLE ISTCP 


12325 


26226 


A 


12441 


276 


1 


FHPFGRPS VSPRVQPKIGI I IHTCVARI 
EKGGSCKALSLCPQYVGSNFFFFFETRS 
FSVAQAGVQRCDLGSMQPPHFPGSSDPP 
APASRVAG 


12326 


26227 


A 


12442 


140 


1 


NSHFPKNLGSPSSFFFFFYLBTESRSVA 
QAGVQWLDLRSLQPPPPG 


12327 


26228 


A 


12443 


310 


3 


LPPCSWGLYGAFARCPLMDDKQLSQVPL 
WACPLSTAGRTRLCVACIKAAGKAQGFF 
FFFETESHSVTQAGVQWHDLGSLQPPTH 
AKPKHTHTHTQTHT I PDT | 


12328 


26229 


A 


12444 


337 


2 


GFFAIVPKKGQKKMFSPFPKEGIFSGPC 
FISSTPTLLGVKKILPPNTSEIRPVLSS 
I INTSVHIREQKKTTYVGIFTGKSRFFF 
FETDSRSIAQVGVRWRHLGSLQAPPPG 


12329 


26230 


A 


12445 


112 


9 


GRVRWLTPAI PALWEAKAGRS PEVRNSR 
PAWPTR 


12330 


26231 


A 


12446 


172 


3 


IPGELLIGRAKFKAKNWPWAPRGIPLGR 
QKFSFFFFFETESRSVAQAGMQWLNIX3S 


12331 


26232 


A 


12447 


221 


418 


RKAERLEVFFRRLCQRALRP FPHCLAAA 
PMPLIVLKKPILGRAWWLTPVI PALWEA 
EAGGWLESRS 


12332 


26233 


A 


12448 


280 


392 


PLGRVRWLTTVI PAHWEALAGGS PEFRN 
LRPAWPDMA 


12333 


26234 


A 


12449 


410 


129 


PSQEIFPYPGPPGKTPFPLKKKKKNRGG 
GGGLPFPPPQKVKKKKFFYPGRGRFKQP 
KFNPLPPPRGEKKNPKPPPPKKKKKKEK 
KRKEKKLVT 


12334 


26235 


A 


12450 


159 


1 


KKKRGGFLGLTFI KKTGPDPFFFFFETE 
SRSVAQAGVQWYDLGSLQAP PPGIM 


12335 


26236 


A 


12451 


270 


1 


NPLKFFFFSLFQNPLKKDPAI FNPFFFF 
FPRIFWPFFGGVSKAGIWFFFFPFFFFF 
FFFFFFETESRSVAQAGVQWRDLGSLRT 
RGRTRG 


12336 


26237 


A 


12452 


249 


33 


DKLVKLPLILKKLNSFFFF FETES RSVA 

r\ TV"*t 7**\t«TO ITT OCT ZTD "D'OTar* 17VDmmVHU 

Q ITjV U W KN Jjuo J-iflr Jr ir e o c Rir JMAWAXArl 
AHAWWPGIiPAQNLR 


12337 


26238 


A 


12453 


190 


86 


TNDLGVrTQWLTPVIPALWDAEAGRSLEL 
RSTRKA 


12338 


26239 


A 


12454 


318 


2 


P PTFPLKTIiFFP VFLP KLFSTLFS P KKK 
FSNFLVS AP PFFFLTI1GNSQGVLI1KGGP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
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od 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Add, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine f V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=^Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FFFFFtfb'FETESRCVSQVGVQWRDLEKK 
EKKKCNSNGGRHYISTHLKYK 


12339 


26240 


A 


12455 


204 


288 


IQLNTHTHTHTHTHTHTHTHTNET6KMW ' ' 

I ' 


12340 


26241 


A 


12456 


409 


13 


QGGAPPIPIMLSFLQEAGLLHVAQPGVK " 
LLTSGDPPSPPSKSAWRQRQETRLNLGG 
RGRSEQRPPPRPPAWATQQDSVSKTNKQ 
TKNNKKIRKSLCKKTLEKEMLAVMI ITK 
YLLCTDVPSTVLCTHLYYLT 


12341 


26242 


A 


12457 


204 


3 


KKKSPTPQSAKFPLGGGVFFFFLVGMRS 
PFVAQAGFKPLGSSDPPIIASQSTAITG 
MNHHTSSAFH 


12342 


26243 


A 


12458 


289 


3 


KAREKKKVGATKNIRIVDLSKGRTSLCK 
KTEMPPVGNRNKQKPQNPSNLRISTRPE 
TESRSVTRIiECSGAISAQVILPGSSDSP 
ASNWRQSETLS 


12343 


26244 


A 


12459 


261 


388 


SWHFGRSKQVYCLSLGVRNQPGQNPVST 
KTTKISQAWWHAPW 


12344 


26245 


A 


12460 


313 


391 


RVGRVHWLKPVIPALWEAVAGGSPEV 


12345 


26246 


A 


12461 


112 


7 


HICIYFFETESRSVAQAGVQWHDLSSLQ" 
PPPAGFK 


12346 


26247 


A 


12462 


389 


3 


ALHEGAARGSPPPGGKKKKRGSPFPMGN 
QAPQVPGKLGGGFGFWFFKKSLPIiQPGG 
KPGVFYKPKKVPTPKDPPPPTLWGGWGT 
TGGPPGPLFFFFFFFETESRSVTQAGVQ 
WRHLGPLQPQPPAHAS 


12347 


26248 


A 


12463 


26 


349 


CIMIDSiTCI FIHCWWBCEMAHPFWKAV 
WQFLFFFFKRGVYFFFPGGIKGPGFGLR 
EPLPPGIKGVPRPNPLGGGDLRPPPPPP 
INLGFFKGKGVFMVWPGVGNFLI \ 


12348 


26249 


A 


12465 


187 


54 


GYKNKPGQAQWLTPVIPALWEAKAGGS P 
EVRSSRPAWP I WRTRG 


12349 


26250 


A 


12466 


199 


55 


1 1 QLRQQNS FL FI CFD IKS CS VAQAGVQ " 
WHNLS SLQLEVPGFKHAPSH 


12350 


26251 


A 


12467 


107 


3 


FAHCMGNS FFS FFFFFFE VGS CSVAQA 
GVQWSD 


12351 


26252 


A 


12468 


120 


1 


GNWCRAQWLTLVIPALWKAEVGRSPEVG™ 
SSTRLGLPTHRP 


12352 


26253 


A 


12469 


105 


2 


YYLYF I FFLFFLKTGSCS VTQAGVQWKD 
LRSVTS 


12353 


26254 


A 


12470 


238 


405 


FHI CL FI LKNQVS KMYTHTVS CSLLE 1 1 
YCCLLQWLTPVI PALWETKVGGSLEVRS 


12354 


26255 


A 


12471 | 


425 


1 


LNPAPAIFFGGPKKKNFFSPPRGDKFFF 
FKRAPPFFFFFFFFFFFFQKKKKPFFPA 
QKKGRGFPGRALSQWPLAGFLKITQFPP 
WSRVPFSPLLFFFFDMBSPSVAQAGVLW 
HDLLHLPGSSDPPTPASRYAGITGVCHH 
A 


12355 


26256 


A 


12472 


122 


2 


QNKYSELRMNTFGRPQWFTPVI palwea 
EAGGSPEVRSSR 


12356 


26257 


A 


12473 


144 


420 


GAPRP ISRPRGKTPGGGGKRGKPNKNGL 
GENPFFPKGVKGKNPPIFFPFLGQNGPT 
PLVLGSANMGFYLYLKRKGPGKGREALR 
AEPGQPFY 


12357 


26258 


A 


12474 


80 


1 


KGGQVQWLTPVI PAFWEAEAGGSPEV 


12358 


26259 


A 


12475 


54 


294 


LIKSWQHRQEYVSLMSFIFFPCHIWYQS 
QRSPGTTQIILEISDTGKNKYFNCTGSH 
RKHTHTHTHTHTHTHTHAHTHTL 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
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sequence 


M 

etfa 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

tng to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A- Ala nine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginiae, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, A=possibIe 
nucleotide deletion, Wpossible 
nucleotide insertion 


12359 


26260 


A 


12476 


364 


234 


EEMGFHHVSQDDLNIiLNSWSACIiGLPKC 
WDYRSETSRLASLPF 


12360 


26261 


A 


12477 


288 


450 


PGTVAHACNPSTLGGPGWSRTPGIiKRI I 
CLGLPKCWDFRRELLPSLPCLLIFL 


12361 


26262 


A 


12478 


271 


396 


TISFLSPFFLFLPFFFPFWRKSGRESRF 
SQKKKKYKKKNIYY 


12362 


26263 


A 


12479 


289 


2 


GSCLLERKLTYRKD IHTKNPS VPHHHQR 
PKVDKTPKTGKKQGRKTGNSKNQSASPP 
PKECS SS PATEQKWMENDFAKLREKGFR 
RSNYSEliQEEI 


12363 


26264 


A 


12480 


323 


1 


YSMVYPYVFLTAKKDQLQVNNTQLTCKS 
YQLYHCXNHSTLQTHNISTLMILGHIPG 
LCIPVNLSEAWAATPALHFVKLLIiTELT 
HHVCRAIiDI I ILAIVSLVALITS 


12364 


26265 


A 


12481 


279 


429 


MNGQRKCVYVYTLYTYTTEYYSTLKRRK 
WPGTMAHACKPNTLGGHGGWTA 


12365 


26266 


A * 


12482 


308 


423 


LTFFKNITGFFFFFFETEFCFVPQAGAQ 
GGDLSSLKAP 


12366 


26267 


A 


12483 


272 


474 


MVSWGLSRAKIPWVI^SFFFFFEKKSHFF 
FQGGGRGGDIiSWEQPIjPPRSKGFSCFTL 
PSTRDYRPAGP 


12367 


26268 


A 


12484 


84 


229 


LFKSVCHLLPLSSCSSHVRQACLPFTFC 
HDCKCSRASPAMLPVQPAEL 


12368 


26269 


A 


12485 


392 


3 


TGKKKRFPFFPKKKIFNSKPQFSWRKKK 
LFKIFSPPGGAPFPPGVQKKKTFFFFYP 
FSPRTPFFSIFFPGKGPPKGGGLGPPFF 
PQKRGGGFPNS KWETWEKFPPLFFFFFF 
ETESHSVAQAGVQWHDL 


12369 


26270 


A 


12486 


76 


1 


KLIS WT WWHMP WP ATWRAEAGD FS 


12370 


26271 


A 

* 


12487 


204 


8 


FFIFFFFFFFFFFFCFKTGSHSVAQAEQ 
BWHDHGSLQPQPPQSFHLSLPCATMPGL 
FFLFLFLFT 


12371 


26272 


A 


12488 


96 


3 


KKPLGQAMFFFFFETESCSVAQAGVQWC 
DL 


12372 


26273 


A 


12489 


126 


1 


KPPHP FLPFFFFFETESHSVAQAGVQWC 
YLGSLQAPPPGFTP 


12373 


26274 


A 


12490 


107 


2 


IAGRAWWLMLAI PTLWEAEAGGSLEPRS 
SQPTCAT 


12374 


26275 


A 


12491 


302 


2 


GGFGFFPPREKGGFFQTVLFGVPPGFFS 
PPVFKTGPGVFFLGAQKKKIFFPPPGGK 
IWFFLRGAPLFFFFFFFFFFFEMTSCSV 
AQAGLQWCDLGS LQHP 


12375 


26276 


A 


12492 


248 


398 


PTEQVTIiGITAQSYSRVHIKNRVYDLAV 
GSGHPDGAAAI KGS FVQRLKS Y 


12376 


26277 


A 


12493 


341 


1 


KTSHFREIYPLEHASSLSKKIETGALPC 
SQKLLNSQENSVMNKFPCLNQLPFQDVN 
SCI IRFQTKDTLKHTAIQPEAKTLSLPC 

N 


12377 


26278 


A 


12494 


95 


3 


PLFFFETKSCSVAQAGVQWQNLGSLQPP 
PP 


12378 


26279 


A 


12495 


62 


420 


CAAKLCTEAHTAAS ADTHTS PHGSVSGL 
FCFHFPPHRREQRSRPGRKPGAHRLAGR 
ALSQKPVGSGATPHNLHHQIRTQTNGLI 
QliLGDRQAPWVTPVLPVLWEAEAGGSLE 
ARSLRPA 


12379 


26280 


A 


12496 


58 


492 


NSPPPPAPSQRTSPPAAAASPTTGSSSA 
PCPASSSWPRSSIALITFYPDQPCGLSF 
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peptide 

sequence 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A^AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=^Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NNLAWKQNVLCFKERLLEKMFRRIQVLG 
LLVWTLIAGTBYFRVPAFGWVMFVAVFY 
WVLTVFFLIIYITMTYTRIPQVPWTTVC 
LCFKR 


12380 


26281 


A 


12497 


2 


198 


RLQGILSAGFLVPGDTSVSSPRPRAAGS 
QSSTSRVLWKLSGAMLRLYVLVMGVSAF 
TLQPAAHTG 


12381 


26282 


A 


12498 


71 


346 


GMFMTSKETFTHYQPQGNSDPAHTATAP 
GGLSAKAPAMTPLMIJ>TSSRKIjVAWDGT 
TDGAAVGILAVAADQTSTTLTFYKSGTF 
RYEDVLWP 


12382 


26283 


A 


12499 


122 


59 


HQIYIYIHTHTHTHIYIYIYI 


12383 


26284 


A 


12500 


220 


3 


P KKES VRCWAMKW I LLILCQTS LHWS L 
PQSSLAATPCCLNIjGTVDYLQGIiGAVAH 
AWNPSTLGGRDGWITS 


12Jo4 


26285 


A 


12501 


125 


1 


HVLVLSHFLFVLFLFt'ijTi'KSRSVTRPKR 
SSAVSAHRNIRLS 


12385 


26286 


A 


12502 


373 


476 


ILTRDGVSLLPRLECSGTISAHHNLCLL 
GSGDYP 


12386 


26287 


A 


12503 


440 


573 


STWBDHLSLGNRGCSEPRLHLCTPTWVT 
E 


12387 


26288 


A 


12504 


146 


5 


HPHI#QTMNGAPI PQELTPUjEKERDGLR 
CRGNRSPVPGIPPRWSRDL 


12388 


26289 


A 


12505 


140 


44 


AQGLTPVI PALWEDEVGRSPEVRSSRSD 
WPTR 


12389 


26290 


A 


12506 


247 


420 


LBISVTFSCSFGCLPKKHIHTHTHTHTH 
THTHTHTHTHTYIRTIYTTLARPLSALH 
NF 


12390 


26291 


A 


12507 


346 


473 


KAY I IGLNCFYETKSHSVAQAGVQWRDL 
NSLYS 


12391 


26292 


A 


12508 


87 


414 


RTGVYRVGKDGRSRSPDLLICPPLGLPK 
CWDYRRE PPRPACLPLLSHPS PPPHFS F 
LSPFLSFSIiPCSPFLYIFFPPPLLPISL 
RWRLrl PEGRFMAPLCLQMPLGHDT 


12392 


26293 


A 


12509 


386 


467 


LLLFFEMESHS ITRLECSGAILAHCNL 


12393 


26294 


A 


12510 


344 


472 


KFLPFDPAVSLLGIYPKESKSFYQKDTC 
TRMFIAILFTIANTW 




26295 


A 


12511 


424 


132 


RGEGPPRSGILSPVGPPGETPPFLKKQK 
ITRGGGGGPLFPPLKRVRGENSFPPGGK 
SFHGAKFPFCPPPWATKRNSVSKKKKKQ 
TKKGSRI ILYNNB 




ZoZyo 


A 


12512 


351 


510 


GGTDFCKQS KLQS LQ PAVE I F I PGRVQW 
LTPVISALWEAEAGGSPEVRRSRPA i 


12396 


26297 


A 


12513 


362 


463 


NTITWLGAVAHACNPHTU3GPGGRITWG 
QEFETS 


12397 


26298 


A 


12514 


137 


1 


EEISLLLPRIiECNGAITAHHNLHLPCSR 
NRRPGKAVHVRQAYSFQ 


12398 


26299 


A 


12515 


251 


592 


GAFTGLAFTMAGGRPHIiKRSFS 1 1 PCFV 
FVESVLLG IVTLLAYRLEFTDTFPVHTQ 
GFFCYDSTYAKPYPGPEAASRVPPALVY 
ALVTAGPTLTILLGELARAFFPAP PLAG 

V\T 
rv 


12399 


26300 


A 


12516 


3 


413 


SWGGRKFLCPPRSLSGSRELHPAQGDRP 
GPLSSSGKRETGTHRETLGKKKVSSAPE 
AQGAGLRLSQALGGLCYDLSPLTEPRLP 
LAATAFPRPCPALPHPQPRVTMGSVSSL 
ISGHSFHSKHCRASQNKLRKSSHLK 
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amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acta 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D^Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleurine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Pro!ine, 
Q=Glutamine, R=Argintne, S=Serme, 
T^Threonine, V=Valine, 
w— Jtryptopnan, x— tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12400 


26301 


A 


12517 


115 


2 


KSHGGHGVEVLVSVIiVGIiAMEVGLVLWV 
KEYDGAAIiA 


12401 


26302 


A 


12519 


444 


34 


YCLRQKHTEYRRRSNIiBMASSCLSGRKT 
Ux c ex ii * yJw LilvN x> Kija msLr x w yHo w -uru> 
RPLAPNSKWEAKEKFLKKI KSVTS LNT 
YILRKPNSLIADMENILVIHIEYQTSYN 
IPSSQRLIQSKAI1CLFNYI1EMECRI 


12402 


26303 


A 


12520 


1088 


935 


ILIIFINYRCEFIiRSKKSSEEITQYIQS 
YKGFVDRTVMilNJoNr r xj1U?1i1JXxj 


12403 


26304 


A 


12521 


2 


347 


AMAAGSRTSUjLAFALLCLPWLQEAGAV 

r^rnrrrtT pm trTYTJfAlun" /~\7VLrD7\TJf\T .TV TTTTVO 

QTVPIiSRIirDHAMl^AHKiUiyijAJLlJl iy 
EFEETYIPKDQKYSFLHDSQTSFCFSDS 

WLE 


12404 


26305 


A 


12522 


50 


205 


WGESDRQSSGFWASSAHACNPTWEVET 

uuro X*JS>U<oWlr<Xjyi_. X i"ir*w V ADinr 


12405 


26306 


A 


12523 


105 


2 


RFVCS T I KVLRDLSSDRSNPGRFLSTSN 
SSIiXEK 


12406 


26307 


A 


12524 


279 


455 


KTKNWLADGGPPCFPGQYGETPSLLKNE 
RGC 


12407 


26308 


A 


12526 


305 


454 


Xjx U LiXir src\i X j.viKs'iy* iir r vr v^r c ra iilo 
HSVPQAGVQWRDLGSLQAPPPG 


12408 


26309 


A 


12527 


363 


459 


UT J. iAiyAW WXi'l'f V X XjAXjVV JVrtJ&fujrio tr ct v 

RGSR 


12409 


26310 


A 


12528 


182 


36 


Ulix, VV r liiVCixivjs^ v rvw xj x jt v xoAiiniVAuA 
GRSPEVRSSNAAWHACRNSA 


12410 ' 


26311 


A 


12529 


388 


530 


KSFLNFFPRDGILNCVPQAGIiELGSSDP 

T1TV 0 1\ T7»T3T7"A/""«U"1 'CT 7T«TlIIJO^\T 

PAfc»AJ? K VAla X XovWiltloyx 


12411 


26312 


A 


12530 


134 


1 


GHKKGVLFFFFKTESRSVAQAGVQWCTL 

PCTADDT.DIHATPPPV 
oo Liy Jr ir LuriUlA 1 lv fK. V 


12412 


26313 


A 


12531 


230 


1 


KKFGFFPQAGPQGGLHLFTTIiAPWGQAI 
SQKREKPFFFFFFETEPHSVARLECSGT 
ISPHOJLHLPSSNDSQASTS 


12413 


26314 


A 


12532 


145 


3 


KKLE I LSAI I PPKFRKEI FLFFFYETES 
CSVAQAGVQWRDJjGoIjUAP 


12414 


26315 


A 


12533 


365 


2 


LFFFTSLIWQITLGFLILNCPCIPGVCI 

VTnPAM'PPDgpa pr t?»FM17T 3VVWTT.TT.PT. 
X X U IT AN 1 J? tvKJ? AT lolT X JN IS X A I « XXlXXirXj 

TKQDYTNLI KLI KKHSFFSNFLKQLLFV 
LQPPPPRFK 


12415 


26316 


A 


12534 


313 


475 


FFWSCSSFQQGFGGGITI III ILRWIIiA 
LSPRLERKWRDLGSTjQPSPSGFKRFF 


12416 


26317 


A 


12535 


124 


3 


mpcjt TMf2ZiPT.<3VYPn\7prjr , TfPnr!Vf T VPTi 
CVCVCVCASTCM 


12417 


26318 


A 


12536 


333 


58 


AOIPSKVRSCRPAWSNMVKSCLSKNAKI 

«pv PLJuwaMiyp t*' v ^'vypT.VRMM'PtfHVflOTi^ 
XlVJSViWnrai.Ni; r f vr »r u v Aiui'ir xm voyuo 

IxKl^TSSDLAASASQSYGITGVNHYAQP 

AK1K.UOCK 


12418 


26319 


A 


12537 


214 


3 


' SDPRRACRCPKEAPEQQRRLPGGSVPAL 
SIFFFFKTESCSVPLSPRLECSGVISAH 


12419 




j± 


lZJJO 






MNFIiAFTNPPRPPQQILKPLLFILRRES 
HSVAQAGVQWCDLSSLLPPA 


12420 


26321 


A 


12539 


195 


1 


IFIPPQAQKRGDPFLSFFFFETRSPSPR 
LECSDAITAHCSLHLPGPGEPPTPPIPK 
SWDHSHVPP 


12421 


26322 


A 


12540 


151 


334 


LLGPJjRHKNHLNPGGGGCSGTIMALCNI 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
. nucleotide 
location 
correspond 
ing to first 
amino add 
residue of . 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A= Ala nine 
OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Glyctne, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIntamine, R=Arginine, S=Serine, 
T=*Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LLPRSSSSPTSASKVAGITGACHHACCL 
RYFLY 


12422 


26323 


A 


12541 


314 


402 


NLNSGPAWWLMP VI PALWEAEAGGS PEV 
G 


12423 


26324 


A 


12542 


339 


3 


GSCLLEGKLPNRKDIHTENPSVHHHHQR 
PKVDKPTKMGKKQNRKTGNSKTQSVSPP 
PKERS S SPATEQS WMENDFDELREGFRR 
SNYCELRED IQTKGKEVEI FEKNFERV 


12424 


26325 


A 


12543 


31 


365 


RSAVMPTEDRATWKFNYFLKI IQLLDDY 
PKCF I VGANDVGS KQMQQ IRMSLRGKAV 
ALKGNNTMMRKAIRGRLENNPALBKLLP 
HI RGNEGS VFTNEDLTE IRDMLLGNKV 


12425 


26326 


A 


12544 


278 


410 


FS FKNVTAGRVWWLMP VI PALWEAE VGG 
SPEVKRSRTTLANMVK 


12426 


26327 


A 


12545 


88 


276 


EVPQAHRKLPSAPQLRPLQAISTKGPCP 
PQSPQPSPRPEGSSAHSRGEKRTRERGK 
EEKRKKK 


t2427 


26328 


A 


12546 


251 


3 


PVEDNSKRRFQTGEEKVTQEEQEALTPW 
RTNFFCFSDRVSVAQARVQCCDSGSLQP 
QLPGSSDLPTSASQVAGTTGMRHHAC 


12428 


26329 


A 


12547 


106 


2 


DEHLPFLTS FETESCS VAQAGVQWRDLG 
SLQAPP 


12429 


26330 


A 


12548 


225 


1 


GFFKKEPHPRPCKKKTPPKLKPFWGGLN 
KKIPLSFVFSKKGPLPKKFFFPFFFFET 
GSCSCHPAGVQWCDHGSIiQ 


12430 


26331 


A 


12549 


218 


3 


VPPPGIiVPGLPQIIPKGFPKTKGPPKGP 
PPGDKQKSGPPNKKNFFFFFLRRESHSV 
AQAGVPWCDLGSLQP 


12431 


26332 


A 


12550 


276 


24 


GPLKMVEAPPQNPLSPKNKIFFFFFFET 
ESHSVAQAGVQWRNLRLPGSSDFPASDS 
RG AG ITGARPHVQL I FVFLVKTGRKRR 


12432 


26333 


A 


12551 


331 


440 


KSASSWSFNSKGWGWWLTFVIPALWEAE 
VGGSLEVR 


12433 


26334 


A 


12552 


136 


1 


RHTGSKTTATALVDVFLLLFFETESHSL 
SRLECSGTTPGHCNIiYE 


12434 


26335 


A 


12553 


84 


1 


I FFCXFS FEMESHS VAQTGVQWCDHGSIi 


12435 


26336 


A 


12554 


3 


349 


HASGPEELSKDPSLVSQGQPHRKPGLKR 
CSSRPLGPPDKLGGEGKQGLLGFTLWIjS 














GP FKKKLFFPPGGGRNFFFLGAPKFFWA 

grf 


12436 


26337 


A 


12555 


257 


417 


KQIiHTiTiQGRXjFS PS FLP S I S KLFFFEME 
SRSVTQAGVQGCDIiASLQAPPPGFT 


12437 


26338 


A 


12556 


268 


403 


YMRLSFQDLiVMFKIDVAVDFSQEKWECLiN 
SYQRNLYRDVILENYSN 


12438 


26339 


A 


12557 


286 


426 


DMLIKTCCHLVCEKKKKKKKKKKKKKKK 
KKKKKKRGGALKKKPLPPP 


12439 


26340 


A 


12558 


114 


1 


PLKE IANRHMKKCS S S LAIRQMQ I KTTM 
RYYFTPVTMA 


12440 


26341 


A 


12559 


106 


1 


FFFFFFFFFFFYFFFFFFEMESRSVAQA 
GVQWCDL 


12441 


26342 


A 


12560 


118 


2 


KN x bMJNiiiv 1 NAGKAQWJjTPAI PAL W bAE 
AGKSPEVRNW 


12442 


26343 


A 


12561 


196 


2 


MCCFSKNWQEGAQMTPGFEPPQDNSEKT 
KLSDTRLLFFFFERESRSVAQAGVQWRG 
LGSLQAPP 


12443 


26344 


A 


12562 


128 


3 


RAPPFFFFETEFRSLPRLECNGAISAHR 
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SEQII> 
NO: of 
nucleotide 
sequence 


SEQD) 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQBD 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Add, ^Phenylalanine, 
G=Glycine, BNHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methiontne, 
N=Asparagine, P=Proline, j 
Q=Clutamine, R=Arginiue, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














NLRIPGSSDSPAS 


12444 


26345 


A 


12563 


277 


370 


WSFTLVPQAGAQWCDLGSLQPALSRFKR 
FSR 


12445 


26346 


A 


12564 


55 


251 


AMVI PTVPFNITINSKPLGHISFQLFAD 
KFPKTGENFHTLNNKDKGFGSCFHRI I P 
EFICQGDDF 


12446 


26347 


A 


12565 


129 


3 


KYSVLRPGTVAHTCNPNTLGGRGGQITW 
GWEFETSLANVAK 


12447 


26348 


A 


12566 


172 


1 


PPPKKKKPRPPKKKNFFFFFGAISAHCN 
LCVPGSSDSPASPSRVAEITGSRCHAEP 
Y 


12448 


26349 


A 


12567 


257 


380 


GICRPLLGGVLQSGATGVRDPLEEAVCP 
LAEIiKHCAGRSTA 


12449 


26350 


A 


12568 


120 


r 


GVFLFCFVLFETESCSVARLECSGMISA 
HCNLRLLGSSNS 


12450 


26351 


A 


12569 


250 


2 


IGKPKTPQFWFIiKTKRGGPNAAFSFKKF 
FFQLKPPPRVFLKIFSCQKKIFFFFFFE 
TESRS VPQARVQWHDLGSLQAP P PGF 


12451 


26352 


A 


12570 


338 


3 


VSTPEKNFVFHTRGFS KKSSKYKVCNFF 
FQKKLGS PPFSLKKVFPGKPPYSCCSGR 
VSTAPYFLNPGPSTKIIiPPFFGPLGPEE 
KFLSFFFFFEMECRSVTQAGVQWCDLG 


12452 


26353 


A 


12571 


5 


556 


ICCCLCFKINIFQLHFNKINFFCVTRSG 
PVTQAG VQWCNLGSQQPQPLS S KQS 


12453 


26354 


A 


12572 


115 


2 


DRVSLLVLAHCNLCLRGSSDSPASASRV 
AGITGARHD 


12454 


26355 


A 


12573 


279 


377 


GQVQWLMPWPAFWEDEAEGLLQPSSSR 
PAWAT 


12455 


26356 


A 


12574 


117 


1 


XTPLFFFFFFFEMESRSVAQAGVQWYDIi 
GSLQAPPNGFT 


12456 


26357 


A 


12575 


253 


374. 


S LPGMWHACNP S P LGGRGRR ITWGRKF 
GTSLANMAKPCL 


12457 


26358 


A 


12576 


130 


2 


KFPGPFLGFFFFFFEMESRSFAQAEVQW 
RDLGPPQAPPPRFT 


12458 


26359 


A 


12577 


132 


3 


RVPGPPIiLFFFFFFEMESRSFAQAEVQW 
RDIjGPPQAPPPRFT 


12459 


26360 


A 


12578 


3 


370 


LRKNCXNLGGRECSEIiRSHHCTPAWVTE 
RDSVSKKKTLSVI^KWPNFPIJIPLALS 
RIOCFPRGPFEQVTHLVKEVVSLTKPCXA 
EGADLDC YD PRTS PLFAKSWEENFPFPV 
HPGPVEGCPK 


12460 


26361 


A 


12579 


131 


2 


FIYFLFYFLFLETESLSVAQAGVQWHDV 
GSLQTPPPRFTPTRA 


12461 


26362 


A 


12580 


202 


2 


RMRKNQHKRAENSKNQNTSFPPKDHNSS 
QTREQNWMENEFDELTEVGFRRWITNS 
SNLKEHVFSC 


12462 


26363 


A 


12581 


109 


2 


KIFFFFFFETESRSVPQARVQWHDLGSL 
QAPPPGF 


12463 


26364 


A 


12582 


105 


375 


LIMILLPLQILVHSGTFCFLFFGFETEF 
HSTPRIiECNGVMSAHCNIiCLP VKTS PAS 
TSQIS 


12464 


26365 


A 


12583 


106 


3 


RFXFlfFFri?"CrrifiSkTVARAGEQWCDLGS 
IQPPPP 


12465 


26366 


A 


12584 


484 


700 


SSSWNRAFSRKKDKTWMHTPEALSKHFI 
PYNAKFLGSTEVEQPKGTEWRDAVRKL 
KFARPIKKSEGQKKKK 


12466 


26367 


A 


12585 


301 


1 


TQPKVRTQMKNNI FEHAFLSSTFPWPLP 
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. SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino arid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

last amino 
acid 

■ muuv vjl 

peptide 
sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=H is tidine, l=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
V^vjiuxaraine, K— Arginine, b=oerine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
i^— u a Known, == ocop coaon, /— possi Die 
nucleotide deletion, \=possible 
nucleotide insertion 














NIiPADFYSATSQTKPDILQSMQKAFKNS 
ox vjyAW W UX PAX r AALWKAE VGGIujE PR 
NLRPAWVYGKTLSLQK 


12467 


26368 


A 


12586 


76 


379 


PLHPPASCPAPSLSCLIPGSPSDPLFSD 
SARPGWGRGWGRAPQLPPS PPFLYNLIK 

KGFFFWRAPPFFFFGGGFFS 


12468 


26369 


A 


12587 


109 


376 


LEFFKMSGWPTAPEQPAGBMENQTKPP 
DPRPDAPPEYNSHFLPGPPGTAVPPPTG 
x lAj^Jji/WCyi xSPQQPSTFPLYQPVGGIH 
PVRYQ 


T2469 


26370 


A 


12588 


323 


1 


KKFCPPRFFPPLFGFQKKGTNP ISPFSG 
GFKKNLGGFNPPPRPPWSPQIGFFLGRS 
JNJSXj j. iAxbr FTKxG I GGrFS F F FFCETKS 
HPVMRLECSGVISAHCNLRLLVS 


12470 


26371 


A 


i *«Joy 




i 
t 


vjUFFI lfa(ji?QTWQPKEHLMNFKSDSQLY 
EDTLAGRSVLIKNLTPQTLQPRWTGPYL 
v x X£> 1 Irl A VKi^UPPHWVHKSRIKLCPS 
EKK 


12471 


26372 


A 


12590 


284 


374 


TESRNWGWAQWIiTPVT LPALWEAEAGGS 

PR 


12472 


26373 


A 


12591 


297 


387 


GQAWWLTSVIPAFWEAKEGGS PDVRSSG 


12473 


26374 


A 


12592 


206 


407 


VKPQNPRKAQKLKGLVYIHI YVYI YVYV 
x x I jl x J. x N x Ilk x ISii IvVjTVNSKEKG 
LDTAAHTCNSR 


12474 


26375 


A 


12593 


222 


379 


LTTGS IMGNFSFLI LVYTRKVMGSVQWL 
TAVTDT.T.WTraiyanr'OT j^cdocdt>7v 


12475 


26376 


A 


12595 


301 


189 


KKIQSQAQWLPPI I PALLEAEAGRSLEL 
KNSRSAWAS 


12476 


26377 


A 


12596 


209 


3 


SVKPLFFINYQSQWLFLFFRGRLKRSG 
I x&AHUWIiH1jPGSRFSCLGLPKC!WDYRC 
APPTLKIDLTKD 


12477 


26378 


A 


12597 


306 


3 


NGKKGF YMAPEPPFFFHFFFFS PLFPTL 
r c e If X If PU PrCrALiS FLVGS S S LS PRLL 
CPMQHYF CDAWNTFDALIWGS I VDIAI 
TEVNVSTWRLSIiTVRV 


12478 


26379 


A 




i 
l 




■N i w*U^r liiirolUfcJjQLIjFLGGGEDGVS 
LLLLRLECNGAISAHCaJLRFPGSGDPPA 
SASRVAGITGACRDHTCEPPRPVPPSS 


12479 


26380 


A 


12599 


228 


404 


RLSSLSPVTEFGCLSPPDLILKCDPWPG 
WTHTCTPSTLGGRGGWITRGQEFKTSP 
VNM 


12480 


26381 


A 




mi 




JjoLi 1 xUN LAJjLrtjEnTT^EQciMTRIjHRRVS 
EVEAVLSQKEVELKAS ETQRS PLEQDLA 
TYITECSSLKRSLEQARMEVSQEDDKAL 
QLLHD IREQS RKLQE I KEQEYQAQVEEM 
RLMMNQLEEDLVSARRRSDLYESELRES 
RLAAEEFKRKATECQHKLLK 


12481 


26382 


A 




404 


J 


v-AxQO ri# i EHPMKSSRLGPTQLKI FTCE 
YCNKVFKFKHSLQAHLRIHTNEKPYKCP 
yu» XiwAI KAJMJLN V HJjRKH 1 XaB KFACDY 
CS FTCLSKGHLKVHIERVHKKI KQHCRF 
CKKKYSDVKNLIKHIRDAHDPQDMYCGR 
SRGSARMSRALPS 


12482 


26383 


A 


12602 


140 


I 


WIGSFLSSEEDGTWQKISGVQRSLSLY 
VYI HTHTHTHTHTHTHTV 


12483 


26384 | A 


12603 


112 


2 


ESLRPGAVAHTHNPSTLGGRDGWINKDK 
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SEQID 
NO: of 
nucleotide 
sequence 

■ 


SEQU> 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
OGrycine, H=Histidine, l=lso!eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginme, S=Senne, 
T=Threonme, V— Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














EFKMCERV 


12484 


26385 


A 


12604 


139 


1 


KKKVKRGPPGGFKGliGGKFFFSPKTFFF 
FF FETESCS VAQAGVQWR 


12485 


26386 


A 


12605 


204 


460 


SGQGNGEICHPRGFCTLPKAISSLLIAR 
VCUCHFNKGVASLSQTANGCFFEGFSET 

■ — ■ ■**■ -in _ i ■ ■ ■ _n_ .n. i jL.n.nri.n.rt rt t /\nnnT^n<i *~%T t"% 

ESRS VIQAGGQGGDPGSLQPPPPGS CLS 
L 


12486 


26387 


A 


12606 


317 


1 


KRPLFKNI FQKPFFKNGAPRKKGPRC I F 
GPNQKNPLFFPAPLSPPFWETKKGAGFF 
FWGFPFPPTKI IGGQKVPFPPTFPPFFF 
FFETESLSVTSLECSGAISAH 


12487 


26388 


A 


12607 


248 


367 


TQILSSGSVQSSLHHPPICFIHTHTHTH 
THKHKHPHS 


12488 


26389 


A 


12608 


2 


387 


QKQQQRAGRETSTCSLRI ISAPTMATFV 
ELSTKAKMPIVGLGTWKSPLGKVKEAVK 
VAIDAGYI^IDCAYVyQNEHEVGEAIQE 
KIQEKAVKREDLFIVSKVQWCIWLAFNT 
IVPKSQWQSLQTPQRC 


12489 


26390 


A 


12609 


258 


23 


MWERG I IRPQTQTLTGEPEGLDHGRS IS 
PNSVMNGIVPHISILTLNINGLNAPLKR 
PGIAEWIRIYQTSMCCLQETGR 


12490 


26391 


A 


12610 


176 


433 


DALSSAWGTVLGTQIPSHFTEILMLSPP 
AWGSSSLTQTLFYVPSGAKRTGSYVLAR 
VGQKYKTLWLGAVAHACN PS S LGGRGRW 
IT 


12491 


26392 


A 


12611 


312 


410 


AHMYRTCSRGWVS WLTPGI PALWEAEAG 
GSLEP 


12492 


26393 


A 


12612 


3 


289 


VFE FLS RKLS Y ILRMFWTFKEWFWLiKRF 
WLPPTIKWSDLEDHDGLVFVKPSHLYVT 
IPYAFLLLIIRRVFEKFVASPLAKSFGI 
KETVRKVTPNT 


12493 


26394 


A 


12613 


294 


432 


FMKLFFFFFFFERRSCFVAQSGMQGGYN 
ASLQPIiPFGSSNFSGLPL 


12494 


26395 


A 


12614 


138 


375 


SPNATCGHADPLPLCCLHI UQAJUfJ<t'L'K 
QT FFFEGKP YLG I LLYAVFFFLRLES CS 
VAQAGVQQPNLSSLETPPPSA 


12495 


26396 


A 


12615 


232 


1 


GVPQRAKLRAGLCPPQAMDTFSTKSIiAL 
QAQKKLLSKMASKAWAVLVDDTSSEVli 
DEL YRATRE FTRSRKEAQKML 


12496 


26397 


A 


12616 


236 


419 


LEMRNG WVW FFPP VNl^bWETlSUVRDQF 
GQHEETLSLKI I FKISRHGGMRLGSQLL 
RRLRQ 


12497 


26398 


A 


12617 


197 


1 


LCHES EGFLSRVS ALLFTLLLLVLMES P 
PSVSQPGVQWPHFSSLQPLItPRFKQFCN 


12498 


26399 


A 


12618 


132 


3 


NS FMFHLFTYFEMESCS VAQAGVQWRDP 
GPIiQAPPERVGGRV 


12499 


26400 


A 


12619 


255 


2 


MNSLNSFTDIAVIPLHYNRILPHFKIIL 
S Y r LKS KFQV KG I GK V KVvJj 1 r V 1 fx JjW a 
AEVGRSbEPRNCPAHQPGQYGKILLYKK 


12500 


26401 


A 


12620 


141 


2 


AISAHCKNHRPGSSNSPA 


12501 


26402 


A 


12621 


274 


421 


PDPRTDPRPDPRPDPRPEPRP 


12502 


26403 


A 


12622 


132 


2 


NQS I KKKI ELQG FQRSGGRG KGQ VQ WLT 
HI I PARWEAKVGRS P 


12503 


26404 


A 


12623 


390 


86 


I FP FF YQNTNGMAPGNQI PQGFSSRFFF 
FLRQVLALWPSLEYTGEITDQCRLKCSS 
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nucleotide insertion 














SLSLCSSWDNRCAHPPLVNILFYVETGS 
tlx V VJtlXvjJuAJjxtrloxv. I r\j 


12504 


26405 


A 


12624 


297 


396 


I IRVRGRAQWLTPVI PAL WEAKAGGS PG 
VRKSR 


12505 


26406 


A 


12625 


303 


381 


KFHLVPS INTMSGSQELQWMVQPHFL 


12506 


26407 


A 


12626 


318 


152 


GDKRRLVSKKKKNLWNFGRIQW VPPVI P 
ILLEGEAGGSPEVRSFRPTGQQSMTPF 


12507 


26408 


A 


12627 


48 

- 


388 


QLGNGKVRLLYQRHS S S FFFFWGEGFPP 
GPQGGGGGPNLGPRES PPPGVKGTPPPS 
LLEGGEPPGPPPPPGOTWGFWKKGGLPL 
YPGGPPTSGPKGEAPPTPPKGGGGIKGIj 

p 


12508 


26409 


A 


12628 


116 


3 


KVATNQAQWIiMPVI PAL WEAKAGGS S E V 
RSSRTADAW 


12509 


26410 


A 


12629 


227 


400 


VLFVLFFCKEWHVFIFLILPVYSYKKKK 
KKKKKKKKKKKKKKNKKDSQKKD 


12510 


26411 


A 


12630 


182 


3 


TNWLFPPPKFGPVPLFFLPPFFFFFFET 
GSHTVAQAGVCSGAILAHCNLRLPGSSN 
TPA 


12511 


26412 


A 


12631 


134 


473 


ASTIMDLLFGRRKTPKELLRQNQRALNR 
AMRELDRERLKLETQE KKI IAD I KKMAK 
QGQMDAVRIMAKDLVRTRRYVRKFVIMR 
ANIQAVSLKIQTLKSNNSMAQAMKGVTK 
A 


12512 


26413 


A 


12632 


100 


462 


QLLLCCCCCRQWTAFNVPAS PAPP PAWA 
HMAPSLLEPPSLLVTQICKLSAFSGPSI 
NAFLIjSKKKK3CKKKKKKKKDRGGAPIjKK 
KK 


12513 


26414 


A 


12633 


313 


441 


CWELLKWLVCFLCHI I KtNNCWAQWLTP 

viptlweaevggsle 


12514 


26415 


A 


12634 


129 


400 


VSQQCWPTPPALYS ISRQALAASPQGRP 
WDLQPTMPESPLLPPRAPAWPKPLRRAIi 
P PAPRCLGPSTAQGLRSAGAWCGTGRHI* 
HLRPQC 


12515 


26416 


A 


12635 


308 


399 


EIRGWTQWHVPVI PALWEABAGGLPEVG 
SS 


12516 


26417 


A 


12636 


285 


390 


IFT3VLINSFIYFETESRSVVRAGVQWRD 


12517 


26418 


A 


12637 


260 


2 


GQGPFFFGGRGSPKKKPPLKGFFLVGGS 
KTQFFFPRPKKGPFGVFFFFFSQTASRS 
VTRLBCSGMILAHCNLRLPGSSDSPASA 


12518 


26419 


A 


12638 


76 


3 


ALQVQWLMPVI PALWEAKAGGSLE 


12519 


26420 


A 


12639 


219 


484 


LGLQEPDLDPKP ILELPLAELAQQLQTE 
ELSLES ILCS YLKQALKVHQEVNCLMI F 
LGECEEELLALKKLKKSERGLLYGVPMS 
LKDT 


12520 


26421 


A 


12640 


33 


462 


EGLSWGYREHNGPIHWKEFFPIADSDQQ 
SPIEIKTKEVKYDSSLRPLSIKYDPSSA 
KXISNSGHSFNVDFDDTENKSVLRGGPL 
TGSYRLRQVHLHWGSADDHGSQHIVNGV 
S YAAELHWHWNSDKYPS FVEAAHEPDG 

T Ti.CZ 
Linu 


12521 


26422 


A 


12641 


2 


414 


SGPAAPATPMS IFPELYFNVDNGYLEGL 
VRGLKAGVLIQADYLNLEQGETLEDLKL 
HLQSTDYGNFLANEASPLTVSVTDDRLK 
EKMAVEFRHMRNHAYQPLAS FUDFITYS 
YMIDNVILLITGTLHQRSIAELVPK 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine T H=Histidine, t=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12522 


26423 


A 


12642 


161 


463 


KEKARGRP KP JjUjLPITS ATTAMGLTI 3 
SLFSRLFGKKQMRI LMAGLDAAGKTT I L 
YKLKLGE I VTT I PT I VFNVETVE YNNI C 
FxvWDDGGQERI KPLiWN 


12523 


26424 


A 


12643 


124 


2 


TVFGGQAQWLMP 1 1 PALWEAEAGRS PEV 
TSLRPAWPRTRG 


12524 


26425 


A 


12644 


258 


454 


NRPELCDYQGLREDSKRKGE INQTACQF 
liWPRQKMSSQTKFKKDKEI IAEYEAQIK 
EIRTQLVEQ 


12525 


26426 


A 


12645 


636 


182 


APLERPSMIPPRCSGGQPSDLRSSPAQG 
TPRPALGDRKGTPRIAEPGGAGRAEAPP 
PRGSRQRFGPQRRQTNAHTKRVRTRLSA 
APARHS PAFIACRQVGRPRIjSARRSGKK 
PHLTVWCPFSFQDPIQDTTLQI ILLSLL 
GGDTFSDFPCS 


12526 


26427 


A 


12646 


199 


414 


IYHTSRSLLDRKLRAQRYDELPHYGGMD 
GVGVPASMYGDPHAPGP I P PVHHLNHGP 
P FHAKQNYGAHAPHPN 


12527 


26428 


A 


12647 


130 


3 


DRGFIXIVHTPLKXKKNTHTHTHTHTHTH 
TNTHTHSHTHTHTK 


12528 


26429 


A 


12648 


192 


443 


LLLiCWAQ ESLGTLGENTAS SHTAGLEWE 
SPLFACWFFWGFCLFVWLFVWFEPKSGS 
VLTLECS S VITAHCS LDLPGS RDPPAS A 


12529 


26430 


A 


12649 


393 


179 


LHSRVQRSGCFCSCVSNPVTGGMHFTRS 
S PQS NQEADGQARWLTPVI LALWEAKAG 
GLPELKSSRPAWATP 


12530 


26431 


A 


12650 


263 


2 


KAPLLVYKWE PNPGGGNFS PGMGTPNTF 
GVRFGGETKPLFFFFFETGSHSVIQAGV 
QLCNLDSLQPPPPSVKPSQYSWDHRKQS 
NTK 


12531 


26432 


A 


12651 


355 


111 


QDPCFIRIaHLPLSFIiSVFLECTKISLLD 
WRMLFPLLGDPPHI1AL1SSLVSPINTWNY 
SFQQLMMFRDVAVDFSQEEWECLDL 


12532 


26433 


A 


12652 


271 


3 


RQLKNILIFFSHKNMLSIKHIIYNYTLF 
LKSAF I FNLETDHLNCFPGS ISAKSYFI 
LFYF FETEPHS VAQAGVQ WRDL S TLQP P 
PPRFT 


12533 


26434 


A 


12653 


333 


2 


PPS PGVFGNPPS PGDPLGKTFFFPRGP P 
PRFWGGG r fr 1 liAJ?r Frooljr P Pr FJjPP 
SQGPPFFPFFFFFFFFEMESRSVSQAGV 


12534 


26435 


A 


12654 


310 


444 • 


TYLSNIFYFILFLVETESPSVAQDGVRW 
CGLGSLGPPPPGFGRFS 


12535 


26436 


A 


12655 


336 


411 


UyiroUrloiS lro JaJjJVxwlA-Li ViivJvjVjAri 


12536 


26437 


A 


12656 


114 


2 


GIFFFFFGLESRSVAQAGVQLCDLNSLQ 
SPPPGFKQF 


12537 


26438 


A 


12657 


191 


1 


KNFI LTFS FRGAKNGD VFFPP PGYIQTG 
ENFFFFFFLKWSFTL.VAQAGVQWCNLSS 
LQPPPPR 


12538 


26439 


A 


12658 


1 


445 


LRTGSEFSGRDSKGLAAAEPTANFGLLI* 
ASIEDQGAGGGGYCGSRDQVRRCLPANL 
LVLLTEAAGVAGVALGLGVSEAGGALAL 

CSIilGGANSLDPGAI/GRIX^WAIiLFFLG 
TTLLASAL 


12539 


26440 


A 


12659 


2 


224 


ESTASRI AFEAWQPETLPKGLNYSGAS P 
KKKGGGVFKKKKI KGGGG 
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C=Cystetne, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=<5lycine, H=Histidine, I=Isokucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=^Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12540 


26441 


A • 


12660 


96 


1 


GLKDGRGQVQWLTPVISAPWEAKVGGS P~ 
EVMS 


12541 


26442 


A 


12661 


1 


457 


CWVHLQKGCHFTLGRLHSGQAKALLTSQ 
TGRPGRGAPHI PDGAARQRRS SHPRRGG 
QAEALLTSQTCQPGRGAPHLPDGAAGQR 
RSSPPRRRAAGQRGSSPPRRWAAGQRLS 
SLPRQGGRAEALLTFQTGRPGRGAPHIP 
DKAAGQRGGAG 


12542 


26443 


A 


12662 


175 


351 


KGIFFFPPSWKAGGAIWFKGTFAPRGKG 
NPPPNPSGEGGPKKKKPGPQI PRGPPLG 
PPV 


12543 


26444 


A 


12663 


247 


401 


SSNQBPGCKGPCVLTOFYFILGRAWWLM 
PVIPAIWEAGTGGSPEVRSWRPT 


12544 


26445 


A 


12664 


152 


251 


RENSCCFIKK3CKKKKKKKKKKKKKKKKK 
KRGAP 


12545 


26446 


A 


12665 


2 


365 


APTRSRCLRHAS CTACRTVS TDTS SLRR 
ADPKGRSALiLADIQQGTRLRKVTQINDR 
SAPQIESSKGTNKEGGGSANTRGASTPP 
TLGDLFAGGFPVLRPASQRDVAGKEEFI 
LAPLWTVSP 


12546 


26447 


A 


12666 


179 


1 


LBLSKSGSHWMRHFLSRSFYSLIS PQLN 
TTVWPTI ITPILLTLFLITQLKILNTNY 
HLP 


12547 


26448 


A 


12667 


279 


35 


NQAPFKARFFTTFKNVFLKBIKKKRCWS 
GAPPIIPPPWGGGGGGPPWGGNLSPPGP 
PVSTPFFNGAQKKKKKKDRYGGACL 


12548 


26449 


A 


1266S 


299 


399 


I S I KKQRKG P VQWLMP VI PALWEAKAVG 
SPEVR 


12549 


26450 


A 


12669 


1 


385 


KMQNRGFFFLTFLAI1I1GLTSGAAKKKNK 
GKKAGPGSKSPNWPWGPCPPSSKDCGGV 
FRKGTWGAQTHGIRGRGPCNWKKEFEAN 
CKSKFKNWGAGDGGPGTKVRQGTLKKAP 
S IAKGQETIRVTKPCT 


12550 


26451 


A 


12670 


357 


1 


AGKIAKI CPVSSMQAPTCGFPVGGNDNQ 
GQAPDGQFQPPLQQWQTSSPDFSNENSP 
ATPPNEQGQGDAPPQLEDEEPAFPHTDL 
AKLDDMINRPRWWPVLPKGELEVLLKK 
GIDFSKK 




26452 


A 


12671 


107 


373 


TGNYTPLEDCAQEQMRLIAQVHUjSTRV 
KQQQVKI KQLLQENEVQFLDKGDENTW 
DLGSKRQYADCSETFNDGYKLRGFYKIK 
PLLNP 


12552 


26453 


A 


12672 


196 


45 


YLYFHGSREKRLAKKYYDKLFKECCIAD 
IiSKYKBNKv^CFPHIYGKRFIjFK 


12553 


26454 


A 


12673 


33 


448 


KEGAERAGAAAPVWS FLLDRRDSARTRS 
GTSLGSADMGDMKTPDFDDLLAAFDIPD 
IDANEAIHSGPEENDVPGGPGKPQPCVG 
SES EHTASASAGDGPGVPAHASDHGIiPP 
PDISVLSVIVKNTGLSRAGWRPWVDV 


12554 


26455 


A 


12674 


250 


29 


KKQTNMWQS PYEDYR I FYTISS SLYHLS 
IYPPIYLSIYLSIYLSLYLPTHLSIITD 
YVS ILEHELRPLWRIQP 






j± 


l /.Of J 








12556 


26457 


A 


12676 


1 


553 


RIFPGRRFRVKLPSCPDPAMGTRLLFWV 
AFdiLGADHTGAGVSQS PSNKVTEKGKD 
VELRCDP I SGHTALYW YRQSLGQGLE FL 
IYFQGNSAPDKSGLPSDRFSABRTGGSV 
STLTIQRTQQEDSAVYLCASSLATAWHS 
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C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Add, ^Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serme, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














RLLPAHKPH P S LS LQLLDTLNRG FS LLL 
PPHGKQ VDLDLS LS FG 


12557 


26458 


A 


12677 


232 


3 


GWSSDCQQRARVIEEKHSRQKGHIAENP 
SVHRHHQRP KVD KTT KMG KKQNRKTGNS 
KTQSASPPPKERSSSPATCI 


12558 


26459 


A 


12678 


134 


3 


IFIVEFGGKPFSCTSISLSQWIjWCIiFIG 
igellwgqvstgtly 


12559 


26460 


A 


12679 


212 


47 


NGTHPHG YTLHTHKHTHS CAHTHTHTHS 
LLLVRKSFLPQKNTT ISFRSCCKLGKP 


12560 


26461 


A 


12680 


224 


2 


MKMAS FLAFLLLNFRVCLLLLQLLMPHS 
AQFSVLGPSGPILAMVGEDADLPCHLFP 
PMSAETMELKWVSSSQCI 


12561 


26462 


A 


12681 


35 


397 


KPLISQNKIHSVSSSQTGFFFFFFFFGE 
KKPPFAPRGGGKGPPPPLPPGGGNRAPP 
RGBKKGKGKPPPKNPGAFPPQKEKKTPG 
GGGGKKKTPAPPTPKEVDQPLPRREQRG 
PGRLPPRGA 


12562 


26463 


A 


12682 


188 


514 


HTTHLVYVLSMAAFFFFFFEKNFLFAPR 
VEKRGKDLGSIiKLPPPGFRHFSGLTLQG 
SGNNGAPPPSPVIFLVFFEKRGFPLVGR 
BGIiIIiPPIjQPAPFCIS FRGAINGPS 


12563 


26464 


A 


12683 


366 


3 


ISQTQTQWTDWVSLELQGCELAHRDTVA 
IPQTRS FPFPLMISCMTLFSLIiQMDRAF 
PPFPPQTPITIQGQISTSPATWPLTHLH 
STPGLSVEYSSNTHKSSLSPTSQWIRVD 
PMIiASPTV 


12564 


26465 


A 


12684 


99 


1 


SSCIRFFLEQAYGQQSYGTYGQPTDVSY 
TQACI 


12565 


26466 


A 


12685 


247 


31 


FLKEKRNQFYLFIFQRWLDPNKPIRKQL 
KRGSPYSLNFRVKFFVSDPTKLQEEYTR 
WVYGYIFLENIVKTLI 


12566 


26467 


A 


12686 


416 


253 


PSPMTI PVTGAPRDADLWSSHAKMLAQP 
LKDSDVEVRFLGSSQIFLCWRPSFNL 


12567 


26468 


A 


12687 


116 


3 


CLLGRMWWLMPVI PALWEAKAGRSLEVR 
SLRPARPMY 


12568 


26469 


A 


12688 


380 


1 


SPESSKIiGLWSKHVSPLIiCIGVCPPPVS 
MAELRQVPGGRETPQGELRPEWEDEVP 
RSPVAEEPGGGGSSSSEAKtiSPREEEDL 
DPRIQEETiKHTiNQASEEINQVELQLDEA 
RTTYRRILQESASV 


12569 


26470 


A 


12689 


157 


488 


REFVS QGGSHLKAQ VRLBALLLTIGT P P 
WAHLSILHMTAMGQRENKREAASSRIQV 
LLNHSQAIKQES I I LFFEMESGVWPRLQ 
YSRMISASCTLFLSGSNNSPVSAPRI 


12570 


26471 


A 


12690 


514 


3 


PS I RAGIiLCGSAENATP FLCGITMAAGP 
LYTYPENWRAFKALIAAQYSGAQVRVLS 
APPHFHFGQTNRTPEFLRKFPAGKVPAF 
EGDDGFCVFES IAIAYYVSNEGLRGSTP 
EAAAQWQWVSYADSDIVPPASTWVFST 
LG IMRHNKQATENAKEECMRPLRVS S L V 
GP 


12571 


26472 


A 


12691 


304 


4 


AESLRVAHERLDTRSTSSDI FNFPQTQS 
NIJSMWSE I IiEb WAN YQS b TS YS INTEIiS 
LFSKVNGKFSTEFQRMKTLQVKDQAITT 
RVQVRNLVYTVKIKPL 


12572 


26473 


A 


12692 


221 


2 


VGPATRDLCFADE P VVGRQKQKQ WWWVL 
CliSPNRGIiAGRPDTLHITCASAHMRTHT 
CMHTHRHAHAHVHTHCI 
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Amino acid sequence (A^Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Pbenylalanine, 
G==Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N-Asparagine, P=Proline, 
V^Ajriutamine, K— Argimne, o=6enne, 
T-Threonine, ( V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12573 


26474 


A 


12fi<tt 


J 




xtnJjUolJLib rQEAFSICIiGEKFIYljFTYL 
FRDGVSLLLPRLECNGVILAHCNLRLGD 
RARLHLSKKQK 


12574 


26475 


A 


12694 


233 


3 


IMGPAHLFKGNHWNGGWGLMYKAFFFLF"" 
FFKTGSHS VAQAGTI PVHCNLRLPGS SD 
o JrAJfAij RVAGNTNVPHHTL Y 


12575 


26476 


A 


12695 


408 


3 


PAREMEKFRVCTRKRPLGMREVRRGEIN - 
I ITVEDKETLLVHEKKBAVDIjTQYILQH 
VFYFDEVFGEACTNQDVYMKTTHPLIQH 
IFNGGNATCFAYGQTGAGKTYTMIGTHE 
NPGLYALAAKDIFRQLEVSQPV 


12576 


26477 


A 


12696 


123 


631 


REAVQGGKGSGGGFGSRDSRSSGAVSAA 
VGDMGDPGSEIIESVPPAGPEASESTTD 
ENEDDIQFVSEGPSRPVLEYIDLVCGDD 
ENPSAYYSDILFPKMPKRQGDFLHFLNV 
KKVK^JjTENWiS VS KNHCRLS KAKE PHFE 
YIEQP I IEEKPSLSSKKE IDNLGASDCW 
D 


12577 


26478 


A 






ozy 


IiALLYPLKWVGKPVNFMEEDIIiGPLPPP 
LNEEEEEAEEEEEEEEEEENPVHKIPDS 
HE ITLKHGTKT 


12578 


26479 


A 




1 

i JO 


404 


LPLLS VERLVEI YAYTFIQDLFSASNI P 
YSSSVFRIEMDQPRTHSGPTTASNPAPS 
STNSSSAPSATNSKQERSSSSLSKPS 


12579 




A 




1 oc 
lO-> 


3 


HRRPIS IPSS VIHPAMVRLGLQYSQGLV 
SGSNARCIALLRALQQVCP ILLSLMIQP 
HLVY 


12580 


26481 


A 


12701 


181 


3 


TVWASMFLSAALRARAAGLAAHWGTHVR 
HLHKPDMQNGAGGALFVHRDPRENNRDT 
PCX 


12581 


26482 


A 


12702 


373 


1 


ILVQERDSQRVIRMIDKLVSSRDRVGRG 
VEKQMTDHRVI YSVRKLQKRITPEPFI P 
SRTTP KLIiFLLVLPSGKTPGHP VSS RTI 
PEPPLPTEPLERIPEHPVPSGTIPKPPE 

ppi»piephetmy 


12582 


26483 


A 


12703 


272 


508 


TKLGKKQNRKTGNS KQQSAS PPPKERSS~~ 

SPATEQSWMENDFDELREBGFRRSNYSE 

LREDIQTKGKEVENCEDNLEE 


12583 


26484 


A 


12704 j 


341 


1 


VPQHPHRPIiAPPSLGPQSWPLMEGSRPR 
SSLSLASSASTISSLSSLSPKKPTRAVN 

QDSSGLRLWKRRWFVLSGHCXiFYYKDSH 
V 


12584 


26485 


A 


12705 


187 


365 


WEPSCRGPSAFKPTRCQSYDWACMCGAE 
GRSAMEQPQEESPEVREEBEIEEMAHAE 

Ulr 


12585 


26486 


A 


12707 


238 


596 


LWLVRVKYSIMSGAALGLEIVFVFFLAL 

1?T .T .TST>"VflT\T?Wf\trOT \n T/^TT t tvt.tut r»ia 
r -LiLiftK i bUr xtKJj v J. JLt» 1 UaAW YJjCF 

LIVFILPLDVSTTIYNRCKHAAANSSPP 

cteiciaX I^IjzATANPVPSQHPCFKPWSYI 

PDGIMPI 


12586 


26487 


A 


12709 


i j i 


o 
JL 


SLTGSWDYTHAAPCI 


12587 


26488 


A 


12710 


151 


2 


YRQGLILLPRLECRVMIMALCNFESPGL 
SDPLTSASRVASTTATCHHTC 


12588 


26489 


A 


12711 


151 


1 


NFPEFDHFTVGCLRAGSSLIHYWWECKL 
VQSLWKAVWRFI KDLKIDLPLY 


"12589 


26490 


A 


12712 


28 


411 


RWPAR PAGE PRE PHVSWVMKLNPQQAP 
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SEQID 
NO: of 
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SEQID 
NO: of 
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sequence 


M 

eth 

od 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, ]=Isoleucine, 
K=Lysine, L=Leucine, MNMethionine, 
N=Asparagine, PHProline, 
Q=Glutamine, R=Arginine, S=Senne, 
T=Threonine, V=Va!ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=35top codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LYGDCWTVLLAEEDKAEDDWVYLVFL 
GSTLRHCTSTRKVSSDTLETIAPGHDCC 

I .l| f tAM T7T r/>T TTST^ fjl^i T f >t JL.VI VT Till nVT| fTlf T TNT^ 

ETMKVQLCAFKEGLFvFD VTK KYFLLPR 
MYRYYRCLYTYPSAAV 


12590 


26491 


A 


12713 


351 


675 


ENPRHT F I YSLPLGIiQKHQVLTVDIG FG 
GTAI MTVGKSS KMLQHTD YRKRWI LQDG 
RIFIGTFKAFDKHMNIjILCDCDEFRKIK 
PKKAKQ PERBE KRVIjeij VliIiRGEN 


12591 


26492 


A 


12714 


326 


457 


ACSHHQRKSQAPPPAHLWPDCPHPGSLP 
LLNPSVCNPGREVSPT 


12592 


26493 


A 


12715 


200 


1 


TITKVCVFTRTCXJLKGSLMAKALFDQRK 
DRESETECQVWWLTPVIPALSEAEVGGL 

IlEPRSSRPGV 


12593 


26494 


A 


12716 


208 


405 


KKERENKKHTNKKGRKKMVHICQWHSSIj 
HRKFQG I YRKTTGTG WAQWLTP VT PALW 
EAEVGGPPEV 


12594 


26495 


A 


12717 


157 


3 


AKETPTHKGSCKKKFFFFFFETESRSVA 
QAGVQWRGIiGSLQAPPPGFTPCI 


12595 


26496 


A 


12718 


1 


472 


SPAILPRLAILPYIiLFDWSGTGRADAHS 
LWYNFTI IHLPRHGQQWCBVQSQVDQKN 
FLSYDOGSDKVLSMGHLEEQLYATDAWG 
KQLEMLREVGQRLRLEIADTELEDFTPS 
GPLTLQVRMSCECEADGYIRGSWQFSFD 
GRKFLLFDSNNRKWTW 


12596 


26497 


A 


12719 


537 


1 


1HTMNGGNESSGADRAGGPVATSVPIGW 
QRCVREGAVLY ISP SGTE LS S LEQTRS Y 
LLSDGTCKCXSLECPIjNVPKTOKFDPLAP 
VTPGGAGVGPASEEDMTKLCNHRRKAVA 
MATLYRSMETTCSHSSPGEGASPQMFHT 
VSPGPPSARPPCRVPPTTPLNGGPGSLP 
PEPPS VSQAC I 


12597 


26498 


A 


12720 


369 


3 


AAKIIPATRKKASIiEIiELPSSPDSTGGT 
PKATISDTNDALQKNSNPYITPNNRYGH 
QnGAS YAwHFEARKSQILKOffiCX3SSHD 

IVETPVTPV 


12598 


26499 


A 


12721 


230 


3 


KFFWVI1AGI1TGKNSDASASLNQVHISPF 

T l?GT3Uli'DT7CT /inr Q ODTH.T /V^O VDOHTT 

Jj if 0 KnxlJr r 0 Uj Jr lio 0 4r V 1u4A*oXUCKU1uj 
ATLRALESASLSQHPHVPV 


12599 


26500 


A 


12722 


26 


110 


REQYAEGNMRGPAPGKKTSGLQQKNVEV 


12600 


26501 


A 


12723 


224 


1 


WQNQDQASDPPKYS FTSQCCLS FARLAR 
RYGDVFQIRLGSCP IWLNGERAI lqam 

V Uy uoiur AUKrA-rAo f K V 


12601 


26502 


A 


12724 


120 


530 


KKVARGRSRSRERS RRRRSRP KAI T \NR 
T*GTRCTPRRWRSTVLGMRSCTRAR*QR 

KbUL»oKOrll KSJUxUt-JJlltao VKiUjbtjLXiA 

DGTGGGDRRIiGLGRDSLLLSSSQSAAFS 
SS ASGSS S FFSATQPLRMLLE YFWL 


12602 


26503 


A 


12725 


434 


222 


KEEKS RGEVKEQD* EKDREEEVEKSRE/ 
RRRSRRRRRRSEV*YRKGKRRREREDIL 
VAERSHRSFRNSIPSTL 


12603 


26504 


A 


12726 


268 


370 


snilihmesmffffgtdipsv*knlpa* 

h*ftegavl™sfsk\aasitltpkpny 
dimrkenyspissyiwnqcsffleqifp 
lsfklkcpgliiph 


12604 


26505 


A 


12727 


103 


636 


VCFIS MKQPHGSRHP PCS / P PRPGS * VC 
QQSRPLGPQPSAFGQLA*HRRGLLGLQG 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycioe, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N~Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PRSPMQPVPHGGRVSGQTLPKGPTPKRP 
PGPAPRHGGHSWLLK* LVCAQPPPRP * S 
*ASQAVFTLQVPGKPQVWTPCPVPVRAP 
TP\PLSNGGU5VSERH*DGH*QAPTSP* 
PRCQGGAGEGPQ 


12605 


26506 


A 


12728 


1193 


1638 


GLGFAMLPRLALNSWPQGDPSVPASRAA 
GTRGVHRHTQLQVSFNYYKVLAMHSGSQ 
L*SQHFRKPRRSDHLRSGVRDQSGQHGE 
TPSL/LKNTRISWAWWHTPLVPATREAE 
ARELPE\PGRQKIASEPRVKPICTASLG 
NTCETPPQKK 


12606 


26507 


A 


12729 


304 


101 


RHLHWPCPSPLAPTPDISHCPEPPKSQT 
SP/CPGT*CLGFLKCPSSCHFA+K/PLH 
PSEPARKSLAPDTCP 


12607 


26508 


A 


12730 


91 


264 


SQRSISGLRVKENLIFVMIMFPPIYSSQ 
TFWSQTFLMLKI VFMGISIS ICYIL/NT 
EKNLGQG WWLAP 1 1 PALWEAEAGGLL * L 
RVKENLIFVMIMFPPIYSSQTFWSQTFL 
MLKI VFMGISIS I CYILKLRKI 


12608 


26509 


A 


12731 


1026 


406, 


LAHFRSQIFSFSHILVHFERMVLNYRYY 
LACVRRWFRFLHVLAYFGRIVLKYRYF 
LVHFRREVFRFRHMIiACFRRMVLYCYFL 
VLRKILLIYVTFLTYFRKVFLRHNFLRG 
GY*RKYFGYAPRWLSFFLYRCQCFLHYF 
LFYLWLRLHLQF/ CCFWSFCLFDFLFL 
FSAACVFSVLQIAIVMFPFHALQLFFF* 
WIiCFSMSRFQ 


12609 


26510 


A 


12732 


1508 


157 


QDVGGSGFKVDTHPRGRMAS I FS KLLTG 
RNASIiFATMGTSVLTTGYLLNRQBCVCA 
EVREQPRLFPPSADYPDLRKHNNCMAEC 
LTPAI YAKLRNKVTPNGYTLDQC IQTGV 
DNPGHPFIKTVGMVAGDEESYEVFADLF 
DPVI KLRHNGY* PQG* LKHTTDLDASKI 
T\QGQFDEHYVLSSRVRTGRSIRGLSIiP 
PACTRAERREVENVAITALEGFKGDL\A 
GRY\YK\LSEMTEQDQQRLIDDHFLF\D 
KPVSPLLTO*GMARDWP\l^GI\WHNY 
DKT\ FLIWINEED \HTRVISMEKGG\NM 
KRVILSR\FCRG\IjKEVERL I \QERGWB 
F\MWK*AP*EYILT\CPSNLGT\GLRA\ 
GVHVRDPQSFSQDPTAFLKI LEKPRTPR 
KRGHKWWDIAA\VADVYD\ ISNIDRIG 
RSEGEL\VQIVIDGVNYLVDCEKKLERG 
QDIK\VPPPLPQFGKK 


12610 


26511 


A 


12733 


261 


487 


TGSETDCAKPQAS FLPREVPTVAEMKME 
LItKNKQ FW \RGVVAHACNPSTIjGGQGGW 
IT*GQEFETSLANMAKPHLY 


12611 


26512 


A 


12734 


382 


668 


YKRITDIFVDSETVHILINKRQSCRIPG 
F IQIiVQLI SHQLAAPRDYTVSHS VAQAG 
VQW / RNLGS LE PLP PGFKRFLCLS * HAL 
KNLSSCDTPPQY 


12612 


26513 


A 


12735 


401 


27 


GDRAEESAEPRAWSHSDNSHRYTTLFIC 

T T'TJ ' I ' U 1 7TJ~KT OT TXJ O \ T_T r Pt-IT t tJ l I iljfnu T^LTnil-Pl'tjr 

JjiHltlViiNPVtlb \nIHTHTHTHTHTHTH 
TVSHRHTETPPLIiLKQTGLKFY*NSRDD 
TPRSRPGSSGIjQRLSSSPPVPFQPGTVB 
ASADFCGHDLLTT 


12613 


26514 


A 


12736 


202 


182 


KYLPIFINLITMNILFFFETHSCSVGQA 
EVH*S*LKPMPPG\SSDSPASAFQVSGI 
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M 
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correspond 
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amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 

correspon 

j;_ _ ja- 
ding to 

last amino 

acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=AJanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I^lsoleucine, 
K=Lysine, L=Leucine, ^Methionine, 
N=Asparagme, P=Proline, 
Vf=Hjriutamine, K— Arginine, a— oerine, 
i— mreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














TGSPSSSWLIIi* FFLKIGFTKKG 


12614 


26515 


A 


12738 


319 


2 


QNTPPHIYEYIPRVEftFLLFLEMSYSVY 
WAVLS PLYFVNKLALTLHCRLAIiNSFL 
HKIQEPS\LGSGSG\PLSCNTII*RLCL 
RSLHLVFRSLWILPICDSSSVF 


12615 


26516 


A 


12739 


1363 


2000 


DRVFVIPGWKCSGTIKVHCSLDLPGSSN 
PSTSAP*VAC\ATGTYPPCPGLNFLYF* 
MEERGFTQVA\QAGSQNSWGSKPNGTSE 
ASPKVTjGMNKP*NLPNPGRSIIiN*DGSI 
RYKDNYPA/ WEPPKRS YKCLRQKKS I * L 
SAGPRDYEASQPRKNYPISSLLTFCTSS 
LHFNPTBLTCSSQLQKSIRRKLKSEESC 
PPLPACSLKHTQAI IKITF*RLHRTA/I* 
FYLLK*NCFRN*S * * KYFLLS VSNLKI / 
QWVRSSIjHPGAPGYTPTV*RPKHKSNVL 
i* iqs / itrclfyilvymrtifplqlfl 
SKHPSTYKEDFA*VLPLSFFFFEMESRS 
AARRSLSSLQPLPPRLKRFSRIiSLLSSW 
DYRCAPPGI*ASPSILSS*GSCCSIQFIL 
RMS T I SHAINVLVLKNTYLVL VS AS EHS 
LIKKPC 


12616 


26517 


A 


12740 


1696 


743 


GGQIMRSGVHDQPDQHGBTLSIiIiKIQKS 
AGPGGMHL*SQLiLRRLRQENRL\NRGCS 
EPR/SRHCTPAWATEQDSVSKKIKK*KK 
♦NHLESKQQGQPALEPPBQAGGQLRKTE 
QLQDGRRELAADMTL*PGHSKRLGALPR 
PI,LSAYYFNKVHAARRQAYLETPGTFTS 
YQREEGEQAL I LGVEAQAS S P TVFHRER 
RQSQALCSHRKSQEAPVRPAHPPRRVPG 
LGKPSSQGLSAHLGQDRAPAPRRASWDQ 
RSQAPISVTFPSVLDKEESVPCGPGFPG 
HAPAPRGIHGATWEGAHSRGYPGHFLAL 
PQHNSDEQRPN 


12617 


26518 


A 


12742 


2473 


445 


RGARRSRRRSRHRRRRHQSRPVRAAPRQ 
PEQGRRRGAPTHGPQLIMMDLELPPPGL 
PSQQDMDLIDILWRQDIDLGVSREVFDF 
SQRRKEYELEKQKKLEKERQEQLQKEQE 
KAFFAQLQLDEETGEFLP IQ PAQHIQS E 
TSGSANYSQVAHIPKSDALYFDDCMQLL 
AQTF PF\VDDNEVS SATFQS LVPWYS PG 
HIESPVFIA\TNQA\QSPETSVAQVAPV 
DLDGMQQDIEQVWEEIaLSIPELQCLNIE 
\NDKLVETTMVPS PEAKLTEVDNYHF\ Y 
SSIPSMBKEVGNCS\PHFLNAFEDSFSK 
HPLHKNDPNQLTVNSLNSRM/ PTVNTDF 
G*WNF ILVF\MAEPSIRQQAWPS PATLS 
HSLS \ELLNGAHGCFLDLFTL / CKAFNQ 
NHPBGTA\EFHGF\DSGISLNTSP\SVA 
SP\EHS\VESSSYGDTLLGIiSDSEVEBL 
DSAPG\SVKQNGS *NTMYSSSGDM\VQP 
LSPSQGAEHFTCMDAQCEEHTRGKDLPV 
\SPG\HRKNPISQKDKHSSPLGGLISQR 

FNEMMSKEQENEAQL\ALIR\DIRREGV 
RNRS/AAASGICRKKENWENIVEIiEQDL 
DHLKDEK\EKLLK\EKGENDKSLH\LLK 
KQLSTLYL\EVFQHAYRDEDGKPYSPSE 
YSUXJTRIXaiVFLVPKSKKPDVKKN 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Ghitamic Acid, F=Pheny!alanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12618 


26519 


A 


12743 


6 


457 


RPRNRPG I RVD PRVRGRVGHAPHEGLVP~ 

ERMYRGSPTACETQAAA*ERAFGPSSPT 

C\RLPI PRMSTS VPQGftTWTQRVKKDDE 

EEDPLDQLISRSGCAASHFAVQECMAQH 

QDWRQCQPQVQAFKDCMSEQQARRQEEL 

QRRQEQAGAHH 


12619 


26520 


A 


12745 


297 


12 


QERPVRMLVLHFGRLRLADHLKLGVRDQ 
PGQYGETPSLLKI PKLSGM\MWAP* I C 


12620 


26521 


A 


12746 


98 


428 


YNTS FNETVLLLTT P I TIDCL YTRKDAI 
APESACGVCPCSLLGVGA* IPKWIRLD 
LSKKHVTTAYGG FMCAKCVLDR INRAFL 
IDEH\KI VPKV* KALSQSSKAFCEHET 


12621 


26522 


A 


12747 


3 


638 


LWLWSLCWVWAVSLPLQFILGSIjHPCQG 
QASWREVDLLHEGSQEASSKPSSGSVPV 
. GLLLDQEMVTPLLVCICGILLHQEMVSP 
LLVYTRG I LLHQEMVTPH PGL YPVJDS P P 
LDGHSPPGLF\RGAAPPS*GG/PLPVLV 
YTRGAAPPSGDS\PPPPGLYPWDT\PPS 
GDRHS PPGLYPWDTPPSGDGHSPPALYP 
WDSPPSGDSHSPPGLFPRG 


12622 


26523 


A 


12748 I 


2 


333 


DMVLLCHPGWSAGS I * KKTKERERMMNL 
HR* * I DRERERERERQIQ* KE * KRINE F 
EGS KERFTQSVKQRNK/ ele kirtldts 
ETIYLSIYRN*SLRRGQERRTERERMRH 


12623 


26524 


A 


12749 


30 


333 


KTSYLLPVQWWAQNDNERYSSSKNTIMA 
LPLPLPVFPRSPSDAERKLDCSAAISAH 
CNLPA\DS PASACRVPAIAGARRHA* lv 
FGFFWWRRRFAVMAGLVS 


12624 


26525 


A 


12750 


231 


39 


INDLL/CXiF*FKKLIWGRVW*LMPIIPA 
RWEAKAGGSFEPRSLKLQ*AMITPMYSS 
MGGREQDPVS 


12625 


26526 


A 


12751 


49 


273 


HLQVTEVFWFWCVFFRRWGGSHCV/AQ 
AGV* WLFTGTWPCCS PELLGSRDPPAS 
AS * VAGTTGACLAANCRGF 


12626 


26527 


A 


12752 


505 


897 


SYLRVQFYSAFSLFFRHEVLPCHTGWNA 
W*SQLTTASN/CLGPSNLLKAPHLAN* 
KKI F * RQGLSMLPRLVLNS WPHMI LLP * 
LPE*LGLQARATAPGSGFTFCLTQDSIL 
MCS PTVHKLSDS IALETKQ 


12627 


26528 


A 


12753 


335 


542 


CCNE FFLSQVWWLMP WPATR/ SAEAGG 
LPGPGSQRLR*ARS KPVNSHCSAGGRCG 
IDPISIKQQKNNNR 


12628 


26529 


A 


12754 


356 


72 


WHEYYLLMEHTKKCHLS*GLYNGLNN*Q 
/WCTHPWPATWEAKAGESLEPRSSRL* 
CTMITPANSHCPPAWATARSCLNQSINQ 
SSNNWQGMITGGK 


12629 


26530 


A 


12755 


340 


127 


NYLFI YFRDSLTMIiPRLECSN* FTGSI I 
VHITLKLLGSSLP / AS ASQVAGTTGTRH 
HVQLSNYFKI KI KPKP 


12630 


26531 


A 


12756 


290 


511 


KKNQPGT / CGSRL * S QRFGRLRQADHL \ 
DQLGQHGKAPSL/ LKNTKISWALWRTPV 
FPSSQEAEMEELIEPTSSRLQ 




26532 






254 


549 


YPGAKKQRPGGDS VKGj. HiA2£>RCW CVLL 
HNCQ*PKSNTQTFP*PKNLRE*NAT*KS 
TDL/WPGAAAHLCNPSTLEG*GEWIT*G 
QEFETSLGNIPRPQ I Y 


12632 


26533 


A 


12758 


617 


451 


NKKRRGLTLLSRLKCNGVIIAHCSLKYP 
GSSEPPTSAF*VAGITY\GT*HHAWAY 
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SEQID 
NO: of 
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sequence 


M 
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SEQID 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino flCtH 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G lutein ine, Rr^Argtnine, S^Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib1e 
nucleotide deletion, V=possible 
nucleotide insertion 


12633 


26534 


A 


12759 


91 


393 


KWYTKECLYLLVS I FSRYLLS IYYREVL 
AQTLGKLS KTVMCPALEETVAVNGLQI K 
SKK* / WLGMVTHACDPSTLGG*GGWIAW 
GREFGTSIASMAKPCLC 




96^1 S 




12760 


10 


221 


QSHNTNGCLDLHFQKGPWQSVGETLNL* 
TIFLCVCTCVCVCVRHVYHM/CYIYSDF 
I CPS I ECEYFSFAHR 


12635 


26536 


A 


12761 


309 


55 


I IRYKNYISIKLEGWKRLYHATTHEKKV 
GVAVLISEKVDFRAKN /MYQDKEGRFLL 
*R 


12636 


26537 


A 


12762 


640 


896 


TIVLKCNFVLFCFRDEVPLCCPGWSAVA 
/QSQTQLW*TTQPGLKQSSCLSLLSSWN 
* RHVPPHLAMFLKTTFTLLYTCGHLETN 
IC 


12637 


26538 


A 


12763 


1 


357 


IVPLHSSLGDRVKPHLLKKKKTCSTSLV 
IRETQIKITRYTYSNG*S*RKMKAGNNK 
CWHGYTATGTLIAD /CKSKQMLWKTF* Q 
FHIKLNIHY*YNSEIQFLGLYPRDKNIA 
RCGGLRL 


1ZOJ5 




j± 


19764. 


19Q 


19ft 


RWKCVKLKNTGQAWWIjIPVI palwknq \ 
AEGSLEARSLRLQ*TLIAPLGYSIiPG*W 
SGTLSLNK 






A 


1976^ 


617 


41 S 


GVKGLILPKLQKLLKSIE/ i/e*klpks 
fydasltmipklhkd\rpishvkieari 
lhkisinii 




96^1 


A 


1Z/OD 


941 


a! 


QKIKKRIKKNKQNL*EIWDYVK*PNL* I 

igipergekvyntenifegiiq/esvpn 
l/ aylvrevdnqiqe iqrtpvryytk 


12641 


26542 


A 


12767 


101 


16 


rtqlfvsvfwfcflfcfetgscsitqvg 
vqwchlrslqlpppp \gsrds pppssqn 

RG*TNI 


12642 


26543 


A 


12768 


362 


92 


RVPHMDLGEECTS P YMYI * PMTCVFKHG 
♦DGNFCVTYFATI I IYTYI / YLSVCVSI 
* I *CYLCVCVYI YI YIYIYI YI YFI YKY 
MKERRXi 


12643 


26544 


A 


12769 


1161 


824 


QAKD3uSTHFTKENM*MG/NKQMKRLSES 
LVISEMQIK3TMRSHLTHTRITIIKMTS 
DKICP*RCX3*MRA/liIiQC*WKYKIVSPL 
WKIWPFLKT\INMNISCNLMILFLGFY 
PIY 


17644 


XUJtJ 


A 


12770 


241 


237 


K* RIWA\GVMAHACKPSTLAGCDGWIA* 
SQEFETNLSNTGRPCLYER 


12645 


26546 


A 


12771 


326 


6 


DTVSRKNKSGKI FQLSSRV* I YERSQSG 
VKVYKCKTFGKAFTQ/HF*AHMRMYTGE 
KPYKY*ECGKFFILVLLLLLMIQKYFHL 
IKTVRLYLIRKKVSCKQPSNKILQS 


12646 


26547 


A 


12772 


413 


1 


iCKTFYFFFFETTQAGV* * CDHNS P W P* P 
P\GSDNPLTSAS *MA\GLQS CMHHHAQL 
S * FVFLEAGSHYVAQTDGDSPTSASQSA 
GITAVSHHTRPILYILIVLSYAISFFSI 
LYSAYFLQPHLPIHNNFLFSYISSSC 


12647 


26548 


A 


12773 


2 


1055 


FFFFIiRLNLTLLPRIiE/CVILAHCNLHF 
PG* SNS PASASRVAE ITGSRHQARI IFV 
FLVEMGFHHVSQAGLE/ SPDLK*SAHLG 
LPKC*DYRRDITPGHH*SF**RYFLKTY 
ISRVL/WCSLLGTFMFMIAFDHM*TW*H 
* QNHFHFLLQ I WKL I SERLRS AQGQQAL 
KTVTSTFSSKYVCRAFLAEPGWIAILYV 
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ed end 
nucle- 
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Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso1eucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, \=possibie 
nucleotide insertion 














TF* SLDSRRTCLWAVSNFFYSTLFLKL 
RQQVLLTQ/ PYHLFSARPSSOGSDKKAR 
ELNLSLSL\CFFFFGKRVSGTQAGVQ*H 
DHS SLQLWPSRLKQCSHLSLXjSSWDYRS 
ITPHLANF/CNFLSGWSRTPGLK 


12648 


26549 


A 


12774 


315 


98 


FRCFIiNS INPLES PHVRKTCAGRAQWLA 
FVIPPFWEAEA\SQMPQVIHSR+LEARS 
SRPAWPTWQKPCLKYKN 


12649 


26550 


A 


12775 


275 


59 


NPSPYKIQKISWAWWYMPVVLATR\RLQ 
* AB IAPLHSSVGDRARPCLPKKQKTNKQ 
TKTSGEPVLVQGPATVC 


12650 


26551 


A 


12776 


73 


381 


SLLNSELSS I PRPTWES PSTSDS AWPMQ 
NLASEPLPLP*TTPPPDFPALSA/PE/P 
LPFTPSPRDTLPPPLSPSYKQAPVLSVH 
PPVMPLKYVPSSPLIiLSFLYL 


12651 


26552 


A 


12777 


274 


342 


LNCVIA* WLMP 1 1 SALWBAE VGVSFA/ P 
/RSLRPAWATW 


12652 


26553 


A 


12778 


2 


478 


TIYYTKYTTFRVPNLGLQLLTAFFCVCV 
CVSLNGFKNAKDYGGSHLKYSVYITGFL 
LQFSLKFDCFLCVCVCVFP*MA*KMPKI 
M/RGSHLKYS VYMT* FLF * FSLKLKYFC 
DPI LRKR I /WPGAMAYTCNPSTLGG * GG 
QIT*AQEFETSIiAKWKPCLY 


12653 


26554 


A 


12779 


680 


453 


SETOWRKNKFFPSNFPSNLRTNSWTNSC 
SKKTFKKH/RVGLGVLAHTCNPSTIiGGR 
GGWSP+GQEFENSLTNMVNHFS 


12654 


26555 


A 


12780 


59 


487 


SLHKHPERGLQSFQGCWTQEMLGGSHVQ 
QRAWELCAPPNLTCPGYLS FFLRQDQAL 
LSKLECSGTI PAHRNPSLPS PQLKVSSH 
LSFPSSWNSRCTTPHQLLLLFF*DRIRL 
FCPSWSAVAQSQPTVT/SSLPSPQLKVS 
SHLSFPSSWNSRCTTPHQLLLLFFNFL* 
RQVFTMLPRLVSNS * VQAILPPWPPKVL 
RLQA 


12655 


26556 


A 


12781 


109 


361 


LSPWHHSRNYTLIYPYTLNFWMSAFPTP 
KEEFFFS FPETRPHS VAQAGAHW\AI IA 
HCJGLDLLGS\SDPPTSAS*AAGTTAI I P 
G 


12656 


26557 


A 


12782 


3 


204 


LIDGSLALSC*LFCGGAIIAHCSLELLG 
LGQ \S PVS ATRVAETTGVCHHAQLAFVN 
LHLRTRSKHCGH 


12657 


26558 


A 


12783 


2357 


6366 


LTGS\NSHITILTIiNI/NMGLNAPI*RH 
RL\ANWI KSQDPSVCCIQBTHLT\CRI)T 
HRLKIKGWRKIYPSPMGKQKKKAGVAI \ 
LVS DKTD FNPTKI KRDKEGHY I MVKGS I 
QQEELTIIjNIYAP\NTGAPRFIKQVLSD 
LQRDLDSHTLI\MGD\FNTPLSTLDRST 
RQKVNKDTQELNS ALHQADL \ I D I YRTL 
HP\KSTEYTFF\SAPHHTYSQNWTTIVG 
SKALLSKCKRTEIIT\NYLSD\HSA\IK 

ijclriknltqnhstt\wklnnlllndy\ 
wvhnemkaeikmffetne/nkktptyqn 

FWDAFKAVCRGKFI \AIjNAHKRKQE\RS 
KIDTLTSQIiKKTREAKSKQHSKASRRQE 
ITKIRA\EIiKEIETQKKTLQP\LKKISE 
SRSWFF\ERINKIS\RPLARIiIKKKREK 
NQIDTIKNDKGDI\TTDLTEIQTTIREY 
YKHLYA\NKLENLEGMDKFLDTYVSLPR 
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residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, ^Phenylalanine, 
G=Giycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=GIutamine, R=Arginine, S=denne f 
T=Tnreonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














LNQEEVESLNRP\ITG\SAIVAIINS \Ii 
PTKKSP\GPDGF\TAEF\YQRYKEBIiVP 
\FLLKLFQLIEKEGILPNSFYEASIILI 
PKPGRDTIKKBNFRPISLMNIDAKILNK 
ILAKRIQQHIKKLIHHDQVGFIPG\MQG 
WFNIRKSINVIQHINRAKDKNHMIISID 
AEKGFDKIQQPFMLKTXNKIGIDGT/YY 
FKIIRRAIYDKPTANIILNGQKLEAFPL 
KTGTRTGMPSLTSPIiLFNIVFCPIVFWA 
R\AIRQEKEIKG IQLGKEEVKLSLFADD 
MIVDLENPIVSAQNLLKLISNFSKVSGY 
KINAQKSQ AFL YTNNRQT / EEAKS * LMS 
ELMSELPFTIASKRIKyiiGIQLTRDVKD 
LFKENYKL\PI*IKEIKED\TNKWKNIFI 
PCLWVGRIS\IMKMAIL\PKVIYRFNAI 
PIKLPMTFFTELEK\TTIiKFIWNQKRS\ 
RIAKS ILSQKNRAGGITLSDFKLYYKAT 
VTKTAWHWYQNSMVLVPKQRYI DQWNRT 
EPSEIIPHIYTILI\F\DKPLLEKNKQW 
GKXDSLVFIKWXC^XENWLAXlCaElKI^ 
\DPF\LTPYTKINSRWIKKKDLNVRPKT 
IKTLEENLGITIQDIGVGKDFMSKTPKA 
\MATKAKIDKWDLIKLKSFC/TICTAKE 
TTIRVWRQPTKWEKI FATYSSDKGLISR 
I YNELKQ I YKKKKKTTPS KKWTK\DMNR 
HFSKE\DIYAA\KKHMKK\CSS\SLAIR 
EMQ\IKTT\MRYHLTPVRMAIIKK\SGN 
NRCWRGCGE IGTLLHCWWDCKLVQPLWK 
SLWRFLRDLELBIPVDPVIPLLGIYPED 
YESCCYKDTCTRMFIAALFTIAICTWNQP 
KCTTMIDWIKKMWHIYTMEYYAAIKNDE 
FMSFVGTWMKLBTI ILSKLSQEQKTKPR 
IFSLIGGN 


12658 


26559 


A 


12784 


787 


926 


PQAIRRPRPPKALG*HNSVDLGWAWWFT 
PAI PTLWD YTHBPLYLAKI S * CFKKVYK 
E^VLNCIQNCPGPHAARHS/SGKTS*V*S 
NYLFFSFCFETKFCSVTQARLQWHDLNS 
LQPPPPGFKRFSCLSLPGGWDYRRLHTR 
SANFCIFSRNRVSPSWPGWSPTPDLRRS 
AVLGL PKRWDYKRE PP CPAKX IT X MAPQ 
KVNSHSSQPVSFLS 


12659 


26560 


A 


12785 


248 


285 


C*VIF*CVMILYVLIYVEKITSIRTiAVY 

t jt t //it T'l'UT OX TV* T^O/T T y /"»* T?T DPVn T 

W/ CI I TRLirVK* X Co CXd-ioC w rbcrWV 
C*LYI FI I FVFFFFFFFFFFFFFFFELN 
ILY 


12660 


26561 


A 


12786 


243 


37 


KKi»A\jxiLiVji> Uxj "byi rljKF 1 rCAUxlA. V Ki> 

N/RGETLSLVKIQKVSQAWW*EPWPAT 
RQAEAGADAWVDR 


12661 


26562 


A 


12787 


75 


289 


D YRHE PRFFFLYTFFEMDSHb v ARAGfiH 
WCDIiGSL*PST\*GSSDSSASTSR\QRH 
HAWLIFGFL*RRGFAC 


12662 


26563 


A 


12788 


83 


256 


RKMYWGL FVTAQN* K* PKYPSTEE/Y I 

Y /DT.WVT.U*PMr7VVQATTrTPKT/3T T P7VnWR 
IV/ KJjW 1 J_ul X ClCi X X OrVl JVXOAJjvjXivn\^iMX\ 

LILY 


12663 


26564 


A 


12789 


197 


499 


QSLKSVEETVFRNNKQ/SIPTFQVILMP 
WWLI P 1 1 * ATWEG*GRRSPLNPGV*DQP 
GQHSKTPSLQKKTKFS *AWWRVPWPA\ 
I GEAEVSG I PLS PERS RL 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Grycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G lutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


12664 


26565 


A 


12790 


394 


91 


FYNSTKFKWKNIQFIiITRCRGDQIQVIR 
AHAVQLGFHVLYF/CLSQNLSIYLYISI 
YVYTYRYIHTHMCIHII*YLYIYTYIVI 
KH* KBHWW I KHCFPTERK 


12665 


26566 


A 


12791 


735 


75 


FFFRQEGFLSPPKRGGVPPPHTKKYFSP 
RGSFLWGWGTSRPPPRKCFSPPPPPVFL 
PPPKKKKIISFSPTKLAPPPEYFKSPPP 
PS\PPPPSSPSPSTFFFFKNFPFSPSSF 
S SPPFFFPS PAWEDP PVQSRRI Y * FLPP 
PF* I PEI PR\KVLKKKPGEGRSWGLFPP 
PKGPKRRVPLGPBFKTRLGHKTKPRFFK 
KKKKKKRFLISCLWWHMPAI PSTLGG 


12666 


26567 


A 


12792 


482 


342 


MGVFVFFFETGSHSS VTWAGVQ * CNLSL 
LQP/PASSDPPTSDSRVAGST 


12667 


26568 


A 


12793 


405 


169 


F YKKKKTGSGLALS PRLECSGT I IAHCN 
I*ELLGSRSLRL\SNTPrLAS*AARTVSV 
CHQTHL IF* FFVETGS CYVAS A 


12668 


26569 


A 


12794 


270 


94 


YVCYSYKQIKICSHVKRRE*NTFKIYLL 
KYI ALGLTGHFFPL * TFSKTVTLWPGAV 
AHACNLS PLEGQGRWI I /R/GQE I KTIIi 
ANM* IPVFLHVNI FLSVCSYS IHTTLDY 
LFYYSCYILISL 


12669 


26570 


A 


12795 


284 


415 


NFIKI I KKKVMGLFYGKTFLLNLKKKKR 
KKK*WPGTVPHA*NPST\LGGLGGWITR 
GQEFETSTLATN*NPTKSISRGF 


12670 


26571 


A 


12796 


1387 


32 


APSS FAI RS FFSGPMNAFFSSMVWKRP W 
PNLEWSMNLRSI FSRARRFVCTSKDIiR 
RVSTRFLVHIiTAFLGYKAGMTHIVREVD 
RPGSKVNKKEWEAVTIVETPPMVWGI 
VGYVETPRGLRTFKTVFAEHISDECKRR 
FYKNWHKS KKKAFTKYCKKWQDEDGKKQ 
LEKDFSSMKKYCQVIRVIAHTQMRIiLPL 
RQKKAHLMGDQVERGAPVPEKAD\WAPR 
EALSS KVLVTQ VFWAG I K* SNF IGGDPR 
AKGYKGGQPVCWAHPRKLPPQRPHPRAL 
RKGGPVLGAW\HP \ARVA\FS VARRWGR 
KGLPFHRTE INKKI YKIGQGYLIKDGKL 
I KNNASTDYDLSDKS INPLGGFVHYGEV 
TNDFVMLKGCWGTKKRVLTLRKSLLVQ 
TKRRALBKIDLKFIDTTSKFGHGRFQTM 
EEKKAFMGPLKKDRIAKEEGA 1 


12671 


26572 


A 


12797 


86 


364 


EQDNRIFFSFLSLFFFFGETEFGPFAQG 
GGQGAILGPLKPPPRGL\SSFPT*GSQE 
VGTTGAPHD I FCF FNKKGET PRLYKKNK 
NNRGGGATP 


12672 


26573 


A 


12798 


101 


440 


HCSRYATGIFEWYSGILVLILLRFGIYE 
PMNLNIGDPYDSPPSRLYKMRLWERMAK 
SLIEVSLKHSHFWLGMLAHACNPRTLRG 
IGGRIA*TWGF\KTSLGNIARPHLYLKK 
I 


12673 


26574 


A 


12799 


364 


195 


NPPGDLGEETLLWGGEIMGTTPP* FPPP 
KEGFLPQDPPGGFNSPPI/QGRSFSFPP 
PGKFGPPQGFFKRPPPFFFFLNNNQ 


12674 


26575 


A 


12800 


86 


489 


PTAMAEEG IAAGGVMDINAALQELLKTA 
L I HDGLAHG I RKATKALDKRQAHLCVLA 
SNGDETVYVXLVEALCAEHQINLIKVDD 
/NKKLGEWVGLCQIDREGKPRKWGCSC 
IWKDYGKESQAKHVIRENF*CKK 
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peptide 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
£>=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, l=IsoIeucine, 
K=Lysine, L=Leuctne, M=Methionine, 
N=Asparagine, P=Proline, 
rv=£*ltriamine_ R— Arpininc &=SerinB_ 

^^JIUUlIUlllV) M.X 4>l £^1J-I 1 IJ w, \* uVI 111!*, 

T=Threonine, V=Valine, 
W^Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 


J 20 fJ 


Jxo fo 




IZoUl 


(LA 


711 


GNHKKSNYSLAPWW YTTWPAT * EAEAG 
GSUE \L* CAMITPVNSHSSLA 


12676 


26577 


A 


12802 


300 


3 


NNVTTGSNIQFHSFPLIRKDICVSYIKN 
DAWNYI KNSNPNGETQVYQ* YSFSN*NL 
KIWLGKVA\NPSTLGGHGGRIA*A*KFK 
TSMGNIVTPCLINKQK 


12677 


26578 


A 


12803 


208 


394 


SQHFERPRREDCLS \QEGQYSEIPSVQK 
NVKI S WAW* CMPAI PATREAEAGGPLEP 
RRSRLE 


12678 


26579 


A 


12804 


183 


348 


AFILGKGKYEP YNPCLGWA* WCMPWLA 
SWEAEAGG\SLEPRSSEL*CAMLIGCLH 


\ 2.0/7 




TV 

xi 






lift 


ETKF P FCCPGWKWGDIiGSOOPLP PGFKR 
FSCFSLPSK\WGQ*SLAFVGKGCK 


12680 


26581 


A 


12806 


1 


293 


PTRPRTS YEKQGYLLLPPVFS I VLEVLA 
RAISQETEIKSVQMGKEEVKLSL\FVCV 
YI YA* EKTPVESTKTVDANLFDTRDWFHA 
IFPQTVGEVGMVLG 


12681 


26582 


A 


12807 


297 


281 


AKNPRRQPREIPGGVFPTPGFPPYFKAK 
MPEGPFPGGFFFKGGVLWGTPPPFFFFF 
FFSETESRFVTR\IjQCSGAISAHCKLRL 
PGSCHSPETC**R*G 


12682 


26583 


A 


12808 


255 


59 


AGCSGSQLFGRPRGVDHLSPGV* DQS VQ 
HGEILSTKNTKI SWVWWSVP WG\EVGR 
SIiEPGR*RLE 


12683 


26584 


A 


12809 


247 


310 


PLHS I LGTPTTP APT PGKPLL I RTS I LY 
* KPILSQAWWCAPLVPATQ/EAEARGSL 
EPRDSRRS *AMIKPVKTSHCTPSWALPRP 
LPQPLESHC 


12684 


26585 


A 


12810 


101 


477 


IGKEE I KLLLFPDNLMEYTVNF * MYKPL 
EhXS *FSKVTEYQVNTEK/SNCI yt*ql 
QIENEIAKTI*FMIASKSIKYLKISLTK 
CN\ KWRDILCICIGRLSI I KVLVLPKIiM 
CGGEKI FNPNPTGFW 


12685 


26586 


A 


12811 


1022 


696 


CFFVL FFKEMGSHYVAQAGVRWCDHGSL 
YPPTP\GSSDPPTSAS*VAGTIGAHHHT 
WLIFKFFVETRSCFLIQAGLKKLASSYP 
SQPRTSPKCIjGFTDVKSLHIiAWIiCP 


12686 


26587 


A 


12812 


166 


143 


RKEERFQINNLGFHVEKLEKEEQI \NPK 
TNRKKKI IKTRV* INE IEYRKTIGK\ IN 
KNKSWFF* KINKIDKPLAMLDSGRRDSN 
Y*NITKS IEEGRRRKEEAGKRKEEGRKR 
RRRWRQRWR 


12687 


26588 


A 


12813 


163 


423 


KAFWEFINHKKTGPF*RGPIGKVSPPGG 
NLASSI * KG/DPFPSPLKGPQGSSGGPP 
PKKPPTGFNLAFGGLRGRETKRGPFFLA 
GLLP 


12688 


26589 


A. 


12814 


2521 


2842 


IOTMYFPP* EMLVGWAWWF/RASNPQHF 
GRIjRQADRLRSGVRDQSGQHGETPRLLK 
NTKISWAWWRAPVTPATWEAEAGESLEP 
GREGCSEPRSCHRTPVWVTRMRLY 


12689 


26590 


A 


12815 


173 


395 


LDQPGQQRETI^ LLKXECQTNKQTK3CKNG 
FKKH'L*LTyrwVHTY\WPTPTiGGRGGGFF 
*GQDFETTLANMGKRGFY 


12690 


26591 


A 


12816 


43 


346 


CVYVS /GVOTCTCVCVCLASVCMYVCLH 
L*VLFGIFYV*DHNICKXiRSFCFLFKNT 
FSHVYF*KIIWPGVEL*VTVFLFLSSLL 
HTSCVYEKSYAILIFSAL 


12691 


26592 


A 


12817 


647 


931 


SQHFGRPRRADHLRSGVQDQPGHHGETP 
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N^Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X^Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














PLLKI \QKLAGRGGA*L*SQLLGRLKQE 
NRLNPGIRSCSEPGSHHCTPAWATE *DS 
VLKKKIGADKY 


12692 


26593 


A 


12819 


392 


2 


PLFFFFFFFLETESHSVTQAGVHNPPGP 
SKPPTSAP * AAGTTGMHYS AHAQ/L I FF 
L* TEL PS CRPGCS AMAVHRHD P FTDQHG 
SASFLIWASSPLLRQTGDSSLIiGGHHVD 
AKFSVDTQLALCTKAQNS 


12693 


26594 


A 


12820 


1183 


870 


DRVS LLTP RliG VAVARNLAQWKPLP PGF 
KRFSCLSLPSSWDYKHTPPTPG*FH/RG 
*FFVFLVETGFRHASQ\AGLNSLPQSDP 
PTLA/ FPKCW\DYR\HDHLAWPRKM 


12694 


26595 


A 


12821 


172 


298 


I YLS IYIiPI / YLCIHLSIHPS I YPYLYL 
SINPSYWVSFSREL*LT*L*YLSIYLPI 
IYVSIYLSIHPSIHISIYPLIHPIGFLF 
LENFD 


12695 


26596 


A 


12822 


51 


254 


YDFLKKHLLSSYRAKDLESLFLLHLHI F 
SGN*SVSYICVCVCVCVCICVCVYICIL 
IKSWCS*S/CGVL*AICAGVGWC*LIFG 
VL 


12696 


26597 


A 


12823 


199 


668 


QPLPAPS PSAQKGRAHPTPS PARPQAE / 
SLPSQSLGTRTRPPLGRPP * SP PWGQVD 
LWGWLGPSPTLPAPESLSPASLSTEWL 
CGTRYLVWGRSGAGQA 


12697 


26598 


A 


12824 


33 


442 


GRGKLLKKSQKQRKNALLKYDS KTRVGT 
LFHWTSVPVTRPLLL+INIRLIKLDVIK 
CSIHKIFHSEFLFFKQHSGL/ CI*LSK* 
LCS * KIHQSGLGTVAHVCNPSALGGQDR 
RIT*GQEFETN*VNMVGPCLYPPLPP 


12698 


26599 


A 


12826 


363 


75 


HWEPRPAGPTLPSAPGSPSGRPCPPHTT 
SG/PPPPNTHTPMSRSPWSWEGSDQRPQ 
PPHTDTGS PPN* GSRLRHERPP WGRERQ 
RPPPTTLVRPLHSH 


12699 


26600 


A 


12827 


76 


486 


CTVNLFI YFEKBS CS WQTGVQWCDLSS 
LQPLIWFSCVPTQISS*IPMCCGRDLVE 
GP*MMGAINSFLCCSYDSK* VS /ARSDL 
FFFFFFFF 


12700 


26601 


A 


12828 


345 


672 


KMKCTADLSLLEKDT* IKLEENR*QERI 
MLRAEVNEIENKIENINKTGGSSSEKYY- 
KIDNPLTRSIRRKVN\KITS IRNEKGSV 
TTNPTE I KRTVKE Y IMKNFMP INLTT 


12701 


26602 


A 


12829 


208 


1 


PSNSTPI FKRNENICPHKDPCTLMLMAA 
LFVMSKK* KQLKCSSNDE\KLWYIHMME 
♦YIAIKRNKLLI 


12702 


26603 


A 


12830 


649 


172 


FPLKYSLSTTSVMAPSFTSCGGSQQVLL 
APLSRLLAWPNV/HPPPQPRHPAPHSFL 
SPPRSPPNYSP*SSQGPLLQPQSKHVRP 
LPQANARLPSAKNPARGPPGAAQAPRDR 
PPA/VCPPLPPHSPLRHERAPRLFLPLP 
ATEPAALHAVHPGRRQERVT 


12703 


26604 


A 


12831 


1068 


667 


KLSWFFCSRLRLHFGS IFKVPSL/PVPK 
LLLPIPSLCPSLPLQLQPAQPVPFNTAT 
WLiUru*/ P*KV*PAPGYQRVGHFNFFPL 
QIiNFIPIILQ*HLGF*KSASLSSSSSPK 
KVSLCHPGCS AWRS AHCNLKLLI S *SA 
CLGLPKC * D YRLQS PRP P/PT I FKVTKM 
CLLK5LTGSVSF 


12704 


26605 


A 


12832 


268 


1 


KCFFLLALNWPEFILDYFILNW*NSLRT 
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VTEKFLF>rrLFLTY*NMNRPGMVAHACN 
PS / TIjGGSGGR I T * AQ&S * x oL»vjN 1 Vi P 
CLYQKL 


12705 


26606 


A 


12833 


654 


321 


LVGIRFIiKVKYFNIKFPHLLAI ISPSLE 
VFNLQHEYIYFIiLFIYF*ETRSCSVA\R 
LECSGVI I VHCSLKLLGSSNPPALASQV 
AGTITAHHCAQQHFCILEKSNLCTYFPL 


12706 


26607 


A 


12834 


867 


201 


RQMS PTLRTKSRDVASRS S FS AWAWGKL 
FTFWDFSMSLVSLQGPGGPFHS PQP * TQ 
GR* VENNQE PLATPFPGPERS LPGQPRL 
EPAHSGKRRGLPLEIRPPVNVSDIHGDP 
SGAQWLVRGVEAG P WAVGGQAP * SEG P S 
PSPPLDVSPDPGSSLPRSSSP*MDLRA* 
LSLTF*TFQKGSGN/DPASSPA\GKPG* 
EWRKIKGPPQGLLIjGPEMWGSQARVLS 


12707 


26608 


A 


12835 


328 


1365 


YAFCLMMLDKKQ I \ * AVFLFEFKMVHKA 
AKTTQNISOTFGPGTANKH7VQWWLKKF 

ckeesledeerdmghgkvdndpi*ep\y 
*tt*kiteelsvdystvirwqhskqig 
kvkkldkp^helsgnqnyrfev*ssfm 
iirnhnpflnrivtcnek* ilyn/rpaqw 
l^prgqlqsqfpkpniiapkkkrslvhw/ 

SPVGLLAPILDPTYSFLNP /GGETITSE 
KLCSGKLGKMHGKLQYIiLPALVNRKGPX 
\LIiHDNT/RDCVFAQPVLQKLiKEIiG\YK 
VLPHPPYSPDLS PSDYHFC* HLDNFLQG 
KH / SQHGAENAFQEFVKS + STDFYATGI 
NK/LFSHWQKCVT3CNGSCFD 


12708 


26609 


A 


12836 


7 


328 


RRERERERERERERERERERERERERER 
ERERERERERERERERERERERERERER 

i mem i us i ii u jiiwrmm ji mi mil -m mrxnT/ri m 

ERERERERERERERERERERARV* E KST 
RYKSPPPRIPPLRR/RRV*RKHPHARAL 
SFFLYKKCTGGERAPSPVCVGGKFTW 


12709 


26610 


A 


12837 


219 


350 


FifNUTzATWoll 1SN vQvC* TlRAUul ii> 
t,IYLFEAGSCSVA\RLECSDMIMI*HCSL 
DLPGSNNPPISAT* I 


12710 


26611 


A 


12838 


189 


359 


LGLDGVMRVGPCRALGPS * EEKS SRVQW 
LTPVI PA/ LiLBAEAGQS PEVRS LRPAWP 
TW 


12711 


26612 


A 


12839 


368 


77 


l^PNFJjKJJTiSllFrotjfc'.fc'J. XiSFF \ JjKU w A<J7 
GFINPGF*TPPGYMGKPPFFLKYSNLPG 
LAAPGGCSPFPGGLGRKI SFTPEMEVS I 
NPGS PLSLPPGEQN 


12712 


26613 


A 


12840 


16 


173 


KTDVHSKTCTKLFTAALFLIVKKWKHLK 

FF* VI// J\P1W XrJrlx VIS 1 1 ioflJL t\o 


12713 


26614 


A 


12841 


425 


45 


NSFVFFFFLFENRVTiTPYAPRIiE \CTGV 
iJj/AFJlt-llj£ l iUjr'J!ir l^rljvFdiUjdjooW 
DYRPVPVMPWLNFCI FVETGFH\HVAQS 
VLKLVS LKL/PS PTWDVPKC * D FRYSVR 
CGGLFF 


12714 


26615 


A 


12842 


237 


375 


LLLVVVY iJjKK-bW \iK»VVAHACNPb iliG 
GQGGWIL*GREFETSQVNM 


12715 


26616 


A 


12&4J 


21 


•IOC 


PG/SQ/DNPPTSG*VAVTTG/IWHHARL 
IFVFFCRDGILLYCPGRSKI IFSSCIRE 
LFKKI SDFFLLTMbl FCNSNKG 


12716 


26617 


A 


12844 


369 


58 


PISPLQFMLPFLKNQSPYGPFFWKKKNF 
PPPFLGGAVLKTPGF*NFLFKKIQRGFP 
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PFPKKKKPRERPKPCPPKFG/ S PPFFPY 
PGFPPPKKGPPPPFFFFFFF 


12717 


26618 


A 


12845 


211 


409 


NALKPKIHFFFSGGGLKGIWGNTLLDI 
GLHKTFFF\*GDFYYAL*I *AENALFGG 
GGDCAPHLQVK 


12718 


26619 


A 


12846 


200 


33 


QKTNNTKCWQG WQATGTLI HF * GE FTRV 
QSHWKIVWKFL* / SLNILP/PSSSSSSS 
SSSSSSSVKNLCLHKACTWMFIAALIiliL 
AKTWKQP*CE 


12719 


26620 


A 


12847 


416 


125 


KIPTRPGKGGPPLYPRPFRGIiNKQIGLT 
PEFGTPLGNKGKSFPFFQKNKN\NWPRG 
GGP IGPPS PGG* VGGLTLKGBVSTKLDS 
PPAPPPGGQKWPLP 


12720 


26621 


A 


12848 


85 


397 


KIHIFFIFLVSLLKGIiSFLLTFSKN*VF 
/SFIDFLILCVCVC I EFYCFI L I LFHYF 
CLFGFCLFLMFVS * CQSLDFYFNSF/Y I 
F* YKI QCYKF/CILI F 


12721 


26622 


A 


12849 


1 


246 


RPRRHLVASLVLTQLNGSSPLPIKI ISW 
AWWLMPVI PTFWEAEVGGLLBPRSSRQ* 
*AMI\SPCPPAWAT 


12722 


26623 


A 


12850 


161 


1 


NKTTSSQ I W* PMP I VSTT*EAE VGGSLE 
PRS/L/KLKCAMISPVNSHCTPTWTTK 


12723 


26624 


A 


12851 


65 


244 


STYYLHSLPFLDISYKWNHTTC\SRYFM 
GALFIi IAIK*KQPKRPSADEQI *T/IMW 

yiytleydlaikrneilihaatwmnlkn 
iklskrsqtqkt\hw* fhl 


12724 


26625 


A 


12852 


235 


34 


TTIAVSVLIIjKEIiMEP*TLQPDFLG/WS 
L ITES VDHVQWHAPVI PALWEGEVEGLL 
EPRSLRPAWATW 


12725 


26626 


A 


12853 


168 


453 


CQLRGASGTQGPGLSESQCQQPCP/ PEC 
RPSKPRPCGCRI *SPARTSPQPSPAASM 
ALPTNERTPPPALKW* PK/ CPKQSPQSA 
KSKSPVKSTERTAK 


12726 


26627 


A 


12854 


244 


3 


ELQVYMHI * I YI/C*S I YTKANIYIKYM 
HMCVYIYTHTHTHVCIHLHIYLLRHGFG 
CWLMLGKVRGCEGMAGLMKAQQGWGW 


12727 


26628 


A 


12855 


376 


428 


KWE PGGHTS FFFFKGLVLNFGRGVLQKT 
RP*GGGQGGKFNPNFSGPREPPPP/PPP 
GGGKKGPPPPPGVFVFFLEKGGSPILPR 
GVLNPGAPKGVYTQRGG I KSGNQGATPL 
SFFuKVWF 


12728 


26629 


A 


12856 


129 


395 


APPNTHPFLSEAPQSLSLEARPPSAPSN 
SSSR/PPVWRASQTHPPQDWSSHCSPPW 
GAPPQI*PFSDSLAHPPQDWSSHCSPPW 
GAPPQI * PFSDSIiAHPPQDWSSHCS PPW 
GAPPQI 


12729 


26630 


A 


12857 


363 


97 


GLATLRRLVSNSQAQVIRP PQPPEVLS L 
PKC*DYRPGRPPASLIHVNSRQT* KVAP 
/CSKPLAAAAHQGSPGATETTRRCPPSR 
CGPIF 


12730 


26631 


A 


12858 


322 


361 


KSEKQS*VMIAVCTLDMMKMT\FISVVF 
IiPKTHN/LMSNYKKNTRQIPMEGHSTIY 


12731 


26632 


A 


12859 


88 


417 


HFTFFFFFFFFKRGGGFFSPRLKRLGKN 
FFFIiDPPPPG*RDFSPSPFKEGGF*KP/ 
SPPPLVFFFFF*KKKGFPLGAGGF*TSG 
PGETPPLYS PEVWFFYGGTPGPTPFFF 


12732 


26633 


A 


12860 


217 


2 


LILINKLANMHCFLFVFETSLT*AGVQW 
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HDL/GLLLRLAPPRFKQFFCLILPSSWD 
YRCALLHPATREDEA 


12733 


26634 


A 


12861 


375 


145 


LS F FFF F * KPGS CS VAQAGVQ WHAHS SL 
QLQTP\GFSDSLNSAFSVA£PWVHCHT 
RPLSPFNICWRARPKQKVFL 


12734 


26635 


A 


12862 


525 


852 


NIiLSIYFFi2ii.isSRSYAQAGVQWCHIjGSIj 
QPLPPRF* IKKPSNTAIjFM*NIaLSIYFF 
ETESRSVAVRIiBCSGAILAHCNLCLPGS 
SDSPASTSRVAGVHHHSHLSSWDYSCAP 
PCLAKFFVFF*VETGFHYVSQDGL/DTS 
♦PQVICPSRPPKVLGLQA 


12735 


26636 


A 


12863 


3093 


204 


EPDKTGPVLWKVGGGARVPGMAETLSGIi 
GDSGAAGAAALSSASSETGTRRLSDLRV 
IDLRAELRKRNVDS SGNKSVLMERIiKKA 
IEDEGGNPDEIE ITSEGNKKTSKRSSKG 
RKPEEEGVEDNGLEENSGDGQEDVETSL 
ENLQDIDIMDISVIiDEAEIDXNGSVMC 
VEDD\DADNLQESLSDSRELV\EGBMKE 
\LPEQLQ\EHAIEDKETINNIiDTSSSDF 
TILQEIEEPSLEP\ENEKILDIL\GET* 
RSEFVN* ESSELEQPFAQDTSSVGPDRK 
LAEEEDLFDSAHP \EEGDL\DLASEST\ 
AHAQSSKADSLLAWKREPAEQPGDGER 
TDCEPVGLEPAVEQS /SVAASELAEASS 
EELABAPTEAPSPEARDSKEDGRKFDFD 
A\CNEVPPAPKESSTSEGAD\QKMSSPE 
DDSDTKRLSKEBKGRSSCG\RKFLGLVG 
LSSTTRATDIiK\NLFRQIWGRVGGAPRL 

wtnars pgs favyg fvtmstaeeatkc i 
nhlhktelhgkmisvekaknepvvkkts 
\pkrgsdgkkekssnsdrstnlkr\ddk 
cdrtddakrcddesveksk\dkndqk\p 
gpsersratbcsrksrgpkrtw\mdksk 

\GVPVISVKTSGSKERA\SKSQ\DR\KS 
ASREKRSWSFDKVKEPRKSRDS\ESHS 
RVSERSEREQR\MQA\QWEREERERLEI 
ARERLAF\QRQRLERE\RMERERLERER 
MHVEHERRREQER I \HRE\REELRRQQE 
LRYEQERRPAVRRPYDLDRRDDAYWPEA 
\ KRAALDER YHSD FNRQERFHD FDHRDR 
GRYPDHSV\DRREGSRSMMG/S IREGQH 
YTjERH \ SDPEPH \GQDS LiK * Jjia Wu x xsJj * 
Q*RLS\EGRGLPSFPPGAGRDWGDTWLE 
DEDDPVMGKGTAERGPMMDR\DHK\RWQ 
\GGERSMSGHSGPGHMMNRGGMSGRGSF 
APGGASRGHPIPHGGMQGGFGGQSRGSR 
PSDARFTRRY 


12736 


26637 


A 


12864 


2 


456 


X HI XjSGSEGlAjSaCKjJ^KljiNori IbKlfoo 

SSWARGDNVPRPPPPAVCSWISEGDVQN 
PGLGEAGAGSSTPGDGGELRY/WPGLLG 
A*GRGIGGDDDDLRTLGLAGVQQCGKG/ 
RRGPRGPGRGQEPRRPRDLGLRGPRRAQ 
GAA/SATAAPPP 


12737 


26638 


A 


12865 


388 


1 


LI KMTKVKKTAITKC* RGWGRTGAFIHC 
SWECKMIQPLWQTVWQLLKKLNIYL\WK 
QPKCPU4/VKWIKKIiYISMDSYSAVKKK 
ETIMYTTA*MTLEVAIiIiSERSQTRKGER 
GHIIiWDS I YINFQKMQTNL 
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12738 


26639 


A 


12866 


198 


3 


KKQKQTKNPPKIS *ALWCVPWPATWGA 
KVBGS PE\PRRSRLQ*AMITPLHSSLCN 
RVRTRGRTRG 


12739 


26640 


A 


12867 


415 


973 


NPVNCAQFSLLPTTEYMGHRVEGATGHG 
TCPSVPPNTHLHTGWWCLQHHRSCRAWG 
RGGSHTHRCPQRVPDGEHP * IYIAVHGE 
PP/ESPPQPSPLSCCPPQGNYIALREPP 
QGLPLPGTLPSHPPPFWHI CKTHSSES R 
HPPFPGFCGLELEKGVGDFI IHPPLTLP 
KPLPNPLKPEPTPTPQTHI 


12740 


26641 


A 


12869 


715 


893 


PCVAPHSIREERFRLTSESTNQKVLWWG 
ID*T/VILILTGNWQKRHLKSFFLGQKP 
GGMAP 


12741 


26642 


A 


12870 


342 


58 


KKKKPKENEQS I WDMWDNFRTS DI * TTG 
VLBEQFRE*GRKIIFKEIVRENSSNLTK 
/ Q INLLTQEV* TTT/HKNMNKS TP IHI I - 
IKLMKTTTKEKI 


12742 


26643 


A 


12871 


110 


401 


S FYSVMILKASRI VI CLFT * LYAYIHTL 
YI YTLCI YYAYYVNI YICTHI \NIHI 1 1 
HVYTYYIVYASIYYSMYSAWYNIKVSKD 
RQTYENIAIiITYY 


12743 


26644 


A 


12872 


439 


256 


S YFMRFGGDKI S KLYQFILSRRWKQSTC 
PSMDEWINKMWH/ITIMEYYSAK*RNEY 
SYMRHE 


12744 


26645 


A 


12873 


356 


43 


GTPERHSHSHSQGSPIIGAFMGIRASFV 
YSUiSCVLLMLSPQRVIiFYFSAFKQQQS 
\WSGKVAQAYNSNTYGKAGSIT*GQEFK 
TILGNIARPCFYKKKFLANS 


12745 


26646 


A 


12874 


341 


205 


REIKKKVLNKKFFSLQFLGKKKNFWEFF 
LKKKKKVFKAPPVFQF*VPIF/CSF*KK 
KVFGFFFSPKI FFFFFFFFFFETESTSV 
TQEENGTILAHCNLGLQGSSSSPASAF* 
VAVSQDCAIELLGDRGRLCLEKKKKKKK 
KKNFGGKKKS KNLFFLKRKKMGT 


12746 


26647 


A 


12875 


474 


206 


PIRANGPVFSPPPPPPAPGTCPHPLVIM 
IiSRHLRQSHPHTTVQPWTSGLLPS PPA* 
LSLETQHFKTAHFI FP*H*SSGLGPGWG 
S PVGEGPG/ HPKLCLQPAAAAEAWPPRE 
GTKARRHEGLPPAPCAWGPAPRDAGSRL 
EFPTLPTI PVSQSPGR*HGAGGRPSCLR 
AFVPSRGGQASAAAAGCRQS FGDLALLP 
LGSPILGQGQNFSARGR 


12747 


26648 


A 


12876 


257 


467 


YRVFTLLPRLE YIMAHCNLDLGSSDP/ P 
QVAGTTLK* LSRASRVAGTTGVHHHSWL 
IFK/YFL+RRWRRFSL 


12748 


26649 


A 


12877 


867 


1409 


GPTVIiVGGQDWNSVLQQPRAHSFTPRGG 
AGGLGGGYVLHKPIPFSQELLGAFPCQK 
QYRVPPPLEAQLLRQARSPAFIY*AAVL 
CQAQGPSPS PCS PMVS PRPSGRAPHLGA 
APQSGQKALASTSSPDAEPV/SVG*WCP 
SNPALQGGQNGAE PRAVS ITS GALS LYD 
CFLDGWWEGRSLGP 


1Z/49 


26650 


A 


12878 


1416 


740 


IRNPRWVQLDVFAAAPMGRSS PLFLLLF 
SFLKTVLRGSMIPSTVPS/FHKGPQACL 
YSLSLRS*SQAAPSPAPSSLPTPSPHPG 
LTQPGRGRRKAD IS /CSHSWAFYVP/SL 
PHLPWACPLSLPQPVLFSILFCFVLF*D 
GVLLCHPG\WSAMARSWAHCNLHLPGFK 
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nucleotide insertion 














QFSCI*SIiPSSW\DYRCMPP\GDG*FLEF 
* VEDGGFTI LVRLI WNS * PQVIHLPQPP 
KVLGLQA 


12750 


26651 


A 


12879 


961 


31 


PSPVRDSCVAWAGERRGRLGGSGVWRSH 
ARGPGLMRGLPASEPAPCVLGSRPCFRV 
AFETPWTSPASCMFERCPLWRPPALVAA 
GLDPFLLSSTLTIAVCPRDLIRSWKKGC 
DVCDSRNCWSTHSRPACRDAILAPILAG 
RGECIGIiWG/AAAACTGRV/TPVSGLTF 
STVFDFSSARG/ V* GGLPRSTGGPRVRG 
PGQPVPLPGLPAGVALAPLEGRPQEPAP 
ALSHSLTGSLFLGAWLCGCTRHGRAPVL 
LEV S I VACGQIiCEAAPQ I F I Q PCxLQQLiP 
TPSATSPAYLWVDVLLTLPSVIKQKRVI 
SFLTKE 


12751 


26652 


A 


12880 


232 


28 


PALWVPWARFDGMEETGQNHWSKEKAW 
PGAVAHTCNPNTLiGG * GGH / Mi * G(J Kr fc. 
TSLANMAKPRIPP 


12752 


26653 


A 


12881 


331 


1 


KNGPFFKNPPEFFPPPKWGVPPNPTPYF 
PWPEPIF*LTPGPLKKQKIPFQ\KGEIL 
GWGGKMGNPIjP rWKWGGP PKJEo ivbo FKr 
YKTPPPI FKKKKKGGSSRSRTSPRV 


12753 


26654 


A 


12882 


370 


287 


VSIIAGCOJVAMIDlSrrVLVHKEVKILRK 
*IIKK*CI**FRGHSK/HNFSG*AWWLM 
PVI QHFGRPR* GSLEARSLRPAWATWQD 
PHLYEK 


12754 


26655 


A 


12883 


656 


1168 


EPSRGVWPHEDARINGSKKKKSKDKKRK 
REDEETQIiD/lVGIW*TVTNFGEISGTI 
AIEVDEGTYIHALNNGLFTLGAPHK/ER 
IALKPGYGKYLS INSDELW/GRSDAIG 
PRBQWEPVFQNyEA/RNGGPAJfiMUEJts KKX4 
GTKWREDTDHTSFPLFPSTGGQPKAHSN 
WRKVCH 


12755 


26656 


A 


12884 


205 


452 


ASGSAHRFRTFSFETGRVLDKPSQTDHT 
NRKRGGPNVK* SFL/ WLGAAAHACNPNT 
LGGRGRRTA*AQEFRTSLGNTVRPCLY 


12756 


26657 


A 


12885 


112 


290 


KYNI IQYNIVFS CHDPLVS FNLFFFFFF 
GKGGFPLS PRLKARGRVWG YIiNP PliPGK 
RNFWAPPPQGFEIKGLPPP*SPPFGGVG 
QAKNLGPGFQPPPAPMGKTPFF * KNQNN 

GLNNPKPSPGPSTWGKGGNPLSQKKKKK 
KD 


12757 


26658 


A 


12886 


406 


190 


TRSLARGGFGRPLQKPYWNEQSFIMAKK 

\ 1. iLsJ^ir iSX^ ir rt JLUlS.f* V In ra*! Wo X J_i i V v» " x/ift 

MKRRDVLTCATITCR 


12758 


26659 


A 


12887 


440 


160 


RVCVCVCVCVCVGVFVCFALWLSSLIYN 

♦ T7VT ITT TTrtTNTT Y"*'D ATC! CCVCT "MT T* CPCLI 

* Villi; lir VHNiAjK Vor 0 X r liW ul C r iSri 

VNHV/HNYVC^IQNTKLYLKSYLETFH/ 
CPC*FPPILLH 


12759 


26660 


A 


12888 


201 


411 


HIWDVILIRHALGNFLTSISCYIVKIWL 
DAVAHTCInPNTIjGGQGG/WIT*SQEFKX 
IIjGNIGDPLSTKKNK 


12760 


26661 


A 


12889 


138 


464 


sctmnpsemqrkappwrwrhrs*apsth 
kmnrmvmseck^pstkkaepptwaqlk 

BOiTELA/TKKKSLENTRGTQTSKNMLFA 
ALMXVSTGCAGVPSSSKBTATIEDKP 


12761 


26662 


A 


12890 


146 


366 


FSVSMKYFV* IH* FYHL/KLLLMVYL* Y 
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X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possib(e 
nucleotide insertion 














GFLAINTSQNISCSQKIMKLHIi*GRGVV 
AHACNPGTLGG*GEWMA*AQEFNTGI,SN 
TAKPYLDF 


12762 


26663 


A 


12891 


368 


50 


KGRQSGVSSFPVGTVTNYTNLVAHNNRY 
LFPHS PGCQTSEVKVLA\GCSLLKASRG 
EGFFPLPGSGGSKCPLAYGCITPISASS 
CDLLLWVCVFSSYLAY*DTCHWI 


12763 


26664 


A 


12892 


227 


456 


KLTTLLKLFEKX IEDRTLSKSFYEVS IT 
*IPSSSSSSSSP/ISLINTDAKKINKIP 
RNRIQQCIKKI IFHKFSTGI YS 


12764 


26665 


A 


12893 


461 


89 


IYQNIiSRKPDNIKCW\*RCTEIQTLIHC 
W*QCKIMQLLWiaiWQFL*/EIiN , rEVPY 

dsiplldmy/ pkktec/ytstqtliycm 
npieyysaikrdkvlihattwmnleni/ 
iskrrqsqkstycmipli 


12765 


26666 


A 


12894 


222 


185 


RI WQ Y1»HLNLGACVCVCVFVCD/ CVCL 
CV*MCVHIHIYISTHTYIHIYVCM*ICR 
LS**CEFYS 


12766 


26667 


A 


12895 


186 


166 


I FFFFFEMGSHS VTQARVQWHNHSSKQH 
PPP\GASDPPTSAY*VAKINIFFFF 


12767 


26668 


A 


12896 


55 


485 


TCWDCRDEPPRPAI1ESVFLTSFL1EDEFS 
IEIADLVPHPTPCSHPLQPP**VIIYFC 
S\KPLVTGSSSPPRKPFPSLHEPKSASL 
*PPSSTIQPVRSSPKG*PSPPLA*VQFP 
KGMTI PTLCLSPVLPVPSTIPLPNVSKP 
LPFY 


12768 


26669 


A 


12897 


923 


312 


VDVI RKVQIE ITLRCHLIHVRIAI IKKT 
RANKCW*GCGEEGALVHC*WEC/NYILV 
QPLWKTVWRFLSKLKIELPYDPAIPLLG 
ISPKGNEII/CCTPKFIAAQFTIVKIQT 
QPRCSSMGKWIKKL/W/HIERSIDR*LE 
YYLAFKKKAVLSFAKTWIDLEDIMIiNE I 
SQTQKEKYCVISLICX3IKKKKVKYIEIE 
NKTVITKVRIRWRK 


12769 


26670 


A 


12898 


320 


3 


ESWRWGRIVRPHSVPGETWLCPVPPGPP 
NAPDGSFPLAICPVPPSCRSL/CPP/DD 
CLASAGV*GGFPLPRTEASPHHSWACPQ 
EQPWPPQCKLKVGAPNSTALSEGS 


12770 


26671 


A 


12899 


138 


447 


I FVFHYKTQNI YNEDTLHVI I INLWCYP 
GSYIKSTGRLRPEVERGLGPTTMCC*FS 
SIKNIFYF*KLKSYRKL/WLGTVAHAYN 
PSTLGG*GRWIT*GQKFNRD 


12771 


26672 


A 


12900 


279 


1 


INQS IHNNIKSTYISQCKL IQP * WKI I W 
RFIR/NLKIELLYVPATPPVGV* LKELK 
SAC*RDNCTT/M/FTAALFCIGYGSNLG 
NSTALIGSRVLFP 


12772 


26673 


A 


12901 


235 


183 


I I NYGGG P PEKKKKGGKK I PLI FYKKIK 
KFLWPKKKKLFGGAKNPIV*AHHIFTV* 
I DVDTRAD FTSATI \IIAI PTGVKVFS * 
LATLHGSNMK* S AAVL * ALGF I 


12773 


26674 


A 


12902 


180 


19 


LFIYLFIYLFICEIRCHFVTQAGLQW\L 
I IAS CS FALLGSSDPPTSAS * VKGI 


12774 


26675 


A 


12903 


406 


949 


APVPGHGFCVSF/ CGENGVSLYCPGRS* 
TPGLKLCSCL\CLPESWDYR/R*APVPG 
\LGFVFHLTL+KPPFFKDYMKSLFQFLK 
YTjIQG*CTLV*GVRYRSSFIFFFFFLRR 
SLA/SVTQAGVQWRDLGSLQSPPPGFTP 
FSRLSLLSSWDYRRPPPCPANFAFVFLV 
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E=Glutamic Acid, ^Phenylalanine, 
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A— un Known, "^otop coaon, /=pos5iuie 
nucleotide deletion, \=possible 
nucleotide insertion 




• 










EMGF \ TVLTRMVW IS* PCD P PAS AS QS A 
GITGVSHRARPKFVLYFKDNGEPLGGVL 
HRSHXAVHWM 


12775 


26676 


A 


12904 


265 


385 


GNGSPSI^WIGVLiRGKb.fc' 1 JjJSJjKn^ir J. r 
LTAAS RRSLDRVS VPMWGTFLSBPLS I E 
GLVSRyiiTNNIirlEKlFAljxKW/ frJNNJSl 
M/PBSLCYRVLI FlfSKGYPRVKGRLDTC 
YSPVRRS PAKKAS FLPDAPRLACVKPVA 

o VHlr D^boWK X Kvj 


12776 


26677 


A 


12905 


363 


137 


APGMQA*WCM/HCIPGAQEAEAGEWTEP 
RSSRLWCTVIMPMSSHCTPAWATW 


12777 


26678 


A 


12906 


315 


565 


PASASRVAGTTGVCHHSQLIFLFFVGTR 
SNFYAQAGIEIiLGX *VILLPQSPKVLGIi 


12778 


26679 


A 


12907 


376 


412 


DAW k 1 WRN Li* c\3 V x VPTlUK w JL if XyiYLU X X 

LTA\ PLKYFKSGVMAHSCN\ PSTLGGQG 


12779 


26680 


A 


12908 


215 


2 


NIHINGQKLFIi/ WLGWAHTYNISTLGG 
IALPLPLPLPLPTVR 


12780 


26681 


A 


12909 


307 


379 


KR*KQPQYPLTEEWINKMRHV\LQYYSI 

T If If tfPTT. 


12781 


26682 


A 


12910 


279 


123 


KMILSWARWLMS IIPAAWEAEAGGSVEL 
RS* \LRLE * AMTVPLRHFTPAWARE 


12782 


26683 


A 


12911 


1 


194 


J- r I" tf\Klir.l.J / vjU r IVi i. V IhnwU 
ELLAS S HP PASAS * VARMTECVHRAWLE 
WIFYLCVIL 


12783 


26684 


A 


12912 


206 


388 


ETGSCSVT\RLECS / GTITAHCSLNLPD 
LSYSPTAAS*VAW 


12784 


26685 


A 


12913 


238 


426 


TP^PSQQWIiKKESIiWLGRVVHACNPST 
LKG*GGRDS\RGQ*FETSLGNIARPCLI 
REKFLKI 


12785 


26686 


A 


12914 


115 


1436 


AKJJKrlJvl V 1AW rTIVrlv-o 1 olAjrfTLKJSri 

QSKIYRYHLSE * /R* LFFFNGGNCWQG/ 
CKK/IQDLIHCWECKMVQPLWNIIW\RF 
L*NHTCICTMGTJ^T^IYPREMKTCVH 

TITVf TPMT? T a fiT.T?UMVl?TWR O P T VP^T JC1 

G 


12786 


26687 


A 


12915 


282 


21 


IKFPPCPSTWGTKKKLFPKKKKEVMKML 
CFHFNliLVQNIF*YRlJGNEISDKAG\PV 

a HIVUMP ^TJT * OP T . TT* nOF FDTQT AT 
Alli-lNINir 0I>JJLA7O UJEiilll UV" 1 L'iOi-irii 

IVKL 


12787 


26688 


A 


12916 


254 




1 UrUXvOijUr Oi-ltrE* V / X JULCVXVLl 1 1 X t/ F1LUV w 

AHSCNPSTLGGKGGWIT*GPPHMLKVSF 
LPTDPTSRVKQLYNWVPHSTVLIALFPT 


12788 


26689 


A 


12917 


181 


24 


AQWClX5IPVSKGI^/>«^VAHAC3«iST 
LGGPGRW IT* GQE FE TSIiANE FHRTD 


12789 


26690 


A 


12918 


75 


298 


NPNVHLIQ / KHPHRHSQNRVW PNVWAPH 
G\QSSCHIKMTIMARPSGSQL 


12790 


26691 


7V 

x\ 


1 OQ1 Q 




o 

y 


KFMFLARCGGSRL* SQLSGRPRRADHLR 
PGVQNQPGQHSESQYLPKSTNFCRVWWC 
TPWPATREAEA\GESTEPGRQRLQ 


12791 


26692 


A 


12920 


198 


211 


rfcfs vgderegkra/rernreterkkq 
tedtetqreerdrdrkr\eretdterer 

PREERHTEREKK* KAAFPFQTLGCRAQD 
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N=Asparagine, P=Proline, 
Q=Glutamine, R=Argimne, S=Sertne, 
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X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














FPKQLFPSRGWGVGPMTVLL 


12792 


26693 


A 


12921 


438 


127 


RVVMERIFVRPHIKDSMWWVRSTLLGRG 
DALNDLFLYCWEKEDPRYKERP\RERER 
ERERERERDRCWEK*APGWRERPVQSIiR 
ACHPPWHRFALAGPVPSAQ 


12793 


26694 


A 


12922 


158 | 


460 


EARVELLLSSRCKPRAVITAETQPAVYR 
LQLEIENFPGLSPRLECSGAITAHCSLD 
LLG* SDPPNSASR\AGTTGMCHAQIVNS 
CKADWLVLLSSRGLTHR 


12794 


26695 


A 


12923 


33 


484 


GIQAACIPGLRLSPPADAHCHRHIPSPP 
PSSTAPRDDVGPMQITPDNFPPQDPPL\ 
TPPSQSLLPHRAP * PQGSS VDLPGLLFK 
LPQMPQTPGPEEVDTCVQIHKTSKARED 
VQ^KTPMAREGRHRAAHRWIGAIiGKAP 
QSQRRLRHTVG 


12795 


26696 


A 


12924 


335 


332 


RIKVYVRGHAKE*LRPGP/VAHTCNCHT 
LGGQGMRITQAQEFETSLGDIGRPHLYK 
KKKKKGGPF 


12796 


26697 


A 


12925 


302 


407 


AQWLMAI I PAPWEAEAGGLLEPRS * \ AW 
VTW 


12797 


26698 


A 


12926 


766 


244 


RVPRLPPPAL*LEFSPSGLRLARSHSFQ 
LRSSPPPVSPPST/PFSPHRSPPLSLLS 
LSVPS /PPLHHSFFSQRAPHILPPPPLI 
FPSSPPPAPPPPPPSSSSPSSSSSPPPP 
/PPINWGTPRFFPPPPFKNPPPEINFGG 
PKKKKIFSPPRA*KFVFLKGPPPFFFFF 
FFFFLGINLL 


12798 


26699 


A 


12927 


127 


720 


WCGLLESTELQLQSLPGLYVSVIFGKIF 
YISGLGICCTKIFPSALPSAQPAASLPL 
PARSALGIVFFLHFC* IE/CNYKKLPIF 
HHI * L I KTFS YGLT * LFFFFFKTKSLI F 
SPRLNCRGPFLVN*NLCLRG*GNSPA*P 
FGGAG I EGAPHKAGLI FGGGGGAFLKKT 
G\LHHVAPGGASNSGTGNISHPPLNPPK 
GAGE 


12799 


26700 


A 


12928 


308 


49 


KEHW * AVGHFNI CVTGGP *GERKKGTBR 
VFKEI ITPNFPNLIQTIKPKTQETQHTP 
SRRNTKI /TPRYI 1 1 KFLKTSNKEKILT 
TAR 


12800 


26701 


A 


12929 


375 


2 


GCMLSLKTINILEVIiANVLAQKKIKRRK 
LND\ EQADTTFTI * FCVRQKTKKSTDKL 
VKJjI / ERFYKVAT YMINFItKKSNS FHTP 
ATINC*KQI PFIMATKTIKYLRINPRKK 
CVRFVGKKIjTFPER 


12801 


26702 


A 


12930 


202 


379 


SSEOTSGKTVNIiCCLGRLHRMISWLHTV 
AHACNPIT\GWIT*GQEFETSIjADMVKP 
CLY 


12802 


26703 


A 


12931 


209 


376 


TVGE KMYLKVKRI \ WLG WAHASNPNTL 
GGQGG WI T * PQE FETS LGNMAKPHLYKK 


12803 


26704 


A 


12932 


280 


31 


ISCKVTKKVTLFELKYIWNKFYIYERVS 
IVGTTEDAACEVTSNVITGPGAVAHACN 
PST/RGRWIT*GRELKTSPANIAKPRLY 




26705 


7V 
*\ 




1 


1 OA 

189 


yMx lJSdJSF w lJKJLiVlI iAjUiWVWrlb 1 / 

IQCTTQAGNVNIRE /RLCMCSGGE YMRN 
LCAFLSILL 


12805 


26706 


A 


12934 


108 


409 


HHSHLLKLPNVWFSFCHFIVTVFLSL*R 
VFFVFY* IFLSSSHFLFSFCFCPIMFIM 
FPFKFILSCNIVFI*NIIFFLF/VHV*G 
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nucleotide deletion, \=possible 
nucleotide insertion 














* STIiVSIiFVFFFFIiLQF 


12806 


26707 


A 


12935 


225 


11 


I LLC34EPS FLFWFGF ICFVKTGSHYIAQ 
VEVLWLFSGMI KVHCSL/KLLHSSNPPT 
SAS * VAGNTGSSYCTW 


12807 


26708 


A 


12936 


74 


12 


GTTHASAGVIN I LVF I hSVF FLTHSLC I 
HISKYMFVYICVCICTRTDFYTEFKSFC 
TFCFFGSSFFFLEKKFPLVAGLEEQGKD 
LG*LKPPPPRIiKKFSCIjTLPG/AWE*RT 
NG 


12808 


26709 


A 


12937 


166 


311 


phs*wecqmvrapaiienwqfltkfirgl 
pl\daalplhm/chrksctqlfi ialli 
vakkcnhhknl*anewinkm*yihttdy 
plaikqne ilihattwmnvvtii 


12809 


26710 


A 


12938 


51 


296 


LCMYTLHTLYMHTC I YAYIC YTCKYIHR 
Y/LHT* I PEDANS VKHKYTHTHI YVHTL 
VYPHTTYIiMCSYIYSWVCVCVCVYIYT 


12810 


26711 


A 


12939 


426 


28 


KGFNLVNPRVFPS PRF* TPAPVFYFCGP 
IKKIFFFKAGGVKFDSLKRAPLFFFFLK 
TGSCFVP \RVECNGI ITAHCNLCLPRSN 
DPPPSDLQVTGVKP PP\AKLFFFFLW*M 
GFHHILDYQGETCLSQLU3WR 


12811 


26712 


A 


12940 


11 


213 


ATAPGLFS FFETGFCSVDQAGVQ* HDHG 
SLQP*PP\GSSDPPALASQSARITGVSH 
CKGPEHTSLLT 


12812 


26713 


A 


12941 


416 


38 


WLNNHSRLGFPNCWDYICKLPRIiGPDVA 
SUjIHNSS\GTWYGATKLESNWYFSIPI 
YKDHQK* FAFTWQGQQYTFTVLPQDYVS 
SAL*HNTVHRDLDHLDNPQNITLVHYSD 
DTMLLDLMSRKYQAL 


12813 


26714 


A 


12942 


361 


74 


I IHPSFSVSSRNIHKNAYPAALGGYSVY 
GVAILLFHYFINKLAFTLHCGLA/LNSF 
LR* IQEPPLG\SGSGPFSCNVFLAPTEG 
TIVQKPDPMATFG 


12814 


26715 


A 


12943 


125 


401 


SSFEPRGIPLNPBVAMQKESVNILCSPR 
SQE FLESRI KIKVIiTDLTQDEL*GQAQ * 
LTPI I PS \ I WEAKAGGFLEPRSLRLRSC 
HCTPAWVTE 


12815 


26716 


A 


12944 


92 


306 


KRTTNSPWGCSAY/GVAILLFIi* IL/KL 
LAFTIiIiHGLPSNSFLRKIQEPSLG\SGS 
GP I FGNKLVAFGGLM I ws 


12816 


26717 


A 


12945 


322 


24 


NFKKNINLNPFLRGYFLFLKGGFGLTKK 
KVGEPLKKKKKKLNQQFSYFLSTYTVK* 
NN*/WLGAVARTYNPSTIiGGRGGWIT*G 
QEFGTSIANMVKKIIiAK 


12817 


26718 


A 


12946 


2 


448 


GGAPMKRHGSLDRNYRVUriKPSQLFIiF 
QLYHNYLKFAYEATKEYMHRKETLCLPC 
PAIQFS^PKSNLCYRGYLCMRTCAHTQM 
*MYRYIYTYVYMHIYICM/YRAICTYYT 
FE * KY I LQKCILLC I FIjFFI^2 YLKYVS Y 
LNRHHFLIAA 


12818 


26719 


A 


12947 


155 


415 


LLETGGNAGCSSWMHVLLGKIHRYGMGS 
RP I SLMAG * \GPPWliRRVAGLRGRPATL 
nT.PVfiPDQYGROOWX EYCTMGASLMORR 
RVRD 


12819 


26720 


A 


12948 


247 


250 


KYVYKY I H I VYKCVY I FTHVYG YTYL YT 
DVYKYTYLYTDV* I CICI YKCI YLYWYI 
YIHLCINMYI YIHLCINMYIHIHV* I YI 
HIYI\HVYIFIYIFLYTVYK*IHIYEVC 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
NKAsparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
I— mreonine, v=valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














EYTYTIiWKL 


12820 


26721 


A 


12949 


25 


407 


EALSLIiLLLVLWGLLLGIARKMYQQMNL 
VQYCLLVL* P*NL *EQKRFERE VGLDHR 
EP*FHVLGCIHSKKSFRFWSW*KL*IRK 
LTLVA\GGVAHTYNPNTLGHRGRRIT*G 
QEFETSLGNWRPLFL 


12821 


26722 


A 


12950 


370 


106 


ILYLETLLKSFISIRSFSAVDSHFEKSS 
TVGKMLPNSITCTDP I \ FNKKKSQLMWQ 
SSLLPFSQKLPQS PQPVSQQPSPSRQGS 
I PATR* QLA*GSVQVMLFGS ILPTVELF 
SKWESTAEKLLILINDFSKVSRYKIYV 


12822 


26723 


A 


12951 


280 


319 


DQPGQHGKTIiF*VSH* KKKKGWM/ PGVY 
AYNPRFLGGRGGW IT*GQGFETSLANMG 
KPCFY 


12823 


26724 


A 


12952 


337 


113 


IQLAISLLGICPRDLNTVTCS \SDTCTQ 
MF1*SVLLMVAKR*KQPKYLQIDEWINKM 
QCIHMESYLVSKGRKYLCKL 


12824 


26725 


A 


12953 


9 


186 


DPTVCCLQDTHFTYKDTNKLKVMGWTKY 
/CHANGNQKRAGWIQIEYP * SKSLKSE 
IVQN 


12825 


26726 


A 


12954 


46 


381 


NENTYSYKKNTQMFIAAIjFVVTPNWKQP 
TCPS/SGEWINKLL\YHLMVYYSAVKTN 
/ ILVYVATS VNLR 1 1 ILGKQPDKKRVLT 
V* FHLYRKYKLISSDRKQIHGRLEVENK 
GK 


12826 


26727 


A 


12955 


19 


250 


CS DMVLKAC I KKIiMSYS KMGH I QAMDYY 
TAL/NKE h/ LMYAT I *ANLEDIMLS KIR 
QTQKDNCLMVDCIYIRYAEQSSL 


12827 


26728 


A 


12956 


173 


163 


GAKHSASGTGWIRKDFTRPGAVAHACNP 
STLGG*GGWLSRSEFETSLTNMVKL/CL 
YXSICCTRCkLGCFPQALSKAID 


12828 


26729 


A 


12957 


301 


79 


EKDNQPLLTPRNCNVIjHAIKESENHRIT 
ELKVTLRPDAVAHTCNPSTLGGRSG/WI 
A*GQEFE ISIAKKVKPCLY 


12829 


26730 


A 


12958 


314 


126 


KHHHFKKHNFRVLCVCTCVCVCVCYKHF 
YLFSLS FS Q * P/C I CC* CVCCS FLLCCF 
♦SATYLKII 


12830 


26731 


A 


12959 


529 


1697 


vpfmghisnfffspflrqkylallprlg 
vqwq*srahcnlrfpg\ssnshasasrv 
agitgachhgwllfvflvetwfhhvg\q 
AGLKTP* pqv\sacfslpkccdrrcepl 

CPATSS ILNL* TMDRL/ PGEPKNQ *NLL 
VSFMCAVFYREGQQHVYFQRSLWCKREF 
I VRSPWAINLEKGGLRMGAV* I RIMLNL 
TLT*GFCTEDEKTVNSRVNQKLTPT*LG 
DE*DVKREKNINDEIKLLTCNSEYTKSY 
R LGR * I KLG AF CF FM \ VPMGQTGRNGQ L 
KVQV*TGTVAHVCNPSTLGG*DRRITRV 
QE FETNLGDHSESSSLQKI KLARHGAAR 
L*SLSL*VSWDYRCSPPWP\SNFCIFK\ 
RDGVSPYWPKA/WS * TPDLQRSAHFSLP 
KVLG 








nnrn 


l 




PPGPKKQTL/ C*KQTTPTKPQKTNLGPK 
KFGGQPKKPFKISHPQIGPPKKKNYNP* 
GQPVWNPPGEKKEGPTPPKFKDPGANLK 
VMGLKGGVKRKRGGKTFPQRA 


12832 


26733 


A 


12961 


90 


436 


LWRLTSPKSAGWAGNLETQEESMLQFKS 
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Amino acid sequence (A-AIanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, t=IsoIeucine, 
K=Lysine, L^Leucine, M=Methionine, 
N^Asparagine, P=ProIine, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, ^possible 
nucleotide deletion, \=possibIe 
nucleotide insertion 














EGHR / PAES PLA* SS VLFS LS TDWMRP I 
LI / MEGDLLYS KP IHFNVKX S S KNTYKN 
IWNNVWPHV*TL*PSQSD/CLK*TITPI 
VLLTCTH 


12833 


26734 


A 


12962 


415 


463 


lrakdqvlpkilvc*tllffcaqikkkk 
k/wpgavahaynpstlegqggwit *gqe 

FKTSLGNMVKPSGIPPHWI 


12834 


26735 


A 


12963 


774 


316 


S I S PTCSGKIGGKLNRHFS KEDKTG \RY 
MNKCFISLVISECTLKP*EDTH*TP\LR 
MAKIKRADH/DKC*QRYKMTGTLIQC*R 
ERKMVQPLWKTV*QFLKRLNIHLPDDS I 
PLLG\IY*RKMKACVHTNTIIWMPIASL 
LVKAK\AKKQPKCPSTR 


12835 


26736 


A 


12964 


44 


395 


MYFTLVWGSLGPKFLGGHQPLQGQGPFL 
HFLKAL**FPFPQTPGNGGGTPPAKTFP 
PFPTPLSSSSPCPSPAEGGSREPSLSTP 
/SCLHLYIGGAASNPSPPSPLPPPQCSG 
LGYPVC 


12836 


26737 


A 


12965 


424 


50 


MAFPFLSCAPDAINNNNKKCCREINNEE 
PPNEPIiKWRHIMIFFFEAESHSFAR\LE 
CSDAI S AHCKLCL PGSRHS PASAYRCPP 
RHIiANFSAFLVQTGPPRVSQDGKDLQTS 
*SALLMPPLDPAIP 


12837 


26738 


A 


12966 


245 


44 


QS KELGNYPLPSPSPSSPSPAFHGiaPPP 
SSPS PAFHGLPLLP\S WTVQPQ* RLTAT 
SLPDPPASPRPG 


12838 


26739 


A 


12967 


205 


30 


QEGASLSRIKRGPGVGAWLGMVAHACNP 
STLGGQG/SWIT*AQEFETSLGNPHLYQ 
KYKN 


12839 


26740 


A 


12968 


114 


465 


DPVSNPETTKPEPPKKTTES*P/EPKSL 
LWLPS\P*NSS*CANREQQPPSPQPHLP 
HPLPTSLQVHRLP * *SLPPPNLFTPLYD 
/SPBPSLLPVIQFPA*TSPP*PPVYSSP 
PSPTCPIiH 


12840 


26741 


A 


12969 


1618 


1038 


VHMVQDKDINLNNQFLSGTMLLFFKETS 
HR*DRGPGDF * S CLYS API PTQPLLF* Y 
LPQNFIEG*PS*NSRIKGTSLKTYISRV 
PS FFFS FL * DGVS \ TVPQAGVQWHDLGS 
LQPPP PGFKRFS CL»SLiLjo b WD X KiiPPLiC 
PANFLDF * * RRGFTMLARLVSNS * PCDL 
PASASQIAGITRMSHHTLPHVFPLNV 


12841 


26742 


A 


12970 


197 


411 


EQALRSTVCYCLNPLP PAPLRPRQDC \G 
PCRFRWMPGGRTRWLAPVIPTL*EAEAQ 
VWLEARS LRQAWATW 


12842 


26743 


A 


12971 


616 


777 


MGIHGVRHNAQLNFVFFVEDGGEWQLTC 
TGAGDSSWVLAaEP* VFAC/ FKJM VJjrx Litl 
P*IiFG* I * IYQLKITF*NYEGMVP/FVF 
RFLFETRSLLSPQLECSGTITTHCSLEL 

VFFVEDGGEWQLTCTGAGDSSWVLGEPL 
RYYKBRLIKWYL 


12843 


26744 


A 


12972 


222 


389 


VITEALGSDELEGRQLWW* IKIHSQVQW 
L I PVI PAL / LEAKAGGLLEAKTS RPAWE 
T 


12844 


26745 


A 


12973 


425 


145 


QIIFFLFQKIKNQFNKASFFKKMTLEQM 
TSRQKNSAQISVFKKKNSKWITDLNVTH 
*TIQLTGKN\NIGKNLQDQGLGKE ILNL 
TLKAQSTKG 
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E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, R=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
vr=Lpiutaraine, K^Arginine, cF^&erine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


12845 


26746 


A 


12974 


322 


411 


NNKKIFMSTS*KFFYCFIVQLIFETILS 
VRVLESIILINYL*yCIICIFPFLFL*T 
* EI RKVTKFTS FPQIEEK* KCRR/ WLGT 
VAHTYNPSTLGG*GGRIA*AHEFETSLG 
NTGRLPSVQKIL 


12846 


26747 


A 


12975 


428 


1 


SRKSSSQFRLSFFPSHFLDLGAKGAVSQ 
DRAIALQPPGPRSR*S*RPPSRPRCFRR 
PRPSRSRSNKERA3EKRRPRRSLQPRPP 
HIAGP / VPRRQGS PS AGPEAPLATDEP P 
PQRRS G I RGGS ANI RS KQTS S LRPRVRG 
RVG 


12847 


26748 


A 


12976 


60 


294 


NHCTETVLFQ* LTCVCVCVCVCVCVCVT 
ERI/CFFKFLIGKGTYGCIYLWPKTFIL 
I F I YIIi x SPRGRS I LKKKFFFLT 


12848 


26749 


A 


12977 


6 


341 


DSLTLLPGLECSGAISAHCSLCLSG/FK 
QFSCFSLPSSWGFTG/VPPRMPQLIFPY 
FSVKTGFH\HVG*AGLKLLI PPASASPK 
eWDYRREPPRPGDLWNF* PTGFKLRFPQ 
FPL 


12849 


26750 


A 


12978 


296 


42 


SQPASLGFKSVDLPLIiAVCPLGKITLTL 
WDFS FI I YIKQE * WPRAGIFMPY\NPST 
LGGRGGR IS * AQEFKTSLGNI VKPHYIiV 
K 


12850 


26751 


A 


12979 


20 


373 


KLYGGIWGFFFFRRGLNSATQAEMQGHN 
LG*LKSPPPG\*RNFPASASGEPGITGS 
HHKAGIVFYIKKKPGLKTFTLGKPTPPP 
S FRHWHQPAPMGVYSEI LGE KS LTNGWD 
PQTKKG 


12851 


26752 


A 


12980 


380 


96 


PVFPPPWRAQKGGSIGREIRPPRGNQGK 
PGPP*KKKKNYPGGGGGPPSPPFPEG\* 
PKKWHNPGGGTPP*TEFPPWPSTRGGEK 
KPPSKKKKKEK 


12852 


26753 


A 


12981 


331 


71 


ELLIYSWVWWCTPWPATW*GRELVGRI 
T \KSRRjjRIjQ * AKI VPIjHSSIjGDRvRPC 
PPPPTKKKLANLCPWTEGSLSFSYLLEQ 

COOK* 


12853 


26754 


A 


12982 


383 


189 


RRALKSCAHSTPAGPKNAALSQAQWLTP 
VTPTL*EAEAGGSLELRSPRI*L*AMIBP 

\\— X rxin VXD 


12854 


26755 


A 


12983 


144 


6 


ILAQ* KTWLSWAHTL*S /TLGGQDGRI 
TGAQE FKTSLGNTRRPCLYK 


12855 


26756 


A 


12984 


341 


509 


ETL YTME * YTAI E KNE IMS FAGTWLELE 

-H. \ V X JU \oXNl_iV ^ISy IV. 1 rU v Hjf t».£j ir larllj Willi 

K 


12856 


26757 


A 


12985 


387 


125 


DLPPTPPKLEPRTCSVAQVEVWCIi/GSL 

*EMFPF*VYFVQTSEHILGINPPGRHTK 
VAWBV 




ZO J DO 


a 




101 




LrKvoOr w Vi-it** 1 FPPGQKKFG.r TP P KAP P 
PLGFFFPG/RFFFFLRRSLALSPRKWRD 
LGPLQPPPPPPTPGY 


12858 


26759 


A 


12987 


251 


1 


RMSLLEKRRSNN* EARV/ CGRVI YVATE 
SAQBFETSIjGNMTNPCLYKKYTHKKSRA 


12859 


26760 


A 


12988 


320 


3 


TPWGVKDFSPPPPPGEGVFPGRAPPPR* 
KFFGNFWKKRGFPNWVRGVLKFWG\KNF 
FPPNLPKSLGLRFKPPGPP*KAPPFFPL 
FFFFFEMEFRLCCPGWSAVAQS 
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C=Cysteine, D=Aspartic Acid, 
E=€lutamic Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
KHLysine, L=Leucine, MNSfethionine, 
N-Asparagine, P=Proline, 
Q=Gltitamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, Y=possible 
nucleotide insertion 


12860 


26761 


A 


12989 


353 


183 


E YQTHILTAAIFS IAKRWAHR* GSLTDE 
WVRKNS S LQSMEYY \ SAI KKNEVUjHAA 
P 


12861 


26762 


A 


12990 


377 


2 

- 


LSIILINFPRFSLQRGKYPGPPPPKKNP 
PQKTPPRAPKKKKPFFFFFFFSFFKKNF 
PFFFKGQSKGGFFGPP* PLIFRFK\NPP 
PPPPQKIGFKAWVFSTPPPPIWGFFFFF 
FLRWSLATiTiPRLEC 


12862 


26763 


A 


12991 


134 


298 


PPNSYCDGS ITLI PKSDKNITRKEKYL/ 
P/IYLMTIDAKVFNKVLPNRIQQ*IKRI 
I 


12863 


26764 


A 


12992 


892 


171 


QT *SH/ S LSRLEYSDT I IAHCSLKLMGS 
SDPPASASQVAKPTGHCTTMPQPHTEGF 


12864 


26765 


A 


12993 


269 


2 


GLWCFVKQLKLTETCPHFAVPQFPLGIS 
KIPISILFFKKLLL/WPGTVAHTCNPST 
LGGRGGWTT +GQE FGLMFHHWPS KFHHG 
QHGET 


12865 


26766 


A 


12994 


324 


4 


SLYPLLFQAEIYFLVTIHRSSMESCIYF 
FSVWPLDSLVQIHVKI /HIKGIGML*V 
N* QNCFKINISE*L* YI WPSTVADTI/3C 
QGEITRAQGFKTSLSNMTKPHLY 


12866 


26767 


A 


12995 


106 


531 


WLLNLFAFLNDKGFLVPMNRSRIFSQQK 
VKQRS FEGCS S FNDVGSCYVTQAGEQWL 
FTGVIGTL+ PGTPG\SSDPPASASQAAG 
TTGS *HHIQLNIiILYSSG*QTI * LL*TT 
DSTHLI FASLYAKEQIiITFYFQATSLQP 
CT 


12867 


26768 


A 


12996 


1 


367 


NTEPFLNI FFKDKVS / CS I AQAGGQWRL 
YSRCSLKLLGSSNSPTSASLVAGSKGRH 
HYTRL*LFFYVSRPQDYLPWSPVFFFSF 
FLSFFFWKGSFFLPPSWKARAPIWVNGS 
LPPRVKPIFLA 


12868 


26769 


A 


12997 


3 


379 


YRPSPSETKAVLVFFS FFS FFFLGTPGG 
KGPPKGAPNPTLTWRGKENFF I * KKKKP 
/RLI * KKGPPPKMGFKRERFWVLKPKGP 
QFGKGEKG/SPNYPHPRGDPKAPSLWKK 
PQFPGGGGGGAPKKAS 


12869 


26770 


A 


12998 


230 


409 


SCRPKKLLAFIiKSSTEISQRWKQPIYI/ 
PSTDE * INKMWHIHTME YYS ALKRTKVL 
IHATT 


12870 


26771 


A 


12999 


310 


343 


DPVCTII*IICDIINL*IECSKVW*SLKF 
YKI LILFNVCVWPSVGNCHKWLCDCW* 
KQW/LWPGMVAHTCNPSTVRGGGGRIV* 
GQEFETSLGNIVRPSLYNNKSCI 


12871 


26772 


A 


13000 


390 


289 


IKLCQPRGVKLDSFKRAPPFFFFFETRS 
CFVS * AGVQWCN/LGSLQPQTSGPNNPA 
TSA*LSLIFMCSNSYVCIVK 


12872 


26773 


A 


13001 


1 


289 


GILRQISVNYLPVGRSLAQAGVQRCDLG 
SL* PLPPKVKQFSCVAGTTRGCHHAWLI 
FI FSREG\GHPVKGASSDSPTFGFSKAG 
ITRLSPRALACT 


12873 


26774 


A 


13002 


359 


105 


KQKKNPSPPPGGGGGQGKPG/PPKPFPN 
£>KJJr irir IsJNyor'r r ±r JS-Lir\J\JVr irlNr I 
NP* KPNPPLKKGKKPKGFFSPPPPLKKP 
PNV 


12874 


26775 


A 


13003 


243 


293 


MNSQFLKRL\RIESPYGSVrSLLGTYST 
EK* KACPCKTCTQMFTSALYI IAKKW\N 
KCPSTVRW/ IKLWHINTOEFFCQ* KKFH 
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N=Asparagine, P=Proline, 
Q=Glutarnuie, R=*Arginine, S=Serine, 
T=Threonine, V=Valme, 
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X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 














CIDMPQFVHLTVEGHIjSPLFCYYI 


12875 


26776 


A 


13004 


465 


307 


SEACFI LKLFMFCG* CCLFVC \ S ILIiVF 
YEVFCFVLFCFAYYMQFFNQCLFG 


12876 


26777 


A 


13005 


207 


3 


IX5SLTMLPKLBC/RWLFTGVIMAHCHLE 
/ RLGSND P PAS AS RVAGTTGVCHHA* L I 
FIFFVKMRSHYPTRP 


12877 


26778 


A 


13006 


2 


56 


DIALLFRL*CNGMITAHCNIjK/RPG*LL 
GSRDPPTMVS * STGITGMSHEAHRPS PS 
CDF*RSCSWQAW1QWHDYSSL 


12878 


26779 


A 


13007 


402 


74 


FGASLLKRHFLFFLSKPLVYNKKIPRPL 
VFLKKT\PPPPGRATPFYYCCFFHKEGP 
GFGVKKEG*PKG/ SPPPPQRGSTPPFFL 
GAFKKP KVTTPFFKKNPPKTLKGPPFF 


12879 


26780 


A 


13008 


356 


145 


RPPPFFFFFETVSHPVAHAGVQWHDHSS 
IiQL* TP\ SSREPPASAS *VAGTTGVHYL 
PGSQLPILEHSPRR 


12880 


26781 


A 


13009 


227 


353 


GWSELT* / SN* AQWLTPVILAL* EAEAG 
GSSEPRSLRSAWATW 


12881 


26782 


A 


13010 


249 


32 


NPDSHSWAWWCASWPGTQKARVGGSLE 
SRSLRLR*AMIVP\CTPAWVAQEPLSLK 
KPPTIETLFPSPQDPSQ 


12882 


26783 


A 


13011 


247 


403 


TPGGGGKYFFFWGPQKKNRGGGFKNGGG 
GKTRGPP*KQPASSSPFAPPPFFWGAPG 
FSPPPRF\KPPPPVFFLGPPKKKIFPPP 
PGGLFFFFLGGPPSPFFFFFFFIiGGGPL 
FFPGLGAMGPMAGFRSLAPPGNSLSKKK 
KSEGLGEGGNSVLTRVLLISSYQIPGNP 
R 


12883 


26784 


A 


13012 


206 


386 


TREAQTFWFPLYS * KAL / WLGTVAHT * N 
PSTLGGQGRRITSSQESETSLANMVKPH 
LLLEI 


12884 


26785 


A 


13013 


354 


31 


YKMNIWDIIHI*HT\YIPWPKGKERKR 
KKLEBYL/QDIMTKNFLNLMKNMNtYFQ 
EVQQTPSRINSQTSTPRHI I I KLSKYKT 
GREQWILSVMLALWEAEWGP PEVRS 


12885 


26786 


A 


13014 


1 


419 


EETSFGKARCLPGYPGFHPRRLRSRSPL 
LS*MPLSHPQGPSQPPAGSMSSSPPATT 
S TCSCS P PAAGCTRGRTTGRP FWTASG P 
AELDWASGSSV/EALTSGIiPARGLWGQY 
AGLGVPPRLLGPMTSCCMSLPFPSPARP 


12886 


26787 


A 


13015 


109 


426 


TSLGNIARPH*QTHMATIRS VP PPS WKD 
ST*LRLHPPPPHMCF/PD*SLKSN*Y*H 
YLVNP\IWPMLTLLFVCLFVCFRDRVSL 
CCPGWSAWPSRLTAASTLWVQVILPP\ 
RLAN* IFFLRWGLAMLPRLV*NSWAGDI 
IiPP*PPKVLGIjQA 


12887 


26788 


A 


13016 


149 


422 


LKRI FFLK. WFTHDTAPGDPDNMCP YSC 
F*NQI IKSGLFFFKKPHL/WLGEGAYPC 
NPTLLGGQGGRIT+ FQEFETTPANMAKP 
LLYKKYKN 


12888 


26789 


A 


13018 


162 


377 


QS PGYPERVS KLPT ITQRGMWMHQPSFV 
CTSL*NV*TGWAQWLTPVIPAF*ED\AL 


12889 


26790 


A 


13019 


205 


427 


IQTGGPPARPFLWPWAPGKK*RVPPEGG 
PPKKKKKTPPPHPGGEREGS/PFSKKKK 
KKTPKKRAGVFFPGGPRPKEGPGWGPPR 
LDS KPVGAPPEPGRLGNGPGKI FFGPGP 
FFPLQPQKGRP*PGGEGGSFFFFLGVPP 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K^Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Vr=(jrlutamine, K— Arginine, o-oenne, 
i— inreonine, v— valine, 
W^Tryptophan, Y=Tyrosine, 
A— unknown, = otop coaon, /^possioie 
nucleotide deletion, \=possible 
nucleotide insertion 
















12890 


26791 


A 


13020 


248 


358 


SRCGGEAQWLLPV/I/ PAF*EAEAGGSF 
EPKSLRPAWAT 


12891 


26792 


A 


13021 


2 


434 


ITKAGSiJrQEXHU J. u x I1UA. v x WKr srl iv 
KAQVKKKVWWHM* PQLLGRLEMGGSLEP 
RSIiRLQGAMTLPVTSTT\QAWAT 


12892 


26793 


A 


13022 


162 


410 


IiTQIvX AAiby± llib r oo 1 otiKlv J. riooISo 
GGGGSVSVTQAAEQWRYHSLIiQP*TPGD 
KRSSHLGIPHP\SSWDYRISKDWVMPV 


12893 


26794 


A 


13023 


250 


40 


LIKLNIFSCSLAIWTSSCMTSLFKYFAH 
FSFF* IDFxSSYYIIjDTNrr cRa/vsl xJM 
ILSSSVTCSFILFW 


12894 


26795 


A 


13025 


264 


1 


VSSTKHGKLRLEPEY *CFIHH* EEKVFE 
SYFEY/PCQTFLRKETVRPGAVAYACNP 
STLGGQCRWI I *EQEFETSLTCIAAALR 
VTLR 


12895 


26796 


A 


13026 


110 


523 


CIDSSSWTQTDRCKEURbCPGTTGTCAP 
GLLFLSAPPLGPF * PSQAHPS FHVSIiSG 
PSPGLRPQLD I PLHPSTS IAPRPLPGPA 

S \liPKJ-ifa»i , HJrb JjFoJrTvjAU.fc'r Liff U-JJjU. 

QRHWPIiPTSIiLWLGUXSHCLATPSVS 


12896 


26797 


A 


13027 


52 


53 


CPTLIiQLT3 J C/LPPQASAMVX>APP*ARP 
IjPPSS IIACCTSGEQGS VWGIiAEPVKG 

tnrr rur<nrT out utrrVTUimwCDPCT.VC 
YNLLVCKIiIjKlrJ-ifci K.Uo J. W V AvoKroiilo 

LSWLPLARKG\NPLTACASWVR 


12897 


26798 


A 


13028 


262 


381 


HFGRLRQVDRLS PGVRDS LCNKVKP * Q I 
IYTF* FSVSLSVINTYLKGPSTVAHTYN 
PSTSGG*GKWI A* AQEFE/TSLCNKVKP 
CLY 


12898 


26799 


A 


13030 


1 


846 


FRPIQAGRAPWALTLPCMGFTEGSPALL 
SHSSRCDIQMGCNTHAVDDSTGEGPFHK 
GRMRESCWLRTPRCGLYTKGVGGPGMBG 
RERRSRSRRQHMPGRGHSTMASWSLRQC 
MRLKLQWASDLLSQLEMQIPSLPPGESG 
SLGLTGPRSIjH/ CEQTIGIiSPTPC»Ei3 Wb 
EELG\GGIVGGGRGTVAGRGGHREAGNK 
PSGTWGRQCCASGDKAMSI*LTSLY\AR 
RGEPGPGRPVRKWLRESSQNRIGGGEGR 

LMWS 


12899 


26800 


A 


13032 


308 


1 


KHYHDFWCWIILALSDAQSLDHTCV\CV 
CVCVCVCVCVLRDQGSHSCL* CEG*LVP 
HFVPGPVLCLLSPVIYVCFICSLTCKVE 
SLSNFFKKLNKALLRHCI 


12900 


26801 


A 


13033 


763 


244 


FLGPRIIGLRHEISVETQDHKSAVRGNN 

QSNFEEHIYGNETSSDYYNFQKPRPSEV 
PQDEDIYILPDSY*L/ GQNIDFCYWMIN 
IHCNFSTAKTRNQTKC*STVDWIKKMWY 

SDKSGS 


12901 


26802 


A 


13034 


326 


58 


EHYKCNYCQKIFQKL* /HL*SKKHVYAE 
DKYYKYKDGWPLLVSQI ILYTFYTRGK 
P*SSCSNFVQNQI I YIGQKSCKCNKPGN 
TFPKKP 


12902 


26803 


A 


13035 


199 


3 


RFIYNHKVCITTFLPHFYF*KLYTi/WLG 
TVAHAWNTSTLGG*GGRIT*GQELETGL 
DNTARPHLHTV 
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SEQID 
NO: of 
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sequence 


M 
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od 
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NO: in 
USSN 
09/515,1 
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Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino aciu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

umg lO 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
V£— vjiuiamine, t\— Arginine, o— oerine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, ^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12903 


26804 


A 


13036 


94 


448 


GSCATSMAFKDTRKTAVEPGVWHGIRI 
KPLEKVCADLITGKKFKNL/KYKGPVCM 
PTKTLKITTRKTPCGESSKA\WDGFQMR 
IQK*LTNLHSPSKIVKQIISISGAPGID 
SEVTNADG 


12904 


26805 


A 


13038 


964 


1717 


HKLSCVKDLKDFMRQAGE \VTFA\DAHR 
PK\LNEGVVEFASYGDIiKNAIEKLSGK^ 

eingkninlikgsk\rpsrsrsrssdpg 
tqkspiigprsrspspwrkp*qpvqear 
srbpgarsqvpvpvk*gpppcpekepqk 
\rgslqskilsppsilwirpevsgprsk 
vprsvdsgn 


12905 


26806 


A 


13039 


2 


725 


SLLFSPVLPLFLFLPLAYILQNLKPLQF 
SPDLKSKHLI FFCNAA*PQYKLDRSSK* 
PENGTLNTFSILQDLiNNSCRKIGKWSEVP 
EVQAFF\SHQSLPSIjCRQCNS/CPNLPS 
FPP \PVPSVPTPSVTESF* SS FSTDPSD 
LSLPPQALLLARPS*VPILPQPPLLHPI 
IL3^SPPLLTPRPAYSFIP*I./CPPPPAQ 
QFTLKKVAGAKGIVKVNAPFSLSQIR*H 
LGSFSSNIKIQPSSWLVWQQP 


12906 


26807 


A 


13040 


1131 


2 


EXJVSIiLSPRLECSGTISAHCNLHIiLCLN 
DSPASAS*VAGITGPATTPPLIFVYFFF 
LVBTGFHHVGQSGLELPTD\SDEPPASD 
SQSVRITGVSH/AYLACSQF/CTRC*FP 
II*EQCVNMNINSSK*IFILISYLYRWM 
NKDPI /VIIERLI*VRMPEGQSHAAW*NK 
TNKQT* ILELVLS VTSCGVSVRFNLS IP 
*VFPSTK*EGWMRCVNPSTLT* FIPQVA 
* TCQEHGLMKIQT *NPGLD PS IVSDHQT 
Q*EPSVKQVLDISHDKLPMSLEELYHCR 

/CIFEMESCSV/ SQAG/VQWRDLSLLQA 

\CIFSGDGVSPC*PGWSQSPDLVIR 


12907 


26808 


A 


13041 


97 


626 


ICFTQVKVIiNESQVQWCMPWPATWEAG 
AGGrVLEPGS*EL* / CPVNSHCPPAWAT 


12908 


26809 


A 


13042 


2 


431 


HDPDTHSQIKVIKLMQSEFCWGWAW*LT 
PVI PALWEAE / AGGS PEVRSS 


12909 


26810 


A 


13043 


405 


1319 


RYRKKLQNKPLWLH/CLLLLPNSATR* T 
bQK+SK*ISPLESSA**KEPPFLSVLSV 
TSVKT/SAIPMPPHWENVNTQVPYQLI P 
LHNQTHEYNEVANLFGKTMDRNRIKRIQ 
RIQNLDLWEFFCRKKAQLKKKRGVPQIN 

T?nMT .TPtt^TC G T? PT71? MP TITMTmfiTD TMP T 
JSUrcjJlTJaul ooJQ IT ViiAJ-L-XxlNr UWKlWljJ. 

HGAVFGKGTYFARDAAYSSRFCKDDIKH 
GNTFQ IHGVSLQQRHLFRTYKSMFLARV 
LIGDYINGDSKYMRPPSKDGSYVNIjYDS 
CVDDTONPKIFVVFDANQIYPEYLIDFH 
* FHFQISWKEALFFFAGRFALQSSSH 


1 ocm n 
12910 


26811 


A 


13044 


23 


450 


RbKAAAIQLGQRIjRijPGPDS PRLTGRQL 
EPGTGRGSRAKRGPCSIFRFRRETPPQR 
EGGEALGATSROAPPPCPPROPWELDAS 
AKAAAVAPPKKVGFKAVSRDPRAEALGG 
KGSP\GGKGSLKGVAHR*GGLGPL*GAP 
EAI 


12911 


26812 


A - 


13045 


172 


16 


AHVSPEVDKTKFW\LGPVAHICNSSTLG 
GRGGQIT/R/ GQEFETNLTNMEKPHLY 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQID 
NO: in 
USSN 
09^15,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, LHLeucine, M=Methionine, 
N-Asparagine, P=Proline, 
Q=G luta mine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=atop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


12912 


26813 


A 


13046 


181 


9 


YKWGPAMIiPRIiKCRAI \ IGSNMAHCRLD 
LPGSSDPPSSASQVAGTTGTCHLGARGG 
MV 


12913 


26814 


A 


13048 


2 


3507 


YVRVSLPPPPPAAGRPGAAVADDAREEE . 

EEAAPPPPPPPPPRLAAARPPGSQPRPP 

AAGEAQAAADMNHQQQQQQQKAGEQQI^ 

EPEDMEMEAGDTDDPPRITQNPVINGOT 

ALSDGHNTAEEDMEDDTSWRSEATFQFT 

VERFSRLSESVI^PPCFVRNLPWKIMVM 

PRFYPDRPHQK\SVGFFLQ\CNAESDST 

SWSCHAQAVLKIINYRDDEKSFSRRXSH 

LFFHKENDWGFSNFMAWSEVTDPEKGFI 

DDDKVTFEVFVQADAPHGVAWDSKKHTG 

YVGLKNQGATCYMNSLLQTLFFTNQLRK 

AVYMMPTEGDDS S KS VPLALQRVFYELQ 

HSDKPVGTKKLTKSFGWETLDSFMQHDV 

QELCRVLIaDNVENKMKGTCVEGTI PKLF 

RGKMVSYIQCKEVDYRSDRREDYYDIQL 

SIKGKKNIFESFVDYVAVEQLDGDNKYD 

AGEHGLQEAEKGVKPLTLPPVLHLQLMR 

FMYDPQTDQNIKINDRFEFPEQLPLDEF 

LQKTDPKDPANYILHAVLVHSGDNHGGH 

YVVYIiNPKGDGKWCKFDDDVVSRCTKEE 

AIEHNYGGHDDDLSVRHCTNAYMLVYIR 

ESKLSEVLQAVTDHDIPQQLVERLQEEK 

RI EAQKRKERQEAHL YMQVQ I VAEDQFC 

GHQGOTMYDEEKVKYTVFKVLKNSSLAE 

FVQSLSQTMGFPQDQIRLWPMQARSNGT 

KRPAMIiDNEADGNKTMIEIiSDNENPVfTI 

FLETVDPELAASGATLPKFDKDHDVMLF 

LKMYDPKTRSLNYCGHIYTPISCKIRDL 

LPVMCDRAGFIQDTSLILYEEVKPNLTE 

RIQDYDVSLDKALDELMDGDI IVFQKDD 

PENDNSEIiPTAKEYFRDL YHRVDVI FCD 

KTI PNDPGFVVTLSNRMNYFQVAKTVAQ 

RLNTDPMI^FFKSQGYRDGPGNPLRHN 

YEGTIiRDIiljQFFKPRQPKKLyYQQLKMK 

ITDFENRRSFKCIWLNSQFREEEITLYP 

DKHGCVRDLLBECKKAVELGEKASGKLR 

LLE I VSYKl IGVHQEDELLECLS PATSR 

TFRIEEIPLDQVDIDKENEMLVTV7VHFH 

KEVFGTFGI PFLLRIHQGEHFREVMKRI 

QSLLD I QE KEFE KFKFATVMMGRHQY I N 

EDE YEVNLKDFE PQPGNMSHPRPWLGLD 

HFNKAPKRSRYTYLEKAIKIHN 


12914 


26815 


A 


13049 


30 


477 


VRAEHC^VWERNFEfcTVRWTo ViCr UA 1 £> 
PEIASLSWGQMKVKGSNTTYKDCKVWPG 
GSRTWDWRETGTEHSPGVQPAAC\DVKB 
WEK.G VQT. Li V IGRGWvj EALK V Fo o i v E I 
LKKHGIDMRVLQTEQAVKEYNALVAC5GV 
RVGGVFHSTC 


12915 


26816 


A 


13050 


179 


389 


NIETIQSMFPI/DNEMKLEINLKKAIWG 

XfXTvJLAO^ V Uric nrJL rrl LtTVIUu^VVJOr 1 iPi 

PRSSRSVWETWRDPI 


12916 


26817 


A 


13052 


466 


1489 


PGKGEMRTRS PSPLAI VPRPQRASRPLL 
CAVSPMASASGATAKHEQILVLDPPIDL 
KFKGPFTDVVTTNLKIiRNPSDRKVCFKV 
KTTVPHRYO^RPKSGIIDPGSTVTVSVM 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 
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NO. in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

am inn nriri 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

Hint* to 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=A!anine 
OCysteine, D=Aspartic Acid, 
E=G)utamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine,R=Arginine, S=Serine, 
T=Tbreonine, V=Valine, 
W*Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, possible 
nucleotide insertion 














LQPFDYDPNEKSKHKFMV/QIFAPLNTS 
DMEAVWNEAKPHELMDSKWR/CPNENDK 
LNDME PSKAVPLNAS KQDGPTPQPHSAS 
LHDTETRKIiTEECKRLQGEMMKLSEENQ 
HLRDEGLRLRKVAYSDKPGSTSTASFRD 
VTSLLPSLLWI AAI FTGFLFFFWRRSL 
TLVAQAGVQRRDDGSLQPPSPGFKQFSH 
LSLPSSWDYRRPPPHPANFCIFSREGVS 
PCWPGWS 


12917 


26818 


A 


13053 


892 


189 


SYDPGALGCRPAPRSGRGKGSGATRDMH 
PMSRERAKFVKSGL\YCKTTRELMLHAR 
CCLNQKGT ILGLDLQNCSLEDPGPNFKQ 
AHTTVT I DLQANPLKGDLANTFRGFTQL 
QTLILPQHVNCPGGINAWNTITSYIDNQ 
I CQGQKNIiCNNTGDPEMCPKNGSCVPDG 
PGLLQCVCADGFH/GIQVYAPGIiVLTAY 
VLRDSGSHHS I RLHSAL£DPAP KSQDFM 
NYIGLTIDLRS I 


12918 


26819 


A 


13054 


423 


12 


SFNQKNPLRQKCSLRPTRPASQSLATWP 
GQSTRFWEQALPWALWDPQSKRAE \MQL 
EPPHAHTWTHAHTCGHTGA\FCSAHTEV 
YVHLHSPVHAHTHAHTHSPPVHTCGCRL 
HAHTH\PQTPSPPGLLSLLAYARPQS 


nniQ 

12919 


268ZU 


TV 




10 




RRSGKNDP WTDQS PRAAASHTRCPHPAV 
AAAAMPKRKTEGDAKGDKAKVKDESQKT 
S /ARMS S KSAAS KAYAKPI KAPAMNGEN 
VPNG I QRKIiEAFYDLQACRHLTRL IDF A 


12920 


26821 


c 


13056 


716 


594 


MMQTETGVMPPQARQCIiEPPGAGTGRKD 
SPLYPSEQSWPL* 


12921 


26822 


A 


13057 


353 


2 


IGNPGKTTFNKPPPPKKFIFSKFFMFLF 
FLVKTRS CYVAQAGLQTPG \S SDPSASA 
NQSAG I TGVSHHTQS EILKYTVFCLTFQ 
TPEYT I PSTWS I PCCSHFS CYLCFLFAC 
FLRQSF 


12922 


26823 


A 


13058 


864 


529 


EVSTS KTAGCRGCALVISMLCI I FPQGL 
FTRALLRTCSALLHADWPE ISFVWWF 
FKTESCSVAQDGVQWCDLRSLKPPPP\G 
SSDSPASASPVAKITGMHHHARHGLKS 


12923 


26824 


A 


13059 


2714 


1376 


GR YDGLVEQLGGRATPAVG FAMGLERLV 
LLVQAVNPEFKAD P WD I YLVASGADTQ 
SAAMAIAERLRDELPGVKLMTNHGGGMF 
KKQFARADKWGARVAWLGESEVANGTA 
WKDLRSALAVGVILGVGALIGWRYWNS 
HQVDSARSASLAYQNAVTAVSEGKPDSI 
PAAE KFAAENKNTYGALAS LELAQQFVD 
ICNELEKAAAQLQQGLADTSDENLKAVIN 
LRLARVQVQLKQADAALKTLD/TPIKGE 
GWAAIVADLRGEALLSKGDKQGLLSVTL 
LSGCSLFNSEEDWKMSPLPTVENQFTP 
TTAWSTSDGSGIGNFYSNLHPALADNW 
YAADRAGLVKALNADDGKE IWSVSLTE K 
DGWFSKE PALLS CRVTSWIPNNQTEKFN 
KQRTEGPAVAEPEQAGVRRHYLYPFTGG 
YRRRRAACQAERPAARCATEEHGSRN 


12924 


26825 


A 


13060 


99 


254 


RNHHLVAFKE LS ALKVKRVT) WAPWLMP V 
IPAF/LEAKVGRSLEPRSSRPAWAT 


12925 


26826 


A 


13062 


127 


386 


FSYYYYFFFLFFLCVFVIPGEPFLKMKL 
GKPKTIMLKEKKGGESYSELFFETDQME 
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SEQID 
NO: of 
nucleotide 
sequence 



SEQID 
NO: of 
peptide 
sequence 



M 

eth 

od 



SEQfl) 
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26 



Predicted . 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 



Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 



Amino acid sequence (A— Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KNLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y^Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 



WL/T P VI PTLWEAEVGGS FEHRSSR 



12926 



26827 



13063 



209 



GKyilxLLKVFCCFLRQGLCHLAHCSLRLi 
PGSSDSRASASQVAG/ IS ITGVHHHARL 
NFVFCFFFLVCFCF 



SHAMCCNYLKRFGWARWLMPV/ IPALWE 
AEAGRSPEVRSSRSAL 



12927 



26828 



13064 



132 



12928 



26829 



13065 



269 



134 



TLWSVFASS / WAWWLTPVILAPWBAKAG 
GSPEVRSLRICKQKVLS 



12929 



26830 



13066 



853 



461 



RLWRSVSQFIiCACVHMCVPSVCVCGIiAH 
AGTYAVSTCVH/MCVCVC3^CARITLAKI 
CSETSGNLLFRKI ISLLDLFTLNFS IKY 
KVQLVLKSRSLAWWGRIKLLTLSRPFSD 
VNTFERINYWLINDIS I IR 



12930 



26831 



13067 



57 



2066 



AQPTGRQTSTVAPTPATARSRGGRVP I F 
PCPHPAPTLLSTSRWTPASPAASMKAIi 
RLSASAIiFCLLLINGLGAAPPGRPEAQP 
P PLSSEHKE PVAGDAVPGPKDGSAPEVR 
GARNSEPQDEGELFQGVDPRALAAVLLQ 
ALDRPASPPAPSGSQQGPEEEAAEALLT 
ETVRSQTHSLPAAGEP\EPAAPPRPQTP 
ENGPEASDPSEELEALASLI/QELRDFSP 
SSAKRQQETAAAETETRTHTLTRVNLES 
PGPERVWRASWGEFQARVPERAPLPPPA 
PSQFQARMPDSGPLPBTHKFGEGVSSPK 
THLGEAIiAPLSKAYQGVAAPFPKARRPE 
SAIiLGGSEAGERLLQQGLAQVEAGRRQA 
EATRQAAAQEERIADLASDLLLQYIiLQG 
GARQRGLGGRGLQEAAEERESAREEEEA 
EQERRGGEERVGEEDEEAAEAAEAEADE 
AERARQNALLFAEEEDGEAGAEDKRSQE 
ETPGHRRKEAEGTEEGGEEEDDEEMDPQ 
TIDSLIELSTKLHLPADDWSIIEEVEE 
KRNRKKKAPPEPVPPPRAAPAPTHVRSP 
QPPPPAPAPARDELPDWNEVLPPWDREE 
DEVYPPGPYHPFPNYIRPRTLQPPSALR 
RRHYHHALPPSRHYPGREAQARHAQQEE 
AEAEERRLQEQEELiENYIEHVLLRRP 



12931 



26832 



13068 



12932 



26833 



13069 



12933 



26834 



13070 



282 



53 



PGF PQGKFRFKKKI LPQDYP FQGAPNS K 
ARPGQGVP PGI PA\LWRVKKGGPLRSGG 
PGPPGAKGGTPFLPIPNTKKS 



177 
789" 



VSLCRPGWMECSGVISAHCNLRLPGSSN 
S / P ALASRVAG I TGARHHALLNSLFKVL 
SRA 



302 



PHPEFYYSFQYFIYFFISFHFTRHFIVH 
YFISSHFISCHLASSSSSSSSSSSSSSS 
SSPPAFHYSFHHFISFHLAIHY\HFIIS 
FHLFHLFS S FHS I S FRDRDLLYCPGWS V 
PPHPAFHYSFHH/VHVMSRHFILRQTRS 
IAQAGIQWRDLCSLQPEVGSSRSA 



12934 



26835 



13071 



193 



P PYLKSHSQVDLS PGVQDQPGQHRETPS 
/LTKN/TKVSWVWWCTPVIPARWEVEVR 
ESLEPRCSTRA 



12935 



26836 



13072 



122 



1495 



IiliSDFFFFETE/SRSIAQACMQWCYLSS 
LQSLPPGFKRFSCLSLPSSWDDRCPPPC 
LANYCIFSRDRVLPCWPGWSRTPDLR 



12936 



26837 



13073 



178 



347 



CCL CNDNS VLLCMYYLLVCMHACMHALY 
IKTWK/MQIiGAVSHACNPSTLGGQGEWI 
T 
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amino acid 

residue of 

pepuae 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

pep nut: 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Le urine, M==Methionine, 
N=Asparagine, P=Proline, 
i^vriutarnine, K— Arginine, o— serine, 
T=Threonine > V=Valine, 
tt— x ryptopnan, i— jyrosine, 
X=Ud known, *=Stop codon, /^possible 
nucieouuc deletion, v—possiDie 
nucleotide insertion 


12937 


26838 


A 


13074 


165 


34 


PILSFPFETVSYFVAQAGVQW\LIIAHC 
SLDLPRSSDSPTSAP 


12938 


26839 


A 


13075 


205 


105 


GGWVQ WFMS IIP / AI WEAKAGGLLEPRN 


12939 


26840 


A 


13076 


3 


357 


HERE YTQAEKRKFIAFFFFKMGSCSVI Q 
ARVQRHDLS SLQP P / RVAGTTGTHYHTW 
LIFVFSRTDSRLPSSPRSPLTSVISLI I 
PLSLI PTTYLTTPSRPSSTSPPSHPSFR 
APLTSDP 


I2J/4U 


26841 


A 


13077 


3 


477 


STPALTARGLSGGSRASVAAMVLLES BQ 
FLTELTRLFQKCRTSGSVYITLKKYDGR 
TKP I P KKGTVEGFE PADNKCLLRATDGK 
KKISTVVSSKBVNKFQMAYSNLLRANMD 
GLKKRDKKNKTKKTKAGSNSS / ATTAAP 
AAAATAATTAATTAATAAQ 


12941 


26842 


A 


13078 


1077 


1566 


RQVTSLDPLKTFPRTSLEFTS PTGQLDS 
GATAFTWWPSGAPTPRGRESAAHQGN/ 
YPVGQRS\RMAGLSPRSFCWWEVSFSR/ 
SHSC/ICWVQLGKSAVLPQQSRGPGAHE 
GSRRRGGSSPPCPPLQTQLVFSPQELSM 
S AP IHRLS VTLQGDC ISTGGAPSRCRLA 


12942 


26843 


A 


13079 


3 


396 


HERGRKMVSVT \RL IQRLRNWASGHDLQ 
GKLQLRYQEISKRTQPTTKLPEGPSHKL 
FNNYY\CTRDGRRESVPPFI IMSSHKAL 
VSRMPAYS FARAATDKKAVTPAP S I YRG 

DrtCOHtinVT DWWl'f TT J»l HIT T 

JlUbSDHP x LiRHtri Li T VTKLL 


12943 


26844 


A 


13080 


467 


109 


SLVPWSPHLKEPSCRYLIENVMLVPKE 
QEFWSKEAGSLFSPPVPFYCWTPSIVFS 
SSFFFLFRCVCFFFFFFCFFFFFF\LLF 
FGTKKALYLLRARGCKQYKNQKLIWYLT 


12944 


26845 


A 


13081 


458 


51 


VSLFCTVAPLLiLLPEGIFFSCSLLEVKA 
PMRDVSGVFIiTLFLYSVLFFPLGVFFFL 
FFFIiILPPPKIFFFFFVCVALFFFSPAF 
GAVFLF \SLWSS VLPGVGFFTACSFFFF 
FFFFFFFFLRQSFALVAQARMQWR 


12945 


26846 


A 


13082 


23 


327 


RSDSEGGRRGHKYIFIPIYTHTHTYIFI 

VmTVT^T TJTV71 C*T ri If TfTUT nTf T17T 

HTYIYISIYAHIYLYIETTHIYLHIYTHT 
YIYIYLHIHIYSSWVCKGS 


12946 


26847 


A 


13083 


352 


153 


KKLVBYYSAIKKNSVLIH/VSTWMRLKN 
HKYRS RLVDSED 


12947 


26848 


A 


13084 


121 


417 


DYDYYCYYYPKI YLTKLLYFY \ VGNYIY 

lrlJ.xV.KIiV XXUSl XWirv X lrlJLril XlLlx 

IYIWKETTWLFSCGISKTYRTHDLAYTY 
LVFCLTHVT FHVS S T 


12948 


26849 


A 


13085 


242 


12 


LSSRIjPMLLETKE I TS KILPSRKL/ LP Y 
GWCFKSFKITWVSQIRNRL\AQWLTTVI 


12949 


26850 


A 


13086 


274 


340 


GGEKKTPGGFLEKKSFFGGGIFGPPP/P 
TKRGVFPPSPKKFFFPPKTKNFGRGGGP 
KIPPPKKDFFSKNPPGVFFSPP / YKKKK 
I IFPPPVKLGPPKDFLKSPPPFFFFFYF 
FFFFFFSSQEVNLKALSSTSE 1 1 FFYIiV 


12950 


26851 


A 


13087 


241 


405 


P P PS KI H IGNFRKMS D VKNATG WAWWLT 
P VT PARWEAEAG \ VS LRPRNS RP AW AA 


12951 


26852 


A 


13088 


537 


375 


KHSLTLSLRIiECRGTI LAHCS/ L / CTSL 
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nucle- 
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peptide 
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Amino acid sequence (A-AIanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=G!ycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N-Asparagine, P=Proline, 

n=nintflmin(> 1}= A raininp S=Serine 

V£ — VJrlUUIUlIIiC, IV ArglMI»l«5j ^ 

T=Tbreonine, V=Valine, 

W=Trvn tn nhan Y=Tvr fKin e~ 
TT — JL 1 jr JptU^f Hall, X — ijrivaiii%*> 

X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














F 


12952 


26853 


A 


13089 


448 


186 


GALVFPSSQAPPCPPKMEQSFFFFFFFL 

w r m QT? QVQTf ZVnVOT.HHR^TiOPPPPGFER 

CiXudKO V SAciU vyunnuwiJy * **** 

SSHFSVTTI /TYLYHHRLALHVFELHIN 
GIMHR 


12953 


26854 


A 


13090 


365 


191 


riOPPFTTRPPT^K iTPPOKXRKVE /REEEKE 
RVRDLESESNRALERERSALVDRAPLSR 
PG 


12954 


26855 


A 


13091 


148 


316 


DKMESCSLAQTGVQWHD\LG\YIiQPSPL 
GFKRF CLSLS S S WD YRWETSTE PFQQAV 
F 


12955 


26856 


A 


13092 


712 


229 


rv AW V AMAiS i ii/Vo J. ro 1 a lUJivviMV-or i r 
KIGVCRHQDRCSRIiHNKPTFSQEVFTEL 
QEKYGEIEEMNVCDNLGDHLVGNVYVKF 
RREEDGERAVAELSNRWFNGQAVH/GEC 
TRGGFCNFMHLRPISQNLQRQLYGRGPR 
RRSPPRFHTGHHPRERNHRVFP 


12956 


26857 


A 


13093 


835 


405 


SAALLCLLLTAAAFSPQGIiAQPVQINTS 
TTCCYRFINKKI PKQRLES YRRTTSSHC 
PREAVIFKTKLDK\EICADPTQKWVQDF 
MKELDKKTQ\TPKIJ^IHDWNLENQAMT 
LRKPN 


12957 


26858 


A 


13094 


1137 


424 


LCPSHFAPTTLTQPGAHKNMCCIKSRFK 
RDLGLCRTCLVNKMFTSSILGKSHRHSL 
VSINQGNNALWKAAG\PLSWKAGYC\QG 

SVLRWISKRGKPLAVEIEGRATGLPGLA 
PWGTE\CLGYKTPIV\HLFNSEMG\ENR 
PYGGEARHVCSNAALLFFTPLRCLGGEK 
HKSGLPAHPVIVLSLEIiNYDIDSFAHMF 
ITATjT.T.TiT TTLJ.SCY I PFC 

r/iU A-i 1 j Jj JL JL X l J I to X x.trr *~ 


12958 


26859 


A 


13095 


1 


324 


ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERP /RPRVLFFVRGREGSHPEKCEN 
KKERAPFSRALRKI SIETRGGVS ITTEA 
rVFVTGAHRVCRS VYI YI 


12959 


26860 


A 


13096 


508 


840 


DGVSLCRPGRTADCSGAISAHCKLRFPG 
SRQSPGLSLPSSWDYRRLKPMRPANFFF 
PTFF\ VKTGFHIiVSO /AMGLDLLNS /SI 

p prlglp kcw \ d yrreatapgqelllnr 
tgm 


12960 


26861 


A 


13097 


53 


254 


WPQTASCLVAQAGVQWSDHSSLQRQTPG 
\SSHPPASASQVARITC31HHCIAWLHLI I 
IiLRAHGSSPIiC 


12961 


26862 


A 


13098 


329 


67 


GPGAPLLKGEKLLPKEPPPRISPIVSAF 

fawr pppgycflickggpffX vfpps ffg 

KKKNS PPP PHINGGAKTFPNPPLliFSSF 
FFF 


12962 


26863 


A 


13099 


253 


AAA 

444 


FRAPS TjGT FYF I FFLVLSNTFLLS I KNO 
QIWNKKTVW\PSPFLPTLIALTTLliIiPI 
SPFILIIL 


12963 


26864 


A 


13100 


341 


590 


PMASSQICIGRIPPFLYIiKWYFWPGKVV 
HTCNPITITjGGRGALITQGQKFEWSIiAN^ 
MAKPHLY 


12964 


26865 


A 


13101 


55 


184 


" " PJjREPTPLYSE/HAPTKYYRMABHNYPH 
SPPIPQPRHHYRVFLL 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProIine, 
Q=Glutamine, R=Argmine, S=*Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Oyrosine, 
X=Unknown, *=Stop codon,/=possib!e 
nucleotide deletion, V=possibIe 
nucleotide insertion 


12965 


26866 


A 


13102 


433 


762 


RNLIiNCKRGLBRKGCRASWPAVPLFLQR 
RVWPRPTAELNEACMLAQHQNCQNVKME 
YPQLS FTKRQE I /WLGWAHACNPSTLG 
G \QGGG I ACSQE FKTIPSS KVRPCFYLK 


12966 


26867 


A 


13103 


349 


250 


DLFFSTKPPPPPPPPKNLGPP\GPFGPP 
FQEG 


12967 


26868 


A 


13104 


3 


236 


YPQHYPS CPTFQ YTLKTKNHNFL I FFBV 
GSCSWQARVQWCHHGSLQPP/NSSDPP 
ASAFQVAGTPGVSHRAPPKNQNF 


12968 


26869 


A 


13106 


154 


4 


KKVKNTVHYNGKYFLKSQNQSTFLKEIL 
/ WliGGVAHTCNP STLGGRSGR I T 


12969 


26870 


A 


13107 


391 


150 


R PP KKPNKKYS Q I TS KG I KMVYTRKYLT 
QKKKAMSQALWY/V/HAVPATQEAEVGG 
SCEPGSS RPRCTMIMPMNTHCAPAWAT 


12970 


26871 


A 


13108 


90 


252 


KVGDGGRCLVFCFYFKRQGFTVAQAGVQ 
W\ATIAHCSLKLLGSSDPQNYEDSWGT 


12971 


26872 


A 


13109 


197 


414 


LYEFI KITVLLGLGCPIiGRYGCSNQRFF 
SLEMEFHTVAWARVQWHDLDSLQPLPPR 
PNRFSCLR/YPQSSWDYR 


12972 


26873 


A 


13110 


2 


317 


GRVGANVHKGHRQRTYGSVI PHILPLHV 
LK3CTFSLRDFHFSVSLKK\NLVLTCLHL 
FL/GVRTPRNDPFVSMMLLFTAFLDRPS 
TILjGTGLLYTEGLTVALRLAYIiR 


12973 


26874 


A 


13111 


413 


162 


LILLPQPAECLQLQASAATLDCLGLPRC 
RDCRRS LVHSVLNGAQAGVQWRDLGSLQ 
PPPPS\TCLGLPKYRDCSLCPAATPSGK 


12974 


26875 


A 


13112 


40 


296 


ESRLSALYIYHICVCVYNRKHLLYGIHM 
/SIHTYTHTHTHTHIYIHMRAFALEDKF 
I CSLLCSRQDNAFILVSVKLQRKNYFLT 
RR 


12975 


26876 


A 


13113 


2 


288 


FEPRCKNSARGKVPRGFSPNLPHVTVET 
PEGSKTGFELESR/RRHRQIHSSAQCPL 
TAVPGAGDAI PBDASGWHTWLPLHAQNC 
FLLYIQAPEQPPA 


12976 


26877 


A 


13114 


685 


340 


LS PPRAGSARPTGP PTDAPGQRSTWTCG 
ALKPRRRALRDAAENLFQELQEHFOALT 
ATLNLRMEEMGNRI EDLQK\NVNDL\MV 
QAGIENS I KEQMLKTVTANMSVFGDGAS 
YRSH 


12977 


26878 


A 


13115 


65 


3021 


GVLTQMGDE KDS WKVKTLDE I LQE KKRR 
KEQEEKAEIKRLKNSDDRDSKRDSLEEG 
ELRDHCME ITIRNS PYRREDSMEDRGEE 
DDSLAI KP PQQMS WKE KVHHRKDEKRKE 
KCRHHSHSAEGGKHARVKERBHERRKRH 
REEQDKARREWERQKRREMAREHSRRER 
DRLEQLERKRERERKMREQQKEQREQKE 
RERRAEERRKERBARREVSAHHRTMRED 
YSDKVKASHWSRSPPRPPRERFELGDGR 
KP VKEEKMEERDLLS DLQDI S DS ER KTS 
SAESSSAESGSGSEEEEEEEBEEEEEGS 
TSEESEEEEEEEEEEEEETGSNSEEASE 
QSAEEVSEEEMSEDEERENENHLLVGKN 

EGTPQ\SSAIiTEGDYVPDSLPLSPIELK 
QELPKYLPALQGCRSVDEFQCLNRIEEG 
T YGWYRAKDKKTDE I VALKRLKMEKEK 
EGFPITSLREINTILKAQHPNIVTVREI 
WGSNMDKIYIVMNYVEHDLKSLMETMK 
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peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
i^^VYiUiaiuioe, iv— fti gin iu c, o uci tuc, 
T=Threonine, V=Valioe, 
W-Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














QP FLPGEVKTLM I QKLRGVKHLHDNWI L 
HRDLKTSNIiLLSHAGILKVGDFGLAREY 
GSPLKAYTPVWTQWYRAPELLLGAKEY 
STAVDMWS VGCI FGELLTQKPLFPGNSE 
IDQINKVFKELGTPSEKIWPGYSELPW 
KKMTFSEHPYNNLRKRFGALLSDQGFDL 
MNKFLTYFPGRRI SAEDGLKHEYFRETP 
LPIDPSMFPTWPAKSEQQRV\KRGTSPR 
PP\EGGLGYSQLGDDDLKETG\FHLTTT 
NPGGLLPRGPGFSIiRVAEVSELDPVIGR 
NSAGDHRRGYLPAWNCDGTRNSSSYFVL 
PCr V c vr u V L-ivuoK JL x\»3 x? oxjxjWl*U\isxj\~r 
LRDLPRAERVGI FRGAHVGQHKPHHHPL 
PbSTRTGLGWAVIWKGTGGSRVDCLIFG 
AGDLFLCWDERCPLAPTHSSRPALSTTS 

rKr if x I ui^iiiirnvjJUCiU v r x x jcvv vvjv xjxv 
CIKIFFEE 


12978 


26879 


A 


13116 


469 


202 


Mil IVGLCEYTKSAVLVQHSDPLPPAPG 
tn?T7T a "iTT n\7im t it Tin/HP vf rrr ■ fvctl ye 
MESCPVAQAGVQWRDI/3SLQPPP\PGSS 
TSCF 


12979 


26880 


A 


13117 


193 


383 


AAKNAo Iff vrwtc WlfoAr \oinoi\.vv.» 
CVCVCVCVCRMHTVQLYFFQVGCQCSSL 
T.KTPT.FPMKT 

1_ixm cur jri*UM 


12980 


26881 


A 


13118 


146 


350 


HYLVKFIiLLAHPGS KAPPLNTLQPPPLA 
TDTJnPPT.TVT FHYP PTS YKMAP\PYLPS 
LTLFGLSPPAPR 


12981 


26882 


A 


13119 


124 


254 


tttvt ,t jypt .pnT.Qri AWWHALVI P ATW\ RAE 

XlrVLJXjy 1 iJi^iJXJO V**" Mricii-l VAT .nx r» 

AGGSLEPRSS RPAWAT 


12982 


26883 


A 


13120 


150 


45 


CT PPT»T?Pc;i?T\7T»AH/P/NIiHLIiGSSDSP 
ASASQWGGI 


12983 


26884 


A 


13121 


336 


446 


IPLKISSWAQWLTPV/ IMPAVWEAEAGG 
SLEVRSSGPA 


12984 


26885 


A 


13122 


181 


39 


Ql^TAQDGVQWCDIiGSIjQPP\I*PSSWDY 
RRESLCPSSFFIFWKRWGFT 


12985 


26886 


A 


13123 


463 


2 


\yrr/W jrOVxLJx xjxVLxtvtr pirtKxjV-rtOxao tr dooo 

HDKGQGTLPRI PQ/ PLxLGEGGSR/ PIiAW 
navarsT .p qktr pp t vpTjPAPTRSGTRVRP 
HTGHQPGMSPGCXjCESKWPHSAAAWPEA 
YPHFFIiIjPFPTQGCEVIjGPLYTADPWVI 
CVSLLPSCPNSTAVDVD 


12986 


26887 


A 


13124 


429 


263 


DHFS FLCVTGSQS VAQAGVQW \L I IAHC 
SLKlxLASSDPPASVFQSTRITGPFLSFT 


12987 


26888 


A 


13125 


310 


486 


" nkkttrgesskqrqprvshhpgwgtvap 
sgftaasasXrarvtllprppe 


12988 


26889 


A 


13126 


124 


461 


GSPLQLLPSPIxAALTRDCSKAPMGSCSV 
annnvnWPDljGSLORLPPGFEPFSCLSP 
TPAP\FPSGWDYR 


12989 


26890 


A 


13127 


2 


337 


RGAAPAAMAVTAIxAARTWLGWGVRTMQ 
ARGFGSDQSENVDRGAGSIREAGGAFGK 
•pTcnziPWPP vttr Rn<;PT^IiAAIjKK\HHDE 

XCJZA^nXaXaXjIvX X? IxriwOXVC* wXJi»*»xjx\a.\. \juil/jj 

E I VHHKKE I ERLQKE I ERHKQKI KMLKP 


12990 








OO 1 


422 


CRSDRWAKEHRGKRGQDSS KDVMARLME 
APKQTAQYFFI FYFFETKS YS VTQAGVQ 
WIxDLGSIiQRPPPG\SSDSPASASCAWPQ 
TAH 


12991 


26892 


A 


13129 


299 


579 


LVMFCKVTKIERLNLWRPGTVAHVLWSQ 
HFGRPRWAVHVGWGVRDP/RLTQHGETP 
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nucleotide 
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ing to first 
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residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, Lr^Leucine, MNMetbiomne, 
N-Asparagine, P=Proline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














SLLE \NRNISQAWWCMPWPATREAKAR 
ESLEPGGQRLQ 


12992 


26893 


A 


13130 


268 


438 


KVDKSMKMRKNRPKNAENSKNKKASSPP 
NDCNSSPARVQ/SWTENEFDKLTEVGFR 
KW 


12993 


26894 


A 


13132 


1108 


1529 


KIHSS FVLLNAKLVAVAAKTPKFYKKKT " 
DLPQTYHSAQTAVPEIPVSAHSTFIRSF 
IHSFI FETES YSVAQAGIQWHDLPS \ PP 
GS\SDSP\ASVSRVAGTIGVCQQAWLFF 
V\FLVET\GS FPHLSPLTMIRNKKQLFK 
KGH 


12994 


26895 


A 


13133 


497 


370 


RPSTVSPRIiECSGLILAHCS \LDPPASA 
SQTAETTYGHHHAS 


12995 


26896 


A 


13134 


217 


373 


CNHTEITYVNKKKHRPGTVAH/SLYNPS 
TLGGQGGLITQAQEFDTSLGNMTLS 


12996 


26897 


A 


13135 


257 


507 


PVSPGFPPL\CPVPHTPHLCPPCRYPES 
PGQPAAQHCGAPRQPQPNPRACSSRGLF 
RCPPACHHAPSCLLCVCPPLPQCCRVQG 


12997 


26898 


A 


13136 


137 


309 


PSFPTHPQFCFIiFCFLFVFETE/SSIVA 
QAEVQWCDLGSPDQGSSASPPSASSLPL 
SIW 


12998 


26899 


A 


13137 


297 


419 


GGSAVYYIWEIVQVLWLMPVI PT\WEAY 
AGGLFEPRSSRPA 


12999 


26900 


A 


13138 


213 


492 


QVGINYQPPTWPVGDLAKVQRAVCMLS 
NTTAI TEAWGLPGAI SSAKCALVHW \ YV 
GEGMAVGE FSEAREDLAALED YETGGPW 
NPVETEAERR 


13000 


26901 


A 


13139 


246 


534 


DRVLPLLPRLECSSAILGHCP/APASQV 
TGTTGTHHHTOL I FLL YCWLC P P PLAS E 
ASAES PPLLPRGFLCFSSTTSS ITAI PA 
TTRDYVDVSRSGS 


13001 


26902 


A 


13140 


1 


206 


SLEWPFFFSFS ETGSHS FTQAI VQWCNH 
SSL\SPTSASQIAGTTGVHNHAQL/VFV 
FCFAFMS PQLSNSC 


13002 | 


26903 


A 


13141 


164 


582 


GLVLVNLYLRS YLMVF I YINS RLLQVTD 
ASRERRENESSWFCRS VCRM/ PPVGPGP 
SCVHPCSPRGSISPCTPHCPLKTTQAPA 
\ PPRPLPRVGPGPHLDQSCCVAARPLYD 
P I LVAAHSS WPAPPS PVP VPQEDTEHGA 
G 


13003 


26904 


A 


13142 


235 


367 


LALNTFCWIjGVYAHACNPSTLGGQGEWV 
TRGQE \QANMAKTCLY 


13004 


26905 


A 


13143 


91 


403 


GALQPATAPWEPLSGIiAEAGAISPCLQG 
G/SGGRGASENRGCTRPGRVPGGRRIiNR 
PRAWSSWPAPPAPGSERPSAINCPRAEE 
CGRRVWDWQAALPAAPAWVPV 


13005 


26906 


A 


13144 


558 


147 


GHS FQNWLLGCPLQLQRHLLHP PDAS EE 
CGSPPWTPLRPHIHPSAGPHGSTHK\HA 
GGCLFPLRPSPTHPCGPPVPLPWSQ\QA 
PLCPKP P IHSPHDWECDPGKS ALI P PPA 
AQSPSRLIKHAANEPECSPRNRRPGIP 


13006 


26907 


A 


13145 


321 


421 


GCAQWLTPV/ IPALWEAKAGESLLFRSL 
RPAWAI 


13007 


26908 


A 


13146 


3 


398 


IELLQSHDKTLTDKKLLLLDEQRKWFLB 
MESTPGEDVNNVELTTKDLEYHINLDD\ 
KAGFERI DSNFERVSAVG/ KMLSNS I AC 
YRE I FHE RS VKVAN FTVAI LPQ PHKLS V 
TINLIIQQPSTLRQDLLLAKR 
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SEQID 
NO: of 
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od 
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USSN 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

rlino to 

last amino 
acid 

residue of 

UClf UUC 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G^Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
O^Tihi famine. R=Art?inine» S=Serine« 
^Threonine, V==Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /=possib!e 
niirl^ntide deletion. V=Dossible 
nucleotide insertion 


13008 


26909 


A 


13147 


228 


261 


SKTQRSSEYYSIiSVCVCVCVRVCIYIYI 
FFFFETEFbS\APQAGVQRCDLSSLRLP 
PP\GSSDS PFSTSQSAWDYRREPQRPTS 
LHYFVQLKHNLHRVK31RDLRNTTR 


13009 


26910 


A 


13148 


n 
J 


LI J 


KHLSDLQILRLGLSS KTRKTRYSHFAQG 
RS I FFLLRP S VDW I KP ACMDGNLLFSKF 
TNLNVNI \TKKHPHRNVQNNVWPNVWAP 
WPKEVDT 


13010 


26911 


A 


13149 


218 


382 


AWDHIKLESFYTAKETT/RMKRQPTEWE 
KI FANYP SDEGLVTRMYKELKQL YRKKI 


13011 


26912 


A 


13150 


391 


29 


FCFLFLFMFLFFFEMQDGVQWCNIiSSPQ 
PPPPG\SSHSPASASQWEABNCLNPGG 
GGCSLGDRARLPQRKRERERETERKRKR 
ERASCAFCX3AITFVNMWCX3KHHTCNHLP 
WLTVFETNKL 


13012 


26913 


A 


13151 


351 


157 


fgysrflcvv infrtnlsiynyiciyin 
lyl\nihtytliytyvhtythihifyiy 
igiltgit 


13013 


26914 


A 


13152 


402 


111 


TFFGGEKPPFTLRPGPPRREGGGGPPGP 
REGGFFRVPLGAFGPSRPK/RPEKSGPV 
WAPQIFLWGGKSRAIKESGKGFFFSRIT 
T.t.vp&tk k K KKTTLF 


13014 


26915 


A 


13153 


310 


141 


REHEKQKPLSQVRWCTPLVQAAQEAEAG 
GELEPRSSRLQCSMT\TPVNSHRPPARV 
T 


13015 


26916 


A 


13154 


153 


21 


LNKCVISWLGAVAHACNAPSTLEGRWIT 
QGQEFKTSLGSLAKPHLY 


13016 


26917 


7V 

A 


nice 

13155 


429 




SVDIPLEKTLINKDVFKEAAFKCKVS /R 
QARVKFKKRYKPVKTKWFFQKLWF 


13017 


26918 


A 


13156 


303 


42 


YRERAP VYP 1 1 IN IAFG PHKQ \ TRAHTH 
THTHTRMHAHTHTHPAI I PAFAKNGLGT 
FHHKGRTYNDKVKPRLSCEAAIPSNLKS 
REFR 


13018 


26919 


A 


1315/ 


lyo 


ii 
ij 


shgvaillffhflnkraft\yspamnss 
lceiqepslg\sgsgplsgnrmsgkghr 

KYFAKI 


13019 


26920 


A 


13158 


192 


2 


I SNSHRKD I KDPEVLLIEQKY ILSGQMQ 
WLTPVTPAL/V/EAEAGRSLEFRSLRPA 
WATDRDSVSK 


13020 


26921 


A 


13159 


40 


190 


RVDPRVRESRS VAQAGMQWRDLGS LQAP 
P\PGSRHSPASASQVALFLNRK 


13021 


26922 


A 


13160 


1 


178 


PTMVLS PADKTNVKA/ AWGKVG AHAGEY 
GAEALERMFLSFPTTKTYFPHFDLSHGV 
SSYL 


13022 


26923 


A 


13161 


411 


41 


ESLQICVGLHAKPSWDACRPWLGQAWFR 
YFLLTFLGPLVFLLVFWFSKIYFLYHLK 
NPSLSSCYNLD\FNTQYLKYCKKKKRLG 
AVAYACNPS ILGSQGGRITRGQEFETRL 
GKMAKPDAWVDP 


13023 


26924 


A 


13162 


372 


218 


FWKRSWISKKGLFKRGVSLSLSIbS/CDT 
HTHTHTHTHTHAHSGYLCVLRKVK 


13024 


26925 


TV 


111 £Sl 


2AI 


CO 


QGLVLS PWLKCRGGITAHCGLLGSSHPT 
TSA\QVAGTTGTQHHAQLKFFILI il 


13025 


26926 


A 


13164 


122 


409 


FPSTAIANSH/ CSS PQGCS AYGVAIRLF 
LRSSNKIiAL\LYGLAL/NSFFCNFQEPS 
\LGSLSGPI*SDNKSNNLCCFKSPSSWLF 
AI AVLGNYYSLNTVRS GRRDPGTPNACS 


13026 


26927 


A 


13165 


69 


313 


PSVSLFFPPNHNTVIHMGLLKINS I FGE 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leu ci n e, M=Meth ion ine, 
N=Asparagine, P=Proline, 
0=Glutamine R=Aroininp &=S<M-in» 
T=Tbreonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














GLALS S RMECS VT I TVHCGLDLLGSGGP 
PSAAS / RVAVTAGTCHHS WLMFLS FV 


13027 


26928 


A 


13166 


310 


26 


IKSMTPLPALNVSLSYYLPPLRPCRERG 
MVQDRFFCFLFCCCCFIFS ETGSHSAAQ 
AVSQNKKTTGWAWWLTPVI PATHEAEAG 
\ESLERGRQRLQ 


13028 


26929 


A 


13167 


296 




NCSKLKKNKIKTKKWGWAPWLTPILPA/ 
LLKAEAGRSIiEARNLRPVWT 


13029 


26930 


A 


13168 


2 


235 


GATPCPREPSLFPLAGWAPLSCWAPPPH 
CTQGTTPPGLMGTG\APIAI IGGNARSP 
PMPFLPLSGTRRALSAQECPTF 


13030 


26931 


A 


13169 


248 


406 


TERCGFDLIjSIjGOVOWLTSVTPATjR /fa 
EAGELLESRSWRPAWATWQNLISLKK 


13031 


26932 


A 


13170 


33 


308 


NSLQGAEMAAAQSSLFSASEGVTRGERGR 
GRGRGRPGRG /RSEGDRGGRGRPGRflPfi 
RKVLLLRI CVWRRVLRL WNRGRIGVGGR 
VTYLMESEI 


13032 


26933 


A 


13171 


294 


56 


ICKRNYIiFFIYFIiREGIjTLLPRIiECSGA 
ITAQCSL\GSSDPPTSASTVSGTTGACQ 
CAIiSSRDATiT < >Tir , YPf;W < ?nAPfiT.Tf 


13033 


26934 


A 


13172 


71 


476 


DSLNYDNHIYQMPLIYINMGIAVTISLL 
G\ILVYRSHLISSLLCliEGIILSLFIIA 
TL I TLNTHS LLSNTVP TAT T.VP AAi"T? A A 
VGWEECGFRSRPTALILBGPLRSCQPRT 
CVRRP ACTRRG PAFVG KNRHS IGG 


13034 


26935 


A 


13173 


86 


318 


NNCNVCVCVCVCVCVCVCIHICIYV/YV 
YICIYTYKYICIYTYKYICICVYTHVYM 
YMCVYVYICICMCVYICIDSKGR 


13035 


26936 


A 


13174 


466 


726 


NCMKFGAVTRIG\DLPWEINPLSSCSLL 
REKDPPTTSOPryrnnPTTCTrT^TN'PKQrtirD 

PLFTLFSNLPHYPSTSFS FQSWRHTS I S 
PFS 


13036 


26937 


A 


13175 


128 


325 


FLRVI LI CHKM YGCVCT/CM / Y ACT PNfJM 
CI YMCVCVKS WKLKP ELKCLC I SDEVGW 
VLTQWAGREEN 


13037 


26938 


A 


13176 


134 


311 


INSNSEKMGSHYVAQDGVB/WQGLFTGH 
GI AHCNLKLLGS S NFS AS I SQGAGTTG I 
VANG 


13038 


26939 


A 


13177 


577 


1052 


SPTSTRTGSGVAMWIiSOPMGRTV CR<51CV 
ASKSRMGPLIWAHPPIVRCPMLRHHY\ 
KAQAGRGLS LE E \ LRVAG I YKKVAQT I G 
IS\E0ARRRNQST\QALQAXVQRrjK\BD 
RSSLILFPPJC\PLAPKKGDSSAEELELD 
TQLTGPEMPIGNVYKEKARVIAD 


13039 


26940 


A 


13178 


496 


678 


TtiNFW?AQGDKITRPWKNAPVFPVTW\R 
LRRKNLLTLKGE ICSDPKWPYCLPPWKT 
KKNLV 


13040 


26941 


A 


13179 


192 


421 


GI YTFFLS FFIjFLRQS LTLSPRAGVQWC 
NLCLPGSSDSPASASRVPG\ATGT\CHH 
GSANFLYFLVGDRVFGYVGPR 


13041 


26942 


A 


13180 


144 


5 


LOLAIKLLKMRr«flVVAHTrNPCl / TT fZTlXd 
G\GRSQGQEFETSLANIVKP 


13042 


26943 


A 


13181 


351 


51 


KKKKKI FFGTKKRRFFFLKGVWGP PPPK 
VSPPPPPKISPNPPKNOTLLKA/IKSPQ 
NWLFFFGPPSCKFFPPFFLNPKNS PWEN 
PPSGGEFSPKKKKKKSAI 


13043 


26944 


A 


13182 


2 


218 


IHPGCFVFLFFETEFCSCHPGCRAGVQW 
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nucle- 
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peptide 

sequence 


Amino acid sequence (A=A!anine ' 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Ctycine, H=Histidine, !=Isoleucine, 
K=Lysine, LHLeucine, M=Metbionine, 
N=Asparagine, P=Proline, 

ftniiilaminp R= A roiniriC- S^^Serine. 

T^hreonine, V=Valine, 

W=Trvntnnh»n Y=Tvrosine. 

X=Unknown, *=Stop codon, /=possib!e 
nucleotide deletion, V=possible 
nucleotide insertion 














'» n\ T . ATTNQT ,T»P P PIjG \ SSDP PTS AS OS A 
E ITGMSHHACP I FKKHS 


13044 


26945 


A 


13183 


33 


304 


EKHG I FPJWKI LCM I LQRWT YATHLSRP 
TGYTAPRVNPKLWTLGDNDVSM/SGSL I 
irTTSTTAT J^MTiT.T.TPP <; Q AC94GOGDNK3NIjC 

V XIN X AiAJrVolJlJ JLtvlvOOXl»-*"VJ^vji-'i \\jxv*jj\~ 

TFLSILL 


13045 


26946 


A 


13184 


277 


13 


FSFFLIVEMGSCYIAQAGIiELLGSSNPP 
c ca QfiQ unTTTJM^ WWP MflTiATIjiCRAFVS 
D\NSLSFPSSENAH^ILAGHFCLMGFF 
FFQ 


13046 


26947 


A 


13185 


140 


361 


ITCCCVCIYTHIHTYVYVCVYIYTHTYI 
RVCV/YYIHTHTYIHIYIHVFIYIYIYT 
FMHTYIYPYVCMVIiIiQIiTL 


13047 


2694$ 


A 


13186 


64 


759 


CLSAESAPTSTMPMTDGYWD IRGLAHAI 
RLLLEYTDSSYVEKKYTLGDAPDYDRSQ 
WLOTKFKI/SLDFPNLtPYLIDGAHKITQS 

xnvTr OVTJVDiniMT fT'PTRITPTrTPVnTT.R 

Mft | | »K X JLAlUVtlHijvA7EiX&Z2^.n^ItVl^XJJC 

NQVMDNHMELVRLC YDPDFE KLKPKYLE 
ELPEKI»KALTS\EFSGGKRPWF/SQGDK 
ITFVDFLAyDVl^MKPaFEPKCLDAFm 
LKDF I SRFEGLKKI SAYMKS SQFLRGLL 


1304$ 


26949 


A 


13187 


41 


426 


LFFFFFESGPCWVTQAGGRR/WNDHGSL 
QPGFPGLKGSPCLTLRRSWNYRPGMCRH 
TQLWAFFFFPREKKFLLGAPTGFKTRG 
LRGPSRFGFQKGGNKGQEPPPRAKGLIF 
T iRPRRGOAFFGVFFGPTO 

JLI Cm IT ul\w<^ili> CVIVi 1 we -*» ^< 


13049 


26950 


A 


13188 


39 


197 


FFFLFKTRSCS F IQAGVQWCEHS S LQPQ 
TPG\SSAAPASASCKAGTTKAALKL 


13050 


26951 


A 


13189 


221 


396 


GIQLQGAEGLWDVTNSPFCWRIiYDQKTV 
YHEO^ANSHCP\PPKGLIiRDITPRCW 
AP 


13051 


26952 


A 


13190 


2 


826 


PGSTI SSRRRGACGSRGGHFPSPRGGSG 
VASIiERAES WSTEPAKAIKP IDRKSVHQ 
I CSGQVVLSLSTAVKKTVENSLDAGATN 
IDLKLKDYGMDLIEVSGNGCGVEEENFK 

r*T \TT.CAT KnUTQTTTnRFAr)T»TRVP!TFG 
yjij \ X i >nnl itvnn X olxXyD* xaI^aj ± x\- v oifu 

FRGEAx^SMAx^DVTISTCHVSAKVGT 
RLVFDHDGKI IQKTPYPHPRGTTVSVKQ 
TiRQTTiPVRWfTRFORNTKKKRACFPFAFC 

Xjr iZj x XJ*r V rvnivPi f yi\il X ivxvxvxvrxv-. x. x» l n— 

RDCQFIjEGSPAMLPVQP\AKLTPRSTPP 
HPCSLEDNVTTVFSSVKNGPGSSR 


13052 


26953 


A 


13191 


235 


1 


SPCARQCCPPNPAGQEPRRRLERGPGKW 
WPRSI KFPLPAV/RARFPLLPSAPLRQL 
VSGRVGGRVGRPGKAVOVSGGLN 


13053 


26954 


A 


13192 


2 


386 


VIYLLLFFETESCCVAQARDGVSPCWPG 
WFQTPG\SSDLPASAPK/VAGITSVSHR 
TRPDVR GFK3CC / GLOLEMSHLALiLL / TT 
SHRFWYFVLLFSFVSKGFFIFFFFLFFF 
FFxjGKGFFFFFPRGGGGGQI 


13054 


26955 


A 


13193 


749 


506 


" rVYi^AR\7T.<3YCrcAQPQTKQ/PCPOAKDH 
PLEPSMHPEGTQLQSCSIMLGPRQIjSSE 
KQPlVLPPRSHlxKSSPMIiRACKGLTS 


13055 


26956 


A 


13194 


73 


297 


RMPGFGALGSPFSCQRIECPC\AGAGSC 
TYASFCKCKEYKCTSCKKSECGAFPOJQ 
GAGQSQRREPRAQQAGAGQ 


13056 


26957 


A 


13195 


670. 


390 


PRETYI KPFARS IGCQPPKTIHPTSRRP 
STRPPARPHARPPVHTP/APSVHMSDRP 
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peptide 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
OGIycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, MhMethionine, 
N=Asparagine, P=Proline, 
v^Hjiiutaraine, k— Arginine, o— oerine, 
T=Threonine, V=Valine, 
»" — lrypiopnan, i—iyrosine, 
X^Un known, *=Stop codon, /-possible 
nucleotide deletion, ^possible 
nucleotide insertion 














STCPPSIH\RPPSTDVSRPQYSLSPSTS 


13057 


26958 


A 


13196 


2 


438 


LS FTS I EFCHLLFLNQLICNYVT FTKS S 
TISCEALGRLLWYPWTQRFFESFGDLS 
TPDAVMGNPKVKAHGKKVLGAFSDGLAH 
L\PPLKPTFATVSELHCAKLHVDPENFT 
LLGNVLVCV/MKHNFGK 


13058 






lily/ 


J/ 


499 


oPSWPRSSARPCSGHCLLPGLWDHGKAP 
CLRAARHSQGHGFRKGRKEGCGLWAAGW 
TKNP VPTWRAG / S RRGPGVTS P P P VPS V 
QNPP \ P PQRRHGGSQATRPVLFTPPPKH 
MEGPNAP I LGDQR ITAPPPB I LYRPPRA 
ATCDALRRG 


JJUJ7 






1 1 1QQ 




A/ZC 

465 


GILPYSPCVLRAPNSKRVRKCGCTTPRH 
It/ CKATRKOGCTTPRAPLNETRKCGCTT 
PRHLSRQI/3NMALPRGHLNETRKOGCT 
TPRAPLNATRKOGCSTPRHLSRQLGNAG 
ALPRGHPLKETRKCGGATPADAFYETKK 
IGGGL 


13060 


26961 


A 


13200 


177 


1075 


PTSSSMAFPKKKLQGLVAATITPMTENG 
EINFSVIGQYVDYLVKEQGVKNIFVNGT 
TGBGLSLSVSERRQVAEEWVTKGKDKLD 
QVUHVGALSLKESQELAQHAAEIGADG 
IAVIAPFFLKPWTKDILINFLKEVAAAA 
P ALP F YYYHI PALTGVKIRAEELLDG I L 
DKIPTFQGLKFSDTDLLDFGQCVDQNRQ 
QQFAFLFGVDEQLLSAIiVMGATGAVGST 
YNYLGKKTNQIV\EAFEQKDFSLALNYQ 
FCIQRFINFWKLGFGVSQTKAIMTLVS 
GDSNGPTPASTCRKPPGEFT 


IJVAJl 










A£A 

464 


V QE FKTS IjGNVAKPCLYKKKKKKS PLGG 
GVPPPPKKTWVGGTLSPQKIRPAGPLIF 
PPPLPRGQQGEPIiSPIKGGGGGGSS\YP 
PPPPPCLTGRA/PPPSPISAKSSPSPPP 
CQYVYHPPLLPLFFFIiHALRRATAPLFL 
FLVRA 


13062 


26963 


A 


13202 


1153 


259 


AGGVLRLG WTGS RMAS DSGNQGTLCTIi 
EFAVQMTCQSCVDAVRKSLQGVAGVQDV 
EVHLEDQMVLVHTTLPSQEVQALL\EGT 
GRQPVLKGMG\TGQFSESGGQPVAI PGG 
GLGTVQGWRFLQLTPERCLIEGTI\PG 

PDGAS\HGGPQDSDRH\RGDLG\NVGAN 

EG\EDDPGAREGQSLYPKITRELPGSRL 
SCGI\IAKSAGLFPEPQSRICSCDGLT\ 
I WEERGRP IAGKG\RKESAQPPAHL 


13063 


26964 


A 


13203 


281 


420 


VDGSKKYNEMPVIPALWEAKAGGLL\EP 
RKSRTAWATREDSVSTKN 


13064 i 


26965 


A 


13204 


3 


1125 


SDSPQTPRMRVMAPRTLILLLSGALALT" 
ETWACSHSMRYFYTAVSRPGRGEPRFIA 
VGYVDDTQFVRFDSDAASPRGEPRAP\W 
VEQEGPEYWDRETQKYKRQAQTDRVSLR 
NIiRGYYNQS EAGSHTLQWMYGCDLGPDG 
RLLRGYDQSAYDGKDYIALNEHLRSCTA 
ADTAAQ ITQRKWEAARAAEQWRAYLEGT 
CVEWLRRYLENGKET1X2RAEHPKTHVTH 
HPVSDHEATLRCWALGFYPAE ITLTWQR 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamie Acid, F=Phenylalanine, 
G=Glycine, H-Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M-Methionine, 
N=Asparagine, PHProline, 
Q=Glutaraine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














DGEDQTQDTELVBTRPAGDGTFQKWAAV 
WPSGEEQRYTCHVQHEGL PE PLTIiRWE 
PSSQPTIPTVGIVAGl^VLAVIAVLGAV 
MAVVMCRRKSSGGKGGSCSQAASSNSAQ 
GSDESLIACKA 


13065 


26966 


A 


13205 


417 


13 


GHPRVMIjQCHMGS AS L»S IMT P P P IRS YQ 
PALTRSSATKPA\TSAPSITPMTPVTAN 
I S /WPTANTTTTKLSNPTS S/STLSTP 
IWHPSILPAPPSTPAPIiTTVTNSATTST 
ICFHVQLFQAHHCHHCIAAALEYTS 


13066 


26967 


A 


13206 


1139 


559 


GLS ELS IHPSVARVQEGHGATPERS IPW 
PGGKPRRGGRGTGLREWGGGSHRSLGQT 
KKKGAKAKRDHCAHGPIAAFQGPFPRAT 
QLLSPCHGCSWKENQPP *GPPGHACTAW 
EKTVTHGSCRPVTKPSPDPGFVGQAPGT 
KGNPAPGMDG CAGFRI S CHPVP I TKPLS 
NTGICVLLCYEFGFLIAIVGCWGFK 


13067 


26968 


A 


13207 


306 


332 


ENGEIRIYIYICLYLIKKHWBEAGCGSV 
HL* SQHLKRLKWEDYLRPGVPDQPGQHR 
ETPSQKLNK* INNDF 


13068 


26969 


A 


13208 


48 


233 


GKQNHLKCLWCRLLFLIiSAVISRNVYIL 
T*LSLSLSQCVCVCVCVCVCVCVCVAIIi 
ENYPR 


13069 


26970 


A 


13209 


250 


2 


YCEGDLGSSRPGVSKCREFSPRRDKRIM 
PWPGAVTYAYNPSTLGGRDGWIA*A*EF 
ETS LGNI VTCPHLYTHTHTHTHTHTHT 


13070 


26971 


A 


13210 


615 


120 


SVFWPLLGGVSWSGYTEVRGPIiEKAVCP 
LSELEYCAGRSAALFRAIRLFKDALSLL 
KLCPQPPFPLGAPLSP* EALVCWPLHV 
STCLS*WCPTPCPPERSWCVLFPSMYPR 
VCPDGLLPPVPLRGPGVCWPLHVSMCL 
LSLFNSLLRLRTCGWFSVPVLVC 


13071 


26972 


A 


13211 


239 


429 


FILLRNKLQPGTVACGCNPSTLGSQGKW 
IA*AQGFETSLDNMVKPCFYLPQIiKKKK 
KIKNLGV 


13072 


26973 


A 


13212 


954 


503 


vqllplcgrdlylfafnlptyrcwqhfl 
skprapilaggsdrwpcpccpfwlp*wc 
hvsahapqpfpcakpeaplpvgcpcspa 
psleaallilraglwlrapfcfsahiiii 
schsqccqfpcsekrsclggrdrhaasn 
plsavwppge 


13073 


26974 


A 


13213 


1844 


1147 


lhsqiystakkaslsmkgsrdktraass 
rpvpsvlgvppwstllqhpqnmwpgpaq 
qqgqpsgrqawctpgeapgaeaapq*qp 

HPEEDHSGGPQASAALALPPSPPSQRDV 
QGETGMQGRSAPRSASSSACCACRWSRL 
PCPQLLQRHPGLRLVSPAHSRPPGPAPS 
SSSGSGLVPGYLPQKGLAGLSAGAVLCP 
PGLLRVAHGAGYGPSAQMLHARLSSSSL 
RGKRFLRF 


13074 


26975 


A 


13214 


413 


60 


LCSKSCHGSFAMDCETLLTFPRLFYPIL 
HMPTVPGQALPVHTPFL*LKLPLLPGIP 
SCHSPI*FLD*AQESPPPGGIiPGHPPGSG 
VLCLWFHSLKHPVLSGSSVSTPGSASPS 
RSRAL 


13075 


26976 


A 


13215 


67 


196 


GLSLSSPKVTHNATLMGCI FQN* KAFVY 
HSMKQKKIILLFNMA*P*YPLDFGEQWP 
LHGSHAYSTIL * LDLFCKKEEE * DE I PY 
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OCysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
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K=Lysine, L= Leucine, M=Methionine, 
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T=Threonine. V= Valine. 
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X~Unk.nown. *=Ston cod on 7=nossihfo 
nucleotide deletion, \=possib le 
nucleotide insertion 














V* CFMLLWKSTTM * KKI ILLFNMA 


13076 


26977 


A 


13216 


161 


2 


SVFFLFIDRRLYLIKSVWAAYNSSTLGG 
RGRRIT * AQBFKTSIjGNKARPRVYQ | 


13077 


26978 


A 


13217 


57 


184 


CLTVCKDKVSLCYPGWSPW*SYLTAAS 
NSWSKAILPPQPPE 


13078 


26979 


A 


13218 


292 


400 


IiKVKQGRGHLDLNQGPLDL* SNSLPLSH 
TPSHRRTV 


13079 


26980 


A 


13219 


186 


394 


YLVLSFLFYFLFIFLSRVSLPIFFILEI 
*LGIYAYTCNPSTLGSQGGRIA*AQBFD 
TSLGNIVRPPSLI 


13080 


26981 


A 


13220 




AAQ 
ttO 


t\. ± ivu* v vun v hh jtro j. r\j\j " V3Vii\.x X vjy 
KFETSLDNRARSCLYK 


13081 


26982 


A 


13221 


3 


13 


LRPIiRSFVSVERLWASCCLGSEIiWDKT 
KRKKRRELSEKHKQEIKDAFELFDTDKD 
EVIDYHELKVAMIJ^FDVKKADVLNIL 
KDYDR*VIGKISFGHSIEIVTDFQL*R 


13082 


26983 


A 


13222 


1 


715 


VDMHSRGSPELTHASTHASGKMAAPWAS 
LRLVAPMWNGRIRGIHRLGAAVAPEGNQ 
KKKRTILQFLTNYFYDVEALRDYTJjQRE 

MV V \7TTI7 VTSTD CV'I'lJT TP ^nUP TDW' npn T?TPT 

r J I x\ v rlEj J>JN Ko 1 l WJjJSlSJUilvjrir ItjALiAr r X 
LKQGGAVKFRDKEWIRPDKYGHFSQEFW 
NFCEVPVEAVDAGDCDINYEGLDNLLRL 
ICRT.OQT j^T/^ppnn/nnupT.cpT vdt am? 

VAGAFRWAGCPRI FRTGAKHGFHHLQ* D 
LSSGWATCPGTSPT 


13083 


26984 


A 


13223 


1019 


1238 


LTGAKYKKSDGLTESVTANIiEPEAPKVF 
SRIiDDEAPVTVLPARLPLPAP *HTARPD 
PPORRRSMOTJUPAPPFTj 


13084 


26985 


A 


13224 


1534 


1181 


RWNSEPDYSAGGPRAGRGARLLPVPKSP 
GGGGAVAPGPQTPLRAERTAVRLSISS* 
NSSTAPADPGAAPPLHPQPAEAGTWVSA 

FTjRVT FPFP^T/iFVtT'WFKK'T A PVDPPDT 
* j-ixvw -l *. x-c c J i-frj r v^i^rviZiXvlvX rxT v c c c If X 

DEKCL 


13085 


26986 


A 


13225 


328 


22 


CLESPRDRPFQGPAFPKHPPPPLAFLSQ 

SPPAPSPPPPPFLYFSIiTAEGGEV*KET 
LDSDRAPPSSREAPICQV 


13086 


26987 


A 


13226 


180 


5 


NSIDYKINVKNSNRPGAVAHTCNPSTLG 
SOGRRIT* AKE FETmiGNT VT.T* T YTT T Y 
YI 


13087 


26988 


A 


13227 


237 


528 


VGLCQDPLLLTLTDS FCSVLWGGSHLAF 
HKNLYVVHIDLVMYFSRIS FYFTLS AAT 

********* w V AA^X/XJ Vl)4i> U1VX4J4 A -* IXJuiUll 

CLSLYLGSHILSPFL*LFYCAYTTLYLC 
LITTLFFIilPKV 


13088 . 


26989 


A 


13228 


140 


366 


NATCLWHPMS PYAPTY*C*LS PGIit^RWV 
IiAITPIVLMILISSYNHVS IS YGGTFRS 
NFFLDQI ILTFLI YELHIV 


13089 


26990 


A 


13229 


257 


25 


LPAHSLGRDLSAQPYSMPRPGGELRAEG 
QSSLCSVARSSL * LHVCVCVCVCVCART 
C^TIWFAC^^TKVCLHPLPE 


13090 


26991 


A 


13230 


561 


52 


WPSCSSSGSPMLLPPVPPGSLG* *SFSP 

Xj rtt/ifuui Ir Air wUiFKirHrXjKl^UtjPAPx P 

AAAPAPAPPAAAS LSAARAAAGSPPGSR 
PSARGARRRPSGPCAPPEPRLGPRAPSS 
APRTRTPS *GRARSGGSAGNAPSARRTP 
QGPPRAACSLARSWI5fPAGRGGSCGRAL 
GA 


13091 


26992 


A 


13231 


335 


103 


CQKYSTS YCNKWLATLVI ICSRKI * DGL 
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E=Glutaraic Acid, F=Phenyla!anine, 
OGIycine, H=Histidine, I=lsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P=ProIine, 
Q=GIutamine, R==Arginine, S=Serine, 
T=Threonine, V=Valine, 
>V=Tryptophan, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














KGQAWWLVPVI PMLWKAEAVGLPEARSL 
SS PCNMARAPALKKIFLIGQV 


13092 


26993 


A 


13232 


452 


712 


VGLEGED* PPFSPAQSQSLSCWPDCSCP 
PSWPPDQRPGERPQPPDPPESHGAEEEG 
LPERPSDA*ARPQPFSLHPGVFPPQNAS 
TQ 


13093 


26994 


A 


13233 


480 


2 


SCFSEDWNPLKFKLQIiSTSLSYRKSHTL 
HCAHHFRGQCHSD*AFLVFSLTCILQVG 
SLRFTAGDVSPSYPEKLRVL* EGRVADF 
HVMARKGQQAALAILLQNARTGyvWWLT 
PVIPAIiWKAKASGSVKPRSLRQTWQNGE 
TRLLDRGESASQSAGMTGM 


13094 


26995 


A 


13234 


692 


824 


QIMALPFPSPPVPPLVM*CKGGKIPPPP 
SKNPRTCVGNTVIIKA 


13095 


26996 


A 


13235 


657 


872 


PRGPRLDVLYACEPQLITCLELGTPLKV 
RFWLEWDHTYNPSTLGDQGGRTA*GQE 
FKTSVGNSKTLSLKKK 


13096 


26997 


A 


13236 


168 


365 


QESLCIIrQTCTCAHQNLK*KFGGKSRLG 
AVAHACNPSTLGGQGGRI I *GQEFETSM 
VNMVKPYLY 


13097 


26998 


A 


13237 


31 


338 


WYIVYICVCVCVCIYIYEIYMIYI*YI * 
YIYIKHWQ*H*HIiAFSTFTLIiC3»CHHC 
PSLELFHYPTLTYHTH* I ITPHCSS C* Y 
QPLFYFHLYVFDYSRYIiV 


13098 


26999 


A 


13238 


714 


1001 


SSGVS CLNRAGWITGVNHHIHSLVLLKR 
EK*TQNIITEVCLMFLLLPHIiAIEATVT 
WPGMVAHACNPSTLGSQGRRSV*AREFE 
F S LDN IARPCL 


13099 


27000 


A 


13239 


250 


406 


NLA*HGGVHMWSSASQVAGITGTRCHAQ 
LIFIFLVKMGFHHVGQDGLNLLTL 


13100 


27001 


A 


13240 


256 


193 


WINFISLPLSSIiT*TLSLCVCACVYVCV 
CVCLRPCVHIVGGNLKRYIVFARSVTLG 
CKFTHLHISLI INNDT 


13101 


27002 


A 


13241 


1 


257 


GMDLWQMCHSLSLPLI FFFFFFSEKTRS 
NFVPQAGFQLRALRDPPA*ASREAGITG 
LTHHVRPGQIFKKIGTLKPTILLPLLPR 
K 


13102 


27003 


A 


13242 


319 


406 


KRGWT* WLMPVI PTLWEAKAGGS PDVRS 
L 


13103 


27004 


A 


13243 


228 


56 


KGAPPVLKPGYPKAAKNPTPFKPPGSKV 
GENPPLF*NPCPGGGNPFPTFFFFFFFF 
F 


13104 


27005 


A 


13244 


1687 


612 


IJ^WU^CiCt'uAyiJlAliAKES™ 

*VREREREKEKEKERVRERKEEREKKKL 

TSSKGTGSTATFHV 


13105 


27006 


A 


13245 


220 


1 


TKDPS STMPPQPNPLLS FKS Q F I F JjX FIi 
FIKKYVGWMQWIiLPVIPAL*EAEAGGSL 
QPRSSRPAWATWRNP X r 


13106 


27007 


A 


13246 


204 


375 


AS W P PGLQ YELRI I KCLLHCWTRAMI FR 
ERERRERERERERERERERE * VHLKRKG 
A 


13107 


27008 


A 


13247 


158 


2 


IWVFKETFFFQNFFFFFGKENNFNGFFF 

PT7T?P*nPV<! T .TT ■HP YT? 
rrrrrrrf r ut\v Dinnnr jjlivjxiv 


13108 


27009 


A 


13248 


221 


3 


NPSQGFPLLKIF FFFLRWSIiALS PRLE * 
NSI * KH*KI * KIS WAWWRTPWPGAWEA 
EPGBSLEPGRQRLQNE 


13109 


27010 


A 


13249 


81 


341 


GELNDTIHVKYLG* YIKVL* RNRTNRVC 
VCVCVCVCVCVYACTQKDLF*GTGSCNC 
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C^Cysteine, D=Aspartic Acid, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine,M=Methionine, 
N=Asparagine, P=Proline, 
Q=ulutamine, K=Arginine, S=Senne, 
T=Threonine, V=Valine, 
w— tryptophan, Y— tyrosine, 
X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, Impossible 
nucleotide insertion 














GALVRPKSARQASRXiKTQKTWIRSLKA 
VC 


13110 


27011 


A 


13250 


299 


1 


ATARLRFLHGAWWYASWPPTQEAEARG 
FLDPRSSRME*AMILPRCTPASVTDKIP 
SLASQIHRPSRVACVTLLSSHAPTEAPS 
RWPPPSPLCPAGSRA 


13111 


27012 


A 


13251 


106 


146 


LCVLKQ VKQATWTKYS S SQ I FAILDCMT 
TLSSHPALKYCNYVLISNQS PNHLRL * C 
SGW ISAHCNLCLPG* SNSPASASRVAVC 
FKASQASYLD 


13112 


27013 


A 


13252 


217 


158 


LFEIFCIPFFFFFFNFFFETRSHSVSQA 
GVQWCKQGSLQP * I P INFFWGKEGVIDK 
LILISYGNAQGFKVAVTPVCTTALQPGR 
QSETLSQKRN 


13113 


27014 


A 


13253 


150 


56 


TILILLLLYETESQSVAQDGVQWCDLGS 
IQSPPRSSCHSPA*AGVNRLRQENGMSF 
EAEIVLS PDRTTALHPGLQIETLSHI 1 1 
IILILSTISFHQLLH 


1^114 


ooni < 
Z/UI J 


A 


13254 


174 


145 


PCLKMTTYDFKSALE IAFFCFTAILRYK 
liLI KVTHFKWLRVAAHDCNPNTLGSGGG 
RITWGHEFKTSIiANMA* PTTRS YSQRVG 
IMGITIPDEVWSRTQPSHIRSGPSKVYP 


13115 


27016 


A 


13255 


179 


273 


GQWLTPVI SAL*BVE VGGLLBTRSLKQA 
WAT 


13116 


27017 


A 


13256 


264 


382 


GLKIGKGCS WWFTP VI PTL* EVKARGLL 
EPRSLKSAWAK 


13117 


27018 


A 


13257 


1199 


1563 


YL WGP AVP PLD LE PGQAGATCG PVS HY 
KI FAKLWAVGS I LDLMPGPE * GS FLSVF 
TSLCFI YGVTYPFHSEQPHPAS PSASGL 
LLLKGRAAWLVSTLCIGSPAPMLTGIFP 
HKVYNQLFI 


1 J 1 10 


JLIKiiy 




1325o 


194 


242 


I L YVB TGS HYVAQAGLE LLGS GN P P AS T 
SEIAGIEA*ADEFIYVGVSWQDECIKSM 
KQVAHASIPAISBVEAGGLPBPRSSRPA 


11110 




7\ 
t\ 




322 


120 


GPTPVLDAFQKLCLS FTSGQQADG 1 1 PA 
FPIRKQGGTQWPKPVIPTLWEAKAGRFL 
* PRSFRPAWAT 


13120 


27021 


A 


13260 


237 


1 


TSKKI SLGWWGKQGI PGTWEG*AKRPFE 
PGKPRVQWTQVPALD FSLGGKARLCLKK 
KKKKQKQKTLKLCTHSRITYSRA 


13121 


27022 


A 


13261 


52 


318 


SAVGIHRCDDGSH* P*TPEHKQLSFLSL 
PSSWDYRGITRELFQRFPWIFLQLITAV 
X 5 o iso TVJj KNLELAA VRGSHVRVI MMAV 
PINPF 


13122 




& 


I jZOZ 




l 


iuuvJvN i> V V W W W TP VI PGS PG * AGEIiLEP 
WRQKVQLAQWPLAKWPQTKWQGCPL 

TV T /^XTV" 1 OTTO/"* T OX TfirtTfirirtTrtTyfiiii t\ Trtrr.T« ^- 

AIjU^KHQGI FVSQKKKNPKTKRKYWAL 
FCSLPSC 


13123 


27024 


A 


13263 


717 


877 


NSTKEMAHWPGWAHTCNPSTFGG *GGW 
IT*AHEFETSLPNMVKPHLYKKYKK 


13124 


27025 


A 


13264 


288 


131 


SLYIWHSKRLITITNNKISGVWWCLPW 
SSTWEVBAGGSLEPRR*RPAWATK 


13125 


27026 


A 


13265 


80 


287 


FMNGEAS * KT*Z TT^DAWWPJVP TVT>7Aan* 
NEAGEUjE PRS EWLVWATRALR I S SRGL 
RFRFRLRRFTSTR 


13126 


27027 


A 


13266 


3 


241 


VGLFLFFFETESCSVTQAGVQWCGLS*Ij 
* PPPPGIRDS PASASQVAGTTGTHHHTW 
LIFLYF* +SSGFHYVGLGRSSNS 
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Amino add sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Add, ^Phenylalanine, 
G=G!ycine, H=Histidine, l=lsofeucine, 
K=Lysine, L=Leudne, M=Metbionine, 
l^Asparagine, P^Proline, 
Q=Glutamine, R^Arginine, S=Senne, 
T^hreonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 


13127 


27028 


A 


13267 


103 


359 


ICVYVYVCI YVYEMRATSKDFIjNQVWFS 
FLRTVAVLFEEYLIVL*MWLSFHIHIYI 
HTHIHIFIYVYTHVLIYVYIHTYSYMCI 
HIYI 


13128 


27029 


A 


13268 


172 


12 


SARSSQAIQIQKSVTSGQVXPQW * QS * P 
TPFLPTLIALTTLLiLPISPFILI IL 


13129 


27030 


A 


13269 


2 


203 


RIENIRSYKNIiAAIiPMTAKKMGTIQMPH 
NFEWINKMWYIHTLEH*SDKKKKKKKVN 
RAEIPKLIIQK 


13130 : 


27031 


A 


13270 


236 


365 


KIYRQGAVAHTCNPNTLGGQDGWIT*GQ 
KFETRLANMVKPHLY 


13131 


27032 


A 


13271 


371 


15 


QNQINFAIiHiIINTLLAIiLLI I ITF*LP 
QltfGYIBKSTPYBCGFDPISPARVPFS I 
KFFLVAITFLLFDLEIAIiLLPLP*ALQT 
TOLPLIVMS SLLL 1 1 ILALSLAYE* LQK 
GLD*AE 


13132 


27033 


A 


13272 


2 


16 


PRVRTS SRS RAALELI FFFFLGPPNLPV 
YNGPLGRTKPGTGELDTGGSP ILCVGQG 
RHPYWKGGAKPLAPFGKGGGP *GTRPLA 
GPIVPPKAGLQSEMPGMAPFCGSFG*PT 
RPD 


13133 


27034 


A 


13273 


329 


290 


SSALVPRLECKGI FSAHCNLCLMGSSNS 
PTS ASRVAG ITGVAGQATDKTPQTLS *R 
SQPSHMDPVRWSP 


13134 


27035 


A 


13274 


175 


13 


APCDHRPCPPENNPL*L*FSITFPNPIK 
RPHP YLPLLTLFSDSAHLHPGE IEQR 


13135 


27036 


A 


13275 


355 


861 


PLTTTPAAPRAPCPPSRLSGQPLTGPTE 
GSRSRLSPNISBQGEPPLALTVGHPLST 
QPGPTVPSELEPIQGPRG*GDCPTPSQS 
A*GGVLS CTPESHTEFKPPPTGGGRRWA 
RLGLNGAT*GREEPLQTRLPAEYPGPGP 
IDPLQPPPI STASMATAFSDFUiLGRDP 
A 


13136 


27037 


A 


13276 


865 


667 


KFLCILLDFLFFIF*EMRSQHRLECSCA 
CHCAQLFLTF 


13137 


27038 


A 


13277 


103 


375 


WSRRLPWRRGU3YIELFQGLE IRHHFLF 
GPHYJjRRTQCQGPVX PSELDGQGWDYMS 
PGV*DQPGQHDETPSLQKI*KTSSAWWH 
APWPAT 


13138 


27039 


A 


13278 


177 


3 


GGRGG*NI *sqkfktslvnmvkpclygr 

V 


13139 


27040 


A 


13279 


379 


372 


SR*WVCMVAHACDPSILGGQGGRIT*AQ 
EFETSIiRNMVRPCLCLGNTNIYIYIHTH 
TYIFIQN 


13140 


27041 


A 


13280 


83 


1184 


PPAHAARAS PPSYTWLC YEVKI KRGRSN 
LLWDTGVFRGPVLPKRQSNHRQEVSSWE 
CRKHJS KMSGGGLSTVYFFHRRFQ ITWF 
VSWNPCLPCVVKVTKFLAEHPNVTLTIS 
AARLYYYRDRDWRWVMiRLHKAGARVKl 
MDYEGERCRGQGSMTGRNSLRDGWICNA 
XNYASIiHR 1 liKE X IjR V GS PSGLXVSIiliS 
PPAHPPEDSPGNESWLCFTMEVTKHHSA 
VFRKRGVFRNQVAPKSYLHPK*ELSSWE 
RRKHNT*HTNYEVTWYTSWSPCPECAGE 
VAE FLARHSNVNLT I FTARLC YFWDTDY 
QEGLCSLSQEGASVKIMGYKDFVSCWKN 
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Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lso!eudne, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arg»nine, S=Serine, 
T=Threonine, V=Valine, 
vr— irypiopnan, i— lyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 1 














r v xSDLUsFl? KPWKibiJylriFKijIjKRRLRE 
ILQ 


13141 


27042 


A 


13282 


1558 


1824 


SFPyLASFPFCLGLPLETLPHPLGL*KD 
♦SDTHTHTHTOTHTHSHTHAHFPSFPDP 
iiiry S S P FSSG F I DE YKYPHLW P VMS VTC 
CRFCV 


13142 


27043 


A 


13283 


1 


519 


PHFFIjPQGFWGFFSPFPL*KSSSPLKAL 
iflggfspffpppkkrffski prgvffp 
PPKGKKFFFPPPG* I WPPQGFFLKGPPS 


13143 


27044 


A 


13284 


349 


159 


CPLEPKKNYPFCFFF*ETGSCYVA*AGL 
v jj VAiS w f FAS AS Q IAG IQGTSRHAQPC 
RIFLSKT 


13144 


27045 


A 


13285 


748 


869 


WLGWAHACNPSSLGGRGGQIV*PQKFE 

L oKljiNWAKlry Si» 


13145 


"27046 


A 


13286 


315 


410 


VQWFTPVNSALWESEAGGLLE IRS*RTT 
WAT 


13146 


27047 


A 


1"*7ft7 


IKK 
JJJ 


499 


ARCGGMHIjYSQI fgrlrwehclr*kvls 

QVWWHAPVFADIWEVEVGAIiLRARSLRL 
*CAMLI»PVNSHGPPTWAMQ 


13147 






1398ft 
1 jZoo 


ftftA 


701 


KS YFSSHLHFEGKKCVSS I LD + S ILVLM 
CSWLMNYTHTHTYIYIYTRTHI CVHVYV 
HNF 


13148 


97049 

X. I\rt7 


A 




J 17 


434 


KRGPPPPKKRGCFFSEKlfF*GPPKTPFF 
RLADAWADAW 


13149 




A 




OKI 


344 


GLFYNLQKIKI FYVKNLFFFFSNS ITEA 
GVQWPNLGSLQPPPSGSNDSPASE*RIV 
V 


13150 


27051 


A 


13291 


561 


745 


AWEPSLVGETNVNSFNQKYINWPGAVAH 
TYNAGTLGGQGGWI T*GQEFETTLANMV 
KPSPY 


13151 


27052 


A 






Z 


xixl/yui£Ul^WJU^Vl?X5SIKQKIjQIl^*NTD 
FIGNVFQSWAQQYTPWPPSWVAEVEGS 
JjAARSSRPIjCTI itplnshcslawatqq 
DPAGRVGRPRV 1 


13152 


27053 


A 


13293 


7R1 

Z.O 1 




JjJMN JCUN V W KJS KioKUvj V VAHACNP S I LGGR 
GGWIT*GEM 


13153 


27054 


A 


13294 


276 


441 


GLFPKFLIQKZNQNWPDAWHAYNPSTIi 


13154 


27055 


A 


13295 • 


258 


389 


NMVEKRLGQAHAYNLSTLRGQGERIT*A 
WEFETSLGNWRPCI 


13155 


27056 


A ! 


13296 


J -JO 




ocimat r a^ArLiir Utixyc r Ml r r rXJGVCW 
RRPPL*RKKXKKK10CKKKKKKKOiCKKE 
MFKRNIGGEERGGGG ! 


13156 


27057 


A 


11997 




1 
1 


liMUf^Af JfXAvliif J. F FAi?"FFF JjKKKKGGPG 
AVAYTCNLSTIjGGRGRWIT*GQEFETSL 
ANIAKPCSC 


13157 


27058 


A 


13298 


182 


436 


gvtilnvrhrhrkkvtlyrdlkkvrkcp 
klhgnlrkvfqaegiassktlkghvwwl 
mpvt pal* eakmegllearslrs awatq 


13158 




A 


1 j Ay 7 


J34 


440 


RHYWLNF*AFYHSSIiAPAPQGGGHWPPP 
GITPLNSLHVPLLNTSA*LASGV* LT* A 
HHRLVPNNRALilQALRITRILGLYSTL 
LPA*KNFEAPFTISDGVYGSAFF*ATGC 
HGLNVI IGSTFLTICFIRQLI FHCTSKQ 
*LWLEPPAWDW 


13159 


27060 | A 


13300 


359 


372 


KKKKICGGKKNPPNKKKVKPRGEKTPLK 
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GV*PFFCLEGFFSPHKFFFFFIFSPNIC 
SPKKS FFIjKFPPALFFFNIjPFLKNI PS P 
PFLKKASF*BIFFCGPPPFFFFFQNTLF 
FFPPFFFFFFFFFFFFFFLDFCFAVY 


13160 


27061 


A 


13301 


233 


406 


VTVEGMEEVLAGS SEGMGSRPTS * V* FL 
lCVTVT.GAVAHAHKPSTTjGGRGGPJr , F*AOE 
LKTSLCNKVRPRLYKINK 


13I61 


27062 


TV 


i lino 


GO 

yL 


*»zy 


R«?HFM<5Rr.TRMKFYT TKRNT FT TLNCFY 
LIKHQHFHYTKFLLVSVINKF * LATVAH 
AYNLNTLGGCK3GRSA*THEFKTSLGNTV 
R5CLYKNLKMSWVWWCTAIWEAEVRGLP 


13162 


27063 


A 


13303 


302 


140 


GGAVNSVQSQAWWHMPWPVPWEAETGL 
1VKPRSLST.* CPMIVHVNSHCTPAWAT 


13163 


27064 


A 


13304 


115 


312 


LRSPSFVLSPPOjKCXSGTIMVHCSFDLLG 
*RNP PALPS KVVETIGLHHHALLRIiLIF 
FF*KWWLFF 


13164 


27065 


A 


13305 


90 


246 


PGVVAHACNPSTIiGGRGGRIT*GQEFKS 


13165 


27066 


A 


13306 


19 


310 


DSSFSPIMCFGMSVNKTGLWIjGKKKKKK 
KKKKGGPP * KKPIjGGPNLPGGGKKKFFP 
*RGAiOCNPPGDFWKKTLFLGGEKI/3PPP 
PQK*NPFFGGGKIGPTPPPKIKPFGBKK 
KF 


1J loo 


z/Uo/ 


TV 




1 fy 


JO 


WVTjVYKKOS I LG* AOWVMS VI SALWSAE 
AGGSLVSTSIiRLA 


lJlO/ 


z7Uoo 


TV 


nine 


4zD 


OQA 
ZW 


TTKTJT YTi APPGYFWPPOP FFT J? PPPP PNV 
xxxaxi X liinrrui x ?vxt xr^xvx x xxwxr xtxt xtxtx'I v 

VTFFFFLFFFFFLFFFFFFLIQYAEGIG 
VMRGRGEGKMGR*W*KFFGRRVLNIRSI 
LLANFKHWPYW 


13168 


27069 


A 


13309 


214 


371 


IYF* *RWISQAWWlxMPVIPxaiWEAEAGG 

* T .P PM *?T iPGOYNETS PT. 
1 ir>iri T i>j| ixrVJ\^Xx^xa x oxrxj 


13169 


27070 


A 


13310 


975 


572 


AWGMWGVGKHS LVS 1*3 I EE CQASTALS 
LDKSGWWEEAARREDV1XEDRRCLLCHV 
PAGVRGSLKPELGSRKGKNGQQSGSKPS 
VPSl^PPpry3PGNPALSKGTPPN*AICF 
LCQTPADHSAKKQAPHTLIPIR 


13170 


27071 


A 


13311 


270 


10 


TPNKSLI*PO^FSNIKlxAl^l^TMA^ 
CMLNTIiGGOGARI TSGEEFTCTGLGNI DP 
IPIESCTIxCTlxAGlWSPDAWVDAIKRTG 
TSG 


13171 


27072 


A 


13312 


103 


290 


LTEWSGVIL*NCIYLLPQYTSTRVHAY 
JJlTYIxmiraTSGIxSSTSVGSTNHKSKI 
FGGKKG 


13172 


27073 


A 


13313 


136 


1 


SOTSSSLVTQAGGQWCExjGSL*PLPPKF 
KRFSCLSLPSSWDYRRL 




Z/U/4 




ITUxt 




I ©J 


RVSTiiLKKSCFF FFFFF FFFFFFFFFF F 
FFFFSSQKKKl^KN10a^*KKSCFFFFF 
FFFFFFFFFFFFFFFFF 


i 7.11 A 
1^1/4 


Z/U/J 


7A 




*t-Ol 


M7 
Jt / 


LIxSP * RPKGKPKGKKRKl^NIJCEKKIGEA 
RGGKERQPJO'PQRTCKQKKRRKYRRIGR 
RNYR 


I j 1 /> 


z/U/o 




mi £ 

135 lO 




iy& 


GN1HLYLPRTO*WVRVIPGSREIEVHAPL~ 

EPGSQRlxLVAEMTPMHSSLDNMNXSPFP 

QPPKTEKQTKNPPKPHT1A. 


13176 


27077 


A 


13317 


2 


209 


PARALDIiKGSPYMES CSDAQAGVQDS I Y 
GDHL * IiRAP AV * *GQTVFVASPSKVGGI 
TGASHHPGLFFLF 


13177 


27078 


A 


13318 


3 


178 


SFCFSVTiREIIGQSLIMKTLP*ITHTHT 
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HTHTHTHTHTQRE I ERERDRERGESQVL 
GC 


13178 


27079 


A 


13319 


40 


382 


DSVPSLLLKLLMETGPHHWQASTTTPS 
* FFILEGFHHVAHAGLKLPSSSRPPHLA 
SQSAGIYGCEPLRSARLIINYIGSRLWG 
IaRNICLSQDHEECLIiCFILQAISMYRFC 
LG 


13179 


27080 


A 


13320 


107 


i 


PPLIMANSGWAOWIjTPWPTTiWF aTranr; 
*SEVRSS 


13180 


27081 


A 


13321 


111 


382 


ASSOLLELAIOVFPGIWIPPTTJCnK'fiVP "" 
SGKTLVTS KI IAGLKMPKDSRSRPRTVA 
HTYNSSTLRGQDRRIT*AQEFKTSLGKI 
GRPCLF 


13181 


27082 


A 


13322 


49 


10ft 


FPASASS VAGTTGMCHHTQLHY* F* V 


13182 


27083 


A 


13323 


1 


257 


ARGERERERERERERERERERERERERE 
REREPJ2PJ2RJ3RJ2RF.RERFPRPFPFPFPF 

RERARERE*ECV*EGETVSDRERDFERA 
PPRVLGAVAIYIEWCLASLIQ 


13183 


27084 


A 


13324 


1 


248 


EFQYILCSALIFGRSKILK*ATKEVKKS 
KHIPRISCX2DTRKYNWSAKAKPJRNTTGT 
GRMRHLKIVYRRFPHGr.WFRNT.FT.KTT 


13184 


27085 


A 


13326 


375 


1 


PPKRRGKTHCSPPKVFPPPQNN* TTPPP 
QLIICYL*RGGVFFSLPLII*APPAFFF 
FFQFFFFFFFFFFFFFFLQGQYWQFRDT 
DAEX3HLASSRPSGAVFPSQGMHTTGSTR 
RWGOCVPFPPTRP 


13185 


27086 


A 

- 


13327 


382 


15 


NWPKNCPIiHFQNWFljRKTPQI FFCLKNF " 
FFFFPKKVFYPPKKPFLEKPWPPVLN* K 
KPPP P FFFGWGPGGPfWPP FVir a ddt.ft 

RGKERFPLWGKGDFFQIPCPPGPLKKKK 
RAAARDLGTS 


13186 


27087 


A 


13328 


2 


220 


GRVGSSRARAVALFFFFFFFFCFFFFFF 
QNPPLKRGEBCKKNTPPLKKKRPLRGGFK 
KQKEC*EKKKKLSGPK 


13187 


27088 


A 


13329 


427 


289 


I QLWSSL * K*LHTHTHlTOin , HTNTCFL 
TVMKBLSTHPGNK 


13188 


27089 


A 


13330 


203 


269 


INi?GPPREPGGFN'OTCPr>FP<;apr3FFDWM 
PPQGPKP * KKXKRKKPFKVWAPGGGSKV 
QNPGLRETGVFG 


13189 


27090 


A 


13331 


216 


418 


D VPVH YCRL YKP CTLRLHS I Y KKQ I LL W 
PGAVAHACNPTTLGGRGLWIT*GQEFEA 
SLANWKPHLY 


13190 


27091 


a ; 


13332 


160 


325 


WERQLFKIAQSGLARWLTPVI PALWEAE 
EGGLFEYTSLR*LWATQQDPISTKMFK 


13191 


27092 


A 


13333 


339 


62 


GMLPLFVPPQKRGS PP YPCYGVYNS PPlf " 
KKQRFFSSLGIVLPPIVFITPPPPAFFF 
FFFCFFFFFFFFFFFFFLT *RRT.MPRRM 
FSCFLHCP 


13192 


27093 


A 


13334 


45 


384 


DPSVRINTLLALLLIRITF*LPQLNGYI 
EKSTP YECGFDP I S PARVP FS IKFFLGA 
ITFIiLFDLEIALLLPLP*ALQTTWLPLI 
GMASLLLIIILALSLAYE*LQKGLH*AE 


13193 


27094 


A 


13335 


270 


660 


AGSRRPLRVPGFSLSMMSPSLCRPVCVA 
QCIRVSFLLTVGAVLRPGFQCLDWPCLV 
YARLSGWTGFPPCRYGKGCCRYDEGCCR 
FGEGCCRCDDRCCRCGEGCCRCDDGCCR 
YDEG* CRCDDGCCH YGE 
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13194 


27095 


A 


13336 


298 


349 


KTT P FYGLRGSKIGTL* CSGVI XAH*SL 
KLLGS SDPLSLVAQAARTTGACHHDQLT 
NAFFFFPLERVPTPLKNNPFLWVKGVKW 
WDLD 


13195 


27096 


A 


13337 


865 


518 


KSATMRMVEDTHKMPAWGG YSGS LQQER 
GELSSLPCP* PES PPPliLI FLKLPSNPL 
PCS PLPCTQGPGQPAPYVS IHTSLKFPN 
AKLSVCLSRAIEFS 


13196 


27097 


A 


13338 


154 


354 


FNLQVKPDGYFVLFICHILPTDSKSGKA 
PGAPCHSRNFHIRKTLGWVQV7LTPVILA 
LWET * FSHQENSGLGAVAHTCWP STLGN 
QGGWIA 


13197 


27098 


A 


13339 


262 


11 


QFHFSRKLTP * KM * LSAHS SNTHEAYVA 
FPKAHKPGLNSKTAF*I/3TEAHAC3JIPNI 
LGGQGGQITRGQAFKTSLANMAKPRLY 


13198 


27099 


A 


13340 


358 


164 


TKKGKPRFFLKNSKISRGWWWAPVISGP 
WEGEGGAFV*TGRPKIWLTEVGPLPFNL 
GKKKENPF 


13199 


27100 


A 


13341 


147 


2 


FRPQKVSVRVGVLAHACNPTILGAQGGH 
II*GQEF*ASMANMVKPSSC 


13200 


27101 


A 


13342 


367 


17 


TSRSWNLFHVLVRFPTADKDIREIGSFT 
KKRSLIDLTIPYAWASETLMMEGKKEQV 
TSS INGSRQKVRSQSGELLSLTPSDFSW 
AHWLMPVFPALW*SETGSIiFEVRPSRPA 
WPTW 


13201 


27102 


A 


13343 


166 


492 


EGTQETIiCGCI I CLVRGDALNLFHLKCS 
WVGWRGAICCMELRETAKQKLSVWKLYP 
FEIVFS FSNNNPRPGAVAHI CNPNTLGG 
*GGRIARAQEFETNLCNIVRPHLFR 


13202 


27103 


A 


13344 


804 


1124 


TFFFCFFL*DRVWDVAPGWESSDMIMGP 
LQALTS WGSTDP PTLGLQSGLGDLQDTW 
PPYPTSFYYFFLQGRGLTMLPRLVTiNSC 
TQAILPPQPGQRSKTPISLKTN 


13203 


27104 


A 


13345 


280 


448 


GGGGKKKKTPKPPLiEKNNFS P PP * FFPP 
KKQIKPPPPF*GGGGKKKKTPKPPLEKN 
NFSPPPFKPRKGI FFLFPPLSWVKKKGD 
PPGGTRPIoAPL 


13204 


27105 


A 


13346 


232 


400 


AELLRYCSCGHFL*SMARYKTKPGVRWL 
TPVI PVFWEAEAR* IiHEPRS SRPASATQ 


13205 


27106 


A 


13347 


138 


332 


RAVVKPSFEIFYFFF*RAHTF**SCVCL 


13206 


27107 


A 


13348 


314 


395 


WLGIVAHAYNPSTLGT*GGWIT*GPEV 


13207 


27108 


A 


13349 


388 


371 


IMNGWQDKELVSRVIQTGIKK* KPRNRP 
EFRAPP * WHERNGRKERERMEGRKERER 
KKGKGKERKRKKBACIYKVKIEISNNLA 
NLI INVER*DRRRPDILSWVSLSSRVSS 
VFIALVTCLTSPQFFLSLHLNQILFPLG 
KNTj 


13208 


27109 


A 


13350 


412 


40 


LVFRFWMCLFSRELFGCFSHLLTKSM3Q 
M*WAFGDIMYTFDLLFIKKTKNNCKLW 
QGCKEEGSLIHCWRECKLVQPIi*RPI*R 
SLKKLQMGLPYDPAISLLETYPKERKSV 
I * RN J. CTSGR VG 


13209 


27110 


A 


13351 


372 


148 


FFFFFFFFFFFFFFFFFFATGSCRVA*P 
GVKWL I SGTVPLL I STGVLTCS I SDLGQ 
PVHTSLGNIjWPYSQEIiPY 


13210 


27111 


A 


13352 


209 


206 


KKKKKNF FS FHGKNPGNGGP FGG P P P P P 
F*TFFKKGGGLPKGPPPKGFFWNPPQNG 
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GGGP*PGGEKKTPKPSLGVLFFFFFKFF 
KKRPTAFFFLKKKIFF 


13211 


27112 


A 


13353 


43 


334 


KKKKKKKKKKKKKKKKKRGGGPFIKISG 
GAQFFRGEBCKFFFFFLGGGKKTPRGFFE 
KKPFFWGGNFWPPPPQKNFPFGEKKKFL 
GGGGGKNRVFFWGEKIFSIiGFPLKKFF* 
KKPEGKNFFPPKKNPVFSPPPP*KFFFF 
PKGE I FLGGGGPKI PPPKKRFFFKKPPG 
GFFS PP * KKKKKFFLPPEKLGPPRNFYK 
RPPPPF^FFFFFFFFFFFFFFF 


inn 
Villi 


27113 


A 


13354 


264 


496 


KPTILRVTHFLEGCE KYEFLLPVAFSNF 
SEPLKRYYYWLGAEAHVCNPSTLEGRGG 
QIT*GQEFETSLVNMVKTCLY 


i ioi •a 
I5l\5 


27114 


A 


13355 


183 


9 


I RS PKI PEIRVFFFSRWS YALVAQVGVQ 
WCjJN LiS * MHPPRARFQPLiSS tiS VPST* KY 
R 


13214 


27115 . 


A 


13356 


208 


378 


EICIiE FKI IDEMDKCX3KLWLGTGAHT* N 
PITLGGQGGQTT*GRAFETSLDIMEKRY 
L 


13215 


27116 


A 


13357 


256 


341 


KGQTQWLTPVIPTLWEA*AGGLLESRSS 


1 11 1 c 

152 lo 


27117 


A 


13358 


78 


344 


IVGLFESESKKGQHIVSGWIYLRLLIYR 
FLFPLFSCLCKFFFFFFSPBTEFCFVPQ 
AGVQ*HNLG*LKPPPPQLKQFSCLTLPS 
SWNYR 


I32I7 


27118 


A 


13359 


259 


291 


AQ*LMPWPALYEAEAl3GSSLEPRSLIiS 
VWAT 


1 1ll o 


27119 


A 


13360 


448 


121 


RLFDiiGNKKTSS FVQNSENAKYEDSLQF 
YCRIYIYVYIYVYICIHICVYM*TYMCM 
YICI YMYTYTHVYTHVYMCIHTYMCI * A 
YICVYIYIF*KACLPLAISTLSSGH 




27120 


A 


13361 


125 


262 


FSFQASVEFTSKTVWSWVQWLTPIISTT 
* EAQAGGS LEARNS RPD 


13220 


27121 


A 


13362 


362 


46 


ARAKGPKKIGFSGKMGPP*GAPPPKMGK 
KI * ITPPPKFLFFFLGKTKI KNPPWGFW 
PFGFPKKKRGGGARR*KTPS PFREKPPP 
QKLKRFKTPFPPLFFFKNPRP 


13221 


27122 


A 


13363 


378 


48 


FKKAAREKVSTRKFRGFVCLFWDFFWRQ 
SHSVTQAGVQ*CDLNLHPPGSSDSPAAA 
SQVAWTTGTHHHTQLIFIFI FCRNKIS P 
SLLKKYKKNLPGVVAGACNPSHLGG 




llilD 


a 
*\ 




1 HA 

174 


396 


o Li! if 1 JjJS JvKEC YY PRSJjFTVAN FTvAKR 
♦QTLNCPSTDKWINKMCHIHTVEYYSDI 
KRNE I PMRATCRQTLKA 


13223 


27124 


A 


13365 


76 


280 


PLLPFKAWQWVGCHYW * KLD* INCYFR 

TV T T 7AT?QPVMVTVTVTD > rtJ T rU' r r* > TVT VT V 

I YKLTYSHLKVG j 


13224 


27125 


A 


13366 


133 


324 


YLL* ILVYCSHLI FSLLCLEGI ILWVFI 
IATLITLm?HSLLINIVPIAIIiDLAAGQ 

AAVGXjALLVSKKKKKGPPLKKPPLGAQ I 

S PANQRKKFPPKRKP IKTRRGTF 






TV 


IJJO/ 


/too 
488 


190 


Fb YiU^CFJjPQCVFv CTFIYLHADGFLLL 
I XFFENSVFILCHSCVfVFCFLLQWFLLM 

IFHFSLMLLFFCFMI 


13226 


27127 


A 


13363 


229 


372 


YYATKDFSRPRAVAHTCNPSTLGGRGRQ 
IT*GRQFETSLANWKPQLY 


13227 


27128 


A 


13369 


122 


2 


KEECVGSGTWWVMP VI PAL * EAEGRGSL 
E PRS LRPAWTKK 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenyIalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=rVoline, 
Q=Glutainine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=5top codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 


13228 


27129 


A 


13370 


43 


412 


RPTRPKRQNYGDSENISGCQGLGKKGMN 
GKSTEDFEGSETTIiYDAITVGTCHNKFI 
RSH IVYNTKSGP * HKLWTWGGHDVSV* V 
PQLRKTYHPGERCli * *GRLSMYRGRBYI 
GNLCI FCSLLL 


13229 


27130 


A 


13371 


136 


1 


KSPTWPGAMAHTCNLSTLEGQGGWIT * G 
QGFETSLANMVKPSDAW 


13230 


27131 


A 


13372 


228 


385 


GLLIHKLEFKHFWLGTVAHTCNPSTLGG 
QGRHIT*GQGFETSLANMAEPCLY 


13231 


27132 


A 


13373 


277 


429 


LETTTKAGLPTI I FTSSGQMS I WFFQNK 
PWDSKKLFNLLSISSP**SSKNY*PWQS 
TVftHTCNPSTLGGQGRWIT*GQEFEISL 
ANMVKPARVGRHVIRGLQVS 


13232 


27133 


A 


13374 


194 


3 


NQENYIYIYTHTHTYTHTYIYIIi*LLLQ 
WVYIAYFLCLSYPSLFAQGFTHTTYTHT 
HMHTLIL 


13233 


27134 


A 


13375 


377 


228 


DRVLLLIiPRLECRGIIMAHCRLPRLVSN 
S * APALS FQSAESTGVPNVPS 


13234 


27135 


A 


13376 


255 


465 


NAWKCPFI YKITFVI FLIHVS CKKFRNY 
RQREWKLPTVRPLS PS * AI I FPVTCTYT 
SRWPEATKDPQKK 


13235 


27136 


A 


13377 


336 


38 


VWWCTPWPATQEAEVGGSLESGRIiRIiQ 
*AVITLVNEHRESALASRRGPEBTSSVK 
PPLPT ILAHTCFSLPRTGQDITSRFLAQ 
RNTEENIiELQMEARA 


13236 


27137 


A 


13378 


376 


293 


FFFFFFFFYYFFFFK*KFLTKKKILSSQ 
YI 


13237 


27138 


A 


13379 


215 


16 


HLTWS FTTATEGSKTVTQHS VYRKAKLG 
LGAVTHTCNP STLGGLGRW I T* GQE YKG 
IPPHGLEDVQ 


13238 


27139 


A 


13380 


184 


64 


VDESLEGWMMDEWMSGWRGGCINRYMHA 
WMDG+GDGWIGG 


13239 


27140 


A 


13381 


233 


417 


LKPITKGRKPRGFFLPFKPKQKKYFWGF 
PQKTP 














13240 


27141 


A 


13382 


34 


360 


RWNTTNADHDLKDNILSPPQINLYIRQN 
SSRI*FOUI*QMDSKSPVKIPAGFVIiYI * 
I YRYTHTHTHTHTHTD S EL YMETHTRMA 
NTIVRKNNSLEHSYYLMLRMTIKLP 


13241 


27142 


A 


13383 


214 


71 


QTLNTDSGPGWIHACNPSNIiGGHGGKI 
I *GQGFETSLANMVKRCLY 


13242 


27143 


A 


13384 


352 


312 


DKQLTLHRTDSLYIFYTY*PPQSCEPIS 
YNOTPCLSQYLYLCPSIiHTHTHTHTHTH 
THTHTHTHTHTHVSVGRRSLFFSGPHTE 
APRSRVSVYT 














13243 


27144 


A 


13385 


393 


271 


IEGQIQYTSTIGNKFKDFYLILCKEGIM 
SREISPPSSCHLRQQ*RVRLRERDRERK 
RQRERQRERE *GRS VLHPHVT 


13244 


27145 


A 


13386 


180 


440 


PVEERTLCEDILCFPFSVLIjCIQFHIiLI 
QHACf KYPNPNSRFGSWPGAVAHICNPS 
TLGGGGRWIT*NQEFEARLSNMVKPRLY 
KNI 


13245 


27146 


A 


13387 


190 


47 


EGEKG VP S T I LKME TL LGTVAYP CHPS T 
LGGQGGRIAEAQEF*DHLE 


13246 


27147 


A 


13388 


188 


470 


ARPPCKGRDSSAEGPPGPPFPSWSSI/3C 
WTREPPGRGEPIQVAVRREESAQDWARP 
ELI I KEWWPGLVAHTCNPSTLGGRGGWI 
A*A*EFENSQ * 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ 10 
NO: of 
peptide 
sequence 


M 
eth 
od 


SEQ 10 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methioniae, 
N=Asparagine, P=Proline, 
v=ijriutaniine, K— Argmine, 5=oerine, 
"-^Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stopcodon,/=possib!e 
nucleotide deletion, \=possible 
nucleotide insertion 


13247 


27148 


A 


13389 


454 


367 


NKKKPPPRPP*PPPPPI^KTPFFKKKI>rp~ 
NFPRGGGNSPGSPS FGGGKGKDPPPGGF 
FPPPPYPRGEGGNPFFEGKKKKKKRASIiD 
V w Wb i>y V U rAXB KJs UN r AP V 1_A 1 V TLP P 
GHSS 


13248 


27149 


A 


13390 


34 


354 


HTLLALLLI I ITF* LPQLNGYIEBCSTP Y 
EOGFDPISPARVPFSIKFFLVAITFLIjF 
Ei Liii 1 iLblili P Li P * AljQ 1 1 NL PL I VMS S LL 
LIIILALSLAYE*LQKGLD*TEY 


13249 


27150 


A 


13391 


1089 


792 


TI^FGPQVGSPLDLKQPQKAMPRGPSIrG 
NPPPGCQQHSHGAGGP*VPPGPRQPVMA 
£>KN JjCJbPIiALPRTHP PS PGQTRDKPRTQ 
DRSCWRDPRNGPGV 


13250 


27151 


A 


13392 


299 


428 


WIRC_^GRGRSC^PVPHSAPALLSPLVVD 
GTRRREAGGGTCQGGSGCM_lRHWGIiGGG 
EGSGCRSQALPREAAEARREFKHGGIPR 
H* LTESLVS WEFRHA 


13251 


27152 


A 


13393 


382 


413 


HGKTHLYKKFKNKKFKATMPA*W1_TPVI 

PTI_QI_Ai_AIsASLKPKSSR^ 

LQKI 




z/IDJ 


2_ 


13394 


64 


401 


GGIPKEDSQAGAFTGIGERQQGLVIPLL 
KNGQLSTDACVPPWGRVGVERAGPPNQP 
AGEGGGOGQECSMIiLPElANRCHPCNPS 
TLGGRSGQIA*GQEFEISLGNTVRSCPY 


13253 


27154 


A 


13395 


146 


18 


FSPGVVAHACNPSTLGGQGGQIT*GQAF 
KTSLANTVKPKRPQ 


13254 


27155 


A 


13396 


1056 


679 


D IQVPERPL* NVPE PEAKGEP PDRAVGE 
HI DRDCRSDPAQQKRKI FTNKCERAGCR 
QREMMKLTCERCSRMFCIKHRHPLDHDC 
SGtsGHPTSRAGLAAI SRAQAVASTSTVP 
SPSQTMPSCTSPSR 


13255 


27156 


A 


13397 


440 


423 


IHSPPTQRRV*QRERERERERERERERH 
ALAERNRTREG I SGTTGERGNLMIiVGHH 
LPGPLSSRRI^FCPDGFGGQHCIiAQGDQ 
VRL 




-71D 1 




i lino 


2 


ICC 

256 


I ETLG S AVE F J. PYEI^YQTYLKKKNRNS 
HMQSQPNRKGHIRLKYIAAWA*WLTPVI 


13257 


27158 


A 


13399 


102 


922 


LSFFLFSETGSCSVAQPEVQWCNHSSLQ 
P * TPGGPSMQFQLPQECX3QRTAVWHGGR 
A 


13258 


27159 


A 


13400 


198 


55 


KDYSAWHTCNPSTIiGGGGGWIT*GQEL 
KTSQANMVKPHLYKNPKIEF 




Z/IOU 


TV 


1 1A(\\ 


in 


a An 

447 


bb AUNIjPS AGI IGVSHRTQP I FC I IiNAIi 

ALGVLILERSPLQSPQVPPSHSHTKPGS 

LPSVTPGRGPGRPRAAEPTAQGRRYNSN 

PVLISPGSVHPASFALPAEPP*TGAPST 

FKPUVLkPJLLIjGNAP-DLLCT 

YRT 


13260 


27161 


A 


13402 


93 


410 


DSNLNYSLFFHGEADIjGTNQVLTHPSTT 
AMYFEHYCQPP* XVHGTINT* PPWHKN 
P IHI ITPS PCLRASTAINLLLSHINCYS 
♦ATPHPLGYQQTYLPLTVHST 


13261 


27162 


A 


13403 


55 


387 


SNSHTYSLKKSAGITKFQNWHIVNCICI 
CQVVTOLEMVNRHTVILCQYPVKPRILY 
QHHTAILVTILTFTLRPGWSHACNPST 
LGGQGGRI I * AQEFKISLGNIVRPCL Y 


13262 


27163 


A 


13404 


39 i 


365 


SGDRRVRLLLKIITF*LSQLNGYIEKST 
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M 
etii 
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nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, HNHistidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q==GIutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possible 
nucleotide deletion, V=possible 
nucleotide insertion 














RYBCGFDPISPARVPFSIKFFLAAITFIi 

T* n 1 \Y C TATTY TiT Ti*-^* t hmt nT *r^^&«n « 

1^DIjEIAIjLiIjPLP*AIjQT^ 
LLLI I IIiALSLAYB*LQKGLD* AE 


13263 


27164 


A 


13405 


32 


350 


HEFATSTSEMTKGTS * IVKRRIKTHTLC 
RRCGSKAYHLQKSTCGKCGYPAKRKRKY 
NWSATAKRRNTTGTGRMRHIjKIVYRRFR 
H(i r REGTTP KP KRAA VAAS S S S 


13264 


27165 


A 


13406 


326 


668 


LQFKIFCMPGIFTISEVSSMSN*RTAYG 
SQSFHKIVLI*LATFGQALSC*IL*IHL 
CTLKNKTKHRALVAHACNPNTLGSPRGR 
IA* GQEFETSLGS I VRLLLSTKNLKKKK 
NM 


13265 


27166 


A 


13407 


170 


472 


KLISVWSEESSYCVEHGEPWPQEKEYS 
WIiE IGKAE^RSTLESLLGATLEALSSN 
PTRKMQGLKLKLQRASAFKEEPWLGVVA 
HACNPS TLGG* GGRIA 


1 JfcUD 


Z/ 10/ 


A 




1*7© 


422 


^TVy*fcTT> XT' T3 ^ T TfT T./TtTTr >"Vf III |>tTt7^1T/T W/1 1 Jl \ T v *v 

FYNR z F * YTKTYQTENI YCKIMSFRLLV 
*KI * SLPGAMAHACNPSTLGGRGGRI I * 
GREFETRNS 




7716ft 
x / 10o 






2/4 


AAA 

444 


LFSQYI V b'VHYSSFVLILFV* *LFSQYI 
VFVHYSSFVLILFVILHNYTYIYIHTHT 
HTYTLIFLLIFSQISLGWMKLIL 


13268 


27169 


A 


13410 


205 


23 


KEFLKFHRKCICQQVKKIWPRAVAHACN 
PSTLGGQGKRIT*GQEIiETSLANVEKPH 
FSNS 


13269 


27170 


A 


13411 


25 


426 


SVWWNSIiETRSSRIiRLLKKI *NLQNKKR 
KFEHRHAQRKYDLETQGBDAIYKLRRER 
RKEEE*EKEEKKRRRRKEBCEKKKKKPTL 
LCLKKQRNQHLDLRLPASRTV1CKLISVV 
♦ATHSWPCYGSPSKLIQQSIH 


13270 


27171 


A 


13412 


227 


52 


EIGSHS CHPGWSTVAQS *LTLTSNS * TQ 
IiILPPQPPRELHYNCVPPCPAFCRQALA 
MF 


13271 


27172 


A 


13413 


345 


67 


SKCCYWAHDC I UaDL I CL YFLELVGCV 
DCFSSSLGHY*PLFV*IFFLSPSLIiFWY 
b Hi EH I CVPHFS KAPFI FLHFFPSPEFP 
FLCIEMVYV 


13272 


27173 


A 


13414 


345 


33 


KYFFFLI ICKMRMQNKQKNKHGICLLNS 
AMfl r FltKUU I M 1 rWWLEVTQ IWGV 
LFYLFHFI EMGS CPGWSAVA* S QPSATP 
vqwqovt r.T/DriDDT?rjT/2MQ 

v onoyv i.xjXjf\^c ifXUJLAaBto 


13273 


27174 


A 


13415 


187 


415 


FNQGWAFLFPFCVI FSYPWKDRNRTI FS 
QVRWLTPVT PALRRPRQADH 


13274 


27175 


A 


13416 


it j 




WIT * GQGGWIT 


13275 


27176 


A 


13417 


170 


17 


KQMREKEGNGLLTI SKVWKQPKS PSQS * 


13276 


27177 


A 


13418 


7 


163 


IK*IWYt *TMEYYSAMKRKKIMSSAAIW 
MKLEAI ILGEVMQEWKTKNLMFSFISGS 


13277 


7717ft 


A 






ivy 


b UKJs. 1 blAjN XflKb" (JLi YTKb' FKiHCLP W 
PATQEAEAGGSLKPRSLRLQ*ARSHHCS 
PVWVTTRP 


13278 


27179 


A 


13420 


236 


4 


VTRAKTGSHNSLFPPHTFFFQGKNLCFF 
QKKKKKXIYIQSKPGAMAHACNPSTLGG 
QIT *DQEFETSLATMVKPNLY 


13279 


27180 


A 


13421 


46 


141 


NSVISAHCNLCLPGSSDSPSSASRVAGI 
TGWC*HCNLCLPGSSDSPSSASRVAGIT 
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nucleotide 
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ing to first 

amino a rid 
residue of 
peptide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A»Alanine 
OCysteine, D=Aspartic Acid, 
&=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Meth4onine, 
N=Asparagine, P=Proline, 

O^CwIll m inf R=Araininp ^Q»rinp 
T=Threonine V=Valinp 
W=Tryptophan, Y=Tyrosine, 
X=Ud known, *=Stop codon, /=possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














GWC 


13280 


27181 


A 


13422 


604 


1121 


LGSGDLP * E INPLSS CSLLREKDPPTTS 

U X U\i JrISJ\xli_i 1 JM ITJXO Air 1 1V1 XlUAcllK 

VTLTVEAHLHPGEINSHVAHTKPVWWSL 
HTDAHE I WCRDSDLKLVPTLPLI PLEAA 

TjRNTTTTQT.QTDPDTTWRDDO'MTCI'PT ETTTT 
JJiviNX lnotioif tr lr AT* r KKJrii lo X LiC V*.V JL 

FLINIRRQEyQASEPKPSHRI PCDLHVY 
AQMA 


13281 


27182 


A 


13423 


214 


22 


IRESSSTPSSPTLPESCNSIPFPF*KFI 
NSTRAGLW 


13282 


27183 


A 


13424 


397 


330 


SEYNSECS * VQ WHTL WSATQEAEAEGS 

T.RATJ Q QPTjT »PATiT T PVMnifTD ZVTJA17DM 
XJla/AX\.0»J.rv 1 11 )V^riJj JL JL tr V J.N v-ilL. JL tr /\Wftr XT L I 


13283 


27184 


A 


13425 


729 


885 


AKCY P VRN I S L W PG AVAHACN P S TLGGQ 
GGRIT*GOEFRT<?T . AMMVTfPWT .V 

WJ\li VJ\^E»J7X\ A OJJLTllMl'J V Ivrtlll X 


13284 


27185 


A 


13426 


231 


2 


ASVFFKATLVDLHW* IY IYTYI YTYIHI 
YTHTYTHIYIYIHTYIYLNVNQYICLYV 
HTNANOOGYTYTCRTPTrPP 

XUlVUiyyUl XXX UK i. tr X ^»X7 t 


13285 


27186 


A 


13427 


184 


171 


l^*YIYIYTHxllTHTHTHTYILYIYTYE 
QPSSQSS 


13286 


27187 


A 


114.9R 


111 


loo 


HWLGTVAHACNPSTLGGC3GRWTI * VWEF 
KASLANLLTPLLQ 


13287 


27188 




1149Q 


1Q1 




MVQT jl\/riT« , T7I4Q'TT * T .T .VnT ITtJCfimratIT 

nnsLAjvijf r no i x LiUt\x/J.r WoljvVimi 
CNPSTL * GQDGR IT * GRKLETRPGNRAK 
TjYTjYKTOJSW 

U X XJ X XVXX-L 1 ! Oli 


13288 


27189 


A 


13430 


39 


254 


EFIPRAQDIxETSQGNRVRPRLYRKFKNW 
*VWWCASWPAMWEAEAGGLSEPGRIiRL 
HSNMDNGERS CLKKK 


13289 


27190 

X / A 71/ 


j± 




116 


AG ft 


FETSl^NMARPWLKKKIiARHSGA 


13290 


27191 


A 


13432 


154 


481 


FFFFFFFQRQKNGFF*KGIFFFFKLEGR 

UrlXX? V x INlJXJJXrVJ Ol\^xr>iO "Jj X\JoOi\J\. 

GGAPPP * LI FFFLOKKLPLLGQGGLKL 
RALGKP PPFPSORGGTNGVKTP RfiT. 


13291 


27192 


A 


13433 


388 


280 


KSKLKNGCCVALCLLGIiI * SVLLA* FKT 
*LFCVSLDMYIDTTCSIiSISIYLSIYLS 
I YLS I YLS IV*SVS 


13292 


27193 


A 


13434 


187 


48 


S ISSTKGPGAVAHAYNPNTWGGGGGR IT 
♦GQEFETSLANMVKPRNS 


13293 j 


27194 


A 


13435 


71 


309 


DSVSEEEEEEEVELAHPLAAERYHCERA 
ET*VKALLWLELCDERVSSRSHTEEDCT 
E ELFDFlxHARDHCVDHKT iPSNTi 


13294 j 


27195 


A 


13436 


18 


375 


FJ>AVPGRPTRPINTLLGLLI,IIITF*LP 
QLSGYIEKSTPYECGFDPISPARVPFSI 
KFOxIxAMTFLLFDLExALLLPLP *ALQT 
TNLPLIVMASLLLI I ILALSLAYE*LQK 
GLD*AE 


13295 


x / i yv> 




1 1/117 


1*71 
J/J 


JLI 


THPW^WnT?YriQP* PT.TViaT CAT T MTOr 1 
x rur x x oxivfc* l^o ir fLi ivtHXio AJLtiirl X oVj 

IxAM*FHr^ITLLILGI^TNTLTIYQ-*W 
RDVTRP^TYnnpwTPPvnirrsT.pvr'TTT t? 

ITSEWFFAGFF*AFDHSSLAPTPQIjGG 
HWPPxXSITPxWxxEVPlxlOTSVxilxASGV 
SIT*AHDP 


13296 


27197 


A 


13438 


575 


699 


IiGTVAHTCNSSSLGGRGGWIT*GQEFET 
SLANMVKTCPQKK 


13297 


27198 


A 


13439 


247 


168 


IxENLIYTRVLEPJttREAKVHFPFSNlSYS 
DKRTDTFVLTKTH* HTHTHTHTHTHTHT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
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sequence 


M 

eth 
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NO: in 
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26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine j 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
Odycine, H=Histidine, I^Isoleucine, 
K=Lysine, L= Leu cine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopnan, Y— lyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 














HTRGSHFFY* HTHTHTHTHTHTHTHTRG 

otiuuv 
onrri 


13298 | 


27199 


A 


13440 


391 


168 


TKKKKHPSPRGAREVFF*QPPPPPFCVL 

*jc jfx. Vr r JLioyUoVjrtji^v^JSr l^Fr r r irctri. 
SPPSPPPRGVWKKTPPFS 


13299 


27200 


A 


13441 


403 


246 


LtC PxAJ. Hxdf ±\jr ir xr r* sf r w IN JrAJrlSr r rw 

GPKKKKKLSPPPAI KMVFFKGPPP 


13300 


27201 


A 


13442 


305 


414 


KNNFHPSILGG*GGWIT*GQALETTSLG 
NMVKPHIY 


13301 


27202 


A 


13443 


269 


409 


PGTVAHAZNJ-^TJjtlti'y'lTaXTyiiUlfit?'* JL 
SLVNTVIPHLYQKKKKKGR 


13302 


27203 


A 


13444 


295 


37 


DPSYIiAWHMVTPHQVSTQHTLPRPRPCL 
PEMNFLVSLKIWQCWPGTVAHAOTPSTL 
RGRGGWIT*GQEFETSIiANMTKTKIKGS 
AR 


13303 


27204 


A 


13445 


256 


388 


FCGFWPGAEAHACCPL. 1 LbbKWjWl 1 *(j 
QKFRTSIiAHMVKPRVC 


13304 


27205 


A 


13446 


277 


374 


SRLTATS S RGGWI T * GQEFVTTLANMVK 
PXjLY 


13305 


27206 ! 


A 


13447 


14 


387 


PQVIHSPWPPKVLGIjQA*TTSQAWWLTP 
VI PALWEAKANGSLEVRSS 


13306 


27207 


A 


13448 


185 


379 


HFGRPRPADHLRPGATNHPGPQGETPSL 
I**NTKKBCKRGRPFKKKKIiQSLVWQNBCIF 
FLAELKKLW 


13307 


27208 


A 


13449 


168 


346 


ISILRTNNMISIKINIiRLFIDELKKRDP 
FIFPYFFFFFFFFFFGGNHSIi* SJjIVIF 
LIC 


13308 


27209 


A 


13450 


57 


176 


VAES RPGPGGTATELVPPSTRLLTRAPR 
DLLTGKRKPPPJjGRMS RARVQWHNHF* t 
ESCSVAQARVQQRNLGSL*TLPPKIKQF 
SCLSLLSSWEDCIiSQRVRDCSEK 


13309 


27210 


A 


13451 


411 


241 


P PLFFFFFFKQKFPS VP *GGGQGWDFGS 
LQPPPPRVKQIFCPKIPPPWPPKKGGVP 
G 


13310 


27211 


A 


13452 


307 


298 


PTHSIVRNNKSL* IN*TSSMCSGIjKvMR 

r*sqkqedp*sgwahacnlstlggrgg 


13311 


27212 


A 


13454 


191 


108 


gwsqtpglgrssllslpqwwdyrmqcrg 
vilaphcgldllgsgdppasasqvagta 
afencatsfwlp* flntqcfkiyktkn 


13312 


27213 


A 


13455 


242 


400 


IATLLDYLQ I PNTGPGAVAHACNPSTLG 

/widh T T */3T\PT?PN , C T - IV WPXTT? G f^T . 


13313 


27214 


A 


13456 


1 


346 


yvttarcscwagteshmgakdfyrqekh 
swargagekhsi^*aqwlmpvvttfwev 

KKG 


13314 


27215 


A 


13457 


166 


289 


VRSSKLNTWVRWLVPI It.TLWEA*AGGF 

t oTJP T CP T ATWIiTO 


13315 


27216 


A 


13458 


220 


76 


TASLKLSKSWLDVVAHACNPSTLGGPGR 


13316 


27217 


A 


13459 


219 


46 


SPPLKKKNNFFPPGVMGAPPRFFLKGPP 
QNFFFFFFFFFFFFFFFFFLKQ* EQTFL 
LIY 


13317 


27218 


A 


13460 


126 


1 


WSLGAVARS CS PSTLGVQGG * IT*SQE 
FETSLANMVRTLTS 


13318 


27219 


A 


13461 


330 


420 


NICSWAQWLTPVTPAL*EAEAGGSPEVT 
SS 
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Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P^Proline, 
Q=Glutamine, R=Arginine, S=Serine, 
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X=Unknown, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 


1 11 1 o 


Z/ZZO 


A 

A 


13462 


577 


766 


AQWHTS I VXiAA* EAAAAGLLE PRS * S RL 
KCTMI I P INSHCTPAWQHMRPCLLKKNF 


13320 


27221 


A 


13463 


206 


387 


VTVGETKAEAGRGHGLLTCCVLKTQKWL 
GTVAHAYNPSTLGG*GGRIT*DQEFETS 
FLGS 


13321 


27222 


A 


13464 


167 


41 


DFWLGAVTHACNPNSVGG*GGQIT*GQE 
FETSLANMVKPWNS 


13322 


27223 


A 


13465 


480 


79 


SRVISPLLILNSVISAKSLLPVGSQGKS 
KGQVWSTQEERLEPGVMGEFTPGPCPCL 
VSQGPSLCPSPPPYSFRWSRLSQAPGSI 
RVLI * RPAAATSAP P PAQLGLPDTAFPP 
PLLSRDGQWAAGRQGDRDKGLP 


13323 

• 


27224 


A 


13466 


3 


373 


DAWVAHASAHASVCLIIDTLLALLLIII 
TF*IiPQLNGYIEESTPXEOGLEPIS PAR 
VPLYTKFFLVAITFLLLDIiERALLLPLP 
*AIiLTTNLPLIVMSSLLLIIILALSLGY 
E * LRKGLD * AE 


13324 


27225 


A 


13467 


214 


55 


TRQKI FNIiFS S * KKFLI PPARAKDFLFF 
IFFFFFFFFFFFFFFFSCSRILLHR 


13325 


27226 


A 


13468 


111 


358 


VMKVFYI* IGELVTQAYIFVKSHQIAHI 
LFHSMYVLP*IKMFWPGTVAYTYNPSTL 
GGHSRRIS * AQEFETS LGNTVS PHL Y 


13326 


27227 


A 


13469 


199 


607 


RIRGTSQKT.T.TiTiKKLISLIPKSQPTRGT 
P*TTFPPPNTTNFPPPPVAATAPDPSPA 
HFVSSPYNPDLGSP*PECPSPGRLQREI 
EQCKKDIQNFPFPTTSRICSNDLSIiKGS 
ASRRRGHIiFCERPVNQFRSPKPKK 


13327 


27228 


A 


13470 


128 


256 


EDWMWWLTPVI PTPQEVE VRGSLEPRSS 
ELQ*AVIVPCTPSE 


13328 


27229 


A 


13471 


94 


388 


KGBGKTEE LWS WRSHRGTHLCLLAD I S A 
LPLHYYYYYYYFWQRWSLPDQPGQHGET 
PCLPKIQI INRAYWHVPWPTDQEADAR 
* LPEPNRQRSQRPE 


13329 


27230 


A 


13472 | 


359 


73 


ATEPGQLFYF* ESESHNVTQAGVQWLDH 
GSLQPPTPGLKPSFPLIHTGITAASHYT 
WLnQLLPMNVSLLTFYESPQLKIQRSHI 
LCNSIHIKFLB 


13330 


27231 


A 


13473 


332 


10 


ILSKRGFPKPGRYCEVNPFLCVQGAK*V 
RP * EDTSLHCE I YHTHTHTHTHTHTHTN 

LKP*TSKELIPQKTESLFFSLCIKKSLL 

o iwYPtpnnxr* oivxTG'Tr/"»/"»n r*j?*r o ir 
KJf y 1 r FDAtji> 1 ajIMo VVaV-AUJf JL oK 


13331 


27232 


A 


13474 


355 


385 


IMVCIKQSCVH*KNN*FWPGTVARVCOT 
YKKFF 


13332 


27233 


A 


13475 


297 


425 


RKSAWWIiMPWSPL*EAKAGGFLEPGSL 
RSAWATWGDPHLYKK 


i im 


27234 


A 


13476 


160 


i 


KSTGRLGMVAHICI PSTLGSQGGWIA*A 
QEFETSLGKILSLPE IQKSASLITW 


13434 


27235 


A 


13477 


334 


79 


GMKEQINFLYQKRGI KPNKLQAENWLI 
HYN* KLLSNS FK*LQDTASI * YFFGNYL 
KRPGMVAHACNPSTLGGRGGWIT*GQEF 
L 


13335 


27236 


A 


13478 


234 


385 


YAKFHVS INEKKPDVAHACNPGTLGGQG 
KWIT*GQEFKTSLANMTKPRIY 


13336 


27237 


A 


13479 


408 


129 


KRSINRKKGGELSSSHFLLLLLPPLDEE 
PSPPPFCPPLPPFPPPCPRLPQSG*GAC 
GRLHPC * MDLGPCKCPARKVFSHLSCSL 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
R=Lysine, L=Leucine, M=Methionine, 
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VLQVCRSP 


13337 


27238 


A 


13480 


251 


384 


TITPWRGAVAHVCNPSTLGGQSRQMT*G 
QEFKTSLSNMVKPRLY 


13338 


27239 


A 


13481 


164 


300 


AGDDS DNSMGLLGRIHE I KFFKCLGPCM 
TTKFIVPALWEAEVGGWLEPTSLRLR*A 
LIALLQCTPA* GTVYDYKVYRPSTLGSR 
GRRMA 


13339 


27240 


A 


13482 


2 


30 


TGCHQFRFLNNPFSPHPVLQLVSPGPDC 
LIFRYFKF*LHQLM* INPNANFCGARSF 
LCKFRVWIS*LRYPSLVALSVLFVFK* S 


13340 


27241 


A 


13483 


39 


199 


IiDVGTCSPIVPASQEAEAGRLHEPTSLR 
LW*AMIVQLYYSLGDRARPCLYIiKK 


13341 


27242 


A 


13484 


18 


72 


TRTRSYPGSTGCQLHTFGLVSLYNCVSQ 
FLITNLYISIYRDIY*YIYIYLYREIYL 
YIYIEI YTYIBRYVYIHI * ILGLQVANC 
IPLD 


13342 


27243 


A 


13485 


266 


496 


I^SASDITQAGVQWCMftSFL*PQTPGLK 
*ASTSAS*VAEITGLSQHVWVKIIMSVS 
LSSYTFIQLPSILLYGNTAP 


13343 


27244 


A 


13486 


286 


72 


HILSHCTVLWFLTPHHGTTHYKNTTGQP 
SrVAHTHNLSTIiGG*GRWTT*GQEFKTS 
LANTVKCSDAWVDRD 


13344 


27245 


A 


13487 


309 


6 


RFFFFFPPGKKGVFFQRFFFGFPRGFPP 
PRFFKTGPEIFFFGPLKKKKNFPPPGGK 
IVFF*RGAPPFFFFFFFFLFFFFFFFLK 
KNYLLLEKVQS PEKEG 


13345 


27246 


A 


13488 


183 


459 


RRVKKICKCIIWQNNYL*KGINICSMPQ 
KKKKKKKKKKKKKGAPFKKKKKKKIS *G 
GGGPFF*KTKIKPPGGRGFFFFFGEKIM 
LFPAVFI 


13346 


27247 


A 


13489 


348 


1 


CEGKDGLPIQFASWFKYAGFHSIiESIFQ 
SFCQTKKLEAPKELSPCSQLYRYNWQLT 
CRRMKLDPHLLPYTKINSRPGAAAHA*N 
PSTLGG*GGWI I *GQEFETSLTNMGETP 
PTRP 


13347 


27248 


A 


13490 


313 


320 


KQGGPPLAQARVQWYNLSSGETSLPLPG 
SSNPP PS ATGVAGTTGS PPLA* LIFFFL 
* DRENGDQRKL 


13348 


27249 


A 


13491 


187 


350 


GNPVKK* IKNTTSSSGWVWWLMPI IPAL 
WDAEVGGLLEPRS LRPAWATERDS VS 


13349 


27250 


A 


13492 


211 


83 


E YAKKNGKLLS WRKYL* SVCVCVCVTTC 
VCVCVCVTTCIKNI 


13350 


27251 


A 


13493 


286 


407 


LGVEAYACNLNTLGG * GGR IT* GQE FET 
GLGNTARPCLYF 


13351 


27252 


A 


13494 


897 


1033 


KFIFKQMQWDDHSSLQP*TPEIiKQSSCL 
RLPSTWDYRREPLHLAM 


13352 


27253 


A 


13495 


265 


3 


HLILLEEDLTWQRGKWIiKGRVSLCSLGS 
♦TQSLTVLKKGRGIiGVVAVAPSTLGGRG 
RWVALAQELETSLGNMVKLSLSKIQKIiA 
GMV 


13353 


27254 


A 


13496 


319 


1 


CFKLWDTCAQCAGLLHRYTCVMWCYTH 
HPEFCTYNliaGLPKiMWSYbWrKCCHAVY 
HFRYIKNLLPGCVAHACNPSTLGGRGGW 
IT*GKEFKRRMKTKV* IGIYFI 


13354 


27255 


A 


13497 


265 


11 


VU3EVSNQHLLGAPSNRTPSEVPLAQAA 
QGRPSLVPPSAIVFPSYFTICYAFLSIS 
MPIIP*VWEAEVGELLEPRSLRQA*ATW 



1355 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 
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Amino acid sequence (A=Alanine 
C=Cysteine, D= As par tic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H~Htstidine, I=Isoleucine, S 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
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X=Un known, *=Stop codon , /=possib!e 
nucleotide deletion, ^possible 
nucleotide insertion 


13355 


27256 


A 


13498 


164 


22 


GIKRKLIKSGWAHARNSSSLAGRGGWL 
T*GQEFETSLAKMMEIPSV 


13356 


27257 


A 


13499 


277 


363 


TYLCYI FNI CLFVGE I EKQR* LI * FHRS 
TLNGHI LYVRP VS PNPSQAvASPVTCTY 
MPTWPEWBESQKK 


13357 


27258 


A 


13500 


1 


270 


YTCSVTiLLVTVNLLLCLIYKLNFI igml 
RKKTVYIYIERIQYYLKFQASTX3CLGVY 
PTV* PYMYTYTYMYMYMYIYIYIYIYIL 
CLCLDG 


13358 


27259 


A 


13501 


170 


383 


ILYSS KLTYPS PQI VI FFCGKST *GWVW 
WLMPVI PAI * EAE VGGSLESTSLGQAWA 
TGRKAFFVFYKEYKN 


13359 


27260 


A 


13502 


31 


402 


GDYLYLREGWQE PARREQHAQNLEPNRT 
FTPHLICLTNYRLFSLAWSECEERRNET 
TDKNQTVKDLALDMKRVLPFS*GHQKVI 
KVGWAYACNPSTLGGQGQWITRGBEFK 
TSL INMAKPYLY 


13360 


27261 


A 


13503 


232 


446 


RNRPGAVAHACNPSTFRGRGGRIMRLGD 
*RS 


13361 


27262 


A 


13504 


196 


390 


VNKAE KKHLY FVARKE I LR PG WVHTCN 
PSTLGGRGGWII*GQEFETSLPLPKTQK 
LAECGRGGL 


13362 


27263 


A 


13505 


146 


7 


LKYVLYWLGAMAYTCNTSTLGGQGGWIS 
★GQEFETSIiANMVKPCLF 


13363 


27264 


A 


13506 


177 


18 


SQNFGRPRQQDHFRLDGRHSETPSLQEK 
VKKLTRHGGACVQSQLLERIiRREDHLGP 
GGGGCSVP*SVNI I 


13364 


27265 


A 


13507 


117 


338 


NKILKKKKGGGRFKES KFTS PGLQGNS F 
FMGPPKLNSRAGV+QRREGKNLGVPQLK 
P FEAMP L FARG PNTKNP 


13365 


27266 


A 


13508 


306 


443 


LARYE PAVRTRAC * AQWLTAVT FVL * EA 
EAGGPLEARNLRPAWAT 


imac 

1JJOO 


2726/ 


A 


13509 


146 


390 


KMF KGH E QAAHRKKKKRGGRF KGS KFTS 
ACLQRNIFFLGPPKLNSRAGV*QRGDWK 
NPGVTQFNRFEENPLFARGPNTQKP 


13367 


27268 


A 


13510 


119 


356 


NEDRNLRGGCPGR* LLRTEGVCSNPAGW 
SGIRWCEESGGLFWRWRGTESVLFSGL 
r V Cv- V r AQE KAKATGRAE VSLCP 


13368 


27269 


A 


13511 


221 


21 


EDIiQRDKPLGSCY STCGWAEQ WYLQHPG 
GAGSKCRIWLGAVAHAYNPSTLGGQGRW 
X X oyiir Ltoli 


13369 


27270 


A 


13512 


163 


418 


TFPDDQCLMLQDHACVKRS IQSA* YLTP 
ii V XuwWISAl&AuVjJjluSlr J. O iiKirAWAl 


13370 


27271 


A 


13513 


239 


21 


FGNLGGPGGRTACIQFSLGNIVRPYFYK 
I KPKPKPNQNKTKIS *AWWCJflP I VPATR 
KABVGESLEPRRSRLQ 


13371 


27272 


A 


13514 


135 


11 


KFFFWPGVLAHAYNPSSLGAQGGRIT *G 
QEYETSLANMVKS 


13372 


27273 


A 


13515 


162 


57 


EGTLRSRRPPLGGWVT*GQEFETSPANM 
VKRCLY 


13373 


27274 


A 


13516 


424 


54 


PKRGFFPTPFIWVPPVFPLPPFFKPPPR 
IFFLGPPKKKFFSPPPGLKIFFF*KGPP 
rr rce t r r r iir 1? iTr I^J^J^iU^JtCLA^oc JP r rf 
FWARPFS FLSLFFFBCET I PL YYNI WMHK 
DSCKAVIHHYH 


13374 


27275 


A 


13517 


310 


349 


GGGPLKKKLFFSRGGERFFFF*GAPPFF 
FFFFL+SSKIFIFFLKSFFFFFFFFFFC 
FFFFFSQILFFISSPCFVFFFLNYTSRT 



1356 



WO 01/64835 



PCT/US01/04927 



SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
add 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
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nucleotide deletion, V=possibIe 
nucleotide insertion 














CSYVCAIiILFKGNTYSRKARHILW 


13375 


27276 


A 


13518 


273 


393 


I KCLS YRKKATTLGGQGKVl IT* bUK r is 1 
SLSNM*KPQLY 


13376 


27277 


A 


13519 


450 


139 


I PP IALVS PFAKHTWQQQMGLVRRSS KG 
NNMSKGWEIIHPSRIFCSNHTLLRIALW 
LGAWHTCNPSTLVGRGGW IT* GWEFKP 
SKTSMERFCILSRHKRTRGS 


13377 


27278 


A 


13520 


47 


427 


KMKQLFEMLVFQNWTI FLSHSI FRTIR* 
LTLE PLMFRYYFKQ *RTLL P PKF* DRVW 
LCHAGGCL*CSGTVSAHCNLRLLGSSDP 
STSAS * VAGTVGAGHHIWL I LNNRELPK 
QKFF INS SRNLFFT 


13378 


27279 


A 


13521 


199 


98 


SLPSVKLSICCEFPEDIVSKDPIPWLP 
WKRESCSLCALPNLLPK* EKTNIQVP * 
VFQSTIKAKAQLGMVAHACNPS ruGGRG 
GWIT*GLR 


13379 


27280 


A 


13522 


314 


18 


QTLNFYILKIYNI ISLVHC* IFEARFQI 
LV* V * *NY PF* SLI CFTr S WHETQNT * * 
KNLI I WAGAVAHTCNCNTLGDOGRRIAL 
AQEIEPSLGNIAIQ 


13380 


27281 


A 


13523 


18 


231 


CPPAVFGTS IEQ IQLKYD I KD * KIGQAW 

l»Tf*1\'mrTTV*1Vl7V'iyTr/V*GT TJTDOCDr.irPBM 
WC-Air V J. FvjJLyAlS VCjVjiLiE. IKooKii^UHn 

STSMNRHCVPAWAT 


13381 


27282 


A 


13524 


239 


465 


RVQVSECLLHREKCLYFQIiGLS * LI PQT 

j-iT.TT vr\rw~r ty f PxrftTTT*^*^TfV? , PT ,T>nx>riTTO T 
GWIiKyy lujlro 1 VAni KJEiirlsX IjKvsKoJVK ± 

T*GQEFETSLANIVKPRLY 


13382 


27283 


A 


13525 


388 


468 


S LRLGT VAHTCNP S TLGG * GGW I TRSG 


13383 


27284 


A 


13526 


342 


369 


CTLNTLLLYS I SDFGVS AFLATGGDI TR 
NKVRKT* LRLGTVAHTYNPNTLGGQGRR 
LT*AQEFKNSLSNIGRPHLYKTKKTKKT 


13384 


27285 


A 


13527 


241 


231 


EYVCI WSHSSLYFSS SLY IHM* VCVCvY 
IYVHTNSHIYTHMYIHTHTHTHICIYKL 
EEKYKEL 


13385 


27286 


A 


13528 


241 


231 


EYVCIWSHSSLYFSSSLYIHM*VCVCVY 
IYVHTNSHIYTHMYIHTHTHTHICIYKL 
EEKYKEL 


13386 


27287 


A 


13529 


34 


299 


SiU?ASASPWGIGVLiRPLACPEDRFSPG 
PEANCX3EIETTELRVTLSPRLECSGTIT 
APCSLDLQGSRDPPPSAS*VSETTGASH 
HAQL 


13387 


27288 


A 


13530 


158 


45 


SGHRSRARWL* PVI SALWEASAGGS PE I 
KSLRNRRPCj 


13388 


27289 


A 


13531 


342 


1 


CFFLKGAPPFFFFYFFNFFFFFFLVGTD 

ITT TT V^nFT*TVT*nf/^ DXTTr^iraWDJf FTftTKinftDf^ 
jlLLiXJjl\.V 1 WAJniJWjlr JVJLlJINAVlrf\.IU>iriyrtKV3 

LP*PDIKIFYKVWLGAVANAYNPSNLGG 
*GGWIA*E*EFETSLANMTKPQYKKYKK 
LA 


13389 


27290 


A 


13532 


589 


742 


RIMKMLRI KICGTGPGMVAHAVNPS SIX5 
DGWIT * DQE FKAS LANMI KPiib x 


13390 


27291 


A 


13533 


74 


446 


HVGIPSRLTSVSHPRHCKHPRTESEHVY 
CAIEARHYQCKVSRPSDAAGSEGRPRDM 
TQDTEVADTPPPNISS ITATRTMAT IGV 
GVVAHAYNPNTLGGOGRRIA* AOE FKTS 
LGNLAKPCLYKH 


13391 


27292 


A 


13534 


1941 


1040 


AFHLLP YLSSGFYCS LGPCFGLGS SHMA 
AWHKE PLHNACSDSRRSAPIRQGLGSPS 
ATHDTHTRTLAHICKE IFKERLHE I KER 
E IDS *RG*NTQR* VEREEAENKYKPTEI 
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W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 
nucleotide deletion, V=possible 
nucleotide insertion 














NTE I ENH KQQEMLRHAGEERENS PREAE 
RKCRNEKCEIKSRNSRKEARLRKKVKFK 
TNEKGKG*QIITVHKEKREVGRERRGRQ 
NKKRTKSTQHVTQKGDMR* SVKEEPHLR 
RAKRNSNKMKNERYVLRQDTDHS CEQKE 
KMRQRHTRKKLARETWKWVRHTLNREQR 
AKPRKKETKEIHIHSAILKC 


13392 


27293 
• 


A 


13535 


1192 


1476 


EIKRKWGPGASCHACNPITGRPRRVDHL 
RSGVQGQPGQRGETPSLLKNIQISWAWW 
CAPVI * E I KRKWGPGASCHACNP ITGRP 
RRVDHLRSGVQGQPGQRGETPSLLKNIQ 
ISWAWWCAPVIPATWEAEAGGSIiETGSG 
GCTEPRSHHCTQAWVTE 


13393 


27294 


A 


13536 


55 


132 


RAESCSLAEDGVQ*GNLSSLQPPPPK 


13394 


27295 


A 


13537 


7 . 


325 


SPVEFRLGEPTKGTSSFGKRRNKTHTLC 
RRCGSKAYHLQKSTCGK*GYPAKRKRKY 
NWSAKAKRRNTTGTGRMRHLKIVYRTFK 
HGFREGTTPKPKRAAVAASSSS 


13395 


27296 


A 


13538 


259 


274 


KFEAAVCYHCTTLSL* *CHTVCFIiSFFL 
IiEIGYHYITQAQSGAMIAHCSLKIiLGSS 
NPPVPAF*VAETRSACYNAWLVFVKFLM 
GL*ELWKDMS 


13396 


27297 


A 


13539 


3 


68 


QASVLPQKQLMSWQKLSESTFVKFWNLV 
KTGRLGEE ISCCLWREYYS ISDCLATI 
KLPASHLSMRKPRHKDFKSTFLISTKNQ 
GNHGHGTKGNEVNNFQQRISV* KTYFKK 
YNMRPGAVAHTCNPSTLGG *DWWIT*GK 
NCQNQlaS 


13397 


27298 


A 


13540 


268 


468 


YRRLISKKFFKHRWKSVSFFKFSFFWLG 
EVAHACNPSTLGGQGGKIA*AQBFLTSL 
GNIVGPCLYKK 


13398 


27299 


TV 


13541 


308 


437 


FSWAFEIGSQAEVQWHDLHSL* PLPPRF 
KRFSHLSLPSSWDYR 


13399 


27300 


A 


13542 


7 


244 


AKTAPLFF* FETVS C F VAQARGQ WHDPG 
SLQPRPPGQETSMIKTSSDPPPPASQVA 
GATGMGHHAQKI * FLVETGS PHVSQGGL 
ALLK 


13400 


27301 


A 


13543 


50 


175 


ALPAIHTAVGQCSVEGFCVLSDPEGDGA 
ARMKLVRFLMKLSHETVTI ELKNGTQVH 
GTITGVDVSMNTHLKAVKMTLKNREPVQ 
LETLS I LGNNIRYFI LPDSLPLDTLLAD 
AifiPNVKbrWREAVAG* R*G*AREI FDEI 
ES 


13401 


27302 


A 


13544 


47 


362 


VAFQGLQVPATNPANFFFFWKGGFFFVP 
yc^KyCjrlsKlGIjRDPWPPGVRKI PGPTLS 
GTREKGAPPPPPI YF* FFWKKGGQKGGP 
GGF*TWDPKGPPSPTPPKGGD 




27303 


A 


13545 


97 


351 


NKKKKGP P P F F* KKRVGKKKKKGARAGG 
PPPKPPPFGGPRGGGPRGQNSKTPRPKG 
GNPPFKKKKKKRNI FPAjGPMGLKGGGHL 




27304 




13546 


214 


iZ 


W x XKISSVDGGAKIK* WT*NLKCL VI*SV 
KNITRWVDCLSSGVQDQPGQHGETPSLL 
KTRKTS W A * WR T /?T . T OR WR KM P W P Q P 


13404 


27305 


A 


13547 


350 


159 


QTGTLSQKTKQKKQAQWLMTVILGL* EA 
KAGGLLE P KS LRPAWATWQD P I YKKKFF 
LERKTWR 


13405 


27306 


A 


13548 


367 


149 


FFFFFFFF*TGSHYVAQNWTQIPGLNLP 
ASASQVTATTGMHHHTRHTPFPSLTSHT 



1358 
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nucle- 
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peptide | 
sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=IsoIeucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, PHProline, 
Q=GIotamine t R=Arginine, S=Serine, 
•^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














QSVSTASWTYLQNSGRH 


13406 


27307 


A 


13549 


321 


2 


GLVKVTHHGYDRNLGYPDSQASVLSSFY 
TA*MVQ*LQLPDKVQIYPAHPI»FINDMP 
P * WLLKIPS FGPGTVAHPYNPNTLGGRG 
RRTTRDREFBTSLANWKPHLH * 


13407 


27308 


A 


13550 


216 


13 


GYSGKETVFSMRKGQQSGPCIiQGPS I PM 
GKAGINQIIPKTTLGGRGGWIA*GQEFK 
TSLANTVKPCLY 


13408 


27309 


A 


13551 


199 


391 


KDFS I FIACFKRPFITGSQGQEQWLI FP 
HAYNPS ILGGRGRQIT*GQEFLTPARLG 
NMVKPCLY 


13409 


27310 


A 


13552 


70 


239 


ATCVKN* RTSWAWWCVP WPATHEAAAG 
GLLEPRSSRLQCSYXASVERHRTSVWTT 


13410 


27311 


A 


13553 


105 


257 


GQAQWLMPVI PALWEAEAGGS FAPRSLR 
LA* ATVPGPNNFLKRAFHS LLNIiFLP 


13411 


27312 


A 


13554 


499 


165 


Y*MESHFVTQGGLQWCSLGSLQAPPPEV 
GGWLELGRQRLPMKPKIAP I WTPSPG 


13412 


27313 


A 


13555 


415 


719 


YTDKSIRSISLLPVKGSTHSMCPVKFIQ 
LQKAGG I HMAKLSGQRVDREWRLGTVAH 
TCNRSHLEGKGGQIT*AQVFKTSIjGNMA 
KSCLCKKYQKLSWIWQH 


13413 


27314 


A 


13556 


224 


523 


DKVSLCHPGWECGWVQSQLTTASISQG* 
TILFPQSPE 


13414 


27315 


A 


13558 


355 


1 


QVSLQLPYCVLFQFHIYNKQAALLQRCY 
VSATNLLINAIWQYVNSLKMCEENQNVS 
LQNMP F * HRNYFGEAGHGGS CL * SQLIjG 
RLRH*NCLNLGGGGCSELRLRHCTPAWA 
TKASPS 


13415 


27316 


A 


13559 


73 


259 


KYVPHKGKISERRLCSVAQAGVQWCDHG 
SRSLEVLGSNDLLASAS * VAGITGCWGS 
SDPHTT 


13416 


27317 


A 


13560 


205 


21 


WKI S EDFLLGDAKKWAYQED I KRLRLG 
RGAVSHTCNPS TWRGRGGW IT *DQEFET 
SLPKC 


13417 


27318 


A 


13561 


254 


481 


LMALLPGSSDVDLSEYGWEHFNFVILIS 
KEI FILLFF*DGVSHIjLPRLECSGMISA 
YCHIiCLQGSSDRRDFLKKQ 


13418 


27319 


A 


13562 


174 


369 


TFFSPSMFVEPGPYYIAQAGVRWLFTGA 
IIWCSLKI*LALSDSPTSAS*VTATTGM 
PPHLAFTVK 


13419 


27320 


A 


13563 


371 


109 


LFSKAGRCILQNLLCCMVSGKCWPMNGI 
DVXREYPTFIAHRVLTYAATSGSDFLGQ 
AQWLMPVTPAI *EANAGBSLEPKSLRLA 
*ATVPGPRSLSQTLLHRSELCGL 


13420 


27321 


A 


13564 


206 


486 


RQLAAVH I LVTPLPDHVTWANYVTSLS L 
LSSSENGVNNDNTHFIGLLLRDYK* GWV 
* WLTPVI PALWEAKVGRIARGQELENSL 
GSKARPRFV 


13421 


27322 


A 


13565 


212 


398 


SRVRGCFPFNLPKSSCI*ECGQGATOHA 
CNPSTLGGRGGQIA*AQEFKTSQGWVAK 
PHLYKI 


13422 


27323 


A 


13566 


26 


395 


YSPVHTDKCSGVRKLGLFFFFFFEKEFH 

KPPKKRGKRGRAPQPGKILNFKKKRGFS 
MGAKVNPNFGPKGNPPP*PPKGAGKKGG 
TPRPGPFFFFF 


13423 


27324 


A 


13567 


400 


195 


" ATRVSLAPTKNKINWGWWPTPVIPPPQE 
GEAGEFL* PQKWRVRLNKIVPLPSRPRD 
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K=Lysine, L=Leucine t M=Methionine, 
N=Asparagine, P=Proline, 

C#— {« 111 t* A m 1 n p RsArotnino CsaCawlMA 

>c v*iui*tiuiiic, xv~/vrginine, o^^crine, 
T»Threonine. V^Valine 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 

nucleotide deletion \=nn«ihlp 

nucleotide insertion 














KTRLCFKKKKKN 


13424 


27325 


.A 


13568 


316 


154 


SO P F PQLR WEnR T . G Rf3 VflMft Dlfn * t ' mdi t 
I PAF S GAEVGGS LE PRS S ECP A* ATNNLK 
SKVIIGGQICDKASKVSIVESSWWGWGC 


13425 


27326 


A 


13569 


297 


475 


QP* PPGLKQSLLPWPPKSAGLTGVSYGT 
WPI 


13426 


27327 


A 


13570 


116 


JJZ. 


PPTSAA* GDDETTGVCHHTPV* FIiKTPF 
IEIRPGYVPRVGHWRF 


13427 


27328 


A 


13571 


233 


405 


TVPELRASCRLKKKKKKKKGEKNLKKIK " 
GFPGPGPRV*GPEAKNPGGGGPGIPKGP 


13428 


27329 


A 


13572 


408 


3 


TFCDYFKTPPPGKFFFINGPPSFFLNRG 

PSFFFFP(^SRPPPB*T.inNnri7l7i?DT trtrcvnr* 
r «* r c r tr \jr trrrc JjJSJNrr r r Fjuv Vj? KJLrCj 

GYKRGVFPPKKKGPPQPLFFFFFFFLI * 

KKTRPGTVAHPCNPSTLGGQGRW IT*GQ 

EFLSSLAKMAGRWRPGNCESV 


13429 


27330 


A 


13573 


273 


406 


IKEFKAERGGHS *NTSTFGGQGRWIA*S 
OEFETS T . aNMWDHT . V 


13430 


27331 


A 


13574 


385 


38 


KKNSPPPPGKNFFFF*GPPPSPFFFFFF 
FFFFFFFIiGGKKTFFFPPPSRFFFFLKP 

GGAKGAPLFKKKKKKKKLKVRPGAVAHT 
CNS 


13431 


27332 


A 


13575 


230 


53 


HWKVLKENKQI FTKIS FKMNAI KDS VGQ 
AQWLMPVIPAL*EPDMGGPVEPKSLRPA 
WAT 


13432 


27333 


A 


13576 


406 


33 


VNSIVWVSSPFRVSTFISFFELKSCSVT 

LSFYLSSIFYPHSSQTELFVGTLNAASF 
TNSRLLYLSLLLCX3KYHPLVPSLPLSTI 
WYFSLRVSHHFP 


13433 


27334 


A 


13577 


426 


294 


GGFPSPPVT* , fCPPPni?17TrT« , r i DT irV7TO nn 
wvjr r orr vr rvJErirfyr Jc r rwc\LiJ\2\J\. loir Ir 

PPGEKKI FF* KAPPP 


13434 


27335 


A 


13578 


268 


3 


LQAKGPAMGEAGARCSSEVWGWPRKGFD 
L*VSR*PSFDSHVRIGRVQRLMPWPVL 
WETEADISFEPCSSRPAWASWRKTYIQK 
PQRI 


13435 


27336 


A 


13579 


395 


49 


EKKKFFVCFPGFPGNPPNPSPQFF*KGL 
ISPSGSI1RTRRGFFP* RVFFFFLKKQFP 
hS PRVECNGI I PDP * PPFGEVTSTPQVA 
GAIGPPPPPRVNFYFFVKKGFLNVGPMF 
PFF 


13436 


27337 


A 


13580 


68 


466 


GASPAQGSTLHLVILPRYFFKIPTVRTE 
SFSGSLVTSPPPLHFLPLWKERGGPG I P 

LPCPO IaLOVT XZCZTCZ T. D D VD G T .DTYT C rsTSV 

WPLHGVPPGHVCS*PLAGDGAWPPSPHW 
IPLWPGTSKSLQPAPPWNSA 


13437 


27338 


A 


13581 


67 


249 


ATAPGLLCSYKIFSCQLQWCVPVTQLAW 
ETEAGGSLEARSSRL* *TMITPVNRHCI 
LAWAT 


13438 


27339 


A 


13582 


359 


3 


KQDSQWVAAAPASVAVSTCWRGLPLPWS 
GGQNRGKGSLPLQVHRGA*GTENKNQGG 
TPRPGGGPGPSAPRGSSLGAHRKLPVHH 
TTLS SSRSALPPTPRHPAPSS PPCTEE F 
HRTRPI 


13439 


27340 


A 


13583 


174 


1 


FFF F F VE TESHS VAQARVLE YSGAI PAQ 
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CNLDNIiSSSELPASAS*AS * VAAXTGT* 
KR 


13440 


27341 


A 


13584 


192 


206 


LFFLEVGSCYVAQAGr Kr FUfaijHijfc'JjiaA 
S*VVGSTGVPHHTQLV*SFSSLYNLBRN 
C 


13441 


27342 


A 


13585 


244 


28 


KKPENQEINYIFYHLFNVHEQIRISLGP 
GAVAYNCNPSTLRD*GGRIT*AQEFKTN 
LGNMARPRTRGSTRTF 


13442 


27343 


A 


13586 


312 


701 


RGPEBT* PRTRKSS PRPTRSCES VS KLA 
RPPRRCSAAPAQVPRIiSIiRbPKJ'lJVi'Kv 
SSAEKCAPLPLLPECSETGALPRNSFLC 
• QNASS PLLSbGLPPS PTVQALQPRALHQ 
HLGSTNKEDAHVAPAKKK 


13443 


27344 | 


A 


13587 


410 


292 


GYMIKQQTVYHECRM*ANSHLHLPPEGL 
LRAVTLWRRAP 


13444 


27345 


A 


13588 


428 


36 


TPPPFFFFWAQKKKKIFPPPGGKIFFFL 
KGPPPLFFFFFFFFFFFFFFLKKFFGLK 
GPFFFVQD*QGIiENPKEGSSLGWEKLGR 
ALLPQQRS FVPGLGPTRLFWGLVFWFFG 
FFFLPFGLCFFFALI FNC 


13445 


27346 


A 


13589 


3 


364 


TAPDDQGRPYQAGEPAAHVRADSTHGIjG 
A*RRVRGHLPPRVGAPHHPGAbbKJst3Ai' 
PLHPPPNL* HRRPNGS CRFI>PG PAAPPQ 
GAEGGRGRDRRGTHSVAQTGGPGGFGSG 
VTPSWRSS 


13446 


27347 


A 


13590 


778 


910 


DRIRRSGAVAHTCNPSTIiGGQGERTA*G 
QEFKTSLGDMVRFCLY 


13447 


27348 


A 


13591 


230 


3 


KLVN I LVE KSLVAMKS1 v fa isljlun V XKfa if 
ASLGMVAHTHSPSTLGGQGRQIA+AQEF 
ETILGNMAEPCLYKKHSVY 


13448 


27349 


A 


13592 


223 


340 


LTP 1 1 PE PWES EWGS LiE * I»TPGAVTHA 
YNPRTLGIRSGRIT*GHEFQTTLSHTMK 
TRLYHELL 


13449 


27350 


A 


13593 


35 


220 


DRASLSPRLECSGMIVAHCS INLSGSSD 
PLTPAS * GDGTTDTCHHARL I FTGADirL. 
IDASS 


13450 


27351 


A 


13594 


87 


369 


NVQKTECEISGKMQIWQKGSVKTKCSNG 
LFNFP I FS KKIAS CKvKJsIj 1 KiS *w RoU 
ARWCT PVI PATEAEVRGQLEPRS LRPAW 
ATYQPHFKS 


13451 


27352 


A 


13595 


261 


489 


gijxTMT?PMTn7n**tfirgTCOar>MT.MPTTPA 
SWVNERMWU* *KHJ£ 1 bUAU wumi?±Xk'j\ 

LCEAEVGGSLEARSWETSLCNIVRLHLS 
KKKRGKGRKRGGNQIAPSRE 


13452 


27353 


A 


13596 


238 


2 


NDCLWWLFRI PANVSTFGLLHMSLKVNT 
PGNNRFKSELGTRCL IHTCNPS ILGGQG 
GWIT*GQEFVTSMQKGSAVKNQ 


13453 


27354 


A 


13597 


36 


687 


RDVHRSTY QAGS KQD WGPGEAERLS S SR 
RGAYSCPVP ITFAEGKTRMGRDKMRLIL 
GI*SWGPSLTIiIjI^ITNAPRRPoM*EF/Uj 
PGNSTS * SGAVRFPGGGRK*EDSAESWS 

r^T?/^oooe T5 o O/^ t3TvTD13 C T3DC Ji"\7T*MT.Df2r J5 
Cr (Jonr or* oovjFiN Jr ro r tf ortv Ll T LUxrvjlJO 

GIiQR* PKQLS PPPAIiSLPVSSLLVRLSP 

WPPTSSHLLPQALPQSLHPQGSHRAVND 

SFSM*GMVLGSRRNRGCMEA 


13454 


27355 


A 


13598 


367 


468 


KKQRRGRE PWLTPAI PALWKAKAGGS P * 
VQSSRP 


13455 


27356 


A 


13599 


183 


380 


LTSMLAVDNSSVEKTC PQAMRI SRPGAV 
AHTCNPS ILGGQGGLIA* VQEYETNLGS 
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MAKTHLYKN 


13456 


27357 


A 


13600 


161 


657 


LIQGCWCSLFFSTRVFLAAIGMKSQWCR 
PVAMDLGVYQLRHPS ISFLSSLL6TENA 
SVRLDNSSSGASWAIDNKIEQAMDLVK 
SHLMYAV*EEVEVLKSQIKELIEKNSQL 
EQENNLLKTLAS PEQLAQFQAQLQTGS P 
PATTQ PQGTTQPPAQPASQGSGPTA 


13457 


27358 


A 


13602 


121 


423 


MIKVNSFGLYLYSQVTYIYIMKYYTAIY 
LKDFFFKDGVLTRHPRWNTVTTQCSLEI 
LGSTD PPASAS * AAGTTNTSHHGQLYS Y 
FKKRPTYAEMEQYKI KK 


13458 


27359 


A 


13603 


1889 


1717 


VPRIRPLSWTPPSSI *RPKPPSSTFSHP 
GKPSMGPPGLNRCPVQRALLPGWYQHCG 
R 


13459 


27360 


A 


13604 


81 


267 


ETCCIKNNTLK* I *SQMLWCAPIVPATW 
QAEVGGSLDPRSSSL* CAMITSVNNHCT 
PAWAR 


13460 


27361 


A 


13605 


267 


418 


THASGMVTHTOjJPGTIjRVSAGRIT*GQE 
FKTSUSNIARPHLYKI I INKY I 


13461 


27362 


A 


13606 


428 


24 


CPESNPRKREPTLKSEPILTLPITGLIL 
DDGNS PTAAAGSVFAETLQPQCSSLCC* 
VKSSPWPTASLPQPFGSAPQTLPVRYAL 
IPGPSLVSTSLGPRLAPGLPGSPSPPSP 
QVTLDSERYPSPTPCIAPALED 


13462 


27363 


A 


13607 


267 


442 


RPPPTLKVPWLGVEVHACNPRTLGGQGG 
SIS*AQKLENSLSNILRPHLYQKQQKQQ 
KK 


13463 


27364 


A 


13608 


3 


280 


YRLSVI CEDPMREREQTCFPPPPSFPPN 
RYCSTRDGGMEGSQGDFRKGFPA*APCR 
SGGGWRGCCSPAGAPGSPLCGSSQGGPV 
SPWGQKRS 


13464 


27365 


A 


13609 


291 


248 


EKRAHGS QPMKGRERNSMCKSSE FRWA 
WSLAGVKGDGGR*GSGAGGSPKLG*AV* 
LDPEGHAGQAVFCGEAGARTQIGGKG*R 
RDQAPLPLSPNLCSGSCLPTKHSLARMP 
JjK V Jvb Y £> b AQI/3RTTC1 HPS PPT ITFDS 
S 


13465 


27366 


A 


13610 


267 


3 


HTDI»LYNTPTHPISPRCDPKHTAI PDKQ 
SLIjFFFFEMESCSVARAGVRWHDIiRSLR 
PLPPRF*KFFCCSIPSIiYOGRSRGSSQT 
YTS 
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Er=Glutamic Acid, F=Phenylalanioe, 
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nucleotide insertion 


13470 


27371 


A 


13615 


197 


428 


KTMVGLKTSAS FHQDF IQ I PLNHNLS VP 
*TMLGSQPRFLFPPGTVSHICNPSTLGG 
RGGWIT*GQEFETSLATMVKP 


13471 


27372 


A 


13616 


445 


550 


I S I TRGRARWLTP VTL VLWEAEAGGS P * 
GRSSRLP 


13472 


27373 


A 


13617 


3 


435 


TMVLS PADKTNVKAAWRKVGAHAGEYGA 
EALERMFLSFPTTKTYFPHFDLSHGSAQ 
VKGHGKKEADALTNAVAHVDDMPNALSV 
LSDLHAHKLRVDPVNFKLLSHCLLVTLD 
RHLPAE PTPAVHAS LDKFLAS VSTVLTS 
KYR 


13473 


27374 


A 


13618 


719 


918 


Lll^RkKKWKIGRERKiGKGKKittji^^ 
GKGREDGGRKRPCTSRPRSSSRDRSNI I 
ISEHASAVEA 


13474 


27375 


A 


13619 


847 


938 


WLMPAVPALLEAEVGISLEAQSCRPPWA 
TQ 


13475 


27376 


A 


13620 


349 


409 


NLWSWPGAVAHVCNPSTLGG 


13476 


27377 


B 


13621 


25 


189 


MVLSPADETNVKAAWGKVGAHAGEYGAE 
ALERMFIiCFPTTMTYFPDFDLIHGSAQ* 


13477 


27378 


A 


13622 


142 


356 


RIVENEKINAEKSSKQKVDLQSLPTRAY 
LDQTVVPILLQGLAVLAKBRPPNPIEFL 
ASYLLKNKAQFEDRN 


13478 


27379 


A 


13623 


18 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMLAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKBPPLLTiGVmP 
NTKLRQAERLFENQLVGPES IAHIGDVM 
FTGTADGRVVKLENGE IETIARPGSGPC 
KTRDDEPVCGRPLGIRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDIiTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVMKFVPRYSLV 
LELSDSGAFRRS LHDPDGLVATYI SEVH 
EHDGHL YLGS FRS P FLCRLS LQAV 


13479 


27380 


A 


13624 


18 


1353 


AGAAQCEWSAGEAGARTMSEADGLRQR 
RPLRPQWTDDDGQAPEAKDGSSFSGRV 
FRVTFLMIAVSLTVPLLGAMMLLESPID 
PQPLRQISGIALFCSFKEPPLLLGVLHP 
NTKLRQAERLFENQLVGPES IAHIGDVM 
FTGTADGRWKLENGEIETIARFGSGPC 
KTRDDEPVCGRPLGIRAGPNGTLFVADA 
YKGLFEVNPWKREVKLLLSSETPIEGKN 
MSFVNDLTVTQDGRKIYFTDSSSKWQRR 
DYLLLVMEGTDDGRLLEYDTVTREVKVL 
LDQLRFPNGVQLS PAEDFVLVAETTMAR 
IRRVYVSGLMKGGADLFVENMPGFPDNI 
RPSSSGGYWVGMSTIRPNPGFSMLDFLS 
ERPWIKRMIFKLFSQETVMKFVPRYSLV 
LELSDSGAFRRS LHDPDGLVATYTSEVH 
EHDGHLYLGS FRS P FLCRLS LQAV 


13480 


27381 


A 


13625 


1 


384 


QSFRGTGRKRERERKRMSLSDWHLAAKL 
ADQPLTPKSILRLPETELGEYSLGGYSI 
SFLKQLIAGKLQESVPDPELIDLIYCGR 
KLLDDQTLDFYGIQPGSTVHVLRKSWPE 
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PDQKPEPVDKEAAMRD 


13481 


27382 


A 


13626 


980 


1089 


KRIRIQLTGGLYPVPPPHPPPQSPPIFP 
RPTSPTRT 


13482 


27383 


A 


13627 


116 


587 


VCX5ELRADSWPVPSQPEQASGPQKQAFI 
WPEAPSRSARLPITYTDYWDSRLQTQES 
QMLGSMARKKPRNTSRLPLALNPLKSKD 
VLAVLAERNEAIVPVGAWVEPASPGSSE 
IPAYTSAYLIEEELKEQLRKKQEALKHF 
QKQ VKYRVNQQ I TLRKQ 


13483 


27384 


A 


13628 


150 


484 


VAF PQAS RGVRKTE VGEQQGQGTACGGC 
GHQ CP P PTWH I QRAHCVPS TCGAGGWHL 
QGRASCRRPPSQSPQVYQTQVGRQDPHF 
GVGRDSRGELAWS S FIHPYLLS VCNPL 


13484 


27385 


A 


13629 


330 


397 


ARCEWLTPVI PAIiWEAEAGGSH 


13485 


27386 


A 


13630 


10 


137 


SFTGAVILI IAHGLTSSLLFCLANSNYE 
RTHNLFFLNDFFFS 


13486 


27387 


A 


13631 


380 


98 


FQWLVSFTATLWFEERHRKNKI EREERR 
AKGERGDRKEERRE EGGEERGRRGGERS 
DKREPKKKSKEESNHPKWKFS FICSFC 
FLTPFFPVFF 


13487 


27388 


A 


13632 


56 


827 


PLFEAFTACGFVHDCXSLLIHPEETCXJLQ 
PISSDYIEAILQSELKRCPSGDMKGQWI 
VPCLSCSDNRTCDWREITWQPHNCQYGV 
LTKPQLQQCLGGRKILFIGDSTNRGIMY 
YLIERLNETLQEWQKVHGTKFYHNVNGG 
KTLISYSYYPQFWISPSLRPTFENALEH 
LLQRSRPIuENTGQTVLVVGGVQWLNSNH 
LQI IHKVLKSPFTTLNQPVTKSCLQAIY 
FPRLS PTLHSNCLDLVYS FTKS FNIYFV 
VQFLN 


13488 


27389 


A 


13634 


3 


2718 


SGPCRTTVAPIiLRAAPVEHCVAAfcRPTD 
STMLKKFDKKDEESGGGSNPFQHLEKSA 
VTjQEARVFNETP INPRKCAHILTKILYL 
INQGEHLGTTEATEAFFAMTKLFQSNDP 
TLRRMCYLTIKEMSCIAEDVI IVTSSLT 
KDMTGIGSDNYRGPAVRALCQITDSTMLQ 
AIERYMKQAIVDKVPSVSSSALVSSLHL 
LKCSFDWKRWVNEAQEAASSDNIMVQY 
HALGLL YHVRKNDRLAVNKM I S KVTRHG 
LKSPFAYCMM IRVASKQLEEEDGSRDS P 
LFDFIESCLRNKHEMVVYEAASAIVNLP 
GCS AKEIAP AVSVLQLFCS S PKAALRYA 
AVRTLNKVAMKHPSAVTACNLDLENLVT 
DSNRS IATLAITTLLKTGSES S IDRttfiC 
QISSFMSEISDEFKWWQAISALCQKY 
PRKHAVLMNFLFTMLREEGGFEYKRAIV 
DCI IS I IEENSESKETGLSHLCEFIEDC 
EFTVLATRILHLLGQEGPKTTOPSKYIR 
FIYNRWLEHEEVRAGAVSALAKFGAQN 
EEMLPS ILVLLKRCVMDDDNEVRDRATF 














YLNVLEQKQKALNAGYILNGLTVS I PGL 
ERALQQYTLEPSEKPFDLKSVPLA.TAPM 
ABQRTES TPITAVKQPEKVAATRQE I FQ 
EQLAAVPEFRGLGPLFKSS PE PVALTES 
ETEYVTRCTKHTFTNHMVFQFDCTNTLN 
DQTLENVTVQME PTEAYEVLCYVPARSL 
P YNQ PGTCYTLVALP KEDPTAVACTFS C 
MMKFTVKDCDPTTGETDDEGYEDEYVLB 
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DLEVTVADHIQKVMKLNFEAAWDEVGDE 
FEKEETFTLSTIKTLEEAVGNIVKFLGM 
HPCBRSDKVPDNKNTHTLLLAGVFRGGH 
DILVRSRLLIiLDTVTMQVTARSLEELPV 
DIILASVG 


13489 


27390 


A 


13635 


118 


629 


LWALQLHPEPELPSRRGTGAAGVWTGVA 

uTTiTJTT TftWDTir'VPDT A VWVM/^lTTirYrYUTr 
MIRr X Jj I QNKAIjK. 1 KJuAivW X W V r Uuuct IN. 

QKLIEEVHAVVTVRDAKHTNFVEFRNFK 

X 1 X KK, x AVjIj X r L. JL V- V U V JNL/riX>iXirt.X JjxxfiX 

HNFVEVLNEYFHNVCEUDLVFNFYKVYT 
VVDEMFLAGEIRETSQTKVLKQLIiMLQS 

T I? 


13490 


27391 


A 


13636 


57 


221 


LHHCTPPWAEVEETLKRLQSQKGVQGI I 
WNTEGGWEPLGHCGDRSRPPAQGCP 


13491 


27392 


A 


13639 


344 


544 


LSGGHAGADSSLVrVHLYCXiLSSQQLLGN 
VLtV 1 V ixAXrlr vjiVIj r i tr Cj V \i±\z> ri y rvn v x vji 

VC&ALCFRYH 


13492 


27393 


A 


13640 


319 


623 


DMEEASEGGGNDRVRNLQSEVEGVKNIM 
TQNVERILARGENLEHLRNKTEDLEATS 
EHFKTTSQKVARKFWWKNVKMIVLICVI 
VFIIILFIVLFATGAFS 


13493 


27394 


A 


13641 


2099 


769 


TRLAGRVSVASRPCRGPAVGGLLVERSK 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQSPEFSVTRQHEDFV 
WLHDTL I ETTD YAGL I IPPAPTKPDFDG 
PREKMQKLGEGEGSMTKEEFAKMKQELE 
AExTjAVFKKTVSSHEVFLQRLSSHPVLS 
KDRNFHVFIiEYDQDLSVRRKNTKEMFGG 
FFKSVVKSADEVLFTGVKEVDDFFEQEK 
NFL1^YI^IKDSCVKADK^^ , RSHKNVA 
DD YXH rAACJurto LiA-LuS c* JrX V J. JUS. X JLtUJV V 

aelfbklrkvegrvssdedlkltellry 
ymlnieaakdllyrrtkalidyensnka 
ldkarlkskdvklaeahqqeccqkfeql 
sesakeelinfkrkrvaafrknliemse 
leikharnnvsllqscidlfknn 


13494 


27395 


A 


13642 


210 


772 


SVKMVRYSLDPENPTKSCKSRGSNIiRVH 

i?xrKT»T>Cii?n*7\^7VTV , /* , MIITPVTV r PTirVT.VTi\7Tf , T. 
r i\JM JLKJS x AyAl INvjIYitl X i\J\rt x Jx i i_i wj v xxi 

QKQCVPFRRYNGGVGRCAQAKQWGWTQG 
RWPKKSAEFLLHMLKNAESNAELKGLDV 
DSLVIEHIQVNKAPKMRRRTYRAHGRIN 
PYMSSPCHIEMILTEKEQIVPKPEEEVA 
QKKKISQKJCLKKQKLMARE 


13495 


27396 


A 


13643 


168 


2172 


S P LCEVS VP S FCFRVF CKKHKKHKS DKH 
LYEEYVEKPLiCLVIiK^GGNEVTELSTGS 
SGHDSSLFEDKOTHDKHKDRKRKKRKKG 
EKQIPGEEKGRKRRRVKEDKKKRDRDRV 
ENEAEKDLQCHAPVRLDLPPEKPLTSSL 
AKQEE VEQT P LQEALNQ LMRQLQR KD PS 
AFFSFPVTDFIAPGYSMI IKHPMDFSTM 
KEKIKNOTYQSIEELKDHFKLMCTNAMI 
YNKPETIYYKAAKKLLHSGMKILSQERI 
QSLKQS IDFMADLQKTRKQKDGTDTSQS 
GEDGGCWQREREDSGDAEAHAFKSPSKE 
WKKKDKD^EDKFKSNNLSREQEQLDRI 
VKSSGGKLTRRXjVNSQCEFERRKPDGTT 
TLGLLHP VDP I VGE PG YCPVRLGMTTGR 
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LQSGVNTLiQGFKEDKRNKVTPVLYLNYG 
PYSSYAPHYDSTFANISKDDSDLIYSTY 
GEDSDLPSDFSIHEFLATCQDYPYVMAD 
SUjDVLTKGGHSRTLQEMEMSLPEDEGH 
TRTLDTAKEMEQITEVEPPGRLDSSTQD 
RLIALKAVTNFGVPVEVFDSEEAE I FQK 
KLDETTRLLRELQEAQNBRLSTRPPPNM 
ICLLGPSYREMHLAEQVTNNLKELAQQV 
TPGDI VSTYGVRKAMGIS I PS PVMENNF 
VDLTEDTEEPKKTDVAECGPGGS | 


13496 


27397 


A 


13644 


1032 


207 


PADVT P KPATEAVQSEHS DAS PMSINF.V 
ILSASGAJCKLIDSLHSYCFSSRQNKSQV 
CCLREQVEKKNGELKSLRQRVSRSDSQV 
RKLQEKLDELRRVSVPYPSSLLSPSREP 
PKMNPVVEPLSWMLCTWLSDPPGAGTYP 
TLQPFQYLEEVHISHVGQPMLNFSFNSF 
HPDTRKPMHRECGFIRLKPDTNKVAFVS 
AQNTGWEVEEGEVNGQELCIASHSIAR 
IS FAKE PHVEQ I TRKFRLNSEGKLEQTV 
SMATTTQPMTQHLHVTYKKVTP 


13497 


27398 


A 


13646 


148 


380 


RGSWREVPESASLPSRGABCGKWRGLCCC 
CCCCCCCCCCCCYHCHQEQGDLKHQADL 
WRSGRTQNQAGIWQEHQTLEG 


13498 


27399 


A 


13647 


2099 


769 


TRLAGRVSVAS RPCRGPAVGGLLVERS K 
ARRPLLESRVAMAAVPELLQQQEEDRSK 
LRSVSVDLNVDPSLQIDIPDALSERDKV 
KFTVHTKTTLPTFQS PEFSVTRQHEDFV 
WLHDTLIETTDYAGLI I PPAPTKPDFDG 
PREKMQKLGBGEGSMTKEEFAKMKQBLE 
AEYLAVFKKTVSSHEVFLQRIiSSHPVLS 
KDRNFHVFIiEYDQDIiSVRRKNTKEMFGG 
FFKSWKSADEVLFTGVKEVDDFFEQEK 
NFLIOTYNRIKDSCVKADKMTRSHKNVA 
DDYIHTAACLHSLAIiEEPTVIKKYIiliKV' 
AELFEKLRKVEGRVSSDEDLKLTELLRY 
YMLNIEAAKDLLYRRTKALIDYENSNKA 
LDKARLKSKDVKliAEAHQQECCQKFEQL 
SESAKEELINFKRKRVAAFRKNLIEMSE 
LEIKHARNNVSLLQSCIDLFKNN 


13499 


27400 


A 


13648 


1 


1206 


MSTSQS PCES ICDYVTSHDKSNFTDMI K 
LNILRCEVILDYPAPGGGSLGAKHCCSC 
YTVSSGVTEGERNAGEKGVKLNADGARI 
RGTPGRGRRAEAEASSPAPAAVAAACW 
AAAAASRQLASGNRTRVSSGVPAPAFLG 
TMNPNCARCX3KIVYPTEKVWCLDKFWHK 
ACFHCETCKMTLNMKNYKGYEKKPYCNA 
HYPKQSFTMVADTPENLRLKQQSELQSQ 
VRYKEEFEKNKGKGFSWADTPELQRI K 
KTQDQISNIKYHEEFEKSRMGPSGGEGM 
EPERRDSQDGSSYRRPLEQQQPHHIPTS 
APVYQQPQQQPVAQS YGGYKEPAAPVS I 
QRSAPGGGGKRYRAVYDYSAADEDEVS F 
QIXjUL I V N VUU 1UDGWMYG T VERTGDTG 
MLPANYVEAI 


13500 


27401 


A 


13649 


3 


394 


GDGGGHLGSGRNGGGSMNAP PAFESFLL 
FEGEK I T INKDTKVPNACLFTMNKEDHT 
LGNI I KSQLLKDPQVLFAGYKVPHPLEH 
KIIIRVQTTPDYSPQEAFTNAITDLISE 
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nucleotide insertion 














LSLLEERFRVRAGPGGAD 


13501 


27402 


A 


13650 


1 


471 


SRPTGLREAAGSGPREAPRRSGCKSPGL, 
GTVGMLRPKALTQVI^QANTGGVQSTLL 
LNNEGSLLAYSGYGDTDARVTAAIASNI 
WAAYDRNGNQAFNEDNLKF ILMDCMEGR 
VAITRVANLLLCMYAKETVGFGMLKAKA 
QALVQYLEEPLTQVAAS 


13502 


27403 


A 


13651 


1334 


82 


CYTGGTQSLWPGSSCASSVARPSSLFRS 
AWSCEWSVRCARACTMSVPAFIDISEED 
QAAELRAYLKSKGAEISEENSEGGLHVD 
LAQI IEACDVCLKEDDKDVESVMNS WS 
LLLIIjEPDKQKALIESLCBKLVKFREGE 
RPSLRLQIiLSNLFHGMDKNTPVRYTVYC 
SLIKVAASCX3AIQYIPTELDQVRKWISD 
WI^TTEKKEmiliRIiLYEALVDCKKSDAA 
SKVMVELLGSYTEDNASQARVDAHRCIV 
RALKDPNAFLFDHLLTLKP VKFLEGEL I 
HDLLTIFVSAKLASYVKFYQNNKDFIDS 
LGLLHEQNMAKMRLLTFMGMAVENKEIS 
FDTMQQELQIGADDVEAFVIDAVRTKMV 
YCKIDQTQRKWVSHSTHRT FGKQQWQQ 
LYDTLNAWKQNLNKVKNS LLSLSDT 


13503 


27404 


A 


13652 


1 


377 


TTASGRSGVKGSTMSAEVPEAASAEEQK 
EMEDKVTSPEKAEEAKLKARYPHLGQKP 
GGSDFLRKRLQKGQKYFDSGDYNMAKAK 
MKNKQLPTAAPDKTEVTGDHIPTPQDLP 
QRKPSLVASKLAG S 


13504 


27405 


A 


13653 


424 


596 


SLKNIYGLSCRKKKKGAVKKI ILVQAWW 
LMPVI TVLWEAEVGGLLEARGLRPTRAT 
W 


13505 


27406 


A 


13654 


190 


448 


LRSYPAPHLGSPELRIRKGRGHSHCLAG 
AAGPQRTALCGLSAPLCPPS PTPPGAGA 
PRYCSGSDAPPCLIiRGAGPP I PGMGDPE 
TS 


13506 


27407 


A 


13657 


1612 


563 


SMPGWRIiLTQVGAQVLGRLGDGLGAALG 
PGNRTHIWLFVRGLHGKSGTWWDEHLSE 
ENVPFIKQLVSDEDKAQLASKLCPIiKDE 
P WP IHPWEPGS FRVGLIALKLGMMPLWT 
KDGQKHVVTIXQVQDCHVIiKYTSKENCN 
GKMATLS VGGKTVSRFRKATS ILEFYRE 
LGLP PKQTVKI FNT TDNAAI KPGT PLYA 
AHFRJPGQYVDOTAKTIGKGFQGVMKRWG 
FKGQPATHGQTKTHRRPGAVATGDIGRV 
WPGTKMPGKMGNIYRTEYGLKVWRINTK 
HNI I YVNGS VPGHKNCLVKVKDS KLPAY 
KDLGKNLPFPTYFPDGDEEELPEDLYDE 
NVCQPGAPS ITFA 


13507 


27408 


A 


13658 


128 


2626 


~ NSHRWVYVRARRWRRRGKQREQPEDRGV 
PMKRAAMAIiHSPQYIFGDFSPDEFNQFF 
VTPRSSVELPPYSGTVLCGTQAVDKLPD 
GQEYQRIEFGVDEVTEPSDTLPRTPSYS 
ISSTLNPQAPEFILGCTASKITPDGITK 

LENIXTVSGGLGQRERiOCfCKKRP PGYYS Y 
JjKDGGDDS I STEAL VNGHANSAVPNSVS 
AEDAEFMGDMP PS VT PRTCNS PQNSTDS 
VSDIVPDS PFPGALGSDTRTAGQPEGGP 
GADFGQSCFPAEAGRDTLSRTAGAQPCV 
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GTDTTENLGVANGQ I LESSGEGTATNGV 
ELHTTESIDLDPTKPESASPPADGTGSA 
SGTLPVSQPKSWASLFHDSKPSSSSPVA 
YVETKYS PPAI S PLVSEKQVEVKEGLVP 
VSEDPVAIFOABLLENVTLIHKPVSLQP 
RGLINKGNWCYINATLQALVACPPMYHL 
MKF I PLYS KVQRPCTSTPMI DS FVRLMN 
EFTNMPVPPKPRQALGDKIVRDIRPGAA 
FEPTYIYRLLTVNKSSLSEKGRQEDAEE 
YLGFILNGLHEEMLNLKKLLSPSNEKLT 
ISNGPKNHSVNEEEQEEQGEGSEDEWEQ 
VGPRNKTS VTRQAD FVQTP I TG I FGGH I 
RSWYQQSSKESATLQPFFTLQLDIQSD 
KIRTVQDALESLVARESVQGYTTKTKQE 
VEISRRVTLEKLPPVX.VLHLKRFVYEKT 
GGCQKLIKNIEYPVDIiEISKELLSPGVK 
N KN F KCHRTYRL FAWYHHGNS ATGGH Y 
TTDVFQIGLNGWLRIDDQTVKVINQYQV 
VKPTAERTAYLLYYRRVDLL 


13508 


27409 


A 


13659 


42 


382 


TLWLKTIQIYLTISLGLGSDYGIiAGFSA 
LGCHQAAVKMLTAFI LIQGLDLGRSHFQ 
AHSGCWQDSFPCDSRIHGGLLLHVQQES 
P WFQSAERVSC I TKCNHRSDTHHLCS I L 

! 


13509 


27410 


A 


13660 


178 


349 


DMGPCYVAQTVLKLLGSSDPPTSASQEA 
GTTCHHAHLLSHPFLMIYRNFSFIQCLE 
T 


13510 


27411 


A 


13661 


268 


2 


RQQKVILSSSGSLMSEMGRTVLPLKSPG 
MNPSSPLLASGGCWKSLACGHIIPFSSF 
ILMWPSPLCVSVSSSLPMRKPLLUILRS 
TLFQ 


13511 


27412 


A 


13662 


112 


483 


AGVGALRMVQRLTYRRRLSYNTASNKTR 
LSRTPGNRIVYLYTKKVGKAPKSACGVC 
PGRLRGVRAVRPKVLMRLSKTKKHVSRA 
YGGSMCAKCVRDRIKRAFLIEEQKIWK 
VTjKAQAQSQKAK 


13512 


27413 


A 


13663 


2 


873 


SVEEFDRGCTGRGCGADARAGAAMVKIS 
FQPAVAGIKGDKADKASASAPAPASATE 
I LLTPAREEQPPQHRS KRGSSVGGVCYL 
SMGMWLLMGLVFASVYIYRYFFLAQLA 
RDNFFRCGVLYEDSLSSQVRTQMELEED 
VKIYLDENYERINVPVPQFGGGDPADI I 
HDFQRGLTAYHD I SLDKCYVI ELNTTI V 
LPPRNFWELLMNVKRGTYLPQTYI iqee 
MWTEHVSDKEALGS FIYHLCNGKDTYR 
LRRRATRRRINKRGAKNCNAIRHFENTF 
WETLICGW 


13513 


27414 


A 


13664 


118 


3 


AWSLI PVIP WREAKAGGLLEPRSLRPT 
WATWQDPVST 


13514 


27415 


A 


13665 


1 


2876 


I RQRINFSRKNKWDSRKLDQAGVSELAT 
NQKLILVCGRYEGIDERVIQTEIDEEWS 
IGDYVLSGGELPAMTLIDSVSRFIPGVL 
GHEAS ATEDS FAEGLLDCPHYTRPEVLE 
GMEVPPVLLSGNHAE I RRWRLKQSLGRT 
WLRRPELLENLALTEEQARLLAEFKTEH 
AQQQHKHDGMAADEAGRTFLRADFNMIE 
EGDR IMVCLSGGKDS YTMLEI LRNLQQS 
APINFSLVAVNLDQKQPGFPEHVLPEYL 
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G=Glycine, H=Histidine, I=Isoieucine, 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














EKH3VE YKI VEENTYG I VKEKI PEGKTT 
CSLCSRLRRGILYRTATELGATKIALGH 
HRDDILQTLFLNMFYGGKMKGMPPKIjMS 
DDGKHIVIRPLAYCREKDIQRFADAKAP 
PI IPCNLCGSQPNLQRQVIADMLRDWDK 
RYPGRIETMFSAMQNWPSHLCDTNLFD 
FKGITHGSEWNGGDLAFDREEIPLQPA 
CWQPEEDENQIjDELRLNWEPDGPRHRH 
APDARRFTOYIiRHQARYLLSVVPNLDVA 
VTE PLANGDGLNVLI KREWGFRANTVE 
KTGENQYRVWPNEMPADLHKIRPHHPIiN 
RNLDHFFPLDLTNSTFSGGYVHVIjKGVL 
SDDLLLKSFQKMGYVRRDSHRLMVTALP 
PACQLVQVAI1GCFAI1RI1ECE ILGEVLAQ 
LGTSVLPAEELLQARRASGDVASCQRLA 
QDEEPPPLP PRGSPAAYRAPLDLYRDLQ 
EDEGSEDAS L YGE PS PGPDS P PAELAYR 
PPIiWEQSAKLWGTGGRAWEPPAEELPQA 
SSPPYGALEEGLEPEPSAFSFLSLRREL 
SRPGDLATPESSAAAS PRRIRAEGVPAS 
AYRSVSEP PGYQGTQLPVPWRPAHPLLR 
HLSPAACCPLCSPARLPSRPLAACAAWR 
RPAALVATACTDGHSALQQPRGPALGQL 
GPRSRALGRQLPWCFSIiI/SRGLGLWIiPC 
PWGSPGHSRGLES PGQPLLEVGIoS VYGR 
LQGQQA 


13515 


27416 


A 


13667 


76 


289 


SGTPAS PCLEMDPNCSCSPVGSCACAGS 
CKCKECKCTSCKKS CCS CCPVGCAKCAQ 
GCICKGTSDKCSCCA 


13516 


27417 


A 


13669 


2 


667 


GRVDAEQSRLGATBRAAAAAMNPEYDYL 
FKLI^IGDSGVGKSCIjIiLRFADDTYTES 
YI STIGVDFKIRTIELDGKTI KLQI WDT 
AGQERFRTITSSYYRGAHGIIWYDVTD 
QESYANVKQWLQEIDRYASENVNKLLVG 
NKSDLTTKKVVDNTTAKEFADSLGI PFL 
ETSAKNATNVEQAFMTMAAEI KKRMGPG 
AASGGERPNLKIDSTPVKPAGGGCC 


13517 


27418 


A 


13670 


176 


398 


RIIiKTQLQENNQPTTTTKNRQKTRTDTIi 
PKGI YkRHISLWIUjGTWL I P V IPALiUKA 
NAGGRLESRSSRPVWATY 


13518 


27419 


A 


13671 


149 


247 


RKGLALS PRLDCSGAI TVNCS XiNXiQUijb 
NIPI 


13519 


27420 


A 


13672 


257 


359 


VASLHGWAQWLTPVI PTLWKTKVGEFFE 
PRCSRP 


13520 


27421 


A 


13673 


142 


1 


HLKGSGLAKHQWLMPVI PALWEAEVGGS 
LEPRSSRPVWATLQDPISS 


13521 


27422 


A 


13674 


619 


168 


MFI ESCCTPKI YTS FVLRQSHSFAQAGV 
QWLDLGSLQPLSLGSSNSPASASCIQVL 
FVIYTSVKLGVGGRLGNKKVMSSNKKFV 
CECRGWI ILPFSVPSLRFQDGGWDHGVS 
SAEKALQAGRTAGHRVGVPPLGAPCGGA 
PCTTPPAWRS 


13522 


27423 


A 


13675 


3 


370 


PSSEGRADLLSPALEEGYPPPASGCFIP 
HFLCGCSIFNCLPPIMRHAHKSPDALLE 
TLGPCRAWRLMPVT PTLWEAKVDRLLiES 
RSSRPSWPTW 


13523 


27424 


A 


13676 


3 


281 


IGIiRPELSVSIiGRQRLTAIRLQPLLPAR 
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C=Cysteine, D-Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
G=Glycine, H=Histidine, I=Isofeucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R-Arginine, S=Serine, 
T^Tbreonine, V^Valine, 
W=Tryptophan, Y=*Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \= possible 
nucleotide insertion 














TPCX3FLRARVALALMADETDLPLLPRLE 
YSGMITAYCSLDFLGSSDPPTLGSGVAG 
TTENCCED 


13524 


27425 


A 


13677 


272 


389 


YCRQ I E KS KKKRCWLGTVAHACNPSTWG 
GRGGRITRSRD 


13525 


27426 


A 


13678 


261 


363 


NSSWGHARWLTS VI PALWEAEVGGSPEV 
RSLRPV 


13526 


27427 


A 


13679 


149 


44 


SCVMSVSVLKKMYWIiGAVAHACNPSTLG 
GQGGRTA 


13527 


27428 


C 


13680 


118 


267 


MSHCTWI^CTCVCVCVCGVYAYMYTYS F 
LRWDLPMLSRLECSGYSQAQ* * 


13528 


27429 


A 


13682 


16 

» 


346 


NHHLTQPHPHLDINDFFFFFFFFFFFFF 
FFFFFFFFFFFLKKS PRPAGKKKKKKKK 
KKKPPRFFFLGGAPPPLFFFLINFFFFF 
LCKI ISLVYNRRGGPCLSKRFFLKERRS 
PL | 


13529 


27430 


A 


13683 


244 ■ 


140 


GRVDVFHHVAQAGLKLLSSGDLPAWPPK 
ELRLQV 


13530 


27431 


A 


13684 


191 


777 • 


NSDBHVYRCLYGHKLSFLLS KYLKSGLA 
GVLLPLDAAVDMEK I EEQFANLHI VKCS 
LGTKEPTYI1LGIDTSKTVQAGKENLVAV 
LCSNGSIRIYDKERLNVLREFSGYPGLL 
NGVRFANS CDS VYS ACTDGTVKCWDARV 
AREKPVQLFKGYPSNIFISFDINCNDHI 
I CAGTE KVDDDAL FGGFGDARMS FLQN 


13531 


27432 


A 


13685 


150 


253 


VGWGLWLVPVIPAFWEAKMGESLDPRNL 
LPCWAT 


13532 


27433 


A 


13686 


500 


321 


IEMGSHYVAQAGIjELLSSSNPPTSVSQS 
AGIVGMSHHALAAISKSSDASITSHHPM 
TTP 


13533 


27434 


A 


13687 


6 


424 


MSLLQCGGIRCFKMPEPAKSAPAPKKGS 
KKAVTKAQKKDGKKRKRSRKESYSVYVY 
KVLKQVHPDTG I S S KAMGIMNS FVND I F 
ERIAGEASRLAHYNKRSTITSREIQTAV 
RLLLPGEIAKHAVSEGTKAVTKYTSSK 


13534 


27435 


A 


13688 


178 


293 


TGYSSQAQWLMTVLTATWEAEAGGSLQP 
RSLRPAWATQ 


13535 


27436 


A 


13689 


663 


144 


KELSAVSAGIPHSCGSQGCGGGSVAACV 
PAAPAAAGLCSGRAQECVPPPPSLAGWPP 
GVNAPPPPVCSSVRLHVCQSDRLWVRLA 
ARRGILALLRSALKAATTiAGCQSVRWSV 
RPSESLRPTSNAASLFRSSVPTVLSHSV 
PLAASLGKRRACGGREHASVAVYLSVCL 
SLPT 


15536 


27437 


A 


13690 


125 


3 


FAHQGHAPGQAWWLRP VI PVLREAELGG 
RPEPRSSRTARA 


13537 


27438 


A 


13691 


136 


24 


GGPPFFFFFFFFFFFFFFFFFFFFFFFF 
FFPGKLQQDFQYYVK 


13538 


27439 


A 


13692 


144 


I 


MLKIVQSGECLTLKFKNFCLIjSLCTLFP 
TLIALTTLLLPISPFILIIIi 


13539 


27440 


A 


13693 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAMWATLPLLCAGAWLLGVPVCGAAELC 
WSLEKFHFKSWMSKHRKTYSTEEYHHR 

lqtfasnwrkinahnngnhtfkmalnqf 
sdmsfae i khkylwsepqncsatksnyl 
rgtgpyppsvdwrkkgnfvspvkctqgac 
gscwtfsttgalesaiaiatgkmlslae 
qqlvdcaqdfnnhgcqgglpsqafeyiii 
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X=Un known, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVIiAVGYGEKNGIPYWIVKNSWGPO 
WGMNGYFLIERGKNMCGIiAACAS YP IPL 
V 


13540 


Z7441 






Of 


z 


PGWAWWFTPVI LALWETEAGGS PEFKS L 


13541 


27442 


A 


13696 


130 


2 


CNILI 1 1 IQVCWAWWLMPVI PALWHAEP 
GGSPEVESSRPALV 


13542 


27443 


TV 


ISbyf 




lU/ 


\jXj X jV*~. ±r \j X v v_l\.o Jj xv v viol ^ i_»o -L J. v — uoi i 

MEHSGTILAHCSLKLLGSSDPPTSVSRV 

AO 


13543 


27444 


A 


13698 


198 


3 


C^ITVNKVKVMTLFRSSMSLEIVCXJIDL 
AVLRRDLGSLQDPPPRFTPIiSCLSLPSS 
WDYRRPLV 


13544 


27445 


A 


13699 


*> 


1Z0 


GSSDKNFKTRF 


13545 


27446 


A 


13700 


60 


341 


PDMGLEDERKMLTESGDPEEEEEEQEEL 
VDPLTTVREQCEQLrQKCVKARERLEIiCD 
ERVSSRSHTDDDCPEEIiFDFLHAKDHCV 
ALKLFNNLQ 


13546 


27447 


A 


13701 


6 


540 


KNSRTLGQCGGIRGSLCRPRKPGVGGTQ 

LVLLGRVPAHPDSRCWFLAWNPAGTLLA 

QRTVRKVAWS PCGNYLAS AS FDATTCI W 
KKTjTIjR TYNI IiRKLEACI KP LCCALKYG 
CLEEKQLHS 


13547 


27448 


A 


13702 


239 


346 


AKRGWLMPVI PALWEAKVGKSLEPRISR 
PLWATWQN 


13548 


27449 


A 


13703 


210 


300 


J-tt*ih* V X OiUJnJUUxnKwXiii Xa xrlv X UX\JTJt\rlf\± 

w 


13549 


27450 


A 


13704 


141 


440 


PSAFEHFEKINLYFLKFCISQGFPERQN 

LAHVIVEASKPHHRPFITWETLGCWWSG 
S VQVQKP WE PGKLMV 


13550 


27451 


A 


13705 


204 


438 


UiNVLLTQLFFLFLSLRQSCSVAQARGQ 
WYNHGSTjOP S THGASNP P TS AS OSVGTT 
GMSHDHGLFLHFSTLDI FSSL 


ncc i 


Z/4DZ 


^ 


I J /vO 


1707 




AKAGGSQHLE I LANAVKP CL YWECKMAG 
PWWCAPVGG 


13552 


27453 


A 


13707 


167 


334 


IRRANFKI LTE IGWTQWLTP VI PALWEA 
EGGGLl^PGSPSLVFLFFILTTIMRKK 


13553 


27454 


A 


13708 


224 


344 


KDTAMEEEIKDTEKTGKPWLGAVAHACN 
PSTLGDQGRQIA 


13554 


27455 


A 


13709 


1476 


380 


NSWSTLASELTLWAATSADPERKSQAAS 
AAl^ATLPLLCAGAWlxLGVPVGGAAELC 
TOSIiEKFOTKSWMSKHRKTYSTEEYHHR 
U^FASNWRKINAHNNGl^FKMALNQF 
SDMS FAEI KHKYLWSEPQNCSATKSNYL 
RGTGP YPPSVDWRKKGNFVS PVKNQGAC 
GSCWTFSTTGAiESAIAIATGKMl^IiAE 
QQLVDCAjQDFNNHGCQGGLPSQAFEYIL 
YI^GIMGEDTYPYQGKDGYCKFQPGKAI 
GFVKDVANIT I YDE EAMVEAVALYNP VS 
FAFEVTQDFMMYRTGIYSSTSCHKTPDK 
VNHAVLAVGYGEKNGIPYWIVKNSWGPQ 
WGMNGYFLIERGKNMCGLAACASYPIPL 
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G=Giycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=ProIine, 
v^Hjriutamine, K^Arginine, o— oenne, 

T—Thronnina V— \7nlSno 

i — inreonine, v— valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 














V 




2/4DO 


A 


13710 


763 


257 


Y EK1 Li KJjTADAKFE SGD VKATVaVLS F I 
LSSAAKHSVDGBSLSSEIiQQLGLPKEHA 
ASLCRCYEEKQS PLQKHLRVCSLRMNRL 
AGVGWRVDYTLS SSLLQIRGKS PWCHCG 
W K.V A x v-Py GPy AF ACLtPt-P r fa fa KTKFPR 
SSLARTESRPKTLMKIiPWABEKGVPRPV 






A 


1371 1 


133 


401 


VLifa KoGNP iWGiiSTNHB IKGQEGCAWQL 
TPVIPALWEAEAGGSLEPRSLRPAWATS 
WLPREIKLDLLKNGCGPSKITQGGPMVA 


13557 


27458 


A 


13712 


212 


74 


RRINLAPPRVFLKGPPFFFFFFFFFFFF 

T? C*3^ T . T7T .T7M Q T T?/"* DT 

r r rtr» vjiM jr J_> r i_ir iNo i rvjKl 


13558 


27459 


A 


13713 


324 


141 


DRVLPRLECSGVTTARYRLNLLGSRDPP 
FIIL 


UJJ7 






13/14 




Alt 


J. rCLiJL W JLio A v UHAL.IM fa fa 1 J-AjOriv* Vy M A JMj 

QDFETNLANTANPHPY 








13/1 j 


133 


1 


X JjV WAW W JJplPA J. P ViiWlSAci 1 rAjlxLibPKb 
LRPAWATWKNTISTKN 


13561 


27462 


A 


13716 


18 


165 


I PLHTHTHTHTHTHTHGYIiLALSGTEFL 


13562 


27463 


A 


13717 


119 


1 


KPNVSQAQWLAPVI PALWEARVGVLiFKP 
RSLRPTWATQQ 


1 J JO J 


27464 


A 


13718 


271 


64 


VJjRS I STIjNEILAKTRNPPIiQILMKPRK 
RWGTVAHLRAPVGDQPDQHSETPSLIiKI 
TKISWVWWRTPVI 


133D4 


^74oD 


A 


13719 


21 


209 


KXJGPKG 1 GGSETNES VEMTECCSVTQAG 

VQWHDLGSLQPPPPGFKRFSCLGLPSSW 
nvQDMn 


13565 


27466 


A 


13720 


391 


151 


RGFFLKGEKKKNFFPIILGKKILGSPGK 
KGEIKKKKKRGKI FFFIiKNPLGFFPKKKV 
LGEKKNPYSGVWGGKKKRGPPPHP 


1 J 000 


z/4o/ 


A 


13721 


109 


2 


KKI^VvaHAC^PSTI/jGEVGGSQGPGNC 
GQVTKVS 


13567 


27468 


A 


13722 


69 


369 


RDILQADGAHSLFPTTPSLNAYIIFSSP 
IGPHTHRPYHATPTYLPFYTNNLIKKKK 
KKKKKKKKKKKKKKKKKKKKKKKGGGPF 
KKTPGG AP I FGGGG k 


13568 


27469 


A 


13723 


58 


1208 


FWNENS PAS ELAPNGGGS VTS VPRLED Y 
LTRPQLYKLTGVEGPSRANSRDS FHDRK 
TYKTPSANMMVLKVEEIiVTGKKNGNGEA 
GEFLPEDFRIXjEYKAAVTLEKQEDLKTL 
LAHPVTLGEQQWKSEKQREAELKKKKLE 
QRSKLENLEDLEI I IQLKKRKKYRKTKV 
P WKEPEPE I ITEPVDVPTFLKAALENK 
LPWEKFLSDKNNPDVCDEYKRTALHRA 
{.JuttuJklUAl V nKXiMEAGAQXE r RDMLEST 
AIHWASRGGNLDVLKLLLNKGAKISARD 
KLLST ALHVAVRTGHYE CAEHL IACEAD 
LNAKDREGDTPLHDAVRLNRYKMIRLLI 














M YGADJaNI KNCAGKTPMDLVLHWQNGTK 
AIFDSLRENSYKTSRIATF 


13569 


27470 


A 


13724 


400 


135 


KWNGKBLNGVDGIGWDWNEMYWKGLEFH 
ETEWNBLERNGLEWNGMSWDGI KCNGI K 
WNGIECSGMLWNGMEClGMECPIiMEWIR 
VEWN 


13570 


27471 


A 


13726 


186 


344 


SAStiGLWRCRDCRRSLVHSVLNVAQAGV 
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nucleotide deletion, V=possibIe 
nucleotide insertion 














QWRDLSSLQPPPPSRLPWPPKVIjR j 


13571 


27472 


A 


13727 


380 


168 


APPQKKVFSSKFPQGFFFAPPNRKKLFF 
SPPRKIVPPPKFFLWAPPLFFFFFFFFF 
FFFFFFWYSRTDYK 


11S79 


<£ l*f IJ 




1179R 


194 


219 


RIMFFI FFYFFFFFFFFVFFLFFFFFKR 
PPPFFY 


11S71 


11 Aid 






9 


970 


RLQEASIiSPVPEIIjGGALPSAKRTSYIjL 

IaSTLCLIjIiSETAS rgnsltglghrsdhy 

KS VRSGGQSLYSACP I VTTIHGTCYGGK 
AKCCK 


13574 


27475 


A 


13730 


134 


3 


AVHRCKKWGRGWGRWLTPVI PTLWEAE V 
GGPU2PRSLRPALAT 


13575 


27476 


A 


13731 


429 


641 


RKKAVCFMNDLICFLDNTFKNNVLiSQAW 
WCVHLVPTIWKAEAGGSLEPRSLKLQCP 
WAPVWNCTPAWAT 


13576 


01 All 

It 1 1 


A 


117^9 


188 


32 


IFFFFLFFYFLFFFFFFFPRSRHCTLQP 
GQQSETRSQNKKQKTKNKQTKNIiP 


13577 


27478 


A 


13733 


341 


3 


GTVTPBEPKKMGDQTPRGLPRPERTLGR 
LSTS ESRVDPTPRVGGGSSGGRTAPWGP 
PRERGRVAGSSLHRACNFSTSNTIISVG 
LWLTPVI PTLGRAEAGGSPEVRS STHAS 


11578 

IDJ i o 


97470 






208 


9 


KLYKNFFFRFTKNPPPLLKKGFYLYNFF 
FKKKKKIVKKKKKKKKKKKKQKKKKKKK 
GRPLAI KNYPRV 


11V79 


974X0 




11715 


16 

1U 


919 


EGVSLLLPRLECRGT ISAHCNLRHRGRR 
CTPASASQILRRLKQENHLNLGGGGYRD 
PIKLQRTAS 


13580 


27481 


A 


13736 


22 


214 


PDRPALPTRPGGWITWGREVRDQPGQCG 
ETPSLLKIEKLAGCDGTCLEQNCLQNNN 
SNNKVGIR 


13581 


27482 


A 


13737 


386 


51 


WGNSITQQALFEHLCFSGHTSLPAVPCP 
HQACFPTLPLICRGIIIAHCNLGIjIiGSS 
DS P I S AS P IDGNTGLHPHARL FS FLLWY 
RWGLTMMPKRSLS PGLKPSSRLADAWV 


13582 


27483 


A 


13738 


204 


431 


S QHFGI S RRADKLS SNVGDQ PEQHGKNS 
LYKYKKINQGWAWWLTP VI P PLWEGQVG 
GSPEGKELKTPLRNLGKPHL 


13583 


27484 


A 


13739 


69 


159 


IKKEALSWAWWLTPVIPALWKAKAGGSP 
E 


13584 


27485 


A 


13740 


32 


285 


FFLSFFLFFFFFFGGGGKKKTKKFFSQD 
FLGPPKILGGGGFFSFFFGGPFFLKKGK 
KSPQNLFFLKKNPPPFFFPPPKKPLWGF 


13585 


27486 


A 


13741 


372 


213 


YRSCVLXiQQNPPLFYQSWWNFSLCVCVC 
VCVCTCTCETGSLKKFLLTQKKKR 


13586 


27487 


A 


13742 


29 


237 


'TKRKGYKTNKGYWAWAWWLT PVI PALFI 
AEGNLILQMQKQAQPEAGCGCVCVCVCA 
CMCVCVCVFKVFF 


13587 


27488 


A 


13743 


322 


183 


HTHTHTHTHTHTHTHTNPIHVFWLLLKY 
PKIQSHQFHISAFQDPLL 


13588 


27489 


A 


13744 


305 


373 


VGWGWWLTP VI PALWEAKVGGS 


13589 


27490 


A 


13745 


172 


1 


KNKIEEQMKWEMRQEAVAHSCNPSTLGG 
QGEFETSLVNMMKPCLYKKMQKISQVCS 
C 


13590 


27491 


A 


13746 


285 


1045 


FQHPFGLSQSEMAAVKASTSKATRPWYS 
HPVYARYWQHYHQAMAWMQSHHNAYRKA 
VESCFNLPWYLPSALLPQSSYDNEAAYP 
QSFYDHHVAWQDYPCSSSHFRRSGQHPR 
YSSRIQASTKEDQALSKEEEMETESDAE 
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M 
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od 


SEQID 
NO: in 
USSN 
09/515,1 
26 
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beginning 

nucleotide 

location 

correspond 

ing to first 

omjnn qpiH 
ailllUU dClU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
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correspon 
uing w 

last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F-Phenylalanine, 
G=Glycine, H=Histidine, l=Iso leucine, 
K=Lysine, L=Leucine, M=Metbionine, 
N=Asparagine, P=Proline, 
V^HjiuiarninG, tv— /vrginine, o — oerine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, Wpossible 
nucleotide insertion 














VECDIiSNMEITEELRQYFAETERHREER 
RRQQQLDAERLDS YVNADHDLYCNTRRS 
VEAPTERPGERRQAEMKRLYGDSAAKIQ 

F 


13591 


27492 


A 


13747 


2 


305 


GRVGSFSVRDVELSDPARERGEMPVAVG 
PYGQSQPSCFDRVKMGFVMGCAVGMAAG 

^T.T7n'P'CCr^T 13 TPMDPDt?! M/"V» Trf™* VT»KnuH"> 

-ftJUr ij 1 r oULjKlUMKtjKJliiMtjvjlGKTMMQ 
SGGTFGTFMAIGMGIRC 


13592 


27493 


A 


13748 


2 


305 


GRVGSFSVRDVELSDPARERGEMPVAVG 

ALFGTFSCLRIGMRGRELMGGIGKTMMQ 
SGGTFGTFMAIGMGIRC 


13593 




A 




7**« 




AVijWlA^ij iTWGTGVQBKKIQAQAWGLMP V 
IPTLWEGEVGGSPEVKSSRPACPHGKTP 


13594 


27495 


A 


13751 


455 


248 


ISVGPGIFQLNFCFLEPYGTFLSEKKFF 
IMETGSRFVIQAGVQWGS YSS PQPQFFG 

r JaOOXlJSJrlrClOVyV IV 


13595 


27496 


A 


13752 


2 


98 


IPTPTIHTHTHSHTHTHTHTHTSFLYMP 
PDLK 


13596 


27497 


A 


13753 


196 


2 


I WAPPKI FLKKPPLFFFFFFFFFFFFFF 
FFFFPFFIISFIPWPLTRKQKLERSWVF 
FKDSACSA 


13597 


27498 


A 


13754 


917 


379 


KYKKCS LQKNLLLVGCKKYS LCYRRHKL 
HJj V THGERKKP AAVNS F FL F FS FFFFLN 
LVRNTEITKKRVWLHETKADAESCNQDC 
l lobJl lAbttJjKUXKbCaKiirlbVjKWAGEGA 
AG6RREGERMEWTEMRRGARGRRGGRER 

ETKLGNLLKLG 


13598 


27499 


A 


13755 


175 


21 


KKKNFFFPPRVILGPPKVFFKRAPLFFF 
FFFFFFFFFFFFFLKKSWRPLAI 


13599 


27500 


A 


13757 


234 


341 


EIGWVQWLMPVI PAVWEAEVGGLLEPRS 

T.R7A7AP7AT 


13600 


27501 


A 


13758 


1 


365 


PAPNRRGGHIQDRATNSTELGANQCFFF 
o fx\jr f oJj IS l\J\. 1 Iris J..N KJS f K irAx'yfc'oJNPG 
NI/5TREGGDSWAGTTRCLRRTDEGDTYR 
TEPPTALSWGQTRAFFFPPALPAGKKRH 


13601 


27502 


A 


13759 


87 


181 


SHTHTHTHTHTHTHTF YIiHVVIHFDME I 
LGL 


13602 




A 








ZVPfTRPPPT?PPPT?'DI?0'E , DPPPDl?D'E»Dt?D'C 

RYREGGRLLLTMGEGETERATDLYHTPP 
PSIKAWRLCEPRPRAGILCGRNIF 


13603 


27504 


A 


13761 




1Q7 


MGEEDFLEDPVIAFVRLAPAVLLSKLSE 
VSVAMT 


13604 


27505 


A 


13762 


276 


57 


YAVLGGGGGRKNLFVFVFFLFFFFFFFF 
FKKQGFGLMPLGEDRGPNMGFFYFPMVG 

DKTKTiPFTCK'K'WirP DP 


13605 


27506 


A 


13763 


377 


209 


PRPAYGPAFLFTEGFSWNPFFFFQMESR 
S VAQAGVQWCYLGS LQ PPPPWF PGTLLN 


13606 


27507 


A 


13764 


160 


202 


MEKYNVHPHSGILHSHEKEQAALFTIAK 
RWRQPS YPS IDEWKNIMS IHTVE YYTAM 
KRNKQLYSQ 


13607 


27508 


A 


13765 


390 


284 


ESGGHFLSLSLSIYIYIHIYIYVCVYIY 
THTSHTI 
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nucleotide 

location 

correspond 
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peptide 
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ed end 
nucle- 
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ding to 
last amino 
add 
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sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenyla!anine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proiine, 
0=Cwlutamine. R— Ar^inine. S=Serine. 
T=Threonine, V=Va1ine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V^possible 
nucleotide insertion 


13608 


27509 


A 


13766 


350 


41 


WVIIiQENRNTQNRKLLPKSTEIiCMLIiQ 
MLLFIIjNTTOYPLPLWQCYHRFCSFFLK 
TGSFYVTQAKVRWLFTGMITVHCSLKLL 
DSSHP PKWLVLQELATVAS 


13609 ! 


27510 


A 


13767 


330 


156 


KITQAWWCVHWPDNREAEVTGVIRLGR 
MRLPLEGACTNCVPSWEKEQDPVSHNEK 
K 


1 QA1 A 
UOIU 


77^1 1 


a 


1 3 /DO 


7^ 


709 


KCFFLSWRGGSRLOSOHFGRPRRADHLR 
PGVPDQPGQHGFLLVKMSDKPDLSEVEK 
FDRS KLKKTNTEEKNTLPSKK I FFSLVG 
VNIQD 


13611 


27512 


A 


13769 


585 


680 


KCLGSRTRWLTPVI PTLWEAEAGGSLRP 
RSS 


1 JOIZ 


£. I J ID 




1^77ft 

13/ 71/ 


too 


JJ7 


PSPRERKFLPFFKKKKGWGPPPP PKKNR 
GRGPQQKEGPQKPKP 


13613 


27514 


A 


13771 


794 


515 


PDMGLBDEQKMLTESGDPEEEEEEEEBL 
VDPLTTVREQCEQLEKCVKARERLELCD 
ERVSSRSHTEBDCTEELFDFLHARDHCV 
AHKLFNNLK 


13614 


27515 


A 


13772 


130 


397 


WGLTPLiCFCMRASLPRHS KPQARKTKK 
KKKKKKKKKKKKKGGSLKKKIFGGAKKS 
GATKKKNFPKKRGQNKKNPGFFKNEIFF 
GGGAI 


HAH 
1 Jul J 


ZiJ 10 


j\ 








TRVGGGCS EQRLCHCTPS WATE PNPVSN 
NNKIPLEIiFPSNQTFFHFVRNWKKRGNN 
WLSNYS I S STVTRHFPTLFQF 


1 3OI0 


77<517 






1 




GLQS LADLLSGLLQKVCLPLSNG I MDL Y 
LLYYLFTFETGSCSVAQARVQWCEHGSL 
QPHTSASSDPPTSATTTCHHARLIIW 


13617 


27518 


A 


13775 


376 


145 


TRVGGGCSEQRLCHCTPSWATEPNPVSN 
NNKI PLELFPSNQTFFHFVRNWKKRGNN 
WLSNYS I SSTVTRHFPTTiPOF 

njjun -± %m* ^ a * a wn^ *- Alii \£J> 


13618 


27519 


A 


13776 


107 


46S 


EKARSPERARDQEGGTERDRNSEREKIL 
PKLREELPWVSGGWRCPWPHQGWSHWED 
NPGWGIPTGPSVGWGEKKGPGEGRSHKY 
GTGRTKCEIjGVS IGNSAFT FLLHFYFKH 
RKRERKQI 


13619 


27520 


A 


13777 


125 


2 


NIFFPLFELYYFKCHWARWLTPVI PALW 
EAKEGGSPEVRSS 


13620 


27521 


A 


13778 


166 


74 


GRICYSHSLSKHTHTHTHTHTHTSHISF 
IP 


1 "}£7l 




A 


1 V77Q 


i 
i 




ARGERERERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERLS FSLGGGGALKKKRI FLCVTLB 
GKHLPPLTHTGFFGENiiiTiiSMKTREGG 
FICAGGDERRSASVMRAYIYRECEPHTH 
V 


13622 


27523 


A 


13780 


'614 


337 


RRGS S ALCYRRHGNHKVKJ RS KQGASV S 
PHEHLRILSLLEVRNVGLGHSIFPHSLR 
MYVCMYVCMYVCISIYtiSIYLSIYIiSiy 
LSISISSR 


13623 


27524 


A 


13781 


390 


158 


VLIRLSWGRICFQAHSGCWQNSGSCSHK 
TEGLCIiLLAGSCSWLLGGALKGWASKEE 
FYTKQHSCQSDTLSPLPYSIH 


13624 


27525 


A 


13782 


308 


163 


TKFY LLFFFFFFFLiFI FFFFFFFLKKKF 
LFFIQGGGKIKTPGTLTFLG 


13625 


27526 


A 


13783 


45 


277 


IASGRPFFFFLHLPPFQKAFVFGGGGGT 
PFEYQNFVAYTKGKQGNPFLFGCGDLFN 
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Amino acid sequence (A=Alanine 
0=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 

(~\ — (~Z lilt') rrt I n o DsArainino CsCavSnA 

v^— unu uuaine, rv^Arginine, o— oerme, 
T=Thref»nin<> V— V^linp 

W=Tryptophan, Y=Tyrosine, 

X=Un known, *=Stop codon, /^possible 

iiuncvuuc ucrcLIOil} \— possiDie 

nucleotide insertion 














APQLI KQLSQLGQKYPKKPLL 


13626 


27527 


A 


13784 


413 


218 


PNFPPFLFFSGKFFFFAQNLPFGFFPPP 
GEGKAFPGQGCPV6GGGFAGPTFFFGPP 
APPRTI.YPP 


13627 


27528 


A 


13785 


1011 


1368 


IDWENDLTPMVLCHGPSI YNKYS FSQNI 
FPISANFFFIAHINLTLLQLVSELSHKI 

ILRHFYIKPIKDILNLYLILPGVWLFDL 

nfCTi Tr'DVT.nrr'TVTUT vTUTU'i'UTtrT 
iaj.ui<.x±a,vv,x xXnlji xHIHirilHI 

YTHIRL 


13628 


27529 


A 


13786 


161 


425 


RWGS CYCPGYPHLKI FKLLLLLVLVLEM 

uV ijlrJjJjirKJLuSVJoVTn J. X VrlCW liiii? JUG TS x 

PPISVSQTKKKKRPFTLPPLPNLQIGLP 
LRGE 


13629 


27530 


A 


13787 


70 


ji i 


nTiT^TTATiVT^P'Knr'R'VTVT QTVT CTVT O 
XAUXOX JL/WiVXOlriMivJ: XXXJj&X XJ-ioX Xiao 

I YLS I IYLIFISTNLLWVLSLWETLNKT 
RLLNLFIjTIQNNI wnpkllvsali 


13630 


27531 


A 


13788 


25 


219 


RNMAAATLTS KLYSLLFRRTSTFALTI I 

Vf7VMFl?'R'P21TnVV22vr»2i TvninATPr'm' xatr 
vvavnr r Eit\tw u\J\M\Ut\l. lUti VffEAjJvLnlx 

HIKHKYENK 


13631 


27532 


A 


13789 


279 


1 JO 


FTPFS CLS LPSSWDYRPF 


T3632 


27533 


A 


13790 


138 


28 


GGWVRWLTPVI PALWEARVGGLLEPGSS 


13633 


27534 


A 


13791 


133 


7 


YSCQRCLTHGQAQWLTTVISAPWEAKAG 

KuJjurlvOVKDiinni. 


13634 


27535 


A 


13792 


86 


290 


EHVHRTMEASVSGWGWWLTPVISAAWE 
TTES LGS FAATS 


13635 


27536 


A 


13793 




17 


PSSSRIAWAV 


13636 


27537 


A 


13794 


216 


77 


CLLPEGLIjRAVTPWCHAP 


13637 


27538 


A 


13795 


251 


347 


VARAWGLAPVI PALWEAEVGGq PRfiR <3T 
KPAL 


13638 


27539 


A 


13797 


177 


274 


VQWHMSWPAIQEAEVGGSPEPRSLRPA 
WATY 


13639 


27540 


A 


13798 


307 


406 


KFKKGWAWWLMPVI PALWEAKAGGSPES 
RSLRP 


13640 


27541 


A 


13799 


194 


3 


FFFFFFFFFEMGSHYIVQARVQWLFTDA 
NIVHCSLQLLASSDPPVSTSQVGLQACA 
DDAQNPE 


13641 


27542 


A 


13800 


333 


198 


LPS VF FFF FQTES CS VTQAGVE WCDLGS 
TjHS SLGNRARLCIjKT K 


13642 


27543 


A 


13801 


1 


335 


ETERGRERETERGRERETERGRERETER 
GRERERERERERERERDREEGRERQRGE 
GKKNRDREENRRRETERVGDVVCVCPHP 
NLILNCS PYISHVLREGLGKTYLNYGGS 
FPTIiCSG 


13643 




A 




Id. 




K. v if r JJKo K I KoK 1 Xiljro 1 nAo AXiiAvJ. XjVY 

p^hlisslix:legiii^lfiiatlitln 
thsllani vp iailvfaaceaavglall 

ffpyrepnknlpgnlgkepflgggdilg 

QPPYKN 


13644 


2754,5 


A 


13804 


1474 


1661 


TDFHNSHLLKCCKNCIFSTLNYYIRREH 
FS IVFIFCYICWKVHYI RRELFRCLYF 
LLYMCC 


13645 


27546 


A 


13805 


360 


3 


L WG KRGG P FS P P P FF FFLE LFF FLKKMG 
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N=Asparagine, ^Proline, 
v==v>rlu famine, K— Argimne, ^^^rine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=*Stop cod on, /^possible 
nucleotide deletion impossible 
nucleotide insertion 














GGGFFGGFPWGPPKGEIFFFVFFWAPQK 
KKKKKPPPFIFWGGGPFFFFFFPPPKQK 
KKKNKXKKKKKQKKKKKKKKKKAARWPA 
RSRSSA 


13646 


27547 


A 


13806 


1 


425 


Or p/ilfAT D O 7V 7» TV TV ~CX 7C? f , 'CC/~V i /- , '^'T T "D TYT 

KIAjO v A 1 ikSAAIK-iAK v £? yj£j r UvjHaJcvLiIjK W 

CLVTDFPPDSCTTCSYSRRSTPGCSPGG 
SRGLSEGBGSSVSLQRSRVLSJ^IKHVIiN 
LYLLGWLTLLS I FVRVMES LEGLLES P 
SPGTSWTTRSQLANTEPTKGLPDHPSRS 
M 


13647 


27548 


A 


13807 


5 


313 


DVTIVT /VIT'PUT'O yi*'T*LTT T VP WDVPT TU 

hilsJ? YLtytjilrloKi xi-Lbx xlsrxi'x 1 XI V 
KEYSTNYVLLIGNTQITKIYHIVFRKFS 
MTP I YKFQDLAQWIjTPVI ptlweakpgg 
LLBPRSLQSSYSLIXLIL 


13648 


27549 


A 


13808 


182 


314 


vitqldkteraqwlmpvilvlyeaevgg 

LLEAMSSRPAWATKTW 


13649 


27550 


A 


13809 


219 


1812 


lpppesgamsgfnfggtgaptggftfgt 

AKTATTTPATOF S F ST SGTGGFNFGAP F 
QPATSTPSTGIxFSLATQTPATQTTGFTF 
GTATLASGGTGFFIjGIGAS KLNIiSNTAA 
TPAMANPSGFGLGSSNLTNAI SSTVTSS 
QGTAPTGFVFGPSTTSVAPATTSGGFSF 
TGGSTAQPSGFNIGSAGNSAQPTAPATL 
PLTPATPAATTAGATQPAAPTPTATITS 
TGPSLFASIATAPTSSATTGLSLCTPVT 
TAGAPTAGTQGFSIjKAPGAASGTSTTTS 

taatatattttsssttgfalnlkplapa 
gipsntaaavtappgpgaaagaaassam 
tyaqleslinkwsleledqerhflqqat 

QVNAWDRTLIENGEKiTSLHRKVEiwKIi 
DQKRLDQELDFILSQQKELEDLLSPLEE 

T TnTDrtCP'PTVT fVTT'JV | mji/O IP U 'I'VTTT TkTJXTT 

jjVJUsyjo^lXxAj^nAUiSiSKaivl X&xuiMSnl, 
DAQLKRMAQDLKDX IEHLNTSGAPADTS 

UCl^^i X ^XXiTlMluTllJO Xiy " X X/^ .W CXrtlJl l\t 

RKVEEVTKVCEGRRKEQERSFRITFD 


13650 


27551 


A 


13810 




OCT 

Zoo 


KT^^T.WKTTFnVINriP PWPT JVIPVT PttT .ttTR 2XRA 

GGLLEPRGLRPTWVT 


13651 


27552 


A 


13811 


210 


51 


TLSHRKPISFEAYFARVRGF ILEVSETR 
NPPIGWTRWLTP VI PALWEAEAGG 


13652 


27553 


A 


13812 


241 


139 


T .HR BriT ,7X DM T D7A T ,LJ CT\ /WflflT .T .RDP Q DP 
J, in PftVTiJrHrrax cAJun IS V JVV VjOlxuIlxr Ko irlv 

PAWASW 


13653* 


27554 


A 


13813 


539 


256 


RTTQMSTAAGFTEAPNWKQPRCPWTEEQ 
1WMWSLHAMECA9AMKWNEVLIQPAVR 

LGRSTGKGSD 


13654 


27555 


A 


13815 


63 


369 


VRETPLKTHYIiEEISSPASPTAIPQSLL 
roil xoirir j Jiiftxvyjvjnjunf vnoumr 
PETEPSPSVCLWAGPKVPPGAAGKGSPD 
3NTPT.VI RSLTiAPP PAST. 


1.3035 


27556 


TV 


10017 


ZDO . 




AES AP PFHTNii I.F KTHTHTHTHTHTHTU 
OTlSlxRTTRWAPSQVCRPQTSQRGGECVG 
PTAFAPSPTLLKPHHPSSHVHLPSQPRR 




i/jj / 


A 


11818 


516 




WFKSGSFWLGWAHrasrPSTIX^GQGGWI 
TSGRSRTSVTTTTTSQTCAPA 


13657 


27558 


A 


13819 


295 


361 


WLTPVIPALLEAVTGGSLEPRS 


13658 


27559 


A 


13820 


166 


368 


GWWATQHSAQYVVSSSSIGAxjCLGESCS 
VTQAGVQWCDLSS PQPSHPGFKRFLCLS 
LPSSWDHRGDL 
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residue of 

peptide 
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Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Aianine 
OCysteine, D=Aspartic Acid, 
E=Glutam ic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R=Arginme, S=Serine, 
T=Threonine, V=Valine, 
W=0Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /-possible 
nucleotide deletion, \=possib!e 
nucleotide insertion 


13659 


27560 


A 


13821 


133 


530 


RKYCGQCLNMALNNVSLSSGDQRSRVAY 
RSSHGDLRPRASALAMVSGDGFLVSRPE 
AIHLGPRQAVRPSVRAESRRVDGGGRSP 
REPDGRGRSRQARFSPYPIPAVEPDLLR 
SVLQQRLIALGGVIAARISV 


13660 


27561 


A 


13822 


76 


1958 


RQEI* I WPLCS P PQGDRFLQKS W I FFRPV 
MADKIjTRIAIVNHDKCKPKKCRQECKKS 
CPWRMGKLCI EVTPQS KIAWI S ETLC I 
GCGI CI KKCPFGALS IVNLPSNLEKETT 
HRYCANAFKLHRLPI PRPGEVLGLVGTN 
GIGKSTALKILAGKQKPNLGKYDDPPDW 
QEILTYFRGSELQNYFTKILEDDLKAI I 
KPQYVDQIPKAAJCGTVGS ILDRKDETKT 
QAIVCQQLDLTHLKERNVEDLSGGELQR 
FACAWCIQKADIFMFDBPSSYLDVKQR 
liKAAITIRSLINPDRYI I WEHDLSVLD 
YLSDFICCLYGVPSAYGWTMP FSVREG 
INIFLDGYVPTENLRFRDASLVFKVAET 
ANEEEVKKMCMYKYPCTKKKMGEFELAI 
VAGEFTDSE IMVMLGENGTGKTTFIRML 
AGRLKPDEGGEVPVLNVSYKPQKISPKS 
TGSVRQLLHEKIRDAYTHPQFVTDVMKP 
LQIENI IDQEVQTLSGGBLQRVAIiALCL 
GKP ADVYL IDE P S AYLDS EQRLMAAR W 
KRFILHAKKTAFVVEHDFIMATYLADRV 
rVFDGVPSKNTVANSPOTLLAGMNK^ 
QLE ITFRRDPNNYRPRINKLNS I KDVEQ 
KKSGNYFFLDD 


13661 


27562 


A 


13823 


3 


292 


KCWDYVHEPPRPAPMQTYQVDLRCCFVS 
TINLLVCGERASLPSVESLVFSGLAEVW 
MKAVLAPSRESGKLVLQQVPEYAEAIWG 
ARPACRGIjVFSA 


13662 


27563 


A 


13824 


245 


91 


YKVNFGP PRGS FFRPPPFFFFEVFHFFF 
IIFFFFFFFFFFFSFFLGGFMYV 


13663 


27564 


A 


13825 


276 


100 


PGQKKKTLSPKKKKSIYLYIHTHIYIYT 
HTHYIYKYIFVCVYICVSIYIHTHIVPR 
IFS 


13664 


27565 


A 


13826 


202 


375 


IYNTYIHTHTHTHTHTHNVYLYIQSGDC 
IYAVLLGELSFIYIFVFLSSLISILFHL 
TY 


J 3665 


27566 


A 


13827 


278 


34 


KPRGLSPAGVSAQRAWGPHSCPRLPDPR 
TELQVRQI PALVGSSAFQGRRGRKGPVG 
FiiKPKAPPQ PRPGCCQCTIiLQI 


13666 


27567 


A 


13828 


62 


307 


GGARAI FLNVPHLKRPMFFLELCLSTKC 
P VGTE FALiDSLLRIj YNSNTFCWVQWLMP 
TIPALCKAEMGGLLEARSLIPAWAT 


13667 


27568 


A 


13829 


386 


478 


NRIGMVSHACNPSTIjGGQGRRIMRSGDQ 
DBS 


13668 


27569 


A 


13830 


364 


484 


NFNIWLGWAWWLMPVIPALWEAKAGGSL 
EVRGSGPAANMV 


13669 


27570 


A 


13831 


33 


486 


PARSAEFGTRRERERERERERERERERE 
RERERERERERERERERERERERERERG 

\ t"t <T TC* "D o a t t (^raPOHTiT c r\ w~vr*\ Tl**\ n? 
ViCV L.XrLAi ujUVKIS&PFAjOWJUxx Vr 

WRYTES LALS P FCLGRCLALLI FLEGM 

WRKIWAPSFLSLCVGALFFSQHPPPFWC 

LCVLSLVRPLPLSLSGGVFVSQRALI 


13670 


27571 


A 


13832 


6 


718 


YSAVEFAMAGVGAGPLRAMGRQALLLLA 
LCATGAQGLYFHIGETEKRCFIEEIPDE 



1378 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteii\e, D=Aspartic Add, 
E=GIutamic Acid, F=Phenyla!anine, 
G=GIycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M-Methionine, 
N=Asparagine, P=Proline, 

T=Threonine, V=Valine, 
W=Tryptophan, Y==Tyrosine, 
X-Unknown, *=Stop codon, /^possible 
nucleotide deletion, \=possible 
nucleotide insertion 














TMVIGNYRTQMWDKQKEVFLPSTPGLGM 
rl ViS V jsJUtriAjjbJyilAl V V uoKU x uo eajKvx r 
TSHTPGDHQICLHSNSTRMALFAGGKLR 

RARQLLDQVEQIQKEQDYQRYREERFRL 
TSESTNQRVLWWSIAQTVILILTGIWQM 


13671 


27572 


A 


13833 


17 


130 


RLQSFGTRRKRERERERERERKRERERE 
RDTHSFLHG 


13672 


27573 


A 


13834 


283 


460 


LWRLAI CKNYS LKRETE PGS VAHVYNP 
STLGGOGGRTARGQEFKTSLDNIARPPS 
LQK 


13673 


27574 


A 


13835 


3 


435 


W PKr CI AbQ E r G I RRE RERE RE RE R E RE 
RERERERERERBRERERATS LS RARPS F 
L FP S ACV SHTPYVGERENLS LHTLKHTH 
TREKHSLSYTNASARDTRARFSEGRAPP 

TVOT7VT TTTfM/PPPTPTPPQT #T OirDU7VI70T 
JL xo V 1 1* Vr if r r r doJ-iwIio VKHAcfcii 

AHV 


13674 


27575 


A 


13837 


351 


57 


RTLVH I S KFTGNLS DLYYVQS E WQDLGT 

TV pvT OTMTCW.Uf'T PCT.POTtfKrPngMPCC 

VAHTCKPSTIjGGRGRCRDQHGQHSKQPS 


13675 


27576 


A 


13838 


336 


150 


TFCYKKYLWGAHCFI FRTTGLP IKNFYF 
VAWAS 


13676 


27577 


A 


13839 


310 


454 


HCSLGLPQVLLI ILYKILLGWARWLTPV 
IPALWEVEAGGLLEARSSRP 


lio/7 


27578 




1384U 


163 


297 


A JL t? Jf iiijUliJuAJJutAn 1 JL x Vis JL x xioXxJjoX 

YLS I YLYTHTLWNTLQLj 


13678 


27579 


A 


13841 


33 


316 


LDQHPTPRSPl^CHSLRKTSSSQGGKSE 
LVKQSLKKPKLPEGRFDAPEDSHLEKEP 
LEKFPDDVNPVTKEKGGPRGPEPTRYGD 
WERKGRCIDF 


13679 


27580 


A 


13842 


4474 


2586 


DGGSGCVKMEFPGGNDNYLTITGPSHPF 
LSGAETFHTPSLGDEEFEI PPISLDSDP 
SLAVSDWGHFDDLADPSSSQDGSFSAQ 
YGVQTLDMPVGMTHGLMEQGGGLLSGGL 
TMDLDHS IGTQYSANPPVTXDVPMTDMT 
SGLMGHSQLTTIDQSELSSQLGIiSLGGG 
TxXPPAQSPEDRLSTTPSPTSSLHEDGV 
EDFRRQLPSQKTVVVEAGKKQKAPKKRK 
KKDPNEPQKPVSAYALFFRDTQAAIKGQ 
NPNATFGEVSKI VAS MWDS LGE EQKQ I Y 
KRKTEAAKKExXKAIAAYKDNQETOATV 
ETVELDPAPPSQTPSPPPMATVDPASPA 
PASIEPPALSPSIWNSTLSSYVANQAS 
SGAGGQPNITKLI ITKQMLPSS ITMSQG 
GMVTVT PATWTSRGLQLGQTSTATIQP 
SQQAQIVTRSVLQAAAAAAAAASMQLPP 
PPJL^PPPLCKWPQPPTQQQVTIIiQQPPP 

PLPTLKMQTTLVPPTVESSPERPMNNS P 
EAHTVEAPS PETI CEMITDWPEVESPS 
QMDVELVSGSPVALS PQPRCVRSGCENP 
PrVSKBTOlWi'CSNEaA^C^VFTJW 
VASRNSNTWFVK 


13680 


27581 


A 


13843 


54 


251 


EFYTOINSPSPLWLCVCVCVCACLFMCLC 
AliTCMCWxTCSLljHSNLTQVLSSKFYDS 



1379 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue oi 
peptide 

CO/1 llAft ftik 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamic Acid, {^Phenylalanine, 
0=G!ycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Olutamine, R=Arginioe, S=Senne, 
l— tnreonine, v— valine, 
W=Tryptophan, Y=Tyrosine, 
X— Unknown, *=^Stop cod on, /—possible 
nucleotide deletion, V=possib!e 
nucleotide insertion 














J. o rAon ruA. 


13681 


27582 


A 


13844 


154 


26 


DPHVPPTERSEGSQARWLPPVIPTLWEA 


13682 


27583 


A 


13845 


475 


225 


GKTtiLi YGGALFQLLQAAI PEGGGTSCLF 
NPSSDGGSSAAGGWGGAGDLARSSRVP 
SPTSSRPVPSPDPS PPSLYLTPWKETH 


1 JUOJ 




7\ 
±X 


iio4o 


479 


319 


RD CRRS LTHS VLS G AQAGV QWRD LGS LQ 
PPPPSRLPWPPKAPRWQPLPGHHPI 


1 1£QA 


27585 


7\ 

A 


13847 


182 


499 


LLCVKLCDS CNENKALREQTRAQVQGGH 
SRAQVQQVQGGHS AHTWQ IRS AASRRK 
Ab bTraSHLCTwljLFyGTGSSTYMRPTT 
RYSPLEGRLAAVFYSILIPTLN 


13685 


27586 


A 


13848 


388 


494 


RDS WERWLE PL I P VLWEAQAGRLLE PRS 
T iH LAWAT 


13686 


27587 


A 


13849 


198 


63 


HHATTWMNLEDIMFRKVSQTQKNKYYMI 

rujLnA JL XVJV V UOtuUiyJV 


13687 


27588 


A 


13850 


555 


353 


RCPPGAQAPLPAP S PAPPVHIALS PLSC 
GVSWPRBLTAHPPE PPPPPFFKIHPHPQ 
LSVGKRGQMTP 


iJUOO 






1 IOC 1 




66 


laliAGSTGQARWLTPVI PELWKAE VGGSL 
EIRSSR 


13689 


27590 


A 


13853 


12 


105 


AL 


13690 


27591 


A 


13854 


256 


363 


I CLF I KDS DRGQTQWLTPVILTLWKAEA 
GGSLEPKS 




27592 


A 


13856 


23 


436 


IDRLSDSHEERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERESSLLCVRHTCFAPPTYIFLWETLR 
VCPTPCVWGALSVFSHTERARIFFYTOS 
GSVARAPVCAHTRI FSSRARARALCARP 
PLSRERRAHTRVFFI 


13692 


27593 


A 


13857 


91 


428 


PDPDTSPLTTDRGETPLGATLPSCYCGG 
QQSFEPKTKKTTNQKKKKKGKKFTPQKK 
NPKIFPPKPFCPKGPRGENQPKKTQNQP 
KKKKKI LGGGPAPNS PLGGGKKNS WAGF 




27594 


A 


13859 


465 


43 


KRVTTENPQHICSAETAIPWSLSQNCNF 
HILWWGHPNHPIYITTSIYIYIYIPHIY 
GMVYIYTRICMCVCIYVHICIHTHTHIY 

TV TtTn^tTMTTTyTTKJI 1 Jl >nw I'll MI Tfi /snira ■/-»■» nr T A 

X Y IHTHTHI YMERERETVSCS VAQAGVQ 
WCDLGSLQPPNKRLKRFSCLSLPSSWDY 


13694 


27595 


A 


13860 


290 


52 


WYKYLPGTRGFIilWLNYLKCIGRIELLV 

/TSVI7G17TVT CTVT r? T VT oTVT conn TT^nn 

GVKRCNHSSLQPRTPGLMHLSNS 


13695 




& 


1 

1 JOOI 


ins 


£.£. 


xr PLroAVMKro iAr yJvarFi KMWHVo 
I 


13fiQ6 

l JU7U 




A 


11R£7 


10/ 


11 


vr i VlttjJL IMLVbVRDIjGQAWWIiTPVIPA 
LWEAEAGGSLEEFLRT 


13697 


27598 


A 


13863 


448 


355 


FFFFFFFFFFFIFFFFFYFFFFFFFAKN 
PLN 


i Joys 


27599 




13864 


321 


445 


SQSTQAWCLMPVI PALWETKSSESFEPR 
SLRPAWATGHNPI 


13699 


27600 


A 


IjOOJ 


JOU 




PFLSNVGKGMQNDPVGLSPTSTPSDLSR 
IYLFFCFFVCLFVCFETGFPSVAQAGVQ 
WCDPLGLKWFLPPQSP 


13700 


2?601 


A 


13866 


372 


264 


KMENPEDKNFFIFFMFFIVFLLFFFFFL 
FCLTQFF 



1380 
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SEQ ID 
NO: of 
nucleotide 
sequence 


SEQ ID 
NO: of 
peptide 
sequence 


M 

etfa 

od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

ftminn iff. 

residue of 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

nentide 

UWllUUV 

sequence 


Amino acid sequence (A=AIanine 
C=Cysteine, D=Aspartic Acid, 
E=GIutamic Acid, F=Phenylalanine, 
G=G)ycine, H=Histidine, I=Isoleucine, 
K-Lysine, L=Lencine, M=Methionine, 
N=Asparagine, P=Pro!ine, 

0=f*lu taming A rtxi ninf*. Ss=Serinp 

T=Threonine, V=Vaiine, 

W~Trvntonhan Y=Tvrosine- 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion^ \=DOSsible 
nucleotide insertion 


13701 


27602 


A 


11R/V7 


161 


104 

Jut 


LF IAEMTSHYIAQAGLELIiASSNPPTSA 
SKRTGITDGATEILPQLRIi 


13702 




A 


llR/ft 

lJOUO 


230 


14X 


VSF IQNLVFCVERVYRVPDFGVWERGSK 
YNNGSTELHSK 


11701 

1 J 1VJ 


07/CA4 


A 


JJ007 


16R 




EVIAECYNGCFQTLSLCVCVCVCVCVCV 
CY 


11704 


Z> IXjSjJ 


A 


J JO /V/ 




16 


LKYGQWRD3jGSLQPPPPRFKQFSCLSI*L 
S SRNCTTMTRDHE 


13705 




A 


IJO/ 1 


141 


10 


KI LKVWSQAWWLIPVT PALWEAEVGGS 
LEPRISRLA 


13706 


27607 


A 


13872 


158 


59 


HLHGC IHLLKIiRLDAVSHACNSSTLGGQ 
GGRIA 


13707 


27608 


A 


13873 


232 


115 


APFFFKGAP IFFFFFFFFFFFIFFFLIR 
GKPGNDVGVEG 


13708 


27609 


A 


13874 


1619 


1375 


KTTWRRHLEKLQDMSLLRHRVHVDSSQG 
LFPFFQRQGLLPRIiKCNGDI I PYCNLKL 
LGWSNS PASAFRVARTTSLCRHTWL 


1 17ft0 
Ijfsjy 


77£1 A 


A 


17 57^ 
I JO fj 


L\y 


0/1 


RWEDHLSPGVPSCSAL 


13710 


27611 


A 


13876 


166 


496 


KKKKKKKKKGGGP LKKTPGGAKINRGRK 
KKTFPOKGGOKKTPGGIIJCKICPFLGGGK 

JAJtv JL X? i ^/ ivJVJ^ XVXV X JET VJV7 J- JJXVXV£VJr JC, J— I>JVJ\J Xv 

KGPNPPKKIKGLREKKKFKRGKGAKPAQ 
NPWGKKISPPGFFIjKKFFPRGRGFFI 


13711 


27612 


A 


13877 


132 


348 


PSKKKKPjGGPFKRTPGGPKFNGGGKGNS 
FPLMGGGIKPNLGI LGKNP YIjGGGTOGN 
NPPTEIKGFGBKKKF 


13712 


27613 


A 


13878 


287 


134 


QQGPP ISPGPKEENRMGPPSPGWGNPGI 
RSPPGAGPNPGAFQDGMFPSKKQ 


13713 


27614 


A 


13879 


333 


480 


VHPLRSAEGPRPESNMLHLTLKSKQQWL 
TP VI PALWEAKAGGSOGOE IE 


13714 


27615 


A 


13880 


1 


227 


PKIHCKIQHIWLVSFFPSFFLSFFQGE 
SCSAAQAGVQWHDLGSLQAPPPGFKWFR 
FIELLGLGFHSRRMRGKDS 


11715 


Z /OlO 


A 


JJ00J 


776 


77ft 


GWPWWLTPVI P ALI1EABVGGI1I1EPRSI1R 
IALAT 


13716 


27617 


A 


13882 


126 


389 


GSVLGTGCCGSVLGTVCRGSVLGTVLIA 
SATPDRYLRLSSGSSLTRITNSDLVPWL 
TPVI PALWEAEAGVSLE PRI WRPARETW 
EDPH 


13717 




A 


11RR1 


414 


isn 


ARQAPKWGNPTGSPPPGFIiNPPPKNFFL 
GPQKKKIFFPTPPPPHFFFFKGPPPFFF 
FFFFFFFSPHASGI LLCHPGRIAVAQS W 
LTE 


13718 


27619 


A 


13884 


197 


363 


DLGVVVVNTFMLKFKCLNVSRGHVQWLI 
IPVI P ILWKAEAGRSLKPRSLRPAWAT 


13719 


27620 


A 


13885 


101 


216 


NIHYANFPYFGLACPEYIVRALQKKYTQ 
TJiTHTHTHTHTRTl'HSL 


13720 


27621 


A 


13887 


17 




T^/HLKMVXIGLHMVAHACNPSTLGGRGG 
PIMRSGALGNFCL 


13721 


27622 


A 


13888 




1 


VLHAGLKLLGS STS PTSAS KGAGTVG IH 
YHTQLAFQFFLFCFLIR 


13722 


27623 


A 


13889 


411 


276 


TLYFKHGEMGFHHVGQAGLELLAS SDPP 
ALEFETSLHNKVRPRL 


13723 


27624 


A 


13890 


216 


54 


EFKLCYTLFDHSYFPLEELVSGQVWRLR 
PAIPS FWEAKEGGS LEPRS SKHE IV 


13724 


27625 


A 


13891 


3 


149 


SSDPRPSSSWLAWVGLWSMVWRVPPFLL 
PILFLASHVGKSPLSPSPHF 



1381 



WO 01/64835 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaraic Acid, ^Phenylalanine, 
G=Glycine, HNHistidine, l=Iso)eucine, 
K=Lysine, L=Leucine, M=Methionine, 
N^Asparagine, P=Proline, 
Vr^viutamine, k— Arginine, 5— oenne, 
T=Threonine, V=Valine, 
▼v — lryptopnan, x— tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 


1 1*70 <5 
1 J ILJ 


Z/OZO 




t 7 BOO 


loo 


i 
l 


SSRPAWATWKNPVYTKNTKISQAWQCT ■ 


13726 


27627 


A 


13893 


128 


2 


FSTTYPNPIKLPHPYLPLLTLFLDSAHL 
UOC17TWCT invovmr 

tl Jr o IS X N o li V Ay JvtSJS. 


13727 


27628 


A 


13894 


225 


391 


HLSIHILTFPFSLPEQWWLVPIIPVLWE 

V?TW3f2T.r.PPP C CPD7AVJ7ATW , ir3Pl'M7QPir 


13728 


27629 


A 


13895 


192 


1 


TIiVCVSSSVPKNYNNICLACWFMPI IPA 

T .WT<*\n? CPOT T T7T VOOT5DS WIVPUTMTJTIT V<? 

KNTKISLA 




27630 


A 


13896 


152 


, 245 


TlOQ T 7A TM7T MVVnnT*fTO'I*LT , l , IJ , l , LJ*PIJ*T , T -/*\XT/% 

lA2kxAUvJ_iMx i lnlHIHIHiHxHlJjQNQ 
VPG 


Li /3U 


27631 


A 


13897 


270 


406 


IoTKr rijxKDKAoljCHFvjWoAiSVUSWljP 
GSSNSQAQAILLPQPPB 


1 9*7*1 t 

13731 


27632 


A 


13898 


232 


73 


iiAJ2>GPKlJV 1 b iTFHWRJj YDQ 1 1 NxHEGR 
MWANSRLRLTPEGLLKVITPWCHAH 


13732 


27633 


A 


13899 


107 


7 


LGQAQWLMPI IPALWEAKAGGSLEPRTS 
RPAWA 


13733 


27634 


A 


13900 


62 


224 


PTRP I UAHCNI^CVIiGSDDP PAS ASQS AG 
VTGMSHCAWSNLI IHLPSRSTAICTL 


13734 


27635 


A 


13901 


450 


194 


KRQVKS HS E VKTTHVQWPGID PGS PAWE 
ARILPLNHQRLSGTTPWRITEKSNHKDL 
ETVLS GFFKCRLKANKD I QTKCVF I GNF 

y 


13735 


27636 


A 


13902 


300 


404 


AAPGRAPQKKKKKKKKKKKKKKKKKKAP 


13736 


27637 


A 


13903 


131 


29 


ALGSLQPLSPRFKRFSCLSLPSSWNSTA 
LTVTTM 


13737 


27638 


A 


13904 


267 


74 


VWVLVPPFTSYGVLDKPILIVNNIYVHV 
CVCIjCMCVC^CVCV CC TI v IRRHCWHNMI 
IIISQHDQ | 


13738 


27639 


A 


13905 


193 


64 


XKbHy IHKJUHFKKFRXIjCjW IWWLiTPVIP 
APWEAEEGRSPEPSS 


13739 


27640 


A 


13906 


309 


424 


QSTDFLENRRVGWAWWIiTPVIPALWEAE 
VGGLPEIKNS 


13740 


27641 


A 


13907 


404 


251 


SCLISLRSWDYRHVPPQPANFVFFFFEM 
ESCS VAQSGVQWQEENS IS KINK 


13741 


27642 


A 


13908 


471 


59 


GPPQAKKGKGFPTPGPPQKKGGGQPKSG 
VLGFSPPPLGPCPGLGPPKGWGFWGGSF 
erect r KKWEHKNEJjrPr 5KTEKPRRAG 
EARKGSQSTKYAGLSRGERQASPTLRPG 

nxjAn^iOrvaVJlrfXA^X J.XvryiUv>AXtirCir\jr( 


13742 


27643 


A 


13909 


479 


231 


QEYCMLIFCEHFGDFDLNIVFCLEKKMV 

rrw*T7VnT?VT T CQTTTYlPDnWT.MDITT'D'PT HP 
uviiwr X i_iJ_ioo c X\jlCfc*yvi Jjl*lJr V Xlr XXiWlS 

ABVGGSLEAKSFTPAWATWQDPISMF 


13743 


27644 


A 


13910 


167 


19 


CIVLGIMLDSLWEPRQWVPSWVRWFTPV 
IPALWEAEAGGSPEVRSSNSS 


1 11 A A 

13 744 


27645 


A 


13911 


too 

328 


A AC 

445 


oo r o JVX X ULioU I N W x V P w V LiAxiUEoE VG 

RSLEPRSSRLQ 


i in ac 

13745 


27646 


A 


13912 


145 


460 


NAFMS TGCEVLS YSEIjRQRNIiVWWFLP V 
IPATPQAEAGRLLGPRSSRPPWATKGVR 
ALSHKQTNTKTNKLSHSTGVS FVFTHHI 
HS PGI LWRLSOWIjIiAMNIATN 


13746 


27647 


A 


13913 


319 


210 


KLGFFKGPPLFFFFFFFFFYFFFFFGFL 
TVSCNFTL 


13747 


27648 


A 


13914 


372 


10 


PNTTCTPITYGKKCCPREISYYLCHEFI 
RIRILTFFDGESCYIAQAGLKIiLGSRSP 
PTSASGVAGNTDVCHHTQPILFYTLiETG 



1382 
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SEQID 
NO: of ! 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

n m inn 5»piH 

41111111/ «UU 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

unig iv 

last amino 
acid 

residue of 

UvU UMv 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=PhenylaIanine, 
G^GIycine, H=Histidine, I=Iso leucine, 
K=Lysine, L=Leucine, M=Methtonine, 
N=Asparagine, P=Proline, 

0=4*liii9minp R=Ar<rinin<» S=55*»rine 

\^ UIU lAlllltltT, f%l gllJIIIC, iJ^fcMSI lUCj ] 

T=Threonine, V=VaIine, 
W=Trvn to nh a n . Y=Tvros in e 
X=Unknown, *=Stop codon, /=possible 

nurlefktide deletion V—nos^ihlf 

nucleotide insertion 














LAIiLPRLILNSWSQAILLPPQPLRSYFN 
FWLFFCLYN 


13748 


27649 


A 


13915 


594 


471 


PSYIAYMCICMCVCVYVYINICVCIYTH 
VHTPYNPVTS KN 


13749 


27650 


A 


13916 


235 


3 


EANNL I RKTYTSGWFQRTS KRP PAVRRM 
KVLFFETESCSVAPVGVEWHDHSIjLQPR 
PPGLNRSSHLSFPSSSDYRWA 


13750 




A 




77? 


1 OA 
lO** 


KTTAW AQWI/T PV I STLWEAKVGGSIiEAG 

s 


1^7^)1 

ij i J J 




A 


1 J7lo 




^71 


SRVOGTGGHRGRGCIOAGVGFOEAKRRT 
EG P P WRWLT P VT S KLWEAKAGGS LEP VS 
SRRATWNLLKRIYYTDEREYLIPLALNT 
SQGSTFSF 


13752 


27653 


A 


13919 


387 


484 


FIEIIGWARWLMPVIPTLWEAEAGGSPE 
VRSS 


13753 


27654 


A 


13920 


142 


321 


FLFSIHFHLSCWGLFSNVWQSLSIFSYL 
KVRL YGO AWWFT P I IIiALWEDKAGGSLD 
PRSL 


13754 


27655 


A 


13921 


270 


412 


IEDKKI YS IIiHTETLERGQARWLTPVI P 
AFWEVKAGGSFEPRSLRPV 


13755 


27656 


A 


13922 


304 


396 


NFGWAQWLTPVI PALWESEAGGSLEPRG 
LRS 


13756 


27657 


A 






JO / 


VWWLTPVIPALWEAEAGGSPEPRSSRP 


13757 


27658 


A 


13924 


103 


3 


NKSPELGWAWWLAPVTPALWEAEftGGSP 
EVRSS 


13758 


27659 


A 


no75 




4ft0 

HUV/ 


CLLILSGLVASCI I ERHGKGQAWWLMP V 
IPTLWEAKVGGS LE LRNLRS AWGTWRNS 
IS I KYTEQLAQG 


13759 


27660 


A 


13926 


294 


489 


KWAKMQTLHKGRYACVC I CV CVCVCVYT 
YTDIYIFYIHIWPMNTKCSMLLVIGEMK 

JL i RJ XVI 1 ft OIH^ 


13760 


27661 


A 


13927 


363 


2 


NWGPPGFPPPPFLKTGPVFFIFGAPKKK 
FFLSTPRALKFVLLKGGPLFFFFFFLWV 
FLVKTGSHCVAOPDIjELIiS SSWPVLVSO 
GAGITGMSPQVQPLDLFFFFSFFETGSR 
SYAWADAW 


13761 


27662 


A 




i 
i 


7ia 

ZlO 


LRFKKRKTblKPKMWSVHTMEYHSALKRK 
EIRTQTTRWINLEDIVLSEIRVAVTKGR 
ILYDSTQMRNLEESNL 


13762 


27663 


A 


13929 


219 


408 


TFCHFIAFLHVYKMFVVFFFSFIMFFFP 
I^CNKAFERKKKXNKKNNKXKKXRKKI S 
LSPPAPP 


13763 


27664 


A 


13930 


161 


22 


WHPPAS ASHVAGTTGACHHERLRQEDHF 
SLRVQGCSEPRLCHHTPP 


13764 


27665 


A 


13931 


116 


I 


INTGWARWLTPVI PAFWEAAVGGSLEPR 
STSQAWATWK 


13765 


27666 


A 


13932 


181 


38 


PWPQRATQPKIS PLPPAISALWEAEAGG 
ALEPRSSRPAWAKSDAWVD 


13766 


27667 


A 


13933 


267 


397 


TVFDRTLFFIKKI IHSWAPWLTPWPAL 
WEAEVCGS PEVRSSG 


13767 


27668 


A 


13934 


129 


12 


NIGVGWALWLTS VI PALWEAEAGGPP KV 
RSSRTARPMS 


13768 


27669 


A 


13935 


179 


1 


KTYYGGIPPPGPPKKGEKKPPFFFKKFF 
LFFISFFIFFFFFFLRQSLAKMQSASVW 
KVS 


13769 


27670 


A 


13936 


245 


616 


DDKKKKEAAQKKATEQKIKVPEQIKPSV 
SQPQPANSJQNGTSTATSTNNNAKRATAN 



1383 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

annuo auu 

residue of 
peptide 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
aing iu 
last amino 
acid 

i*Ari/l ft a nf 

rtMuuc ui 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutamtc Acid, F=Pheny (alanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N=Asparagine, P^Proline, 
v=vjiuiaijiine, k— Arginine, o—oer ine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, - 
a.— un Known, — otop cooon, /=possiDle 
nucleotide deletion, V=possibIe 
nucleotide insertion 














QQPQQQPQALPRYPREVPPRFRHQEHKQ 
LLKRGQHFPVEF 


13770 


97671 


A 




171 
1/1 




m Jl UiiULiALiAlj JL VKVJU 1 oUJJJN r JsJN WGWVQ 

WLMPVFPAPWEAEVGGSSEATNLRPAWA 
T 


13771 


27672 


A 


13938 


421 


202 


GPREAPLFPSTQGISRPLFFFFFFETES 
RTVTRAAAQWHNLGSPQPPLCSLDILKT 


13772 


27673 


A 


13939 


205 


543 


ILESHRRLGWSEALPBNQLLCRASSWES 
LrtjFPJUPPC IjKPPS pafwgaaggwrakst 
SVSEPVPVGTLLVAIiPLEMAVGLLLGLP 
FCFHSLGAPHLLCEPDGPSVLPABARQS 
R 


Will 




A 


1 jyH\) 


i An 


JL 


JL&UH&iSisx I IiF FPIjxIjE JUsSSRPPPPQH 
TPHPPPPWPWHPDIADIQ 


13774 


27675 


A 


13941 


47 


303 


GLI Fill IWTCI VKTSTDFPRMEDCSQCI 
nyv l JfiCoNJ^JttrivJr x iANPHHCjoSRIiIj 
WPQHAAPWDDGRRGKPVFSLGFVSFPFP 
Q 


13775 


27676 


A 


13942 


1488 


1719 


PLVSFSKNTYPCIX3NVKQNQKTTFCSRW 

tfT inTI.COMT QOPi\C*W1MfTtnvOTXTT TftrPPA 

JSXixlV JjooMJbooFiVCiV X V VAd JUNJUVVSEQ 

SFQNSQLPAMLCLIVHLQMS 


13776 


27677 


A 


13943 


127 


402 


QPVTHPPNRTMCSSMIFLFLYLCFLSLS 
PFPSHLSLYAPPKKKKKKKKKKKKKKKK 

KKIHFPPGGGRIIFFY 














13777 


27678 


A 


13944 


852 


209 


EQTVYCTRlDLQQKiVFl^RLEAFCLLSQ 
GDPGSPMMCQLQQFDLWVLRGVLNFGGE 
TCPGJjF Jb YTKVEDYS KW ITS KABRAGPP 
IiSSLHHWEKLISFSHHGPNATMTQKTYS 
DSELGHVGSYLQGQRRTITHSRLGNSSR 
Lto JbU V J^KJUVJKESGRS PKASVQPLiYYDY 
YGGEVGKGRIFAGQNRLYQPEE IILGSL 
RACFLFAAVSSPGATPPN 


13778 


27679 


A 


13945 


68 


260 


INIFWHCVLLYVIIjYAISQFLLFPCVKN 

X/vb n iAio v ArlAuW iro i JjUUsJjAJb JSSJbB PR 
NLRPAWAT 


13779 


27680 


A 


13946 


420 


156 


FLPPGVKYGSFKRAPPFFFFFFFFLVET 
LSCYIAHTGLELPDSSNYPTSASQSAVI 

TGMQHHTWDT.PfilPTATDriftT.PTT T \TT *T 
x vofiMO run a. W crixilvj>\i^LirtJ. Ir yivLiK J. iaJLiVJjV 

PLH 


J 3780 


27681 


A 


11Q47 


117 


17 


AGHGGS TCNTPSOP T VTA VP TNTT. WttPnr.r . 
QQWGAQAF I PEQLYS PQSQHTMQEMGYV 
PGMGLEKNLKGLKKPLQAEGQNSHQGLG 
YNFSWQPLLSLQNIjYL 


13781 


27682 


A 


13948 


55 


224 


NFRLTSSFSYLKMSFWLGMWLTPVIPAL 
WEAKTGGLFEARSLRLVWWQSKKAGSHL 


13782 | 


27683 


A 


13949 


1464 


805 


RATS VRGAGRERS CGAAWS PAS IGPFLR 
RSVLPNAHFRSSREGGMAASTDMAGLEE 
b rKAr iU.HGUPKASGQEMNGKNwAKLCK 
DCKVADGKSVTGTDVDIWSKVTCGKSAR 
VINYEEFKKALEELATKRFKGKS KEEAF 
DAI CQLVAGKEPANVG VTKAKTGGAVDR 
LTDTSRYTGSHKERFDESGKGKGIAGRQ 
D I LDDSG YVS AYKNAGTYDAKVKK 


13783 


27684 


A 


13950 


356 


98 


l^PGIYFWGPIKKILPCPPAGVKLGSL 
KRAPLFFFFVEMGFHHVAQAGLKIiLSSR 



1384 . 
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S£QD> 
NO: of 
nucleotide 
sequence 


SEQI0 
NO: of 
peptide 
sequence 


M 

etfa 

od 


SEQ n> 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
tng to first 
amino aciu 
residue of 

pcpUUc 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
oing to 
last amino 
acid 

residue of 

pepuuc 

JCUUCUVv 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, ENHistidine, l=lsoleucine, 
K=Lysine, L= Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
v^vriuiamine, Jrv.— /v rgi n 1 n e, o-oermc, 
T^Threonine, V=Valine, 

W=T rvntn nhon V=*T vr i n p 
tt~— x rjrpiop 112111, I — J y> ualuC) 

X=Un known, *=Stop codon,/=possible 

niiplonli/lii flalai'iAn ACCI nip 

nucicuiiue deletion, v— puiaiuic 
niMMPntiHp in coffin n 














DSPIPVSQSIRITSTCEPPCIiAKIKSSL 
RL 


13784 


27685 


A 


13951 


194 


48 


NFLEGWPGQRLT P VI PAL W KAEAGGSLE 
PKTFRPVWAMWRDRLRKPQH 


13785 


27686 


A 


13952 


341 


418 


DQPGQHGETPS ILKIQKLAGRGSTCL 


13786 


27687 


A 


13953 


158 


323 


TWLKKYFQSTNQLPPAHLLNVQPLLPFG 


13787 


27688 


A 


13954 


231 


355 


VFLPQKQKQNQCPLTGKWVNNSWNIHTL 
EYYSAVKGMKQLI 


13788 


27689 


A 


13955 


493 


39 


LPPAPVSSMAAPDIiDSFPLPLCWTFLRV 

SNIFSISLPHWASPPLPTARLTSAVSPE 
PT i P PT ,P <5 RT <OR O PROAP SHPGMSMGP T Ii 
DATSNSTTRSR 


13789 


27690 


A 


13956 


497 j 


372 


NTKISWASWYAPVIPATWEVSNSRPQAI 
HSHQPPKRIGLQA 


13790 


27691 


A 


13957 


438 


301 


GTTYLDIDMTCDyVSVCLCIYICTRQQS 
11111 vi J. n x xi x n. 1 run x x 


13791 


27692 


A 


13958 


84 


466 


QPLGRSGKVPQLHLWEMQRLPAAFFRS F 
AAQGLGESVHINTPTSLSLRGRPFISSS 

HDfilD API/TP ft PPT .W A AR TT.PPFWR(TT.R 

RLKCSNVISAHCNLRFSGSSDSPASASQ 
VAGITGRSSLBQLLE 


13792 


27693 


A 


13959 


216 


28 


KYVKNYVIiSS 1 1 FWMQE I FHNFKKS S RT 
QWLTPVI PALWEAE AGGS PE PRSSRPAS 
SIQLENS 


13793 


27694 


A 


13960 


320 


943 


VLSFSLSDRAQRCNCRWGKQNSKLAPEV 
MEDLVKSTEFNEHELKQWYKGFLKDCPS 
uKJUxN JLi ttCj r Xj x V J\J? r Ir X laUAo JVT AyxiA 
FRTFDKNGDGTIDFREF ICALS ITSRGS 
FEQKl^NWAFMra^LDGDGKI TRVEMLE I 

T I? A T Y IfMURTV T MMTTMNT? T\CZT .TP RP»R VD 

KZFSKOTKNKDIX3ITLDEFKEAAKSDPS 
IVLLLQCDIQK 




27695 


n 
A 


IJ7OI 


101 


Q 
O 


CCTCCCCCXXCTLRLAPQSPAAPAPELH 
TPDPGSRRNR 


13795 


27696 


A 


13962 


203 


28 


VWGNQOTCFVPVFRKKVFCFFPLNKLLR 
GGFFFFFFFFFLVEIDNSWVFLGEGDLA 
GS 


13796 


27697 


A 


13963 


18 


171 


GIRHEERERERBRERERERERERERERH 
PAREIDMCVSKRDTRGALCALF 


13797 


27698 


A 


13964 


25 


299 


HDFCTRRERERERERERERERERERERE 
RERERERERERERERERERERERERERE 
RERERERERERERERETHTLSLYI YMCV 
FLYIPLYFSFLYIHTQREAPPPLSATLS 
LSIL 


13798 


27699 


A 


13965 


246 


410 


AYNGQGSRGVTPVI PSIjWEAKTGGSLEP 
RSWRSAWATWHTPPVPGPSKEISLYI 


13799 


27700 


A 


13966 


91 


26 


FCXZCGCCCCXrCFYCKKEKTLV 


13800 


27701 


A 


13967 


158 


24 


PFCHIYIYFLESCFVAMLECSGKILAHC 
YLRLLGSSNSLVPSLA 


13801 


27702 


A 


13968 


129 


7 


SWAQWLMPIIPTIjWEHEAGRSLEAWSSR 
PAWATQQDPVSKK ' 


13802 


27703 


A 


13969 


445 


268 


KIQLTKLKNASESLPSRINQAEEKISGW 
AWCLMPVI PVLWEAKAGGSLEPRS LRRG 
QVT 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

a mi [in ttuu 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 

/lino fct 

UIIIH Li/ 

last amino 
acid 

residue of 

UcUUUC 

sequence 


Amino acid sequence (A-Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Glutaratc Acid, F=PfaenyIalanine, 
G=Glycine, H=Histidine, I-Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
\^ uiu latum Vj *v~-'/krgHiiiicj »>~oEriiie, 
T=»Threonine, V~Valioe, 
W^Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /-possible 

llULiCvUUC UclcllUN, \— pUoalilie 

nucleotide insertion 


13803 


27704 


A 


13970 


1628 


1309 


HRRKKCAGYSSLSSALLRERERERKRER 
ER ERE S VCVCVLGTT TCPP P nVHV T Ort T 

SGVRQNCVQIPALSDPDLPLFLSLSVLH 
SNTWVVRPTLRVTERNKGAMCV 




9770S 




110*71 


11 A 




LGAKAGGSLEAGSLRSAWATWRAPMSTK 
IF 


11 QAC 


X f /UO 




l /z 


zyz 


4oZ 


vif QVT?TnnQpaw^7TiDT.'\rriv'TT? v^/ivrr m; 
xxvo vrviuuOwUiuvurui voi^jlki vjNxjr V 

YICQHSDGR TWWLTPVI PPLWETKGGGS 

SBVGRSG 


13806 


27707 


A 






Ifll 


STAIGPSFSPPI.CTHfi c ;PW2PT.PPPPr , P 
CPFLVEMGSHYVAPAGLKIJiGSSDPPAS 
ASPSHLGLQV 


13807 


27708 


A 


13974 


493 


289 


IPPTALGTSFSPFIjCTHGSFHGFLFFFF 
r , pr , PPT.\7PM<i<2Tiv^7TiD&r , T vr t rconDD 

AS ASPSHLGLQV | 




977A0 

A. f /Uy 


A 






191 
IZJ 


XKiUi/iuadour V2Cf\M±rn.l\jXjA,oozrl Lie. 

IYAVKDTRCMQRRILVRELECAAHGLTE 

f^T.Q TD'l l l-4'PM , ri4'l , UM , M i ri-l , l , 14 , l l U , l , IJ , l'LI , 'l , U l l* 
v«uoxi/A£ixnxni.iiininAnimninlal 

HGTRWEAGQSLKGES 


1JOU7 




A 


1 J7/U 


IIS 


9S/1 
Zj*» 


TVTCTNS53 W T/F PT^ PT. PT ,M PTJW T rvTf fiP 
RVCVCVCVCVC 


13810 


2771 1 


A 


11077 


170 


S8 

JO 


VPLSINSPOROCTPCCdPPCTf^PPT.VT.n 
KIISERSWAEIG 


13811 


27712 


A 


13978 


97 


186 


KI I FWLGVVAHACNPSTLGGQGGRIMRG 
RD 




9*77 n 






zoy 




irT/lT.irTIIfiDDOrtSDUTrMOTTTJTVT UT?7V t? 7\ 

EixA7XjruJ.£lLsr iroyAIr Wl? Mf J. x. FAJjWiv/UiA. 
GGLLESRSSRIAWAMW 


13813 


27714 


A 


13980 


421 


106 


SGGRNPFFFLGGGFYKKKFPPPPFLRGK 

PKKGGGKIFFFFGGKNFIMQNFGGPPPL 
WGKKKKFLPKKKNPLFFFFFF 




9771 * 




noci 

uyoi 


1AQ 


ii* 
i Jj 


VUTVTVT.CTVT.CTVT GTVT CTVT 137V VT 15 

i\xlx IX Ixjox xxjo 1 x xjox I JjoX XxjirAxxjir 

TYLHAHPLIHANAYKNIHIDYPFKKGFE 
Y 


13815 


27716 


A 


13982 


231 


21 


QHMYCLKS I WHKKLSRKPYCY 1 1 SB YSS 
MVKLRVS T SOVOWTiMP VI PAT .WP ATT AOG, 

l i V XUJnvoxouVynunJCr V X ItxAXj rt C*^iXVf\UrU3 

LLKARSSRPMWAI 


13816 


27717 


A 


13983 


209 


359 


mHTHTHTHTPNGVKQIMTQIPILPLTG 
PLI LE KFLNFSKPLSHPGKRDP 


13817 


27718 


A 


11084 


301 


249 


IWP ILPPQKI FFFFFETES CSVTQARVQ 
WCNLGSLQPPNPVFPSLAK 


13818 


27719 


A 


13985 


181 


403 


YLVLDS PES *TLG ILSQPKDY ITKDQTLT 
FRPVVVAi^O^STLGSOGKWITSVOEV 
ETSLCNTARPCLYQKYKN 


13819 


27720 


A 


1 J70U 


441 


358 


ETGSRCVIIiAGAOWRDLSSVOPlxPPRFK 


13820 


27721 


A 


13987 


7 


233 


AS I PC I YQKTKI PNTKQTKNKNTKITPA 
WWCTPIAPATQKAEIPPLHSSLGDPLRL 
S PKKKKGGPNWAPTPSKPF 


13821 


27722 


A 


13988 


152 


44 


PKAPLFFFFFFFFFFYETRSHSVAQAKV 
QRHDHGSL 




97701 
Z/ /Zj 




1 inon 

uyov 


loo 


no 

j/y 


vi v o urlAMlS 1 K V W W rUSKFxj F V F r VaAxixu 
DGMPLVWLLSASSILSIWLSIYLSIYLS 
IYLSIYF 


13823 


27724 


A 


13991 


296 


53 


SSILFHASKYHFSGRLFSLLSCKRRNIL 
QLVPHMYSPYLxWIHTHKHIYEYMYVH 
IYIYMNLRNCRIYLHSCTPKENQAR 


13824 


27725 


A 


13992 


37 


290 


P^GSPASRHQQIQCLIRACLKDGTLGVrv 
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SEQ 10 
NO: of 
nucleotide 
sequence 


SEQD) 
NO: of 
peptide 
sequence 


M 
etb 
od 


SEQ ID 
NO: in 
USSN 
09/515,1 
26 


Predicted 
beginning 
nucleotide 
location 
correspond 
ing to first 
amino acid 
residue of 
oen tide 
sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Gtyctne, H=Histidine, I-Isoleucine, 
K=Lysine, L^Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q^Glutamine, R=Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














WWLIPVIPALWBAKVGGSLEPRSSTPAW 
ATQGDLILKKDGTLLQCPHVAKGHSSHS 


13825 


27726 


A 


13993 


189 


370 


SRNAI FKKRWESSCQSSFNSLGVFPQAA 
TKGWVRWLTLVI PALWEAKAGGSLEPRS 
SRPA 


13826 


27727 


A 


13994 


179 


25 


SKHNKKNIQICLLCSKRSMFVHVCIHIH 
ICVYVYRYAYIYIGMCIHIYIHS 


13827 


27728 


A ; 


13995 


159 


2 


EDPLS PGVQDQHGHHRKSLSGRGGKISL 
GQEFKSSMAAMYFGRSGGCKLERR 




9779Q 


A 


13996 


307 


99 


FIWEHFVNYTMFFTYKKPTSQAWWLMPV 
VPALWETEAGGSPEVRSPRPPGQHDDVR 
PLGPSLHVYLQMD 


1189Q 


9771ft 


A 


1 J77/ 


218 


21 


FQHFGRPRRVDHLSLGVQDQPGQHGETP 
SPLASSILDLIPPPFPIDEPVDLAFQSV 
TLGVQDTTS 


13830 


27731 


A 


13998 


227 


404 


G YS VTLLLGLTQLTQR LAHGTRKGERER 
ERERBRERERVCS CLMHELASHLLFFFL 
FRE 


13831 


27732 


A 


13999 


400 


668 


PVGRRWELQGMGLGVRTWQSSLPCQGPT 
SSSVLSLGRGTPGISNTCLPPRSATELG 
LPAQVPFPNVQSQDQDLSFSLGCFPRTY 
NRQTL 


13832 


27733 


A 


14000 


225 


414 


NRVXIiCHTGWSTVAIIiAYCSFKLLGSRD 
SPTLASRAARTIWCRNVWLIKKKKKKK 
KGGRRL 


11811 


z/ / j 1 * 


A 


14001 


397 


118 


LFSGPFFFDKSECFQE PTVPFCLLLYNQ 
MQPYLLRSAFVDSYYMYDSFLRDRVSFC 
QSGWSTVAPSRLTAALNSLAQVILLPQP 
LNVLCGISR 


iiri4 


27735 


A 


14002 


3 


173 


YVFYRQKQMVCF YKI R I KNMFNRNRLNL 
GQWCVPIVPATQVAEAGGSPBPTSLRPI 


11815 


27736 


A 


14003 


440 


290 


YTHTLYIYIYIHTHTHTHTYIYLCTYWG 
QFDLLVLDNKLYKE I YYLS VF I 


13836 


27737 


A 


14004 


2 


116 


ARLVAMP FKYEE I KDFLLTARRKDAKS V 
KIKKNKSAAV 


13837 


27738 


A 


14005 


279 


422 


SAKAPLSCLRTNSLLLKNGLRGWAWWIT 
PVIPTLWEAEVGGSPDVTSP 








14006 


3 


497 


GGIGDSRCGSTRASSSPQLAGRSSSVLP 
AAAQPCTPTMDVFKKGFS IAKEGWGAV 
EKTKQGVTEAAE KTKEGVMYVGAKTKEN 
WQSVTSVAEKTKEQANAVSEAWSSVN 
TVATKTVEEAENIAVTSGVVRKEDLRPS 
APQQEGEASKEKEEVAEEAQSGGD 


13839 


27740 


A 


14007 


1 


292 


SLGGGGCNDPRSYHCTPAWATEQDTVSL 
SLSIYMSVRVCIYMYMCVCVCIRIYTYI 
HTHIHIRVYIYTIPYIWGIYIYIYVDLV 
IYMGWCGCPPPSI 




77741 


A 


14008 


57 


250 


KIFFFLMNHTWGGVPWLATWEAEVGGS 
LEPRSLGLQETMMTSLLSRLPRQQGEIL 
SLILKRNY 


13841 


27742 


A 


14009 


184 


2 


AVPGRWPCQQVPGLLPS IQDTLKQGRTG 
QWLTP VTPMPWEAEAGGSLDARS LRPAW 
ATGK 


13842 


27743 


A 


14010 


123 


3 


KEPSRPGRWLTSIIPACWEAEAGGPPEA 
RSSTREFRTAK 


13843 


27744 


A 


14011 


403 


168 


GTGLRASRCISQPVMSGLLSPSEFPFPE 
IESHLHVSQRETAARLRDRERERARERE 
RERERGKNASSGKPPGSASCVP 
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NO: of 
nucleotide 
sequence 
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SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
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09/515,1 
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nucleotide 
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correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=>Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutaraic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutamine, R^Arginine, S=Serine, 
T==Threonine, V=VaIine, 
W=Tryptopban t Y=Oyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, V=possible 
nucleotide insertion 




27745 


A 


14012 


207 


422 


EIRRGSLAGPLSLNFILSVMGSHSKYLR 
KVIGRAQWLMPVVPALWEAEVGKLHELR 
RLGHTWAAW 


13545 


27746 


A 


14013 


200 


405 


WLCVLGWSLLALS PRLECYGP ISAHCNL 
HLLRSSDSPASASQALQEAWYQHLLLMM 
PQAAS TQVGRKK 


13846 


27747 


A 


14014 


319 


176 


KPSTITFLFLHLFISSRESRPVTQTEVK ' 
WLNHSSPQPPTPRFSLPSS 


13847 


27748 


A 


14015 


63 


401 


EQENKNTiiNiiPGWDGVHIHVIiRFIiNTRM 
PLGHLS FSEGVSGSGSSGRRCQAGAGGR 
SATAAARPCQCLGLKGLPPAVPSCGLEG 
KAPERAGYSASHPPHGCBTGNGWVLYVN 


13848 

"13849 


27749 


A 


14016 


416 


63 


VSKVYPKINHTQRTVRHETFKIAMPIOry " " 
SNLCS ILRFNKDRLAFMRHYTKQCS RTS 
ITEYIAMFFFVCLFVLKIGAHCHPGWNE 
WLVCSQVTAASTSRAQAILPPQPPKIL 
GMQQA 




27750 


A 


14017 


~196 


425 


SLMFRPPLFS FCCFCIRYPFIFVFFSLL 
CFAFSWCSLLLFFFHYLKLFIiLLFLLFF 
LPVLI»CLDS FLVHDFFLLSS 


13850 ~~ 


27751 


A 


14018 


3 


233 


S SRMGRGTKT IAFDMRWS CEVDRARHRD 
REREREREREREREKGAEPQEFPVISPW 
VDRPCIAGILHLKCGCGHRL 


13851 


27752 


A 


14019 


163 


I 


AERWHDQICVERE I WG WVQ WLMPVI P VL ' 
WEAKAKNLLEPKNLKLQRNRRPGNSR 


13852 


27753 


A 


14020 


"203 




I*KKKKNKSQAQWLTSWLASQEAEAGGS 
LEPRNSRPAWAT 


13853 
IJoj4 


27754 
27755 


A 
A 


14021 
14022 


179 
358 


3 

431 


VTWQRELRU2MELRLLICRAWWLTPVI P 

VPWEAKAGGSPEPRSSRPAWETRNRRPG 
NS 


13855 


"27756 


A 


14023 


98 


3 


HHKKKNQNWU3TGAHACNPSTLGG 

GRPAGWLTPVAPTLWEAEMGGSLEPRSLH 
GHE 


T3856 


27757 


A 


14024 


254 


385 


RENWABPSSHSLIiNI FLQLGTVARACNP 
STLGGRGGWITRSGD 


13857 


27758 


A 


14025 


140 


1 


KGVLDGPRGLCVFCFCFFFLCVCVCVCV 
CVCVWRRSLTLSPRLQCI ] 


13858 
"13859 


27759 


A 


14026 


579 


852 


QREWVGWAGKEGEGWVSHVPASAQLPAN 
GQRGQPHPS PLGGTGS TWAQEGAYCCLS 
SCSHCCCCSSCCCCCPPGFCLFPSLGAI 
CHLYHL 


13860 


27760 


A 


14027 


30 


663 


LRIRALREL PASHI PGS LTTCCVPRPPL 
PCSSTKPDAKGYKPLAQQKLGSQRTGPC 
LGHAPCYSPLWELRGHGCSPYGPSPSGS 
VSLWQEAMRLPKNTPBEKDRRTAALQEG 
LRRAVSVPLTLAETVASLWPALQELARC 
GNLACRS DLQVAAKALEMGVFGAYFNVL 
INLRDITDEAFKDQIHHRVSSLLQEAKT 
QAALVLDCLETRQE 




27761 


A 


14028 


I 


267 


avgvhhafhlphcffasllespvsprlaH 

MD PNCS CAAGV S CTCAGS CKCKECKCTS | 

CKKSCCSCCPVGCSKCAQGCVCKGASEK 
CSCCD 1 


13861 


27762 


A 




AC 

46 


1746 


paagaatmefrqee frklagralgklhr" ' I 

LLEKRQEGAETLELSADGRPVTTQTRDP 
PWDCTCFGLPRRYI tAIMSGLGFCISF 
GIRCNLGVAIVSMVNNSTTHRGGHVWQ 
KAQFSWDPETVGLIHGSFFWGYIVTQIP 
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Ulllg IV 
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peptide 

sequence 


Amino acid sequence (A— Alanine 
OCysteine, D=Aspartic Acid, . 
E=Glutamic Acid, ^Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
0=Oliitarninp R=A rein in fL. S^Serine 

T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possible 
nucleotide deletion, \=possible 
nucleotide insertion 














GG PI CQKFAANRVFGFAI VATS TLNML I 
PSAARVHYGCVI FVRILQGLVEGVT YPA 
CHGIWSKWAPPLERSRLATTAFCGSYAG 
AVVAMPLAGVLVQYSGWS SVFYVYGS FG 
I FWYLFWLLVS YES PALHPS ISEEERKY 
IEDAIGESAKLMNPLTKFSTPWRRFFTS 
MPVYAI I VANFCRS WTFYLLL I SQPAYF 
EEVFGFEISKVGLVSALPHLVMTI I VP I 
GGQIADFLRSRRIMSTTNVRKLMNCGGF 
GMEATLLLWG YSHSKGVAI S FLVLAVG 

NGVGTLSGMV CP 1 I VGAMTKHKTREEWQ 
YVFL IAS LVHYGGVI FYGVFAS GEKQ P W 
aRPiypMQpRTrrvTi^OHnrx^T.aoQnn^RNrR 

nQr lit Ci l lu3 rt IJ Xvv—VJ C V \JlXLJ\^ Lu^\J C3 LJ LJ <J J_»i JUCj 

DEAEPPGAPPAPPPSYGATHSTFQPPRP 
PPPVR0Y 


13862 


27763 


A 


14030 


240 


63 


VFEAQSNGGVELNRFNEGPI*SAVIiKGRV 
AWAQWLTP VI PTLWEVKAGGLLEARSLS 
PA 


13863 


27764 


A 


14031 


289 


414 


HAEMGRVQWLTPAIPALCKAEAGGPPEP 
RSLRPAWATY 


13864 


27765 


A 


14032 


386 


1765 


IiGDARAPEKMSAIQAAWPSGTECIAKYN 
FHGTAEQDLP FCKGDVLTI VAVTKD PNW 
YKAKNKVGREGI I PANYVQKREGVKAGT 
KLSLMPWFHGKITREQAERLLYPPETGL 
FLVRESTNYPGDYTLCVSCDGKVEHYRI 
MYHASKLSIDBEVYFENLMQLVEHYTSD 
ADGLCTRLIKPKVMEGTVAAQDEFYRSG 
WALNMKELKLLQTIGKGEFGDVMLGDYR 
GNKVAVKCIKNDATAQAFLAEASVMTQIi 
RHSNLVQLLGVIVEEKGGLYIVTEYMAK 
GSLVDYLRSRGRSVLGGDCLLKFSLDVC 

AKVSDFGLTKEASSTQDTGKLPVKWTAP 
EALRE KKF STKSDVWS FG I LLWE I YS FG 
RVP YPR T PT.KDVVPRVEKGYKMDAPDGC 
PPAVYEVMKNCWHLDAAMRPSFLQLREQ 
LEHIKTHELHL 


13865 


27766 


A 


14033 


619 


414 


EQARCIiEQHACTSPRQPRPQLQCSELKG 
HSLGMEHQGHVPCLHPCSPCPPVKASPA 
CWSSNPAPLCPH 


13866 


27767 


A 


14034 


2 


614 


IiESRPDGRPSSTHPASPSAFSAPGKPHP 
PEAKMSSKRAKAKTTKKRPQRATSNVFA 
MFDQSQIQEFKEAFNMIDQNRDGFIDKE 
DLHDMLASLGKNPTDEYLEGMMSEAPGP 
INFTMFLTMFGEKLNGTDPEDVIRNAFA 
CFDEEASGFIHBDHLRELLTTMGDRFTD 
EEVDEMYREAP IDKKGNFNYVEFTRILK 
HGAKDKDD 


13867 


27768 


A 


14035 


1 


254 


AVEFGPAGPGSLGRAAAMIIPVRCFTCX3 
KIVGNKWEAYLGLLQAEYTEGDALDAI.G 
LKRYCCIRPJ4LIAHVDLIEKIJ 1 NYAPLEK 


IJOuO 




A 


14036 


494 


148 


FRFLSDCGVFAEGHIELQVESGVPLGFS 
TMAEDMETKIKNYKTAPFDSRFPNQNQT 
RNCWQNYLDFHRCQ KAMTAKGGDI S VCE 
WYQRVYQSLCPTSWVTDWDEQRAEGTFP 
GKI 


13869 


27770 


A 


14037 


130 


3969 


I MGD VKNF L Y AWCGKRKMT P S YE I RAVG 
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ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Add, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Iso leucine, 
K-Lysine, L=Lcucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=l*lutamine, K— Arginine, cF'oerine, 
T=Threonine, V=Valine, 
W=Tryptopban, Y=Tyrosine, 
X-Un known, *=Stop codon, /=possible 
nucleotide deletion, ^possible 
nucleotide insertion 


• 












NKNRQKFMCEVQVEGYNYTQMGNSTNKK 
DAQSNAARDFVNYLVRXNEIKSEEVPAF 
GVAS P PPLTDTPDTTANAEGDLPTTMGG 
PLP PHLAIiKAENNSBVGASGYGVPGPTW 
DRGANLKDYYSRKEEQEVQATLESEEVD 
LNAGLHGNWTLENAKARI^QYFQKEKIQ 
GEYKYTQVGPDHNRS FIAEMTI YIKQLG 
RRI FAREHGSNKKLAAQSCALSLVRQLY 
HLGVVEAYSGLTKKKEGETVEPYKVNLS 
QDLEHQLQNIIQELNLEILPPPEDPSVP 
VALNI GKLAQFEPSQRQNQVGVVPWSPP 
QSNWNPWTSSNIDEGPLAPATPEQISMD 
LKNELMYQLEQDHDLQAILQERELLPVK 
KPESEILEAISQNSWIIRGATGCGKTT 
QVPQFILDDFIQNDRAAECNIVVTQPRR 
I S AVS VAERVAFERGEEPGKS CGYS VR F 
ESVLPRPHASIMFCTVGVLLRKLEAGIR 
GISHVTVDEIHBRDINTSFLLWLRDW 
QAYPEVRIVFMSATIDTSMFCEYFFNCP 
SLKLWRTYPVQBYFLEDCIQMTHFVPPP 
KDKKKKDKDDDGGEDDDANCNLICGDEY 
GPETRLSMSQLNEKETPFELIEALLKYI 
ETLNVPGAVLVFLPGWNIiIYTMQKHLEM 
NPHFGSHRYQILPLHSQIPREEQRKVFD 
PVPVGVTKVILSTNIAETSITINDWYV 
IDSCKQKVKLFTAENNMTNYSTVWASKT 
NLEQRKGRAGRSTAGFCFHLCSRARFER 
LETHMTPEMFRTPLHE IALS I KLLRLGG 
IGQFLAKAI EPP PLDAVTEABHTLRELD 
AIiDANDELTPLGRILAKLPIEPRFGKMM 
IMGC I FYVGDAI CTIAAATCFPE PFVNE 
GKQLGYIHRNFAGNRFSDHVALLSVFQA 
WDDARMGGEEAE I RFCEHKRLNMATLRM 
TOEAKVQLKEILINSGFPEDCLLTQVFT 
NTGPDNNLDWISLLAFGVYPNVCYHKE 
KRK I LTTEGRNAL IHKS S VNCP FS S QDM 
NYPSPFFVFGEKIRTRAISAKGMTLVPP 
LQIjLLFASKKVQSDGQIVLVDDWIKLQI 
SHEAAACITGLRAAMEALWEVTKQPAI 
ISQLDPVNERMLNMIRQISEPSAAGINL 
MIGSTRYGDGPRPPKMARYDNGSGYRRG 

uoo X oljlib i bub I boliu ibobb X uub Al 

PSGR I CAGVGGGYRGVSRGGFRGNSGGD 
DIEEEVAAIKLGYVSSVCRQ 


13870 


27771 


A 


14038 


431 


542 


EGITPGWARWLTPVI PGLWEBEAGGSPG 
REIDIGLAN 


13871 


27772 


A 


14040 


238 


362 


RRCTFLS FRMLAPRVYS WGKRAFS PS V 
CVRAHGKCDYSYS 


13872 


27773 


A 


14041 


6290 


3514 


FRAAGSSSTNSGRICPLPSGALLYQSEG 
LIARPHGKGSPQVGRQQHQAVRVGSAHA 
SSAGYVCVSAFPFVFLRFCVGGGPGISR 

NKMNKVEQKSQESVSFKDVTVGFTQEEW 
QHLDPSQRALYRDVMLENYSNLVSVGYC 
VHKPEVI FRLQQGEEPWKQBEBFPSQSF 
PEVWTADHLKERSQENQSKHLWEWFIN 
NEMLTKEQGDVTGIPFNVDVSSFPSRKM 
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nucleotide 
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peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
C=Cysteine, D=Aspartic Acid, 
E=Giutamic Acid, F=Phenylalanine, 
G=Glycine, JBNHistidine, t=Isoleucine, 
K=Lysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=Glutaraine, R—Arginine, S=Serine, 
T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon, /^possible 
nucleotide deletion, ^possible 
nucleotide insertion 














FCQCDSCGMSFNTVSELVISKINYLGKK 
SDEFNACGKLLLNIKHDETHTREKNEVL 
KNRNTLSHRENTLQHEKIQTIiDHNFEYS 
I CQETLLE KAVFNTRKRENAEENNCDYN 
EPGRTFCDSSSLLFHQISPSRDNHYEFS 
DCEKFLCVKSTLS KPHGVSMKHYDOGES 
GNNFRRKLCLSHLQKGDKGEKHFECNEC 
GKAFWEKSHLTRHQRVHTGQKPFQCNEC 
EKAFWDKSNLTKHQRSHTGEKPFECNEC 
GKAFSHKSALTLHQRTHTGEKPYQCNAC 
GETFYQKSDLTKHQRTHTGQKPYECYEC 
GKSFCmSHLTVHQRTHTGEKPFECLEC 
GKSFCQKSHLTQHQRTHIGDKPYECNAC 
GKTFYHKSVLTRHQI IHTGLKP YECYEC 
GKTFCLKSDLTIHQRTHTGBKPFACPEC 
GKFFSHKSTLSQHYRTHTGEKPYECHEC 
GKIFYNKSYLTKHNRTHTGEKPYECNEC 
GKTFCQKSQLTQHQRIKIGEKPYECNEC 
GKAFCHKSALIVHQRTHTQEKPYKCNEC 
GKSFCVKSGLI FHERKHTGEKPYECNEC 
GKFFRHKSSLTVHHRAHTGEKSCQCNEC 
GKIFYRKSEIAQHQRSHTGEKPYECNTC 
RKTFSQKSNLIVHQRKHlGENIjMNEf^ I 
RNFQPQVSLHNASEYSHCGESPDDILNV 
Q 


13873 


27774 


A 


14043 


187 


422 


LTIPKLHSAQEPAGPVSPPPPPSPTFQI 
GGRSRAEPRNQYCPQVAPVPALRGCLPL 
SPGEPPHDTSSPRHLTCGGIVF ! 


13874 


27775 


A 


14044 


340 


442 


NKFLSWVRLI14PVIPTFWEAKYGGWLEA 
RSLRMQ 


13875 


27776 


A 


14045 


3 


1240 


LVEGAAGQGVSDGARLRKCGTRSFPGSE 
EVLSSMARGSAFPAAALWLWS ILLCLLA 
LRAEAGPPQEESIiYLW I DAHQARVLIGF 
EED I L I VS EGKMAP FTHDFRKAQQRMPA 
ISVNIHSMNFTWQAAGQAEYFYEFLSLR 
SLDKGIMADPTVNVPLIiGTVPHKASWQ 
VGFP CLGKQDGVAAFEVD VI VMNS EGNT 
ILQTPQNAI FFKTCQQAECPGGCRNGGF 
CNERRICECPDGFHGPHCEKALCTPRCM 
NGGLiCVTPG FC I CP PGF YG VNCDKANCS 
TTCFNGGTCFYPGKCICPPGLEGEQCEI 
SKCPQPCRNGGKCIGKSKCKCSKGYQGD 
LCSKPVCE PGCGAHGTCHEPNKCQCQEG 
ri rlUKrl V-JN JSJx I n»Holj InAl^Kir ALiAyiiKyil 
TPSLKKAEERRDPPESNYIW 


13876 


27777 


A 


14046 


18 


274 


YSQEVLCEMNRSGWSNI KRGWAWWQLT 
TTIPALWEAEVGGLLEPRSLRLASPQPP 

F 


13877 


27778 


A 


14047 


177 


441 


GGSHYAWPNITFI FTKLIAQFFCFFEMQ 

VMAQILKNQRKSNFDEDATLLGLSYTLL 
GEI 


13878 


27779 


A 


14048 


136 


319 


DVFLLLFHSNYFLFFETGSHSVSQAGVK 
WCDLSSLQPPPLGSSDPPTSASRVPALF 
FFLEF 


13879 


27780 


A 


14049 


388 


128 


ELKLPSYKGQSPQLSLRRYFADIiIAIVS 
NRFTLCPSARHLAVYLLDLFMDRYDISI 
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OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, ^Phenylalanine, 
G=GIycine, ENHistidine, I=Isoleucine, 
K=Lysine, L=Leucine, M^Methionine, 
N=Asparagine, P=Proline, 
^poiutamine, K— Arginine, o=oenne, 
T=Threonine, V=Valine, 
w— i ryptopnan, i— tyrosine, 
X=Un known, *=Stop codon, /=possibIe 
nucleotide deletion, \=possible 
nucleotide insertion 














yUi-tiiJj VAbbCXiJjIjAi> Ivx ESDJjHDWKFIiM 
FI 


13880 


27781 


A 


14050 


262 


3 


VIYNYISVRPFPNIYHTFYTLSLFLYLS 
THTHTHTHTHTHTQRETGHVCVCIYTYM 
YMCM X AKWWYKFT F L»I SGKARI HTHXiHP 
CI 


13881 


27782 


A 


14051 


149 


423 


YPHLCAFLLLPSISSFRAGVFFFSYIiSL 
TAYPLTWKPPEPPATPQPTPNPPPGHPA 
TRGPKSALLDHSYACASAPPLGTEGTAP 
SPSFALI 


13882 


27783 


A 


14052 


2 


1440 


FVAVTAMAAPCLLRQGRAGALKTMLQEA 
QVFRQIiASTVSIiSAESGKSEKGQPQNSK 
KQSPPKNWEPKERGKLLATQTAAELSK 
NLSSPSSYPPAVNKGRKVRSPSPSGSVL 
FTDEGVPKFLSRKTLVEFPQKVLSPFRK 
QGSDSEARQVGRKVTSPSSSSSSSSSDS 
ESDDEADVSBVTPRWSKGRGGLRKPEA 
SHS FENRAFRVTVS AKEKTLLQKPHVDI 
TDPEKPHQPKKKGSPAKPSEGRENARPK 
TTMPRSQVDEEFLKQSLKBKQLQKXFRL 
NEIDKESQKPFEVKGPLPVHTKSGLSAP 
PKGS PAP AVLABEARAEGQLQAS PPGAA 
EGHLEKPVPEPQRKAAPPLPRKETSGTQ 
G I EGHLKGGQAIVEDQI PPSNLETVPVE 
NNHGFHEKTAALKLEAEGEAMEDAAAPG 
NDRGGTQEPAPVPAEPFDNTTYKNLQHH 
DYSTYTFLDLNLELSKFRMPQPSSGRES 
PRH 


13883 


27784 


A 


14053 


177 


1253 


EKKRTLPVRSVTGELQGKSLSDLAAGTM 
DSEKKRFTEEATKYFRERVSPVHLQILL 
TNNEAW KRFVTAAELPRDEADAL YEALK 
KLRTYAAIEDEYVQQKDEQFREWFLKEF 
PQVKRKI QE S I E KLRALANG I EEVHRGC 
TISNWSSSTGAASGIMSLAGLVLAPFT 
AGTSIiALTAAGVGLGAASAVTGITTSIV 
EHSYTSSAEAEASRLTATSIDRLKVFKE 
VMRDITPNLLSLLNNYYEATQTIGSEIR 
AIRQARARARLPVTTWRISAGSGGQAER 
T IAGTTRAVS RGAR I LSATTSG I FLALD 
WNLVYES KHLHEGAKSAS AEELRRQAQ 
ELEENLMELTQIYQRLNPCHTH 


13884 


27785 


A 


14054 


1585 


2099 


ICVKTFPPIiALQVRMAAAEHHHSSGLPY 
WP xliTAETLKNRMGHQPPP PTQQHS I TD 
NSLSLKTPSERLLYPLPPSAPPSADDNL 
KTPPECLLTPLPPSALPSADDNLKTPAE 
CLLYPLPPSADDNLKTPPECLFTPLPPS 
APPSVDDNLKTPPECVCSLPPHPQRMI I 
SRN 


13885 


27786 


A 


14055 


2 


2865 


ALPDGGASVASDRAEGRPAKPSKTAARE 
KTEGAVAAVGGGPSSFRCCYGCCHEARL 
GRTSLPRGVIMLTEASLSIWGWGSLG IV 
LFLITFGPFVIFYLTFYILCFVGGGLW 

x uur \jiv 1 si oars. X Jj tt\i K Xiir Jr X o Jf\j V 

PKCLEEMKREARTIKIDRRLTGANIIDE 
PliQQVIQFSLRDYVQYWYYTIiSDDESFL 
LE I RQTLQNALIQFATRSKE IDWQPYFT 
TRIVDDFGTHLRVFRKAQQKITEKDDQV 
KGTAEDLVDTFFEVEVEMEKEVCRDLVC 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 
etfa 
od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence <A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Pbenylalanine, 
G=Glycine, H=Histidine, I=lsoleucine, 
K=Lysine, L=Leucine, MNMethiontne, 
N=Asparagine, P=Proline, 
Q^GIutamine, R— Arginine, S = Serine, 
T=Threonme, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Unknown, *=Stop codon,/=possibIe 
nucleotide deletion, V=possible 
nucleotide insertion 






4 








TSPKDESGFLRDLCEVLLYTjIiLPPGDFQ 
NKIMRYFVRBILARGILLPLINQLSDPD 
YINQYVIWMIRDSNCNYEAFMNIIKLSD 
NIGELEAVRDKAAEELQYLRSLDTAGDD 
INTIKNQINSLLFVKKVCDSRIQRLQSG 
KEItrrVKLAANFGKLCTVPLDSILVDNV 
ALQFFMDYMQQTGGQAHLFFWMTVBGYR 
VTAQQQIiEVLLSRQRDGKHQTNQTKGLL 
RAAAVGI YEQYLSEKAS PRVTVDDYLVA 
KIiADTLNHEDPTPEIFDDIQRKVYBLML 
RDERFYPSFRQNALYVRMLAELDMLKDP 
SFRGSDDGDGESFNGSPTGSINLSLDDL 
SNVSSDDSVQLHAYISDTVYADYDPYAV 
AGVCNDHGKTYALYAITVHRRNLNSEEM 
WKTYRRYSDFHDFHMRITEQFESLSSIL 
KLPEKXTFNNMDRDFTjEKRKKDLNAYLQ 
LLLAPEMMKAS PAIiAHYVYDFLENKAYS 
KGKGDFARKMDTFVNPLRNSMRNVSNAV 
KSLPDSLAEGMTKMSDNMGKMSERLGQD 
IKQb FFKVPPIi I P Ub DP JsHKK V SAQJj 
DDNVDDNIPLRVMLLLMDEVFDLKERNQ 

HVDWMTS PEQVADSVKRFRDAFWPNGIL 

GE 


13886 


27787 


A 


14057 


311 


150 


FLCFKYRPJGFATIiHSGLKLLGS SDPPS S 
TSQIPGITGTSHCVQPTHLFFLALS 


13887 


27788 


A 


14058 


53 


211 


RQHITCLDIFFIHMNYKYKYINIYLSIY 
LSIYLSIYLIYLSISHSSYITERA 


13888 


27789 


A 


14059 


1 


1642 


RDGRKMATATIALQVNGQQGGGSEPAAA 
AAWAAGDKWKP PQGTDS I KMENGQSTA 
AKLGLPPLTPEQQEALQKAKKYAMEQSI 
KSVLVKQTTAHQQQQLTNLQMAAQRQRA 
LAIMCRVYVGSIYYELGEDTIRQAFAPF 
GPIKSIDMSWDSVTMKHRGFAFVEYBVP 
EAAQLALEQMN S VMLGGRN I KVGRPSN I 
GQAQP I JJJQTjAEEARAFNRI YVASVHQD 
LSDDDI KSWEAFGKIKSCTLARDPTTG 
KHKGYGFIEYEKAQSSQDAVSSMNLFDL 
GGQYLRVGKAVTPPMPLLTPATPGGLPP 
AAAVAAAAATAKITAQEAVAGAAVIiGTL 
GTPGLVSPAliTLAQPLGTLPQAVMAAQA 
PGVITGVTPARPPIPVTIPSVGWNPIL 

IV ci p PTT jRI .T »"RPK'in?Trn , RFT?TiFPF <?RP PR 
MLSEQEHMSISGSSARHMVMQKLLRKQE 
STVMVLRKMVDPKDIDDDLEGEVTEECG 
TTFnaVNPVT T YOEJCOGEEEDAE T TVKT F 
VEFS IASETHKAIQALNGRWFAGRKWA 
EVYDOERFDNSDTjSA 


13889 


27790 


A 


14060 


1238 


1474 


VLALQEGRP WRRE PAS IDACRLNFQRLR 
RGKFSNVLFPGLAQEALYSGGYHLKFAD 
ELMGGNLKKSTADASG5RGHQL 


13890 


27791 


A 


14061 


266 


400 


GQWARPVIPALWEAKAGGSLEPSMLRPA 
GQHIKTPSVLITSKKT 


13891 


27792 


A 


14062 


2178 


1881 


VLQAPS ILLDAPRTDGGDMGRAMVARLG 
LWlxLLl^LLl^IYSSETTTGTSSNSS 
QSTSNTCLAPNPTNATTKAAGGALQSTA 
SLFWSLSLLHLYS 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

eth 

od 


SEQID 
NO: in 
USSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acia 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
last amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Acid, 
E=Glutamic Acid, F=Phenylalanine, 
G=Glycine, HHHistidine, Hsoleucine, 
K=Lysine, LHLeucine, M=Methionine, 
N^Asparagine, P=Proline, 
v=VFiutamine, K— Arginine, serine, 
^Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /=possib!e 
nucleotide deletion, V^possible 
nucleotide insertion 


13892 


27793 


A 


14063 


261 


1 


NSVLTLALLFSLHVHLPPPPSGVTDTAG 
ALSTGGPPSPTSVTGRGPAHSHASQLPP 
APGEFAPLNESCRGWAGEAFLERFPDAW 
VDP 


13893 


27794 


A 


14064 


2384 

• 


1115 


QHFSRRGLCWEQRS S VTSSWTSGAWS P 
PCPPSNASCNTLHTRDWASPDPGGQGSL 
GESPGPAPPGQIiHTLDTDLHSLAQIGGK 
SPVAGVGNGGSLWPRESPGTANGHSPEH 
TPPGPGPPGPCPTKRRLLPAGEAPDVSS 
EEEGPAPRRRRGSLGHPTAANS SDAKAT 
P FWSHLL PGPKE P VLDPTDCG PMGRRLK 
GARRLKLS PLRS LRKGPGLLS PPRASPV 1 
PTPAVSRTLLGNFEESLLRGRFAPSGHI 
EGFTAE IGASGS YCPQHVTLPVTVTFFD 
VSEQNAPAP FLG IVDLNPLGRKGYSVPK 
VGTVQVTLFNPNQTVVKMFLVTFDFSDM 
PAAHMTFLRHRLFLVPVGEEGNANPTHR 

SRRSLELDTGLPYELQAVTEAPHNPRYS 
PLP 




z / /yj 




!*RK>J 


LSI 




vj WAVQiiGXGTBGKEGTSSS ERQREGRGD 
GNRKTGRRREGRRSTKTDAPSPYPAHRP 
RSKLI 


13895 


27796 


A 


14067 


238 


45 


IQTCGSHS FCI PS I WEAKAGGLLEPRSL 
SPACPTQKDPISHLGSELPQSFRLLIRL 
VPLFLSAC 


13896 


27797 


A 


14068 


3 


1705 


SCESKATPWRAVSASQELQHPQGGQRSP 
LPGDLTPARPNPAYPLTVECQRSCSRPV 
PAPLPPHTHPPGSSCFYSSFSFITKATA 
PGAQRRAVTQAERGRMGFLGTGTWILVL 
VLPIQAFPKPGGSQDKSLHNRELSAERP 
LNEQIAEAEEDKI KKTYPPENKPGQSNY 
SFVDNLNLLKAITEKEKIEKBRQSIRSS 
PLDNKLNVEDVDSTKNRKLIDDYDSTKS 
GLDHKFQDDPDGLHQLDGTPLTAEDIVH 
KIAAR I YEENDRAVFDKI VS KLLNLGL I 
TESQAHTLEDEV7VEVLQKLISKEANNYE 
EDPNKPTSWTENQAGKIPEKVTPMAAIQ 
DGLAKGENDETVSNTLTLTNGLERRTKT 
YSEDNFEELQYFPNFYALLKS IDSEKEA 
KEKETLITIMKTLIDFVKMMVKYGTISP 
nnijvo i Jb IjUh M I AIjQTKNKLE KNATD 
NISKLFPAPSEKSHEETDSTKEEAAKME 

EAYLEAIRKNIEWLKKHDKKGNKEDYDL 
S KMRD F INKO ADAYVPTf nT T .r>KT?T? a P* A T 
KRIYSSL 


13897 


27798 


A 


14069 


1 


140 


NTSVAIQTIEYYSAFKRKETLTHVPLWM 


13898 


27799 


A 


14071 


212 


104 


HTNHQCQTVLPAGQALATPQGIiAPS PVF 


13899 


27800 


A 


14072 


358 


112 


FPLTVMLNVFLWWQNLKHFCWFIQEQTjM 
FFFFF FLRTGSHYWQAGLELLGSTNLP 
ASVSRVAGTTGIHNCTQFNPLHTLHL 


13900 


27801 


A 


14073 


3 


151 


YMGFHHVGYAGLELLTSSDLPPWPPKCW 
DYRHEPSCLAMFFYFALIAPE 


13901 


27802 


A 


14074 


8 


1493 


VTIHHLFVGQAVRADTLKKKSPSCLLCI 
VFIPVPRIDLTNIDQVAVIFKHHFPVGR 
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SEQID 
NO: of 
nucleotide 
sequence 


SEQID 
NO: of 
peptide 
sequence 


M 

etb 

od 


SEQID 
NO: in 
DSSN 
09/515,1 
26 


Predicted 

beginning 

nucleotide 

location 

correspond 

ing to first 

amino acid 

residue of 

peptide 

sequence 


Predict- 
ed end 
nucle- 
otide 
location 
correspon 
ding to 
fast amino 
acid 

residue of 

peptide 

sequence 


Amino acid sequence (A=Alanine 
OCysteine, D=Aspartic Add, 
E=G!utamic Acid, F=Phenylalanine, 
G=Glycine, H=Histidine, I=Isoleucine, 
KHLysine, L=Leucine, M=Methionine, 
N=Asparagine, P=Proline, 
Q=G!utamine, R=Arginine, S=Serine, 
T=Threonine, V=VaIine, 
W=Tryptophan, Y=Tyrosine, 
X=Un known, *=Stop codon, /^possible 
nucleotide deletion, V=possibIe 
nucleotide insertion 














GDAVLKTOAPAQCLCSRMGPAWLWLLGT 
GILASVHCQPLLAHGDKSLQGPQPPRHQ 
LSEPAPAYHRITPTITNFALRLYKELAA 
DAPGNI FFS P VS I S TTLALLS LGAQANT 
SALILEGLGFNLTKTPEADIHQGFRSLL 
HTLALPSPKLELKYGNSLFLDKRIiKPRQ 
HYLDSIKELYGAFAFSANFTDSVTTGRQ 
INDYLRRQTYGQWDCLPEFSQDTFMVL 
ANYIFFKAKWKHPFSRYQTQKQKSFFVD 
ERTSLQVPMMHQKEMHRFLYDQDLACTV 
LQIEYRGNALALLVLPDPGKMKQVEAAL 
QPQTLRKWGQLLLPSLLDLHLPRFSISG 
rifNLBDILPQIGLTNTLNLEADFSGVTG 
QLNKTISKVSHKAMVDMSEKGTEAGAAS 
GIiLSQPPSLNTMSDPHAHFNRPFLLIjLW 
EVTTQSLLFDGKWNPVAG 
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WHAT IS CLAIMED IS: 

1 . An isolated polynucleotide comprising a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 1-13901 , a mature protein coding portion of SEQ ED NO: 1-13901 , 
an active domain of SEQ ID NO: 1-13901 , and complementary sequences thereof. 

5 

2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization 
conditions. 

10 3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said 
polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1. 

4. The polynucleotide of claim 1 wherein said polynucleotide is DNA. 

15 5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the 
complementary sequences. 

6. A vector comprising the polynucleotide of claim 1 . 
20 7. An expression vector comprising the polynucleotide of claim 1 . 

8. A host cell genetically engineered to comprise the polynucleotide of claim 1 . 

9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively 
25 associated with a regulatory sequence that modulates expression of the polynucleotide in the host 

cell. 

10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of: 

(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and 
30 (b) a polypeptide encoded by a polynucleotide hybridizing under stringent 

conditions with any one of SEQ ID NOS: 1-13901. 

11. A composition comprising the polypeptide of claim 1 0 and a carrier. 
35 12. An antibody directed against the polypeptide of claim 10. 
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13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample with a compound that binds to and forms a complex 
with the polynucleotide of claim 1 for a period sufficient to form the complex; and 
t 5 b) detecting the complex, so that if a complex is detected, the polynucleotide 

of claim 1 is detected. 

14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: 

a) contacting the sample under stringent hybridization conditions with 
1 0 nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions; 

b) amplifying a product comprising at least a portion of the polynucleotide of 

claim 1; and ^ 

c) detecting said product and thereby the polynucleotide of claim 1 in the 

15 sample. — * 

1 

15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method p 
further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide. ^ 

20 16. A method for detecting the polypeptide of claim 1 0 in a sample, comprising: Q 

a) contacting the sample with a compound that binds to and forms a complex 
with the polypeptide under conditions and for a period sufficient to form the complex; and 

b) detecting formation of the complex, so that if a complex formation is 
detected, the polypeptide of claim 10 is detected. 

25 

1 7. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 

a) contacting the compound with the polypeptide of claim 1 0 under 
conditions sufficient to form a polypeptide/compound complex; and 
30 b) detecting the complex, so that if the polypeptide/compound complex is 

detected, a compound that binds to the polypeptide of claim 10 is identified. 

1 8. A method for identifying a compound that binds to the polypeptide of claim 1 0, 
comprising: 
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a) contacting the compound with the polypeptide of claim 10, in a cell, under 
conditions sufficient to form a polypeptide/compound complex, wherein the complex drives 
expression of a reporter gene sequence in the cell; and 

b) detecting the complex by detecting reporter gene sequence expression, so 
that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide 
of claim 10 is identified. 

19. A method of producing the polypeptide of claim 10, comprising, 

a) culturing a host cell comprising a polynucleotide sequence selected from 
the group consisting of a polynucleotide sequence of SEQ ID NO: 1-13901 , a mature protein 
codingportionofSEQEDNO: 1-13901 , an active domain of SEQ ID NO: 1-13901 , 
complementary sequences thereof and a polynucleotide sequence hybridizing under stringent 
conditions to SEQ ID NO: 1-13901 , under conditions sufficient to express the polypeptide in 
said cell; and 

b) isolating the polypeptide from the cell culture or cells of step (a). 

20. An isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NOS: 13902-27802, the mature protein portion 
thereof, or the active domain thereof. 

21 . The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide 
array. 

22. A collection of polynucleotides, wherein the collection comprises the 
sequence information of at least one of SEQ ID NOS: 1-13901. 

23. The collection of claim 22, wherein the collection is provided on a nucleic 
acid array. 

24. The collection of claim 23, wherein the array detects full-matches to any 
one of the polynucleotides in the collection. 

25. The collection of claim 23, wherein the array detects mismatches to any 
one of the polynucleotides in the collection. 

/ 
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27. 

5 

28. 

10 



The collection of claim 22, wherein the collection is provided in a 
computer-readable format 

A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising a polypeptide of claim 
10 or 20 and a pharmaceutical^ acceptable carrier. 

A method of treatment comprising administering to a mammalian subject in need 
thereof a therapeutic amount of a composition comprising an antibody that 
specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically 
acceptable carrier. 
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